Science ❯ Computer Science ❯ AI Research ❯ Behavioral Analysis
The revision elevates misalignment plus persuasion into formal risk thresholds with mandatory safety reviews before release.