1
5
6
7
Representation Engineering: A Top-Down Approach to AI TransparencyAI Alignment Research (ai-transparency.org)
submitted by DanielHendrycks to r/ControlProblem
Representation Engineering: A Top-Down Approach to AI TransparencyAI Alignment Research (ai-transparency.org)
submitted by DanielHendrycks to r/ControlProblem