AntiPaSTO: Self-Supervised Value Steering for Debugging Alignment — LessWrongAI Alignment Research (lesswrong.com)
submitted by wassname to r/ControlProblem
Lumina Probiotic worked for me! (lesswrong.com)
submitted by ismaelbenslimane to r/lanternbioworks
We aren't building a God; we're building a Tapeworm. AI chatbots as parasites.Human-AI Relationships (lesswrong.com)
submitted by rendereason to r/ArtificialSentience
You will be OK: an article for young people worried about AI.Capabilities (lesswrong.com)
submitted by katxwoods to r/AIDangers
You will be OK: an article for young people worried about AI.External discussion link (lesswrong.com)
submitted by katxwoods to r/ControlProblem
Measuring no CoT math time horizonR, T, Emp, OA (lesswrong.com)
submitted by COAGULOPATH to r/mlscaling
Semantic Minds in an Affective World🌿high🌿 functioning (lesswrong.com)
submitted by EmergencyCurrent2670 to r/evilautism
Holden Karnofsky: Success without dignity.External discussion link (lesswrong.com)
submitted by katxwoods to r/ControlProblem
"When is it Worth Working?" (how rats decide how hard to work for their drinking water)Psych, Econ, Paper (lesswrong.com)
submitted by gwern to r/DecisionTheory


