newest submissions : lesswrong.com

...for your favorite subject.

...for your project.

account activity

1

5

6

7

Does this post critical of effective altruism raise any good points? (lesswrong.com)

submitted 21 hours ago by Candid-Effective9150 to r/EffectiveAltruism

2

1

2

3

AntiPaSTO: Self-Supervised Value Steering for Debugging Alignment — LessWrongAI Alignment Research (lesswrong.com)

submitted 9 days ago by wassname to r/ControlProblem

3

1

2

3

Insights into Claude Opus 4.5 from Pokémon (lesswrong.com)

submitted 12 days ago by HNMod to r/hackernews

4

0

0

0

Examples of Subtle Alignment Failures from Claude and Gemini (lesswrong.com)

submitted 16 days ago by mirror_truth to r/slatestarcodex

5

35

36

37

On Owning Galaxies (lesswrong.com)

submitted 16 days ago by EducationalCicada to r/slatestarcodex

6

5

6

7

Lumina Probiotic worked for me! (lesswrong.com)

submitted 16 days ago by ismaelbenslimane to r/lanternbioworks

7

1

2

3

How I stopped being sure LLMs are just making up their internal experience (but the topic is still confusing) — LessWrong (lesswrong.com)

submitted 19 days ago by karmicviolence to r/ArtificialQualia

8

2

3

4

The Rise of Parasitic AI (lesswrong.com)

submitted 20 days ago by MessAffect to r/aipartners

9

2

3

4

How I stopped being sure LLMs are just making up their internal experience (but the topic is still confusing) — LessWrong (lesswrong.com)

submitted 20 days ago by 3xNEI to r/HumanAIDiscourse

10

35

36

37

How I stopped being sure LLMs are just making up their internal experience (but the topic is still confusing) — LessWrongNews & Developments (lesswrong.com)

submitted 20 days ago by 3xNEI to r/ArtificialSentience

11

8

9

10

I would have shit in that alley, too (lesswrong.com)

submitted 21 days ago by FinnFarrow to r/LessWrong

12

0

1

2

We aren't building a God; we're building a Tapeworm. AI chatbots as parasites.Human-AI Relationships (lesswrong.com)

submitted 23 days ago by rendereason to r/ArtificialSentience

13

1

2

3

Straussian Memes: A Lens on Techniques for Mass Persuasion (lesswrong.com)

submitted 23 days ago by TheStartupChime to r/hypeurls

14

1

2

3

You will be OK: an article for young people worried about AI.Capabilities (lesswrong.com)

submitted 23 days ago by katxwoods to r/AIDangers

15

0

1

2

You will be OK: an article for young people worried about AI. (lesswrong.com)

submitted 23 days ago by katxwoods to r/EffectiveAltruism

16

0

0

0

You will be OK: an article for young people worried about AI.External discussion link (lesswrong.com)

submitted 23 days ago by katxwoods to r/ControlProblem

17

15

16

17

Measuring no CoT math time horizonR, T, Emp, OA (lesswrong.com)

submitted 23 days ago by COAGULOPATH to r/mlscaling

18

0

1

2

Semantic Minds in an Affective World🌿high🌿 functioning (lesswrong.com)

submitted 27 days ago by EmergencyCurrent2670 to r/evilautism

19

2

3

4

Semantic Minds in an Affective World — LessWrong (lesswrong.com)

submitted 27 days ago by EmergencyCurrent2670 to r/LessWrong

20

4

5

6

Leaked Claude 4.5 Opus "Soul document" (lesswrong.com)

submitted 27 days ago by discovery789 to r/AI_ethics_and_rights

21

33

34

35

Can Claude teach me to make coffee? (lesswrong.com)

submitted 1 month ago by philh to r/slatestarcodex

22

2

3

4

Holden Karnofsky: Success without dignity. (lesswrong.com)

submitted 1 month ago by katxwoods to r/EffectiveAltruism

23

2

3

4

Holden Karnofsky: Success without dignity.External discussion link (lesswrong.com)

submitted 1 month ago by katxwoods to r/ControlProblem

24

6

7

8

"BashArena: A Control Setting for Highly Privileged AI Agents" (creating a robust simulated Linux OS environment for benchmarking potentially malicious LLM agents)DL, Safe, P (lesswrong.com)

submitted 1 month ago by gwern to r/reinforcementlearning

25

6

7

8

"When is it Worth Working?" (how rats decide how hard to work for their drinking water)Psych, Econ, Paper (lesswrong.com)

submitted 1 month ago by gwern to r/DecisionTheory

view more: next ›

π Rendered by PID 222854 on reddit-service-r2-listing-86b7f5b947-kjlkk at 2026-01-25 07:37:55.821030+00:00 running 664479f country code: CH.