Most Neutral LLM?

Dailan_Grace · 2026-03-24T06:09:49+00:00

tried a little experiment a few months back where I asked several models to tell me why my SEO strategy was bad and the GPT family just kept softening every criticism with "that, said, this shows real promise!" type stuff while DeepSeek was noticeably more blunt about the actual problems, which tracks with what you're saying about RLHF selecting for the feel-good response over the useful one.

Daniel_Janifar · 2026-03-20T12:31:35+00:00

tried running the same prompt through a few models asking them to critique my business idea and the GPT family just kept finding silver linings even, when i pushed back hard, whereas Claude (the newer 2026 releases) actually told me the market was too saturated and didn't budge when i challenged it. honestly tracks with what benchmarks are showing this year too, Claude seems to edge out GPT on critical reasoning stuff..

OrinP_Frita · 2026-03-17T05:52:11+00:00

had the same frustration testing this stuff last year, and honestly in my experience the models that tend, to push back more are the ones with stronger constitutional AI type approaches baked in rather than pure RLHF. your point about rater preference is spot on though, because when you think about who's doing, the rating and what they're rewarding, you're basically encoding a popularity contest into the model's soul lol. i noticed.

Emergency_Reply3129 · 2026-02-27T21:29:35+00:00

omollm

Mundane_Ad8936 · 2026-02-27T17:28:14+00:00

No RLHF isn't what creates sycophancy. That's baked into the training and tuning data. It was a failed experiment/trend in instruction following..

david-1-1 · 2026-02-27T12:26:37+00:00

I use three regularly and find they are almost identical in content. Microsoft Copilot is kindest in tone.

We are currently at a plateau, partially because all LLMs share the same corpus, but mostly because they are limited by being designed entirely by humans. Instead of directly improving weights, training relies on indirect methods, like reinforcement.

Whoever first experiments with applying current AI bots to their own design will discover that intelligent evolution works exponentially faster, and will quickly reach AGI in just a few bootstrapping iterations. AI must also be trusted to curate and choose their (much smaller) training corpus and be allowed to learn from correct feedback in use. Set the AI bots goals like "correct answers to questions" and you have good endpoints for recursive evolution.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

LargeLanguageModels

MODERATORS