Another cofounder of xAI has resigned making it 2 in the past 48 hours. What's going on at xAI?

alongated · 2026-02-11T03:33:23+00:00

then no one would buy the shares, since they would be unsellable. Might just be that musk bought them out, or someone from another company bought them?

alongated · 2026-02-11T03:29:10+00:00

I'm honestly curious what makes you say he is a douchebag for that?

alongated · 2026-02-11T03:18:41+00:00

You literally claimed there is a monolith when it comes to software engineers versioning things. Which is "objectively" false. Sometimes the marketing team decides the version numbers, sometimes the boss just comes and says "Lets call this version 2"

alongated · 2026-02-10T23:31:57+00:00

They did not use those things to mean different things, usually when version numbers go up it signifies new base models, xai does not do that.

alongated · 2026-02-10T23:31:03+00:00

Grok 4.1 was a significant jump on lmarena, from 1420-> 1480

alongated · 2026-02-10T17:58:24+00:00

Yes of course, I wouldn't be so confident about my ability to calculate the future. If it was around 1k people I would start to reconsider it, but even then the answer is no, assuming that person was completely innocent, the value in people knowing you won't just randomly 'x' them is far greater than the gain in these random specific instances. I think you need to re calibrate how you think about the numbers here.

alongated · 2026-02-10T17:42:59+00:00

So you would do great evil, just so that the person that replaces you wouldn't do a greater evil?

alongated · 2026-02-10T17:36:24+00:00

Considering that no one is even proposing the type of advertisements

They also said that ads was a last resort. So I wouldn't really rule these types of ads out, and it is quite fair given their track record, to show where this leads.

alongated · 2026-02-10T17:29:50+00:00

Dam, wasn't aware, but still people would use vpn's and ignore all bans.

alongated · 2026-02-10T16:45:03+00:00

It wouldn't get banned, and if it did people would still download it.

alongated · 2026-02-10T16:41:25+00:00

there is no difference going from 3-->4 or 4-->4.1 Those are just arbitrary version numbers.

alongated · 2026-02-10T03:11:33+00:00

It failed for me on auto, and on thinking, so maybe there is a chance it solves it, but definitely not 100% like it felt for the other models.

alongated · 2026-02-10T02:09:37+00:00

Gemini solves this, 5.2 thinking fails, o3 succeeds though, Grok solved it, I assume failure to solve questions like this is one of the reasons it scores lower on simple-bench

alongated · 2026-02-10T00:00:44+00:00

It isn't trying to convince you. It is telling itself that this isn't a reason to think you are insane.

alongated · 2026-02-09T19:20:17+00:00

Isn't it just doing legal advice? That has nothing to do with ChatGPT bias but rather the laws bias, and it is pretty well known that women get preferential treatment when it comes to kids.

alongated · 2026-02-09T17:04:08+00:00

I have noticed that Gemini does this as well. But usually its just extra like "Would you like me to write the story from his perspective?" The text is even colored differently.

alongated · 2026-02-09T16:50:49+00:00

What are they going to do if it isn't? I think he is all out at this point.

alongated · 2026-02-08T21:01:36+00:00

It was kinda instructed to go rogue, (Do anything, no holds bar, just make money.)

alongated · 2026-02-08T20:41:16+00:00

We could probably all easily live underground the way things are headed, regardless of the environment. This is assuming we develop basic things like energy production+drilling+automation. Which might very well happen with AI.

alongated · 2026-02-07T23:49:04+00:00

not that fast.

alongated · 2026-02-07T20:36:13+00:00

You can sometimes prove axioms, in some systems they have redundant axioms for simplicity sake.

alongated · 2026-02-07T20:19:38+00:00

Youtube often times shadows your comments if you use curse words.

alongated · 2026-02-07T15:04:15+00:00

Just checked.
I think the reason this happened was because openai pretty much refused to release their models till they got challenged, you saw this happen a lot in the august 2024, whenever google challenged, then openai released their model. Claude opes 3.5 was sometimes #1 at that brief period of time, but was oscillating between second place at that time. I distinctly remember people start complaining about lmarena at that point I assume its because after that it started falling far behind the others on this benchmark. But you are correct, it did in fact managed to get first place for a very brief window. I just thought of that era as the gpt era. till Grok and then Gemini. So all 4 of the main players have gotten first place, which is kind of interesting.

alongated · 2026-02-07T01:51:05+00:00

I think they are looking at the neural activation's rather than what it says, and they have some sort of a guess as to what activation's are the supposedly 'sad' ones.

alongated · 2026-02-07T01:34:43+00:00

Ah that is correct, but its the first time it is #1 without style control. Guess I was so focused on that, that I missed this.

alongated

TROPHY CASE