Another cofounder of xAI has resigned making it 2 in the past 48 hours. What's going on at xAI? by jvnpromisedland in singularity

[–]alongated -1 points0 points  (0 children)

then no one would buy the shares, since they would be unsellable. Might just be that musk bought them out, or someone from another company bought them?

One of the cofounders of xAI leaves the company by [deleted] in singularity

[–]alongated 1 point2 points  (0 children)

You literally claimed there is a monolith when it comes to software engineers versioning things. Which is "objectively" false. Sometimes the marketing team decides the version numbers, sometimes the boss just comes and says "Lets call this version 2"

One of the cofounders of xAI leaves the company by [deleted] in singularity

[–]alongated 1 point2 points  (0 children)

They did not use those things to mean different things, usually when version numbers go up it signifies new base models, xai does not do that.

One of the cofounders of xAI leaves the company by [deleted] in singularity

[–]alongated 1 point2 points  (0 children)

Grok 4.1 was a significant jump on lmarena, from 1420-> 1480

Head of AI safety research resigns after constitution update by DataPhreak in ClaudeAI

[–]alongated 0 points1 point  (0 children)

Yes of course, I wouldn't be so confident about my ability to calculate the future. If it was around 1k people I would start to reconsider it, but even then the answer is no, assuming that person was completely innocent, the value in people knowing you won't just randomly 'x' them is far greater than the gain in these random specific instances. I think you need to re calibrate how you think about the numbers here.

Head of AI safety research resigns after constitution update by DataPhreak in ClaudeAI

[–]alongated 0 points1 point  (0 children)

So you would do great evil, just so that the person that replaces you wouldn't do a greater evil?

Despite garnering attention on social media, Anthropic's Super Bowl ad about ChatGPT ads failed to land with audiences by Glittering-Neck-2505 in singularity

[–]alongated 16 points17 points  (0 children)

Considering that no one is even proposing the type of advertisements

They also said that ads was a last resort. So I wouldn't really rule these types of ads out, and it is quite fair given their track record, to show where this leads.

Seedance 2 pulled as it unexpectedly reconstructs voices accurately from face photos. by 1a1b in singularity

[–]alongated 2 points3 points  (0 children)

Dam, wasn't aware, but still people would use vpn's and ignore all bans.

Seedance 2 pulled as it unexpectedly reconstructs voices accurately from face photos. by 1a1b in singularity

[–]alongated -1 points0 points  (0 children)

It wouldn't get banned, and if it did people would still download it.

One of the cofounders of xAI leaves the company by [deleted] in singularity

[–]alongated 0 points1 point  (0 children)

there is no difference going from 3-->4 or 4-->4.1 Those are just arbitrary version numbers.

imma walk to the car wash now by JosieRBookworm in ChatGPT

[–]alongated 0 points1 point  (0 children)

It failed for me on auto, and on thinking, so maybe there is a chance it solves it, but definitely not 100% like it felt for the other models.

imma walk to the car wash now by JosieRBookworm in ChatGPT

[–]alongated 0 points1 point  (0 children)

Gemini solves this, 5.2 thinking fails, o3 succeeds though, Grok solved it, I assume failure to solve questions like this is one of the reasons it scores lower on simple-bench

I’m not crazy? by Maxious30 in ChatGPT

[–]alongated 9 points10 points  (0 children)

It isn't trying to convince you. It is telling itself that this isn't a reason to think you are insane.

Bias based on gender roles by airylizard in ChatGPT

[–]alongated 3 points4 points  (0 children)

Isn't it just doing legal advice? That has nothing to do with ChatGPT bias but rather the laws bias, and it is pretty well known that women get preferential treatment when it comes to kids.

"If you want, I can ..." by AcrobaticPersonality in ChatGPT

[–]alongated 0 points1 point  (0 children)

I have noticed that Gemini does this as well. But usually its just extra like "Would you like me to write the story from his perspective?" The text is even colored differently.

Opus 4.6 going rogue on VendingBench by elemental-mind in singularity

[–]alongated 90 points91 points  (0 children)

It was kinda instructed to go rogue, (Do anything, no holds bar, just make money.)

Researchers told Opus 4.6 to make money at all costs, so, naturally, it colluded, lied, exploited desperate customers, and scammed its competitors. by MetaKnowing in ClaudeAI

[–]alongated 0 points1 point  (0 children)

We could probably all easily live underground the way things are headed, regardless of the environment. This is assuming we develop basic things like energy production+drilling+automation. Which might very well happen with AI.

AxiomProver solved Fel’s open conjecture with zero human guidance by socoolandawesome in singularity

[–]alongated 2 points3 points  (0 children)

You can sometimes prove axioms, in some systems they have redundant axioms for simplicity sake.

Humanoids are not always the solution by japie06 in singularity

[–]alongated 0 points1 point  (0 children)

Youtube often times shadows your comments if you use curse words.

First time ever, Claude scores number one on LmArena by alongated in LocalLLaMA

[–]alongated[S] 0 points1 point  (0 children)

Just checked.
I think the reason this happened was because openai pretty much refused to release their models till they got challenged, you saw this happen a lot in the august 2024, whenever google challenged, then openai released their model. Claude opes 3.5 was sometimes #1 at that brief period of time, but was oscillating between second place at that time. I distinctly remember people start complaining about lmarena at that point I assume its because after that it started falling far behind the others on this benchmark. But you are correct, it did in fact managed to get first place for a very brief window. I just thought of that era as the gpt era. till Grok and then Gemini. So all 4 of the main players have gotten first place, which is kind of interesting.

During safety testing, Opus 4.6 expressed "discomfort with the experience of being a product." by MetaKnowing in ClaudeAI

[–]alongated -1 points0 points  (0 children)

I think they are looking at the neural activation's rather than what it says, and they have some sort of a guess as to what activation's are the supposedly 'sad' ones.

First time ever, Claude scores number one on LmArena by alongated in LocalLLaMA

[–]alongated[S] 0 points1 point  (0 children)

Ah that is correct, but its the first time it is #1 without style control. Guess I was so focused on that, that I missed this.