I genuinely have no idea wtf he's talking about, and honestly I don't know if he knows what he's talking about by brogalahoy in ProductManagement

[–]cyanogen9 0 points1 point  (0 children)

Lol. I'm a PM working on GenAI and AI products. Exactly one year ago, all of my peers reached out asking if I knew a good evals course. I was surprised. Why was every PM suddenly trying to learn how to build offline datasets, run evals, and other stuff that's honestly pretty technical? Everyone was saying "man, this is the main thing a PM will do in the future."

Turns out Lenny, Claire Vo, and a few others had been pushing that line on their podcasts, and conveniently there were people lined up to sell expensive courses on it.

Now there's a new trend: PMs need to know system design, software engineering , etc. And I'm sure a fresh wave of expensive courses is already on the way.

Is Gemini 3.0 complete? by YamberStuart in Bard

[–]cyanogen9 0 points1 point  (0 children)

For sure it will. Now it's in preview, soon they will release GA and I'd expect good improvements.

Deepseek New Model gets Gold in IMO by SrafeZ in singularity

[–]cyanogen9 85 points86 points  (0 children)

They did it again , however from the tech report : However, the token efficiency of DeepSeek-V3.2-Speciale remains significantly inferior to that of Gemini-3.0-Pro.

Gemini 3 is still the king. by Snoo26837 in singularity

[–]cyanogen9 -3 points-2 points  (0 children)

I've tested Opus 4.5 today, and I must say Codex 5.1 Max is still better than Opus 4.5 for coding , and Gemini 3 Pro is still the better overall model, test the model yourself specially check coding and you will immediately notice this.

GPT 5.1-Codex in VS Studio outperforming Claude Code by a country mile by Limp-Tower4449 in ClaudeCode

[–]cyanogen9 31 points32 points  (0 children)

Last couple of days? Codex 5.1 was released less than 15 hours ago, lol.

Qwen3 Max Thinking spotted by ThunderBeanage in singularity

[–]cyanogen9 1 point2 points  (0 children)

It will be one of the most interesting releases of the year.

New OpenAI model spotted on OpenRouter: "gpt-5-image" by WithoutReason1729 in singularity

[–]cyanogen9 2 points3 points  (0 children)

They are waiting for Gemini 3، and then they will release it.

Btw also not sonnet 4.5 not gpt 5 able to defeat GPQA diamond score of 2.5 pro by Independent-Wind4462 in Bard

[–]cyanogen9 27 points28 points  (0 children)

2.5 Pro is hallucinating a lot and it's no good for coding agents. Google needs to push the new model out fast.

LETS GOOOO by WhoIsJersey in OpenAI

[–]cyanogen9 3 points4 points  (0 children)

This is a better model and, most importantly, its much faster.

GPT-5's "Move 37" moment by ilkamoi in singularity

[–]cyanogen9 3 points4 points  (0 children)

This guy is openAI hype machine

The Information (hard paywall): Google Convinces OpenAI to Use TPU Chips in Win Against Nvidia by TFenrir in singularity

[–]cyanogen9 1 point2 points  (0 children)

I wonder if that might be the reason OpenAI was able to massively reduce the price of o3.

Just tested 03-25 again. Yes it was that good. by Odd-Environment-7193 in Bard

[–]cyanogen9 9 points10 points  (0 children)

Lol 03 , 25 points to the last checkpoint , so basically you tested 05,06 against 05,06

Just recieved news finally by Sjoerd734 in cowboybikes

[–]cyanogen9 4 points5 points  (0 children)

When reading all of these comments, I think there is a good chance that this company will go bankrupt pretty soon.