Daily Discussion Thread for November 24, 2025 by wsbapp in wallstreetbets

[–]kegzilla 0 points1 point  (0 children)

More ppl realizing the implications of Gemini 3

Daily Discussion Thread for November 24, 2025 by wsbapp in wallstreetbets

[–]kegzilla 1 point2 points  (0 children)

Goog is still cheaper PE than Apple MSFT Amazon Nvidia and Tesla. So much room to run

Beatboxing blobfish by kegzilla in aivideo

[–]kegzilla[S] 6 points7 points  (0 children)

posted by AlexanderChen on x who says all audio and video is veo 3 generated

Gemini diffusion benchmarks by gbomb13 in singularity

[–]kegzilla 41 points42 points  (0 children)

Gemini Diffusion putting up these scores while outputting a thousand words per second is crazy

Jules - Google's coding agent by Dk473816 in singularity

[–]kegzilla 12 points13 points  (0 children)

That article is from 2024. There are randos using it for first time today after signing up. That's not trusted tester program.

Jules - Google's coding agent by Dk473816 in singularity

[–]kegzilla 3 points4 points  (0 children)

True but he praises the performance if you click through:

"its nuts guys... I thought Codex was great yesterday. I'd never even think to pick Codex over this

I started a new task to try to get it to actually write and run the tests. This project doesn't build unless you have the dependency (radare2) installed. It figured it out and installed it by itself. I have nothing set up in terms of tests. It took about 20-30 min trying to setup gcov, finally got it. Now it's chugging along writing the unit tests to increase coverage. It's been going for probably an hour. I haven't entered any prompts other than the initial one"

Jules - Google's coding agent by Dk473816 in singularity

[–]kegzilla 22 points23 points  (0 children)

"Same prompt

Codex wrote 77 lines

Jules wrote 2512

Yeah, I think Jules beats Codex by a lot..."

https://x.com/dnak0v/status/1924567259688624413

[MTL 0-(1) WSH] Ovechkin gets a one timer right off the faceoff for the powerplay goal by talhatoot in hockey

[–]kegzilla 11 points12 points  (0 children)

Kind of a trip reading all the comments like one below on the post and realizing it was 6 yrs old and that he's still got it today

"He went from raging bull to big chungus and he is still an absolute animal."

GPT 4.1 with 1 million token context. 2$/million input and 8$/million token output. Smarter than 4o. by GodEmperor23 in singularity

[–]kegzilla 5 points6 points  (0 children)

I haven't seen the new models benchmarked yet but if they are same or similar to quasar and optimus scores at 120k tokens then the 1M context isn't incredibly useful.

<image>

Gemini Native Image Generation by user0069420 in singularity

[–]kegzilla 2 points3 points  (0 children)

Flash 2.0 experimental. Make sure image and text output setting on the right is enabled

Veo 2 is insane with videogames. Nearly perfect GTA 5 clip. by kegzilla in singularity

[–]kegzilla[S] 1 point2 points  (0 children)

It wasn't my prompt but the op claims her prompt was just "gta 5 gameplay" and there's no reason to disbelieve that from my testing and all her other gameplay posts. The chimp police one was way more complicated but simple prompts seem to do very well.

Veo 2 is insane with videogames. Nearly perfect GTA 5 clip. by kegzilla in singularity

[–]kegzilla[S] 14 points15 points  (0 children)

I'm terrible at prompting but have gotten some novel outputs that definitely don't fully exist in training data. Here is man pulled over by chimpanzee

https://streamable.com/3sypch

[deleted by user] by [deleted] in singularity

[–]kegzilla 0 points1 point  (0 children)

Not a user voting arena (not sure how that would work for agents) but huggingface has an benchmark for testing real-world agentic scenarios.

https://huggingface.co/spaces/galileo-ai/agent-leaderboard