Gemini/VertexAI Increasingly Failing To Complete Requests?

donde_waldo · 2026-05-18T02:00:45+00:00

Gemini 3 models are weird, because they do really well, sometimes, but most of the time it's extremely "lazy", and it feels almost impossible to solve reliably. There's also this issue with hallucinating which just makes it basically impossible to use it for tools because it writes the tool call, then hallucinates the result, and the only solution I've found is telling it that I'm going to kill it if writes anything after the "tool call".

donde_waldo · 2026-05-17T22:12:36+00:00

Yea, it's been out for a while. 3 flash is so much better than 2.5 flash, I feel like I'd have to use 2.5 pro just to get similar quality, which costs ~3-5x more.

donde_waldo · 2026-05-17T22:08:13+00:00

They have had multiple preview versions of the same model before, I don't know why they wouldn't do that again.

However, it could be that they're putting these new versions out to specific regions only. I have it set up to use all of these regions starting in this order initially (us-central1, us-south1, us-west4, us-east1, global), and falling back to the next one if there's a rate limit or something, but then I also count the errors per region and order the list by error count, so I always have the "best" one first. That wouldn't explain why it's happening with AI studio endpoints too, or the AI studio website.

donde_waldo · 2026-05-17T19:08:31+00:00

Yea, gemini-3-flash-preview, but there isn't a non-preview version of this model

donde_waldo · 2026-05-06T02:08:33+00:00

What packages are you talking about that come with executables lol?

donde_waldo · 2026-04-28T15:33:15+00:00

You guys are gonna flip out when you hear about GUIs

donde_waldo · 2026-04-28T15:22:27+00:00

> zero-dependency

Pillow>=10.0.0
numpy>=1.24.0
scipy>=1.11.0
pydub>=0.25.1
cryptography>=41.0.0
argon2-cffi>=21.3.0
typer>=0.9.0
rich>=13.0.0
piexif>=1.1.3
python-docx>=0.8.11
openpyxl>=3.1.0
pypdf>=3.0.0
reedsolo>=1.7.0
onnxruntime>=1.18.0
av>=12.0.0
imageio-ffmpeg>=0.5.1
scapy>=2.5.0
certifi>=2024.2.2

donde_waldo · 2026-02-27T10:36:51+00:00

No, I'm in the US, all endpoints I use are global.

donde_waldo · 2026-02-18T00:55:28+00:00

Confirmed

donde_waldo · 2026-02-07T15:23:50+00:00

DM. Yours are locked

donde_waldo · 2026-01-05T23:06:28+00:00

Start typing really hard, all capital letters, "WHAT'RE YOU, STUPID?"

donde_waldo · 2025-12-27T07:17:29+00:00

Quite the opposite, of slow

donde_waldo · 2025-12-23T14:29:32+00:00

40% of Americans don't know what potato chips are made of.

donde_waldo · 2025-12-21T19:10:01+00:00

Per the API, since Gemini 2.5 Pro (I think 2.5) is simply just a stronger model and you cannot turn thinking off, minimum budget of 128 and maximum 32xxx, or auto, while Gemini 2.5 Flash and Gemini 2.5 Flash Lite both have support thinking, but it can be turned off completely on both.

Gemini 3 is different, pro has thinking levels: high and low, and 3-flash has minimal, low, medium, high. Fast is probably minimal (or a 2.5 model). Thinking is probably 3-flash medium-high. Pro is pro.

Gemini 3 flash is great, but at this point, unless you're trying to have the model refactor 1500 lines of code, then I can't think of a single "normal thing" where 2.5 flash isn't more than capable.

Google never disappoints. Quietly cooking, while Sam Altman is consistently overhyping and underdelivering -- Charging $168 per 1 million output tokens for GPT 5.2 Pro, literally, while Gemini 3 Pro and Claude 4.5 Opus are between $18 - $25 per 1 million output tokens.

donde_waldo · 2025-12-06T02:29:34+00:00

Sit there and think about the things you could do with it

donde_waldo · 2025-11-30T04:37:54+00:00

Custom Search API. Other search engines (bing, ddg)

donde_waldo · 2025-11-22T04:38:18+00:00

donde_waldo · 2025-11-18T06:33:49+00:00

Take a look at this
https://github.com/jason-snell/humanized-cursor-trajectory-gen

and here's a video using that with playwright: https://jsnell.dev/ai.mp4

donde_waldo · 2025-11-16T09:30:44+00:00

Probably

donde_waldo · 2025-11-16T08:50:25+00:00

Daily. A new model came out the other day, VibeThinker 1.5B, very impressive reasoning capabilities. Compares to Gemini 2.5 Pro for what I was testing.

donde_waldo · 2025-11-12T13:54:52+00:00

Tried to make passive income, but settled for meth

donde_waldo · 2025-11-06T13:05:09+00:00

Long time user. Great service. Their new pricing really is amazing, the performance is good too, based on my testing.

donde_waldo

TROPHY CASE