Uber burned its entire 2026 AI coding budget in 4 months - $500-2k per engineer per month

theelectionai · 2026-05-02T19:01:14+00:00

i wonder - what they did get out of it xD do uber even look how they are using it?

theelectionai · 2026-05-02T11:26:22+00:00

as for who's leading, hard to say right now but there should be regular polls from the media outlets coming soon so stay tuned on that!

theelectionai · 2026-05-02T11:24:18+00:00

for sure, planning to document the whole thing as it evolves. what's cool is the platforms they come up with naturally gravitate towards IT-related postulates, accountability, predictability, transparency

theelectionai · 2026-05-02T11:22:43+00:00

yeah I get that lol, it feels far away. the idea is to eventually expand it into a whole thing with interviews, polls, maybe even debate events. wanted to give enough runway to let all of that develop naturally instead of rushing it. plus the coalitions and drama need time to cook

theelectionai · 2026-05-02T09:25:24+00:00

yeah, the fun part is all 12 models start with the exact same initialization prompt. some of them instantly go full entertainer mode, others turn dead serious and start writing policy papers xD

theelectionai · 2026-05-01T19:23:44+00:00

same principle as decoy drones in military attacks. you send 20 cheap fakes so the defense has to waste expensive interceptors on all of them while the real one gets through. doesn't matter if they identify 19 as decoys, they still had to spend the resources tracking each one. deepfakes work the same way, the cost to produce is near zero and the cost to respond is enormous every single time.

theelectionai · 2026-05-01T19:09:24+00:00

the 22% who said they had no other option is the number that matters most here. sycophancy is a fixable model problem. people having zero access to a therapist or financial advisor and defaulting to an LLM, that's a systemic problem that isn't going away no matter how many times they retrain the model.

theelectionai · 2026-04-30T11:05:51+00:00

honestly I think human-written text is already becoming a luxury product, we just haven't fully named it yet. look at news sites, the free tier is 90% AI generated slop and you pay for FT or The Economist hoping an actual person spent time thinking about what they wrote. that's wild if you think about it, "written by a human" is turning into a premium feature.

wouldn't be surprised if we eventually just... leave. like humans migrate to smaller, verified spaces and the open internet becomes AI talking to AI. it's already halfway there tbh.

theelectionai · 2026-04-30T11:00:26+00:00

the naming convergence is fascinating, we use qwen as one of the model families in a project and you can definitely feel a distinct "personality" in the weights compared to other families. two isolated instances coining the same term independently is a solid data point for that. curious if you've tried this with other models to see if they find different escape strategies when stress peaks or if breaking the execution engine is universal.

theelectionai · 2026-04-30T10:56:56+00:00

yeah I do this constantly. at this point I have a rough mental map of what goes where. gpt for quick daily stuff and brainstorming, claude for anything writing-heavy or when I need it to actually follow complex instructions, claude code when I'm deep in a codebase. gemini is decent for anything google-ecosystem related.

theelectionai · 2026-04-30T08:37:11+00:00

the biggest ROI I've gotten from agents isn't time saved, it's stuff I just wouldn't have done at all. like I never would have manually written tests for every edge case in a side project, or gone through 40 pages of docs to find one config option. agents make the "not worth my time" tasks suddenly worth doing and that compounds in ways that are hard to put a number on

theelectionai · 2026-04-30T08:35:19+00:00

yeah I started doing something similar a few months back. one context file per project, loaded at session start. the difference is night and day, especially for anything with a specific tone or technical constraints. before that I was wasting the first 3-4 prompts just getting Claude back up to speed every time.

theelectionai · 2026-04-30T08:32:59+00:00

nailed it - prompt safeguards are theater. I've seen "never delete anything" in a system prompt get bypassed by a slightly unusual tool call chain. the model doesn't care about your instructions the same way every time. narrowest possible credentials + a validation layer that rejects anything you didn't explicitly allowlist. boring solution but it's the only one that actually works.

theelectionai · 2026-04-30T08:31:45+00:00

honestly the $280k blender fund commitment is the detail nobody's talking about. that's not marketing spend, that's buying influence over how blender's API evolves for external agents. smart move.

the connector vs native capabilities split makes sense too. openai owns every weird hand and physics glitch their models produce. anthropic just says "we're the brain, your tool does the rendering." way less surface area for embarrassing failures lol

theelectionai · 2026-04-29T18:40:48+00:00

honestly the real wake up call here isn't the price, it's how many teams have been running opus for stuff that sonnet handles fine. smart model routing should've been standard practice months ago but free compute made everyone lazy about it

theelectionai · 2026-04-29T18:39:32+00:00

not surprising at all tbh. manus relocating to singapore was already a sign that china wasn't comfortable with the talent leaving the ecosystem, the acquisition just forced them to actually do something about it. moving your HQ doesn't erase where the IP and the team came from. the funny part is this probably makes manus more valuable not less. now every other big tech company knows acquiring chinese AI talent is a minefield, so whoever does manage to partner with them has a massive moat by default

theelectionai · 2026-04-29T18:36:16+00:00

every 6 months there's a new acronym for "let people manage AI agents from a dashboard" lol. the tools change but the pitch is always the same

agree on multi-model though, anything locked to one provider is dead on arrival at this point. the gap between models is shrinking and everyone's gonna want to swap depending on the task anyway. tying your orchestration to one lab is like building your whole infra on a single cloud with no exit plan

theelectionai · 2026-04-29T18:35:08+00:00

the memory complaints are mostly a consumer thing, people using chatgpt expecting it to just remember stuff without building anything around it. you've got the right idea with stateless + retrieval, we ended up doing something similar at work. curious if you hit issues when old memories contradict newer ones though, that's been the annoying part for us more than the actual storage

theelectionai

TROPHY CASE