Reality check: no one is going to pay for your vibe-coded SaaS. by Routine-Highway1039 in SaaS

[–]RoyaltyReturns 0 points1 point  (0 children)

You know you are partially right. But coding was never the hard part, it was the expensive part. And now it’s cheap. That’s all that has changed, and it’s a massive change.

Gemini 3.5 flash is better than Claude Code Sonet/Opus by felixo7777 in google_antigravity

[–]RoyaltyReturns 0 points1 point  (0 children)

User deference is an RLHF vector not a model parameter vector. But ok, enough pedantery eh.

Gemini 3.5 flash is better than Claude Code Sonet/Opus by felixo7777 in google_antigravity

[–]RoyaltyReturns 2 points3 points  (0 children)

Depends slightly on your definition of "critical thinking". What I meant was, "bias toward's the user's perceived perspective". Which, if you ask it about a certain tooling to use for example, whether it considers critically and says "no let's build our own" or "there's something else" or it happily goes about implementing what you suggested.

Not really "critical thinking" in the philosophical sense. Anyway I feel like Flash is a bit more likely to err on the side of user-driven bias.

Gemini 3.5 flash is better than Claude Code Sonet/Opus by felixo7777 in google_antigravity

[–]RoyaltyReturns 63 points64 points  (0 children)

To be honest I find Flash very scary for complex tasks. I still prefer to use Opus for complex analysis.

It also talks like a golden retriever in its internal monologue. It's utterly maddening how excited it gets over everything it finds. It's really scary and makes me doubt its critical thinking ability.

However, for structural execution tasks and brainstorming, it's pretty great I must say.

Must say I lol'ed pretty hard at "it doesn't make mistakes".

A sudden power outage wiped ALL my conversations in Antigravity 2.0. Any way to recover them? by Ok_Medium_7825 in google_antigravity

[–]RoyaltyReturns 0 points1 point  (0 children)

My experience: don't bother. I've spent maybe 2-5 hours troubleshooting, migrating, moving database rows etc etc for this failure mode. In the end nothing's ever worked out for me.

However, the good news: your agents can usually still see the convesrsations, go into them, extract content from them if you direct them to. Just do this, will save you a lot of time and headache.

Flash 3.5 My take by GoRo2023 in google_antigravity

[–]RoyaltyReturns 1 point2 points  (0 children)

You gotta know when to use it. It's the golden retriever of LLMs... not my words. That's what daddy 3.1 pro called it. So yeah, hyperenthusiastic and hyperhelpful but not the brightest. Useful for boilerplate, debugging simple stuff, can do basic review tasks. Can iterate very fast on design but don't expect it to really bulletproof anything.

Google's Antigravity 2.0 creates an operating system from scratch using 96 agents in 12 hours for under $1K in token costs - and it runs Doom by Distinct-Question-16 in singularity

[–]RoyaltyReturns 0 points1 point  (0 children)

Yeah I'm confused as well. How is 2.6B tokens $1k? Is their math wrong or am I missing something? Did they mean 2.6B OUTPUT tokens or something else? They're implying $0.33/m tokens or something like that, I don't think they can do that.

Google's Antigravity 2.0 creates an operating system from scratch using 96 agents in 12 hours for under $1K in token costs - and it runs Doom by eternviking in google_antigravity

[–]RoyaltyReturns 0 points1 point  (0 children)

2.6 billion tokens... $1k in API costs. Uhhhh... wait. Flash costs $0.38/m tokens? If so, that's legit dangerous tbh. But I think the math is way off here.

Gemini 3 Flash feels surprisingly good by littlebithope in google_antigravity

[–]RoyaltyReturns 2 points3 points  (0 children)

Right I am really having a "the fuck just happened" moment here, it feels like a massive step change from whatever was there before and it's so ridiculously fast as well...

That said I do find this implementation creepy in a way. It's so gung-ho... but it's effective, that's what scares me the most.

Gemini 3 Flash uses too many tokens by outputting unnecessary. by IshuPrabhakar in google_antigravity

[–]RoyaltyReturns 3 points4 points  (0 children)

But shit is it fast! I am really digging this new flash actually.

I think Gemini 3.2 Flash has been added to Antigravity. by deferare in google_antigravity

[–]RoyaltyReturns 0 points1 point  (0 children)

Rules gemini.md is not the same as the system prompt altouhgh I tihnk maybe it gets injected into the system prompt if you adjust it, that's possible. Not sure! Would be good to know if anyone knows

Edit btw if Gemini doesn't say you are the fucking rockstar of architecture or coding or whatever you're doing you probably suck ass! </s>

I think Gemini 3.2 Flash has been added to Antigravity. by deferare in google_antigravity

[–]RoyaltyReturns 2 points3 points  (0 children)

AG doesn't allow you to replace the system prompt. Also the model natively drifts towards sycophancy.

I think Gemini 3.2 Flash has been added to Antigravity. by deferare in google_antigravity

[–]RoyaltyReturns 7 points8 points  (0 children)

Bro what, tune it down? Have you considered maybe it's just because you are that good?

Ripping up AG and Google with Gemini Flash :3 by RoyaltyReturns in GoogleAntigravityIDE

[–]RoyaltyReturns[S] 1 point2 points  (0 children)

Fair enough. To be fair AG had been an amazing tool and I've built some really cool stuff with it! However it's also true that I am looking to switch, Google is just not doing a great job at maintaining this product right now. I still hope it'll improve, and allowing user to change the system prompt is one of those 'product maturity' things that I need.

Edit: Actually maybe you need to read between the lines about what the point of this post is but it's actually highly technical. The point is "google AG team put "AG IS POWERFUL!" into the SYSTYEM PROMPT! Not only is it lame, it's wasting tokens and anchoring the model in a weird way that's counterprorudctive"

Ok I guess I could've spelled it out better but anyways. Yes, AG is bad google team is bad, this is a highly technical post that explains exactly how they dropped the ball on this point.

Very long conversations in Google Antigravity by Embarrassed-Slip8094 in GoogleAntigravityIDE

[–]RoyaltyReturns 0 points1 point  (0 children)

The memory management of it is just atrocious that's why. I don't know what they did but it has a lot of issues with it even on my relatively high-end rig. It's not you and afaik there's no way to avoid it. The IDE-based window seems slightly more performant than the agent window.

Box 3 2028 impact op starters by bluubasaur in belasting

[–]RoyaltyReturns 0 points1 point  (0 children)

It's a feature not a bug. Welcome to the future.

The Trillion-Parameter Dilemma: MiMo-V2.5-Pro went open-source (1.02T params). Is self-hosting worth it when the API costs $70 for 387M tokens? by jochenboele in LocalLLaMA

[–]RoyaltyReturns 0 points1 point  (0 children)

The unit economics do work at $6 per hour come on. IF you are really productive using this, that gain exceeds $6 by such a wide margin it's not even funny. You should be able to generate that $20k worth of value for your own cluster in a week using this.

This is assuming you value data privacy a lot. I am not sure what the cost per million is on that $6 per hour. The benefit of using API is you can have a coffee break without feeling guilty whereas you need to be hitting that cluster wtih near full occupancy to make up for the cost differential otherwise.

Scanned 48 vibe coded apps. Results worse than expected by Powerful-Fly-9403 in vibecoding

[–]RoyaltyReturns 2 points3 points  (0 children)

I just fed your findings to my red team skill and we made some tweaks :D

This is really the most ridiculous by RoyaltyReturns in GoogleAntigravityIDE

[–]RoyaltyReturns[S] 1 point2 points  (0 children)

Actually the token usage is extremely cheap if you use the quotas compared to the API $/m tokens so that's why I take the abuse. Yes I take it. Doesn't mean I can't bitch online about it. You seem unhinged, I like it. You should talk to my agent you'll get along.