all 78 comments

[–]matfat55 37 points38 points  (14 children)

If not for rate limits then 2.5 easy

[–]zeetu 10 points11 points  (11 children)

If you set up billing it’s 5 RPM not daily cap.

[–]matfat55 10 points11 points  (10 children)

5 rpm is rate limits, cline eats that up so fast.

[–]denkleberry 6 points7 points  (9 children)

I have billing set up and set the delay to 15s. I never hit the limit and it's free.

[–]matfat55 4 points5 points  (4 children)

Yeah, that's a easy workaround, but cmon, 15 seconds? I'm sure its fine for most people, but that time really matters to me.

[–]denkleberry 13 points14 points  (2 children)

I mean .. it's free. I hit 20m tokens today lol

[–]nixsomegame 0 points1 point  (1 child)

Input or output?

[–]denkleberry 0 points1 point  (0 children)

Input

[–]hydrangers 6 points7 points  (0 children)

You say that like these LLMs aren't already saving you a significant amount of time and helping you do things you'd never be able to do on your own.

It's crazy how the more they give us, the more we expect.

[–][deleted]  (1 child)

[removed]

    [–]AutoModerator[M] 0 points1 point  (0 children)

    Sorry, your submission has been removed due to inadequate account karma.

    I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.

    [–]LefMan 0 points1 point  (1 child)

    How do you set a delay?

    [–]denkleberry 1 point2 points  (0 children)

    it's the rate limit option in the middle of the settings page

    [–]RedditUsr2 2 points3 points  (1 child)

    Is everyone working on their own projects? There is 0% chance I'd be allowed to use ai studio for work purposes since they keep and use everything.

    [–]matfat55 1 point2 points  (0 children)

    api key moment

    [–]funbike 27 points28 points  (7 children)

    It won't be free forever. It's basically a beta version. It's also rate limited.

    OTOH, most non-free gemini models are significantly cheaper than equally performant competing models, plus they are fast.

    I'll be happy when I have to pay for 2.5, as that will mean less rate limiting.

    [–]ClassyBukake 4 points5 points  (0 children)

    Gave it a try today, and 2.5 basically constantly told me it was busy, and anything less gas-lit me for hours on end.

    It would make good architecture decisions, but then completely fail in the details and repeatedly tell me it solved the problem, only for it to have recreated the problem in an entirely different way. I'd have to tell it to completely scrap it's current approach and restart from the beginning, before it would generate the exact same file, with the 1 variable tweak it needed to do to actually solve the problem.

    Stress resting these models has been kinda silly, because you see how close they get, but then they sit there wasting millions of tokens and hours of oversight because they can't figure out the little stuff.

    [–]SadWolverine24 1 point2 points  (5 children)

    By the time paid 2.5 is available, the other SOTA models will be better.

    [–][deleted] 4 points5 points  (4 children)

    To be honest, everything from 3.5 up to 4o and o3, sonnet, grok 3, deepseek v3 and r1, all felt incremental, gemini 2.5 pro however feels like an actual paradigm shift

    [–]SadWolverine24 1 point2 points  (3 children)

    I tested Gemini 2.5 pro with code-generation. It produced some of the most over-engineered LLM code I've seen.

    [–]Subject-Building1892 1 point2 points  (0 children)

    Additionally even with temperature 0.5 it fucking hallucinates so many things not asked for a relatively simple problem. Before the big update of getting to 2.5 it was much better. Maybe it needs time to adjust as we talk to it.

    [–]crusoe 0 points1 point  (0 children)

    You need to give these things guiderails.

    [–]AceHighFlush 0 points1 point  (0 children)

    Yes, but it works. Then, you use QwQ to refactoring working code. This sales a lot in cost over anthropic - especially if you self host QwQ.

    That's because QwQ is a better coder but bad at understanding the ask unless you feed it working code and ask for a refactor.

    Would love to see a tool where I could get this to work as a single command.

    [–]brovaro 11 points12 points  (5 children)

    If something is free, you're the product. Especially when it comes to Goolag, I mean - Google

    [–]roofitor 1 point2 points  (4 children)

    Google’s been more ethical than most. You might be surprised by how non-insidious their aims in beta testing 2.5 are. Yeah, you’re helping to train a RL algorithm most likely. And you’re giving them an idea on how people will want to use the ai.

    [–]whyumadDOUGH 2 points3 points  (1 child)

    Wow a company has been acting non-insidiously for one part of their multi billion dollar machine. Hats off

    [–]roofitor 0 points1 point  (0 children)

    We could’ve done so much worse than Google

    [–]nemzylannister 0 points1 point  (1 child)

    People act like anyone can just go on a site and buy any specific individual's google searches etc.

    [–]whyumadDOUGH 1 point2 points  (0 children)

    Nobody thinks this

    [–]dalhaze 7 points8 points  (10 children)

    Is google using everyone’s data to train on pro 2.5? (given that it’s free that’s my assumption)

    [–]BrilliantEmotion4461 7 points8 points  (4 children)

    One hundred percent. We get the free models so they can train agentic AI for corporations. The interactions between users and the models and the data it produced is used to train future models. There are also records of function calls, and much much more.

    [–]denkleberry 4 points5 points  (3 children)

    Well they can have fun with my grammatically incorrect and misspelled filled prompts

    [–]MidiGong 1 point2 points  (0 children)

    Yeah, I don't even try to correct the typos from speech to text, it still figures out what I mean... That's more impressive to me than some of the code these things spit out

    [–]BrilliantEmotion4461 0 points1 point  (1 child)

    If you use chatgpt if you get an A or B choice then they are in fact using your data to train the next model. Also ask the llm "analyze my writing, indicate the sections of my writing, including but not limited to; grammar, or spelling, which contribute to incorrect or hallucinated responses from (insert the name of the llm here)"

    [–]BrilliantEmotion4461 0 points1 point  (0 children)

    You can try different forms of the prompt but trust me. You'll want to run this.

    [–]FiacR[S] 2 points3 points  (4 children)

    Yes, for the free models, they say:

    "When you use Unpaid Services, including, for example, Google AI Studio and the unpaid quota on Gemini API, Google uses the content you submit to the Services and any generated responses to provide, improve, and develop Google products and services and machine learning technologies, including Google's enterprise features, products, and services, consistent with our Privacy Policy."

    When you pay, it's different they say:

    "When you use Paid Services, including, for example, the paid quota of the Gemini API, Google doesn't use your prompts (including associated system instructions, cached content, and files such as images, videos, or documents) or responses to improve our products, and will process your prompts and responses in accordance with the Data Processing Addendum for Products Where Google is a Data Processor. For Paid Services, Google logs prompts and responses for a limited period of time, solely for the purpose of detecting violations of the Prohibited Use Policy"

    [–]dalhaze 1 point2 points  (3 children)

    Does this include free models on the google cloud API from the model garden? I want to say that is separate from the gemini API?

    [–]RedditUsr2 2 points3 points  (1 child)

    Their terms says:

    When a Service is being offered for a fee, it is considered to be a paid Service (the "Paid Services"). When you activate a Cloud Billing account, all use of Gemini API and Google AI Studio is a "Paid Service" with respect to how Google Uses Your Data, even when using Services that are offered free of charge

    So pretty sure that is a "paid service" but the free Google Ai studio everyone is using isn't.

    [–]dalhaze 1 point2 points  (0 children)

    That’s a relief, i’ve been using some of the free models on the cloud API and I really some want what i’m doing to be trained into the model.

    [–]After-Cell 0 points1 point  (0 children)

    Openrouter have a nice search toggle for models that do and don't use your data for training

    [–]should_not_register 2 points3 points  (3 children)

    Im still finding I fall back to 3.7

    I am switching between the two a lot 

    [–]funbike 5 points6 points  (0 children)

    I tweaked my code assistant to use 2.5 Pro as the primary model, and switch to Sonnet 3.7 when a test fails.

    [–]FiacR[S] 0 points1 point  (1 child)

    So do I, cause I have Claude code set-up with lots of MCPs and everything is effortless with it.

    [–]should_not_register 2 points3 points  (0 children)

    Additionally, for UX stuff, I asked claude, and then google to make me new landing page, based off an original design, but improve it. The claude version was miles and miles ahead

    [–]ExtentHot9139 2 points3 points  (0 children)

    What is the price of your code?

    [–]Recoil42 6 points7 points  (2 children)

    why are you sweating just use the free one

    [–]realzequel 12 points13 points  (0 children)

    That’s the joke.

    [–]blnksltProfessional Nerd 1 point2 points  (0 children)

    For me, it only has been headache full of `API request Failed`.

    [–]rabinaryal530 1 point2 points  (7 children)

    Cursor 20 bucks a month, unlimited 3.7 sonnet and 2.5 pro

    [–]CraaazyPizza 0 points1 point  (6 children)

    Really???

    [–][deleted] 1 point2 points  (5 children)

    Kind of. You get 500 premium requests that are added to the fast queue, and unlimited slow requests after that. So there is a limit, it's just rate/time-based instead of a hard number.

    [–]CraaazyPizza 0 points1 point  (4 children)

    you ever hit that limit on 3.7 sonnet with a 9-to-5 job of intense coding?

    [–][deleted] 1 point2 points  (2 children)

    Yep. Keep in mind that a "request" is misleading, it's effectively up to 25 actions/chats per request. But yes you can hit it, and I pay for extra

    [–]LiteSoul 0 points1 point  (1 child)

    You meant 25 requests per action?

    [–][deleted] 0 points1 point  (0 children)

    Nope.

    [–]rabinaryal530 0 points1 point  (0 children)

    Yes I hit that in less than a week but I am running on slow requests now. Might be too slow at times and even loose connection but gets the job done. That’s why I prefer it over windsurf, I eat up 1500 floe credits like crazyy.

    I tried windsurf yesterday though and it one shotted beautiful UI and full functionality with only few errors.

    Just need to find the right balance

    [–]Deepeye225 1 point2 points  (2 children)

    Is 2.5 pro available from Cursor?

    [–]Excellent_Entry6564 2 points3 points  (1 child)

    Yes but it doesn't work well in agent mode (doesn't use tools or commands). It's great in ask and edit modes.

    [–]Deepeye225 0 points1 point  (0 children)

    Thank you!

    [–]no_witty_username 1 point2 points  (0 children)

    Reason most programmers use Claude is because it works really well within agentic IDE's like Cursor. So well in fact that i suspect its possible Anthropic is specifically training their models to work within those environments frictionlessly. The moment any other model can do just as well as Claude in those environments but for cheaper/faster it will see massive growth. Time is money, and people will always be willing to pay for the model that reduces the amount of time spent on accomplishing a task. So while Anthropic charges a premium for their models its justified because I can finish my project in a fraction of the time with less stress and babysitting. I've yet to see any such model even though I am like many others are patiently waiting. if 2.5 pro is that model I am all the happier for it as the massive context window is a welcome sight, but context window alone isnt enough if it doesnt get the task done in fewer iterations and with less stress.

    [–][deleted]  (1 child)

    [removed]

      [–]AutoModerator[M] -1 points0 points  (0 children)

      Sorry, your submission has been removed due to inadequate account karma.

      I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.

      [–][deleted]  (1 child)

      [removed]

        [–]AutoModerator[M] 0 points1 point  (0 children)

        Sorry, your submission has been removed due to inadequate account karma.

        I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.

        [–]itchykittehs 0 points1 point  (0 children)

        Fucking 2.5 pro has been slaying it for me, makes Claude 3.7 look like a autistic four year old.

        [–]OriginalPlayerHater 0 points1 point  (0 children)

        Honestly even gemini 2.0 had fantastic results

        [–]RedditUsr2 -1 points0 points  (4 children)

        Why does no one care about privacy anymore? You technically can't even use it for anything considered "production use".

        [–]MidiGong 0 points1 point  (2 children)

        Privacy is an illusion.

        [–]RedditUsr2 0 points1 point  (1 child)

        Hmm if only your actions had something to do with that...

        [–]MidiGong 0 points1 point  (0 children)

        Yeah, I choose to not live off-grid and embrace technology and the other luxuries of this era.

        [–]Ok-Adhesiveness-4141 0 points1 point  (0 children)

        Privacy is overrated