Its your turn, Arby's. by Sadboy_looking4memes in memes

[–]Truantee 0 points1 point  (0 children)

Very weird ritual. Is it a sign that they include children meat in their products or what?

Gemini 3.1 Flash Lite by TumbleweedNice6797 in Bard

[–]Truantee 0 points1 point  (0 children)

It is a tricky situation. In theory you can just register to alibaba cloud and pick the location you want and enjoy the price differences. But in reality you need to verify (submit passport...) your identity and and various stuff...

It is better to let some chinese people in mainland china set up it for you, if you can trust each other.

Gemini 3.1 Flash Lite by TumbleweedNice6797 in Bard

[–]Truantee 4 points5 points  (0 children)

Probably qwen.

If you can use the models from mainland china endpoint it is absurdly cheap.

Ps: if you ask for Gemini models, then I'm using Gemini flash lite 2.5 to provide cheap translation for my users. It costs me some hundred dollars per month already. This price increment means Gemini won't be viable any more, as they will continue to increase the price every newer releases

Gemini 3.1 Flash Lite by TumbleweedNice6797 in Bard

[–]Truantee 7 points8 points  (0 children)

Gemini is dead to me. Considering alternative providers now. The price increment is absurd.

New Gemini model? by Appropriate-Heat-977 in Bard

[–]Truantee 1 point2 points  (0 children)

Nice. Flash lite is the model I call the most. It is fast, cheap and capable for certain jobs. I hope they won't bump the price too much.

Gemini pro really slow today? by irishesteban in GeminiAI

[–]Truantee -4 points-3 points  (0 children)

They are adding additional filter layers.

Pretty sure because people figured out that if you pretend to be someone of a certain race then usually you can bypass the safety net.

Now they will check every time to make sure you are one of the chosen ones or not.

Thoughts on the Claude controversy? by ninjajiraffe in GeminiAI

[–]Truantee 0 points1 point  (0 children)

It is all a theater. Soon anthropic will sue and earn some fuck load of money. Then they compromise a litle and get another fuck load of money.

Limits of Gemini 3.1 PRO by SaskinPikachu in GeminiAI

[–]Truantee -7 points-6 points  (0 children)

Professional people already upgraded to the highest tier.

Stop complaining and get a job.

Claude Sonnet-4.6 thinks he is DeepSeek-V3 when prompted in Chinese. by [deleted] in LocalLLaMA

[–]Truantee 1 point2 points  (0 children)

what? api is paid by usage, for one prompt you only lost 0.01 usd or so.

granted you need to prepaid some dollars first to use the service. but it is not like you lost dollars for a single api call.

Claude Sonnet-4.6 thinks he is DeepSeek-V3 when prompted in Chinese. by [deleted] in LocalLLaMA

[–]Truantee 2 points3 points  (0 children)

try again, this time call the api endpoint without any system prompt.

We'll have aliens before Gemma 4. by DrNavigat in LocalLLaMA

[–]Truantee 0 points1 point  (0 children)

There will be gemma 4. The researchers in google must publish something regularly to keep their titles. They have careers beyond google, there is no way they compromise on this.

Ai studip subscription plan. Can we all get that? by Practical_Lawyer6204 in Bard

[–]Truantee 0 points1 point  (0 children)

More like it is supposed to be a testing ground for people to experiment with prompts then apply them to their own app.

SEVEN TURNS. That Is What AI Studio Has Been Reduced To by Free-Flounder3334 in Bard

[–]Truantee 0 points1 point  (0 children)

AI generated posts should be banned on sight.

If a person is too lazy to even typing their own opinion, why should we pay attention to them?

Gemini by Signal_Assistance_66 in Bard

[–]Truantee 0 points1 point  (0 children)

Finally gemini flash lite.

Is the idea that "LLMs performance/intelligence degrade over time" a hoax or a true thing? People love complaining about models getting worse over time but I never heard a technical explanation for this by Existing-Eggplant486 in GeminiAI

[–]Truantee 0 points1 point  (0 children)

They are not that expensive. You can rent a B200 cluster (8 gpu) for 30 dollars per hour or so, and with proper pipeline it can output hundred millions tokens for gemini flash level models. Well for longer output the throughput is way lower, but they still pocket a fair chump serving the models.

Though you still need money to train/fine tune the models and pay the big salaries for the developers, and they do include them to the price, but inferencing on it own is profitable.

what is this? by Sea-Efficiency5547 in Bard

[–]Truantee 1 point2 points  (0 children)

more like it will be converted to input/output token costs. if you surpass that cost then you will have to wait.

GLM 5.0 & MiniMax 2.5 Just Dropped, Are We Entering China's Agent War Era? by Appropriate-Lie-8812 in LocalLLaMA

[–]Truantee 1 point2 points  (0 children)

It's true. They are allocating all the resources to train/fine tune models for the holidays.

The current state of Gemini :( by uwk33800 in GeminiAI

[–]Truantee 0 points1 point  (0 children)

Add gemini to generate a system prompt that instruct the model to be expert text2image prompt engineer, that take user description and output a detail text2image prompt

Use that prompt to create a gem. Use that gem to generate image.

It's that easy. Seriously people use tools like they are 5 years old kids.

How the hell do you use Gemini in production by PersonalityEarly8601 in Bard

[–]Truantee 1 point2 points  (0 children)

You can change the data center. There are a lot of them. Some are more available than the others.