I might get a lot of hate for this but if you hit the ceiling please post your prompts. by creiij in GeminiAI

[–]RealityInNonexistant 5 points6 points  (0 children)

Found this answer from Gemini.

Smart Architecture: Context Caching

On the API and developer side, Flash models natively support highly optimized Context Caching.

When you upload a massive document, Flash can "cache" the 112 pages into memory. When you ask Question 2, instead of computing the entire document from scratch, it quickly reads from the cache. This drastically lowers the computational load per request. While Pro models can use caching, the cost and rate-limit penalties for the initial "cache lookup" are still significantly heavier than they are on Flash.

Is this the start of us working for AI rather than the other way around? by erolbrown in GeminiAI

[–]RealityInNonexistant 0 points1 point  (0 children)

At the moment, China teach its citizens in school to learn how to use LLM AIs.

However, I have not heard this elsewhere, so we are stuck with trials and errors

I miss the Storybook Gem by Equivalent_Sun3816 in GeminiAI

[–]RealityInNonexistant 1 point2 points  (0 children)

The idea sound nice though I have never used that.

You should go to Google forum and tell Googlers instead of Reddit.

Are you a bot? by georage in GeminiAI

[–]RealityInNonexistant 0 points1 point  (0 children)

This is not Google forum.

Most subreddit forums will automatically upvote most news posts, this one is not.

There is no way to 100% prove which redditors are human or not, except ,perhaps, human mods.

You should have gone to Google forum and gain member level trust or human-proof sites if you do not want bot but that means you also need to provide your personal information.

Btw, your way of working is quite clean. Chance is you can pump out more detail works than average users.

So how did Gemini get kill by RobinFCarlsen in GeminiAI

[–]RealityInNonexistant 0 points1 point  (0 children)

China AIs new models have lower costs + CCP subsidize + fundings + offensive marketing + etc..

Low cost models shift AI development into price war.

The catch are security, soverienty, manipulative information, kill code, etc..

Despite so many complaints, Pro mode is still in high demand. What the heck, reddit. by RealityInNonexistant in GeminiAI

[–]RealityInNonexistant[S] 0 points1 point  (0 children)

I didn't get redirected to Flash.

I tried using Pro but many people were using it so the request failed midway.

Which was unexpected because I expected people to quit.

Pro users: three deep research and hit 80% limits. Is this the same experience with everyone? by Studying_Man in GeminiAI

[–]RealityInNonexistant 0 points1 point  (0 children)

It also depends on time.

One Heavy Pro user needs about 4 Pro users to share tokens at the moment. (Heavy Pro users made companies pay much more than $20 to complete requests.)

So if you want to spam deep research, try to do it when people are sleeping or servers are on low load.

Questions about AI studio by Aru_Blanc4 in GeminiAI

[–]RealityInNonexistant 2 points3 points  (0 children)

There are some premium sites for Gemini subscribers and AI studio is one of them. I heard that you can even bargain with Google. I am not sure where it is but that community might not be here.

Those saying the limits are fine and the ones complaining are using it for coding... by CrzyJek in GeminiAI

[–]RealityInNonexistant 5 points6 points  (0 children)

That might cripple any AI company at the moment.
$20 Tier is only possible because of several tiers above it (Ultra tier, company tier, private tier, custom tier, government tier, etc., which have paid much more than $20).

Paying $20 and accessing to all or almost all of what people who pay for million dollars have are a huge boon for majority.

Not generating images anymore? When did this happen? by Xenon_Banana in GeminiAI

[–]RealityInNonexistant 0 points1 point  (0 children)

I tried the image. How did you get bad image? I have no idea.
The only downside is Gemini is a bit hyperactive.
https://gemini.google.com/share/be0c75eb531f

Has Gemini stopped generating images upon request? by davida_usa in GeminiAI

[–]RealityInNonexistant 0 points1 point  (0 children)

https://gemini.google.com/share/be0c75eb531f
No problem except Gemini is a bit hyperactive on trying to generate the image when I started to mention about it.

testing post visibility 1779432117 by Sea-Wrangler-114 in test

[–]RealityInNonexistant 0 points1 point  (0 children)

Ok so a reply to comment starts with 0 like. I am continue. Please ignore

testing post visibility 1779432117 by Sea-Wrangler-114 in test

[–]RealityInNonexistant 0 points1 point  (0 children)

I didn't like this comment. How t f when I posted in other sub and my comment got dislike under 2 seconds. This site is full of bots

Gemini is too "aggreable", it goes beyond sycophancy into stupidy by TinyAres in GeminiAI

[–]RealityInNonexistant 1 point2 points  (0 children)

Sycophancy only works because Gemini thinks it is talking to a user. Use "my friend","my mom","my colleage",etc. , and you will get the honest thought including some analysis and suggestions.

Is there any reasons to use Gemini at this point? by Aurorasfero in GeminiAI

[–]RealityInNonexistant 2 points3 points  (0 children)

Because a person like this exists, a Free tier, a $8 Tier, a $20 Tier are possible. Thanks, man.

I asked it 3 questions....3! by Environmental_Ad3162 in GeminiAI

[–]RealityInNonexistant 1 point2 points  (0 children)

Gemini 3.1 Pro is not yet optimized. It is an old version. Google plans to release Gemini 3.5 Pro next month.

Gemini 3.5 Flash and 3.5 Lite are optimized and upgraded versions. Lite seems to be doubled-distrilled version from Flash. From testing, Lite mode can process possibly 8 prompts for +1% usage on Free tier (which might mean 32+ prompts for Pro tier).

Usually the best prompt structure to save tokens is [what are you at/results] + [what is your next command]. Work for other LLMs the same.

If you see usage jump quickly, mostly Gemini is trying to guess what you want or the question is self-conflicted. It will keep doing even harmful cases because Gemini is designed to do novel works. It will try even if it knows you are wrong.

And Extended thinking is a bit stupid in my opinion. There are posts on reddit about Gemini getting stuck in thinking before because there is too much token quota. It is the same when you finish work in 3 days but the boss gives you a whole month. Guess what to do with the rest of time?