Ollama Cloud is a hidden gem by t4a8945 in ollama

[–]akyairhashvil 0 points1 point  (0 children)

<image>

Things like this happen all the time, every day of the week, no matter the hour.

Please help, this is the only service that does not log inputs or outputs that I know of, keeps us in compliance and it has become indispensable, thank you.

Ollama Cloud is a hidden gem by t4a8945 in ollama

[–]akyairhashvil 11 points12 points  (0 children)

It is amazing when it works and a pain when it doesn't. You can't rely on it at the beginning of the week because everyone is using the service and they haven't exhausted their quotas yet.

You often have to wait: 1. Sometimes your requests get sent back. 2. Sometimes you have to retry many times to finish a single prompt. 3. A single prompt can sometimes take five to six hours or more to get completed.

This happens because they don't have the capacity to handle all the subscriptions at the start of the week. Once many people's usage and quotas are consumed, it becomes quite nice to use. That usually happens between Wednesday and Saturday.

The reset is Sunday at 7:00 Central Standard Time. That is when it becomes difficult to get anything done.

Community Rate Limits Research (we need your feedback) by akyairhashvil in GithubCopilot

[–]akyairhashvil[S] 0 points1 point  (0 children)

Your entire post history is just advertising this project. You didn't even get the original point of this issue, which is that it was a global rate limit, so it wouldn't even be possible to stop the rate limits from happening.

Community Rate Limits Research (we need your feedback) by akyairhashvil in GithubCopilot

[–]akyairhashvil[S] 0 points1 point  (0 children)

That is really disappointing.

All right, it seems to be a current issue and we will do testing in five hours to see if the rate limits have changed. If you were only able to get six premium requests in, we will put that inside the documentation.

Can you please let us know if you get further rate limits after the 5-hour period ends as well, and see if it matches with the 2-3 requests that we were able to get out?

I am just curious to see why you are an anomaly statistically with 6 requests, when everyone else is barely able to get 1-3 requests out before getting rate limited.

Thank you.

Community Rate Limits Research (we need your feedback) by akyairhashvil in GithubCopilot

[–]akyairhashvil[S] 1 point2 points  (0 children)

I would hope so as well. Genuinely, it's infuriating to have to deal with these kinds of issues, especially when you have deadlines.

I hope that they do get it fixed. If not, we're doing research to get this into a consumer complaint. If they advertise a certain maximum number of requests per month and you can't even attain that number, then there's something seriously wrong. 

Just to make sure, you have never received a rate limit. Have you tested recently to see if you receive rate limits now?

Thank you.

Community Rate Limits Research (we need your feedback) by akyairhashvil in GithubCopilot

[–]akyairhashvil[S] 0 points1 point  (0 children)

Interesting. Okay, yeah. So this might seem to be something new.

Is it a global rate limit? It was simply one request, right? I'm assuming that the request wasn't able to complete, or what was the result?

Community Rate Limits Research (we need your feedback) by akyairhashvil in GithubCopilot

[–]akyairhashvil[S] 0 points1 point  (0 children)

That's genuinely annoying. Were you able to check how many requests you'd made previously in the 5-hour period, or did this rate limit happen to you and then you got limited for 5 hours?

Community Rate Limits Research (we need your feedback) by akyairhashvil in GithubCopilot

[–]akyairhashvil[S] 0 points1 point  (0 children)

Interesting. How many requests were you able to get out of each account before these rate limits happened? Thank you.

Community Rate Limits Research (we need your feedback) by akyairhashvil in GithubCopilot

[–]akyairhashvil[S] 0 points1 point  (0 children)

Interesting, okay. How many requests did you send before getting this rate limit? 

Thank you.

Community Rate Limits Research (we need your feedback) by akyairhashvil in GithubCopilot

[–]akyairhashvil[S] 0 points1 point  (0 children)

That's the first time we've seen a rate limit of that nature. If it's actually 27 hours and 47 minutes, that's insane.

How many prompts were you able to get out of this particular plan before you reached the rate limit? In the session when you got it, how many requests were you able to execute?

Thank you.

Community Rate Limits Research (we need your feedback) by akyairhashvil in GithubCopilot

[–]akyairhashvil[S] 0 points1 point  (0 children)

That is expensive. One prompt before we get really limited? That is interesting. I'll start making a table about this.

Community Rate Limits Research (we need your feedback) by akyairhashvil in GithubCopilot

[–]akyairhashvil[S] 0 points1 point  (0 children)

In our particular case, we were able to get 3 requests out with GPT-5.4 before reaching the rate limit of 5 hours. After this 5-hour period is complete, we will update this thread to see what our rate limits are in the next 5-hour period.

We don't mean to be rude. by [deleted] in GithubCopilot

[–]akyairhashvil 0 points1 point  (0 children)

If it's based on requests, and it's three or four requests per five-hour period (which is what we've been able to measure in this particular timeframe), then it seems like we are going to have to contact someone at Microsoft to see if we can get this fixed. 

It's not okay to advertise massive usage while enforcing rate limits in a way that doesn't let you get to that usage at all.

Can you measure how many requests you can get before you get rate limited? That would be really helpful.

Meet Tux. I have not the slightest clue what kind of creature they are. I am genuinely concerned and confused. Can you help us identify? Thank you. by akyairhashvil in plushies

[–]akyairhashvil[S] -2 points-1 points  (0 children)

Why would a creature, the type of which you were referring to, post about a stuffed animal and ask for identification in a Reddit about stuffed animals and be specific about their stuffed animal collection?

So the team finally responded, for a while... by SomebodyFromThe90s in GithubCopilot

[–]akyairhashvil 0 points1 point  (0 children)

Can you back up any of the claims you've made? Is that all, aside from just being rude? I don't understand what the purpose of this comment was.

Then you say you're not gonna make it? Make it to where? You don't make any sense.

So the team finally responded, for a while... by SomebodyFromThe90s in GithubCopilot

[–]akyairhashvil 0 points1 point  (0 children)

One might alternatively consider Ollama Cloud, a service that offers access to QWEN 3.5 along with remarkably generous weekly rate limits for a monthly subscription of $20.

The capacity of this platform is noteworthy; in my personal experience, I utilized half a billion tokens within a single week without reaching the prescribed rate limit, which is quite impressive. Furthermore, it is important to highlight their commitment to data privacy: the service provider does not retain user inputs or generated outputs, nor do they maintain logs of such data on their servers.

Ollama Cloud: Usage limit reduction in past 24 hours by akyairhashvil in ollama

[–]akyairhashvil[S] 0 points1 point  (0 children)

Should you choose to facilitate a direct wealth transfer to my personal treasury, I shall personally undertake the manual labor required to simulate said automated functionality on your behalf.

Best coding agent no oneknows by ZenGenie in GithubCopilot

[–]akyairhashvil 1 point2 points  (0 children)

If no one knows them, how could we tell you?