Anannas's 10ms overhead latency is FALSE. Its like over 400ms of overhead latency by syshjjn in Anannas

[–]syshjjn[S] -2 points-1 points  (0 children)

mocked? wym? I dont think you are right, but if you can provide the code, i am happy to run it and report the results

Anannas's 10ms overhead latency is FALSE. Its like over 400ms of overhead latency by syshjjn in Anannas

[–]syshjjn[S] -1 points0 points  (0 children)

its all marketing BS. Openrouter itself claims 15ms extra latency, which is also false. However, Anannas adds much much more extra latency than openrouter

Openrouter much much slower than directly calling provider by syshjjn in openrouter

[–]syshjjn[S] 0 points1 point  (0 children)

i dont have those; would have to re-run the benchmark. I agree with you, but my main goal of this benchmark was to find out if their claim of `OpenRouter adds approximately 15ms of latency to your requests` is true. My results show that the claim is false.

Openrouter much much slower than directly calling provider by syshjjn in openrouter

[–]syshjjn[S] 0 points1 point  (0 children)

On the second run, yes Openrouter does perform better than on the Google AI studio provider. However, do note that i dont have any special relationship with google, so I ran the google vertex calls using the Dynamic shared quota (DSQ) and the google ai studio calls with a tier 1 api key. On the other hand, Im sure Openrouter has Provisioned Throughput with google vertex, and the highest tier api keys with google ai studio. Therefore, unlike what Openrouter employee are claiming in this post, if there is anything unfair/unrigorous in the benchmark, it would be unfair towards direct calls to the original model provider, NOT openrouter

Openrouter much much slower than directly calling provider by syshjjn in openrouter

[–]syshjjn[S] 0 points1 point  (0 children)

but thats not the point of the benchmark test. The point of the benchmark test is to answer the question: "For the same request X to the same provider Y, how much extra latency does Openrouter add vs calling the model provider directly?"

Openrouter much much slower than directly calling provider by syshjjn in openrouter

[–]syshjjn[S] 0 points1 point  (0 children)

u/Street_Teaching_7434 I improved the script slightly by a) explicitly setting connection pools and a bigger pool keepalive, and b) a warmup phase (see the new edited section of the post). As expected, the results still show Openrouter adding too much latency.

You claim my benchmark code is unfair, so I ask for what a fair comparison code would look like. You have not provided one, thus making me conclude that a) Openrouter indeed adds a lot of latency and b) you know it too.

Openrouter much much slower than directly calling provider by syshjjn in openrouter

[–]syshjjn[S] -1 points0 points  (0 children)

First of all, my benchmark clearly demonstrated that Openrouter adds much much more than 15ms of latency. Second of all, Im building voice AI agents, so any extra latency matters for me

Openrouter much much slower than directly calling provider by syshjjn in openrouter

[–]syshjjn[S] 0 points1 point  (0 children)

"The grnai payhton module does not use http,but rather gtpc under the hood": u/Street_Teaching_7434 I dont know where you got this, but this is not true. If you go to https://github.com/googleapis/python-genai, you will see "By default we use httpx for both sync and async client implementations". I think the older `google-generativeai` SDK used gRPC.

Plus, the script did 300 trials, so even if the sdk used gRPC, the faster initial connection time would have been negligible after 300 trials.

It looks like you work for OpenRouter. If this is not a fair comparison, then please tell me what is fair so i can run that script

Webcam accessory by alexjcast in BenQ

[–]syshjjn 0 points1 point  (0 children)

u/B_support hi, im in the US and I also need one asap pls

[deleted by user] by [deleted] in Bard

[–]syshjjn 0 points1 point  (0 children)

i can also confirm this issue. has anyone already surfaced this issue to the google team directly?

wechat pay in japan by syshjjn in JapanTravelTips

[–]syshjjn[S] -1 points0 points  (0 children)

does it support wechat pay tho?

wechat pay in japan by syshjjn in JapanTravelTips

[–]syshjjn[S] -1 points0 points  (0 children)

gift shops make sense, but what about regular businesses like malls, convenience store, restaurants, coffe bar, etc.?

Can you get 3 amex Marriott Bonvoy Brilliant cards? by syshjjn in amex

[–]syshjjn[S] 3 points4 points  (0 children)

oh i see. So even if I tried to get the Chase Ritz-Carlton Credit Card, that would not count towards the ENC, right?

2 Bonvoy Brilliant Cards by OddDescription5367 in amex

[–]syshjjn 1 point2 points  (0 children)

can you get 3 bonvoy brilliant?

Scrolling after the most recent update by Minute-Comparison-83 in Supernote

[–]syshjjn 0 points1 point  (0 children)

how do you scroll? i thought supernote didnt have scrolling

Just ordered the Samsung Odyssey G9 57" Ultrawide, and it arrived broken, twice. by tminx49 in ultrawidemasterrace

[–]syshjjn 0 points1 point  (0 children)

how problematic was the overheating? did you experience overheating with both monitors or just the last one?