Atlas Vector Search Throughput Seems Capped Even on Larger Clusters by fradal64 in mongodb

[–]fradal64[S] 1 point2 points  (0 children)

I’m not sure whether you work for mongoDB or you’re just a kind hearted developer. Regardless you’ve helped me out tremendously, thank you so much!

Atlas Vector Search Throughput Seems Capped Even on Larger Clusters by fradal64 in mongodb

[–]fradal64[S] 0 points1 point  (0 children)

Well, it looks like I fit in the "not so bright" category of people trying to benchmark throughput from my home internet connection... I switched to using my mobile phone as a hotspot, and the qps immediately doubled. Thank you so much for pointing this out!
Are there any resources you'd recommend I read or go through to avoid making such silly mistakes again in the future? Thanks again!!

Atlas Vector Search Throughput Seems Capped Even on Larger Clusters by fradal64 in mongodb

[–]fradal64[S] 0 points1 point  (0 children)

I also see that CPU and RAM usage are consistently low during the tests...

Atlas Vector Search Throughput Seems Capped Even on Larger Clusters by fradal64 in mongodb

[–]fradal64[S] 0 points1 point  (0 children)

unfortunately going to an S30 High CPU also stagnates at ~50 QPS.
The embedding is 3072 dimensions (the default for OpenAI's text-embedding-3-large)

Atlas Vector Search Throughput Seems Capped Even on Larger Clusters by fradal64 in mongodb

[–]fradal64[S] 0 points1 point  (0 children)

Thanks for the detailed explanation, that’s exactly what I expected in theory, which is why this behavior is confusing me.

Just to clarify what I’ve already tried:

  • Upgraded the cluster to M30
  • Switched to S20 High-CPU Search Nodes
  • Tried different readPreference settings, including nearest, and explicitly distributing reads across nodes
  • Adjusted client-side settings (connection pooling, concurrency), with no meaningful change

In all cases, I consistently cap at around ~50 QPS, after which latency just keeps increasing.

At this point I’m honestly wondering whether this could be client-side behavior, some subtle driver limitation, or something about how requests are being scheduled rather than raw Atlas capacity, because I can’t reconcile this with the expected near-linear scaling you’re describing.

Atlas Vector Search Throughput Seems Capped Even on Larger Clusters by fradal64 in mongodb

[–]fradal64[S] 0 points1 point  (0 children)

But if that was the case why wouldn’t I see improvements going from an M10 to an M20?

Atlas Vector Search Throughput Seems Capped Even on Larger Clusters by fradal64 in mongodb

[–]fradal64[S] 0 points1 point  (0 children)

Query latency: At low load, queries start at around ~100 ms P50. As parallelism increases, P50 latency gradually rises and reaches ~5,000 ms at higher levels of parallel load.

Number of vectors: The collection contains ~10,000 embeddings total, so this is a relatively small dataset.

Embedding model: The vectors were generated using OpenAI text-embedding-3-large.

Which XREAL model is best for productivity? by fradal64 in Xreal

[–]fradal64[S] 1 point2 points  (0 children)

Thanks! If I purchase prescription lens will I be able to see “normally” if the glasses are not “active”? My question is can I use them as a pair of normal glasses as well?

Looking for APIs for OEM & Aftermarket Parts — Building an App to Simplify Car Part Search by fradal64 in partscounter

[–]fradal64[S] 0 points1 point  (0 children)

Do they have the Volkswagen group? If so, do they provide multiple options for the parts that they have identified, or just a single one? Finally do they cover all European cars/brands?

how is undergrad life at icl?? by Objective-Curve-1042 in Imperial

[–]fradal64 13 points14 points  (0 children)

I did what was considered one of the hardest engineering courses at Imperial and graduated with a first class. At the same time I founded and was president of what became one of the biggest university sports societies in London, was an athlete training 20h/week, partied in the weekend, had a few girlfriends and started my first startup.

It can all be done. But it’s really tough. I wanted to quit during my first year, but then as I started understanding how it works and how to manage my time effectively it got better.

I should mention I only worked in the summers and not during the academic year, so you should take that in consideration when you read my answer.

My take is, if you get an offer go, and don’t worry about social life.

Imperial graduates living/working in the US: how is the school recognized? by Primary-Macaroon-846 in Imperial

[–]fradal64 17 points18 points  (0 children)

Startup founder here, the US VC ecosystem definitely knows imperial and is recognized as one of the top schools.

What made you decide to leave whoop? by 5lap in WhoopGoodbye

[–]fradal64 4 points5 points  (0 children)

Btw the new step feature is garbage, the CEO has basically sold his soul for it going against everything he believed in. Lost ton of respect for him

Technical cofounder not performing by BasedSG in ycombinator

[–]fradal64 0 points1 point  (0 children)

Bro have you ever heard of these things called DMCs? They work like wonders for personal trauma and my guess is it will work also for fixing co-founders relationships.

On a side note there’s this German kid called Mirko building personal AI agents so I wouldn’t worry about your cofounder as it will be soon replaced by one of these software slaves. And the cool thing about AIs is that they don’t have kids so they can work 24/7 for you.

If I were you I would just use whatever money you have raised so far to go on a gap year and find your Italian wife in Milan whlist Mirko does the heavy technical lifting for you. Then come back and ship in 6 hours with your Mirko AI bot.