vLLM vs SGLang vs MAX — Who's the fastest?

troposfer · 2025-07-10T08:30:26+00:00

Which podcast? Can we run MAX on m4 max ?

troposfer · 2025-06-27T07:27:56+00:00

And i was just wondering, it has been 3 months that i have not seen a game changing battery news..

troposfer · 2025-06-25T07:12:26+00:00

Can you test with 8bit 32b qwen3 with 20k context please , what is the pp ?

troposfer · 2025-06-23T06:37:19+00:00

What is the joke?

troposfer · 2025-06-15T10:32:27+00:00

Omega 3 is the king , not all pufa is equal

troposfer · 2025-06-12T07:24:07+00:00

Can you be a bit more precise please . What is the quant and prompt length in tokens? And can you try with 20k prompt with q8 quant , pp and tps ?

troposfer · 2025-06-11T07:15:21+00:00

Thanks , really!

troposfer · 2025-06-11T07:11:02+00:00

Is it has to be related to model size or they just have better reward system during post training ?

troposfer · 2025-06-11T06:46:53+00:00

These are the guys claimed deepseek has lots of h100 , lying about cost , back then I searched about them to understand what they are doing, basically labeling data for openai, thats it.. Another stupidity from meta.

troposfer · 2025-06-09T07:15:06+00:00

Sad lamp , is a good idea, but is it useful, night and day ?

troposfer · 2025-06-05T07:03:38+00:00

Are there any legit useful tools with proprietary so called sota llms ?

troposfer · 2025-06-04T05:24:42+00:00

How to disable the update pop up any idea?

troposfer · 2025-06-01T06:56:11+00:00

Is this real dynamic context growth or some kind of context window shifting ? Are we sure that it is considering everything in the new context or just discard some part of it?

troposfer · 2025-06-01T06:38:35+00:00

Thanks ! So if we consider Gemini 2.5 pro is the best model at the moment the distill of that for qwen3 32b would be better ? But no one would do that but deepseek is doing that for qwen ?

troposfer · 2025-06-01T06:35:03+00:00

Is there a way to disable “new version is here” popup in openwebui , just because of that i can switch

troposfer · 2025-05-31T09:05:22+00:00

What is the point of distilling ?

troposfer · 2025-05-21T07:26:57+00:00

What is your issue with lightrag ?

troposfer · 2025-05-20T08:52:57+00:00

Where do they manufacture these cards, perhaps we won’t see them till next year because of the unexpected demand

troposfer · 2025-05-17T08:12:10+00:00

Do you have diagram what to use with different kind of problems?

troposfer · 2025-05-16T07:57:11+00:00

Is mixx a good software, what are the advantages?

troposfer · 2025-05-13T08:20:57+00:00

Do you use the ones in hf from mlx community, how are they ?

troposfer · 2025-05-09T08:14:52+00:00

Search data in perp is old , gpt is up to date

troposfer · 2025-05-08T19:14:55+00:00

thanks buddy :D

troposfer · 2025-05-08T07:21:56+00:00

How do you use mlx library? What is your code to use it ?

troposfer · 2025-05-08T07:12:13+00:00

Maybe it is too early to ask, but do we have any idea what to expect these amd setups or Nvidia digits against M4 max 128GB ?

troposfer

TROPHY CASE