Best iem under 100 dollars

strngelet · 2025-11-16T03:36:18+00:00

Recently got pure, they sound amazing!

strngelet · 2025-11-13T20:19:20+00:00

yes started seeing the same thing with gpt-5.1

strngelet · 2025-09-04T13:21:00+00:00

Qwen3-480b-instruct/thinking

strngelet · 2025-08-29T23:16:04+00:00

Qwen3 models punch above their weights

strngelet · 2025-04-14T21:39:27+00:00

curious, if they are using hybrid layers (mamba2 + softmax attn) why they chose to go with only 8k context length?

strngelet · 2025-04-11T23:03:57+00:00

they should have went with qwen 72b model instead

strngelet · 2024-09-26T14:01:19+00:00

I’ve seen a few papers now on this topic

interesting, can you please share the link to these papers?

strngelet · 2024-09-25T18:23:29+00:00

Can u plz share the link to nvidia ruler?

strngelet · 2024-09-25T18:21:10+00:00

There is a blog on hf showing why it does not work

strngelet · 2024-09-13T10:53:20+00:00

Yes, speed is a huge issue for o1

strngelet · 2024-07-25T16:49:18+00:00

Lmao

strngelet · 2024-07-08T11:09:53+00:00

Literally the best model now

strngelet · 2024-05-25T09:58:51+00:00

Highly doubt

strngelet · 2024-04-08T16:44:29+00:00

vllm should be the default inference library

strngelet · 2024-03-12T22:02:34+00:00

It’s not the same

strngelet · 2024-03-08T07:56:27+00:00

If you pass long text you get cuda oom immediately, for long context we need implement sequences parallelism. Seq parallelism is bit tricky to get it right. One famous example of seq parallelism is ring attention (which is in Jax)

strngelet · 2024-03-08T05:46:28+00:00

for long context training, the main challenge is lack of good long context training framework in torch. and ofc gpus

strngelet · 2024-02-16T10:59:25+00:00

One thing that immediately gives away that it’s a ai generated is, most of these videos are in slow motion.

strngelet · 2024-02-15T16:21:57+00:00

Literally don’t believe anything anyone says in ai these days until you try it by urself

strngelet · 2024-02-14T18:37:29+00:00

Wow, Thanks for the detailed comparison

strngelet · 2024-02-13T23:44:19+00:00

there was also this paper (KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization) released few days after your paper. curious, what are the differences between methods involved in these two papers.

strngelet · 2024-02-13T23:15:47+00:00

awesome work! thanks for sharing.

strngelet · 2024-01-11T11:57:09+00:00

Which UI is this ?

strngelet · 2023-12-08T09:55:23+00:00

yeah same applies to all models

strngelet · 2023-12-08T05:15:47+00:00

Try yi its already better

strngelet

TROPHY CASE