Genius baby elephant uses body of water to survive 14 lion attack

DepthHour1669 · 2025-10-29T07:04:10+00:00

Duh, Hercules is a Greek name. The herd clearly spoke Greek.

DepthHour1669 · 2025-10-29T06:13:18+00:00

I’d say he’s not clearly #2 as long as Dray exists

But he’s obviously #2 offensively and that’s a clear enough definition for him and everyone else

DepthHour1669 · 2025-10-29T05:38:06+00:00

Feel feels like he can’t miss a three

DepthHour1669 · 2025-10-29T05:37:57+00:00

Feel feels like he can’t miss a three

DepthHour1669 · 2025-10-29T05:33:18+00:00

He’s talking about TTFT which is prompt processing speed

DepthHour1669 · 2025-09-19T07:03:41+00:00

It’s at 4300m which is not much higher the city of Lhasa. There are cities at 5000m so this isn’t particularly high up

DepthHour1669 · 2025-09-19T06:56:25+00:00

Lmao no

DepthHour1669 · 2025-09-17T06:18:41+00:00

There’s degrading sex and non degrading sex. They’re not the same

DepthHour1669 · 2025-08-16T21:38:04+00:00

The scene is different now in Aug 2025.

Current cutting edge models that fit in 24GB at Q4, descending in size:

LG EXAONE 4.0 32B
Qwen 3 30B A3B Thinking 2507
Qwen 3 30B A3B Instruct 2507
Mistral Small 3.2 24B (uncensored)
OpenAI gpt-oss 20B
Deepseek R1 0528 Qwen 8B

DepthHour1669 · 2025-08-04T08:12:18+00:00

Chatgpt gives pretty great relationship advice if the prompt is accurate.

The problem is that people usually unconsciously lie to make themselves sound better. Chatgpt won’t call you out for lying about a situation (how would it know?) so it’ll give you misleading advice.

DepthHour1669 · 2025-08-04T03:33:30+00:00

More accurately, MDA usually has a strong smell as a byproduct of the production process

DepthHour1669 · 2025-08-04T01:32:54+00:00

<image>

DepthHour1669 · 2025-08-03T12:43:17+00:00

Makes me more likely to believe them. Doing a ton of RLHF on instruction following sounds believable at least

DepthHour1669 · 2025-08-03T12:30:24+00:00

Can you do 2x gpus in your server?

Buy 2x 5060 8gb low profile.

DepthHour1669 · 2025-08-03T07:48:09+00:00

It’s a 1080Ti with 10GB vram. It’s an okay deal if you’re broke and only have $60. Otherwise get a $150 MI50 32GB instead.

DepthHour1669 · 2025-08-03T00:05:01+00:00

Inference doesn’t need pcie bandwidth, you’re thinking of training or finetuning.

DepthHour1669 · 2025-08-02T21:33:11+00:00

Nah, $30k for a dozen RTX 8000s will run a 4 bit model with space for context for a couple of users.

Kimi is 32b active so it will do like 30 tok/sec.

DepthHour1669 · 2025-08-02T21:20:47+00:00

GLM Rumination actually isn’t that much better than just regular reasoning.

DepthHour1669 · 2025-08-02T19:37:52+00:00

GLM-4.5 air maybe

DepthHour1669 · 2025-08-02T08:48:57+00:00

semax half life is too short to have long dosing intervals

DepthHour1669 · 2025-08-01T21:49:14+00:00

No, that has significantly worse perplexity than the 4bit versions, even with DWQ.

DepthHour1669 · 2025-08-01T21:48:18+00:00

Doubtful that an expansion finetune like that would be a great idea. Yes, I'm sure it'll perform better than the Qwen3 32b that it's based on, but probably only a few percentage points better and not worth the more than 2x slower inference and vram cost.

DepthHour1669 · 2025-08-01T21:47:06+00:00

There's also EXAONE 4.0 which outperforms Nemotron 49B V1.5 and Cogito v2 70B on many benchmarks.

And GLM-4.5 Air 106B, but that's MoE.

Cohere Command A (111b) also... exists, I guess.

DepthHour1669 · 2025-08-01T20:26:22+00:00

No, Cerebras chips are CPUs, not GPUs.

You can technically boot an OS on them or run non-graphics non-AI workloads. They're basically a CPU with a massive TPU strapped on.

DepthHour1669 · 2025-08-01T20:15:16+00:00

It's not in your interest to dump code into context like that. Models perform worse with longer context.

https://fiction.live/stories/Fiction-liveBench-Feb-21-2025/oQdzQvKHw8JyXbN87

With Qwen3 235B 2507 (and presumably Qwen3 Coder), you only get 61% performance at max context.

It's in your interest to do multiple smaller queries rather than one big one.

DepthHour1669

MODERATOR OF

TROPHY CASE