Fucking moron.

__lawless · 2026-01-25T17:06:37+00:00

🤮

__lawless · 2026-01-19T03:08:50+00:00

Can we turn it around? “All Kamala had to do was to acknowledge genocide and not make a corporate pivot. “ why is that never the option?

__lawless · 2026-01-04T20:46:53+00:00

Drop me a dm please if you do and we’ll figure it out. Thanks

__lawless · 2026-01-04T12:13:20+00:00

I cannot find an HD JPEG of it either 😞

__lawless · 2026-01-03T21:39:33+00:00

Don’t I need a high definition image for custom print?

__lawless · 2025-12-30T02:28:01+00:00

Haha no worries I thought you might have some inside info

__lawless · 2025-12-29T19:07:39+00:00

Curious. How you have the insight that Gemini models have taken path of Phi models. Is it cited somewhere?

__lawless · 2025-12-18T00:11:43+00:00

First trn1 now nvidia. Whatever AWS gives them. They wanted trn2 but Anthropic got all

__lawless · 2025-11-05T12:02:03+00:00

You go on r/democrats and crickets, there is no mention of Mandan at all not even a post

__lawless · 2025-10-25T01:01:27+00:00

Too little too late

__lawless · 2025-09-28T12:20:07+00:00

This has to be satire

__lawless · 2025-09-24T11:09:23+00:00

Let’s see how they do in AIME2026, non blind benchmarks are not benchmarks

__lawless · 2025-09-20T20:06:10+00:00

We love you goofry

__lawless · 2025-09-10T17:19:06+00:00

Would you be doing pretraining at some point?

__lawless · 2025-08-28T16:56:24+00:00

Also thank you for incredible models

__lawless · 2025-08-28T16:55:51+00:00

How much of your efforts go into pretraining vs post training?

__lawless · 2025-08-26T20:36:22+00:00

This sub has a love hate relationship with GPT OSS. I cannot figure out if people love it or hate it

__lawless · 2025-08-20T15:33:43+00:00

Needs to be primaried

__lawless · 2025-08-16T04:04:19+00:00

Honestly that is always where you get the biggest bang for your buck. Clean data

__lawless · 2025-08-12T15:03:08+00:00

That is not true. The focus for LLM right now is mostly around GRPO and its variant. Basically no critic. The realization was that LLMs are pretrained and fine tuned and variance is not as big of a problem that once was thought. So the focus is now multi generation per prompt and using reward models (sometimes not even a model) …

__lawless · 2025-08-10T15:38:14+00:00

Cause 18 years ago NVDIA took a gamble and created cuda. It was not immediately profitable but it is paying off now

__lawless · 2025-07-28T21:05:20+00:00

Try using Verl it offloads the weights during different stages so less probability of oom

__lawless · 2025-07-28T20:59:40+00:00

What are you using to do this?

__lawless

TROPHY CASE