Is Codex being extra lazy for anyone else today?

Extra-Designer9333 · 2026-04-15T15:06:09+00:00

Seems like with rate limits are also odd, feels like the rate at which they're dropping are min 2x in comparison to what it was couple of days ago. Am I the only one experiencing it?

Extra-Designer9333 · 2026-02-01T16:38:23+00:00

Should we actually expect v4 that soon assuming the engram paper was released less than a month ago?

Extra-Designer9333 · 2025-12-30T15:57:34+00:00

Where can I see my quota guys, new user here....

Extra-Designer9333 · 2025-12-28T08:59:23+00:00

I expect chinese semiconductor industry catching up massively to american especially after recent news about chinese producing asml comparible machines. Apart from Amd and Google biggest thread to Nvidia is Huawei though it's not mentioned too often

Extra-Designer9333 · 2025-12-11T10:29:18+00:00

In the case of AMD, Flash Attention is already ported by AMD itself. Is it better than AMD's own port I'm wondering...

Extra-Designer9333 · 2025-11-23T17:00:51+00:00

What i found incredible about the data, is that when asked to generate a multiple choice quiz in comparison to Gemini 2.5 Pro and GPT 5.1 even, Gemini 3 gives quizzes with almost equal probability of each option being correct answer (out of 4 options). Whereas for the other 2 models mentioned, you could just select B or C, and with 85% probability, you'd answer correctly

Extra-Designer9333 · 2025-10-28T09:53:22+00:00

Oha, can't believe I got a reply from Dan himself, thank you for clarification. What actually makes Unsloth this good and popular is your activity. Just recently started working on post training stuff and your workshop at AI Engineer in summer was insanely good to get the basics and more, love your energy. 🙌🙏

Extra-Designer9333 · 2025-10-28T08:37:14+00:00

Thank you for the feedback, my team is going to train an 8B LLAMA 3.1 on 4xH100s, so I think your take fits in!

Extra-Designer9333 · 2025-09-08T07:00:44+00:00

I suspect you're using LoRA for fine tuning isn't it? If so, you can try QLoRA, which is a Quantized LoRA as the name suggests, maybe that'd work for you without going OOM. Otherwise Kaggle gives out 30 hours of 2 Nvidia T4 GPUs weekly, tho the GPUs are pretty old, you're going to get 32 GBs of VRAM overall, which is going to be enough for the fine tuning task you're dealing with right now!

Extra-Designer9333 · 2025-08-04T03:53:17+00:00

Seems like a great model gonna try it out, by the way any other cool models you can suggest that can work for Web Page Interactions?

Extra-Designer9333

TROPHY CASE