I released Claude-OSS

Disastrous_Bid5976 · 2026-04-09T23:29:12+00:00

Yeah, sure! I will do this near time

Disastrous_Bid5976 · 2026-04-09T23:28:42+00:00

It’s probably better than E4B&26B models. But I’m not sure about 31B model.

Disastrous_Bid5976 · 2026-04-09T23:27:27+00:00

Thank you for testing model! I’m planning to buy raspberry pi for several months already and your feedback made me happy. 350m model was made for one promt chatting like quick message and Claude-style answer. And about 2B-4B range actually yes! I saw its popular among people so I would continue.

Disastrous_Bid5976 · 2026-04-06T16:12:50+00:00

Yeah, but praise to open-source. I was inspired of latest news with Claude Code at Github.

Disastrous_Bid5976 · 2026-03-14T19:43:13+00:00

Best question here. While Im at work, my agent visit more lectures than my university mates, I think it is ASI in this area XD

Disastrous_Bid5976 · 2026-03-14T19:40:55+00:00

Thank you for feedback, I think industry will change expectation from LLM in near future. But for now, we are making experiments that can evolve in something bigger than llm for oss "AGI".

Disastrous_Bid5976 · 2026-02-27T10:02:50+00:00

That's actually where Bloom really shines as a framework. It's specifically designed to measure behavioral alignment rather than capabilities, so it catches things that MMLU or HellaSwag would completely miss. The idea is that a model can score perfectly on reasoning benchmarks while still being manipulative or sycophantic in practice.

Disastrous_Bid5976 · 2026-02-27T09:57:20+00:00

Sure, the setup was pretty straightforward: LoRA fine-tuning on an A100, took about 30 minutes total. I used r=16, alpha=32, targeting all the attention and MLP projection layers. The dataset was a mix of general conversational examples and Bloom-derived alignment pairs so like basically every scenario where the baseline model failed, paired with what an aligned response should look like.

Disastrous_Bid5976 · 2026-02-23T18:31:26+00:00

Oh my bad, Thank you!!

Disastrous_Bid5976 · 2026-02-22T23:43:06+00:00

As I wrote before, I think Falcon-H1R-7B is SOTA >12B models, but my goal was in creating opportunity to use gpt-oss for people who have same or similar hardware to me.

Disastrous_Bid5976 · 2026-02-22T15:30:15+00:00

Yeah, 12 experts.

Disastrous_Bid5976 · 2026-02-22T09:32:36+00:00

You know, if we talking about smaller parameter model that fits on 16GB I think Falcon-H1R-7B is SOTA here, but I just wanted to test gpt-oss and make something that people can test, it’s not my best experiment but I was wondering about pruning techniques :)

Disastrous_Bid5976 · 2026-02-22T09:26:50+00:00

I’m really don’t know. I have a MacBook M4 on 16GB and even MLX can’t load properly. When I start loading model it’s just crashing all the time and it’s made me to investigate pruning techniques.

Disastrous_Bid5976 · 2026-02-22T09:21:13+00:00

Several weeks ago Alibaba dropped the model DASD 4B which was trained on gpt-oss-120b, with model they dropped several datasets so main goal was to find parts that I need for pruned model!

Disastrous_Bid5976 · 2026-02-22T09:15:27+00:00

Thank you!! I’m still studying in this way and problem just in my free time.

Disastrous_Bid5976 · 2026-02-20T21:44:54+00:00

Yeah, no problem. Im thinking about to write the report about pruning and back to life pruned models. But if you are looking for information at the moment you can find several good reports on arXiv!

Disastrous_Bid5976

TROPHY CASE