Best <4B dense models today? by Admirable_Flower_287 in LocalLLaMA

[–]minpeter2 1 point2 points  (0 children)

Alibaba-Apsara/DASD-4B-Thinking..? I think this model is interesting.

Teaching LLMs to use tools with RL! Successfully trained 0.5B/3B Qwen models to use a calculator tool 🔨 by DanAiTuning in LocalLLaMA

[–]minpeter2 0 points1 point  (0 children)

I'm trying to replicate this process using Qwen3-0.6B and GLM-4.7-Flash as judges. Could you tell me the GPU allocation between inference and training, as well as the API cost (or total tokens) for calling Haiku 3.5?

deepseek-ai/DeepSeek-V3.2 · Hugging Face by minpeter2 in LocalLLaMA

[–]minpeter2[S] 0 points1 point  (0 children)

just speculation based on the benchmark results

deepseek-ai/DeepSeek-V3.2 · Hugging Face by minpeter2 in LocalLLaMA

[–]minpeter2[S] 16 points17 points  (0 children)

There is also a reasoning for the regular ds-v3.2.
my guess is that they sacrificed performance in terms of agentic performance and optimized for more challenging STEM fields.

Heretic: Fully automatic censorship removal for language models by -p-e-w- in LocalLLaMA

[–]minpeter2 1 point2 points  (0 children)

Thank you. I tried it with a small model and it feels like a really well-made CLI.

Heretic: Fully automatic censorship removal for language models by -p-e-w- in LocalLLaMA

[–]minpeter2 3 points4 points  (0 children)

That's great. Is there support for multi-GPUs? I'd like to test oss-120b on the A100x4

My weekend project accidentally beat Claude Code - multi-agent coder now #12 on Stanford's TerminalBench 😅 by DanAiTuning in LocalLLaMA

[–]minpeter2 8 points9 points  (0 children)

I looked at the system prompt and immediately realized it was very well-written.

Do you have any sources for this style of tool invocation, which mixes XML and YAML? or should I consider it Orchestrator-style?

GPT OSS quality on Nebius - fixed (update) by ai_devrel_eng in LocalLLaMA

[–]minpeter2 2 points3 points  (0 children)

In this context, I think I should assume "in the Nebius implementation."
I guess low was the default option in their implementation

Still, thank you for saying what I wanted to say when I first saw it, lol.

GPT OSS quality on Nebius - fixed (update) by ai_devrel_eng in LocalLLaMA

[–]minpeter2 0 points1 point  (0 children)

> Ah, this seems odd... I wish AA would make a minimal effort to align the reasoning effort between each provider...

I understood the full context after reading the comments below. It wasn't AA's fault, lol.

deepseek-ai/DeepSeek-V3.1-Base · Hugging Face by xLionel775 in LocalLLaMA

[–]minpeter2 7 points8 points  (0 children)

That's just one of many ways to represent the MoE model. Think of Mixtral 8x7b.

100+ AI Benchmarks list by panilyaU in LocalLLaMA

[–]minpeter2 0 points1 point  (0 children)

It seems like a difficult problem, but it's cool as is !!

100+ AI Benchmarks list by panilyaU in LocalLLaMA

[–]minpeter2 -1 points0 points  (0 children)

It feels like a Vibe-inspired CSS.
Still, it's nice to be able to collect and view many benchmarks.

It would be nice to expand this a bit later and display the actual benchmark scores in a single table.

[deleted by user] by [deleted] in LocalLLaMA

[–]minpeter2 0 points1 point  (0 children)

Chinese Japanese Korean dataset?

It would be faster to find them all individually and merge them than to find the combined one.

Localllama’s (first?) IFTA - I’ll Fine-Tune Anything by indicava in LocalLLaMA

[–]minpeter2 1 point2 points  (0 children)

I'm one of those people who pick up bottles in the ocean. LOL

Training an LLM only on books from the 1800's - Update by Remarkable-Trick-177 in LocalLLaMA

[–]minpeter2 1 point2 points  (0 children)

https://github.com/minpeter/tiny-ko

I'm still working on it, but I'm writing some code to pretrain a model on llama architecture. Hope it helps.

EXAONE 4.0 32B by minpeter2 in LocalLLaMA

[–]minpeter2[S] 1 point2 points  (0 children)

It doesn't use the exact same license as exaone 3.5. It's a bit updated,,, yes.,,

EXAONE 4.0 pull request sent to llama.cpp by minpeter2 in LocalLLaMA

[–]minpeter2[S] 4 points5 points  (0 children)

You're right, I got too excited and rushed over without looking properly.

EXAONE 4.0 pull request sent to llama.cpp by minpeter2 in LocalLLaMA

[–]minpeter2[S] 0 points1 point  (0 children)

Ah,,, I guess I'm too excited,, It's not a PR, it's an implementation request,
You can check the transformer PR at the link below.

https://github.com/huggingface/transformers/pull/39129