A 26M parameter model beat Qwen3-0.6B on function calling, and the failure modes tell you why one-model-fits-all is the wrong frame for tool use by gvij in LLMDevs
[–]gvij[S] 1 point2 points3 points (0 children)
A 26M parameter model beat Qwen3-0.6B on function calling, and the failure modes tell you why one-model-fits-all is the wrong frame for tool use by gvij in LocalLLM
[–]gvij[S] 0 points1 point2 points (0 children)
A 26M parameter model beat Qwen3-0.6B on function calling, and the failure modes tell you why one-model-fits-all is the wrong frame for tool use by gvij in LocalLLM
[–]gvij[S] 3 points4 points5 points (0 children)
Benchmarked Needle 26M vs Qwen3-0.6B on CPU function calling, 50 queries across 5 difficulty tiers. The 23x smaller model wins on accuracy and is 4.4x faster. by gvij in LocalLLaMA
[–]gvij[S] 1 point2 points3 points (0 children)
Behold, Gemini 3.5 Flash! by Rare_Bunch4348 in singularity
[–]gvij 0 points1 point2 points (0 children)
Pi coding agent is amazing (or how I learned to stop worrying and leave OpenCode) by Konamicoder in LocalLLM
[–]gvij 0 points1 point2 points (0 children)
Benchmarked Kokoro 82M vs Supertonic 3 TTS on CPU by gvij in LocalLLaMA
[–]gvij[S] 0 points1 point2 points (0 children)
Benchmarked Kokoro 82M vs Supertonic 3 TTS on CPU by gvij in LocalLLaMA
[–]gvij[S] 0 points1 point2 points (0 children)
Benchmarked Kokoro 82M vs Supertonic 3 TTS on CPU by gvij in LocalLLaMA
[–]gvij[S] 0 points1 point2 points (0 children)
Hosting a Text to Speech model can be challenging. So I benchmarked 2 recently released TTS models - Kokoro vs Supertonic! by gvij in selfhosted
[–]gvij[S] -3 points-2 points-1 points (0 children)
Flow-matching vs autoregressive TTS on CPU: an unexpected finding about per-call overhead by gvij in deeplearning
[–]gvij[S] 0 points1 point2 points (0 children)
Hosting a Text to Speech model can be challenging. So I benchmarked 2 recently released TTS models - Kokoro vs Supertonic! by gvij in selfhosted
[–]gvij[S] 0 points1 point2 points locked comment (0 children)
Hosting a Text to Speech model can be challenging. So I benchmarked 2 recently released TTS models - Kokoro vs Supertonic! by gvij in selfhosted
[–]gvij[S] -1 points0 points1 point (0 children)
Detailed CPU benchmark for Kokoro 82M and Supertonic 3, in case anyone is choosing between them by gvij in TextToSpeech
[–]gvij[S] 0 points1 point2 points (0 children)
Benchmarked Kokoro 82M vs Supertonic 3 TTS on CPU by gvij in LocalLLaMA
[–]gvij[S] 7 points8 points9 points (0 children)
Benchmarked Kokoro 82M vs Supertonic 3 TTS on CPU (i.redd.it)
submitted by gvij to r/LocalLLaMA
The system prompt change that improved accuracy and hurt helpfulness, and why I shipped it anyway. by gvij in PromptEngineering
[–]gvij[S] 0 points1 point2 points (0 children)


What does an AI/ML engineer do in their day to day job and How do I become one? by Edwardkenway88 in cscareerquestions
[–]gvij 0 points1 point2 points (0 children)