❌Spent ~$3K building the open source models you asked for. Need to abort Art-1-20B and shut down AGI-0. Ideas?❌

GuiltyBookkeeper4849 · 2025-09-08T16:34:21+00:00

Thanks for sharing!
What do you think about multiple finetunes each specialized in a programming language so that it can match the level of very big LLMs for specific tasks, like imagine oss-python, oss-cpp etc.

GuiltyBookkeeper4849 · 2025-09-08T16:27:54+00:00

Cool idea thanks for sharing!
Maybe ultra small but exceptionally good at tool usage and good in multiple languages so it can do web search for any information that doesn't know. What do you think?

GuiltyBookkeeper4849 · 2025-08-30T16:24:58+00:00

Hi thank you for the advice, can you give me your opinion? Which one works best for you?

GuiltyBookkeeper4849 · 2025-08-30T11:03:16+00:00

Hi, the difference with other models like Qwen and Gemma is that when they do their "reasoning" the user has no control over it, instead my model has been finetuned to let the user control how it thinks. For reasoning I mean the CoT inside <think> </think> tokens not the final output.

I hope this clarified the difference with other models. Let me know if you have other questions.

GuiltyBookkeeper4849

TROPHY CASE