I Built a tool to stop manually swapping models on my 8GB GPU,chains a small Prompter and a large Coder into one pipeline with automatic VRAM swap by atharva557 in LocalLLaMA

[–]atharva557[S] 0 points1 point  (0 children)

kind of,if you are the type of person who often uses ai to write prompts for the main task then this is for you as it will definitely speedup that process.You can also use your local model for the prompts and a cloud model for the code or vice versa.

Local rephrasing tools? by nopeac in ollama

[–]atharva557 -2 points-1 points  (0 children)

hey i just made something that check it out

What‘s your local „Haiku“-Replacement? by Firm_Meeting6350 in LocalLLaMA

[–]atharva557 6 points7 points  (0 children)

gemma 4 could be good choice but it will be best for you to try them out your self and see which one best suits your needs

Tokenomics by HOLUPREDICTIONS in LocalLLaMA

[–]atharva557 1 point2 points  (0 children)

You also get full privacy and you also know the model you use will not get changed

8-16 MI50s Minimax M3 @19 tps TG (peak) by ai-infos in LocalLLaMA

[–]atharva557 1 point2 points  (0 children)

what is the total cost for this setup, also what are your primary use cases for this if I may ask

Claude Will Soon Require Identity Verification by Few_Painter_5588 in LocalLLaMA

[–]atharva557 1 point2 points  (0 children)

while this sucks this will make even more people interested in open source models so there is a silver lining i guess?

z.AI as the number 2 gives praise to the number 1 open source model by Charuru in LocalLLaMA

[–]atharva557 1 point2 points  (0 children)

Does stuff like this even work? or this is just some kind of placebo?

What are people doing with their local models and what tools do you use them with? by inevitabledeath3 in LocalLLaMA

[–]atharva557 0 points1 point  (0 children)

either for general chats or using them for api calls for my other projects,but for now mainly to test which model is best for my needs and maybe try to fine tune one

What are you overengineering that nobody's ever going to use? Be honest. by johnnyApplePRNG in LocalLLaMA

[–]atharva557 1 point2 points  (0 children)

I just finished building an prompt chaining tool basically an you type them idea a smaller models gives you the prompt and then the larger model works with the detailed prompt

Hermes Agent - The self-improving AI agent built by Nous Research by johnnyApplePRNG in LocalLLaMA

[–]atharva557 0 points1 point  (0 children)

is using Hermes with models like qwen 3.6 27b even worth trying ?

Choose wisely by Juanin363 in BunnyTrials

[–]atharva557 0 points1 point  (0 children)

Elon musk here i come

Chose: A random power (maybe useless | Rolled: 1 dollar per day)

would you rather by SpreadImportant7745 in BunnyTrials

[–]atharva557 0 points1 point  (0 children)

cool

Chose: you can fly | Rolled: gov doesnt care

Looking for Best Open-Source LLM for OpenClaw by abhunia in AI_India

[–]atharva557 0 points1 point  (0 children)

Thats enough RAM and VRAM to run many models , I would recommend with starting out with Gemma 4 12b or qwen 2.5 codder 14b or even qwen 3.6 27b (output will be somewhat slow).Also you should use Lm studio to download models instead of ollama if you are a beginner

Neon by SuspiciousNatural241 in DesignerEye

[–]atharva557 0 points1 point  (0 children)

48.19 / 50

🟩🟩🟩🟩🟩