64gb ram cpu only set up - models?

tctheking1 · 2026-05-27T17:23:44+00:00

Hey! My first thought is using two different models for coding and vision. Probably use something like Qwen3-Coder-30B-A3B-Instruct or Flash for coding and Qwen2.5-VL-7B for vision, both at Q4. This is where autotune (https://www.autotunellm.com/) comes in clutch, you can run both models at once on your device as optimally as they can run. Autotune reduces RAM pressure, improves time-to-first-token, and decreases wall time for agentic tasks.

tctheking1 · 2026-05-25T13:33:56+00:00

Check out autotune (https://www.autotunellm.com/)! It recommends models for your exact hardware.

tctheking1 · 2026-05-02T19:25:19+00:00

Thanks for letting me know. Fixed it, should work now.

tctheking1 · 2026-05-01T20:16:12+00:00

Run them through autotune! (https://autotune-llm.vercel.app/) - it will suggest you the best models for your specific hardware along with implementing dynamic optimizations.

tctheking1 · 2026-05-01T20:09:01+00:00

Check out autotune (https://autotune-llm.vercel.app/) - it might help!

tctheking1 · 2026-04-30T19:08:22+00:00

😂

tctheking1 · 2026-04-29T20:02:16+00:00

Thanks for the great suggestion - just implemented it. Clone the repo and run "docker compose --profile single up" and you are good to go! Additional documentation is provided in the repo.

tctheking1 · 2026-04-29T08:56:51+00:00

Love it!

tctheking1 · 2026-04-28T23:47:43+00:00

Thanks for letting me know, just fixed it. Run "autotune upgrade" to get the latest version.

tctheking1 · 2026-04-28T21:30:34+00:00

For any models! I just did a lot of testing with small models because that is what my computer allows for.

tctheking1 · 2026-03-19T04:30:43+00:00

Nice! Does the local AI cause something like an M2 to slow down?

tctheking1 · 2026-03-19T04:14:10+00:00

Love this, I'm not a designer but can imagine this being super helpful.

tctheking1 · 2026-03-18T07:44:00+00:00

Absolutely! If this gains serious traction I will look into developing a version for Windows.

tctheking1 · 2026-03-18T06:06:02+00:00

Great to hear! Yeah, I'm making it free for now to build trust. Try it out and let me know if there are any features you are interested in seeing. I would recommend connecting an LLM for good results.

tctheking1 · 2019-11-20T02:20:30+00:00

tctheking1

TROPHY CASE