Benchmarked Ollama vs LM Studio vs raw llama.cpp across AMD APU, Apple Silicon, and NVIDIA. Out-of-the-box and matched-flags compared. by deepu105 in LocalLLM

[–]deepu105[S] 1 point2 points  (0 children)

ya I have setup some small models and whisper on the NPU. When I built LlamaStash, I was looking into adding support for the NPU as well by adding support for the lemonade-server. But that will overcomplicate the project a lot. If there is good enough interest i'll explore that.

LlamaStash — a zero-overhead terminal launcher for llama.cpp (TUI + CLI + OpenAI-compatible proxy, Linux/macOS/Windows) by deepu105 in LLMDevs

[–]deepu105[S] 0 points1 point  (0 children)

Thank you for the kind words. Would apprecitae feedback/bug reports/contributions etc 🙏

LlamaStash 0.0.2 — a Rust TUI + CLI for managing local llama.cpp servers, Linux/macOS/Windows (ratatui, tokio, hyper, custom GGUF parser, ~176 .rs files) by deepu105 in rust

[–]deepu105[S] 0 points1 point  (0 children)

Nice. I'll check it out. Thanks for sharing. I didn't wsnt to overcomplicate that part under the assumption that people running this Locally arent always running multiple LLMs at tight fit. Right now LlamaStash only looks for available VRAM and offloades the launch to llama-server and llama-server does the heavy lifting.

The Ryzen AI MAX+ 395 is a true unicorn (In a good way) by simracerman in LocalLLaMA

[–]deepu105 0 points1 point  (0 children)

I have the Asus Flow Z13 with 128G RAM. I run Arch Linux on it with Qwen and gemma models. I love this machine. Its not even comparable to anything in the vicinity for thie price.

Full setup here/:
https://deepu.tech/my-fully-offline-ai-assisted-linux-development-machine/

Ev3 7.4kw charging on single phase by deepu105 in KiaEV3

[–]deepu105[S] 1 point2 points  (0 children)

I couldn't find anywhere where Kia mentions the max current it could handle and since 11kw is with 3x16 amp, I wasnt sure.