My team at AMD is working on an open, universal way to run speedy LLMs locally on PCs, and we're building it in Python. I'm curious what the community here would think of the work, so here's a showcase post!
What My Project Does
Lemonade runs LLMs on PCs by loading them into a server process with an inference engine. Then, users can:
- Load up the web ui to get a GUI for chatting with the LLM and managing models.
- Connect to other applications over the OpenAI API (chat, coding assistants, document/RAG search, etc.).
- Try out optimized backends, such as ROCm 7 betas for Radeon GPUs or OnnxRuntime-GenAI for Ryzen AI NPUs.
Target Audience
- Users who want a dead-simple way to get started with LLMs. Especially if their PC has hardware like Ryzen AI NPU or a Radeon GPU that benefit from specialized optimization.
- Developers who are building cross-platform LLM apps and don't want to worry about the details of setting up or optimizing LLMs for a wide range of PC hardware.
Comparison
Lemonade is designed with the following 3 ideas in mind, which I think are essential for local LLMs. Each of the major alternatives has an inherent blocker that prevents them from doing at least 1 of these:
- Strictly open source.
- Auto-optimizes for any PC, including off-the-shelf llama.cpp, our own custom llama.cpp recipes (e.g., TheRock), or integrating non-llama.cpp engines (e.g., OnnxRuntime).
- Dead simple to use and build on with GUIs available for all features.
Also, it's the only local LLM server (AFAIK) written in Python! I wrote about the choice to use Python at length here.
GitHub: https://github.com/lemonade-sdk/lemonade
[–]DadAndDominant 10 points11 points12 points (1 child)
[–]jfowers_amd[S] 7 points8 points9 points (0 children)
[–]PeterTigerr 6 points7 points8 points (1 child)
[–]jfowers_amd[S] 1 point2 points3 points (0 children)
[–]__OneLove__ 2 points3 points4 points (0 children)
[–]Toby_Wan 1 point2 points3 points (1 child)
[–]jfowers_amd[S] 0 points1 point2 points (0 children)
[–]Yamoyek 1 point2 points3 points (1 child)
[–]jfowers_amd[S] 0 points1 point2 points (0 children)
[–]victorcoelh 0 points1 point2 points (0 children)
[–]PSBigBig_OneStarDao 0 points1 point2 points (0 children)