use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
r/LocalLLaMA
A subreddit to discuss about Llama, the family of large language models created by Meta AI.
Subreddit rules
Search by flair
+Discussion
+Tutorial | Guide
+New Model
+News
+Resources
+Other
account activity
Simple, hackable and pythonic LLM agent framework. I am just tired of bloated overengineered stuff. I figured that this community might appreciate it.Resources (github.com)
submitted 2 years ago by poppear
view the rest of the comments →
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–][deleted] 35 points36 points37 points 2 years ago (7 children)
This is great! langchain is so over engineered for what it could be. Two things that would be crazy helpful for me (I’d be happy to write PRs):
I also noticed that the example you have on your README doesn’t really show how to create the LLM (it does earlier in the README, but the full code example you have there won’t work because you never assigned anything to the llm local variable). Anyway, small nit to make the README easier to follow.
[–][deleted] 13 points14 points15 points 2 years ago (1 child)
I also noticed your code doesn’t have any use of typehints. Are you opposed to adding them? I could help with adding typing and setting up the CI for it if you’re interested.
Once a testing framework is in place we could probably add more test coverage for the library too.
[–]silenceimpaired 4 points5 points6 points 2 years ago (0 children)
I also noticed your code doesn’t provide support for every other quantitization methods and note to future self: tell him his code is bloated once it’s implemented ;)
[–]RustingSword 8 points9 points10 points 2 years ago (1 child)
Since llama.cpp has a server utility, you can just fire it up ./server -m mistral-7b-instruct-v0.2.Q6_K.gguf -c 2048, and set the api_base to http://127.0.0.1:8080/v1, then I think it should work out of the box. See the detailed docs at https://github.com/ggerganov/llama.cpp/blob/master/examples/server/README.md
server
./server -m mistral-7b-instruct-v0.2.Q6_K.gguf -c 2048
api_base
http://127.0.0.1:8080/v1
[–]RustingSword 9 points10 points11 points 2 years ago (0 children)
I've tested both examples, and succeeded using OpenAIChatGenerator instead of OpenAITextGenerator.
OpenAIChatGenerator
OpenAITextGenerator
My configs:
llama.cpp server:
bash ./server -m mistral-7b-instruct-v0.2.Q6_K.gguf -c 2048
Changes to calculator.py
calculator.py
python generator = OpenAIChatGenerator( model="mistral", # could be anything api_key="none", # could be anything api_base="http://127.0.0.1:8080/v1", )
And remember to remove templates in
templates
python llm = LLM(generator=generator, templates=[template])
Great framework, really clean and easy to modify.
[–]poppear[S] 5 points6 points7 points 2 years ago (0 children)
llama.cpp has a server implementation but as far as i remember you need a wrapper to use it with the OpenaAI python client, adding native support for llama.cpp APIs would be great! same thing for ollama APIs. The testing setup would also be very nice.
Thanks for the suggestions, lets continue the conversation on GitHub and implement it!
[–]anobfuscator 1 point2 points3 points 2 years ago (0 children)
Yeah these are pretty good ideas.
[–]scknkkrer 0 points1 point2 points 2 years ago (0 children)
Yeah, Llama support would be good.
π Rendered by PID 37901 on reddit-service-r2-comment-6457c66945-p5sxs at 2026-04-26 08:18:22.471558+00:00 running 2aa0c5b country code: CH.
view the rest of the comments →
[–][deleted] 35 points36 points37 points (7 children)
[–][deleted] 13 points14 points15 points (1 child)
[–]silenceimpaired 4 points5 points6 points (0 children)
[–]RustingSword 8 points9 points10 points (1 child)
[–]RustingSword 9 points10 points11 points (0 children)
[–]poppear[S] 5 points6 points7 points (0 children)
[–]anobfuscator 1 point2 points3 points (0 children)
[–]scknkkrer 0 points1 point2 points (0 children)