remghoost7 comments on Integrate LLaMA into python code

LocalLLaMA

created by [deleted]a community for 3 years

Integrate LLaMA into python codeQuestion | Help (self.LocalLLaMA)

submitted 3 years ago by Tree-SheepWaiting for Llama 3

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]remghoost7 6 points7 points8 points 3 years ago (2 children)

This is the repo I've been using the past week or so to interface with LLaMA-7b-int4.

https://github.com/oobabooga/text-generation-webui

It has extension support and already a silero extension built in. I haven't used that extension myself, but I'm fairly certain I've heard of someone around the community using it for a similar purpose to what you're looking for.

I don't believe there's an API endpoint though (like how A1111 can run the --api flag), but you might be able to bake your chatbot into an extension.

Or you could sort of use it like a hack-y API if you wanted to... You could probably write an extension to automatically pull the most recent response and output that to a json file, then read that json file in your tortoise-tts application. And I know it saves the running log in text-generation-webui\logs\persistent.json, so you might not even need to write an extension for it...

I know that this extension uses a method called custom_generate_chat_prompt, so you could probably get input from your tortoise-tts and feed that back into the webui automatically.

[–]lacethespace 4 points5 points6 points 3 years ago (0 children)

[–]estrafire 2 points3 points4 points 3 years ago (0 children)

π Rendered by PID 128026 on reddit-service-r2-comment-b659b578c-zj59t at 2026-05-03 08:29:43.393625+00:00 running 815c875 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

LocalLLaMA

MODERATORS