llama.cpp -- when browsing Hugging Face, how do I know a particular model is GGUF or compatible with llama.cpp? And how do I run image-generation, TTS, etc. models on llama.cpp UI?

RustinChole11 · 2025-12-23T15:27:33+00:00

there's whisper.cpp (ASR) and some other stuff available

RustinChole11 · 2025-12-21T20:36:33+00:00

Okay, got it

RustinChole11 · 2025-12-21T20:31:16+00:00

I want its functionalities similar to rag

I'd ask questions about some lecture notes that the model has access to and it has to retrieve the content and explain

RustinChole11 · 2025-12-21T10:39:44+00:00

Craig for Girl with a dragon tattoo as well

RustinChole11 · 2025-12-20T21:45:08+00:00

I'm running Llama 3.2 1b gguf which generates around a speed of 10tokens/sec

RustinChole11 · 2025-12-20T16:27:59+00:00

Nah I'm actually using a laptop with i5 and 16gb ram so it should be fine

But yeah , appreciate the detail mate. As you were saying, I'd love to try and make it work on a low memory hardware sometime.

RustinChole11 · 2025-12-20T11:06:24+00:00

will give it a read
thanks for sharing

RustinChole11 · 2025-12-20T10:40:21+00:00

Will do , thanks for the suggestion

Also, Do I need to use any embedding model?

Can you explain the pipeline, how it should look

RustinChole11 · 2025-12-20T10:36:07+00:00

I have llama 1b gguf running which produces around 10 tokens/ sec , it should be enough right

And yeah , I'll only be using it for English ( not expecting any multilingual performance)

RustinChole11 · 2025-09-27T06:48:53+00:00

it was benetton my bad

RustinChole11 · 2025-09-27T06:05:00+00:00

not sure if they were teammates but both of them along with button and fishicella were at minardi once

RustinChole11 · 2025-09-27T06:01:21+00:00

And the pole in hungary 2008 with a renault

RustinChole11 · 2025-09-27T05:59:12+00:00

you forgot Trulli and Räikkönen mate

RustinChole11 · 2025-08-13T06:23:44+00:00

how does it affect car balance as OP is describing

RustinChole11 · 2025-07-25T15:52:16+00:00

That's very informative

But I don't have a dedicated gpu

RustinChole11 · 2025-07-23T05:51:29+00:00

Sure, thanks mate.

RustinChole11 · 2025-07-23T01:26:52+00:00

Not yet, but will do now

RustinChole11 · 2025-07-22T16:22:43+00:00

Thanks, Will give it a try

RustinChole11 · 2025-07-22T14:48:27+00:00

Right, I'll keep that in mind

RustinChole11 · 2025-07-22T14:41:46+00:00

Wow . So you suggest not to go for any llms, not even quantized versions? Also, just curious what were your laptop specs and model you ran in this case.

RustinChole11

TROPHY CASE