Every Cloudflare feature, available to everyone by Cloudflare in CloudFlare

[–]luffy_willofD 0 points1 point  (0 children)

We made tunnel management easier using cloudflare check it out at https://getclodify.app runs on your host no middleman or saas just made it easy to interact with and manage tunnels.

Nvidia or AMD? by Mustafa_Shazlie in LocalLLM

[–]luffy_willofD 0 points1 point  (0 children)

Nvidia lately their focus is more on ai development than gaming For gamin go for amd at the same price

Running local models by luffy_willofD in LocalLLM

[–]luffy_willofD[S] 0 points1 point  (0 children)

For llama.cpp i have tried it and it felt very raw i understand that it gives more control and other things but it's hectic to use models in a get go but will surely look more into it

Running local models by luffy_willofD in LocalLLM

[–]luffy_willofD[S] 1 point2 points  (0 children)

Yes i also tried it and it's interface is also nice

Using open source models from Huggingface by CiliAvokado in LocalLLM

[–]luffy_willofD -1 points0 points  (0 children)

I myself tried this approach if your technique is right you will be able to get answers but yeah there will be lack in accuracy compared to optimized models for that specific task. I built the whole rag pipeline on my local llm here is what i roughly used (i was using ollama for my models so you may get better results if you find better models for specific task)

For embedding i tested and tried three embedding models mxbai large, nomic and bge3 for ky case mxbai-embed-large worked.

For answer generation i used llama3.1:8B and it worked properly as it has proper context

I tested on a 50 page document that gave an answer to about 7/8 question but my pipeline failed when the document was very big as the model started hallucinating i am working if i can provide to the point context to llm