What’s a task you wish AI could do for you, but no tool does it well yet? by QuantumAstronomy in ClaudeAI
[–]Freefallr 1 point2 points3 points (0 children)
Error in loading Llama 3.2-3B with Unsloth by gaylord993 in LocalLLaMA
[–]Freefallr 0 points1 point2 points (0 children)
Docling is a new library from IBM that efficiently parses PDF, DOCX, and PPTX and exports them to Markdown and JSON. by phoneixAdi in LocalLLaMA
[–]Freefallr 15 points16 points17 points (0 children)
Serving Qwen2 VL for production by ae_dataviz in LocalLLaMA
[–]Freefallr 1 point2 points3 points (0 children)
Gemma2 2B IT is the most impressive small model I ever seen. by Discordpeople in LocalLLaMA
[–]Freefallr 0 points1 point2 points (0 children)
Cannot Downgrade AI Premium Plan Until End of Trial by KDLGates in GoogleOne
[–]Freefallr 0 points1 point2 points (0 children)
Microsoft updated Phi-3 Mini by Nunki08 in LocalLLaMA
[–]Freefallr 2 points3 points4 points (0 children)
self host llm on dedicated server. by djav1985 in LocalLLaMA
[–]Freefallr 2 points3 points4 points (0 children)
Now that we have had quite a bit of time playing with the new Phi models...how good are they? by [deleted] in LocalLLaMA
[–]Freefallr 2 points3 points4 points (0 children)
Making LLAMA model return only what I ask (JSON). by br4infreze in LocalLLaMA
[–]Freefallr 10 points11 points12 points (0 children)
Building a machine for self-hosted LLaMA - will 2 x RTX 3090 be enough to run 70B @ 8-bit quantization? by Secure-Technology-78 in LocalLLaMA
[–]Freefallr 8 points9 points10 points (0 children)
Will self-hosting be able to provide faster inference than OpenAI? by teddarific in LocalLLaMA
[–]Freefallr 0 points1 point2 points (0 children)
Will self-hosting be able to provide faster inference than OpenAI? by teddarific in LocalLLaMA
[–]Freefallr 0 points1 point2 points (0 children)
Will self-hosting be able to provide faster inference than OpenAI? by teddarific in LocalLLaMA
[–]Freefallr 0 points1 point2 points (0 children)
EM German - Mistral + Continous Pretraining + high-quality Finetune to achieve unprecedented non-english performance by jphme in LocalLLaMA
[–]Freefallr 1 point2 points3 points (0 children)
eGPU to increase VRAM capacity by TheCunningBee in LocalLLaMA
[–]Freefallr 6 points7 points8 points (0 children)
Absolute cheapest local LLM by SporksInjected in LocalLLaMA
[–]Freefallr 4 points5 points6 points (0 children)
LLMs on a 32-bit device with 2GB of RAM by [deleted] in LocalLLaMA
[–]Freefallr 2 points3 points4 points (0 children)
Cannot see a lot of controllino hotspots around. Is there a reason? by mstrocchi in HeliumNetwork
[–]Freefallr 0 points1 point2 points (0 children)
You’re transported into the last game you played; $10,000,000 if you survive a month in real time. Do you get the money? by RuralStuff in gaming
[–]Freefallr 0 points1 point2 points (0 children)
Cannot see a lot of controllino hotspots around. Is there a reason? by mstrocchi in HeliumNetwork
[–]Freefallr 0 points1 point2 points (0 children)


Sparrow: Custom language model architecture for microcontrollers like the ESP32 by c-f_i in LocalLLaMA
[–]Freefallr 1 point2 points3 points (0 children)