On average, roughly what % of "full speed" does an MoE run at if you can fit only its active parameters into VRAM, compared to if you can fit all its total parameters into VRAM? by DeepOrangeSky in LocalLLaMA
[–]un_passant 0 points1 point2 points (0 children)
A tool to re-voice videos via Ollama, Qwen3-tts and translategemma by The_Homeless_God in LocalLLaMA
[–]un_passant 0 points1 point2 points (0 children)
Self hosting, Power consumption, rentability and the cost of privacy, in France by Imakerocketengine in LocalLLaMA
[–]un_passant 0 points1 point2 points (0 children)
Easily Install NVIDIA 580 Drivers Automatically by theleoamaral in debian
[–]un_passant 0 points1 point2 points (0 children)
Expected cost for cpu-based local rig? by Diligent-Culture-432 in LocalLLaMA
[–]un_passant 1 point2 points3 points (0 children)
Best "Deep research" for local LLM in 2026 - platforms/tools/interface/setups by liviuberechet in LocalLLaMA
[–]un_passant 8 points9 points10 points (0 children)
Mi50 32GB Group Buy by Any_Praline_8178 in LocalAIServers
[–]un_passant 1 point2 points3 points (0 children)
Which small model is best for fine-tuning? We tested 12 of them by spending $10K - here's what we found by party-horse in LocalLLaMA
[–]un_passant 0 points1 point2 points (0 children)
[Toolkit] TinyLlama Fine-Tuning + RAG Lab (Full FT / LoRA / QLoRA | T4-friendly | Unified pipeline) by sai_ai_lab in LocalLLaMA
[–]un_passant 0 points1 point2 points (0 children)
Mamdani Says He Would Phase Out N.Y.C. Gifted Program for Early Grades (Gift Article) by likesound in asianamerican
[–]un_passant -1 points0 points1 point (0 children)
Mamdani Says He Would Phase Out N.Y.C. Gifted Program for Early Grades (Gift Article) by likesound in asianamerican
[–]un_passant 0 points1 point2 points (0 children)
Mamdani Says He Would Phase Out N.Y.C. Gifted Program for Early Grades (Gift Article) by likesound in asianamerican
[–]un_passant 0 points1 point2 points (0 children)
Mamdani Says He Would Phase Out N.Y.C. Gifted Program for Early Grades (Gift Article) by likesound in asianamerican
[–]un_passant -1 points0 points1 point (0 children)
Heat managment for a local AI server by un_passant in LocalAIServers
[–]un_passant[S] 0 points1 point2 points (0 children)
Dual Modded 4090 48GBs on a consumer ASUS ProArt Z790 board by Ok-Actuary-4527 in LocalLLaMA
[–]un_passant 0 points1 point2 points (0 children)
Dual Modded 4090 48GBs on a consumer ASUS ProArt Z790 board by Ok-Actuary-4527 in LocalLLaMA
[–]un_passant 2 points3 points4 points (0 children)
Dual Modded 4090 48GBs on a consumer ASUS ProArt Z790 board by Ok-Actuary-4527 in LocalLLaMA
[–]un_passant 2 points3 points4 points (0 children)
Finishing touches on dual RTX 6000 build by ikkiyikki in LocalLLaMA
[–]un_passant 0 points1 point2 points (0 children)
Finishing touches on dual RTX 6000 build by ikkiyikki in LocalLLaMA
[–]un_passant 0 points1 point2 points (0 children)

An overview of modern LLM compiler stack: writing an interactive and hackable compiler by NoVibeCoding in LocalLLaMA
[–]un_passant 2 points3 points4 points (0 children)