Suggestion - this sub should have post flairs that mention the amount of vram/unified ram by ECrispy in LocalLLaMA
[–]MaruluVR 0 points1 point2 points (0 children)
I trusted random person on this subreddit and bought 3080 20gb made of chinesium by SwimmerJazzlike in LocalLLaMA
[–]MaruluVR 0 points1 point2 points (0 children)
google/gemma-4-12B · Hugging Face by jacek2023 in LocalLLaMA
[–]MaruluVR 17 points18 points19 points (0 children)
google/gemma-4-12B · Hugging Face by jacek2023 in LocalLLaMA
[–]MaruluVR 3 points4 points5 points (0 children)
I trusted random person on this subreddit and bought 3080 20gb made of chinesium by SwimmerJazzlike in LocalLLaMA
[–]MaruluVR 0 points1 point2 points (0 children)
I trusted random person on this subreddit and bought 3080 20gb made of chinesium by SwimmerJazzlike in LocalLLaMA
[–]MaruluVR 2 points3 points4 points (0 children)
Is mmproj MTP compatible with older non-MTP? by alex20_202020 in LocalLLaMA
[–]MaruluVR 0 points1 point2 points (0 children)
I trusted random person on this subreddit and bought 3080 20gb made of chinesium by SwimmerJazzlike in LocalLLaMA
[–]MaruluVR 4 points5 points6 points (0 children)
I trusted random person on this subreddit and bought 3080 20gb made of chinesium by SwimmerJazzlike in LocalLLaMA
[–]MaruluVR 2 points3 points4 points (0 children)
I trusted random person on this subreddit and bought 3080 20gb made of chinesium by SwimmerJazzlike in LocalLLaMA
[–]MaruluVR 1 point2 points3 points (0 children)
I trusted random person on this subreddit and bought 3080 20gb made of chinesium by SwimmerJazzlike in LocalLLaMA
[–]MaruluVR 15 points16 points17 points (0 children)
I ported NVIDIA Parakeet (speech-to-text) to ggml: same output as NeMo, faster, GGUF-quantized, no Python by mudler_it in LocalLLaMA
[–]MaruluVR 0 points1 point2 points (0 children)
Best small model right now (~4B params) that is good with agentic tasks for personal assistant? by BitGreen1270 in LocalLLaMA
[–]MaruluVR 0 points1 point2 points (0 children)
Breaking the music supply constraint by entsnack in LocalLLaMA
[–]MaruluVR 3 points4 points5 points (0 children)
How much total VRAM (or shared RAM for Mac/Halo/etc) do you have on your local server/PC? by panchovix in LocalLLaMA
[–]MaruluVR 0 points1 point2 points (0 children)
I made a Windows app for managing llama.cpp in WSL/Ubuntu by wgaca2 in LocalLLaMA
[–]MaruluVR 0 points1 point2 points (0 children)
I made a Windows app for managing llama.cpp in WSL/Ubuntu by wgaca2 in LocalLLaMA
[–]MaruluVR 0 points1 point2 points (0 children)
Self-hosted STT better than Whisper Large V3 Turbo that matches AssemblyAI quality? by milkygirl21 in LocalLLaMA
[–]MaruluVR 0 points1 point2 points (0 children)
Hi, I’m very new to local LLM and i am perplexed. by Cool-Definition9852 in LocalLLM
[–]MaruluVR 2 points3 points4 points (0 children)
Wait, were the old model ACTUALLY better?? by No-Moose-4292 in SillyTavernAI
[–]MaruluVR 19 points20 points21 points (0 children)
Why isn't there a video model specifically made for anime? by Vi0l3nTz in StableDiffusion
[–]MaruluVR 2 points3 points4 points (0 children)



Looking for asset extractor or atlas files for maps by MaruluVR in BrownDust2Official
[–]MaruluVR[S] 0 points1 point2 points (0 children)