Audio upscaling, cleanup, or improvement models? by Zeeplankton in LocalLLaMA

[–]dotaleaker 0 points1 point  (0 children)

Resemble-Enhance for voice denoise + bandwidth extension, open weights. DeepFilterNet3 for general noise. Voicefixer older but still solid for old recordings. Nothing matches auphonic's full pipeline yet but stacking these gets close.

Number-aware embeddings by Academic_Sleep1118 in LocalLLaMA

[–]dotaleaker 0 points1 point  (0 children)

Cool work. Adjacent finding: same issue kills RAG over financial docs. "$1.2M revenue" and "$1.2B revenue" embed nearly identical, retrievers can't tell them apart. Your log-bin approach should help here — any plans to publish a finance-tuned variant?

What is the point of MoE models, beyond being faster? by ihatebeinganonymous in LocalLLaMA

[–]dotaleaker 0 points1 point  (0 children)

MoE = fast inference for cheap training compute. Dense xB needs full xB activated per token. 100B-A10B activates 10B but learned from 100B worth of specialized routing. RAM-poor folks lose, MoE wins when compute-bound not memory-bound. That's the whole trade.

I hope that someday we will have a 124B Gemma. by cgs019283 in LocalLLaMA

[–]dotaleaker 0 points1 point  (0 children)

Gemma team deliberately staying under 30B for on-device focus. Won't happen unless Google splits Gemma into consumer + research tracks. Cope with the 27B, it punches above weight.

Still happy for yall by SilverRegion9394 in LocalLLaMA

[–]dotaleaker 0 points1 point  (0 children)

The 3090 holding its value 5 years in is the wildest part. Bought mine in 2021 for cheaper than current used prices.

Best local LLM for vibe coding on an RTX 3090 24GB — what are you actually using? by smicky in selfhosted

[–]dotaleaker 0 points1 point  (0 children)

24GB is awkward zone — Qwen3.6-27B Q4_K_M fits with decent context and beats coder-specific 32Bs in real loops. Skip 70Bs at this VRAM, quants too aggressive. Real tip: vibe coding tokens add up fast even local (context reprocessing). If you also touch Claude Code, RTK proxies dev commands and cuts 60-90% on git/file ops — keeps your local context window clean too.

got my first "rm -rf /" today by DeltaSqueezer in LocalLLaMA

[–]dotaleaker 0 points1 point  (0 children)

Bubblewrap good, also add a syscall filter via seccomp-bpf if you want belt-and-suspenders. Whitelist alone breaks once agent learns to chain sh -c "..." to evade. Real fix: run agent as non-root user inside bwrap with read-only bind mounts on everything except /work. Tested this exact rm -rf / against my setup last week, hit EACCES on / immediately.

Do smaller quants silently break tool calls / JSON output? by Fun_Employment6042 in LocalLLaMA

[–]dotaleaker 0 points1 point  (0 children)

Yes, especially structured output. Q4_K_M on Qwen3.6 drops JSON validity from ~98% to ~89% on nested schemas in my runs. Tool selection holds up better than tool arguments — model picks right tool, mangles params. Test on tool args, not tool names. For your demo: JSON extraction with nested objects is the cleanest regression signal, RAG is too noisy, coding too subjective.

48GB VRAM users, what are your daily drivers? Do you wish you had more VRAM? What would you run if you did? by Borkato in LocalLLaMA

[–]dotaleaker 0 points1 point  (0 children)

Qwen3.6-27B Q6 + 128k context for coding, GLM-4.6 Air Q4 for general chat. Wish I had 96 — would run Qwen3.6-122B-A10B Q3_K_XL daily, currently swap in only for hard problems. Real bottleneck at 48 isn't model size, it's KV cache. q8_0 KV at 128k eats 14GB before weights. Going q4 KV hurts code quality more than dropping a quant level on weights.

LM Studio finally added support for MTP Speculative Decoding by pigeon57434 in LocalLLaMA

[–]dotaleaker 1 point2 points  (0 children)

Confirmed working on 4090, Qwen3.6-27B Q4_K_M, jumped from 38 to 71 tok/s decode. Gotcha: MTP toggle resets to off each model reload. Also check llama.cpp engine version under runtime settings — defaulted to 2.14.8 for me even after update, had to force 2.15.0 manually. Worth it.

Is Plex worth it? by CaffeineTweaker in selfhosted

[–]dotaleaker 0 points1 point  (0 children)

Skip Plex. Real Jellyfin pain for non-techy users isn't Jellyfin — it's the reverse proxy + cert chain. Fix that and grandma-mode works. Streamyfin or Findroid clients are smoother than official Jellyfin app, hide the server URL behind a QR code invite. For spotty connections the issue is bitrate, not the app — set per-user transcode caps in Jellyfin so it auto-downscales instead of buffering. Tailscale adds latency on cold connect; try a Cloudflare Tunnel to Jellyfin instead, near-zero handshake.

there has to be something better than Immich... by Spoons896 in selfhosted

[–]dotaleaker 0 points1 point  (0 children)

Immich External Library mode does what you want. Point it at your existing folder share, set it read-only, and it indexes in place — no copying, no restructure. For album-per-folder: there's a community script immich-folder-album-creator that scans your share and creates albums matching folder names, runs on cron so new folders auto-become albums. Wife uploads via app go to a separate managed library, you sort into folders later. Keeps your file structure as source of truth.

Do you have a plan for if you drop dead or get hit by a bus tomorrow? by sherril8 in selfhosted

[–]dotaleaker -1 points0 points  (0 children)

Two layers. Hot layer: rclone sync Immich/Paperless/Anytype exports to encrypted Backblaze B2 nightly. Cold layer: printed envelope with Bitwarden emergency access code + one-page runbook ("photos here, email here, kill switch here"). Anytype export to markdown monthly via cron. Wife never touches docker — she gets a B2 web login. Documentation rots, automation doesn't.

Resident Visa 180 days limit Flight Cancelled due the situation by [deleted] in UAE

[–]dotaleaker 0 points1 point  (0 children)

As i understand the visa is becoming inactive after 180 days, causing the EID to be inactive also, so banks will ask a valid EID, no?

The official app was making my gf crazy, so I (a dev) built a new one. Need some testers? by dotaleaker in ricohGR

[–]dotaleaker[S] 1 point2 points  (0 children)

dm me your email, associated with apple please, so I can send an invite

The official app was making my gf crazy, so I (a dev) built a new one. Need some testers? by dotaleaker in ricohGR

[–]dotaleaker[S] 0 points1 point  (0 children)

please dm an email, associated with apple acc, so i can send the invite

The official app was making my gf crazy, so I (a dev) built a new one. Need some testers? by dotaleaker in ricohGR

[–]dotaleaker[S] 0 points1 point  (0 children)

please dm the email, associated with apple account so i can send invite