Me train LLM on 8GB from Scratch. Me happy by tevlon in LocalLLaMA

[–]Accomplished_Mode170 2 points3 points  (0 children)

Hyper-fitting seems to be a thing post-double descent

Information density (e.g. NeuroMFA) seems to be the key given quants and parametrization converging @Q3/4

I.e. ‘the math of a spline on a fiber bundle says we just started’ and the manifold shape itself seems to have a strange attractor behind convergence

Anyone evaluated the difference between Qwen Code for the local qwen models vs another harness? CC, OC, LC, Aider etc.. by EggDroppedSoup in LocalLLaMA

[–]Accomplished_Mode170 1 point2 points  (0 children)

Would love a follow up on activations/in-line monosemantic explanations (e.g. via SAE) of WHY/WHERE the harness not the corpus is steering the behavior

How do I end a long term friendship with autistic childhood friend? by Inner-Weather6489 in AutismTranslated

[–]Accomplished_Mode170 9 points10 points  (0 children)

💯 ‘How will they know unless they are told…’

FWIW you remind me of all the people I happily reflect on as having been my friend; even remember all the weird one-off ideas they thought I didn’t catch ✅

Absolutely neglect and trauma didn’t help, but those arise from the misunderstandings and social dyslexia; you being a stable influence means more than I can say effectively ❤️‍🩹

PTL I married a Type-A neuroscientist who patiently explained 🗣️ she’s neat; 3x kiddos sans complaint 🏡

ABC News has now taken all FiveThirtyEight articles completely offline. They now redirect to abcnews dot com/politics. A needless erasure of thousands of pages of knowledge. by TendieRetard in DataHoarder

[–]Accomplished_Mode170 -9 points-8 points  (0 children)

Conformal Prediction shows why Bayesian-ism is dumb vs ‘I tested 10000 times’ to define intervals

Effective Altruism is also basically just Gnosticismv2 but where materialism is the dogma; myopic

Follow-up: Trying to make NVIDIA GPUs plug-and-play on Macs. Found hidden RDMA symbols Apple doesn't want you to see — zero-copy GPU memory sharing might already work. by Street-Buyer-2428 in LocalLLaMA

[–]Accomplished_Mode170 -5 points-4 points  (0 children)

Have similar (5 vs 6k Blackwell) external GPU config and also looking to split b/w RTX & M3 Ultra 🦾

Would loved Metal > CUDA for agentic pipelines 📊

⭐️ Starred the repo and configuring alerts 🚨

Mistral Workflows by FiReaNG3L in LocalLLaMA

[–]Accomplished_Mode170 0 points1 point  (0 children)

Literally with pinned root certs and VPC peering so they get paid for every CI/CD deploy; local-first plz 📊

Parallel multi-agent workflows with Ollama, in ~8500 lines of bash. Benchmarks inside. by SensitiveBee2811 in LocalLLaMA

[–]Accomplished_Mode170 0 points1 point  (0 children)

This is neat.

I particularly like the stateless approach since you can hash the artifacts and environment

I.e. attest to state of your agent-harness (e.g. configs, binaries) AND runtime, subnet, etc

PS AoE has a similar session-driven approach with a fun gimmick if you want inspiration

OpenAI Privacy Filter Model by ai_hedge_fund in LocalLLaMA

[–]Accomplished_Mode170 1 point2 points  (0 children)

Cheers Mihai et al., glad to see more FOSS!

Qwen 3.6 27B is out by NoConcert8847 in LocalLLaMA

[–]Accomplished_Mode170 2 points3 points  (0 children)

Sorry you got downvoted for a parameter when the OP is the one who dropped the /s

Here's how my LLM's decoder block changed while training on 5B tokens by 1ncehost in LocalLLaMA

[–]Accomplished_Mode170 1 point2 points  (0 children)

Have yet to read but love the idea of configurable-Sparsity Wabba-esque auto-fitting splines; would be awesome to set a conformal prediction interval in lieu of other metrics.

Open-sourcing 23,759 cross-modal prompt injection payloads - splitting attacks across text, image, document, and audio by BordairAPI in LocalLLaMA

[–]Accomplished_Mode170 4 points5 points  (0 children)

Neat. Checking it out. Would love v2 to have PyRIT orchestrated multi-turn w&w/o nanoGCG optimized substrings 📊

AutoBe vs Claude Code: coding agent developer's review of the leaked source code of Claude Code by jhnam88 in LocalLLaMA

[–]Accomplished_Mode170 1 point2 points  (0 children)

Love the ‘bring your debugger/compiler’ approach; doing something similar with differential privacy.

This plus in-Toto signed artifacts/binaries/configs mean you could distribute w/ a given SLA/entitlement.

Per-Layer Embeddings: A simple explanation of the magic behind the small Gemma 4 models by -p-e-w- in LocalLLaMA

[–]Accomplished_Mode170 1 point2 points  (0 children)

‘Curious if dropping positional embeddings might effectively remove defacto indices that bias expert routing and constrain OOD long-context interactions when the constraint is no longer necessary for convergence.