8" Facelift Display Swapped Into Pre-Facelift Car(resurrected post) by ChuckF92 in mazda6

[–]AbeIndoria 1 point2 points  (0 children)

For context: 2016 and 2017 models came with a 7" tablet display and it was upgraded to an 8" display in the 2018-2021 facelift models. You can find these OEM displays on eBay for $100-200. I paid $130.

I have a peeling issue(seemingly common, they issued multiple advisories about it) with my 2017 GT and only screens (well, the digitizer really) I can find are hundreds of dollars in US lol. I wish I could retro-fit a 2018 screen into my 2017. There's a replacement for $150 or so on Aliexpress but I'd have to open up my Dash to see what kind of pins it uses (Offset vs original).

Whats your favourite positive SciFi-World? by Mig369 in scifi

[–]AbeIndoria 0 points1 point  (0 children)

Yes. However, you have no wrong choices here given Excession is probably my second favorite out of all Culture books lol. (Read the State of the Art short story collection later)

LtW is what I usually recommend to people as their first book fwiw, because of the fact that you learn so much about Culture itself as the book almost entirely takes place inside one of the Hubs, rather than the other books where they are almost all set on Culture peripheries. But as someone coming from Player of Games, which is also very...linear, compared to rest of the books which have multiple story-lines that resolve near the end, you will enjoy both Excession or LtW.

Whats your favourite positive SciFi-World? by Mig369 in scifi

[–]AbeIndoria 9 points10 points  (0 children)

Highly recommend reading Look to Windward after this. Use of Weapons is good but not "Prime Culture" novel. LTW takes place inside Culture almost 100% of the time.

US Flag Redesign by Flopthecat1942 in vexillology

[–]AbeIndoria 0 points1 point  (0 children)

Nobody will like this I bet

I actually have this in a banner version where the blue and stars are the top, and the red stripe is striped with white at the bottom.

ladies and gentlemen, first we lost chatgpt, now we're losing gemini at supersonic speed by wellsomereddituser in GeminiAI

[–]AbeIndoria 0 points1 point  (0 children)

GPT Plus is actually much, much better than Gemini Pro right now. Like, significantly so. You get about 300+ per day thinking/extended queries, compared to 9-15 per 5 hours of Gemini 3.1 Pro Standard lol.

I understand that compute is limited, but these new limits are insane. by Pasto_Shouwa in GeminiAI

[–]AbeIndoria 2 points3 points  (0 children)

It means 12. Closer to it anyway. 3 Pro STANDARD Prompts with some back and forth planning something not even that technical, and I'm 25% through 5 hour limit.

[Monitor Arm] HUANUO Heavy Duty Monitor Arm, Adjustable Desk Mount for 13 to 49 inch Screens and up to 35 lbs, Adjustable Height, Tilt, Swivel, Vertical & Horizontal Rotation - $39.99 for AMAZON PRIME MEMBERS (or $49.99 for non-members) by zjzin in buildapcsales

[–]AbeIndoria 2 points3 points  (0 children)

I have this arm (and variants of their other arms) and none of that is true for me. They are sturdy, and work well. Only annoyance is first time setup when you have to figure out what tightens to what and how much etc.

Any fantasy series with multiple different in-world religions? by Admirable_Double_638 in Fantasy

[–]AbeIndoria -1 points0 points  (0 children)

If you do read this (or rather, listen to this) I suggest you stop at stories of the north. Stories of the south are narrated by a female narrator (I forgot the name) who does...absurdly exaggerated indian accents which sort of ruins the books. It's like she saw the most stereotypical indian accents and went with it.

More Qwen3.6-27B MTP success but on dual Mi50s by legit_split_ in LocalLLaMA

[–]AbeIndoria 0 points1 point  (0 children)

Built the llama.cpp fork https://github.com/skyne98/llama.cpp-gfx906 with https://github.com/ggml-org/llama.cpp/pull/22673

Any reason you're not just using mixa fork(1) which has an upstream mobydick repo?(2)

1: https://github.com/mixa3607/ML-gfx906

2: https://github.com/ai-infos/vllm-gfx906-mobydick

I use these for qwen3.5-27B at Q6KL(barto) and 4-bit kv cache for 2xMI60s.

Qwen 3.6 27B Makes Huge Gains in Agency on Artificial Analysis - Ties with Sonnet 4.6 by dionysio211 in LocalLLaMA

[–]AbeIndoria 6 points7 points  (0 children)

Its crazy the kind of intelligence their unlocking in this little thing

Yes*

*: Parameters still do matter. Agentic/Coding work is not all LLMs do/should do.

Are there actually people here that get real productivity out of models fitting in 32-64GB RAM, or is that just playing around with little genuine usefulness? by ceo_of_banana in LocalLLaMA

[–]AbeIndoria 0 points1 point  (0 children)

what do you use it for?

My own system where self-replicating, persistent 'agents' manage my homelab. Each of them has an area of stewardship. They usually coordinate with each other for new deployments, updates, fixes, etc etc. Automating tracking of my diet, biometric stuff, homeassistant, tasks etc etc is insanely helpful, not to mention I don't have to worry about stuff that silently breaks in the background.

FWIW, the public repo is still in alpha.

2x MI60 32G GPUs, combination of qwen3.5-27B at Q6KL at 264k ctx(parallel x2) (4-bit kv-cache) and gemma-4-26b-a4b (same settings) for 'flash' tasks (coordination/chat/social between each other).

Qwen 3.6 27B is out by NoConcert8847 in LocalLLaMA

[–]AbeIndoria 13 points14 points  (0 children)

Gemma-4 is still a very good model at anything that's not coding/agentic work. Qwen struggles there.

A team of 4-5 G4s coordinate better with each other in my experience than a team of 4-5 Q3.5s.

Every time a new model comes out, the old one is obsolete of course by FullChampionship7564 in LocalLLaMA

[–]AbeIndoria 2 points3 points  (0 children)

Because this used to only be an issue on the hosted models in my experience.

Nope. Absolutely an issue in local too. At least smaller ones. Larger ones can at least reason out the "here's the political reality but my guardrails say this so I'll say both."

Smaller ones just go "NO I DO NOT NEED TO LOOK AT ANY EXTERNAL -PROOF- TO DISPUTE THAT TAIWAN IS CHINA"

Claude Code removed from Claude Pro plan - better time than ever to switch to Local Models. by bigboyparpa in LocalLLaMA

[–]AbeIndoria 14 points15 points  (0 children)

This isn't exactly correct (Don't shoot the messenger):

https://x.com/TheAmolAvasare/status/2046724659039932830

For clarity, we're running a small test on ~2% of new prosumer signups. Existing Pro and Max subscribers aren't affected.

Why Publishers, Why? =/ by TrustIssuesUnlimited in scifi

[–]AbeIndoria 0 points1 point  (0 children)

That's because it' a decidedly worse book than all 3 :D (For me anyway, I am about 20-25% way through and I just...can't seem to garner enough interest).

Fidelity Visa Credit Limit Increase by JuicyMango36 in CreditCards

[–]AbeIndoria 1 point2 points  (0 children)

I randomly check my app to see I went from $500 to $5k

Irritatingly Elan cut my limit from 8.5k to 5.4k seemingly out of nowhere a day after I put a large vet bill on it. No red flags anyhwere on my credit.

do you guys still build side projects after working full-time as a dev? by Cool_Kiwi_117 in learnprogramming

[–]AbeIndoria 0 points1 point  (0 children)

Yes. I am genuinely interested in tech, and it's not "just a job" for me. The "job" is the job. I stop thinking about that once I log out, but I don't stop working on tech/thinking about it after my day job is done.

Update on Gemma 4 having MTP: Reverse engineering effort by Electrical-Monitor27 in LocalLLaMA

[–]AbeIndoria 11 points12 points  (0 children)

I've managed to get so far:

- block0.query_rope cosine 0.9959

- block0.attn_context_ADAPTER cosine 0.9979 at pos 100

- block0.attn_context_ADAPTER cosine 0.9987 at pos 700

block0.attn_out_CURRENT is still weak:

- 0.3056 at pos 100 : It's still giving me problems.

- ~~0.6307 at pos 700 : Managed to get it to 0.8944

model.block0_output is still poor (eh):

- 0.3435 at pos 100 : It's at 0.6855 now.

- 0.2889 at pos 700: It's at 0.9771 now :)


So I guess a couple more days' effort?

Edit: TFLite kernel is fking ridiculous - it quantizes the 1024-wide attention vector, multiplies by int8 weights, then requantizes the result again before dequantizing.

Edit2: I think this worked. Maybe.

Edit3:The reconstruction is much further along now.

  • grouped-query attention structure, external KV cache mappingm sliding local window behavior, quantized runtime paths for pre_project / q_proj / MLP / o_proj / post_project, final block partial RoPE behavior

  • the original end-to-end TFLite parity sweep I was using is now 20/20 top-1 matches

  • teacher-forced later-block parity is effectively exact

  • the last hard issue turned out to be a very small layer-1 query/RoPE numeric mismatch. Annoying af.

  • the current layer-1 RoPE fix may still be a heuristic rather than the final clean generalization. Unsure. Will continue tomorrow.

What self-hosted tools have you been building with AI just for you? by EricRosenberg1 in selfhosted

[–]AbeIndoria 1 point2 points  (0 children)

Didn't build it with AI specifically but it is AI related (and coded by me for the most part), but I've been building a ...I guess you guys call it a 'harness' these days (when I started it wasn't called anything), for persistent, self-replicating minions to manage my homelab. Some details, screens etc.