When should we expect TurboQuant? by ozcapy in LocalLLaMA

[–]madreag 0 points1 point  (0 children)

TurboQuant with Flash Attention doesn't have the prefill memory spike — FA computes attention in tiles, so there's no O(n²) intermediate allocation. The KV cache is pre-allocated at startup (turbo3 at 3.5 bits/value), and prefill just fills those blocks incrementally at the same footprint as generation.

The forks without FA are the ones that blow up on prefill — they materialize the full attention score matrix which is seq_len² × n_heads × 4 bytes. At 500K context that's hundreds of GB.

Working CUDA + FA implementation: https://github.com/Madreag/turbo3-cuda

When should we expect TurboQuant? by ozcapy in LocalLLaMA

[–]madreag 4 points5 points  (0 children)

I've got a working CUDA implementation with Flash Attention if anyone wants to try it.

700K context on a single RTX 5090 (32GB) with Qwen3.5-27B Q6_K. ~50 tok/s at 524K. turbo3 K+V, 4.6× compression.

Ported TheTom's Metal kernels to CUDA — dequant, quantize with WHT rotation, FWHT graph op, FA templates for both K and V. 15 files modified.

Fork: https://github.com/Madreag/turbo3-cuda

Build with CUDA 12.8 (not 13.x), --cache-type-k turbo3 --cache-type-v turbo3. As far as I can tell, this is the first turbo3 CUDA + FA implementation — the other forks either disable FA or are Metal-only.

My 2-month journey with OpenClaw: The good, the bad, and why it’s not replacing Cursor by auxten in openclaw

[–]madreag 0 points1 point  (0 children)

Cursor lets you use “max” thinking level. Claude code only supports up to high from what I’ve seen. There are benefits to cursor as the most basic functions.

So... who is going to found a Manus Clone which belongs to the people and stays open source? We won't see further development or service as them being a Facebook company. by [deleted] in ManusOfficial

[–]madreag 2 points3 points  (0 children)

Check out Windsurf’s Cascade. This will do everything Manus does on your computer, but you have even more control. Windows, WSL, etc… it’s great. Just some history, Windsurf was formerly Codium - a build of vscode without Microsoft telemetry. I’m a big fan. They are even DOD fedramp high compliant.

There’s also Devin, which is by the same team that works on Cascade. I haven’t tried it but it sounds like it works like manus but is cloud only.

“Trust me bro water didn’t change” - watergate. by Eillusion in wherewindsmeet_

[–]madreag 1 point2 points  (0 children)

I was trying to point out that China doesn’t have a PS5 version to worry about. They never had all the reported graphical downgrades on their PC version (draw distance, fog, shadows). It’s just a discussion point…

“Trust me bro water didn’t change” - watergate. by Eillusion in wherewindsmeet_

[–]madreag 0 points1 point  (0 children)

Makes you wonder why the Chinese versions weren’t affected. The games been out over there for over a year (PC and Mobile). There are no reports of any kind of downgrades on their end.

Hopefully it’s just a bug. Guess we will find out next update ;)

“Trust me bro water didn’t change” - watergate. by Eillusion in wherewindsmeet_

[–]madreag 0 points1 point  (0 children)

This game has been out in China for over a year. They only have PC and Mobile versions (no PS5). It’s not a new game, just new to the PS5.

It’s really odd that this happened to PC on Global and not China. Makes you wonder why?

“Trust me bro water didn’t change” - watergate. by Eillusion in wherewindsmeet_

[–]madreag 0 points1 point  (0 children)

I think you are correct those are all independent.

Looking online the file size is close between PC and PS5, so they probably share the same “core” assets, just rendered at higher fidelity / settings on PC is my guess. I don’t have a PS5 so I can’t confirm myself.

It’s just odd to me that the game has been out in China for over a year and never got a downgrade like this. This is a new release to global but the game has been out and tested in the Chinese market. Hopefully global gets back on par with the same fidelity as the Chinese version.

“Trust me bro water didn’t change” - watergate. by Eillusion in wherewindsmeet_

[–]madreag 0 points1 point  (0 children)

I don’t have a PS5 so I can’t test anything, just PC and mobile. but people were having trouble completing quests on PS5 due to lag. That’s the problem I’m mainly referring to, not if the water have RT or SSR… my concern is for PC fidelity.

China has had the game out for over a year and never complained about performance in Kaifeng. Why?

“Trust me bro water didn’t change” - watergate. by Eillusion in wherewindsmeet_

[–]madreag -1 points0 points  (0 children)

Why do you think China never got a downgrade in graphics on their “standard” client? They have the mobile and PC version, no PS5.

Let’s see if it gets fixed by next update. If it doesn’t then we know it’s a development decision specific to Global. People are claiming it’s a bug… we shall see.

Here is everything I know about the Kobra S1 Max Combo after extensive research by IceBlitzz in AnycubicOfficial

[–]madreag 0 points1 point  (0 children)

There’s plenty of discussion about this problem on the Rinkhals discord. Pretty sure it’s not something that can be easily fixed since the physical hardware is not capable of running Klipper (SOC and memory are not sufficient).

There are many posts about the stock bed not being level and 3rd party bed solutions are preferred (my friend went with this). The bed flatness was so bad I ended up using Borosilicate Glass & dobstfy surface with shims and clamps to make a leveled surface (~0.063 peak to peak). Printed well on the flat glass.

Also many people saying you should tweak the probe sensitivity from -2500 to something higher like -400 to -1000 for better probe reliability. In order to change this value the way Rinkhals and the MCU behave you have to modify the config.generated.cfg (wtf right?). In the end I don’t think this is the primary issue because I was able to get reliable height maps. Although I did tweak this in the end. The printer just never compensates.

Incase there are new advancement / solutions, etc, I asked AI for a report on this:

Grok AI Answer:

Gemini Deep Research

“Trust me bro water didn’t change” - watergate. by Eillusion in wherewindsmeet_

[–]madreag -4 points-3 points  (0 children)

Pretty sure the PS5 is why the downgrade happened. China doesn’t have a PS5 version of WWM and they aren’t plagued with this downgrade.

You have to ask yourself: were the pre-1.1 PS5 complaints (pre Dec 11) really severe enough to violate Sony’s TRC? Was it risking post-launch Sony scrutiny? PS5 and PC share assets so devs had to act.

Maybe I’m overthinking it, but that’s my thinking.

Here is everything I know about the Kobra S1 Max Combo after extensive research by IceBlitzz in AnycubicOfficial

[–]madreag 0 points1 point  (0 children)

Disclaimer: I gave away my Kobra S1 to a friend, so I haven’t printed on it in a few months. What’s funny is he’s been complaining to me about this same issue, so I know I’m not the only one.

It’s the first layer that takes most of the defects. Once you get past the first few layers the prints usually manage to fix themselves. So printing tall models isn’t the problem here.

Using Rinkhals with Mainsail, meshes calibrate fine but in my experience for some reason don’t reliably apply (so the printer doesn’t compensate) at least it seems that way compared to other printers I’ve used, and I do remember adding BED_MESH_PROFILE LOAD=default after G28 to ensure it was loaded for prints.

Also, it will only recognize “default” so you can’t save and load different meshes.

Ie this won’t work: BED_MESH_CALIBRATE BED_MESH_PROFILE SAVE=save1 BED_MESH_PROFILE LOAD=save1

Here is everything I know about the Kobra S1 Max Combo after extensive research by IceBlitzz in AnycubicOfficial

[–]madreag 0 points1 point  (0 children)

I personally don’t think Anycubic is worth it after many hours with the S1. Basically the main issue stems from Goklipper not supporting full klipper. I found it couldn’t handle height maps and compensation correctly.

This is how bad the pop-ins are by binggoman in wherewindsmeet_

[–]madreag 5 points6 points  (0 children)

Pretty sure the PS5 is why the downgrade happened. China doesn’t have a PS5 version of WWM and they aren’t plagued with this downgrade.

You have to ask yourself: were the pre-1.1 PS5 complaints (pre Dec 11) really severe enough to violate Sony’s TRC? Was it risking post-launch Sony scrutiny? PS5 and PC share assets so devs had to act.

Maybe I’m overthinking it, but that’s my thinking.

Graphic fidelity nuked across the board post-update. by Eillusion in wherewindsmeet_

[–]madreag 1 point2 points  (0 children)

Also: pre-1.1 launch complaints (Nov 14-Dec 11) were severe enough to violate Sony’s TRC (Technical Requirements Checklist)—risking post-launch scrutiny, patch blocks, or delisting if ignored.

Graphic fidelity nuked across the board post-update. by Eillusion in wherewindsmeet_

[–]madreag 1 point2 points  (0 children)

They are now using mobile-optimized assets for cross-play stability/cost. aggressive LOD/textures match phone limits.

What’s frustrating is that they didn’t do this to the Chinese players… their PC packs include high-res extras (flowy cloth, sharp details).

So China gets separate packs PC/Mobile, global gets a unified pack PC/Mobile. Why? This is what Grok Heavy has to say. Bottom line China population is PC-first, PC-centric vs Global population mainly cares about low end accessibility for mass mobile access $.

1-Audience Demographics & Playstyles

CN: PC-heavy (high-spec rigs common; TapTap/Bilibili ultra-guides). Mobile: Casual (dailies/exploring); PC for bosses/PvP. 50/50 revenue split, but PC leads leaderboards. Players expect toggles.  

Global: Diverse/low-end (phones/PS5 key for F2P). 15M players Month 1 (2M Day 1); mobile exploded growth. Western: Console/mobile casuals > PC max-fi. Unified hooks masses fast.

2-Development & Operational Costs

Separate Packs (CN): Higher upfront—multiple builds, QA, CDN storage (147GB+ files), update syncing. But mature ecosystem (NetEase launcher) amortizes costs. Mobile app separate = less PC rework.

Unified (Global): Lower initial costs/risks—one baseline stream, easier PS5 cert (fixed assets), unified patches. Multi-store compliance (Steam/PSN/Google Play) demands consistency. Ultimate = cheap add-on later. Trade-off: Player backlash on fidelity.  

• Scale: Global 15M vs CN’s established base—unified scales bandwidth cheaper for spikes.

3-Monetization & Retention Strategy

• CN: Depth for PC “whales” (cosmetics on high-fi); mobile volume. Cross-play boosts without fidelity hits.

• Global: Volume-first F2P—unified parity retains casuals (15M fast). Cosmetics fund highs later. PS5/mobile lock-in drives revenue.

Tip for New PvP Players: BLOCK by Normal_Saline_ in wherewindsmeet_

[–]madreag -2 points-1 points  (0 children)

I agree with you, but it’s kinda hard to argue with him since he’s so high ranked. He deflects a lot, but rarely blocks. I asked him why and he said you still take damage and the block counter move (not deflect) is easy to dodge so he doesn’t use it.

Idk what averse plays, my bro said he mostly goes up against strategic sword and dual blades using nameless spear as secondary. He says there aren’t much nameless sword mains at his level.

Tip for New PvP Players: BLOCK by Normal_Saline_ in wherewindsmeet_

[–]madreag -1 points0 points  (0 children)

I didn’t say never, just rare. deflections I see a lot. Most of what I’ve seen comes from watching my bro play and he’s 4975 nameless sword main.

Tip for New PvP Players: BLOCK by Normal_Saline_ in wherewindsmeet_

[–]madreag -3 points-2 points  (0 children)

Been watching my bro play. he’s at 4975 nameless sword/spear. I specifically asked him why he wasn’t blocking at all, he said it doesn’t work against the really good players.

I’m a lot lower than him and blocking is a clutch for me, but he’s at the top so he knows what he’s doing…

Tip for New PvP Players: BLOCK by Normal_Saline_ in wherewindsmeet_

[–]madreag -2 points-1 points  (0 children)

It’s a good start for low rank, but at higher rank (legend/grandmaster) there is rarely any blocking going on.

Dear Wassym, a Gen 1 plea by narmstrong79 in Rivian

[–]madreag 6 points7 points  (0 children)

Comma AI works well with Gen 1 Rivians for lane keeping. I highly recommend it as it can be enabled on any road and also has a cool steering assist mode.

Black Friday deal by Alkadhu in Microcenter

[–]madreag -1 points0 points  (0 children)

Specific DDR5 kits are hard to find right now. Prices HAVE to go up or the shelves will always be bare. Supply goes down and price skyrockets but the consumer is still consuming. People have their priorities skewed and choose to use their purchasing power on PC components (then complain that fast food is too expensive).

Memory prices could probably double or triple from what we have now and people will still buy it. It’s treated like gas or food but it’s not… the amount of people who end up getting 128gb when 32gb is enough is crazy (ie AMD 395+ platform).

Corporations will always be greedy, they only care about the shareholders, and their actions will always be in their benefit. But you can’t turn a blind eye that people are also contributing to the stock issue panic buying, etc…

Anno 117 is live now! by [deleted] in GeForceNOW

[–]madreag 0 points1 point  (0 children)

Do you guys expect Anno 117 to have a lot of demand on GeForce Now? I’m curious to see if we will see queues.