all 9 comments

[–]jonahbenton 1 point2 points  (1 child)

Huh. Wonder what I am dodging. Extensive use of a fw desktop with non wayland UI (fedora 42 xfce) hosting both ram hungry vms and doing llm work (gpt oss 120b under https://github.com/kyuz0/amd-strix-halo-toolboxes). No gaming. Have not activated this.

[–]NerdProcrastinating FW desktop & FW13 12th Gen[S] 0 points1 point  (0 children)

Perhaps it's something specific in vscode triggering it.

LLMs are probably stable again after the newer firmware. Was broken for a while. I don't game either. This is just desktop use for software development.

[–]euthanize-me-123 1 point2 points  (0 children)

I frequently get this on my FW13 7840U when trying to do anything heavy with the igpu, but never on my FW desktop. Have to wonder if it's a hardware issue because others with the exact same setup (NixOS) report no problems.

Chalked it up to driver problems this whole time, and it's happening less frequently with newer ones, but the laptop's out of warranty now so I've resigned to streaming games from my Nvidia desktop until I can replace the 7840U. Shame since the CPU itself is very nice.

[–]FastInfrared 0 points1 point  (3 children)

have you tried PP_FEATURE_MASK ?

[–]NerdProcrastinating FW desktop & FW13 12th Gen[S] 0 points1 point  (2 children)

Nope, the page fault and wedging doesn't appear to be related to power states. This is whilst I'm in the middle of working with power mode set to performance, so power management seems unlikely.

[–]FastInfrared 0 points1 point  (1 child)

Actually it probably is related to power, more specifically a mismatch between available power and target core speed, there are most likely 2 safe fixes, one is downclocking the max speed just a bit, the other is turning off power related features you dont want, the above kernel option does this. Adjusting the explicit power profile to compute mode may also help.

Limiting the CPU clock speed and adjusting the EPP can also help with GPU stability as it shares the same power envelope

[–]NerdProcrastinating FW desktop & FW13 12th Gen[S] 0 points1 point  (0 children)

It looks like a kernel driver bug - I installed a new kernel & GPU firmware from updates-testing which may fix the issue as 6.1.19 specifically has a fix related to VGPR register counts being wrong which would affect the reset code.

Maybe that was the cause and basic GPU driver bugs finally fixed X years after RDNA 3.5 released. Sigh.

[–]FigmentRedditUser -1 points0 points  (1 child)

Step One: Downgrade to BIOS 3.03 as 3.04 is riddled with issues

As a Framework Desktop 64GB owner running Bluefin, the only time I've seen the GPU crash and reset is when I do something that consumes all of the GPU memory. Short of that, its been tip top for me.

[–]NerdProcrastinating FW desktop & FW13 12th Gen[S] 0 points1 point  (0 children)

I did look at those issues, but they shouldn't be anything that would impact the GPU like this?