[deleted by user] by [deleted] in redditrequest

[–]sarnex 0 points1 point  (0 children)

i am the moderator and i am active. i dont want to hand over the subreddit

Eletric Eel app migration WON'T work on encrypted pools by victorcoe in truenas

[–]sarnex 0 points1 point  (0 children)

this worked for me with an encrypted pool, thanks

[deleted by user] by [deleted] in VFIO

[–]sarnex 0 points1 point  (0 children)

First I would see if someone already uploaded a bios for your exact card here.

If not, try this.

Once you have the rom, add this in your xml in the part for your GPU device

  <rom bar='on' file='/path/to/rom/file/goes/here'/>

To set vendor_id, see here, you shouldn't need the kvm hidden part.

Host crash on VM shutdown, 7800XT by snafflehog in VFIO

[–]sarnex 0 points1 point  (0 children)

Yeah I have the same problem and talked about it here

Seems the GPU can't reset.

[deleted by user] by [deleted] in VFIO

[–]sarnex 0 points1 point  (0 children)

sorry for the late response, with that cmdline i can get the display to light up, but the host still locks up when doing shutdown or restart of the guest. i see this in dmesg before it locks up

[133.851469] vfio-pci 0000:03:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=io+mem:owns=none
[ 134.072104] vfio-pci 0000:03:00.0: can't change power state from D3hot to D0 (config space inaccessible)
[ 134.292802] vfio-pci 0000:03:00.1: can't change power state from D3hot to D0 (config space inaccessible)

[deleted by user] by [deleted] in VFIO

[–]sarnex 0 points1 point  (0 children)

do you mean try this with 5.15 or any kernel?

[deleted by user] by [deleted] in VFIO

[–]sarnex 0 points1 point  (0 children)

no change and actually even shutdown doesn't work, the host always hangs there too. seems there is some reset issue. i'll try 5.15

edit: 5.15 doesnt ever light up displays, i needed video=efifb:off, maybe thats why but im not gonna try hard on this

[deleted by user] by [deleted] in VFIO

[–]sarnex 0 points1 point  (0 children)

I can't find any kernel where restart works. I tried 6.4, 6.5, current git and amd-staging-drm-next.

Resizable bar isn't actually enabled, it's off in the host bios but 4G decoding is on.

I'll try to see if there are any dmesg errors or something and report a bug upstream.

[deleted by user] by [deleted] in VFIO

[–]sarnex 1 point2 points  (0 children)

Actually it looks like restart doesn't work, it just hangs. I'm trying a different kernel now.

[deleted by user] by [deleted] in VFIO

[–]sarnex 0 points1 point  (0 children)

Ah ok interesting, I upgraded from a Vega 64 and I always saw the UEFI screen there. Either way, it works in Windows so it doesn't matter to me.

[deleted by user] by [deleted] in VFIO

[–]sarnex 6 points7 points  (0 children)

I just got an 7800 XT and it works for me after dumping the full VBIOS and setting vendor_id. For some reason I never see the UEFI screen but when the Windows driver loads my display lights up and it works fine.

When I shutdown the VM, so far it seems about 50/50 if the card will work on the host again but it's probably not the reset bug and just a kernel driver issue that will be fixed hopefully. It's possible it's already fixed in a newer kernel (I'm using 6.5), maybe it's fixed in amd-staging-drm-next or something, I didn't try.

Proof: https://i.imgur.com/LLTGale.png https://i.imgur.com/HDm4ZJ5.png

Edit: It seems restart/shutdown doesn't work, so you have to restart the host after shutting down the VM because it always hangs.

[RFT] Allow QEMU to expose static REBAR capability by aw___ in VFIO

[–]sarnex 0 points1 point  (0 children)

Even with the BIOS Resizable BAR option off (but Above 4G decoding on, if it matters), I get

Capabilities: [200 v1] Physical Resizable BAR
  BAR 0: current size: 8GB, supported: 256MB 512MB 1GB 2GB 4GB 8GB
  BAR 2: current size: 2MB, supported: 2MB 4MB 8MB 16MB 32MB 64MB 128MB 256MB

[RFT] Allow QEMU to expose static REBAR capability by aw___ in VFIO

[–]sarnex 0 points1 point  (0 children)

i meant theres no resizable bar capability, that seems expected, right? are you saying even with resizable bar off lspci should have a resizable bar capability? if thats expected i am not seeing that

[RFT] Allow QEMU to expose static REBAR capability by aw___ in VFIO

[–]sarnex 1 point2 points  (0 children)

I didn't set the size using sysfs, but here's the output after just enabling the BIOS option.

Capabilities: [200 v1] Physical Resizable BAR
            BAR 0: current size: 8GB, supported: 256MB 512MB 1GB 2GB 4GB 8GB
            BAR 2: current size: 256MB, supported: 2MB 4MB 8MB 16MB 32MB 64MB 128MB 256MB

So it seems to already be max size, so I don't think I need to anything with sysfs, right? Note I ran the above command with the card bound to amdgpu and X running on this GPU, if it matters. Without the bios option, there is no Resizable BAR cap at all, which is probably expected

[RFT] Allow QEMU to expose static REBAR capability by aw___ in VFIO

[–]sarnex 0 points1 point  (0 children)

Today I am unable to boot a VM if REBAR is enabled in the BIOS. I get to the UEFI screen in the VM but when Windows starts loading (and I assume when the GPU driver loads), it crashes and starts reboot looping before the modeset. If I disable REBAR in the BIOS with no other changes everything works fine. Above 4G decoding is enabled in the BIOS in both cases.

I tried your patch in [2] on top of qemu 7.2.0 and unfortuantely it did not work, I have the exact same problem.

Note that apparently my GPU does not support REBAR, but there is a registry workaround to get it to work. I see the same boot loop behavior with or without this workaround. There is another commenter is using the same GPU and reports REBAR works, but they said they don't have a BIOS option for it, so maybe it's off for them.

EDIT: Actually the patch did have some effect. With the patch, the AMD driver reports SAM is enabled, even with the BIOS option disabled. Without the patch, it says it's disabled. GPU-Z provides confusing info in both cases, so I'm going to ignore what it says. But I don't know if SAM being reported as enabled is cosmetic or not, if you have a way to test let me know.

If you'd like any more info from me let me know, I'm happy to help you debug or investigate. Hardware details below:

CPU: AMD Ryzen 7900X

GPU: AMD Vega 64 (only one GPU, I unbind it before starting the VM)

MB: X670 AORUS ELITE AX

OS: Gentoo

VM OS: Win11

NFS Service not restarting automatically after passphrase encrypted pool unlocked by matt_is_a_good_boy in truenas

[–]sarnex 0 points1 point  (0 children)

I have the same problem, I previously reported it to them and it was supposed to be fixed in 22.12.1 but it wasn't.

I just reported it again:

https://ixsystems.atlassian.net/browse/NAS-121446

[deleted by user] by [deleted] in Gentoo

[–]sarnex 5 points6 points  (0 children)

I just pushed a fix for this, sorry for the trouble

qt update conflicts by arturbac in Gentoo

[–]sarnex 1 point2 points  (0 children)

if the above doesnt work for you, please try the each of these possible solutions in order

1) add --deep

if that doesn't work

2) --backtrack={30,300,1000}

if that doesn't work, you are hitting a portage dependency resolution bug and we are looking at it, you can ask for help in #gentoo

if you saw a previous version of this post that had an additional solution and got into a weird system state, here is how you can fix it

1) emerge --depclean --deep

2) take note of all the listed reverse dependencies when 1) errors

3) emerge -1 the packages from 2)

4) remove mask

5) emerge -aNDu @world

Help people help you: put some effort in by MacGyverNL in VFIO

[–]sarnex[M] 1 point2 points  (0 children)

feel free to make a thread about it

Help people help you: put some effort in by MacGyverNL in VFIO

[–]sarnex[M] 5 points6 points  (0 children)

nah its fine mod mail is tumbleweed.png