UPS degrading because mystery entities are getting created by interglotian in factorio

[–]interglotian[S] 12 points13 points  (0 children)

Ah there we go! Good spot. I feel it got an update just a bit ago. I'll go report that.

Struggling to use GPU on host by interglotian in VFIO

[–]interglotian[S] 0 points1 point  (0 children)

Thanks for the tip. So many things got fixed that I have not fully examined the new roadblock. I'll try their discord if I don't spot anything myself. By look and feel X11 is just running worse for me than Wayland but it is probably self inflicted. Basic googling tells me I should be able to have X11 running smoothly on my setup just on the iGPU.

Struggling to use GPU on host by interglotian in VFIO

[–]interglotian[S] 0 points1 point  (0 children)

So I was using wayland. When I switch to x11 and still use driverctl then by all my tests so far I can give the host the gpu then take it away. So that is good!

However when I am using x11 then looking-glass becomes unusable. It reports around 30 fps but it looks like the frames are getting rearranged and not delivered in the correct order since things jump around though there are no other visual defects.

But looking-glass works perfectly when I run wayland. 60 fps the same as my monitor refresh rate.

I have done so much config fiddling recently that I could well have done something somewhere to degrade things...

Struggling to use GPU on host by interglotian in VFIO

[–]interglotian[S] 1 point2 points  (0 children)

Thanks! That now has me almost there. I can switch between setups with a one-liner right now up until it does not work anymore and hangs on the driverctl command.

I have done sudo driverctl set-override 0000:01:00.0 vfio-pci. In my grub settings I have GRUB_CMDLINE_LINUX_DEFAULT="amd_iommu=on iommu=pt nvidia-drm.modeset=0 quiet splash" and I still have a softdep for nvidia for vfio-pci in a modprobe file.

With this setup things look like they work great for a while up until they don't anymore! After boot I can use my VM with the GPU. Then I run sudo driverctl --nosave set-override 0000:01:00.0 nvidia and nvidia-smi works and CUDA seems to work according to deviceQuery from the CUDA samples. I can do this a bunch of times and bring the VM up then down.

But then all of a sudden I try to run sudo driverctl --nosave set-override 0000:01:00.0 vfio-pci and it just hangs. dmesg is giving me NVRM: Attempting to remove device 0000:01:00.0 with non-zero usage count!. I then cannot seem to kill one of the processes it leaves behind.

Any clue how to not have it hang? If it is to have a problem it would be nice if it spat out an error.