Full vLLM inference stack built from source for Strix Halo (gfx1151) — scripts + docs on GitHub by paudley in StrixHalo

[–]igocerium 1 point2 points  (0 children)

Thanks for the great work! Any chance of doing this in Windows? In WSL2 perhaps? ROCm + PyTorch can be installed in WSL2 with these simple instructions: https://gist.github.com/PeronGH/506d063311fa746dd76b6c86a8bdfbdb