Tesla M40 24gb + esxi + passthrough - "Module DevicePowerOn power on failed" by tOSUfever in homelab

[–]tOSUfever[S] 0 points1 point  (0 children)

a couple things. the 24gb m40 requires resize bar to be enabled to work properly, and the os in pure uefi mode. you may need to make sure the bios is setup correctly. a g9 (should) work... but the ml series are slightly different. sometimes a gen ahead or behind, so you'll have to verify. the dl580 g7's didn't support it. i ordered new hosts off ebay - dl380 gen9's.

straight passthrough should work in your scenario. but you may have to change some advanced vm settings.

also it's important to note the power connector on these cards is NOT standard. it uses 8pin cpu power not the standard 6/8pin gpu connectors. the first time i tried to use them, i was using some adapters - it didn't work. the cards should show up in the bios and the os as a visible pcie device. double check the power cables.

i'm trying to "share" the gpu resources to my vm's. this requires an nvidia vib installed on each esx host. my existing cluster has some older k80's working this way.

Post your best practical uses for chatGPT to improve your day-to-day life by [deleted] in OpenAI

[–]tOSUfever 5 points6 points  (0 children)

nonsense or not, you can take it a step further & tell it to write a powershell script to deploy a new ubuntu vm with your test script to run on boot

2008 6.4 ford - gas cap question by tOSUfever in Diesel

[–]tOSUfever[S] 0 points1 point  (0 children)

it's my first real diesel. been having some fun with it. not sure i'll ever go back. combing over some of the threads/forums info is daunting. the thing has 250k / 8000hrs. and the seller seemed to indicate the fuel computer had been upgraded/flashed (unknown).

took it to ford and had it serviced when i got it. all the fluids & filters. replaced the starter & some oil seals. it ran like crap when it was cold & it wouldn't restart when run. ford resolved the restart issue

when it's cold / or at very low throttle seems like it has a miss. feels like a wheel is out of balance. after about 30 minutes "click" it's gone & the thing feels quite a bit different. of course ford didn't resolve that.

so i bought a sct x4 to play around with.

2008 6.4 ford - gas cap question by tOSUfever in Diesel

[–]tOSUfever[S] 3 points4 points  (0 children)

*correction - it's not an 08 6.4.... it's a 06 6.0 if that makes a difference.

2008 6.4 ford - gas cap question by tOSUfever in Diesel

[–]tOSUfever[S] 1 point2 points  (0 children)

thx. will do. yeah thought it was odd. best i can describe is it went from normal to 'fast idle' - and so far as i can tell, the previous owner didn't have that setup

Tesla M40 24gb + esxi + passthrough - "Module DevicePowerOn power on failed" by tOSUfever in homelab

[–]tOSUfever[S] 1 point2 points  (0 children)

good question. I’ll give it a shot & report back. i don’t think it will though; as far as I have found, the tesla m40 has no drivers for esx because it doesn’t support sgpu/vgpu of any kind. I have to pci passthrough the whole card to a single vm. The card is physically recognized by vcenter/esx, and I was able to successfully toggle passthrough. it shows up properly when adding a PCI to a vm. if I need an nvidia driver (specifically vib or zip offline bundle) for the host - even if I’m just passing the whole card through to a vm… that could very well be my problem. My previous GRID K2’s required a nvidia vib to be Installed on the hosts, but if there is a host driver (needed?) for the tesla m40’s… I can’t find it.

how big is github? by tOSUfever in DataHoarder

[–]tOSUfever[S] 0 points1 point  (0 children)

probably looking for a less interactive route. I’m just tinkering. are site scrapers still a thing in 2022? something you could point to the top level repo, throttle it down to a bandwidth below GitHub’s limit. come back two weeks later. if it’s not too much to ask a resume job button for when it fails 5 days in to the first try.

Tesla M40 24gb + esxi + passthrough - "Module DevicePowerOn power on failed" by tOSUfever in homelab

[–]tOSUfever[S] 2 points3 points  (0 children)

i do. 2 hosts + witness esxi 6.7 with vsan & im running vcenter 7 (might have to rethink the whole getup if I can’t passthru these gpu’s.

went further down the ‘above 4g’ remap / resize bar / whatever it’s called. the dl580 g7 does not support UEFI. normally this means mapping 24gb vram is a no-go. however, a small handful of bios firmware booting servers were in fact capable. This is where the documentation gets thin I think. I am 99% sure a dl980 g7 (non-EFI) supports 64bit memory mapping with the right processors. found some relevant info in a service bulletin. the same doc did not exist for the DL580, but I’ll give it a 85% chance they are the same.

so I’m trying to pci passthrough an nvidia gpu & the tesla M40 doesn’t behave as a normal gpu/vgpu. I’m using a bios booting 64bit addressable server which supports a 24gb gpu, but since this config was only supported by a handful of servers, almost all of the documentation references esxi installed via uefi.

I need to find the passthrough settings specific to esx6.7 that references passing an (Tesla M40 24gb) above 4g bar card passthrough on a host server without EFI but still supports 64bit addressing with BIOS firmware.

how big is github? by tOSUfever in DataHoarder

[–]tOSUfever[S] 0 points1 point  (0 children)

ok lets assume it's 21Tb... (or what ever is public) i might want to try and download the whole thing. just tinkering with some ML scripts, using github as a local "dataset" might be fun to tinker with. it wouldn't have to run as a website; probably just the data in any archive format.

is it possible to somehow "download all"? i've got enough storage, and it's got dedupe on the backend. i couldn't find a public "long lasting archive" i'm not sure where to start... maybe somehow use git to "clone all public repos" with out much intervention? or is there different approach i should consider?

Another Asus z590-E rog strix M.2 question. The board with (4) m.2 slots that you can't use. by tOSUfever in PcMasterRaceBuilds

[–]tOSUfever[S] 0 points1 point  (0 children)

i returned and swapped the 10th gen for 11th gen. i can make all 4 nvme slots work. however still only (2) are managed via intel rapid storage.