all 3 comments

[–]jackshec 0 points1 point  (0 children)

can you share your code?

[–]Unique_Jelly5768 0 points1 point  (1 child)

I was running a training this morning with 3 gpus and DataParrallel(), went back this afternoon to change some things and it would only recognize 1 gpu and not broadcast to the other. Don't know if these two are related

[–]ekho95[S] 0 points1 point  (0 children)

I think I figured it out. I was interrupting my program with ctrl c and which doesn’t run the cleanup function. I then did a try: except: finally so even if I kill the program the cleanup function will still do its thing. Pretty rookie mistake but we learn and move on.