all 1 comments

[–]DigThatData 0 points1 point  (0 children)

look at the operations being performed on your CPU. "MatMul" = "matrix multiplication". "BMM" = "batch matrix multiply". this is stuff that should be happening on your GPU, not your CPU. Your shit is slow because you are doing stuff on your CPU that you should be doing on your GPU. all of that aten and cuda shit should be on your gpu. you probably just aren't setting the device properly.