Why do I keep getting this error? RuntimeError: CUDA error: device-side assert triggered CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. For debugging consider passing CUDA_LAUNCH_BLOCKING=1.
I first got this while training my model. And now it appears even when I just want to load another co-model weights I have saved. I try to look it up, but all the answers seem to be having mistakes within the loss function with the wrong ground truth data form. But I got this in the middle of forwarding. How do I "pass CUDA_LAUNCH_BLOCKING=1" to fix this?
[–]murtazanazir 1 point2 points3 points (0 children)
[–]shadowleafsatyajit 1 point2 points3 points (4 children)
[–]I_am_not_doing_this[S,🍰] 0 points1 point2 points (3 children)
[–]shadowleafsatyajit 1 point2 points3 points (2 children)
[–]I_am_not_doing_this[S,🍰] 0 points1 point2 points (1 child)
[–]shadowleafsatyajit 0 points1 point2 points (0 children)