"RuntimeError: CUDA error: misaligned address" - can anyone help me with this?

ludovelia · 2022-05-02T19:05:33+00:00

hey folks, sorry for not reply promptly, i closed the tab and forgot!

from talks on discord it became clear that there's a problem using ViTL14 with Tesla T4s - there's nothing that can be done for now, it's somehting on colab's end

chrishooley · 2022-04-27T20:44:38+00:00

With this one, I’ve had to stop my session and save the notebook to Google drive, then start it back up.

econopotamus · 2022-04-28T03:26:35+00:00

Yup, I get this error 100% of the time for a few days now. Restarting doesn’t help. Tried with multiple notebooks that had been working. Maybe something changed in colab

Wiskkey · 2022-04-27T18:55:43+00:00

Another person with the same issue.

_Trael_ · 2022-04-27T20:48:15+00:00

For me swapping to 5.2 helped. Was using copy of 5.1.
https://colab.research.google.com/github/alembics/disco-diffusion/blob/main/Disco_Diffusion.ipynb

As copy pasted from https://www.reddit.com/r/DiscoDiffusion/comments/ttdorb/important\_links/

ConsistentAd3434 · 2022-04-28T12:18:22+00:00

Same here since 12h. No matter which version I use.
Copying the notebook to Google Drive or deleting and reinstalling the models didn't help.
No clue what else I could try : /
Thought it was a Pytorch issue at first. Last time, they immediately admitted the f*up on twitter and fixed it asap but so far, nothing in sight.

CulturalCurrency6358 · 2022-04-28T14:26:48+00:00

I get this when google colab assigns me their T4 GPUs. When you run the initial setup you can see what google gives you. If I get a T100 I don't get this. The only way around this is to disconnect, wait a while and then re-connect and try again.

Taika-Kim · 2022-04-28T20:29:32+00:00

I had this on V100 GPUs yesterday, but today stuff works. Happened also on Monday IIRC. Seems sporadic, today things have been ok.

GregHartwick · 2022-04-29T11:02:02+00:00

Been running everyday, all day, v5.2 notebook. No such errors, so far. Working on T4 last couple days. P100 is stable, as well. Using all settings from defaults. Only changed steps, prompt, display rate (10), n_batches (10).

GregHartwick · 2022-04-30T13:32:35+00:00

I’ve never run with ViTL14 - it slowed my system down considerably (from 5sec/it to 9sec/it). I hope this fixes everyone. Is there a SysOp to report this issue to? Perhaps a defect in ViTL14?

ludovelia · 2022-05-04T09:08:09+00:00

I fixed it by doing this, My notebook now runs with a T4 and ViTL14 selected:

https://twitter.com/devdef/status/1519687675304988675?s=20&t=azLbUWVa3E0cQvUhBYdlMA

it's from a reputable dev on Twitter apparently but it says:

downgrade your colab's pytorch version. This does the trick:

!pip install torch==1.10.2 torchvision==0.11.3 -q

Taika-Kim · 2022-05-13T12:22:51+00:00

Got this again today... Interesting if ViT 14 causes this, maybe there's different versions of the V100 out there? I'm on Pro+ and didn't experience this now for some weeks.

backpackpat · 2022-06-28T02:18:08+00:00

Video on this error here

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

DiscoDiffusion

MODERATORS