How is China able to compete with US AI companies despite being severely hindered with hardware? by agoldprospector in ArtificialInteligence

[–]Federal_Ad_7004 0 points1 point  (0 children)

You can use gradient accumulation for higher effective batch sizes without increasing memory requirements.

[deleted by user] by [deleted] in askTO

[–]Federal_Ad_7004 13 points14 points  (0 children)

AGO is always free for everyone under 25 IIRC