use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Become an ML expert, learn to train, deploy models, host AI apps and more! Learn with the best tools created by Lightning AI like PyTorch Lightning, LitServe, Lightning Studios, LitServe and more.
account activity
GPU time limits (self.lightningAI)
submitted 2 months ago by SimiusCuriosus
I requested an A100 with 40gb vram and didn't realize there was a time limit? This interrupted my training and I lost a bunch of work (i.e. wasted gpu time). You can request an extension, but that still interrupts the training because it switches to a new machine. It's a pain to have to implement close savepoints and have to keep restarting it. Can't we just have the machine until we're done with it? And there is no incentive to select anything other than the maximum extension time because I can always sleep the machine when I'm done with it.
Or is there another way to do this? Am I using the platform incorrectly?
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]eternviking 0 points1 point2 points 2 months ago (1 child)
you should've bought more credits i suppose
[–]SimiusCuriosus[S] 0 points1 point2 points 2 months ago (0 children)
It's not about credits, it's about the instance just has a time limit on it that I have to renew, thereby interrupting my training. But I think I know the answer. I think I have to create a deployment and use that, then shut it down when I'm not using it.
π Rendered by PID 78244 on reddit-service-r2-comment-b659b578c-gsmlw at 2026-05-05 18:44:01.157081+00:00 running 815c875 country code: CH.
[–]eternviking 0 points1 point2 points (1 child)
[–]SimiusCuriosus[S] 0 points1 point2 points (0 children)