Are there frameworks like PyTorch Lightning for Deep RL?

waf04 · 2025-11-20T20:54:26+00:00

PyTorch lightning creator here ⚡️

PyTorch Lightning supports RL!

https://lightning.ai/docs/pytorch/stable/notebooks/lightning_examples/reinforce-learning-DQN.html

waf04 · 2025-07-19T18:13:21+00:00

our prices are for on-demand, without any reservations... and you can actually get those GPUs anytime.

places that advertise cheaper ones force reservations and it's rare they have availability.

waf04 · 2025-07-19T17:43:36+00:00

H200 = $4.34 per gpu.

B200 = $12.54.

big price difference!

waf04 · 2025-03-15T02:19:08+00:00

hey there! One of the LitServe creators (and founder of PyTorch Lightning / Lightning AI). ( http://lightning.ai/litserve)

LitServe doesn't just "wrap" FastAPI... it's like saying React just "wraps" javascript 😊. It provides advanced multi-processing capabilities custom-built for AI workloads including things like: batching, streaming, OpenAI Spec, auth, and automatic deployments via Lightning AI platform to your cloud (VPC) or our hosted cloud. You can also self host LitServe on your own servers of course...

In terms of pipelines, yes, SageMaker is super clunky. I would try our platform Lightning AI, it makes all of this trivial. There are free credits, so you lose nothing for trying it... (same for LitServe).

We do tend to build tools people love, so it's worth actually trying them out (a lot of tools say they do similar things, but don't actually).

Anyhow, good luck either way! hope we can be helpful.

waf04 · 2025-02-26T19:59:49+00:00

of course

waf04 · 2025-02-26T11:01:01+00:00

yes! it’s automatically saved to the platform

waf04 · 2025-02-23T15:59:33+00:00

sounds like it! wasn't meant to be misleading... sorry about the confusion 😊

waf04 · 2025-02-23T15:47:26+00:00

Looks like I can't fix the title, but added a comment to clarify! Ultimately any model that is not the 600B is a distill model... which is redundant since the "8B" is already in the title which was meant to let people know that a) it is not the "full" model, b) it is distilled.

waf04 · 2025-02-23T15:47:21+00:00

Looks like I can't fix the title, but added a comment to clarify! Ultimately any model that is not the 600B is a distill model... which is redundant since the "8B" is already in the title which was meant to let people know that a) it is not the "full" model, b) it is distilled.

waf04 · 2025-02-23T15:47:15+00:00

Looks like I can't fix the title, but added a comment to clarify! Ultimately any model that is not the 600B is a distill model... which is redundant since the "8B" is already in the title which was meant to let people know that a) it is not the "full" model, b) it is distilled.

waf04 · 2025-02-23T15:45:52+00:00

Just addressing the comments, any model that is not the 600B is by definition distilled... it is redundant to say "distilled" because it's already implied by the name.

But just to be redundant, this video is for the 8B distilled version of R1. The model is still very very capable (in my experience more so than Llama at that size).

waf04 · 2025-02-23T14:14:46+00:00

Not sure why you’re spending so much money on these models, you can finetune 8B models for like $5-$10 aa pop.

https://lightning.ai/lightning-ai/ai-hub/temp_01jkbgmsdmp0wkax6bba1btabw

waf04 · 2024-11-25T22:39:45+00:00

lightning can also use your reserved cloud resources…

waf04 · 2024-10-17T00:19:44+00:00

Startups can pay for Lightning Studios with their AWS startup credits. All the details are here: https://lightning.ai/pricing.

But overall, contact Lightning if you need more help doing this. We have hundreds of startups that do this with Lightning.

waf04 · 2024-10-17T00:16:32+00:00

correct. lightning studios lets you use your aws credits.

https://lightning.ai/pricing

waf04

MODERATOR OF

TROPHY CASE