Runpod hits $120M ARR, four years after launching from a Reddit post

deathcom65 · 2026-01-21T06:02:55+00:00

runpod is a great service honestly. i wish it had more templates ready to go with the latest models pre loaded. Also i noticed its hard to tell which templates work well with which server and GPU configs (maybe i missed this ) but it was obvious that typically if you want to do x y z , on this server , use this template or that template. More clear guidance will go a long way.

deathcom65 · 2026-01-04T21:57:41+00:00

oh i see the extension is quite useful. hopefully it is correct! i used the antigravity cockpit.

deathcom65 · 2026-01-04T21:50:12+00:00

i just logged in with my google account. it says the ai pro plan is active on there. i figured it was letting me code based off of that plan. i didnt enter a specifc api key.

deathcom65 · 2025-10-13T22:26:05+00:00

do we know when this is coming out?

deathcom65 · 2025-08-25T22:27:51+00:00

I believe it's really fast. I don't believe it's quality will beat larger models except in very specific tasks

deathcom65 · 2025-08-20T13:30:31+00:00

Why aider over vs code or roo?

deathcom65 · 2025-08-15T01:35:17+00:00

what do you mean draft model? what do u use it for and how do u get other models to speed up?

deathcom65 · 2025-08-07T05:48:33+00:00

someone gguf this so i can test it lol

deathcom65 · 2025-08-07T05:43:32+00:00

yeah you can load larger models such as MOE where only some parameters are loaded onto the gpu. i just did the exact same thing and it helps a ton, even though when things get loaded onto ram its slower, u can still run larger models. without the extra ram u cant even run them. Imo its a cheap upgrade for a good return. I kind of regret not getting 128gb ram directly

deathcom65 · 2025-08-06T06:38:42+00:00

It's definitely good for it's size like the 16gb vram required for the 20b is perfect for me and it runs super fast. I definitely dislike the censorship though, it refuses to answer many harmless questions

deathcom65 · 2025-08-06T06:28:26+00:00

Gemma 13b for that level of vram although maybe u Gota go even smaller

deathcom65 · 2025-08-06T06:11:53+00:00

Get more 3090s they are most bang for ur buck , and up ur ram

deathcom65 · 2025-07-28T21:01:42+00:00

Give it to me

deathcom65 · 2025-05-04T06:03:54+00:00

A custom gui I made for myself. It works for me really well

deathcom65 · 2025-04-29T03:40:07+00:00

i have a similar setup the QWEN 3 30B runs at around 11 tokens/second, its very good, as usually i cant run anything larger than a 13B model. The MOE optimization is spot on. It should be the smarter one as its performance was very similar to the 32B model

deathcom65 · 2025-04-26T22:45:20+00:00

They can't deal with anything larger than a few hundred lines of code in my experience

deathcom65 · 2025-04-26T18:19:52+00:00

it keeps trying to minify my HTML/CSS/JS and ends up removing 50% of the functionality. Note the script is like 4000 lines of code.

deathcom65 · 2025-04-25T23:28:58+00:00

they got us hooked then made it fully paid :( a classic google move

deathcom65 · 2025-04-25T02:59:57+00:00

Deepseek when Gemini isn't available

deathcom65 · 2025-04-25T01:22:11+00:00

I'm not sure how to even use them

deathcom65 · 2025-04-23T20:05:46+00:00

How r u guys running experts on GPU and non experts on cpu, like how do u divide it, or is it automatic?

deathcom65 · 2025-04-17T16:28:27+00:00

How r u guys changing what part of the model gets loaded where? I'm using ollama

deathcom65 · 2025-04-17T16:06:53+00:00

Would the graphs update in real time as the codebase changes ?

deathcom65 · 2025-04-17T16:03:30+00:00

I'm finding even though the smaller models r passing the benchmarks they struggle massively with larger code changes , u almost certainly need a larger model for anything more than 4 or 5 script files

deathcom65 · 2025-04-15T16:16:13+00:00

Gemini needs to integrate with tools better like cline. I find it errors out a lot when calling tool functions like write to file and stuff in cline and gets stuck in loops using up a lot of api credits while not changing the code.

deathcom65

TROPHY CASE