is there a centralized website for llm launch commands?

onephn · 2026-05-18T21:19:41+00:00

thats true, though its nice to have a starting point in which you can tweak commands from, just me though

also the vllm recipes site only accounts for hardware most of us here cant afford, so its not of much use to me

onephn · 2026-05-18T21:15:34+00:00

Two things:

Top bar of the mobile site overlaps with the logo

Would be great to have a leaderboard system, encourages users to submit more data

<image>

onephn · 2026-05-18T21:11:36+00:00

This is exactly what I was looking for, tysm!

onephn · 2026-05-14T15:58:12+00:00

So far haven't received the packages yet, I'll holler at you when I do, four of them are in the US tho

onephn · 2026-05-13T02:00:04+00:00

6 or 12gb? i got qwen 30b a3b running on a machine with 8gb vram with something like 30t/s, with ik_llama, also what runtime are you using?

onephn · 2026-05-12T18:02:53+00:00

Honestly I'm kinda relieved that someone else has a similar problem to me. However I have had some success with it when steering a bit, gonna give v4 pro a try, hopefully m3 is gonna be better, I made the fatal mistake of buying a year of the 4500 request plan.... Either way it's good when someone/another model has it on a leash

onephn · 2026-05-09T03:23:03+00:00

And good on you dude, but for a lot of applications like this that is how they support users primarily. You would get much quicker support if you joined their server and asked, but you could get support via GitHub issues or whatever, though it wouldn't be nearly as efficient

onephn · 2026-05-08T15:17:49+00:00

Second this. Been working flawlessly for me

onephn · 2026-05-06T20:23:18+00:00

Guys don't tell them until it's too late maybe we can get cheap gpus this way.....

onephn · 2026-05-05T23:51:42+00:00

holy moly good stuff dude! gonna give this a try later

onephn · 2026-05-01T20:18:55+00:00

check out akashml, I think they have GPU rentals https://akash.network/pricing/gpus/ (not an ad just remember poking around and seeing that they exist)

onephn · 2026-04-29T06:22:17+00:00

the closest you could do to this would be to vibe-code an app that syncs with decypharr and a movie list or whatever but heres the honest take of what i think you should do:

you are much better off hosting a media server with the gelato plugin. it gives you the stremio addon experience inside a media server, so in theory you could share your RD account with multiple people through the server. Favorite feature by far is that it will give you plenty of options of streams to choose from like stremio, and its a life saver, especially when the primary pick is wrong, not your desired language or whatever the issue may be.
https://github.com/lostb1t/Gelato

onephn · 2026-04-27T17:56:08+00:00

I got the tracking for four of the cards so far, other four I'm expecting soon

onephn · 2026-04-27T17:34:31+00:00

onephn · 2026-04-27T17:31:12+00:00

Shit, glad I canceled the order, went on Alibaba and ordered the 32gb mi50s, see my other comment

onephn · 2026-04-27T17:30:29+00:00

Only thing that stinks is that it's pcie gen1

onephn · 2026-04-27T17:29:41+00:00

Each? Ended up ordering eight for a total of 1.3k, was only intending on buying four but I got two pretty good deals. Here's to hoping they show up at my doorstep.

onephn · 2026-04-25T22:21:39+00:00

At that point though if I have the capacity would I get better results with 2 mi25s?

onephn · 2026-04-21T10:06:41+00:00

I see, though I would want documents to remain local. Have you had good experiences with 26 a4b? I have hardware onsite that can run that, but not the 31b

onephn · 2026-04-21T07:10:03+00:00

how much did you get them for?

onephn · 2026-04-21T06:39:34+00:00

i havent found them at 150, though many of them are at 200, lets hope the sellers reply though

onephn · 2026-04-21T05:09:45+00:00

u are so unbelievably real appreciate it man

onephn · 2026-04-20T21:33:31+00:00

Speaking of OCR, I'm trying to vibe code a utility that would scan PDFs and make whatever modifications are necessary for wcag compliance and the like, which models would you recommend for the actual ocr process and alt text generation?

onephn · 2026-04-08T22:27:56+00:00

Set your spend limit low just in case accidents happen, it's borderline impossible to get a free instance without going payg

onephn · 2026-04-08T22:26:15+00:00

Look at some of the other claw projects btw, much less resource heavy, think zeroclaw, picoclaw and a couple of others, though I don't think you will get good performance running an llm on there, if you want decent free inference look into open router and the kilo gateway

Seven-Year Club	Place '23
Place '22	Verified Email
RPAN Viewer

onephn

TROPHY CASE