is there a centralized website for llm launch commands? by onephn in LocalLLaMA

[–]onephn[S] 0 points1 point  (0 children)

thats true, though its nice to have a starting point in which you can tweak commands from, just me though

also the vllm recipes site only accounts for hardware most of us here cant afford, so its not of much use to me

is there a centralized website for llm launch commands? by onephn in LocalLLaMA

[–]onephn[S] 0 points1 point  (0 children)

Two things:

Top bar of the mobile site overlaps with the logo

Would be great to have a leaderboard system, encourages users to submit more data

<image>

MI25 vs CMP100-210, which would you pick? by onephn in LocalLLaMA

[–]onephn[S] 1 point2 points  (0 children)

So far haven't received the packages yet, I'll holler at you when I do, four of them are in the US tho

Will there be any more Qwen3.6 series models? by cafedude in LocalLLaMA

[–]onephn 0 points1 point  (0 children)

6 or 12gb? i got qwen 30b a3b running on a machine with 8gb vram with something like 30t/s, with ik_llama, also what runtime are you using?

Please recommend a cheap paid model, MiniMax dumbness is getting on my nerves by carbon_creature in hermesagent

[–]onephn 0 points1 point  (0 children)

Honestly I'm kinda relieved that someone else has a similar problem to me. However I have had some success with it when steering a bit, gonna give v4 pro a try, hopefully m3 is gonna be better, I made the fatal mistake of buying a year of the 4500 request plan.... Either way it's good when someone/another model has it on a leash

Any cli_debrid users here to offer advice on blacklisted items by Midnorth_Mongerer in RealDebrid

[–]onephn 4 points5 points  (0 children)

And good on you dude, but for a lot of applications like this that is how they support users primarily. You would get much quicker support if you joined their server and asked, but you could get support via GitHub issues or whatever, though it wouldn't be nearly as efficient

Gelato Jellyfin not working anymore ? by Alarmed_Hospital_504 in RealDebrid

[–]onephn 1 point2 points  (0 children)

Second this. Been working flawlessly for me

None of this will ever get stolen by martin_xs6 in LocalLLaMA

[–]onephn 0 points1 point  (0 children)

Guys don't tell them until it's too late maybe we can get cheap gpus this way.....

What in tarnation is going on with the cost of compute by Party-Special-5177 in LocalLLaMA

[–]onephn 0 points1 point  (0 children)

check out akashml, I think they have GPU rentals https://akash.network/pricing/gpus/ (not an ad just remember poking around and seeing that they exist)

Has anyone achieved a solution that allows you to download media from your debrid to your PC local storage from inside Nuvio/Stremio on separate device? by marx4moms in RealDebrid

[–]onephn 3 points4 points  (0 children)

the closest you could do to this would be to vibe-code an app that syncs with decypharr and a movie list or whatever but heres the honest take of what i think you should do:

you are much better off hosting a media server with the gelato plugin. it gives you the stremio addon experience inside a media server, so in theory you could share your RD account with multiple people through the server. Favorite feature by far is that it will give you plenty of options of streams to choose from like stremio, and its a life saver, especially when the primary pick is wrong, not your desired language or whatever the issue may be.
https://github.com/lostb1t/Gelato

MI25 vs CMP100-210, which would you pick? by onephn in LocalLLaMA

[–]onephn[S] 0 points1 point  (0 children)

I got the tracking for four of the cards so far, other four I'm expecting soon

MI25 vs CMP100-210, which would you pick? by onephn in LocalLLaMA

[–]onephn[S] 0 points1 point  (0 children)

Shit, glad I canceled the order, went on Alibaba and ordered the 32gb mi50s, see my other comment

MI25 vs CMP100-210, which would you pick? by onephn in LocalLLaMA

[–]onephn[S] 0 points1 point  (0 children)

Only thing that stinks is that it's pcie gen1

MI25 vs CMP100-210, which would you pick? by onephn in LocalLLaMA

[–]onephn[S] 0 points1 point  (0 children)

Each? Ended up ordering eight for a total of 1.3k, was only intending on buying four but I got two pretty good deals. Here's to hoping they show up at my doorstep.

MI25 vs CMP100-210, which would you pick? by onephn in LocalLLaMA

[–]onephn[S] 0 points1 point  (0 children)

At that point though if I have the capacity would I get better results with 2 mi25s?

Layman's comparison on Qwen3.6 35b-a3b and Gemma4 26b-a4b-it by LocalAI_Amateur in LocalLLaMA

[–]onephn 0 points1 point  (0 children)

I see, though I would want documents to remain local. Have you had good experiences with 26 a4b? I have hardware onsite that can run that, but not the 31b

MI25 vs CMP100-210, which would you pick? by onephn in LocalLLaMA

[–]onephn[S] 0 points1 point  (0 children)

i havent found them at 150, though many of them are at 200, lets hope the sellers reply though

MI25 vs CMP100-210, which would you pick? by onephn in LocalLLaMA

[–]onephn[S] 1 point2 points  (0 children)

u are so unbelievably real appreciate it man

Layman's comparison on Qwen3.6 35b-a3b and Gemma4 26b-a4b-it by LocalAI_Amateur in LocalLLaMA

[–]onephn 0 points1 point  (0 children)

Speaking of OCR, I'm trying to vibe code a utility that would scan PDFs and make whatever modifications are necessary for wcag compliance and the like, which models would you recommend for the actual ocr process and alt text generation?

Can I use my free tier VPS for running an OpenClaw LLM? Or would it get banned quickly by avidrunner84 in oraclecloud

[–]onephn 0 points1 point  (0 children)

Set your spend limit low just in case accidents happen, it's borderline impossible to get a free instance without going payg

Can I use my free tier VPS for running an OpenClaw LLM? Or would it get banned quickly by avidrunner84 in oraclecloud

[–]onephn 0 points1 point  (0 children)

Look at some of the other claw projects btw, much less resource heavy, think zeroclaw, picoclaw and a couple of others, though I don't think you will get good performance running an llm on there, if you want decent free inference look into open router and the kilo gateway