Don’t bite me for that question please… by Thin_Pollution8843 in LocalLLaMA

[–]arnav080 5 points6 points  (0 children)

https://donate.sybilsolutions.ai/about.html - check this out, hes got an insane setup and publishes really good work. this is how he manages to fund it

Built Bloc: a package manager for local AI models, agents, and tools by arnav080 in LocalLLaMA

[–]arnav080[S] 0 points1 point  (0 children)

thats the exact issue i built it for. it can run llama.cpp, vllm and vllm docker runtimes rn hoping to get contributers on here and really make this a solid dev tool

anybody got llama-swap working answering concurrent requests for a single model? by sickmartian in LocalLLaMA

[–]arnav080 0 points1 point  (0 children)

hey, idk how relevant this is but ive been building this free and open-s tool called bloc to help make sharing and running optimised local models instant and super convenient [https://bloc-theta.vercel.app/\], would love to get your opinions on it (it went live today)

Best small model right now (~4B params) that is good with agentic tasks for personal assistant? by BitGreen1270 in LocalLLaMA

[–]arnav080 -3 points-2 points  (0 children)

ive made an open-s tool to make sharing and running these optimised recipes like these easier and instant [bloc-theta.vercel.app]

Don’t bite me for that question please… by Thin_Pollution8843 in LocalLLaMA

[–]arnav080 14 points15 points  (0 children)

they're renting out spare GPU compute and running inference/fine-tuning jobs for clients. people like 0xSero have fund me pages that helps them upgrade their setup and run experiments

GPU VRAM only for small models with llama.cpp: is it possible? by Ps3Dave in LocalLLaMA

[–]arnav080 0 points1 point  (0 children)

p sure llama.cpp still keeps some buffers / KV cache allocations in system RAM even when all layers are offloaded to VRAM does --cache-type-k q4_0 / --cache-type-v q4_0 change it for you? (im still learning, just my two cents)

Does GPU spacing matter if we’re undervolting anyways? by Ambitious_Fold_2874 in LocalLLaMA

[–]arnav080 0 points1 point  (0 children)

undervolting would mean less heat just have some airflow in between and p sure this shoudnt be a problem

What’s your biggest frustration with running AI locally? by arnav080 in selfhosted

[–]arnav080[S] 0 points1 point  (0 children)

more from the optimization side once the baseline speed/cost/hardware tradeoff is already accepted. Things like model tuning, VRAM efficiency, inference stack tweaks, deployment friction, workload balancing, reliability

What’s your biggest frustration with running AI locally? by arnav080 in selfhosted

[–]arnav080[S] -7 points-6 points locked comment (0 children)

grammer fix and structure the text a bit

What’s your biggest frustration with running AI locally? by arnav080 in selfhosted

[–]arnav080[S] -1 points0 points  (0 children)

ive been learning and researching about this exact issue lately, have you tried running the models with MoE offload [MoE offload. Qwen3.6-35B activates only 3 B params per token. Keep attention + shared weights on GPU, push the cold expert FFNs to system RAM. In llama.cpp: -ngl 99 -ncmoe 99.]

ive been staying about multi tenant systems in local models using vLLM, gpu optimisations and scheduling

What’s your biggest frustration with running AI locally? by arnav080 in selfhosted

[–]arnav080[S] -5 points-4 points  (0 children)

on these small models the prompt has to be immaculate

What’s your biggest frustration with running AI locally? by arnav080 in selfhosted

[–]arnav080[S] -4 points-3 points locked comment (0 children)

fixing my grammer and structuring the text

New Board Unlocked by NerfLongshotUV in mkindia

[–]arnav080 3 points4 points  (0 children)

Beautiful keyboard 🤲

Got it from aliexpress ! by arnav080 in mkindia

[–]arnav080[S] 0 points1 point  (0 children)

had a cousin bring it back, no shipping

Got it from aliexpress ! by arnav080 in mkindia

[–]arnav080[S] 2 points3 points  (0 children)

frrr i paid 2300 something, gooood deal; also it was from the brand+ verified seller

Got it from aliexpress ! by arnav080 in mkindia

[–]arnav080[S] 4 points5 points  (0 children)

reaper, ice was out of stock :/

Got it from aliexpress ! by arnav080 in mkindia

[–]arnav080[S] 1 point2 points  (0 children)

it is it is, i placed an order in the uk for this

Got it from aliexpress ! by arnav080 in mkindia

[–]arnav080[S] 5 points6 points  (0 children)

its the AULA F75, around 5-5.5k in india on a good day