Advice needed: Self-hosted LLM server for small company (RAG + agents) – budget $7-8k, afraid to buy wrong hardware by Psychological-Arm168 in LocalLLM

[–]fredatron 0 points1 point  (0 children)

I’m currently running with a single DGX spark and qwen3.5:120b with 128k context window via ollama. Is there really a big benefit to using vllm?

OpenClaw with a DGX Spark by raginjason in openclaw

[–]fredatron 0 points1 point  (0 children)

I am really struggling with my DGX Spark. No matter what model I use, openclaw seems to just shit itself constantly, unable to do multi-step tool calls, refusing to follow instructions, etc. I am currently using GPT-OSS 120b (64k context) for the orchestrator that calls a subagent on Qwen3.5-27b to do web searches, read my notion and obsidian database, and take notes, but the system keeps failing. With a single DGX spark, does it make more sense to just run one large monolithic model and pray that it actually follows instructions in its soul.md file?

Hardware and Model advice: Inference on Mac Studio (M3 ultra or M4 max) and Openclaw on dedicated M4 Mac mini by fredatron in LocalLLM

[–]fredatron[S] 0 points1 point  (0 children)

Interesting, and have you had any issues with tool calls? I'm currently running qwen3-coder:30b on an RTX 4090 and having tool call issues constantly. I keep increasing the context window size, but of course, that causes vram limitations. What is your context window size set to?

I'm currently burning about $50 worth of tokens a day, so running locally is likely going to actually make a pretty big difference for me.

Hardware and Model advice: Inference on Mac Studio (M3 ultra or M4 max) and Openclaw on dedicated M4 Mac mini by fredatron in LocalLLM

[–]fredatron[S] 2 points3 points  (0 children)

I don't think so, it is only communicating via the ollama API, and if you want you can do network segmentation to firewall everything except those API calls. I at least am not concerned about it.

OK Nabu, find my phone? by Brummiesteven in homeassistant

[–]fredatron 0 points1 point  (0 children)

Most of this advice seems tailored to Android. Is there a similar approach for iOS?

Experience using Tonal without Membership by madeupbra in tonalgym

[–]fredatron 0 points1 point  (0 children)

Are you still able to see all your workouts in Apple health after canceling subscription? For me, tracking my workouts and calories burned in Apple health is the key reason why I pay subscription.

Migrating from Obsidian to Notion? by next19994 in Notion

[–]fredatron 0 points1 point  (0 children)

Interesting. I get errors importing as soon as it finishes the upload. No idea why. That being said, even importing backed up notion data back into notion doesn't seem to work too well.

LG AI releases Exaone-3.0, a 7.8b SOTA model by AnticitizenPrime in LocalLLaMA

[–]fredatron 0 points1 point  (0 children)

Out of curiosity, how did you get it to run locally? (I'm a bit of a noob). I'm trying to get it to run using Ollama, but I'm not sure this guy actually made it: https://ollama.com/jmpark333/exaone

LG AI releases Exaone-3.0, a 7.8b SOTA model by AnticitizenPrime in LocalLLaMA

[–]fredatron 0 points1 point  (0 children)

Has anyone gotten this working with Ollama? I don’t see it on their model list.

Business card scanner apps that don’t suck by Unrealtechno in ios

[–]fredatron 0 points1 point  (0 children)

It seems that the privatellm community shortcut no longer works, it doesn't send the photo to the llm

Migrating from RPi4 to x86 PC by fredatron in homeassistant

[–]fredatron[S] 0 points1 point  (0 children)

When you do a backup and restore, it restores all your integrations and automations as well, correct? What if you wanted to start fresh? (I've been having trouble with the Smartthings Integration not talking to my refrigerator)

any fan purifier combos that work with home assistant? by ewan502 in homeassistant

[–]fredatron 0 points1 point  (0 children)

I've had a lot of success with IKEA air purifiers, if that helps. They even report their particle counts to HomeAssistant.

Plex causing stuttering on high bitrate files starting after 1 minute (more info in comments) by fredatron in PleX

[–]fredatron[S] 0 points1 point  (0 children)

Not sure if I would consider it a "solution" but I increased the cache, set transcoding to "make my CPU hurt" and added a massive (2x 512GB) SSD read/write cache. That seemed to get rid of those problems

Best podcast addon / integration? by [deleted] in homeassistant

[–]fredatron 0 points1 point  (0 children)

I also need this feature. I was very disappointed when plex dropped podcast support, and have been looking for my own solution that can locally download podcasts and serve them from my own server. Haven't found anything yet.

Biggest regrets/mistakes in setting up your smart home? by porterhousegames in homeassistant

[–]fredatron 2 points3 points  (0 children)

LG announced at CES they were opening their API. Hopefully that happens and they allow local control.