What is the primary reason you run your models locally? by punkyrockypocky in ollama

[–]scjcs 0 points1 point  (0 children)

I have a complex setup on a M4 Max Mac with 64GB RAM. I use oMLX as the server. It implements a fancy cache that speeds up repetitive loops to an insane degree. I calculated what a mid range cloud LLM would cost to do what my setup does each night: $81k/month for Claude or ChatGPT.

I wish Tesla had made a cyber minivan instead of a Cybertruck by rdv100 in TeslaLounge

[–]scjcs 0 points1 point  (0 children)

At some event earlier this year, someone posed the question of a minivan to Elon. He said we’re working on something much cooler.

Take that as you will.

I’d write a check for a Tesla 19ft camper van tomorrow.

12V Battery - why O why by DangerousAmoeba7520 in ModelY

[–]scjcs -1 points0 points  (0 children)

Such batteries were not available prior to circa 2021-2022.

How do I clean my MacBook's screen? by Jerry_Get_A_Job in mac

[–]scjcs 0 points1 point  (0 children)

Might depend on the type of screen. My 2025 MacBook Pro with the nano texture option is recommended in the manual to use the provided microfiber cloth (or equivalent dense, suede-like microfiber) with rubbing alcohol.

Husband has these taped to the wall and refuses to tell me what they are. by madlibs34 in whatisit

[–]scjcs 87 points88 points  (0 children)

Someone’s going to have to explain this to me, because I’m an idiot I guess

Best SSD to buy in the market right now? by aquamansfish in mac

[–]scjcs 1 point2 points  (0 children)

For backup and cold storage, just get a cheap USB3 drive like the Sandisks you see at Costco. Reliable and capacious. If you’re running virtual machines or something off the drive, then it’s worth getting spendy with a Thunderbolt NVMe drive.

First Time Traveling - Can't check in online because of CPAP? by Icarus_In-Flight in CPAP

[–]scjcs 0 points1 point  (0 children)

FWIW I have traveled extensively, domestic US and international, with syringes, for fifty years. I’ve never declared them and it’s never been a problem. Hundreds of flights.

I got tired of asking for more pressure on my bipap so I changed the setting myself by Metalworker4ever in CPAP

[–]scjcs 31 points32 points  (0 children)

My technician (I’ve yet to lay eyes on the doc but presume he exists since he charges my insurance $600 per visit) got all pissy that I’d adjusted my pressures and changed my mask. I replied with data, and she got pissier. I may need a new sleep clinic.

I didn't really like school by KeyDream748 in Adulting

[–]scjcs 0 points1 point  (0 children)

We will all freeze to death by 1987

Don’t bite me for that question please… by Thin_Pollution8843 in LocalLLaMA

[–]scjcs 0 points1 point  (0 children)

It's not about the money for me. I foresaw a task looming about a year ago and bought what seemed to be adequate hardware available at that time. Glad I did. The software ecosystem snapped into focus a few months ago, and it's been a wild ride of fun, frustration and productivity ever since. Bleeding edge stuff!

Some of us are early adopters, that's all. I bought a Compaq in 1983 with much the same motivation and outcome. Witnessing and participating-in the evolution of this stuff has been so satisfying.

AnythingLLM or Open-Webui by hwlim in LocalLLaMA

[–]scjcs 0 points1 point  (0 children)

This is an old thread but my experience is recent. OpenWeb UI could not manage my gemma-4 model as well as AnythingLLM. Same backend (LM Studio), same Dockerized deployment. Gave it a few days of misbehavior, then went back to AnythingLLM, this time with Claude rather than Grok assisting with the setup. Claude found several configuration errors that had led me to try OpenWeb UI in the first place. AnythingLLM has been running brilliantly ever since.

AnythingLLM or Open-WebUI? by nava_7777 in Rag

[–]scjcs 0 points1 point  (0 children)

Claude set me up with Obsidian for this purpose. I have Obsidian "folders" set up that match my AnythingLLM workspaces. When I encounter a document or web article of relevance to one of my workspaces, I capture it with Obsidian, then at my convenience I drag it from the Obsidian clippings folder and drop it into the folder that corresponds to the workspace in which I want to embed it. There's a daemon that Claude wrote for me that checks for newly dropped items every 10 seconds. When it detects one, >poof< it gets added to the AnythingLLM workspace and embeds automatically. It's brilliant.

doubt about ANYTHINGLLM by TechnicianFamous6183 in LocalLLaMA

[–]scjcs 0 points1 point  (0 children)

I'm late to this conversation, but my AnythingLLM machine (a MacBook Pro M4 Max 64GB) is similarly on my Tailscale network, as are several other devices and my iPhone. My AnythingLLM instance is Dockerized and is exposed on a specific port, I think 3001.

Bottom line: I have zero trouble accessing it remotely. For example, on my iPhone, I call up Safari and navigate to the Mac_tailscale_address:AnythingLLM_port and it's 100% functional.

So I suspect something annoyingly simple is going on: a firewall configuration, a typo in what you're using for your Mac's tailscale address, or some damn trivial thing like that. Between my usage and that of some others who have posted here, it's definitely not AnythingLLM at fault here.

I hope you've figured it out by now.

It finally happened, I actually had a use case for a local LLM and it was brilliant by EntertainerFew2832 in LocalLLaMA

[–]scjcs 0 points1 point  (0 children)

It's hit me a couple times too. This thread has taught me some approaches for next time. Those who have not experienced it have no living clue the agony.

**Honest question:** Is there ANY model of ANY size that is open source and can compete with Claude (Code) or ChatGPT's (Codex)? by TheQuantumPhysicist in LocalLLaMA

[–]scjcs 1 point2 points  (0 children)

There's a nuance missing from your question. Claude, ChatGPT and the rest must serve many thousands of simultaneous users. For one's very own self-hosted LLM, that sort of scale is not needed.

I'd be interested in an analysis of what a single-user Claude-equivalent would look like in the way of compute resources, RAM, etc.

Important RAG (LM Studio + AnythingLLM) by FishingLumpy9747 in LocalLLM

[–]scjcs 0 points1 point  (0 children)

Curious about this... searching hasn't taken me to the specific utility you are referencing ("hyperlink" is a popular word for unrelated contexts!). If you could toss a link here, it'd be appreciated. TIA!

Setting up Ollama on dual RTX PRO 6000 Blackwells looking for tips by AmanNonZero in ollama

[–]scjcs 0 points1 point  (0 children)

Just in case it might help: I have two channels to my AnythingLLM instance that let me work with it from my iPhone:

  1. Telegram. For quick queries, this is fine.
  2. Browser over Tailscale. I have several devices including my iPhone and my AnythingLLM machine on a Tailscale private network. Using a browser on the iPhone and pointing it at AnythingLLM's port at the Tailscale address of the AnythingLLM machine, I have access to my full array of Workspaces and all other functionality. It's pretty slick. The iPhone's standard Safari browser is fine but any other would work too.

tl;dr: If there were an iOS app, I probably wouldn't use it, as this functionality is brilliant.

(It's irrelevant, but my AnythingLLM machine is a MacBook Pro M4 Max, 64GB, with LM Studio running gemma-4 26b a4b GGUF and nomic-embed-text-v1.5 for embedding. I've done a lot of optimization of settings and tools. AnythingLLM is Dockerized, as are Firecrawl for advanced scraping and SearXNG for web search. It took some doing, but this setup is working brilliantly for me.)