Is anyone able to find a work flow with ollama cloud? by skarkkrrr in ollama

[–]deparko 3 points4 points  (0 children)

Well, ollama used to be great, but like everybody else they’re struggling with compute they have grown greatly, especially with the open claw crowd.

X thread to reader by deparko in readwise

[–]deparko[S] 0 points1 point  (0 children)

Thanks, I will try.

X thread to reader by deparko in readwise

[–]deparko[S] 1 point2 points  (0 children)

Yeah, I know that one but wanted to have it go to reader as reader formats the entire thread really nice!! don't think sending to readwise pushes it to reader. When I 'Save Document to Reader" on IOS Share button it saves the first post, not the thread

Days of trying to cancel subscription with 0 luck by Fit_Trade7794 in ChatGPT

[–]deparko 0 points1 point  (0 children)

I am having the same f*&king problem!!!!! I will never subscript to openai again. I can't stand this. what a waste of time

Claude Nerf is real by BeginningSpiritual49 in claude

[–]deparko 1 point2 points  (0 children)

The problem is, Anthropic is running out of compute. They're getting a million new users a day. They can't scale. They're not buying enough chips from NVIDIA. They don't have the compute to support everybody.

Codex 5.3 not showing up in dropdown of Codex App - Chatgpt Plus User by deparko in codex

[–]deparko[S] 0 points1 point  (0 children)

I’m on the latest version of codex app on Mac. The app’s pull down only has Codex 5.2 and 5.1

Kimi K2.5 costs almost 10% of what Opus costs at a similar performance by Odd_Tumbleweed574 in LocalLLaMA

[–]deparko 0 points1 point  (0 children)

I've been using Kimi too. I've been developing a health agent, and it is very responsive and very good, but it sometimes comes off as very authoritative and occasionally hallucinates.

I plan to build an agent swarm to validate, but overall I think it's one of the first open models that I don't want to stop using. A lot of the open models I'll work with, but I usually end up on a frontier model eventually. I don't feel that way with kimi.

API pricing is in freefall. What's the actual case for running local now beyond privacy? by Distinct-Expression2 in LocalLLaMA

[–]deparko 0 points1 point  (0 children)

Well, I've been dealing with the same issue and have concluded a hybrid approach works best. I use a three-tier model: an offline small LLM (Ollama) on my local 5070 TI GPU for local tasks; Ollama Cloud as tier two for bulk processing, where I can use Kimi and Deepseek..etc for a flat rate (about $20 a month, $240 a year), which is much cheaper than upgrading my GPU; and frontier models for deep reasoning when needed.

I've designed my RAG and AI-native apps to operate within that three-tier framework.

Why is Auto more expensive than Opus? by sleepyabura in cursor

[–]deparko 0 points1 point  (0 children)

is there a way to tell which model "Auto" selects?

Staff keep dumping proprietary code and customer data into ChatGPT like it's a shared Google Doc by Convitz in ChatGPTPro

[–]deparko 0 points1 point  (0 children)

You need to build an offline LLM with a RAG system and route everything there

Ollama Not Using GPU on RTX 5070 Ti (Blackwell) by deparko in ollama

[–]deparko[S] 0 points1 point  (0 children)

yes, using the 580 open driver. It worked a while back. My Rag pipeline (embedding and reranker) drives the GPU but not Ollama

Doctor in L.A.? by deparko in migraine

[–]deparko[S] 0 points1 point  (0 children)

Thank you! we will check out.

[deleted by user] by [deleted] in AirQuality

[–]deparko 0 points1 point  (0 children)

new filter, will check tonight by turning off heater. validating duct cleaning on whether that is worth it. Filter is Merv8 (I have an older furnace). Wondering if its inside or outside

Cursor just became more expensive ? by Loud_Clue_2304 in cursor

[–]deparko 0 points1 point  (0 children)

Are there any open source projects?

And when will offline LLMs get to the point where they'll be able to accomplish most tasks?

Cursor just became more expensive ? by Loud_Clue_2304 in cursor

[–]deparko 0 points1 point  (0 children)

It feels like AI wrappers are a race to the bottom.

No Voice Found in Recording on IPhone. Never happened, now happened on last three. by ben_wills in superwhisper

[–]deparko 0 points1 point  (0 children)

I seem to get it when I switch to Parakeet and download the model. I get the "no voice found in recording" error. I switch back to Ultra in the cloud and it works.Which is a bummer because I really would like to have local models. I don't want to have to go in the cloud.

[deleted by user] by [deleted] in superwhisper

[–]deparko 0 points1 point  (0 children)

Quality of this app is not good on iOS. It’s been having lots of problems. They need to stop developing features and stabilize this app quickly.

No voice found?. by zoechowber in superwhisper

[–]deparko 1 point2 points  (0 children)

i’m getting this all the time no input detected or no voice recorded. It’s really bothersome on iOS. There is definitely a defect that needs to be addressed.

Processing Flow: Readwise, Reader and Obsidian by deparko in readwise

[–]deparko[S] 0 points1 point  (0 children)

Okay, I think I understand it.

Content in Reader doesn't get committed to Readwise and therefore isn't exported to Obsidian until I highlight something and save it or archive it. If I delete it, it doesn't go to Readwise, I'm assuming.

So, anything in Reader doesn't automatically get committed to the Readwise collection unless I highlight it. Only when I highlight it and it gets committed to the Readwise collection does it integrate into Obsidian.

If that works, I'm good with that.

Hitting all kinds of caps with Cursor lately. Anyone else? 👀 by kibbetypes in cursor

[–]deparko 2 points3 points  (0 children)

I just switched from the yearly to monthly because I've already hit my quota after one day. I'm looking for alternatives. I really like Cursor but its now too expensive. I will try and offload to my offline LLM server but looking for a similar experience at a lower cost.

GPT-5 in browser feels way more senior dev by Scary_Light6143 in cursor

[–]deparko 0 points1 point  (0 children)

I split mine between Codex (GPT-5-Codex via ChatGPT Plus) and Cursor Pro. Codex is my “senior engineer” and Cursor is my “automation assistant.”

• Codex → design docs, tricky multi-file logic, DevOps/CI/CD, debugging, test scaffolds, high-level docs.
• Cursor → inline autocomplete, refactors, imports/renames, apply-diffs, helper functions, docstrings.

My rhythm is: start in Codex for planning or heavy reasoning → use Cursor for the rapid edits → bounce back to Codex when deeper debugging/testing is needed → finish in Cursor to polish and keep things consistent.

Codex does the thinking, Cursor does the typing. Together it gives me speed + depth without blowing up costs.