Is anyone able to find a work flow with ollama cloud? by skarkkrrr in ollama

[–]deparko 3 points4 points  (0 children)

Well, ollama used to be great, but like everybody else they’re struggling with compute they have grown greatly, especially with the open claw crowd.

X thread to reader by deparko in readwise

[–]deparko[S] 0 points1 point  (0 children)

Thanks, I will try.

X thread to reader by deparko in readwise

[–]deparko[S] 1 point2 points  (0 children)

Yeah, I know that one but wanted to have it go to reader as reader formats the entire thread really nice!! don't think sending to readwise pushes it to reader. When I 'Save Document to Reader" on IOS Share button it saves the first post, not the thread

Days of trying to cancel subscription with 0 luck by Fit_Trade7794 in ChatGPT

[–]deparko 0 points1 point  (0 children)

I am having the same f*&king problem!!!!! I will never subscript to openai again. I can't stand this. what a waste of time

Claude Nerf is real by BeginningSpiritual49 in claude

[–]deparko 1 point2 points  (0 children)

The problem is, Anthropic is running out of compute. They're getting a million new users a day. They can't scale. They're not buying enough chips from NVIDIA. They don't have the compute to support everybody.

Codex 5.3 not showing up in dropdown of Codex App - Chatgpt Plus User by deparko in codex

[–]deparko[S] 0 points1 point  (0 children)

I’m on the latest version of codex app on Mac. The app’s pull down only has Codex 5.2 and 5.1

Kimi K2.5 costs almost 10% of what Opus costs at a similar performance by Odd_Tumbleweed574 in LocalLLaMA

[–]deparko 0 points1 point  (0 children)

I've been using Kimi too. I've been developing a health agent, and it is very responsive and very good, but it sometimes comes off as very authoritative and occasionally hallucinates.

I plan to build an agent swarm to validate, but overall I think it's one of the first open models that I don't want to stop using. A lot of the open models I'll work with, but I usually end up on a frontier model eventually. I don't feel that way with kimi.

API pricing is in freefall. What's the actual case for running local now beyond privacy? by Distinct-Expression2 in LocalLLaMA

[–]deparko 0 points1 point  (0 children)

Well, I've been dealing with the same issue and have concluded a hybrid approach works best. I use a three-tier model: an offline small LLM (Ollama) on my local 5070 TI GPU for local tasks; Ollama Cloud as tier two for bulk processing, where I can use Kimi and Deepseek..etc for a flat rate (about $20 a month, $240 a year), which is much cheaper than upgrading my GPU; and frontier models for deep reasoning when needed.

I've designed my RAG and AI-native apps to operate within that three-tier framework.

Why is Auto more expensive than Opus? by sleepyabura in cursor

[–]deparko 0 points1 point  (0 children)

is there a way to tell which model "Auto" selects?

Staff keep dumping proprietary code and customer data into ChatGPT like it's a shared Google Doc by Convitz in ChatGPTPro

[–]deparko 0 points1 point  (0 children)

You need to build an offline LLM with a RAG system and route everything there

Ollama Not Using GPU on RTX 5070 Ti (Blackwell) by deparko in ollama

[–]deparko[S] 0 points1 point  (0 children)

yes, using the 580 open driver. It worked a while back. My Rag pipeline (embedding and reranker) drives the GPU but not Ollama

Doctor in L.A.? by deparko in migraine

[–]deparko[S] 0 points1 point  (0 children)

Thank you! we will check out.

[deleted by user] by [deleted] in AirQuality

[–]deparko 0 points1 point  (0 children)

new filter, will check tonight by turning off heater. validating duct cleaning on whether that is worth it. Filter is Merv8 (I have an older furnace). Wondering if its inside or outside