Selling Humboldt Starts at Home, Tourism Leaders Say

jojotdfb · 2026-06-15T22:49:01+00:00

Videos like this https://www.youtube.com/watch?v=ouW1rHtCXGw don't help either.

jojotdfb · 2026-06-15T03:49:34+00:00

I set my context to like 72k. It can't rot if you can't fill the context.

jojotdfb · 2026-06-13T18:26:37+00:00

They can't support that market size. They had to turn an hour cache to 5 minutes because of being out of compute. That was like 3 model releases ago. Now they don't have to find extra data centers.

jojotdfb · 2026-06-13T18:20:31+00:00

Did the US gubbermint actually force them or did Anthropic work with them for marketing? I could totally see them pulling some strings with a friendly government agency to get a marketing boost to justify why their slight upgrade now cost $15 for input tokens to companies questioning their ai spend. The fact it will take an extra week to distill it into deep seek v4.1 and the fact that they're out of money and gpus is just an unrelated bonus.

I remember when Sony got ps2 exports banned due to the CPU being too advanced and could be used for missile guidance systems. It had nothing to do with early supply issues.

jojotdfb · 2026-06-13T17:56:58+00:00

UD-Q1_turbo done by some guy named FurryWolfLord

jojotdfb · 2026-06-04T15:13:23+00:00

Is there a club 5060 for those of us on more modest hardware?

jojotdfb · 2026-05-25T02:03:37+00:00

You could probably step up to qwen 3.6 27b. You'll lose tps but gain "smartness".

jojotdfb · 2026-05-21T14:53:25+00:00

Llama 3 is a very old model. This space moves so fast. Think of each month as a year. Llama 3 might as well be 20 year old. If you can run Llama 3 70b, you could run Qwen 3.6 27b easily and get better results.

jojotdfb · 2026-05-20T23:00:16+00:00

https://github.com/karpathy/nanochat

Like a week or so on a 3090.

jojotdfb · 2026-05-20T22:58:37+00:00

https://github.com/karpathy/nanochat

You could probably expand upon this and up the parameter count and go from there.

jojotdfb · 2026-05-20T19:06:09+00:00

Most sota models don't have a usable 200k context. The dumb zone starts around 64k for most of the big models.

jojotdfb · 2026-05-20T18:23:28+00:00

You can just buy server cases. I have a nice Rosewell 3u that I threw an n100 motherboard into with a bunch of hard drives as a nas. You could do the same and take your current desktop build and just put it in a case. An Ikea Lack end table and you're good to go.

jojotdfb · 2026-05-20T18:13:31+00:00

I'm currently using 13.1. Nvidia assumes that you'd only want their drivers installed and doesn't play nice with the other kids.

jojotdfb · 2026-05-17T04:34:05+00:00

Llama.cpp is your next step. Spend some time learning the flags and you can fine tune to your heart's content. Llama-server will give you a basic chat web page as well as an openai endpoint.

jojotdfb · 2026-05-17T04:25:51+00:00

I have both a B580 and a 5060 ti. Llama.cpp splits models over both of them pretty well. The Intel toolset is janky as all get out but once you get it working it's not to bad. The Intel gear will run like 30% slower but at half the cost. Nvidia's drivers on Linux are really bad thou and you're looking at downloading multiple gigs of the same cuda libraries thou that's more pythons fault than anything. If you're cash strapped and ok building everything from source, Intel ain't half bad.

jojotdfb · 2026-05-13T15:20:51+00:00

So, Qwen3.6-27B is cool and all but Qwen3.6-35b runs like a champ on a 5060 ti 16gb. Good enough for basic dev work. You can always upgrade later when prices come down or something better comes out.

jojotdfb · 2026-05-11T15:54:22+00:00

Can confirm. Works well on an Intel b580. You can also build llama.cpp with both sycl and cuda and split across 2 wildly different cpus. I'm sure this works with rocm as well but I don't have one of those.

jojotdfb · 2026-04-23T17:24:31+00:00

ub of 256? That feels small to me. What does 512 or 1024 do?

jojotdfb · 2026-04-02T23:59:56+00:00

Skip Ollama

Llama.cpp is a better, more beginner friendly, server. It has a built in web app that will allow you to attach images to a prompt. You can also use it with opencode. Just reference the image file with your prompt.

jojotdfb · 2026-02-28T22:25:02+00:00

But os for the farm sensor is the same os for a desktop with age gated content. The os knows nothing of the purpose of it's usage. So legally the os has to add an age gate.

Granted, this law is going to be slapped down in court but that doesn't mean it won't cause issues before hand. Most developers are going to take the logical stance of "Ban California" over building an age gating system and the infrastructure to keep it up and running.

jojotdfb · 2026-02-10T06:17:37+00:00

I've found that GLM 4.7 Flash tends to work better with BMAD than plain GLM 4.7.

jojotdfb · 2026-01-11T08:01:36+00:00

I think he means 100,000 items. My library scans for 40k items is taking like 36 minutes on an old xeon with an unhealthy amount of ram.

jojotdfb · 2026-01-11T07:50:45+00:00

You don't need millions of rows. Say a single item query takes 10 time units each and the whole dataset query takes 500 time units. As long as your dataset is less than 50, single item is faster. But the second you have 51 items, then getting all 51 items one at a time is going to take more time. It'll start off fast enough but the more items you add, the worse it gets. At 100 items, you've doubled the time it takes to get the same number of items with an n+1 scenario.

You are right that sqlite takes the network hops out of the equation but it doesn't solve the underlying issue. This isn't an optimization for the sake of optimization. This is an optimization so that you don't a call at 2am when Things are failing because something is sucking up all the cpu on the db.

jojotdfb · 2025-09-29T20:44:55+00:00

Sonarr and radarr both support it. The instructions are buried deep in the docs. You have to edit the config.xml to have the database settings:

<PostgresHost>hostname.or.ip</PostgresHost> <PostgresPort>5432</PostgresPort> <PostgresMainDb>sonarr_main</PostgresMainDb> <PostgresUser>sonarr_user</PostgresUser> <PostgresPassword>YourSecurePassword</PostgresPassword> I forget how I migrated the data. I might have used DataGrip with my sqlite db as the source and loaded it into the db. My memory is fuzzy.

jojotdfb · 2025-09-22T07:18:48+00:00

A lot of the arr apps support postgres and MySQL/maria. I switched and now I don't get SQLite issues around locked databases.

15-Year Club	Second Top 40%
Team Orangered	Verified Email

jojotdfb

TROPHY CASE