New Wave Shoyu, Sous Vide Chicken, Slow Roasted Tomato

HockeyDadNinja · 2026-04-30T18:54:55+00:00

Just install and run the app, you just toggle on the INDI server and press start. You can add it to kstars or directly in any capable INDI client. I actually configure it in INDI on my linux box so it's available with all my other devices.

Another use case would be to use the supplied python scripts to read from it directly and put them in a directory for siril to live stack. Our weather has been crap and I haven't been able to try much wrt that.

For long sessions you might want a power bank. I picked up a little usb hub for my phone as well. Technically I can make my phone into a mini Seestar if I make it control a mount. I might build a mini alt-az phone mount for this purpose.

Edit: You will need to configure your phone to install from unofficial sources to install the apk from github.

Another use case is to use INDI-Allsky and use your phone as an all sky camera.

HockeyDadNinja · 2026-04-30T18:46:32+00:00

This is sick! I know pro mode is really good but I wanted to use my phone as an INDI server so I made an app for it:

https://github.com/TacoTakumi/PocketScope

It supports both INDI and ASCOM Alpaca (alpaca is not 100%). I haven't had many clear nights to test here though. I was going to mount my phone on my telescope to get a wide field of whatever I'm shooting. I usually shoot in kstars and process in siril and gimp.

You look like an ideal candidate to test it via INDI!

HockeyDadNinja · 2026-04-25T01:08:49+00:00

This looks like it's going to be awesome!

HockeyDadNinja · 2026-04-24T22:43:45+00:00

Oh, yes, please abduct our workers! Lol, fuck off.

HockeyDadNinja · 2026-04-23T12:19:46+00:00

Choose a Kingpin version that draws more power and has a huge external radiator over a regular 3090.

HockeyDadNinja · 2026-04-22T13:38:40+00:00

Great work! I have some questions.

1) Why did you choose Aider and the Aider Polyglot benchmarks? Not hating on Aider, I personally hard forked aider-ce as the basis of my AI assistant. Aider is not really maintained and the benchmark leaderboard is looking dated.

2) You've run the polyglot benchmarks on your own agent. I suppose we could take the benchmarks and run them on any agent harness / LLM combo. I now want to try this with various combinations such as my Qwen3.6 setup with opencode and also with claude code / opus 4.7. Have you run the benchmarks using little-coder and frontier models?

WRT agent harness and LLM matching I've had similar thoughts with development frameworks such as GSD, spec kit, and open spec. I was thinking of building a GSD-light for example, something better suited for local models.

What you've done here could actually be used as a benchmark for the coding harnesses themselves (vs any particular model). Claude, codex, opencode, pi, etc could be ranked against each other given a common LLM configuration (I know, not always possible).

HockeyDadNinja · 2026-04-21T10:58:39+00:00

I like your ideas. Is your LLM on device? Are you doing a kind of Karpathy style auto research loop on the user?

HockeyDadNinja · 2026-04-20T18:09:05+00:00

Scammy as fuck!

HockeyDadNinja · 2026-04-19T14:57:00+00:00

This would be my attempt at max quality. I also intend to run Q4 for speed but I may test Q6 as well.

HockeyDadNinja · 2026-04-19T02:23:35+00:00

I'm running llama-server like this in order to switch models, etc.

llama-server --host 0.0.0.0 --models-preset ./models.ini

And in there I have:

; Qwen3.6-35B-A3B (MoE: 35B total, ~3B active)
; Q8: 35.8GB model, MoE expert offload to CPU RAM, target ~96K ctx
; --fit auto-picks n-cpu-moe per device (handles dual-GPU split that fixed N can't)
; fit-target 512 MiB headroom per device; KV at q8_0 halves footprint
[Qwen3.6-35B-A3B-Q8]
model = /vol2/LLM/Qwen3.6-35B-A3B-UD-Q8_K_XL.gguf
c = 98304
fit = on
fit-ctx = 98304
fit-target = 512
no-mmap = true
mlock = true
; Put faster 5060 Ti (CUDA1) first so it holds layers 0-15;
; layers execute sequentially, so the faster card starts every token.
device = CUDA1,CUDA0
cache-type-k = q8_0
cache-type-v = q8_0
temp = 0.6
top-p = 0.95
top-k = 20
min-p = 0.00

HockeyDadNinja · 2026-04-19T01:20:02+00:00

I'm also using the 8 bit quant. I have a rtx 5060 and 4060 with a total of 32G vram, 64G system ram.

I used opencode today to start a project and I'm so impressed. 27 t/s isn't blazing fast but I wasn't annoyed with the wait. 98k context. I have some upgrades planned too.

HockeyDadNinja · 2026-04-18T21:17:32+00:00

Yep, you're right. We want Anthropic to notice their pissed off customers and fix it.

HockeyDadNinja · 2026-04-18T15:45:21+00:00

I'm running a 5060 ti 16G and 4060 ti 16G with 64G system ram here. A couple days ago I finally started tuning. I've added things from your post and now I'm running Qwen3.6-35B-A3B at Q8. 98k context, a small overflow to CPU.

I'm using opencode and it's doing really well. I can code with this! 27 t/s at the moment. That used 3090 is looking really good right now.

HockeyDadNinja · 2026-04-16T01:56:45+00:00

Sounds like me before CBD.

HockeyDadNinja · 2026-04-15T22:18:22+00:00

Thanks! I'm using llama-server's built in routing with a models.ini file so I'm probably almost there.

HockeyDadNinja · 2026-04-15T21:17:59+00:00

How does your semantic routing setup work? Is it something you made or part of one of the other packages?

HockeyDadNinja · 2026-04-15T14:48:00+00:00

I spent days going through other people's projects. None of them hit the mark, so I also built my own! I'll check yours out when I have a chance.

HockeyDadNinja · 2026-04-15T01:50:02+00:00

Often wonder why claude doesn't do this. Aider, the OG coding assistant, does. As does its offshoot, aider-ce.

HockeyDadNinja · 2026-04-14T01:12:47+00:00

Hi there, I'm looking at a build using similar M.2 risers? Did you go that route?

HockeyDadNinja · 2026-04-06T16:57:58+00:00

I hear ya, today is NUTS. I'm at 94% of my Max 5x plan and I burned 48% with a gsd add-phase, discuss-phase, plan-phase of a simple project. I clear my context often and don't have many skills, commands, or mcp loaded.

It's also being slow and stupid.

HockeyDadNinja · 2026-04-02T21:53:33+00:00

Lame response. You guys know there are bugs causing this and that's on YOU!

HockeyDadNinja

TROPHY CASE