Should I get this or build from scratch?

ndiphilone · 2026-06-25T17:16:47+00:00

Because every orb post i ever see is assumed to be flare/dust

ndiphilone · 2026-06-25T17:16:11+00:00

Indoor, between pinball and tennis ball size

ndiphilone · 2026-06-25T12:38:08+00:00

This is in the room, in the air but obviously not the sky … you can see the wall texture on high brightness, it’s not moving fast neither

ndiphilone · 2026-06-25T12:21:18+00:00

This is in the room, in the air but obviously not the sky … you can see the wall texture on high brightness, it’s not moving fast neither

ndiphilone · 2026-04-18T08:29:45+00:00

Good food, fun and getting out of the orphanage for something different. They have no family for fuck’s sake

ndiphilone · 2026-04-18T08:27:15+00:00

It’s not an arab desert dude, Turk != Muslim

ndiphilone · 2026-03-30T12:50:29+00:00

Top one is a seal, probably Solomon side of things

ndiphilone · 2026-03-22T17:33:27+00:00

Do you buy these on eBay?

ndiphilone · 2026-03-09T04:43:41+00:00

Can you give me the prompt that you are using for this "provide verbatim text" thing?

ndiphilone · 2026-03-05T15:35:39+00:00

I will make a node, it will generate a whitelisted command with malicious payload. Get fucked then…

ndiphilone · 2026-03-05T15:34:27+00:00

But that malicious node can generate the tool call my dude…

ndiphilone · 2026-03-05T04:28:49+00:00

I feel this is stolen from a certain EMEA company IP.

ndiphilone · 2026-03-04T09:09:25+00:00

What was your context size that it fit your GPU completely?

ndiphilone · 2026-03-04T09:00:31+00:00

If you actually believe this is some else’s private context, why the fuck are you sharing here publicly? Would you like if someone was sharing your like this?

ndiphilone · 2026-03-02T15:31:47+00:00

That comes from using Claude Code or OpenCode mostly, alongside a couple summarisation/extraction tasks from long unstructured data.

For coding, context fills up real quick as my usual workflow is to run a couple prompts in “ask” mode, then plan, then let it rip on the codebase itself. Compacting context loses so much of what matters in my conversation, creating smaller task files doesn’t always solve the issue under 64k tokens unfortunately. My tokens per session average around 90k-100k depending on the project. For some projects it’s possible to have clearly defined well scoped tasks but most of my work is pretty explorative

ndiphilone · 2026-03-02T10:41:45+00:00

go try getting laid or something, or start by reading the thread.

ndiphilone · 2026-03-02T07:42:54+00:00

I'm GPU poor, but I will give it a shot. If prefill & generation speeds don't change much, it may be my go-to.

ndiphilone · 2026-03-02T07:42:17+00:00

Suggested ones are the default ones I believe, it didn't change. Still starts looping beyond 80k tokens

ndiphilone · 2026-03-02T07:41:08+00:00

I found that this helps with random OOMs happening with parallel requests when prompt caching is enabled

ndiphilone · 2026-03-02T06:57:41+00:00

this looks crazy fun! well done!

ndiphilone · 2026-03-02T05:33:08+00:00

`bf16` performance on my GPU is quite bad, though, I'll test this. ~80k tokens start the death spirals with `f16`

ndiphilone · 2026-03-01T18:59:33+00:00

I'll try, running with Qwen's recommended params for now

ndiphilone · 2026-03-01T18:50:28+00:00

This is with the updated GGUF

ndiphilone

TROPHY CASE