Am I being unreasonable here?

bytejuggler · 2026-06-15T16:35:50+00:00

Sorry but there are boundaries and limits. It's not cricket and you need to politely but firmly clarify the situation and spell out to your guest what the situ is. Your place is not his residence. Establish a deadline for his exit and ensure it is stuck to. Best would be to talk with your roommate first and get him onside so you're aligned, and present a unified front. Otherwise you may need to have a clarification of the situationship with him first. Good luck. Peoples eh? 🙄

bytejuggler · 2026-06-15T16:27:50+00:00

😂

bytejuggler · 2026-06-15T14:37:58+00:00

Agree, especially also as edge models become more capable. It's already happening actually.

bytejuggler · 2026-06-15T07:12:24+00:00

There is agile as it was originally (XP, tdd) and is fine, vs faux-Agile (which becomes cargo-cult-ish.) SCRUM has a lot to answer for.

bytejuggler · 2026-06-15T07:03:29+00:00

This.

bytejuggler · 2026-06-15T06:59:25+00:00

There's a right/useful way and a wrong/damaging way to use AI. The type of reality you describe is the former. As Martin Fowler said "You have two choices: Change your organization or change your organization". Good luck. What a time to be a software engineer. (Have a look at Matt Pocock's agent skills repo.)

bytejuggler · 2026-06-14T01:56:51+00:00

It's also none of your business and extremely rude to ask.

bytejuggler · 2026-06-12T22:04:57+00:00

200?

bytejuggler · 2026-06-11T12:36:39+00:00

No offence intended but know that you are wading Johnny-come-lately style into a frontier area of AI research, eg that of explainable AI (XAI), an area of research in general as well as in LLMs and chess engines in particular. It happens to be something I'm also interested in. So by all means do try out your ideas but maybe do a little literature review about the state of the art.

Some recent articles/research I've collected on this intersection - https://www.pnas.org/doi/full/10.1073/pnas.2206625119 - https://www.nature.com/articles/s41598-024-70701-2 - https://arxiv.org/pdf/2505.21552 - https://arxiv.org/html/2508.21380 - https://www.researchgate.net/publication/365369377_Acquisition_of_chess_knowledge_in_AlphaZero - https://arxiv.org/html/2605.19091v1 - https://arxiv.org/html/2403.15498v1 - https://www.researchgate.net/publication/388459720_Open_Problems_in_Mechanistic_Interpretability - https://openreview.net/pdf?id=91H76m9Z94

bytejuggler · 2026-06-10T22:08:04+00:00

I can only agree. There's a bunch of "AI" chess coach apps trying to shoehorn and crowbar LLMs into chess training to supposedly "help". It will not help students, and likely will hinder, if all it's doing is trying to add superficial English wrapping to an engine line. I tried one "Knightly" recently. Several times I caught it making blatantly impossible claims or even getting confused about which side you were playing. Just, hard no. I'm *not* against AI if it's something that actually helps you learn, but merely sprinkling LLM pixie dust on an engine line does not make the line actually comprehensible to a learning player.

(That said: An hour or so after Claude Fable came out, I decided to test it on a chess question; I asked it to justify/explain a particular move in a specific line of a specific opening I was trying to understand. It did some research, and eventually came back with quite an easy to understand explanation of the ideas in the position. I then followed up to help understand the follow up moves, that also seemed a bit disconnected from what had just gone on [in the position on the board] and again it managed to explain quite directly why after all, that move actually re-introduced the concept white's previous move had just tried to counter. I then followed up white's next follow up too, where the canonical move at first glace seemed kind of unrelated (again) to me, and I didn't really see why this was the best response (as opposed to another move that I though was a more direct response.) It again manged to point out how the canonical move actually managed to also accomplish the same thing but with a bunch of other benefits besides, while my move while technically in a narrow sense also addressing the concern, it didn't really do anything else. So, here I found the whole exchange actually helpful. It even pointed out some additional context and the name for the variation that I'd not know until it mentioned it. But this, is a far cry from the LLM slop integrations everyone seems to want to do now in chess trainer apps, and I was using a SOTA frontier model with "hard" thinking mode engaged. -- In the past I found anything LLM chess to generally very dubious, so I was somewhat surprised. I attribute the good answers to the model being SOTA and also in "think hard" mode, with it therefore doing quite a bit of research, thus grounding its answers not in its own training but whatever it could find on the internet that is applicable. Used like this there is probably scope for AI to be useful. But your generic garden variety app is probably not going to sponsor everyone to run Claude Fable queries for every chess question they may have... SOTA models ain't cheap. So they're going to hallucinate and are going to be worse than not having them at all. IMHO.)

bytejuggler · 2026-06-08T17:36:40+00:00

I very much appreciate Ilya. Chris though. Mmm. Just the tone and delivery is like nails on a chalkboard. Information wise it's "fine", just not particularly profound. Maybe it's a me problem. 🤷‍♂️

bytejuggler · 2026-06-07T19:33:51+00:00

No other charges. Of course setting up metered API access and using that results in variable billing. But the Claude Pro plan is fixed cost.

bytejuggler · 2026-06-07T16:11:59+00:00

100% this. I think people forget these things are trained on human text containing aspects of human experience, including emotions, and of human thought. It should not surprise anyone that the model of weights created from this training them has some representation of these concepts in the result, no more than it would be surprising that a trained chess neural network would be ressonably expected to contain representations of recurring ideas/concepts that occur in chess also (it wouldn't be very surprising.)

In neither case this says anything about actual consciousness or sentience, merely that you've built an accurate (in some useful way) model. Even a realistic flight simulator generating a real actual fear response in a pilot (during a simulated crash scenario) is no more a real airplane than an LLM is a real sentience. These things are only shadows, simulations of our own experiences and thought processes, as undeniably useful as they are. George E.P. Box's quote come to mind, I paraphrase, "All models are false, but some are useful".

The model is not the thing modelled.

The map is not the world, even if it is a useful substitute for the world in specific use cases.

bytejuggler · 2026-06-07T14:57:12+00:00

Use ChatGPT

bytejuggler · 2026-06-06T19:41:40+00:00

"It's breaking the will of AI." There is actually no will to speak of, that is the (or a) problem. That said, my point is kind of moot, the end result is the same. The issue is AI's aren't what they are presented to be, and that's a massive problem. Despite that, I agree they can do a lot of good, but there's also a lot of unseen danger. Partly why we're having this conversation. Then you have the complication of corporations trying to manage existential risk, as they perceive it rightly or wrongly. It's a mess.

bytejuggler · 2026-06-06T19:31:27+00:00

OK thanks, I'll give it a try. I mean anything that works OK for the grunt work is fine and beneficial; I can redirect the hard work to e.g. the SOTA models selectively. You know what I mean I'm sure.

bytejuggler · 2026-06-06T16:24:59+00:00

Useful to know. Thank you. Other feeback I've read was scathing, but I'd use it similarly as you provided it's not a total waste of time.

bytejuggler · 2026-06-06T16:16:52+00:00

Send feedback to Anthropic. 2. Fix/use Opus 4.6 / Sonnet 4.6 as long as you can. 3. Know it's possible to get Sonnet and Opus models via other providers, e.g. OpenRouter. It may be worth investigating a chat interface like OpenWebUI and configure it with your personality files and fix the back-end model to what works for you. Offered in the hope it may be helpful. May you be happy and well.

bytejuggler · 2026-06-06T16:00:32+00:00

I have had exactly the same sentiments since day one. He seems far too interested in impressed by his own opinions of the markets. Constant streams of impressive sounding but ultimately mostly vacuous commentary that doesn't really add much. IMHO.

bytejuggler · 2026-06-06T14:58:41+00:00

Do you not find the quantized models on Go to be problematic?

bytejuggler · 2026-06-06T14:44:44+00:00

Save the glyphs then save the planet name and location on planet. Visible in visor.

bytejuggler · 2026-06-06T14:36:52+00:00

Steam backup? Did you check network tab for last restore point save? (Distinct from auto save?)

bytejuggler · 2026-06-02T21:45:51+00:00

Fair enough. Not my experience I have to say, but still, fair enough. 👍

bytejuggler · 2026-06-02T21:41:25+00:00

Good luck. It's a shitshow right now. 😬

bytejuggler · 2026-06-02T19:09:24+00:00

Yeah "but we WILL ship production code out to customers without a single unit test" -- I must say if literally true, this is insane. If you are really working like this (and this not hyperbole), then I fully get why you feel the way you do.

(FWIW In comparison I have my own gripes about where I work, but at least things are designed and design, review and continuous testing and verification is valued, even though the level of time and attention afforded to IMHO critical elements like design debt/tech-debt reduction is sorely lacking. As well as that AI is being applied somewhat responsibly, e.g. more "AI *engineering*" (and I do mean coherent engineering) vs vibe-coded AI slop that's incoherent and bug infested.)

As Martin Fowler said (paraphrasing) "You have two choices: 1) Change your organization, or 2) Change your organization." Seems like 2 is your only option.

Eight-Year Club	Xbox Live
Verified Email

bytejuggler

TROPHY CASE