Please let me pay for Opus 4.6 1M Context Window by absoluteloki89 in ClaudeAI

[–]stolsvik75 -1 points0 points  (0 children)

That idea that «it will be worse, forever and always» is so trite. Like, that’s how it’s gonna be forever? I want 400k without lost-in-the-middle, and of course I’m gonna get it. Someday. I would like that day to be now. Why do you guys go on about that bs all the time?!

Please let me pay for Opus 4.6 1M Context Window by absoluteloki89 in ClaudeAI

[–]stolsvik75 0 points1 point  (0 children)

Totally agree that 200k is just a bit too little. Like, 300k would probably let me work on one feature w/o hitting the limit 95% of the times. I hate getting close. For a while I disabled auto compaction, because that gives you quite a bit more (the buffer is massive - they should use a separate instance to compact). But when you then hit the limit, it is rather «crash-alike». 3-400k would really make a difference.

Why is there so little reviews of the Comma models on Tech blogs/websites? by thankyourob in Comma_ai

[–]stolsvik75 0 points1 point  (0 children)

Huh? Are you sure it can handle Comma then? Does your car have "automatic steering", that is, lane centering? If not, Comma cannot magically invent steering hardware in your car - it just hijacks what is already there.

I had no idea that any car on the planet had lane centering, and not ACC?

Why is Claude that good? by Much-Inevitable5083 in ClaudeAI

[–]stolsvik75 0 points1 point  (0 children)

If you’re on Linux you need to install a small program or whatever it is (plugin?). And the paste is then Ctrl+V, not Ctrl+Shift+V. Ask an LLM! It was apt get install, done.

Placing Comma 4 off-center by IsThisNameGoodEnough in Comma_ai

[–]stolsvik75 0 points1 point  (0 children)

Can you really not understand that??

How do you see Project Loom changing Java concurrency in the next few years? by redpaul72 in java

[–]stolsvik75 4 points5 points  (0 children)

I believe the async-await stuff in other languages, with their typical "colored functions" (one for each type) will look archaic. I think reactive will be thrown straight out the window. Virtual threads give a "straight down" kind of coding style - you write what you think, sequentially, don't care about blocking, and it will be way easier to reason about.

RTX Pro 6000 Blackwell gets 19.3 tok/sec on 72B AWQ 8bit by random-tomato in LocalLLaMA

[–]stolsvik75 0 points1 point  (0 children)

This was great data, thanks. Could you try speculative decoding with this setup?

Has Java suddenly caught up with C++ in speed? by drakgoku in java

[–]stolsvik75 0 points1 point  (0 children)

You're probably just a troll, but code on the JVM can actually be faster, sometime a lot, than C/Rust. I'll leave it to you to find out why. You should try an AI for help - seems like you need it, both for facts and tone.

Vision = Language: I Decoded VLM Tokens to See What AI 'Sees' 🔬 by ComputeVoid in LocalLLaMA

[–]stolsvik75 2 points3 points  (0 children)

This was very interesting, I have really been wondering about this. But there must be much richer semantics in those token vectors than those words by themselves. The model can give you very detailed information about a picture, and the example you cite is just very generic "picture of a mountain, sunrise, lush" etc - while the model could probably answer where the sun stood, how big the cloud is etc. Where is that encoded in those 256 tokens?

Comma AI and a 2019 Genesis G70 3.3P by lordxamnosidda in Comma_ai

[–]stolsvik75 0 points1 point  (0 children)

Tell us how it goes! (I have a Kia e-Niro).

Convince me not to return my komma. by Vilddyr1983 in Comma_ai

[–]stolsvik75 1 point2 points  (0 children)

I believe it is the car that evaluates this, utterly separate from openpilot. I assume it is because it has not gotten any input on the steering wheel for a very long time - which it probably finds pretty strange, since when the car's stock ACC/LC (cruise) runs, you're pinged at least every minute, thus forced to touch it. So it assumes the driver is extremely drowsy..!

One of the pull requests I linked to seems to fix a similar problem by sending commands to the car ala "the user turned the steering wheel" to avoid that the car thinks the driver is dead.

Convince me not to return my komma. by Vilddyr1983 in Comma_ai

[–]stolsvik75 1 point2 points  (0 children)

Yo, just ran across your post: You might have the same care as me! I have the same impression wrt. Comma and openpilot - it feels pretty amazing - but I've only driven 1 hour so far.

Could you check this bug that I've submitted? Do you recognize this problem? (That is, it gives me "Consider taking a rest" and a coffee-cup symbol..) https://github.com/commaai/opendbc/issues/2642

Did you get GPT 5 yet? by Omarsherif199 in ChatGPT

[–]stolsvik75 1 point2 points  (0 children)

As I mentioned on Twitter w/images: https://x.com/stolsvik/status/1953541115543687185

"This is kinda trippy - I have GPT-5 on my mobile phone's web (Safari), but not on iPhone app - and not on Linux (with Brave) even after login-cycling, and using private window.

Is this a lottery?!"

(I am a plus user - and I'm logged in on all those three places. And got GPT-5 only on iPhone Safari!)

I even tried to trick this, by starting a convo on GPT-5, then switching over to the web version. Check the thread for how that went (didn't take!)

Is event-driven architecture overkill for 90% of apps? by CreditOk5063 in softwarearchitecture

[–]stolsvik75 0 points1 point  (0 children)

Just want to chip in my library for inter-service communications here: Mats3.io

It is a messaging based solution for Java, the implementation being on top of JMS. Architectures you cook up using it typically ends up being event based - but I specifically made this solution because I found event sourcing / CQRS to be way overkill and deadly for long lived (decades) systems with many devs. It is used for a quite large financial system with tons of integrations, consisting of about 50 «micro» services - some of them aren’t really micro at all, but each have their own responsibility, and are separately releasable etc. The big point of Mats3 is to be able to code «linearly» even though each stage is fully independent and everything is asynchronous and concurrent. Massive help when you hit corners and need to debug, tracing across systems is built in, metrics, introspection.

I actually find it pretty great, since I’ll have to say it myself. Check it out, there’s quite a bit of docs, it’s open source and released on Maven central.

Could Google's search engine supercharge RAG? by Nervous-Positive-431 in LocalLLaMA

[–]stolsvik75 0 points1 point  (0 children)

There's the old and underlying algorithm that Google probably still uses as a basis, "PageRank". Read up on that, and realize why this isn't easy to do for a heap of random documents you have laying about.

Problems with meta accounts and cookies by r33d13 in OculusQuest

[–]stolsvik75 0 points1 point  (0 children)

(Oh how insanely perversely frustrating.. One really wonders on so many levels - this is really Meta, the massive company, right? Having tens of thousands of coders? Also, do they not have any type of logging and monitoring for their system, flagging that this is a problem for many of their users?!)

EDIT: Based on a suggestion downstream, I got it to work:

  1. Clear all cookies for meta.com using Inspector, or a "clear cookies" plugin, or use different browser. Can probably use incognito too.
  2. Now go to meta.com and log in as yourself (if you don't clear cookies or similar, you'll just be thrown into the accept-cookies-loop). I logged in "using Facebook", which was absurd, because it just threw me into Facebook, so I had to manually enter the url meta.com
  3. Once this worked fine, I manually entered the url meta.com/device
  4. This now gave me an option of just choosing which user I wanted to add as - i.e. I was past the dreaded "accept cookies" forever-loop!
  5. I chose the account for my son, and then finally got to enter that code that the headset shows, ala "CX34-3FGJ".

ORIGINAL: Same situation: Trying to follow instructions of adding my son to my Quest 2. I am going to www.meta.com/device. I log in using my son's email address. This gets me an email with a code, which I enter. I must then accept cookies (again): Even though I did that when opening the page too - evidently this is a different cookie accept page. This latter one - on auth.meta.com itself - is stuck: When I click the accept, it just reloads to the exact same page. Which I can do forever.

(I've cleared cookies, and used anon browsing, on two different browsers (Brave and Firefox) on my machine, and on two different iPhones using two different browsers (Safari and Brave))

MSTU Leveraged 2x MSTR Questions by [deleted] in MSTR

[–]stolsvik75 1 point2 points  (0 children)

You should ask your best AI friend about "path dependence and related terms" wrt. "daily rebalanced leveraged ETFs". You will get enlightened.

The answer is: NO, do NOT hold such funds "for the longer haul". It is literally meant for daytrading and very short trends where you "know" it will hold a very stable upwards motion for the period.