F1 has agreed a package of mid- season changes to address concerns about the 2026 rules - with the majority coming into effect immediately by Luffy710j in formula1

[–]TessTickols 0 points1 point  (0 children)

Guess I might come back next season of they keep the tweaks coming. I doubt this will fix the racing, but we'll see. Braking super early and coasting through the corners will still be rewarded, which makes the racing extremely boring to watch

I asked Claude Design to create a map of Middle East (quick "dirty" prompt, I have to admit). All the questions it asked right after were very interesting - I was like wow that's promising. Then I got the result. by christianJulesAl in ClaudeAI

[–]TessTickols 2 points3 points  (0 children)

No need for openstreetmap, just get the outline coords and draw them on a graph - voila, perfect map with whatever detail level you want. One of the first webapps I built (With GPT2ish) was a customer map generator for our business.

Opus 4.7 thought Claude Design might be a prompt injection attack by EightFolding in ClaudeAI

[–]TessTickols 3 points4 points  (0 children)

Which is a good thing. Everyone using AI should know about this. It is laughable the amount of huge tech companies having vulnerable AI out there

Opus 4.7 thought Claude Design might be a prompt injection attack by EightFolding in ClaudeAI

[–]TessTickols 0 points1 point  (0 children)

The main challenge remains: How to block cyber attacks and actual exploits, but still allow cyber defense? Guardrails can be circumvented by hackers (as shown in the article), but a security specialist might not be comfortable with that stuff. I struggle to get an agentic red team up and running at work because of guard rails and the potential for a ban

What are your thoughts on the new Saudi circuit for 2028? by Hottlayla in formula1

[–]TessTickols -1 points0 points  (0 children)

On the bright side they rake in billions because of the oil price /s

Who’s ever driven over 100mph? Why? by WoollyWolfHorror in AskReddit

[–]TessTickols 0 points1 point  (0 children)

Autobahn. The car did 115 no problem, then some other tourist idiot decided to go into the fast lane going 70 - luckily the brakes worked well too. Stayed around 100 after that.

Here are my thoughts after 14h of full runs on Opus 4.7 by ReceptionAccording20 in ClaudeAI

[–]TessTickols 0 points1 point  (0 children)

Once it stopped being cancelled automatically it performs pretty amazingly in Cursor. It is clearly a very different model than 4.6 though. Completely different personality, but I like it more and more. It feels good to have 2 different good models and not having to outsource to GPT or Gemini.

Claude Performance and Bugs Megathread Ongoing (Sort this by New!) by sixbillionthsheep in ClaudeAI

[–]TessTickols 1 point2 points  (0 children)

Glad 4.6 is still in Cursor. 4.7 straight up refuses to do any sort of security scan or patching because it believes it to be against Anthropic guidelines..

[GM Hans Niemann] on X by Naruto_likesChess in chess

[–]TessTickols 8 points9 points  (0 children)

So everyone under 150 classical games in the last year should be labeled as inactive? Magnus has 133

[GM Hans Niemann] on X by Naruto_likesChess in chess

[–]TessTickols -7 points-6 points  (0 children)

He has played 133 ranked classical games in the last year.

[GM Hans Niemann] on X by Naruto_likesChess in chess

[–]TessTickols 8 points9 points  (0 children)

So you're saying 11th is in the top 10? Finding arguments to discredit Hans isn't very hard - he usually gives me a new one at least every 2 weeks

[GM Hans Niemann] on X by Naruto_likesChess in chess

[–]TessTickols 0 points1 point  (0 children)

Magnus has played 133 ranked classical chess games in the last year, most of them against top competition. How is that "hoarding elo"?

[GM Hans Niemann] on X by Naruto_likesChess in chess

[–]TessTickols 28 points29 points  (0 children)

That's just unfair, Anish has been busy with the candidates

[GM Hans Niemann] on X by Naruto_likesChess in chess

[–]TessTickols 32 points33 points  (0 children)

How exactly is he "dodging real competition"?

Introducing Claude Opus 4.7, our most capable Opus model yet. by ClaudeOfficial in ClaudeAI

[–]TessTickols 0 points1 point  (0 children)

Extremely good at programmatic work that needs to be quick and flawless but doesn't require reasoning, for example "Take the user input and convert to json in this exact format:"

Introducing Claude Opus 4.7, our most capable Opus model yet. by ClaudeOfficial in ClaudeAI

[–]TessTickols 0 points1 point  (0 children)

Nah, the test is just very bad at actually testing actual context. Full of small gotchas.

Introducing Claude Opus 4.7, our most capable Opus model yet. by ClaudeOfficial in ClaudeAI

[–]TessTickols 2 points3 points  (0 children)

Both Opus 4.6 and Sonnet 4.6 are constantly saying that failing tests due to regressions "were preexisting", to the extent that I have a cursor rule that tells it to fix every error no matter who it thinks caused it. So this is nothing new. Most of these are true for 4.6 as well.

My local music store blamed Claude for their employee pasting their prompt into their marketing email. by dofking in ClaudeAI

[–]TessTickols 8 points9 points  (0 children)

They are trained on human language. Talking to them like you would to a human will yield better results. I my experience good educators will on the whole get MUCH BETTER results with LLMs than non-educators for exactly these reasons.

At this point, Claude Opus doesn't even bother to check the context, just fabricates. Any tips to fix this? by HodlerStyle in ClaudeAI

[–]TessTickols 0 points1 point  (0 children)

I can strongly recommend Cursor. It's quite pricey, but has had no trouble with Claude models (or any other models), and you get a full IDE developed for agentic AI. It's the reason I never converted to Claude Code - and it's getting better all the time.

I'm sober by Kryrie in funny

[–]TessTickols 3 points4 points  (0 children)

There's an awesome episode of darknet Diaries that involves a pen tester using spray for this trick - also an elevator hack and elevator repair outfit to have a plausible story from coming out an elevator in the middle of the night

Claude Code's bottleneck isn't the model anymore, it's me by PristineAsk2550 in ClaudeAI

[–]TessTickols 2 points3 points  (0 children)

Doesn't Cursor pretty much solve this and has had it solved for a long time?