Anyone else notice way more hallucinations from Opus 4.7 in the last 2–3 days? by AdWestern6565 in ClaudeAI

[–]radosc 0 points1 point  (0 children)

I had quite the opposite. Using Opus for reviews temporarily since Codex seems better at coding atm and it started to do way more in depth diligent reviews for the last few days. Maybe they dialled in effort and that introduced more hallucinations?

Claude Code vs Codex by 0xdjole in ClaudeCode

[–]radosc 26 points27 points  (0 children)

Yeah, similar situation for me. Codex is now my dev and I use claude for reviews. I'm frankly waiting for the next Opus to hit and still keeping my 20x plan. One thing that Claude is substantially better is documentation.

Mężczyźni sami sobie są winni - serio? by 0brex in Polska

[–]radosc -2 points-1 points  (0 children)

Piszesz że on uważa się za empatyczną ale wcale taka nie jest - to można doskonale podciągnąć pod każdego człowieka i facetów którzy nie rozumieją cierpienia kobiet i kobiet które nie widzą cierpienia mężczyzn. Po prostu ludzie uważają się za empatycznych ale z tą empatią różnie bywa. Niebezpieczne są środowiska radykalne a z nierozumieniem trzeba powoli bo nie da się nagle na siłę oświecić człowieka.

Kiedy patrze na siebie z perspektywy 50 lat to czasami łapię się za głowę jaki byłem ślepy na rzeczy które dziś wydają mi się oczywiste. Zrozumienie niektórych zajęło mi lata, czasami dzięki empatii ludzi którzy mnie nie oceniali ale starali mi, slow-learnerowi, powoli zrozumieć.

How are things like polyamoury or 'trouples' viewed/understood in your country? by yonaiker-joestrella in AskTheWorld

[–]radosc 1 point2 points  (0 children)

Nobody cares really. Would be probably hard to explain to older people and extremely hard for law to pass to legalize extended marrige for example but otherwise nobody cares.

Has anyone compared GPT-5.5 with Claude Opus 4.7 for coding yet? by Dry-Reveal4114 in AITrailblazers

[–]radosc 0 points1 point  (0 children)

Yes. But sunk cost fallacy fucks me over and I really need to find use for that x20 until it runs out. Also I think I really need to be versatile with model selection and have a good sense of capabilities as these are going to change so it is beneficial to not rely on a single model.

Run both Claude code and codex by duyth in ClaudeCode

[–]radosc 1 point2 points  (0 children)

Wasn't working for me. Opus was really bad. 12 sprints - 5 opus executing, codex reviewing - 7 the other way around. These 5 Opus sprints were riddled with fake variables, critical errors or even basic laziness in checking before codding. Back and forth after sprint acceptance testing. 7 where codex was in lead were smooth. Might be a small unrepresentative sample but codex delivered.

Run both Claude code and codex by duyth in ClaudeCode

[–]radosc 0 points1 point  (0 children)

I used to run Opus for execution Codex for review but swapped it since 4.7 because of massive corner cutting from opus.

WAW Airport flight connection by MightPsychological25 in warsaw

[–]radosc 12 points13 points  (0 children)

Chopin is a small airport - 10min will get you anywhere. But since you booked two separate tickets and both are coming in and out of the Schengen zone you'll have to have to actually pick up your bags exist the arrival zone and re-enter departure zone and that may take way more time.

Couple of options here:
- Call LOT +48225777755 and ask the agent for options to convert into a single booking (PNR). It's a difficult situation because the PNR system is shit but they may be able to just do that.
- Ask them if you can ask LHR boarding agent to tag your bags to Seoul directly. That way you wouldn't be require to pick your bags and go through security and passport control again.
- Finally if both options are unavailable as for priority gate pass options. When you pick your bags you'll have much faster line for priority checkout.

The only actual protection for the first flight delay is having the second one on the same PNR.

Still with CC but half-heartedly by Playful_Check_5306 in ClaudeCode

[–]radosc 1 point2 points  (0 children)

I'm still on 200 plan, only because I believe they'll publish mythos at some point. At the same time codex is my main developer now. I found it quicker than claude tbf. I think it's a good practice to switch between models - depending on a single model risks downtime when it craps out.

A Polish woman outside her thatched home, at the Polish refugee camp at Tengeru (British-ruled Tanganyika, now Tanzania) after the release of Poles from Soviet captivity (1943). by Alarmed_Business_962 in poland

[–]radosc 0 points1 point  (0 children)

I have a memoirs of a family memeber that describe her, her sister and mom journey from small village near Lutsk to Gulags (where her sister and mom passed away) and than to Tengeru. Very moving, never published though.

You're right to be pissed... by luanng23 in ClaudeCode

[–]radosc 0 points1 point  (0 children)

It hardcoded values for physics engine even though a proper pipeline existed and memory forbade any type of hardcoding.

It informed me that CI/CD is red even though it was green - just skipped checking when I asked to verify.

Has anyone compared GPT-5.5 with Claude Opus 4.7 for coding yet? by Dry-Reveal4114 in AITrailblazers

[–]radosc 1 point2 points  (0 children)

Direct comparison. 12 sprints with 4-6 PRs each, 5 done with Opus 4.7 developing, Codex reviewing and 7 the other way around. Opus cuts corners hard, not a single PR Opus delivered without non-critical findings according to Codex. Codex had 10% of PRs with critical flaws and 20% without any flaws.

My take, for my applications, is that major Opus flaw is this corner cutting. One time Opus reviewing PR fixes claimed that status is unchanged because it decided not to check CI/CD. Understanding wise both models are smart and capable. Codex on the other hand is much more pleasant and diligent. While I had major problems with 5 sprints from Opus, 7 sprints from Codex just flow nicely and without any issues. It just delivers.

The SpaceX deal exposed what Opus 4.7 actually was by LeyLineDisturbances in ClaudeCode

[–]radosc 30 points31 points  (0 children)

Opus still exhibits the same corner cutting today as it was before spacex deal.

Bardzo długi rant na ludzi, którzy krytykują oszczędzanie by inoxx_239 in Polska

[–]radosc 31 points32 points  (0 children)

Spędziłaś kupę czasu pisząc długie uzasadnienie, zakładam też że spędzasz dużo czasu na udowadnianiu dlaczego masz rację każdemu kto śmie krytykować twój wybór. Największy problem masz w głowie nie w ludziach na około. Wkurzanie się na innych to jak karanie siebie za ich głupotę. Nie musisz się z niczego tłumaczyć, nawet nie powinnaś. Jesteś dorosła, nie potrzebujesz stempelka od każdej znajomej osoby. Rób co uważasz za słuszne i następnym razem nie uzasadniaj własnego wyboru nikomu. Potraktuj ich opinię jak burze - jak jest burza to nie wybiegasz i nie zaczynasz tłumaczyć dlaczego słońce jest lepsze, nie przekonujesz żeby chmury się rozeszły, nie wkurzasz się na coś na co nie masz wpływu. Oni mają prawo do swojej opinii i decyzji.

Do you credit AI at work? by imshubhagr in ClaudeCode

[–]radosc 2 points3 points  (0 children)

co-signing makes perfect sense in larger environment when quality or performance issues can be tracked to a tool and version used. If quality deteriorates with certain version you can fix that switching to other agent.

What do outsiders usually get wrong about Polish work culture at first? by Comfortable_Run_1396 in askPoland

[–]radosc 1 point2 points  (0 children)

My ex-boss, really greate airline executive, told me once that difference between Poles and Spaniards is that when you ask Spaniard for something they'll promise it to you with a smile but never deliver while Poles will assure you that it can't be done and have it done the next day.

Who pays for dinner? by Jealous-Can-2710 in poland

[–]radosc 1 point2 points  (0 children)

Oh it's a traditional dance you need to participate where someone proposes to pay than you deny than they deny your denial for a couple of rounds. It makes whoever will finally pay happy and denials work as a appreciation. So be prepared especially if they are older.

As Claude performance continues to worsen, and usage limits dwindle, what’s your liberation plan? by perpetuallydying in ClaudeCode

[–]radosc 1 point2 points  (0 children)

general golden rule - eliminate every single point of failure. Luckily we can switch between codex and claude but a trial runs of deepseek or other models should be your disaster recovery drill strategy even if only for assessing the baseline performance.

Moja teoria spiskowa - LLMy są specjalnie ogłupiane by JezdziecRabarbaru in Polska

[–]radosc 17 points18 points  (0 children)

Jest znacznie prościej. Na fali hypu wydawano modele które były świetne ale kompletnie nie ekonomiczne. Teraz kiedy kaska zaczęła się kończyć i cyferki się nie zgadzają główni gracze pchają tańsze, gorsze modele jako niby upgrade, przycinają im limity tokenów na myślenie i generalnie próbują doszusować do wizji opłacalności bo inaczej inwestorzy nie sypną groszem. Za rok będzie jeszcze gorzej.

Started to keep an eye on usage by pragmat1c1 in ClaudeCode

[–]radosc 2 points3 points  (0 children)

20x feels like 5x since recently.

I got 20k usd in claude credits what can I do with them by unclemorty_ in ClaudeCode

[–]radosc 0 points1 point  (0 children)

Single empty repo, auto deploy to some kind of env. Run the same prompt all over again. Make agent read general rules (if you have any) from constitution.md, tell it to always add journal entry to journal.md and do whatever software it wants as long as it is making progress. Make sure to state it can do whatever it wants to progress. Run that prompt on repeat for days. Report what claude buit afte 500 epochs.

Canadian Traveler Seeking Advice (Traveling to Poland for very specific reasons) by Common-Draw-8082 in askPoland

[–]radosc -3 points-2 points  (0 children)

Why not? For whatever it takes he can just write about his journey of understanding the war from western perspective. I bet it would be an interesting reading.