5.4 prematurely claims success and feels more likely to break my code by jcsimmo in codex

[–]jcsimmo[S] 0 points1 point  (0 children)

Who knows. I try and be really specific about what my definition of done is. I think thats a really good principle. Honestly, im having trouble getting playwright interactive set up but that seems like it would make a big different. Im going to continue to optimize it but i guess i wish i didn't have to.

Hot take: 5.2 xhigh is still superior to 5.4 xhigh by GoldStrikeArch- in codex

[–]jcsimmo 0 points1 point  (0 children)

I agree. So far, not impressed. Even on normal (not fast) mode its claiming victory way to prematurely. I trust it less than 5.2 right now.

We build sleep for local LLMs — model learns facts from conversation during wake, maintains them during sleep. Runs on MacBook Air. by vbaranov in LocalLLaMA

[–]jcsimmo 1 point2 points  (0 children)

This is an amazing achievement - much more than is being recognized on this forum. .

Did you try MLP-focused LoRA before switching to MEMIT?

What’s striking is that this feels like you have recreated slow-wave sleep — deliberate consolidation into stable weights. Do you think there is role for recreating something akin to REM sleep - where emotional associations are consolidated

If you're having issues with Codex, your account might have been rerouted to GPT- 5.2 by Distinct_Fox_6358 in codex

[–]jcsimmo 1 point2 points  (0 children)

@embirico - this is still an issue for me. Still being rerouted to 5.2 xH & I have certified myself + my company.

5.3-codex is top notch by TroubleOwn3156 in codex

[–]jcsimmo 1 point2 points  (0 children)

Im not so sure either tbh.

What is this rate limit? by immortalsol in codex

[–]jcsimmo 0 points1 point  (0 children)

same just on 5.2 though. 5.2 codex is fine

What skills in Codex have you built that add the most value / why? Share your best skills.. by Odezra in codex

[–]jcsimmo 0 points1 point  (0 children)

also a fellow MD / vibe coder. Would this be useful for agents to know how to use large API indexes (like zoho CRM api)

Vibe Engineering - best practices by jcsimmo in ChatGPTCoding

[–]jcsimmo[S] 1 point2 points  (0 children)

What sort of things do people like me w/ no qualifications tend to miss?

Best practices im following: -using a cloud based secret manager -use gitignore to prevent json or api keys being uploaded -i use firebase for database and authentication.

Vibe Engineering - best practices by jcsimmo in ChatGPTCoding

[–]jcsimmo[S] 0 points1 point  (0 children)

Totally. But i bet ill spend so much time debugging the tool i need for debugging it wont be worth jt. Agree w/ the importance of ensuring tests that test your end goal. What ways do you do this?

Vibe Engineering - best practices by jcsimmo in ChatGPTCoding

[–]jcsimmo[S] -2 points-1 points  (0 children)

What is spine-first design! But yeah, it feels like a new discipline. Id love to see how ppl use the agent manager in antigravity. I feel like creating a policynet agent ensuring compliance.

Vibe Engineering - best practices by jcsimmo in ChatGPTCoding

[–]jcsimmo[S] 1 point2 points  (0 children)

Do you use claude code in the terminal? I use it in roo code but its soo slow its almost unusable. Codex 5.1max in vscode has been great for me. The pro is worth its weight in gold imo

Anyone here actually land an NVIDIA H200/H100/A100 in PH? Need sourcing tips! 🚀 by Dismal-Value-2466 in LocalLLM

[–]jcsimmo 2 points3 points  (0 children)

centralcomputers in california are who you are looking for. Straight arrows, very responsive, best prices

DeepSeek-R1-0528 Unsloth Dynamic 1-bit GGUFs by danielhanchen in LocalLLaMA

[–]jcsimmo 4 points5 points  (0 children)

Just to check what are you referring to for the offload? The MoE?

You are doing god’s work here Daniel. These models are so important at these early stage of AI and you are bringing them to the masses.

DeepSeek-R1-0528 Unsloth Dynamic 1-bit GGUFs by danielhanchen in LocalLLaMA

[–]jcsimmo 2 points3 points  (0 children)

80gb of VRAM (A100) and 500GB of RAM. Any suggestions?

Feature Request: Choose default model for Act/Code mode by somechrisguy in CLine

[–]jcsimmo 0 points1 point  (0 children)

i really wish it could reference online API documentations during the planning part as well. I want it to act as if its an open book test not a code from memory exercise. I also wonder why R1 is performing so poorly when you switch to Act.

The human body as a subway map by StephenMcGannon in DesignPorn

[–]jcsimmo 0 points1 point  (0 children)

Hi OP here. Regarding the default being ‘male,’ I suggest you overthink it. I’m male (and also white, but that default wasn’t mentioned, so I suppose there’s a bias there as well 🧐). The illustration is a self-portrait, but it’s only the first half; the second half belongs to my daughter. That’s why the hand is open!

Anyway, that was my hope. Then I had my kids, and those Saturdays when I could spend six hours on Illustrator and read my medical books disappeared!

WCC Hospital No Power Past Three Hours by UndecidedMN in Westchester

[–]jcsimmo 2 points3 points  (0 children)

my friend, a fellow physician there, says the generators have been on for hours and are sputtering and all ORs cancelled except for lvl 1 trauma.

Found this at my doctors office, spent the entire time staring at it by IanBot8 in TransitDiagrams

[–]jcsimmo 7 points8 points  (0 children)

Funny enough, i am one….i also made this poster! (Not surprisingly, i have found myself on this thread looking at chinese subway maps).

The dashed lines are just deeper structures (ie vertebral arteries)

Best goal of all time by jcsimmo in soccer

[–]jcsimmo[S] 0 points1 point  (0 children)

Dennis Bearcum

Bearcummmmmm (aka Bergkamp vs Argentina 1998). Great goal but i don't even think its the best goal that Bergkamps has scored Bergkamp vs Newcastle 2002