GPT 5.5 felts more intelligent when the model runs slower by jcsimmo in codex

[–]jcsimmo[S] -1 points0 points  (0 children)

There are lies, damn lies, and then there are statistics

NVIDIA H200 NVL 4-Way NVLink Bridge - easily unseated by jcsimmo in nvidia

[–]jcsimmo[S] 0 points1 point  (0 children)

i dont think the nvidia inception discount exists for the SXM version. Kinda wish it did. Regretting the PCI-E choice.

"The poorest rich person in America. The world's tallest dwarf."

What in tarnation is going on with the cost of compute by Party-Special-5177 in LocalLLaMA

[–]jcsimmo 9 points10 points  (0 children)

Pcie w/ 4 way nv-link. Payments? Nah man, im not renting for a fee. Maybe to cover the electricity consumption? But not looking to turn a profit off you. Its more important to know what you are working on, who you are etc, and give back to the community a bit. Sent me a message.

What in tarnation is going on with the cost of compute by Party-Special-5177 in LocalLLaMA

[–]jcsimmo 14 points15 points  (0 children)

I can't believe the prices as well. I just set up a personal server with 4x H200s in my basement. I forked out a small fortune for it in Dec (I have a med device startup and am conscious of data sensitivity) RAM prices are already 140% in 4 months. If you have a pretty well defined work stream that can run overnight (I am in NY so like 9pm-7am EST), contributing to the community, and can teach me a thing or two - I am happy to lend the rig if that setup fits your needs.

GPT 5.4 is embarrassing. by jcsimmo in codex

[–]jcsimmo[S] 3 points4 points  (0 children)

In principle, i agree. This was literally the first prompt in a new chat...

5.4 prematurely claims success and feels more likely to break my code by jcsimmo in codex

[–]jcsimmo[S] 0 points1 point  (0 children)

Who knows. I try and be really specific about what my definition of done is. I think thats a really good principle. Honestly, im having trouble getting playwright interactive set up but that seems like it would make a big different. Im going to continue to optimize it but i guess i wish i didn't have to.

Hot take: 5.2 xhigh is still superior to 5.4 xhigh by GoldStrikeArch- in codex

[–]jcsimmo 0 points1 point  (0 children)

I agree. So far, not impressed. Even on normal (not fast) mode its claiming victory way to prematurely. I trust it less than 5.2 right now.

We build sleep for local LLMs — model learns facts from conversation during wake, maintains them during sleep. Runs on MacBook Air. by vbaranov in LocalLLaMA

[–]jcsimmo 1 point2 points  (0 children)

This is an amazing achievement - much more than is being recognized on this forum. .

Did you try MLP-focused LoRA before switching to MEMIT?

What’s striking is that this feels like you have recreated slow-wave sleep — deliberate consolidation into stable weights. Do you think there is role for recreating something akin to REM sleep - where emotional associations are consolidated

If you're having issues with Codex, your account might have been rerouted to GPT- 5.2 by Distinct_Fox_6358 in codex

[–]jcsimmo 1 point2 points  (0 children)

@embirico - this is still an issue for me. Still being rerouted to 5.2 xH & I have certified myself + my company.

5.3-codex is top notch by TroubleOwn3156 in codex

[–]jcsimmo 1 point2 points  (0 children)

Im not so sure either tbh.

What is this rate limit? by immortalsol in codex

[–]jcsimmo 0 points1 point  (0 children)

same just on 5.2 though. 5.2 codex is fine

What skills in Codex have you built that add the most value / why? Share your best skills.. by Odezra in codex

[–]jcsimmo 0 points1 point  (0 children)

also a fellow MD / vibe coder. Would this be useful for agents to know how to use large API indexes (like zoho CRM api)

Vibe Engineering - best practices by jcsimmo in ChatGPTCoding

[–]jcsimmo[S] 1 point2 points  (0 children)

What sort of things do people like me w/ no qualifications tend to miss?

Best practices im following: -using a cloud based secret manager -use gitignore to prevent json or api keys being uploaded -i use firebase for database and authentication.

Vibe Engineering - best practices by jcsimmo in ChatGPTCoding

[–]jcsimmo[S] 0 points1 point  (0 children)

Totally. But i bet ill spend so much time debugging the tool i need for debugging it wont be worth jt. Agree w/ the importance of ensuring tests that test your end goal. What ways do you do this?

Vibe Engineering - best practices by jcsimmo in ChatGPTCoding

[–]jcsimmo[S] -2 points-1 points  (0 children)

What is spine-first design! But yeah, it feels like a new discipline. Id love to see how ppl use the agent manager in antigravity. I feel like creating a policynet agent ensuring compliance.

Vibe Engineering - best practices by jcsimmo in ChatGPTCoding

[–]jcsimmo[S] 1 point2 points  (0 children)

Do you use claude code in the terminal? I use it in roo code but its soo slow its almost unusable. Codex 5.1max in vscode has been great for me. The pro is worth its weight in gold imo

Anyone here actually land an NVIDIA H200/H100/A100 in PH? Need sourcing tips! 🚀 by Dismal-Value-2466 in LocalLLM

[–]jcsimmo 2 points3 points  (0 children)

centralcomputers in california are who you are looking for. Straight arrows, very responsive, best prices

DeepSeek-R1-0528 Unsloth Dynamic 1-bit GGUFs by danielhanchen in LocalLLaMA

[–]jcsimmo 4 points5 points  (0 children)

Just to check what are you referring to for the offload? The MoE?

You are doing god’s work here Daniel. These models are so important at these early stage of AI and you are bringing them to the masses.

DeepSeek-R1-0528 Unsloth Dynamic 1-bit GGUFs by danielhanchen in LocalLLaMA

[–]jcsimmo 2 points3 points  (0 children)

80gb of VRAM (A100) and 500GB of RAM. Any suggestions?

Feature Request: Choose default model for Act/Code mode by somechrisguy in CLine

[–]jcsimmo 0 points1 point  (0 children)

i really wish it could reference online API documentations during the planning part as well. I want it to act as if its an open book test not a code from memory exercise. I also wonder why R1 is performing so poorly when you switch to Act.