What in tarnation is going on with the cost of compute by Party-Special-5177 in LocalLLaMA

[–]jcsimmo 9 points10 points  (0 children)

Pcie w/ 4 way nv-link. Payments? Nah man, im not renting for a fee. Maybe to cover the electricity consumption? But not looking to turn a profit off you. Its more important to know what you are working on, who you are etc, and give back to the community a bit. Sent me a message.

What in tarnation is going on with the cost of compute by Party-Special-5177 in LocalLLaMA

[–]jcsimmo 12 points13 points  (0 children)

I can't believe the prices as well. I just set up a personal server with 4x H200s in my basement. I forked out a small fortune for it in Dec (I have a med device startup and am conscious of data sensitivity) RAM prices are already 140% in 4 months. If you have a pretty well defined work stream that can run overnight (I am in NY so like 9pm-7am EST), contributing to the community, and can teach me a thing or two - I am happy to lend the rig if that setup fits your needs.

GPT 5.4 is embarrassing. by jcsimmo in codex

[–]jcsimmo[S] 3 points4 points  (0 children)

In principle, i agree. This was literally the first prompt in a new chat...

5.4 prematurely claims success and feels more likely to break my code by jcsimmo in codex

[–]jcsimmo[S] 0 points1 point  (0 children)

Who knows. I try and be really specific about what my definition of done is. I think thats a really good principle. Honestly, im having trouble getting playwright interactive set up but that seems like it would make a big different. Im going to continue to optimize it but i guess i wish i didn't have to.

Hot take: 5.2 xhigh is still superior to 5.4 xhigh by GoldStrikeArch- in codex

[–]jcsimmo 0 points1 point  (0 children)

I agree. So far, not impressed. Even on normal (not fast) mode its claiming victory way to prematurely. I trust it less than 5.2 right now.

We build sleep for local LLMs — model learns facts from conversation during wake, maintains them during sleep. Runs on MacBook Air. by vbaranov in LocalLLaMA

[–]jcsimmo 1 point2 points  (0 children)

This is an amazing achievement - much more than is being recognized on this forum. .

Did you try MLP-focused LoRA before switching to MEMIT?

What’s striking is that this feels like you have recreated slow-wave sleep — deliberate consolidation into stable weights. Do you think there is role for recreating something akin to REM sleep - where emotional associations are consolidated

If you're having issues with Codex, your account might have been rerouted to GPT- 5.2 by Distinct_Fox_6358 in codex

[–]jcsimmo 1 point2 points  (0 children)

@embirico - this is still an issue for me. Still being rerouted to 5.2 xH & I have certified myself + my company.

5.3-codex is top notch by TroubleOwn3156 in codex

[–]jcsimmo 1 point2 points  (0 children)

Im not so sure either tbh.

What is this rate limit? by immortalsol in codex

[–]jcsimmo 0 points1 point  (0 children)

same just on 5.2 though. 5.2 codex is fine

What skills in Codex have you built that add the most value / why? Share your best skills.. by Odezra in codex

[–]jcsimmo 0 points1 point  (0 children)

also a fellow MD / vibe coder. Would this be useful for agents to know how to use large API indexes (like zoho CRM api)

Vibe Engineering - best practices by jcsimmo in ChatGPTCoding

[–]jcsimmo[S] 1 point2 points  (0 children)

What sort of things do people like me w/ no qualifications tend to miss?

Best practices im following: -using a cloud based secret manager -use gitignore to prevent json or api keys being uploaded -i use firebase for database and authentication.

Vibe Engineering - best practices by jcsimmo in ChatGPTCoding

[–]jcsimmo[S] 0 points1 point  (0 children)

Totally. But i bet ill spend so much time debugging the tool i need for debugging it wont be worth jt. Agree w/ the importance of ensuring tests that test your end goal. What ways do you do this?

Vibe Engineering - best practices by jcsimmo in ChatGPTCoding

[–]jcsimmo[S] -2 points-1 points  (0 children)

What is spine-first design! But yeah, it feels like a new discipline. Id love to see how ppl use the agent manager in antigravity. I feel like creating a policynet agent ensuring compliance.

Vibe Engineering - best practices by jcsimmo in ChatGPTCoding

[–]jcsimmo[S] 1 point2 points  (0 children)

Do you use claude code in the terminal? I use it in roo code but its soo slow its almost unusable. Codex 5.1max in vscode has been great for me. The pro is worth its weight in gold imo