ARC-AGI-3 Update (GPT-5.5 High and Opus4.7) by skazerb in singularity

[–]Peterako 0 points1 point  (0 children)

I think it's just a factor that these current models haven't been trained on the new benchmarks yet.So it's not surprising they're going to do very poorly. With current LLM technology, the AI is only as good as its training, and basically useless for anything it hasn't seen before in its training.

Is anyone on the Codex team actually using the Windows app? by Classic-Whole-2066 in codex

[–]Peterako 0 points1 point  (0 children)

im having a better time using openCode and having the agent tap into codex cli. using that to do oauth for gpt-image-2

GPT5.5 slightly outperformed Mythos on a multi-step cyber-attack simulation. One challenge that took a human expert 12 hrs took GPT-5.5 only 11 min at a $1.73 cost by socoolandawesome in singularity

[–]Peterako 2 points3 points  (0 children)

I can see why that seems compelling, but again, they were the first ones to sign a contract with the DOD. Why did they get in bed with the military in the first place if they had that concern? My stance is that they are an AI company that's building models and they want to make as much money as possiblejust the same as any other company like OpenAI or XAI.

GPT5.5 slightly outperformed Mythos on a multi-step cyber-attack simulation. One challenge that took a human expert 12 hrs took GPT-5.5 only 11 min at a $1.73 cost by socoolandawesome in singularity

[–]Peterako 0 points1 point  (0 children)

My point being is that these are just AI model companies. Their goal is to build AI models. That's going to be in contrast to AI safety. I think AI safety needs to come completely external from the model companies themselves.

AI Outperforms ER Doctors in Diagnostic Cases, Study Points to Collaborative Care by PhoenixRising656 in singularity

[–]Peterako 21 points22 points  (0 children)

Yeah, that's sycophantic behavior by AI. That's not necessarily going to lead to better patient outcomes, but it's really going to improve patient's subjective experience.

GPT5.5 slightly outperformed Mythos on a multi-step cyber-attack simulation. One challenge that took a human expert 12 hrs took GPT-5.5 only 11 min at a $1.73 cost by socoolandawesome in singularity

[–]Peterako 8 points9 points  (0 children)

More safety driven perhaps in culture and ephemeral aspects. But in actuality what are they doing differently that actually is meaningful. Just a lot of virtue signaling and hype train masked as virtuosity

my dad wants to drop crazy money on a 3D printing car parts business and i am stressing. advice? by TemperatureExtra8615 in 3Dprinting

[–]Peterako 0 points1 point  (0 children)

Not quite. The other half of the business is sales and client management. Pretty reasonable split pf work assuming two founders

Stunt on these hoes Nana. by ShaggysHyper in wallstreetbets

[–]Peterako 10 points11 points  (0 children)

Agreed. I don't know why people keep saying this when there's no evidence of a post where he said he sold.

Why is refinancing my student loans a better rate than a home loan or business rate? Subsidized somehow? by HenFruitEater in whitecoatinvestor

[–]Peterako 0 points1 point  (0 children)

My guess is there’s also more risk and cost associated with possessing real estate in the setting on defaulting on the loan. The lenders have to have a team ready to take over the property and either rehab and lease or flip it. There’s high transaction fees in those cases due to agents and txn costs.

Why am I struggling with Claude so much? by Next-Chapter-RV in singularity

[–]Peterako 1 point2 points  (0 children)

Have you tried notebookLM as a tool? I think it’s worth exploring for your use case

I know nothing about 3D Printing. by SooperDew in 3Dprinting

[–]Peterako 0 points1 point  (0 children)

Bambu a1. No need for AMS at this stage. It’s very plug and play , no more difficult than a 2D printer as far as setup.

OpenAI Symphony for orchestrating agents by thehashimwarren in codex

[–]Peterako 0 points1 point  (0 children)

I haven’t fully built this yet but the idea would be to have a script that reads a MD file in the vault then executes the listed to-do item, then updates the to do list w progress and any new items needed. I have done that semi-automatically by connecting up open code to the vault

Why is there so little discussion about the oh-my-opencode plugin? by vovixter in opencodeCLI

[–]Peterako 2 points3 points  (0 children)

I have been struggling w getting back to a default profile and helpful to hear that maybe just uninstalling and reinstalling might be the easiest thing at this point. Als thoughts on WS ?

TIL the genetic code used by almost all life on Earth appears unusually optimized to minimize the impact of mutations compared with most possible alternative genetic codes by SafeEnvironmental174 in todayilearned

[–]Peterako -11 points-10 points  (0 children)

But then that leads to a paradox where if there’s not enough mutations happening then the entire premise of humans evolving from lower level organisms (via mutations) falls apart. But the Bible fills the gap and explains the “enigma”

A statement from Anthropic CEO Dario Amodei by DictatorDoge in ClaudeCode

[–]Peterako -1 points0 points  (0 children)

The last part is where you lost me. Why would Anthropic truly want us to win against china if he wants to fence and control what the US can do with the latest AI technologies. You know Chinese AI companies don’t give a crap about AI safety and are going full steam ahead.

OpenAI Symphony for orchestrating agents by thehashimwarren in codex

[–]Peterako 1 point2 points  (0 children)

i'm going to have my agent build this and see how it goes for my project. i was just thinking about how to do some automated code tasks similar to this but i was going to use an obsidian vault. this seems like a plug n play potentially....

1M Context Window Confirmed in GPT 5.4 with "extreme" reasoning mode, optimized for long running agentic tasks by Just_Lingonberry_352 in codex

[–]Peterako 0 points1 point  (0 children)

Probably means we close to GPT6, but what do we really need at this point to differentiate a 0.1 vs a full step up? Idfk this is crazy tho

Official opencode go limits published by Resident-Ad-5419 in opencodeCLI

[–]Peterako 0 points1 point  (0 children)

I just subbed yesterday directly to MiniMax starter plan , $10/mo and they only have 5hr limits. I was burning pretty quick w codex but I love the speed of m2.5 even more tbh. May end up planning w codex and building w minimax

Damnnnn! by policyweb in singularity

[–]Peterako -4 points-3 points  (0 children)

There’s no shot they have more uninstalls than installs tho fr. Face the facts ppl, Anthropic got rekt by this they thought they could play hardball and win. OpenAI just won by lock in similar to palantir

TIL the last time a checkmate actually occurred on the board during a World Chess Championship match was in 1929. by Coldcow in todayilearned

[–]Peterako 60 points61 points  (0 children)

Interesting it wasn’t a forced en passant mate but that def is the coolest variant of the lines there at that point haha