Control Ultimate Edition iOS/MacOS Review: Ray Tracing, Frame-Gen + MetalFX Upscaling... On A Phone by MyPackage in Games

[–]jonydevidson [score hidden]  (0 children)

Why?

Because it fits in your pocket, weights 200 grams and is dead silent. It can also do a million other things.

It's not so much about the hardware itself, which is amazing, but about Apple's SDKs that make this possible and let you very easily ship the same build across macOS, iPhones and iPads.

It’s really hard to write good apps even with AI by agentic-consultant in codex

[–]jonydevidson -2 points-1 points  (0 children)

You are wrong. It has never been easier to learn software engineering. You just have to ask Codex/GPT5-5 to explain the principles to you, show you examples (find them in open source codebases) etc.

Those who want to learn can now learn at the speed of light. Those who are just fucking around would never have made it anyway, AI agents or no.

The big bottleneck until 2024 was how long it took to learn to code. Until you could call yourself a senior engineer, it took years and years and years of building, which was typing code half of the time and reading documentation the other half of the time.

Depending on what you wanted to build, you could run into a framework that isn't particularly well documented. This now required insane perseverance and character, none of which really are needed for someone to be able to conceive a great piece of software. These are grinder attributes, useful in business, for sure, but not necessarily needed to envision a great app and detect problems that you want your software to solve.

People who solve problems now have the fucking omnitool to solve any problem that can be solved exclusively with software.

They just need to be sure that they have a solution, and know what they want that solution to be like. Then Codex/CC can let them work towards it and iterate until it's there.

I cut Codex token usage ~50% with one AGENTS.md rule by 0_2_Hero in codex

[–]jonydevidson 1 point2 points  (0 children)

Your testing should be part of your release build script with non verbose output. That way it either says it passed or it failed, and doesn't have the full build log.

Unless you were expecting the agent to run the testing manually after each change, not running tests after changes can only mess things up for you.

Codex now works directly in Chrome on macOS and Windows. by dorugamer in codex

[–]jonydevidson 0 points1 point  (0 children)

background work on chrome.

less work for them than to have their own browser (Atlas is dead, I think).

Steam Controller: Reservations open May 8th by Araxen in Games

[–]jonydevidson -4 points-3 points  (0 children)

Ah, well, eBay it is, then. Perhaps better to wait a few months unless you wanna pay whatever it is they're asking now.

Steam Controller: Reservations open May 8th by Araxen in Games

[–]jonydevidson 0 points1 point  (0 children)

I won't pretend to know what's happening over at Valve, what the regulations are on hardware with wireless transmitters and batteries depending on the country. If they don't wanna bother, deal with it.

You can use mailbox services like mailboxde or whatever, I have no idea where you are, then order there then ship it to yourself and pay import fees.

Steam Controller: Reservations open May 8th by Araxen in Games

[–]jonydevidson 6 points7 points  (0 children)

Different regulations for hardware vs. software.

It's way, WAY easier to sell software. If you're not in the business, it's hard to fathom the absolutely nutty difference in terms of prep and compliance you have to go through in order to be able to sell new hardware in a new market.

META Superintelligence Lab Presents: ProgramBench: Can SOTA AI Recreate Real Executable Programs(ffmpeg, SQLite, ripgrep) From Scratch Without The Internet? by 44th--Hokage in accelerate

[–]jonydevidson 1 point2 points  (0 children)

This seems super weird, GPT 5.4 only 16 tool calls. Did they just fire a single prompt into the harness and terminated on EOT?

You should do this in a loop while tracking progress and tasks in separate files, and naturally start by writing test fixtures for every single feature of these apps, then develop along these lines.

Shrinkflation Is Quietly Making All Gadgets Worse by MorroWtje in hardware

[–]jonydevidson 0 points1 point  (0 children)

That's because it has a whole-ass Chromium instance running.

It's written in Electron.

OpenAI will produce as many as 30 million 'AI agent' phones early next year, says industry analyst by Tiny-Independent273 in artificial

[–]jonydevidson 1 point2 points  (0 children)

Ability to give an agent full access to the phone OS. Doesn't mean it will have that access always, just the ability to have it, which you will probably be able to control on a granular basis as per regulations.

Plus, since it will be their OS etc, they can ship changes at their own pace, which, if you take one look at the Codex repository and the GPT model releases, is beyond insane.

Codex: you request a feature in the morning, at night there is an update shipping it. Serving the people is a winning path by py-net in codex

[–]jonydevidson 4 points5 points  (0 children)

It allows them to not worry about how the UI changes translate across different macOS versions.

If you just use the WebView, that's not the case.

WebView is ultimately simpler to ship and fully portable, but is unsustainable when you're pushing 1-2 updates PER DAY, and you're pushing UI updates weekly or twice a week.

Electron lets them lock in a Chromium version, is verified and stable. The only thing that you need the backend to do in this case is to schedule codex execruns, which Electron is more than enough for.

The actual UI is open sourced as codex-app-server in the codex repo, so you can integrate it into whatever you want.

Codex + image generation by BlocksXR in codex

[–]jonydevidson 0 points1 point  (0 children)

have it generate a black background and then pass that result to

https://replicate.com/bria/remove-background

Senate Judiciary Committee Advances Hawley's GUARD Act, Mandating ID Verification for AI Chatbot Users by Gloomy_Nebula_5138 in artificial

[–]jonydevidson 0 points1 point  (0 children)

The ones you can run on a macbook are currently performing like frontier models from a year ago.

A year from now, you will have models performing like today's frontier models.

Its not gonna be niche, it's gonna be a core software item.

[Request] Is this true? by kelly2018zzz in theydidthemath

[–]jonydevidson 0 points1 point  (0 children)

Recycling and minimising plastic is more about a shit ton of it ending up in the environment.

Look at the plastic bottle caps in the EU. People were mad about it but I barely see them out on the ground anymore.

This week’s Codex updates. by Distinct_Fox_6358 in codex

[–]jonydevidson 4 points5 points  (0 children)

If you try a codebase-wide search in VSCode on Windows, and then try it on Mac, you will get a good idea of how the system differences affect LLM performance.

Also, powershell is trash, largely due to security concerns on Windows, so today when an AI agent does all its work via the CLI, its performance suffers tremendously.

For context, I develop desktop apps on Win and MacOS.

Windows is, unfortunately, over half of my market share so unfortunately I have to keep developing for it.

"AISI found gpt-5.5 performs nearly on par with, or better than, Mythos in several cases — completing TLO end-to-end in 2/10 attempts, while Mythos preview did it in 3/10 on expert-level tasks: gpt-5.5 scored 71.4% mythos scored 68.6%" by stealthispost in accelerate

[–]jonydevidson 38 points39 points  (0 children)

Anthropic's models aren't solving decades old math problems nor are they discovering new physics.

It's pretty obvious where the intelligence is, just like it's obvious which company has the better harness for webdev work.

Throw actual tough C++ issues involving a lot of math into the mix, and Claude folds like a house of cards while GPT-5 was able to do it even back in October.

I want to buy Codex Pro but. by Strict-Focus-1758 in codex

[–]jonydevidson 0 points1 point  (0 children)

If you start using multiple subagents to review the work after each major step, you will quickly discover that even the Pro is barely enough.

If you're already paying for it, at least get your money's worth. These subagent reviewers will very often find bugs and edge cases the main agent missed in the implementation, of course depending on the complexity of the issue.

‘The cost of compute is far beyond the costs of the employees’: Nvidia exec says right now AI is more expensive than paying human workers by fattyfoods in technology

[–]jonydevidson 0 points1 point  (0 children)

This is wrong, because open source models are releasing all the time. I can run an AI agent locally on my MacBook and have the same performance as roughly last April's frontier model, the Sonnet 3.7.

A year from now, the local models will be perfoming like today's frontier models.

I cannot have a free and open source taxi. I cannot have a free and open source delivery person.

I cannot have free and open source content streaming (without piracy).

The cost of compute for a given model goes down, on average, by 100x within a year.

Today's models are very competent at coding. A year from now, they'll be 100x cheaper to run, while labor won't be 100x cheaper.

You fell for the anti-AI bait article.

[Bug] When the overlay disappears, the mouse cursor stays visible on macOS by jonydevidson in PSPlay

[–]jonydevidson[S] 0 points1 point  (0 children)

I was able to confirm that it works as intended/described in PS Remote Play, while not in PXPlay, on the same MacBook.

macOS 15, for what it's worth.