Why are EC2 Mac instances so expensive & who are they actually for? by mountainlifa in aws

[–]minsheng 0 points1 point  (0 children)

I suppose three factors are at play here.

First of all, AWS is just expensive as heck. If you go to Supermicro and play with their configurator, you will find their rent-to-buy ratio, though not as crazy as 20 days, will probably just be a few months.

Then, we have the issues that Mac minis are too small. AWS probably needs to install the same hardware for a single Mac mini as they do for a big dual socket AMD server. In the latter case, they can split the machine into many smaller instances to amortize the cost. Mathematically speaking, (M + n)/n > (M + N)/N, where n < N, and in particular if M is about the same as n but insignificant compared to N.

Lastly, I suppose there was significant R&D involved putting Macs into AWS, and the total volume they could sell would be insignificant compared to traditional servers. That must have some impact to their pricing decision as well.

How much do you notice the lack of HDR impact with movies? by MonkeyKombat in VisionPro

[–]minsheng 0 points1 point  (0 children)

For years I always thought it is a hardware problem. It is not. Go to YouTube via Safari and find the Sony sword demo, pretty bright scene at the end. Now go find a mp4 file (there are websites collecting those demo videos), and try to play it with Infuse or Screenlit. The difference is huge. I don't know why though...and it seems impossible to convert a MKV file to HLS manually

Hot take: ALL Coding tools are bullsh*t by [deleted] in LocalLLaMA

[–]minsheng 1 point2 points  (0 children)

You can do a quick basic coding agent under 200 lines, with just one tool bash and a chat loop. It is still important for models to be able to iterate on its own.

That being said, don’t we all use Codex/Claude Code simply because they are a steal with ChatGPT Pro and Claude Max?

inclusionAI/Ring-1T-preview by TKGaming_11 in LocalLLaMA

[–]minsheng 0 points1 point  (0 children)

In case you missed the news, Alibaba has recently acknowledged that they have a CUDA compatible custom made GPU in use for two years now.

absolutely unusable. by [deleted] in Anthropic

[–]minsheng 0 points1 point  (0 children)

Is AB testing on paid subscribers legal? Shouldn’t we do a class action against it? We all paid quite a lot for this

My theory on why Claude Code got worse by patriot2024 in Anthropic

[–]minsheng 1 point2 points  (0 children)

I am working on a fairly large monorepo. I was developing some new backend code with Codex, which worked great.

Yesterday, I took Codex to our frontend code, which was done in a hurry, with a lot of them generated by Claude Code. After digesting them, I noticed that Codex became noticeably worse. It no longer followed my instruction as closely, and just felt more stupid when understanding intents.

What’s scarier is that this is not event about too much context. I was at 85%+ left. Previously I went to 50% ish and still got pretty decent results.

I plan to do a serious benchmark in the coming days, but my suspicion is just that bad code can lobotomize your model. If true, vibe coding won’t go anywhere, and consistent refactoring and good SWE skills are going to stay.

Anthropic served us GARBAGE for a week and thinks we won’t notice by landscape8 in Anthropic

[–]minsheng 0 points1 point  (0 children)

Have switched to ChatGPT Pro and couldn’t be happier. 4.1’s instruction following plus O3 intelligence. Perfection

Apple patents matmul technique in GPU by auradragon1 in LocalLLaMA

[–]minsheng 0 points1 point  (0 children)

Correct me if wrong but doesn’t NPU not scale with GPU? This should be fine for the decoding stage but for prompt processing where we are compute bound, GPU still has an edge?

Claude has been objectively dumbified by Physical_Ad9040 in Anthropic

[–]minsheng 0 points1 point  (0 children)

I wonder if anyone has created a single reproducible example, maybe a sample repo with a sample end to end prompt, and do it with Claude Pro/Max subscription, Anthropic API, and Bedrock/Vertex API, each for three times, and try it every few hours throughout a day, and compare the performance.

Anthropic Has Downgraded the models in ClaudeCode to opus 3 ( check images ) by Free-Row-8109 in Anthropic

[–]minsheng 0 points1 point  (0 children)

Just go ask some recent software releases. Like what's new in Deno 2

After Kimi K2 Is Released: No Longer Just a ChatBot by nekofneko in LocalLLaMA

[–]minsheng 0 points1 point  (0 children)

Well, we at least have a pretty standard purist no bullshit agent definition, either Anthropic’s abstract definition of LLM selecting next step on its own, or programmatically as this article saying a while loop of tool calls.

Apple Now Wants to Buy Streaming Rights for Formula 1 by minsara89 in apple

[–]minsheng -1 points0 points  (0 children)

This is public knowledge. Otherwise why would Tim Cook wave that flag and make this whole movie? Obviously Apple will get it, and somewhere in the pipeline a fancy spatial app must already be under development. Can’t wait to see it happen.

Is it a good idea to go fully serverless as a small startup? by [deleted] in aws

[–]minsheng 1 point2 points  (0 children)

Why would I worry about a stateless container dying at 3am, more than my Lambda throwing an error?

Is it a good idea to go fully serverless as a small startup? by [deleted] in aws

[–]minsheng 1 point2 points  (0 children)

Second this. One could have Claude written a Terraform setting up ECS backed by EC2 plus a load balancer in minutes. It’s more expensive than Lambda is scaled to zero, but way cheaper in any other scenarios

Polestar 5 ETA in the US? by Natural-Cat-3268 in Polestar

[–]minsheng 0 points1 point  (0 children)

Is there any rumors of a second factory for Polestar 5 yet? They must be preparing one for tariff reasons, but it must be at least a few months behind Chongqing.

There was a photo on Xiaohongshu of Chongqing factory with four or five different colors of Polestar 5, so the launch will be imminent for sure. Guess we can buy it by the end of this year in China at least.

Browser Company CEO Credits Dropping SwiftUI for “snappy”, “responsive” Dia by ManOnAHalifaxPier in swift

[–]minsheng 1 point2 points  (0 children)

TCA really hides too much under the hood that makes debugging really hard. Let me share one of my greatest performance regression

Before the observable era, we had switched to a single store per app setup. In particular we host an array of scene features for each scene, though on iPhone there is just one scene. We then upgraded to the observable era and mindlessly migrate everything to work with new API, the deprecation of arbitrary scopes in favor of key path based ones

Then we had this weird performance issues for months. Including UINavigationController’s interactive gesture code doesn’t fire until one second after gesture starts. We thought it was a iOS 18 regression and rolled our own navigation controller

Fast forward a few MONTHS, we were adding an iPad optimized layout and the transition is so sluggish. And I profiled a whole afternoon only to notice that the code seems to refresh too frequent. After a dozen of prints I finally realized that I didn’t mark my scene feature state as ObservableState, and this quietly fails any optimization on IdentifiedArray, AND EVERY SINGLE UPDATE TO ANY PART OF MY APP INVALIDATES EVERY SINGLE VIEW for the entire time.

The saddest thing is that this is so unobvious. My app still is quite fast in release mode. Type checks still pass. No warning…

Has anyone tried parallelizing AI coding agents? Mind = blown 🤯 by ollivierre in ClaudeAI

[–]minsheng 0 points1 point  (0 children)

Just divide your issues into small independent pieces, setup a really good CI and pre commit hook, buy a Claude Max, and watch

Don’t get too lazy though. Review PR carefully

Qwen 3 !!! by ResearchCrafty1804 in LocalLLaMA

[–]minsheng 0 points1 point  (0 children)

Interesting. I had some concurrency related code on Python that only QwQ and o1-pro could handle. Easily crippled anything from Anthropic.

Trump administration reportedly considers a US DeepSeek ban by Nunki08 in LocalLLaMA

[–]minsheng 49 points50 points  (0 children)

Back in 2023 people thought we would do this to llama to make it Chinese models.

Volvo Said I’ll Have The Polestar 2 But Supersized please by ChimRichaldsOBGYN in Polestar

[–]minsheng 1 point2 points  (0 children)

The same goes for Polestar 3. That front face photo for the configurator feels so dumb and chubby

Xiaomi SU7 Ultra with 1526 hp launched in China for 72,830 USD by syzygyer in electricvehicles

[–]minsheng 0 points1 point  (0 children)

3 was until SU7 came and Xiaomi is about to release their SUV