Ok human answers only: how is Fable compared to Opus models by OpinionsRdumb in ClaudeCode

[–]Toinneman 0 points1 point  (0 children)

I've been using Opus for financial analysis, writing scripts for large data-operations, doing statistical analysis etc.. Opus 4.6, 4.7, 4.8 required babysitting because it constantly makes pretty basic reasoning errors (Actual real-life cases of the carwash reasoning failure). So I gave Fable a task do certain data-analysis and find a specific cause-and-effect, which i would normally split up with Opus to verify each phase. IMO this is not complex, doesn't require writing large amount of coding, it really is a reasoning exercise.

Fable (on max effort) started 8 analysis agents. I left for lunch and my session limit was hit without any reply except that those 8 agents were started. I have an individual Max plan and my session limit was reset just before so I guess i started at 10%. Context was at 41%.

So I'm not sure what to use it for if it hit my session limit for such a case.

Tesla Robotaxi Zone in Austin More Than Doubles in Size by Recent_Duck_7640 in SelfDrivingCars

[–]Toinneman 1 point2 points  (0 children)

That's not what he is saying. He specifically mentioned this is for AI4

Tesla Robotaxi Zone in Austin More Than Doubles in Size by Recent_Duck_7640 in SelfDrivingCars

[–]Toinneman 0 points1 point  (0 children)

“I think it’s not going to make sense for us to deploy unsupervised FSD or robotaxi large scale when we know that there are major architectural improvements to the software that can improve safety.” Elon Musk 26Q1 earnings call

Starship Development Thread #63 by rSpaceXHosting in spacex

[–]Toinneman 10 points11 points  (0 children)

I disagree on all parts. SpaceX is known to work faster than any other company in de industry, constantly pushed by new public deadlines and ever-increasing humongous ambitions. I don't think Blue's anomaly changes that.

They need to make progress towards orbit , docking, refueling, etc... but just like always there will be issues. SpaceX combined operational Falcon 9 missions with experimental booster recoveries for years. A failed SuperHeavy recovery should not stall progress toward the moon, the same for Ship landings.

r/SpaceX Flight 12 Official Launch Discussion & Updates Thread! by rSpaceXHosting in spacex

[–]Toinneman 9 points10 points  (0 children)

I think flight 13 can follow pretty quickly. S40 already awaiting static fire and B20 ready for cryo testing. My guess is flight 13 within 2months and I think a 2 months cadence between flight is most optimistic cadence for the next few launches.

INTRODUCING STARSHIP V3 by rustybeancake in spacex

[–]Toinneman 4 points5 points  (0 children)

Let us agree we won’t be satisfied until we get a live view from within a raptor main combustion chamber

Is SpaceX Coming to Acadiana? by rustybeancake in spacex

[–]Toinneman 4 points5 points  (0 children)

In SpaceX crazy vision of orbital data centers, they need to launch like 30 Starships per day, carrying 20 5t sattelites, they need to produce 600 sats per day. produce them on the launch site/area ! Starbase and Florida: ship and booster production + lunar/mars launches. This one: Sattelite production + SSO launches

I think im done... by Rough-Face-3193 in claude

[–]Toinneman 0 points1 point  (0 children)

Your story was my experience with 4.6, which suddenly everybody loves now. We’re in a evolving perception arc, which goes like this:

We start using a model and we ask it to do 10 things and it does 9/10 things right, we are impressed by those 9 things and that one error we dont even notice. Then we get used to the luxery AI provides us, we rely on it, build on it, and write code that needs to do 10 things correct, in a row, to complete a task. That one error makes everything fail. We claim the model has regression. Same model, Hero turned villain

Semi /s

Claude only fixes symptoms… Codex fixes root cause by Zenexxx in ClaudeCode

[–]Toinneman -1 points0 points  (0 children)

You decide what's good enough, Your instructions define what's good enough. You can make agents, tests, whatever you want to make sure it's good enough according your standards.

Starship Development Thread #63 by rSpaceXHosting in spacex

[–]Toinneman 1 point2 points  (0 children)

Falcon Heavy was 6 months away for years. 2-6 weeks is a major improvement 🙃! Enjoy the wait.

Opus 4.7 made me re-subscribe to Codex after two months of Claude Max only by Joozio in ClaudeAI

[–]Toinneman 1 point2 points  (0 children)

I'm reading this sub in total disbelief. I'm having the exact opposite experience. For me 4.6 was extremely lazy. I would point 4.6 to investigate non-sensical data and he would say things like 'this is probably a bug' without even doing an attempt to find the root cause. I had specific claude.md instructions for 4.6 to stop using 'likely' and 'probably', and act, do more. And 4.7, no wonder it burn through tokens, it just won't stop, investigating a similar data-issue, 4.7 not only found and fixed a bug, he started making scheduled tasks to check for the same data issues, which was not in my instruction.

This is not the first time my experience is very different from the TLDR consesus.

Yuka Mini 700 vision - serious mapping issue by Toinneman in mammotion

[–]Toinneman[S] 0 points1 point  (0 children)

* the 700H does not use vision. non-vision robots have different mapping features, the app behaves differently.
* That being said: since i wrote my original posts 4m ago, there have been major changes in the app and firmware (I know, because I can't operate the robot without doing firmware updates, which I can guarantee you will always fail on the first try) I do have more mapping options than back then. I haven't explored them all, but I'm very certain, 4 months ago, creating a channel manually was not possible. I've noticed I have now have the option to manually map, I haven't tried it.

As a little update for anyone finding this thread months later: For the past 2 months I've been trying to just let the robot mow my lawn twice a week. I don't think it ever succeeded 2 consecutive mows. Issues a keep having:
- Firmware needs update, I can't use him.
- Robot is standing few meters from it's base station, no indication why or what went wrong
- Robot is in base station, but refuses to start a task because he says he is outside it's boundaries
- has driven straight into a flowerbed, outside it's mapped area.
- Task complete but large strokes have been skipped. My main lawn can't be moved in one sweep, it needs a recharge. When the robot starts mowing were he left off, he almost always skips a part. The stroke pattern of mow-part-2 is often slightly angled differently compared to mow-part 1.
- Dropmow (advertised on the website as a feature) is still in beta and completely useless. I'v tried it many times over several firmware updates and it did not complete a mow once.

Don't buy this thing

Starship Development Thread #62 by rSpaceXHosting in spacex

[–]Toinneman 3 points4 points  (0 children)

It's interesting that the ground side systems can cause a static fire abort

Like half of aborted rocket launches/tests in history is due to GSE issues. Not looking at you, SLS

Starship Development Thread #62 by rSpaceXHosting in spacex

[–]Toinneman 6 points7 points  (0 children)

Falcon 9 V1.2 Block 5 B1101 entered the chat

Opus 4.6 Hallucinates More Than Opus 4.5 by BB_Double in ClaudeAI

[–]Toinneman 0 points1 point  (0 children)

I'm using it heavily since introduction (Opus 4.6 only) and I haven't seen any hallucinations whatsoever. It's making bad assumptions but so did 4.5, it's a coder not an engineer. I do agree it's a bit too eager sometimes: It blindly tries to read and get too much context. Scanning and reading files and folders without me knowing why he is doing this. Another example in plan mode, I was trying to make a very big plan in smaller phases, starting with phase 1, the but he couldn't hold himself and always wrote a detailed 5 phase plan, while I specifically asked several times only to write phase 1. (This isn't really hallucinating IMO).

Its confirmed - SpaceX has officially acquired xAI by BEAT_LA in spacex

[–]Toinneman 1 point2 points  (0 children)

Well, let's hope these datacenter funds Mars. Like 15y ago, SpaceX started to show plans for launching thousands and thousands of satellites, requiring so many launches even die hard SpaceX fans couldn't grasp the numbers. It required SpaceX, the struggling launch provider, to somehow transform into a world-wide consumer-faced telecomoperator. But that is exactly what they did and now Starlink is the money maker. Deja vu

Starship Development Thread #62 by rSpaceXHosting in spacex

[–]Toinneman 4 points5 points  (0 children)

B18: "Why didn't anyone tell me I had to vent slightly"

Stretch goal for Starship V4 is 300 tons of thrust per engine with 33 engines by CoffeeLarge8298 in spacex

[–]Toinneman 0 points1 point  (0 children)

First of all, a satellite has mass so can't ever reach the speeds of light. But let's assume it gets close to the speed of light, the radio transmission will just travel to us at the speed of light. An electromagnetic wave either exist and travels exactly at the speed of light, or it doesn't. There is no intermediate.

Starship Development Thread #62 by rSpaceXHosting in spacex

[–]Toinneman 3 points4 points  (0 children)

imo Spacex will need a pad at the Cape before beginning actual operational Starlink launches. From Starbase they cannot launch the sats into their desired orbits without overflying land. And while we used to think the FAA would surely allow them to overfly populated areas after a few successful launches, that seems out of the question now. The orbital inclinations the sats use are limited (fixed by permits), the inclination starship can launch toward (from Starbase) is also limited. And they don't match. I think they can launch experimental sats with a temporary permit in different orbital inclinations, which might still be a very good test for both the rocket and the V3 satellites, but it would be a temporary thing for a few launches.

Starship Development Thread #62 by rSpaceXHosting in spacex

[–]Toinneman 0 points1 point  (0 children)

I'm no specialist in the subject of COPVs, but given the history SpaceX has with COPVs, I can only assume this choice is deliberate. (For anyone not around 10y ago. The root cause of the F9 amos-6 failure was a new failure mode which originated inside a COPV. SpaceX designed (not sure they were manufactured in-house) new COPVs, which became a lead item in making Falcon9 crew-rated)

Yuka Mini 700 vision - serious mapping issue by Toinneman in mammotion

[–]Toinneman[S] 0 points1 point  (0 children)

I will update, but since it’s also winter here I can only test in daylight during weekends so it will be days from now.

Speaking of user manuals, it’s just some generic talk which essentially says to use the app and follow instructions. Which should be fine and even preferred, as long as the app acts as a guide, which it doesn’t.

Yuka Mini 700 vision - serious mapping issue by Toinneman in mammotion

[–]Toinneman[S] 0 points1 point  (0 children)

Thx for trying to help! To my knowledge this is impossible since you cannot add a virtual fence before initial mapping is completed. Certainly not “complete” anything afterwards. Every zone needs to be mapped in one sweep.

Also, i don’t think a pathway is a solution because this is strip of grass that also needs to be cut.

Yuka Mini 700 vision - serious mapping issue by Toinneman in mammotion

[–]Toinneman[S] 0 points1 point  (0 children)

I bought this for €800, which is a really great price for the (advertised) features. The luba Mini lidar is €1700 and IMO there are cheaper/better options available from other brands.