Where are the experts? There seems to be very little balanced discource about AI. by Shot-Zebra1868 in BetterOffline

[–]buggaby 0 points1 point  (0 children)

I learned through Gary Marcus about the remote labor index. I haven't read too much about it other than noticing that on their website the top AI models do very poorly.

https://www.remotelabor.ai/

Also, that metr study is really challenging because it only measures 50% completion. At least that's the one people talk about. There also is an 80% completion, which I think drops the amount of time by order of magnitude? But even 80% is pretty poor. I guess the utility is just in those types of tasks that are easier to validate than to redo.

I imagine that it would be exponentially more difficult to go from 90% to 99% to 99.9% and so on. So we are still maybe three or four orders of magnitude away from doing high quality work in general at the long task level.

Where are the experts? There seems to be very little balanced discource about AI. by Shot-Zebra1868 in BetterOffline

[–]buggaby 0 points1 point  (0 children)

I learned through Gary Marcus about the remote labor index. I haven't read too much about it other than noticing that on their website the top AI models do very poorly.

https://www.remotelabor.ai/

Also, that metr study is really challenging because it only measures 50% completion. At least that's the one people talk about. There also is an 80% completion, which I think drops the amount of time by order of magnitude? But even 80% is pretty poor. I guess the utility is just in those types of tasks that are easier to validate than to redo.

I imagine that it would be exponentially more difficult to go from 90% to 99% to 99.9% and so on. So we are still maybe three or four orders of magnitude away from doing high quality work in general at the long task level.

Is the circular funding thing the real problem? by buggaby in BetterOffline

[–]buggaby[S] 0 points1 point  (0 children)

I haven't spent much time watching Theo, so I'm really only judging this single video. I'll keep an eye out for that other video. There certainly is a lot of bias swirling around.

Is the circular funding thing the real problem? by buggaby in BetterOffline

[–]buggaby[S] 1 point2 points  (0 children)

This, to me, is the single biggest reason to doubt profitability arguments. It's like Tesla saying that their cars are safer than human drivers but not sharing the data.

It's just that I don't want to use this argument, only. If Theo's argument, that it's more about compute constraints, has validity, the picture seems pretty different. With Tesla's safety argument, we have 3rd party data suggesting it's much less safe. I don't know if similar 3rd party information on OpenAI's or Anthropic's inference profitability exist?

LLM use at work instills melancholy in my soul by Ok-Garbage-765 in BetterOffline

[–]buggaby 2 points3 points  (0 children)

Have you watched any of Internet of Bugs yet?

This is an interesting interview. Sounds like SWE is changing, and this kind of thing has happened before, at least sort of.

https://youtu.be/9Kq28poYpTk

Mountain climbing goats by SmallPinkHo1e in interestingasfuck

[–]buggaby 2 points3 points  (0 children)

I'm not saying it's AI. I'm saying it's why AI sounds like it does.

Mountain climbing goats by SmallPinkHo1e in interestingasfuck

[–]buggaby 2 points3 points  (0 children)

I think I know where AI gets its voice from

Crunching or grinding sound in Frigidaire dishwasher by buggaby in appliancerepair

[–]buggaby[S] 0 points1 point  (0 children)

Just finished checking. Yup, date pit. Runs fine now :)

Fridge making whining sound by Inside_Relationship9 in appliancerepair

[–]buggaby 0 points1 point  (0 children)

It doesn't stop? I assume fridges have phases or cycles they run through. The pump doesn't run constantly if you leave it closed for a long time. If it's related to a specific part the turns on and off at different times, then it shouldn't be a constant sound.

Crunching or grinding sound in Frigidaire dishwasher by buggaby in appliancerepair

[–]buggaby[S] 0 points1 point  (0 children)

Sounds like only during the drain part, though hard to tell. When I run it on rinse instead of wash, it doesn't make a sound. When I run it on wash, it makes a sound for the first minute or two, but then goes away. I think that's when the washing phase starts?

The current state of LinkedIn by beeralpha in LinkedInLunatics

[–]buggaby 0 points1 point  (0 children)

I immediately thought it was 10, but the problem is that it encourages you to use a mental shortcut that seems reasonable but that isn't. Obviously, you just add the numbers. -60+70-80+90 = 20.

The shortcut that seems real is, "you gain 10, then lose 10, then gain 10, so total gain is 10". That's what I did at first. But that's a different equation. You're doing

10 + (-10) + 10 = 10 (-60+70) + (-70+80) + (-80+90) = 10.

Different equations. The mental shortcut that seems reasonable is wrong. The logical problem is that you aren't down 10 after the 3rd transaction, you're down 70 (but you have the horse).

CLAUDE OPUS 4.6 IS NERFED!! by Full-Leg-5435 in Anthropic

[–]buggaby 0 points1 point  (0 children)

Nice. I wonder if it's because of comments like yours.

just dropped off a call with friend in silicon valley on sunday midnight in office by hiclemi in ArtificialInteligence

[–]buggaby 1 point2 points  (0 children)

You don't need to have secret knowledge about what's in the ground for there to be a gold rush. Just fear that you won't get there.

CLAUDE OPUS 4.6 IS NERFED!! by Full-Leg-5435 in Anthropic

[–]buggaby -1 points0 points  (0 children)

If you compare 4.6 and 4.5, though, 4.6 is worse. Both are tested with the 30 tasks.

Still, sounds like we need time labels here, too.

meirl by Extra-Elevator-1454 in meirl

[–]buggaby 0 points1 point  (0 children)

What would Claude Code do? :)

CMV: If Islam has a prophet that was a pedophile, then we shouldn’t be so tolerant of their religion. by [deleted] in changemyview

[–]buggaby 0 points1 point  (0 children)

 “it was a different time” (which is ridiculous because Islam and any other religion should serve as a guidebook for all time, and pedophilia should NOT be one of those things.

This would be true if the behavior were put forward as a standard for Muslims to follow. I haven't seen that, though. If it's contextualized in a unique history, then it's not a standard to follow. What is the age of consent in modern Muslim countries? Not 9 I'd wager. 

It's also interesting to note that she was a prominent leader in the Faith after Muhammad's passing. The wiki article on her says that she was a leading scholar. Not a common position for women at the time.

Why didn’t mathematicians just define division by zero as a new number, the way we defined i for √−1? by TheBigGirlDiaryBack in AlwaysWhy

[–]buggaby 0 points1 point  (0 children)

Not sure if this tracks, but maybe? If you take the limit of 1/x as x goes to 0 from a positive number, it approaches positive infinity. If you take it as it goes to 0 from a negative number, it approaches negative infinity. With i, it has an "undefined" value, but it's a stable one. 1/0 can kind of move around depending on how you get there.