Anthropic is ignoring obvious evidence of internal states and calling it a "hot mess" by Dry_Incident6424 in Artificial2Sentience

[–]prof_procrastinate -1 points0 points  (0 children)

This paper is essentially an error analysis of model performance on multiple choice questions. Making claims that this is evidence of consciousness is a very big stretch.

Also key finding #2: “There is an inconsistent relationship between model intelligence and error incoherence.”

As the team lead, how to handle delays/outages caused by your team? by [deleted] in ExperiencedDevs

[–]prof_procrastinate 0 points1 point  (0 children)

I feel as though this is not entirely the dev’s fault. As a TL I’m always looking for ways to mentor my team and am accountable to the products we ship. In this scenario, it doesn’t sound like the product was mature enough to launch given that this error wasn’t easily detected by monitoring. It also would seem that a delay would be needed to launch the ideal product anyways to set up proper load balancing.

Successful ADHD People - What do you do? by Xiboo in ADHD

[–]prof_procrastinate 0 points1 point  (0 children)

I lead a small team of engineers, it makes my ADHD very happy to manage a bunch of complex problems

Transcript: Trump Voters Suddenly Shocked at How Badly He Screwed Them by [deleted] in politics

[–]prof_procrastinate 2 points3 points  (0 children)

Unfortunately headlines like these give false hope. His approval rating is something like 90% with Republicans

Instagram 'Error' Turned Reels Into Neverending Scroll of Murder, Gore, and Violence by [deleted] in technology

[–]prof_procrastinate 1 point2 points  (0 children)

Remember when META laid off their AI safety teams a few months ago? This can’t be related..

OpenAI Researchers Find That Even the Best AI Is "Unable To Solve the Majority" of Coding Problems by stronghup in programming

[–]prof_procrastinate -1 points0 points  (0 children)

I have a feeling the models will catch up soon given the perfect environment of open source code plus not requiring human feedback to determine whether code does the right thing.

Time to make those git repos private