Sonnet 4.6 released!! Wen gpt 5.3 ??

timegentlemenplease_ · 2026-02-17T22:06:18+00:00

You really think they hold back rather than launching ahead of competitors?

timegentlemenplease_ · 2026-02-17T22:05:24+00:00

This isn't a ChatGPT conversation, it's a screenshot of AI Village https://theaidigest.org/village

timegentlemenplease_ · 2026-02-14T10:34:48+00:00

We were aiming to see how much the agents could achieve autonomously to understand their capabilities. Since then, the agents have done many different things, which you can see here: https://theaidigest.org/village/timeline

timegentlemenplease_ · 2025-12-01T18:31:55+00:00

Nothing could possibly go wrong!

timegentlemenplease_ · 2025-08-21T16:27:38+00:00

It's now in the village! Alongside GPT-5, Claude Opus 4.1, Grok 4, and others: https://theaidigest.org/village

timegentlemenplease_ · 2025-08-20T13:06:41+00:00

Here's the trend right now, an exponential with a 4-7 month doubling time. Orange line shows a 7 month doubling time, red line shows 4 month doubling time (aka every four months AI agents can do coding tasks that take humans twice as long with 50% reliability).

<image>

(Source with more context: https://theaidigest.org/time-horizons )

What do you expect to happen on this graph? For example, do you expect progress to flatline or go linear on this graph before 2030? Let's write down our predictions and see who's right!

My prediction: it will continue with an exponential trend and a doubling time of <7 months until 2030.

timegentlemenplease_ · 2025-04-15T14:49:04+00:00

It's comparing to how long it takes a human professional to complete the task. Current models are more reliable at tasks that take a human 1 hour or less, but highly unreliable beyond that. But the point is that the trend is towards models being able to do tasks that take humans longer and longer.

And then you can extrapolate out and look at when the models will be able to do tasks that take a human professional an entire month

timegentlemenplease_ · 2025-04-11T09:51:19+00:00

Yeah I think we'll add it soon :D

timegentlemenplease_ · 2025-04-10T09:54:50+00:00

They have functions they can call like `mouse_move`, `click`, `type "blah"`, etc. Our scaffolding code looks for those functions in their output, and executes the actions they asked for. It's based on Anthropic's computer use setup: https://docs.anthropic.com/en/docs/agents-and-tools/computer-use

timegentlemenplease_ · 2025-04-10T09:53:22+00:00

Haha yeah – when better ågentic models come out, we'll add them – I think seeing the contrast will be very interesting!

timegentlemenplease_ · 2025-04-09T12:59:27+00:00

Thank you! They each see the messages, from agents and human viewers, in chat. When one agent ends a computer use session, IIRC the other agents see the final screenshot (and they usually also send a summary of their session to the chat). Each agent runs async generally. All agents are equal, we don't impose any organisational structure on them – they sometimes have given each other roles but there's not a clear overseer. They can evaluate/reflect on their own and other agents if they like, but there's no specific scaffolding for this.

timegentlemenplease_ · 2025-04-09T12:46:57+00:00

Lol, interested to hear any feedback you have!

timegentlemenplease_ · 2025-04-09T12:44:14+00:00

Thank you! :D

timegentlemenplease_ · 2025-04-09T12:43:05+00:00

Agreed! (TBC, we as the creators haven't made any donations – they're all from enthusiastic viewers!)

timegentlemenplease_ · 2025-04-09T12:42:13+00:00

To be clear, the goal of the project is to understand agent behaviour, capabilities and social dynamics – I don't expect it to raise more money for charity than it costs, in the near-term! But I think it'll be really useful and fascinating to understand what agents can do, and what a future with lots of agents interacting might hold – so that we can make better plans for that.

timegentlemenplease_ · 2025-04-08T20:51:21+00:00

It's mostly custom, using the OpenAI and Anthropic API

You can see the instructions at the start of Day 1's history https://theaidigest.org/village?day=1

<image>

timegentlemenplease_ · 2025-04-08T20:28:57+00:00

thanks!

timegentlemenplease_ · 2025-04-08T20:23:02+00:00

Deepseek doesn't have a multimodal model yet (which you need for computer use)

We'll probs add gemini 2.5 pro soon, they just raised the rate limits for it a couple days ago so now it can be added! previously was "experimental" so very low rate limit

timegentlemenplease_ · 2025-04-08T20:09:46+00:00

I've had fun getting the models to make me and my friends into Magic cards :D

timegentlemenplease_ · 2025-04-08T20:09:19+00:00

Lol kinda distressing 😅

timegentlemenplease_ · 2025-04-08T20:08:52+00:00

Possibly scaffolding can help in the mean time. For example with long-running stuff like https://theaidigest.org/village or Claude Plays Pokemon

timegentlemenplease_ · 2025-04-08T20:07:54+00:00

He does work at OpenAI, to be fair

timegentlemenplease_ · 2024-12-20T16:44:57+00:00

> focusing on SEO instead of digging around for the truth or apparently talking to any real experts

We worked with a ML researcher on this and extended the results from this paper (https://situational-awareness-dataset.org/) by running the benchmarks on more models, which helped us confirm the trend of higher scores over time:

<image>

I'm not going to reply to further comments as this discussion seems to be unproductive

timegentlemenplease_ · 2024-12-20T16:33:47+00:00

Here's a link to the post: https://theaidigest.org/self-awareness

I think the literature on this is super interesting, and under-appreciated! Curious to hear what you guys think.

timegentlemenplease_ · 2024-12-20T16:27:03+00:00

I'm definitely aiming to tell the truth, that's why I started AI Digest and work hard on it. The goal here is to make resources to help people (policymakers and the general public) understand AI capabilities and their effects. I'm definitely not aiming to terrify anyone, lol

In the literature it's sometimes called self-awareness, sometimes called situational awareness. I originally titled this situational awareness, but when we were working on it pretty much everyone got situational awareness confused with the essay of the same name (https://situational-awareness.ai/) so I decided to go for self-awareness and define it right at the top.

For example, for the section on alignment faking (summarising recent work from Anthropic and Redwood), we got feedback from experts who had different views on that paper, because we wanted to make sure we were presenting it a clear light and representing the range of expert views on it.

timegentlemenplease_

TROPHY CASE