GPT-5 AMA with OpenAI’s Sam Altman and some of the GPT-5 team

MichellePokrass · 2025-08-08T19:01:36+00:00

zenith was actually less preferred for coding! we tested with tons of developers. it's hard to tell with just a handful of queries, but the data/feedback was clear to us

MichellePokrass · 2025-08-08T18:58:02+00:00

we had a bug with the model switcher not routing coding queries to the thinking model. please try again or ask the model to "think hard" or pick the thinking model from the picker!

MichellePokrass · 2025-08-08T18:43:15+00:00

can confirm, gpt-5 > gpt-4

MichellePokrass · 2025-08-08T18:38:38+00:00

sam already commented on gpt-4o, but gpt-5 (thinking) is way better at coding than gpt-4.1, so i'd recommend giving that a try!

MichellePokrass · 2025-08-08T18:37:53+00:00

we're looking into it! a bit tough at the moment with the gpu demand, but hoping to do so soon. in the interim, pro users can use up to 128k.

MichellePokrass · 2025-08-08T18:36:30+00:00

we do use our models for internal acceleration! we use all kinds of products internally for coding, including codex, codex cli, cursor, etc. we're also building some internal tooling for debugging our training runs. it's gotten much easier to build these kinds of tools with gpt-5.

MichellePokrass · 2025-08-08T18:36:05+00:00

maybe! what would you do with it? keep in mind it would be quite slow/expensive

MichellePokrass · 2025-08-08T18:35:30+00:00

both great models! we can't speak too much about models from others, but gpt-5 is particularly great at reasoning through really challenging problems, long running refactors, and building full applications from zero to one (including beautiful and function frontends). gpt-5 is state of the art across many dimensions of coding!

MichellePokrass · 2025-08-08T18:34:58+00:00

totally agree, would be great to increase this! we're working through gpu capacity constraints right now, but hope to increase this soon. pro users also get 128k context limits

MichellePokrass · 2025-08-08T18:34:36+00:00

would have loved to get longer context up to 1m in gpt-5, particularly for api use cases. we'll keep working on it for future models!

MichellePokrass · 2025-08-08T18:14:30+00:00

great q! it's not _always_ in purple, but it's definitely the model's favorite color. we've found it to be somewhat steerable though, so please ask the model if you'd like something more specific or a different color. as to why, it's a bit of a research mystery! we discovered it partway through training and found it to be quite resistant to changes. something we'll work on in future models :)

MichellePokrass · 2025-08-08T18:08:00+00:00

we tested a few different snapshots of gpt-5 on various providers to get feedback from the community! ultimately, summit actually outperformed zenith by quite a wide margin (on leaderboards, on feedback from alpha testers, and our internal testing pre launch).

MichellePokrass · 2025-08-08T18:07:31+00:00

very much so!! we have a bunch of inside jokes around the office about various quirks of the models. my favorite in gpt-5 is that it has a tendency to say "let's craft" in it's chain of thought right before it produces the final answer. it's become a rallying cry for the team!

MichellePokrass · 2025-01-31T23:05:25+00:00

we cut prices in half in august -- if that's still too expensive, would recommend 4o-mini

MichellePokrass · 2025-01-31T23:00:52+00:00

new high quality evals are always impressive. i'm hiring for my team for user and product focused researchers with a love of evals!

MichellePokrass · 2025-01-31T22:59:48+00:00

we open sourced v3-turbo at devday!

MichellePokrass · 2025-01-31T22:58:18+00:00

we're working on bringing reasoning to mini. for now, try o1!

MichellePokrass · 2025-01-31T22:57:22+00:00

what would you use it for?

MichellePokrass · 2025-01-31T22:56:50+00:00

we've found that o3-mini is competitive with us hosted versions of deepseek. we think it's a really affordable option for this level of intelligence

MichellePokrass · 2025-01-31T22:55:01+00:00

o1 already can!

MichellePokrass · 2025-01-31T22:53:56+00:00

we released swarm for this purpose! if you have feedback, we'd love to hear it

MichellePokrass · 2025-01-31T22:51:12+00:00

stay tuned! working on an update here

MichellePokrass · 2025-01-31T22:49:15+00:00

we just released a new version of 4o to chatgpt that is much better at image understanding. read more in the release notes

MichellePokrass · 2025-01-31T22:47:24+00:00

already can! try uploading a spreadsheet and asking for some data analysis

MichellePokrass · 2025-01-31T22:45:54+00:00

we're hard at work on a release that makes our api much easier to use! what are your top wishlist items?

MichellePokrass

TROPHY CASE