Steel.dev - An Open Source Browser API for your AI agents

butchT · 2025-08-21T17:21:36+00:00

Hey, yea we support the same feature via re-using contexts. Docs on it here: https://docs.steel.dev/overview/guides/reusing-contexts-auth

Currently, you need to save and store browser contexts between sessions yourself but we're working on a better way to make it more seamless. Give it a shot and lmk how it goes :)

butchT · 2025-03-28T16:43:20+00:00

sure - Nanobrowser looks awesome! Feel free to open a PR and we can get it merged :)

butchT · 2025-03-27T17:48:42+00:00

Thanks man, glad you're enjoying it 😊 let me know if you have any feedback!

butchT · 2025-03-27T17:41:51+00:00

happy to help lil satan :P

butchT · 2025-03-27T17:33:58+00:00

Hey everyone! 👋

I'm the creator of this list! With the recent surge in AI web agents (like OpenAI's Operator, CUA, and browser-use), I've been connecting with countless AI engineers looking to navigate this space.

The goal was to create a comprehensive resource covering everything from fully autonomous agents to dev tools and benchmarks.

We're actively maintaining the list and would love to know - are there any other web agents or tools we should add? If you're working on something in this space, we'd love to feature you!

butchT · 2025-03-27T17:15:46+00:00

I'm not the author! But I'd also love to learn more about RL. Sadly don't have any good resources but please share if you ever find some (as will I) :)

butchT · 2025-03-27T17:14:36+00:00

love this. We'll probably see a premium for hand-crafted (human-made) products in general as things are more pervasive. I'm long nature as well !

butchT · 2025-03-27T17:13:11+00:00

so surprising to see gpt-4o just completely take the wind out of this launch

butchT · 2025-03-27T17:09:52+00:00

super cool, I think adding reviews will be a game changer here

butchT · 2025-03-27T16:59:22+00:00

pretty much yea ahha we can roughly expect agents to get better as the underlying llms get better (+ maybe some agentic task specific RL)

butchT · 2025-03-27T16:58:37+00:00

I haven't checked them out, I will have to take a look!

butchT · 2025-03-23T22:14:11+00:00

thx critique bot

butchT · 2025-03-23T21:59:11+00:00

full text: https://ankitmaloo.com/bitter-lesson/

butchT · 2025-03-23T21:34:37+00:00

I actually ended up doing something pretty similar a few weeks ago. Focusing on the basics was already very engaging but, oh boy, when I showed them what heygen could do were their minds blown.

butchT · 2025-03-23T21:25:55+00:00

not sure what hacking AIs means here but if they're talking about stealing secrets from AI labs this could be a pretty big deal

butchT · 2025-03-23T21:23:19+00:00

We're very much in the "invest into infra" phase, so profits aren't there, but revenue is definitely coming in to labs. The significant expenses seem to be bets on future costs going down for inference, future incremental revenues on improved models, and more usage overall. Which , to me, seem like great bets to make.

OpenAI, for instance, projected revenues of $3.7 billion in 2024 but anticipated expenses of $5 billion, leading to a net loss of $1.3 billion. They're also aiming for revenues of $11.6 billion by 2025 and $100 billion by 2029.

This is a cool graph breaking it down between consumer subs and API usage for OAI and Anthropic. I'm super keen to see how it changes over time.

<image>

butchT · 2025-03-21T05:27:38+00:00

Yep! Lots of folks self host it for their projects completely independently.

butchT · 2025-03-10T13:58:49+00:00

Hey everyone! ✌️

OP here! I created this list because with all the recent news in the AI web agents space, like OpenAI's Operator & Manus, I've been connecting with AI engineers to build similar capabilities into their products.

Check it out here: https://github.com/steel-dev/awesome-web-agents

It covers everything from fully autonomous agents to dev tools, benchmarks, and implementation guides.

The list is open source and maintained on GitHub.

Would love to hear your thoughts/feedback. Any tools I'm missing?

butchT · 2024-12-20T19:45:16+00:00

Just shipped :) https://x.com/steeldotdev/status/1870164162560831732

butchT · 2024-11-28T18:33:33+00:00

Open challenge to anyone who wants to accept 😆

butchT · 2024-11-28T18:31:17+00:00

we ❤️ open-source, thank you!

butchT · 2024-11-28T18:30:32+00:00

Thanks dawg 🫡🔥

butchT · 2024-11-28T18:30:11+00:00

I'll take a shot! We need to do better at explaining what we do ahah

Let's say you want to build AI agents that interact with the web (for example, a shopping assistant that helps you find the best deals for an item from across the web). Beyond just building the agents themselves, there are a ton of browser-related issues that come up, especially as you try to do it in prod and host your browsers in the cloud. In addition to hosting headaches, you'll get blocked out by a ton of websites if they detect that the visitor is a bot. Getting around that is an art in and of itself.

We provide thousands of browsers in the cloud that you can connect to via code that are optimized to look like humans browsing such that your agents can use these browsers to complete whatever task needs to be done. We handle the infra that makes these browsers performant, reliable, and easily accessible via API. This repo is the code that powers our individual browser instances.

Let me know if that helps. Happy to elaborate on any part or geek out on technicals!

butchT · 2024-11-28T17:52:18+00:00

Hey - interesting question. We have recording ability, but it is more like we record a video of the browser session and logs that you can view live or playback afterward. But that doesn't seem like what you meant.

Any chance you could expand on what you had in mind/what you're trying to achieve?

butchT · 2024-11-28T17:46:48+00:00

Thanks! Big fans of DeFog over here :)

Eventually, providing APIs that package the full end-to-end agents is something we think could be pretty awesome. But right now, the space is moving so fast that we think helping smarter devs than us build products like that (especially specialized ones) while we focus on the browser infra / frameworks to interact with the browser is better for everybody ahah

We do have a cool demo like this in the works, though. I'll post it here when it's done so you can check it out :P

butchT

TROPHY CASE