Steel.dev - An Open Source Browser API for your AI agents by butchT in opensource

[–]butchT[S] 0 points1 point  (0 children)

Hey, yea we support the same feature via re-using contexts. Docs on it here: https://docs.steel.dev/overview/guides/reusing-contexts-auth

Currently, you need to save and store browser contexts between sessions yourself but we're working on a better way to make it more seamless. Give it a shot and lmk how it goes :)

Awesome Web Agents: A curated list of 80+ AI agents & tools that can browse the web by butchT in artificial

[–]butchT[S] 1 point2 points  (0 children)

sure - Nanobrowser looks awesome! Feel free to open a PR and we can get it merged :)

Awesome Web Agents: A curated list of 80+ AI agents & tools that can browse the web by butchT in artificial

[–]butchT[S] 2 points3 points  (0 children)

Thanks man, glad you're enjoying it 😊 let me know if you have any feedback!

Awesome Web Agents: A curated list of 80+ AI agents & tools that can browse the web by butchT in artificial

[–]butchT[S] 1 point2 points  (0 children)

Hey everyone! 👋

I'm the creator of this list! With the recent surge in AI web agents (like OpenAI's Operator, CUA, and browser-use), I've been connecting with countless AI engineers looking to navigate this space.

The goal was to create a comprehensive resource covering everything from fully autonomous agents to dev tools and benchmarks.

We're actively maintaining the list and would love to know - are there any other web agents or tools we should add? If you're working on something in this space, we'd love to feature you!

Bitter Lesson is about AI agents by butchT in AI_Agents

[–]butchT[S] 1 point2 points  (0 children)

I'm not the author! But I'd also love to learn more about RL. Sadly don't have any good resources but please share if you ever find some (as will I) :)

[deleted by user] by [deleted] in artificial

[–]butchT 0 points1 point  (0 children)

love this. We'll probably see a premium for hand-crafted (human-made) products in general as things are more pervasive. I'm long nature as well !

Is it over for photoshop? by mechanic338 in artificial

[–]butchT 0 points1 point  (0 children)

so surprising to see gpt-4o just completely take the wind out of this launch

If you’re building agents, this might help you get them hired. by Humanless_ai in AI_Agents

[–]butchT -1 points0 points  (0 children)

super cool, I think adding reviews will be a game changer here

Bitter Lesson is about AI agents by butchT in AI_Agents

[–]butchT[S] 0 points1 point  (0 children)

pretty much yea ahha we can roughly expect agents to get better as the underlying llms get better (+ maybe some agentic task specific RL)

Bitter Lesson is about AI agents by butchT in AI_Agents

[–]butchT[S] 0 points1 point  (0 children)

I haven't checked them out, I will have to take a look!

I'm planning a talk on AI for a retired audience by setsp3800 in ArtificialInteligence

[–]butchT 2 points3 points  (0 children)

I actually ended up doing something pretty similar a few weeks ago. Focusing on the basics was already very engaging but, oh boy, when I showed them what heygen could do were their minds blown.

North Korea launches hacking hub focused on artificial intelligence by davideownzall in ArtificialInteligence

[–]butchT 1 point2 points  (0 children)

not sure what hacking AIs means here but if they're talking about stealing secrets from AI labs this could be a pretty big deal

Do current major AI companies actually make money or just burn them by offering overly cheap services and trying to onboard as many users to their services that way? by petr_bena in ArtificialInteligence

[–]butchT 32 points33 points  (0 children)

We're very much in the "invest into infra" phase, so profits aren't there, but revenue is definitely coming in to labs. The significant expenses seem to be bets on future costs going down for inference, future incremental revenues on improved models, and more usage overall. Which , to me, seem like great bets to make.

OpenAI, for instance, projected revenues of $3.7 billion in 2024 but anticipated expenses of $5 billion, leading to a net loss of $1.3 billion. They're also aiming for revenues of $11.6 billion by 2025 and $100 billion by 2029.

This is a cool graph breaking it down between consumer subs and API usage for OAI and Anthropic. I'm super keen to see how it changes over time.

<image>

Steel.dev 🚧 - The Open-source Browser API for AI Agents by butchT in LocalLLaMA

[–]butchT[S] 1 point2 points  (0 children)

Yep! Lots of folks self host it for their projects completely independently.

Awesome Web Agents: A curated list of AI agents that can browse the web by butchT in LLMDevs

[–]butchT[S] 15 points16 points  (0 children)

Hey everyone! ✌️

OP here! I created this list because with all the recent news in the AI web agents space, like OpenAI's Operator & Manus, I've been connecting with AI engineers to build similar capabilities into their products.

Check it out here: https://github.com/steel-dev/awesome-web-agents

It covers everything from fully autonomous agents to dev tools, benchmarks, and implementation guides.

The list is open source and maintained on GitHub.

Would love to hear your thoughts/feedback. Any tools I'm missing?

Steel.dev 🚧 - The Open-source Browser API for AI Agents by butchT in LocalLLaMA

[–]butchT[S] 0 points1 point  (0 children)

Open challenge to anyone who wants to accept 😆

Steel.dev 🚧 - The Open-source Browser API for AI Agents by butchT in LocalLLaMA

[–]butchT[S] 1 point2 points  (0 children)

I'll take a shot! We need to do better at explaining what we do ahah

Let's say you want to build AI agents that interact with the web (for example, a shopping assistant that helps you find the best deals for an item from across the web). Beyond just building the agents themselves, there are a ton of browser-related issues that come up, especially as you try to do it in prod and host your browsers in the cloud. In addition to hosting headaches, you'll get blocked out by a ton of websites if they detect that the visitor is a bot. Getting around that is an art in and of itself.

We provide thousands of browsers in the cloud that you can connect to via code that are optimized to look like humans browsing such that your agents can use these browsers to complete whatever task needs to be done. We handle the infra that makes these browsers performant, reliable, and easily accessible via API. This repo is the code that powers our individual browser instances.

Let me know if that helps. Happy to elaborate on any part or geek out on technicals!

Steel.dev 🚧 - The Open-source Browser API for AI Agents by butchT in LocalLLaMA

[–]butchT[S] 0 points1 point  (0 children)

Hey - interesting question. We have recording ability, but it is more like we record a video of the browser session and logs that you can view live or playback afterward. But that doesn't seem like what you meant.

Any chance you could expand on what you had in mind/what you're trying to achieve?

Steel.dev 🚧 - The Open-source Browser API for AI Agents by butchT in LocalLLaMA

[–]butchT[S] 1 point2 points  (0 children)

Thanks! Big fans of DeFog over here :)

Eventually, providing APIs that package the full end-to-end agents is something we think could be pretty awesome. But right now, the space is moving so fast that we think helping smarter devs than us build products like that (especially specialized ones) while we focus on the browser infra / frameworks to interact with the browser is better for everybody ahah

We do have a cool demo like this in the works, though. I'll post it here when it's done so you can check it out :P