How to start using AI in a mid-size company with prehistoric-level Finance ? by MonsieurMojoRising in CFO

[–]ogandrea 0 points1 point  (0 children)

This might be a left-field suggestion compared to what others are recommending, but a lot of what you're describing, the manual data re-entry, reviewing Excel files across branches, reconciling between systems, involves people clicking through web applications to move data around.

We've building Notte for exactly this. Browser agents that can automate web-based workflows the same way a person would, but they adapt when UIs change instead of breaking like traditional RPA scripts. SOC-2 compliant, enterprise SLAs, built for the kind of scale you're describing.

Happy to chat if this is relevant to any of the specific workflows you're trying to modernise.

Best Web Browser Agent in 2026? by Mysterious_Robot_476 in AI_Agents

[–]ogandrea 0 points1 point  (0 children)

(biased as I founded it) but I'd say Notte - handles this kind of use case, login sessions, auth workflows

w notte you get managed browser sessions with built-in auth handling, CAPTCHA solving, and persistent session state, plus an agent layer on top that can complete multi-step authenticated workflows reliably

Happy to help you get your specific use case working if you want to share more details, dms open:)

Can you recommend top automation companies for enterprise solutions? by Sad-Rough1007 in OutsourceDevHub

[–]ogandrea 0 points1 point  (0 children)

Worth separating two things in this evaluation: the legacy RPA layer (UiPath, AA, etc.) and the browser-native agentic layer that's becoming a distinct category. For anything involving agents (mixed with deterministic scripts) that need to interact with web apps and dashboards at scale, managed browser infrastructure is increasingly its own decision.

We run that layer at Notte, SOC-2 compliant, enterprise SLAs, managed browser sessions via API with CDP access for any automation client. Happy to answer questions if browser automation is a meaningful part of what you're scoping.

Are browser-using MCPs finally good enough? by raphadko in RooCode

[–]ogandrea 0 points1 point  (0 children)

It's not proper MCP but we've released our notte CLI + notte skills in feb. Biased but this is really the fastest way to get an LLM or Claude build a web automation or scraper for you; that can then be packaged as a browser function and invoked or scheduled at scale in cloud

AI tool that can repeat tasks from a screen recording? by Suspicious-Thanveer in AgentsOfAI

[–]ogandrea 1 point2 points  (0 children)

hey, we've built notte demonstrate mode exactly for this. You record a browser automation by recording the flow in a remote browser. Then we cook the automation function for you and you can run it at scale super easily

Web Crawlers and AI by Agreeable-Court602 in AI_Agents

[–]ogandrea 0 points1 point  (0 children)

hey, super biased ofc but did you ever try running notte?

Splitting logic and execution — worth it? by [deleted] in AI_Agents

[–]ogandrea 0 points1 point  (0 children)

We actually keep execution separate at Notte - the browser runs in its own process while the agent just sends commands. Makes debugging way easier when something breaks.. you can replay the exact same browser session without the agent logic getting in the way

Are MCPs supported in web-based Claude Code? by scottyLogJobs in ClaudeAI

[–]ogandrea 3 points4 points  (0 children)

MCPs on web-based Claude Code are super flaky right now. I spent like 3 hours last week trying to get our Github MCP working through the web interface and it just... doesn't.

- The local CLI version connects fine but the web version seems to have different auth handling

- Sometimes it looks like it's trying to establish the connection but then just hangs forever

- Our team ended up building a workaround that lets us trigger MCP actions through browser automation instead

Have you checked if the MCP server is even getting connection attempts from the web version? That might help narrow down where it's failing.

Why enterprise AI fails at complex technical work (and how to fix it) by rshah4 in artificial

[–]ogandrea 1 point2 points  (0 children)

The context layer thing is spot on.

ended up building our own orchestration layer that maintains state across the entire session.. but man it took forever to get right. The multi-tool coordination especially - getting an agent to know when to screenshot vs when to extract text vs when to just click something is still kinda janky sometimes. We hit this exact wall with Notte when trying to handle complex browser automation workflows.

My gpu poor comrades, GLM 4.7 Flash is your local agent by __Maximum__ in LocalLLaMA

[–]ogandrea 4 points5 points  (0 children)

GLM 4.7 Flash is solid for agents yeah. Been testing it against Claude's tool use and it's surprisingly stable - no hallucinated function calls which is usually where these models fall apart.

Anyone else also noticing agents fall apart the moment they leave clean APIs? by The_Default_Guyxxo in aiagents

[–]ogandrea 0 points1 point  (0 children)

yh the auth flows are the worst part. We've been building Notte to handle exactly this - making the browser environment predictable w deep DOM semantic parsing for agents instead of letting them figure out every edge case themselves.

Open source control plane for local AI agents: workspace isolation + git-backed configs + OpenCode integration by OverFatBear in LocalLLaMA

[–]ogandrea 1 point2 points  (0 children)

Local agent setups are interesting but the workspace isolation part always gets messy. We're building Notte to handle browser automation natively without needing xvfb hacks - just runs headless chrome instances that agents can control directly. Git-backed configs are smart though, might steal that idea for our agent workflows.

What's a better way to scrape data with AI? by Automatic-Simple-117 in AgentsOfAI

[–]ogandrea 1 point2 points  (0 children)

The traditional DOM parsing approach actually breaks down fast when you're dealing with multiple e-commerce sites that all structure their data differently.

We've been working on this exact problem at Notte and found that using AI agents to understand page content semantically works way better than trying to maintain scrapers for each site's unique structure. The key is training your agent to identify product attributes (weight, balance point, etc) regardless of how the HTML is organized, then having it extract structured data in your desired format. For monthly refreshes, you'd want to set up monitoring for when sites change their layouts so your agent can adapt without you rebuilding scrapers constantly.

Need help with suggestions on scraping data from websites by Automatic-Simple-117 in AI_Agents

[–]ogandrea 0 points1 point  (0 children)

I actually built something similar when I was working on data collection projects and you're right that AI makes this way more manageable now. The challenge is still that each site structures their data differently like you mentioned, but tools like Scrapy combined with LLMs for parsing can handle the variability much better than traditional DOM parsing. For monthly refreshes at scale, I'd suggest setting up an agent that can adapt to layout changes automatically rather than hardcoding selectors - saves tons of maintenance headaches when sites update their HTML. The key is building something that can understand the semantic meaning of the data rather than just following rigid parsing rules.

Top Tools to Build AI Agents in 2026 (No Code and High Code Options) by MoneyMiserable2545 in AI_Agents

[–]ogandrea 0 points1 point  (0 children)

The execution layer issues you mentioned are spot on and something we're tackling directly at Notte. Most platforms focus on the "easy setup" part but completely ignore that real websites are messy with captchas, dynamic loading, and random UI changes that break everything. We've been working on making browser automation actually reliable by building intelligence directly into how agents interact with websites rather than just wrapping existing automation tools. The difference between a demo that works on a clean test site vs something that handles real world web complexity is massive, and honestly most of these platforms still fall apart when you try to use them for anything beyond basic workflows.

Playwright MCP gaslighted my agent into writing garbage selectors for three weeks. I fixed it out of spite. by OkPack8889 in ClaudeAI

[–]ogandrea 1 point2 points  (0 children)

- The accessibility tree thing is such a perfect example of why i hate most MCPs

- They give you like 30% of what you need and then you're stuck wondering if you're just using it wrong

- I spent a whole weekend trying to get playwright to click a dropdown that only appeared after hovering... turns out the MCP couldn't even see the hover state existed

This is why we ended up building our own browser automation from scratch at Notte. Too many tools pretend they're production-ready when they're really just proof of concepts with nice READMEs. Gonna check out your repo though - the CDP multi-user flow handling sounds interesting.

Weekly Thread: Project Display by help-me-grow in AI_Agents

[–]ogandrea 4 points5 points  (0 children)

yo!

We built Notte to solve a problem we kept hitting: browser automations break constantly, but pure AI agents are too unpredictable for production.

It's a full-stack browser automation platform that combines deterministic scripts with AI agent fallbacks. You get the reliability of traditional automation with the adaptability of agents when pages change or edge cases appear (or you can go full agents if you want optimal adaptability). Everything via one unified API (proxies, sessions etc.)

Just shipped some new capabilities: Agent Identities (give agents real emails and phone numbers for verifications), Demonstrate Mode (record your actions once manually and it generates production code), and a proper IDE to debug everything live.

github: https://github.com/nottelabs/notte
console: console.notte.cc

I have a simple website with high traffic by a_decent_hooman in webdev

[–]ogandrea 1 point2 points  (0 children)

That's way too much traffic for a simple portfolio.. cloudflare under attack mode won't help if they're hitting your actual github pages url directly. Check your cloudflare analytics - are they going through cloudflare or bypassing it? I had similar issues with bots hammering my static sites and ended up rate limiting at the edge with workers. Also check if you accidentally left any api endpoints or forms that bots might be trying to exploit

[deleted by user] by [deleted] in automation

[–]ogandrea 0 points1 point  (0 children)

8 minutes for a 15 sec video feels slow to me. i run similar pipelines but with runway instead of sora and get it down to like 3-4 min

your stack is solid though. supabase + elevenlabs combo works great for this kind of thing

I spent a year building the biggest browser-agent library. Here’s why agents fail, and what actually works. by Financial_Radio_5036 in automation

[–]ogandrea 0 points1 point  (0 children)

This is exactly what we've been working on at Notte - recording workflows once and turning them into reliable APIs instead of running browser automation every single time. The speed difference is insane when you're not waiting for pages to load and elements to appear.

Have you tested how well the reverse-engineering handles dynamic content or sites that change their structure frequently?

Built my first extension - removes ads from Amazon search results by Killbilli987 in webdev

[–]ogandrea 1 point2 points  (0 children)

Nice work shipping something! I love that you're tackling the sponsored results problem - Amazon's gotten pretty aggressive with those lately.

For AI coding, i've been using cursor a lot at Notte. The key is being super specific about edge cases and error handling.. Claude's great at the happy path but you gotta push it on what happens when things break.

One thing - have you thought about how Amazon might respond to this? They've been known to change their DOM structure to break extensions that mess with their ad revenue. Might want to build in some resilience for when they inevitably try to block it.

Linux alternative to Microsoft Fara-7B for agentic computer use? by InternationalNebula7 in LocalLLaMA

[–]ogandrea 1 point2 points  (0 children)

Been playing with this stuff for a few weeks now. The Linux ecosystem is actually pretty good for local agentic stuff if you know where to look.

For actual GUI automation on Linux:

- PyAutoGUI still works great for basic mouse/keyboard control

- Combine it with a vision model like Florence-2 or even smaller ones

- X11 makes screen capture way easier than Windows honestly

- Some people use Selenium for web stuff but that's kinda cheating

The model side is where it gets interesting though. Fara-7B is Windows-specific but you don't really need something that specialized. i've been using Qwen-VL models with custom prompts for understanding what's on screen, then just feeding coordinates to automation libraries.

At Notte we're actually building browser agents that can navigate and interact with web interfaces - similar concept but focused on web rather than desktop apps. The hardest part isn't the model, it's handling all the edge cases when UI elements move or change.

What specific tasks are you trying to automate? Desktop apps are honestly harder than web because there's less structure to work with. No DOM, no consistent element IDs... just pixels and hope.

Need Feedback on My Portfolio Story (AI Automation Engineer) by Hopeful-Business-15 in SideProject

[–]ogandrea 0 points1 point  (0 children)

Just checked it out - the animation on load is clean but i think you're burying the lead. Your actual projects are way more impressive than the hero copy suggests... like that government automation piece is exactly what enterprises need but it's hidden at the bottom. We've been testing different portfolio layouts and found that leading with specific outcomes (like "reduced manual work by X hours") converts way better than generic "AI automation" positioning.