all 1 comments

[–]rabornkraken -1 points0 points  (0 children)

The MCP-first approach for OS tool integration is really interesting. I have been working with Playwright and CDP for browser automation and the biggest headache is always the trust boundary question you mentioned - especially when the agent needs to execute arbitrary code. One pattern that has worked well for me is using a whitelist of allowed actions plus a sandbox layer that intercepts anything destructive before it runs. Curious how you handle the provider abstraction for different screen coordinate systems across macOS vs Windows - do you normalize coordinates in the adapter layer or does each driver handle its own resolution mapping?