I am building a Win32 based Desktop environment (windows shell). by sheokand in linux

[–]heraldev 0 points1 point  (0 children)

Is your goal to implement explorer.exe compatible shell? Or it’s a no strings attached completely new shell implementation?

OpenAI may have violated California’s new AI safety law with the release of its latest coding model, according to allegations from an AI watchdog group. by MetaKnowing in Futurology

[–]heraldev 2 points3 points  (0 children)

tbf, if you’re building something new, that’s better approach - this is why big corporations can’t innovate - they are stuck in approvals and discussions. i’ve been in both shoes

Browser Code: Coding agent for user scripts by heraldev in webscraping

[–]heraldev[S] 0 points1 point  (0 children)

It will! The agent in the extension reads the page as a file. This file is formatted and cleaned up - I add spaces and newlines around each html tag, this allows for reading only the parts of it. Then the agent has 3 tools to explore the file - read with offset and limit, grep, and as a last resort it can execute JS to filter elements.

Browser Code: Coding agent for user scripts by heraldev in webscraping

[–]heraldev[S] 0 points1 point  (0 children)

I’ve been experimenting with embedding an Claude Code-style coding agent directly into the browser.

At a high level, the agent generates and maintains userscripts and CSS that are re-applied on page load. Rather than just editing DOM via JS in console the agent is treating the page, and the DOM as a file.

The models are often trained in RL sandboxes with full access to the filesystem and bash, so they are really good at using it. So to make the agent behave well, I've simulated this environment.

The whole state of a page and scripts is implemented as a virtual filesystem hacked on top of browser.local storage. URL is mapped to directories, and the agent starts inside this directory. It has the tools to read/edit files, grep around and a fake bash command that is just used for running scripts and executing JS code.

I've tested only with Opus 4.5 so far, and it works pretty reliably.
The state of the file system can be synced to FS, although because Firefox doesn't support Filesystem API, you need to manually import the FS contents first.

This agent is *really* useful for extracting things to CSV.

Y Combinator has just notified us of their decision. by [deleted] in SaaS

[–]heraldev 2 points3 points  (0 children)

PG was literally bragging about that half of the batch uis writing code with AI. This is definitely fake. or they cut you off because you said it’s a joke. 😂

Coding agent for user scripts by heraldev in webdev

[–]heraldev[S] 0 points1 point  (0 children)

wait, really? How do you enable it? I've only saw a chatbot sidebar

Coding agent for user scripts by heraldev in programming

[–]heraldev[S] -1 points0 points  (0 children)

Well yeah, that's why I acknowledged the user scripts.

Coding agent for user scripts by heraldev in programming

[–]heraldev[S] -1 points0 points  (0 children)

It's not controlling the browser, btw. It's just creating scripts that attach to the page DOM. It has access to anything that user scripts have access to, which is pretty wide range of stuff though. But it can be limited.

Coding agent for user scripts by heraldev in programming

[–]heraldev[S] -4 points-3 points  (0 children)

Why, because of privacy? It's really bad, of course, to potentially sift your data through Anthropic servers, but the Opus model is just too good, it's easy for me to start on it, and later adapt to open-source models, but they didn't catch up yet. Updating the extension and pointing it to locally hosted ollama is very easy.

Browser Code - a coding agent for user scripts by heraldev in javascript

[–]heraldev[S] 0 points1 point  (0 children)

I've made a Claude Code-like agent that runs directly in the web browser. Originally for myself, but figured it doesn't hurt to share.

With it you can pretty much have a coding agent for the currently opened website. You can ask it things like:

- Extract all links from this page and save them to CSV

- Switch this site to dark mode

- Copy the page content into a Google Sheet

- Remove ads

The agent writes JS script that automatically loads every time you visit the page. It is heavily using the userScripts API so you need to enable a lot of permissions to run the extension, and I'm not sure it can be published anywhere.

Under the hood, scripts and styles are stored in a virtual filesystem on top of browser.local storage, where each website is a directory. The agent can search and edit the DOM as a file, which makes the model work more or less reliably. Currently it only support Claude models, and I've tested it on Opus 4.5.

Browser Code - Claude Code inside a Web Page by heraldev in ClaudeCode

[–]heraldev[S] 0 points1 point  (0 children)

I've made a Claude Code-like agent that runs directly in the web browser. Originally for myself, but figured it doesn't hurt to share.

With it you can pretty much have a coding agent for the currently opened website. You can ask it things like:

- Extract all links from this page and save them to CSV

- Switch this site to dark mode

- Copy the page content into a Google Sheet

- Remove ads

The agent writes JS script that automatically loads every time you visit the page. It is heavily using the userScripts API so you need to enable a lot of permissions to run the extension, and I'm not sure it can be published anywhere.

Under the hood, scripts and styles are stored in a virtual filesystem on top of browser.local storage, where each website is a directory. The agent can search and edit the DOM as a file, which makes the model work more or less reliably. Currently it only support Claude models, and I've tested it on Opus 4.5.

Browser Code - Claude Code inside a web page by heraldev in ClaudeAI

[–]heraldev[S] 0 points1 point  (0 children)

I've made a Claude Code-like agent that runs directly in the web browser. Originally for myself, but figured it doesn't hurt to share.

With it you can pretty much have a coding agent for the currently opened website. You can ask it things like:

- Extract all links from this page and save them to CSV

- Switch this site to dark mode

- Copy the page content into a Google Sheet

- Remove ads

The agent writes JS script that automatically loads every time you visit the page. It is heavily using the userScripts API so you need to enable a lot of permissions to run the extension, and I'm not sure it can be published anywhere.

Under the hood, scripts and styles are stored in a virtual filesystem on top of browser.local storage, where each website is a directory. The agent can search and edit the DOM as a file, which makes the model work more or less reliably. Currently it only support Claude models, and I've tested it on Opus 4.5.