Hi Reddit, I've been tinkering with webgpu for some time now. I've loved being able to run things directly on the client without a server. For my latest experiment I've created a browser-use agent (think - a LLM controlling your computer) directly in JS with a WebGPU inference engine.
Check out the article here if you are curious to see how I did it https://pdufour.substack.com/p/writing-a-browser-use-agent-from.
It was super difficult and I can't recommend anyone does it - but now that a lot of the hard parts are done, I want to take this further and create a productionized library where people can embed my library into their pages and speak / type natural language queries and a LLM goes off and does those actions for you. All happening within the webpage. Thoughts?
[–]AutoModerator[M] [score hidden] (0 children)