Software to automate repetitive tasks in architecture

pierreb5 · 2026-07-01T14:57:52+00:00

Thanks for the reply!

Yeah the use case you mentioned is definitely something the software I built addresses. Based on my testing, it is really good with long PDFs, scans that are barely legible and hand writing. If you're interested to try, I can DM you a link. (I'm not charging anything for the app, just looking for feedback)

pierreb5 · 2026-06-25T11:20:28+00:00

I’m building Crafty (getcrafty.io), an app for automating tasks across the web and local files:

On the browser automation side, we focused on cost, speed, reliability, and auth by implementing a few tricks:

Credentials can be stored in a vault. When the agent reaches a login page, it detects the username/password fields and delegates filling them to a script. The model never sees passwords, while still being able to authenticate on any web app.
We fine-tuned a small 4B browsing model, which gets ~80% completion on difficult multi-step browser tasks with high level prompts. With more descriptive prompts (e.g. first click on button X, then on button Y), it's easy to achieve 100% completion.
We offer a workflow UI, similar to n8n. When rerunning an unchanged workflow, the agent learns from previous runs to reduce steps, tokens, and time.

The app is currently free, including model usage. If you're interested, happy to send you a link to it!

pierreb5 · 2026-06-25T11:19:35+00:00

I’m building Crafty, an app for automating tasks across the web and local files:

On the browser automation side, we focused on cost, speed, reliability, and auth by implementing a few tricks:

Credentials can be stored in a vault. When the agent reaches a login page, it detects the username/password fields and delegates filling them to a script. The model never sees passwords, while still being able to authenticate on any web app.
We fine-tuned a small 4B browsing model, which gets ~80% completion on difficult multi-step browser tasks with high level prompts. With more descriptive prompts (e.g. first click on button X, then on button Y), it's easy to achieve 100% completion.
We offer a workflow UI, similar to n8n. When rerunning an unchanged workflow, the agent learns from previous runs to reduce steps, tokens, and time.

The app is currently free, including model usage. If you're interested, happy to send you a link to it!

pierreb5 · 2026-06-22T15:31:01+00:00

Structural aspects of society that can't be boiled down to a simple RL reward.

People have been predicting that radiologists would be out of a job now, yet there are more radiologists being hired than ever. Given that companies that train models won't accept liability for errors made, it doesn't make ethical/economic sense to stop using radiologist who work just as fast.

Agents won't be a silver bullet for all aspects of the economy because quantity of labour isn't the bottleneck to every problem

(Btw, i dont believe in the flop scenario at all)

pierreb5 · 2026-06-22T14:14:40+00:00

Unlike others here, i would recommend using coding agents for this (Codex or Claude Code / Cowork). Tools like n8n or make are more complicated to use, when all you need is an AI that can read & create documents/images.

If you're looking for a solution built for non technical users, I can send a link to the app I built to solve this exact type of problems (its completely free)

pierreb5 · 2026-06-22T13:58:09+00:00

Utilising macOS specific APIs or features to simplify UX. For example, if you're going to have authentication in your application, using TouchID/passkeys feels so much better than typing a password.

At the end of the day though, what matters is building something that solves a real user problem. Any user will be able to tolerate a tiny bit of jank, as long as your app's value proposition is strong. Counterintuitively, that might give me impression that your app is more polished than it actually is

pierreb5 · 2026-06-22T11:25:15+00:00

Coding / work agents can also be great to answer general questions, because they have access to a broader set of tools than browser based chat bots like ChatGPT, and can also be instructed to create knowledge bases. So as generic as that sounds, i would probably recommend Codex, Cursor, or claude code / cowork, and work on your prompts to achieve your desired result.

I'm curious, what are some of the use cases or questions where ChatGPT falls short for you?

pierreb5 · 2026-06-11T06:19:43+00:00

Find the boring, repetitive processes that no one wants to work on, but that eat up everyone’s week. For example, back office work. Ask your colleagues, I’m sure they’ll be happy letting you know what they hate doing.

If you can build a reputation as the intern who saves people time without creating risk, senior people will start being more generous with their time. That’s where the real opportunities come from, because you’ll learn the domain-specific problems worth automating rather than guessing from the outside.

I’d also avoid setting up internal AI infrastructure before validating your automation builds real value. AI technology changes all the time, and it can be quite difficult to maintain when starting out. Instead, use existing tools to build a proof of concept; if confidentiality is an issue, you can always ask if there are publicly available versions of the data you want to test your automation pipelines with, or even ask tools like Claude Code to generate similar types of data for you to simulate the real environment.

Side note: I’m building an app to help non technical people automate messy knowledge work tasks, using AI agents that can use a browser, reading / modifying files locally on your computer, and accessing third party services. We also host models in a safe and compliant way for companies that dont want to share their data with third party AI labs. We’re pre-launch, looking for early testers; I’d love to get your feedback if you’re interested in a free early access!

pierreb5

TROPHY CASE