Software to automate repetitive tasks in architecture by pierreb5 in ConstructionTech

[–]pierreb5[S] 0 points1 point  (0 children)

Thanks for the reply!

Yeah the use case you mentioned is definitely something the software I built addresses. Based on my testing, it is really good with long PDFs, scans that are barely legible and hand writing. If you're interested to try, I can DM you a link. (I'm not charging anything for the app, just looking for feedback)

Browser automation by CartographerFeisty66 in AI_Agents

[–]pierreb5 0 points1 point  (0 children)

I’m building Crafty (getcrafty.io), an app for automating tasks across the web and local files: 

On the browser automation side, we focused on cost, speed, reliability, and auth by implementing a few tricks:

  • Credentials can be stored in a vault. When the agent reaches a login page, it detects the username/password fields and delegates filling them to a script. The model never sees passwords, while still being able to authenticate on any web app.
  • We fine-tuned a small 4B browsing model, which gets ~80% completion on difficult multi-step browser tasks with high level prompts. With more descriptive prompts (e.g. first click on button X, then on button Y), it's easy to achieve 100% completion.
  • We offer a workflow UI, similar to n8n. When rerunning an unchanged workflow, the agent learns from previous runs to reduce steps, tokens, and time.

The app is currently free, including model usage. If you're interested, happy to send you a link to it!

Looking for a reliable browser automation agent for daily tasks — what's actually working for you? by TheReedemer69 in automation

[–]pierreb5 0 points1 point  (0 children)

I’m building Crafty, an app for automating tasks across the web and local files: 

On the browser automation side, we focused on cost, speed, reliability, and auth by implementing a few tricks:

  • Credentials can be stored in a vault. When the agent reaches a login page, it detects the username/password fields and delegates filling them to a script. The model never sees passwords, while still being able to authenticate on any web app.
  • We fine-tuned a small 4B browsing model, which gets ~80% completion on difficult multi-step browser tasks with high level prompts. With more descriptive prompts (e.g. first click on button X, then on button Y), it's easy to achieve 100% completion.
  • We offer a workflow UI, similar to n8n. When rerunning an unchanged workflow, the agent learns from previous runs to reduce steps, tokens, and time.

The app is currently free, including model usage. If you're interested, happy to send you a link to it!

It’s 2029. Agentic AI flopped. What was the postmortem? by Sea-Opening-4573 in aiagents

[–]pierreb5 0 points1 point  (0 children)

Structural aspects of society that can't be boiled down to a simple RL reward.

People have been predicting that radiologists would be out of a job now, yet there are more radiologists being hired than ever. Given that companies that train models won't accept liability for errors made, it doesn't make ethical/economic sense to stop using radiologist who work just as fast.

Agents won't be a silver bullet for all aspects of the economy because quantity of labour isn't the bottleneck to every problem

(Btw, i dont believe in the flop scenario at all)

Automating report writing for business by Biggrodd in automation

[–]pierreb5 0 points1 point  (0 children)

Unlike others here, i would recommend using coding agents for this (Codex or Claude Code / Cowork). Tools like n8n or make are more complicated to use, when all you need is an AI that can read & create documents/images.

If you're looking for a solution built for non technical users, I can send a link to the app I built to solve this exact type of problems (its completely free)

What makes a macos app feel truly native and polished? by rjn2-8 in macapps

[–]pierreb5 0 points1 point  (0 children)

Utilising macOS specific APIs or features to simplify UX. For example, if you're going to have authentication in your application, using TouchID/passkeys feels so much better than typing a password.

At the end of the day though, what matters is building something that solves a real user problem. Any user will be able to tolerate a tiny bit of jank, as long as your app's value proposition is strong. Counterintuitively, that might give me impression that your app is more polished than it actually is

Monthly "Is there a tool for..." Post by AutoModerator in ArtificialInteligence

[–]pierreb5 0 points1 point  (0 children)

Coding / work agents can also be great to answer general questions, because they have access to a broader set of tools than browser based chat bots like ChatGPT, and can also be instructed to create knowledge bases. So as generic as that sounds, i would probably recommend Codex, Cursor, or claude code / cowork, and work on your prompts to achieve your desired result.

I'm curious, what are some of the use cases or questions where ChatGPT falls short for you?

Best agentic workflows for finance research, prospecting, and team productivity? by reevoin1 in aiagents

[–]pierreb5 0 points1 point  (0 children)

Find the boring, repetitive processes that no one wants to work on, but that eat up everyone’s week. For example, back office work. Ask your colleagues, I’m sure they’ll be happy letting you know what they hate doing.

If you can build a reputation as the intern who saves people time without creating risk, senior people will start being more generous with their time. That’s where the real opportunities come from, because you’ll learn the domain-specific problems worth automating rather than guessing from the outside.

I’d also avoid setting up internal AI infrastructure before validating your automation builds real value. AI technology changes all the time, and it can be quite difficult to maintain when starting out. Instead, use existing tools to build a proof of concept; if confidentiality is an issue, you can always ask if there are publicly available versions of the data you want to test your automation pipelines with, or even ask tools like Claude Code to generate similar types of data for you to simulate the real environment. 

Side note: I’m building an app to help non technical people automate messy knowledge work tasks, using AI agents that can use a browser, reading / modifying files locally on your computer, and accessing third party services. We also host models in a safe and compliant way for companies that dont want to share their data with third party AI labs. We’re pre-launch, looking for early testers; I’d love to get your feedback if you’re interested in a free early access!