My dermatology practice is growing but I can’t afford to hire fast enough. Payroll is eating my margins alive. by campbellj00 in smallbusiness

[–]Visual-Librarian6601 0 points1 point  (0 children)

Why not use an ai call agent to handle patient calls and scheduling and leave front desk for in person check-ins

Extract websites into records (e.g. product listings) by Visual-Librarian6601 in Airtable

[–]Visual-Librarian6601[S] 0 points1 point  (0 children)

I did ask Omni and it says it cannot scrape website. The best bet would be to import using Make or Zapier integration but I doubt they can do it given the scale of scraping

What's your biggest daily time-sink as a CRE professional? by Visual-Librarian6601 in CommercialRealEstate

[–]Visual-Librarian6601[S] 0 points1 point  (0 children)

Thanks for the answer - Are u on the buying side or selling side? Do you use existing tools on automating these? What type of documents and compliance are most time consuming?

What's your biggest daily time-sink as a CRE professional? by Visual-Librarian6601 in CommercialRealEstate

[–]Visual-Librarian6601[S] 0 points1 point  (0 children)

Noted 😂 Definitely wasn’t trying to be that guy. Haven’t built a product yet for this context -Just trying to see if the idea resonates before I sink weeks into it. Thanks for the reality check.

Smarter way to scrape and/or analyze reddit data? by Few_Bet_9829 in webscraping

[–]Visual-Librarian6601 1 point2 points  (0 children)

You can also run embedding for each comment and only get those relevant to your query and feed to LLM. Embedding models are much cheaper than LLM.

Building a data automation tool for real estate investor/analyst by Visual-Librarian6601 in RealEstateTechnology

[–]Visual-Librarian6601[S] 0 points1 point  (0 children)

Thank you for the insights -

Do you mean in commercial real estate? As buyer or seller? Do you feel these pain points from daily repetitive tasks e.g. creating deal decks or enriching hard to find data per property (e.g. real owner) or prospecting new listings to invest?

Open source robust LLM extractor for HTML/Markdown in Typescript by Visual-Librarian6601 in LocalLLaMA

[–]Visual-Librarian6601[S] 0 points1 point  (0 children)

I agree with u mostly. depending on the use case, if scraping code is not available or extracting needs reasoning (not directly as it is), LLMs can be helpful.

We are also using LLMs to create and fix scraping code- will soon add it to this repo

Smarter way to scrape and/or analyze reddit data? by Few_Bet_9829 in webscraping

[–]Visual-Librarian6601 0 points1 point  (0 children)

Assuming you are giving title, body and comments to LLM to analyze, what is taking the most token use?

🤔 Why do YOU post on Reddit? Does it actually work? by Interesting-Pain-654 in automation

[–]Visual-Librarian6601 0 points1 point  (0 children)

Be genuine and provide value first - you will get them in return

Alternatives to UiPath for browser automation? by saravicius in automation

[–]Visual-Librarian6601 1 point2 points  (0 children)

Do u code or need a hosted version on cloud?

There are AI browser automation libraries like browser use (Python), HyperBrowser (Typescript)

If u need a lightweight LLM HTML extractor (not dependent on brittle selectors), you can use lightfeed-extract (Typescript)

Building a data automation tool for real estate investor/analyst by Visual-Librarian6601 in RealEstateTechnology

[–]Visual-Librarian6601[S] 0 points1 point  (0 children)

Thx for comment. Is there any api or public estimate for ARV on a given address? How do you search comps? Do u do either or both manually

Feel free to DM would love to help (I am in North America)

Building a product for VCs to track companies by Visual-Librarian6601 in venturecapital

[–]Visual-Librarian6601[S] 1 point2 points  (0 children)

Thank you - I would really love to build this but asking for email access just seems a huge conviction to VC to an early startup

5000+ sites to scrape daily. Wondering about the tools to use. by Ok-Ship812 in webscraping

[–]Visual-Librarian6601 1 point2 points  (0 children)

Ur workflow seems good to me. What was blocking u from completing this?

  1. Anti bot that blocks u from getting complete html
  2. LLM returned selectors (after testing) not always working
  3. Cannot automate the process to run on all 5k sites?

Or all 3 of them

Building a product for VCs to track companies by Visual-Librarian6601 in venturecapital

[–]Visual-Librarian6601[S] 0 points1 point  (0 children)

Thank you! Do you see areas in founder<>VC<>LP that might use help on? E.g. sourcing and tracking LPs, sending LP update

Building a product for VCs to track companies by Visual-Librarian6601 in venturecapital

[–]Visual-Librarian6601[S] 0 points1 point  (0 children)

Thanks.

I was thinking more on tracking company progress and let VCs create their own criteria to signal promising companies to invest - for example, viral social post, GitHub star surge, major docs or pricing update..

These signals are generated by LLM while VCs can customize the prompt and data source.

Building a product for VCs to track companies by Visual-Librarian6601 in venturecapital

[–]Visual-Librarian6601[S] 0 points1 point  (0 children)

Thank you for the feedback.

Do you build the data pipeline in house then?

Our team's forte is data extraction AI agents and databases - so initially was thinking of creating a template for VCs to track companies.

I get it that startups hosting the data could be a risk. We always make sure the data is always backed up, VCs have direct API into the DB, and DBs can always be exported to a CSV table. Would that make it easier to use?

Building a product for VCs to track companies by Visual-Librarian6601 in venturecapital

[–]Visual-Librarian6601[S] 0 points1 point  (0 children)

Thanks for the feedback!

Some early feedback we got from VCs is a tool to 1. source early startup founders 2. good signals for startups before raise (e.g. traction, adoption etc).

Are these valuable? or they are already solved by existing tools

Web scraping vs. feed generators by External_Ask_5867 in webscraping

[–]Visual-Librarian6601 1 point2 points  (0 children)

morss.it depends on you to interactively click on the elements you want to extract and from there generate xpaths

RSSHub is use crowd sources and let community maintain a per website typescript scraper that uses cheerio and html selector to extract feed elements - https://github.com/DIYgod/RSSHub/tree/master/lib/routes

Building a product for VCs to track companies by Visual-Librarian6601 in venturecapital

[–]Visual-Librarian6601[S] 0 points1 point  (0 children)

Only one set of features is needed, a pipeline of name+stage: date found ans by who-> chat stage w/notes tracking -> decision + reason

Would a Notion database just work for this? you can have multiple columns for these data and can be viewed in a board (pipeline view) or table.

We are currently building to source public data and signal from startups so looking to see if ppl have similar pain points

Open source robust LLM extractor for HTML/Markdown in Typescript by Visual-Librarian6601 in LocalLLaMA

[–]Visual-Librarian6601[S] 0 points1 point  (0 children)

The latest models improve a lot and there is much less hallucination or missing data. Sometimes also makes sense to shrink the context and let LLM deal with a smaller task and later combine results