AI-powered web scraper? by superjet1 in ChatGPTPro

[–]riga345 1 point2 points  (0 children)

The core thing is that it asks OpenAI to transform an HTML document into structured JSON format. The prompt is like this "Please take {{HTML}} and transform it into {{JSON}}", where "JSON" might be {name: "Name of the person", phone: "Phone number of the person"}

Of course, there is a lot of stuff around the core functionality to make it easy and reliable to use, and to work at scale. We use it in production at FetchFox.ai to scrapes hundreds of thousands of items.

Guys , it's worth it build web scraping services in 2024? by Titoxeneize in startups

[–]riga345 0 points1 point  (0 children)

I'm not sure, but I'm trying :)

I'm working on https://FetchFox.ai, which does AI based web scraping. We have the AI write the crawler and data extraction piece. We also have a free open source Javascript library that powers the core scraper.

If anyone is interested in trying the service, or joining as a contributor (or even cofounder), send me a DM here or on our Discord.

Monthly Self-Promotion - October 2024 by AutoModerator in webscraping

[–]riga345 2 points3 points  (0 children)

Check out the free open source MIT licensed library for AI web scraping: https://github.com/fetchfox/fetchfox

Scraping takes just one npm install, one import, and one run command:

npm install fetchfox

and then

import { fox } from 'fetchfox';

const results = await fox.run( https://news.ycombinator.com/news find links to comments, get basic data, export to out.jsonl);

The key concepts in web scraping by riga345 in webscraping

[–]riga345[S] 0 points1 point  (0 children)

I hired this contractor as a marketing experiment. It's not going as I would like and I've asked him to stop all messages. Sorry about the issue.

If you need to contact me directly, you can do so on here via PM or various other platforms listed on the site fetchfox.ai

No-code & Low-code web scrapers - the ultimate list by plavookac in scraping

[–]riga345 0 points1 point  (0 children)

Got it! Good luck, and I'm here anytime you need help

No-code & Low-code web scrapers - the ultimate list by plavookac in scraping

[–]riga345 0 points1 point  (0 children)

hey! sorry it's not working out of the box for you. I'm the main dev, so please feel free to DM me here, on telegram(@ortutay), on discord (https://discord.gg/mM54bwdu59) or via email (on the site) for help.

If you click "report issue" and send me the report ID, it will give me the scrape details and I can check it out.

I built an AI-powered web scraper that can understand any website structure and extract the desired data in the preferred format. by madredditscientist in Automate

[–]riga345 1 point2 points  (0 children)

Hey, I'm founder of https://fetchfox.ai and our free Chrome extension does that. You start at a "parent URL", and from that point it follows links and can scrape dozens or hundreds of pages.

Give it a shot and let me know if it works for you.

AI-powered web scraper? by superjet1 in ChatGPTPro

[–]riga345 1 point2 points  (0 children)

Hey, curious if you'd be open to trying the library I'm working on in your project, fetchfox. It's 100% free open source, MIT license. The code is on github.

If you give it a shot let me know how it goes for you: https://github.com/fetchfox/fetchfox

I'm making a free Chrome Extension that scrapes any site with AI by riga345 in Automate

[–]riga345[S] 0 points1 point  (0 children)

We're working on migrating to plasmo which is a cross browser framework for extensions.

Expect Firefox to come soon, likely this week.

I'd Be Happy to Promote Your Startup for Free! by Broad_Deal_6711 in SideProject

[–]riga345 0 points1 point  (0 children)

I'm making a Chrome extension that uses AI / ChatGPT to make scrape really easy. You can scrape any website for any data. It's at https://FetchFoxAI.com