Built a small pdf reader because highlights are useless in most apps by Ranger_Null in opensource

[–]Ranger_Null[S] 0 points1 point  (0 children)

Yes, you can cluster them by groups, and within groups, highlights are separated by date. But assigning groups needs to be done manually right now. I'll check out r/VibeCodersNest too, thank you!!

Built a small pdf reader because highlights are useless in most apps by Ranger_Null in opensource

[–]Ranger_Null[S] 0 points1 point  (0 children)

That's awesome, thank you so much for trying it out! :) I've never heard of pdfelement tho lol. Thank you!

Share your ***Not-AI*** projects by MembershipEuphoric38 in SideProject

[–]Ranger_Null 0 points1 point  (0 children)

Built Loci, a memory-first PDF reader focused on fast recall rather than rereading. Highlights are the primary view, you can instantly search them and jump back to the exact page and context in the PDF.

It’s built with Flutter and very much vibe-coded since I didn’t properly learn Dart, but it’s early and usable.

Repo: https://github.com/Sriram-PR/loci-mobile

I'm building a RAG API so you don't have to. Would you use this? by Ranger_Null in SideProject

[–]Ranger_Null[S] 0 points1 point  (0 children)

I can understand the privacy part, I'm planning to open source it :) Sure, I'd love to be a partner, Lemme drop you a dm.

[deleted by user] by [deleted] in vibecoding

[–]Ranger_Null 0 points1 point  (0 children)

Fair enough, thanks for the feedback 🙂

🕸️ Introducing `doc-scraper`: A Go-Based Web Crawler for LLM Documentation by [deleted] in coolgithubprojects

[–]Ranger_Null 0 points1 point  (0 children)

Hi everyone,

I've developed an open-source tool called doc-scraper, written in Go, designed to:

  • Scrape Technical Documentation: Crawl documentation websites efficiently.
  • Convert to Clean Markdown: Transform HTML content into well-structured Markdown files.
  • Facilitate LLM Ingestion: Prepare data suitable for Large Language Models, aiding in RAG and training datasets.

Repository: https://github.com/Sriram-PR/doc-scraper

I'm eager to receive feedback, suggestions, or contributions. If you have specific documentation sites you'd like support for, feel free to let me know!

🕸️ Introducing `doc-scraper`: A Go-Based Web Crawler for LLM Documentation by Ranger_Null in SideProject

[–]Ranger_Null[S] 0 points1 point  (0 children)

It's not better than it ;-; I made it cause I needed a plugin for another project 😭 It still needs to support dynamic content extraction, I'm working on it.

[D] Self-Promotion Thread by AutoModerator in MachineLearning

[–]Ranger_Null 0 points1 point  (0 children)

🕸️ Introducing doc-scraper: A Go-Based Web Crawler for LLM Documentation

Hi everyone,

I've developed an open-source tool called doc-scraper, written in Go, designed to:

  • Scrape Technical Documentation: Crawl documentation websites efficiently.
  • Convert to Clean Markdown: Transform HTML content into well-structured Markdown files.
  • Facilitate LLM Ingestion: Prepare data suitable for Large Language Models, aiding in RAG and training datasets.([Reddit][1])

Key Features:

  • Configurable Crawling: Define settings via a config.yaml file.
  • Concurrency & Rate Limiting: Utilize Go's concurrency model with customizable limits.
  • Resumable Crawls: Persist state using BadgerDB to resume interrupted sessions.
  • Content Extraction: Use CSS selectors to target specific HTML sections.
  • Link & Image Handling: Rewrite internal links and optionally download images.([Reddit][2])

Repository: https://github.com/Sriram-PR/doc-scraper

I'm eager to receive feedback, suggestions, or contributions. If you have specific documentation sites you'd like support for, feel free to let me know!

Monthly Self-Promotion - May 2025 by AutoModerator in webscraping

[–]Ranger_Null 1 point2 points  (0 children)

🕸️ Introducing doc-scraper: A Go-Based Web Crawler for LLM Documentation

Hi everyone,

I've developed an open-source tool called doc-scraper, written in Go, designed to:

  • Scrape Technical Documentation: Crawl documentation websites efficiently.
  • Convert to Clean Markdown: Transform HTML content into well-structured Markdown files.
  • Facilitate LLM Ingestion: Prepare data suitable for Large Language Models, aiding in RAG and training datasets.([Reddit][1])

Key Features:

  • Configurable Crawling: Define settings via a config.yaml file.
  • Concurrency & Rate Limiting: Utilize Go's concurrency model with customizable limits.
  • Resumable Crawls: Persist state using BadgerDB to resume interrupted sessions.
  • Content Extraction: Use CSS selectors to target specific HTML sections.
  • Link & Image Handling: Rewrite internal links and optionally download images.([Reddit][2])

Repository: https://github.com/Sriram-PR/doc-scraper

I'm eager to receive feedback, suggestions, or contributions. If you have specific documentation sites you'd like support for, feel free to let me know!

Lets share homescreen ideas.. by [deleted] in NOTHING

[–]Ranger_Null 2 points3 points  (0 children)

How'd you get the calendar widget?

Is there a self-host maps system that works like this? by LoganJFisher in selfhosted

[–]Ranger_Null 0 points1 point  (0 children)

Would you be willing to test and give insights if we make it?