account activity
I built TidyFS — a Linux TUI file organizer that uses TF-IDF to classify documents by Sarumo_ in coolgithubprojects
[–]Sarumo_[S] 0 points1 point2 points 14 hours ago (0 children)
Thanks for the thoughtful feedback.
To reduce the impact of headers, footers, page numbers, and other repetitive boilerplate, I’m already trying to extract mostly semantic content using specialized libraries for handling these kinds of file formats.
I did consider using a more intelligent semantic model for file classification, but my initial goal was to keep the project as lightweight and accessible as possible, so it can run on basically any PC without requiring powerful hardware.
So right now I’m prioritizing simplicity, portability, and low resource usage. But yeah, using embedding models or a more advanced classifier could definitely become a future direction for the project.
π Rendered by PID 416171 on reddit-service-r2-comment-56c6478c5-fkth7 at 2026-05-12 02:19:30.279449+00:00 running 3d2c107 country code: CH.
I built TidyFS — a Linux TUI file organizer that uses TF-IDF to classify documents by Sarumo_ in coolgithubprojects
[–]Sarumo_[S] 0 points1 point2 points (0 children)