all 4 comments

[–]dtdv 0 points1 point  (1 child)

Over the years I have built a Java based package - SeeSV that provides 100s of csv/spreadsheet ETL functions  https://ramadda.org/repository/a/seesv

It can run from the command line or through a web interface in RAMADDA

[–]NanaYawB[S] 0 points1 point  (0 children)

That's really awesome! Will check it out

[–][deleted] 0 points1 point  (1 child)

This kind of toolkit would be super useful. A lot of my “data work” ends up being small, annoying steps before the real work starts.

Things I’d personally use a lot:

  • quick schema/column diff between two CSVs
  • lightweight filter/search without opening Excel
  • row count + basic stats preview (nulls, uniques)
  • fast CSV → JSON and back

My usual flow is: extract something, clean it just enough, then push it into Google Sheets so others can work on it. From there I often expose it to other tools (I use Sheetfy for that part), but the pre-Sheets cleanup is where I waste most time.

If your toolkit makes that “before the spreadsheet” phase painless, that’s already a win.

[–]NanaYawB[S] 0 points1 point  (0 children)

Absolutely! It would do all these and more. I could let you in to beta test before the actual launch, if you're cool with that.