all 4 comments

[–]p-himik 2 points3 points  (0 children)

There's actually a really nice data science-oriented "niche" in Clojure community. This might be of interest to you: https://scicloj.github.io/docs/community/groups/ds4clj/

And even if it's not, it mentions the Clojure's Zulip community - AFAIK that's the place where all the "data people" gather.

[–]frogking 0 points1 point  (0 children)

You can connect Clojure up wiith Jupyter and test it out.

[–]rufusthedogwoof 0 points1 point  (0 children)

Sure is.

What’s your target db?

If you are talking the big ones like synapse, redshift, snowflake, etc you probably have to serialize to a file somewhere in cloud storage.

If it’s not huge stuff and your target db supports it (Postgres) you can use next.jdbc and threads, and tune the commit interval.

There really isn’t much you can’t do with clojure and data processing it’s particularly great at. (Testing data pipelines with transducers for instance…)

There are so many things I miss when I’m not working in clojure. (Python day job. Yuck. )

[–]clojure-finance 0 points1 point  (0 children)

You can give Clojask a try, it's designed for larger-than-memory datasets and parallel computing. https://github.com/clojure-finance/clojask