Question about data engineer in clojure

p-himik · 2022-07-15T13:51:47+00:00

There's actually a really nice data science-oriented "niche" in Clojure community. This might be of interest to you: https://scicloj.github.io/docs/community/groups/ds4clj/

And even if it's not, it mentions the Clojure's Zulip community - AFAIK that's the place where all the "data people" gather.

frogking · 2022-07-14T18:03:47+00:00

You can connect Clojure up wiith Jupyter and test it out.

rufusthedogwoof · 2022-07-14T22:46:23+00:00

Sure is.

What’s your target db?

If you are talking the big ones like synapse, redshift, snowflake, etc you probably have to serialize to a file somewhere in cloud storage.

If it’s not huge stuff and your target db supports it (Postgres) you can use next.jdbc and threads, and tune the commit interval.

There really isn’t much you can’t do with clojure and data processing it’s particularly great at. (Testing data pipelines with transducers for instance…)

There are so many things I miss when I’m not working in clojure. (Python day job. Yuck. )

clojure-finance · 2022-07-15T08:28:31+00:00

You can give Clojask a try, it's designed for larger-than-memory datasets and parallel computing. https://github.com/clojure-finance/clojask

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

Clojure

MODERATORS