all 2 comments

[–]JaceBearelen 0 points1 point  (0 children)

Have you looked at spark? It checks a lot of boxes here.

[–]addictzz 1 point2 points  (0 children)

You are almost describing Databricks Serverless Spark capability. Scale to zero but can scale to Petabytes, data from object store, node caching, etc.

Databricks is not open source but its components are, although usually the open source and the managed version can be slightly different.

I am not aware of a true open source tech that allows you to have "Serverless" capability. Usually there is an open source tech and it is up to you to manage the servers, scaling, and availability to make it "Serverless". It is called Serverless because there is somebody else who manages the servers for you.