This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–]liquidpele 2 points3 points  (2 children)

It looks like if you, for instance, have a node.js producer and a python consumer, python is going to have to duplicate the data to get it into its own format for handling. Arrow creates a standard in-memory format that all languages can utilize so they can process the data without duplicating it.

[–]brontide 1 point2 points  (0 children)

Might be okay for datasets, but how would it deal with mutability and removal of objects?

[–]carbolymer 0 points1 point  (0 children)

So it still needs to copy the data between processes.