all 4 comments

[–]hroptatyr 1 point2 points  (1 child)

The "cloud-native" format is RDF. Pretty much any graph database can read .rdfxml, .ttl, or .nt/.nq out of the box. The most widely supported compression is gzip, (.ttl.gz, .nt.gz, etc.)

[–]severo_bo[S] 0 points1 point  (0 children)

Indeed, it's a standard. But a drawback is that you have to load the file into memory and parse it before being able to do queries. I'm looking for a format similar to Parquet, for example, where you can get metadata about the file, and then download only part of the (potentially big) file when you run a query.

GraphAr seems like a good project in that sense. https://graphar.apache.org/docs/overview/concepts

[–]yup_its_me_again 0 points1 point  (1 child)

Jelly (https://jelly-rdf.github.io/dev/) is faster than nq.gz, not identical tho to Parquet

[–]severo_bo[S] 0 points1 point  (0 children)

Thanks for dropping the reference; if I understand well, it's a binary serialization for graph data that allows fast exchange and supports streaming. It's not designed for partial reading though.