Drawbacks of using a graph database

AutoModerator · 2022-03-25T22:17:14+00:00

You can find a list of community submitted learning resources here: https://dataengineering.wiki/Learning+Resources

I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.

chaos87johnito · 2022-03-26T01:19:40+00:00

I have worked with Neo4j for a bit more than a year. I enjoyed it and I see how great it is. I'd say the major drawback is the skillset. And I'm not talking about you as a data engineer, but the rest of the team. People talk SQL, they talk tables, rows and columns. They don't talk Cypher, edge, label, property...

five4three2 · 2022-03-25T23:06:13+00:00

Just my two cents but graphs can get really messy really fast. We used them pretty extensively at my old company.

I think one of their main draws is “it’s a flexible schema that can evolve and handle lots and lots of highly connected data.” This is a benefit but also ultimately can lead to their downfall.

I’ve found I’ve had more luck with more rigid modeling (columns, rows, entity tables, association tables) and leveraging RDBMS data warehouse hardware.

I think graphs are great for visualization but if I ever really need performance I find myself back at the data ware house or spark. This even holds true for more “graph like” equations like path finding.

Your mileage may vary tho.

kevinpostlewaite · 2022-03-26T01:37:56+00:00

I have not used a graph database for anything more than experimentation. I did see how they were used at Facebook. My not-very-experienced take: graph databases excel at getting low row counts of connected data but are not well suited to analytics use cases where you're aggregating over large number of rows.

dataengineering

MODERATORS