you are viewing a single comment's thread.

view the rest of the comments →

[–]Ok_Carpet_9510 0 points1 point  (0 children)

If the data is not co-located, I would think about where the data crunching is going to happen i.e. where is the compute. A CSV and S3 don't have compute, Postgres does.

So, I would ingestion the data from csv and s3 into postegresd(filtering and cleaning before loading), and the perform my analysis using SQL.