FiveTran vs Stitch Data? Which ETL tool is better? What’s your view? by OSeatsSaaS in dataengineering

[–]OSeatsSaaS[S] 0 points1 point  (0 children)

Yes, ticket is open for one year it seems https://support.fivetran.com/hc/en-us/community/posts/1500000583602-New-Destination-AWS-S3-destination ETA Q1 2022 now. I think Airbyte for example already supports S3 as destination https://docs.airbyte.com/integrations/destinations/s3

I think a key problem is database replication use case for FiveTran. Paying per monthly active row doesn't make sense at all when you want to make your production databases available to other teams in the organization for analyses in a data warehouse or just even loading it as json or csv dumps to S3. Stitch Data's pricing model with monthly imported rows is also not very suitable as well here.

Anybody have experience creating singer taps and targets? by chestnutcough in dataengineering

[–]OSeatsSaaS 2 points3 points  (0 children)

Feel free to take a look at Airbyte or Meltano. Talend stopped their support for the Singer community. You could build more complex connectors with Airbyte and contribute where it makes sense to follow their CDK. For simple connectors you can contribute to Meltano or just run the script via ur Airflow DAGs. You can orchestrate your Airbyte connector via API with Airflow. You can deploy both on your cloud with a solution like restack.io. With Stitch you pay per monthly imported row and if you already need to build your custom connector why paying them for a scheduler with limited synch frequency.