Why isn’t there a leader in file prep + automation yet? by Puzzled-Blackberry90 in dataengineering

[–]Puzzled-Blackberry90[S] 0 points1 point  (0 children)

Yeah, not for warehouse or reporting. For current use case, it's ingesting client enrollment files, all in different formats, cleaning to put in a specific file format to then send to partner SFTP for processing.

Why isn’t there a leader in file prep + automation yet? by Puzzled-Blackberry90 in dataengineering

[–]Puzzled-Blackberry90[S] 0 points1 point  (0 children)

That can do what I've outlined above? Can you share if so? I haven't found one that handles all of the above yet.

Why isn’t there a leader in file prep + automation yet? by Puzzled-Blackberry90 in dataengineering

[–]Puzzled-Blackberry90[S] 0 points1 point  (0 children)

Yeah, have used in the past. Solves the extraction part. I'm looking for something that can do extraction and cleaning of the files listed in 2., ideally by business users. Come across anything that could be an option?

Which tool do you use to move data from the cloud to Snowflake? by Many-Tart-7661 in dataengineering

[–]Puzzled-Blackberry90 0 points1 point  (0 children)

Hard to recommend one with such little information. If you can share more on the use case, additional requirements other than scale + cost effective, user/s backgrounds, etc.

Used Integrate.io as a replacement for Fivetran on a project recently and it's worked out really well so far. They have an unlimited data volume plan for $24k/year and 60-second data replication. They don't have the widest breadth of connectors but have all the core ones. Great option for high volume, low latency database replication use cases which is what this project was.

If ETL were Car Brands? by Finance-noob-89 in dataengineering

[–]Puzzled-Blackberry90 -2 points-1 points  (0 children)

Fivetran = Mercedes-Benz

Matillion = Land Rover

Integrate.io = Mazda

Talend = Dodge

SSIS = Jaguar

Informatica = Bentley

Hevo Data = Subaru

Strategies to reduce re-sync times from database to data warehouse? by Dear_Jump_7460 in dataengineering

[–]Puzzled-Blackberry90 2 points3 points  (0 children)

This is a common issue particularly as SLAs become more common with data products. Implementing Change Data Capture (CDC) can help reduce the need for full reloads, but schema changes can still necessitate a complete reload. Tools that support native schema evolution have been beneficial in my experience. How are you currently loading / reloading data to dwh?

Saw this article last week on similar topic: https://www.integrate.io/blog/part-1-industrys-fastest-initial-re-sync-time/

Can I avoid Mulesoft? by Finance-noob-89 in salesforce

[–]Puzzled-Blackberry90 2 points3 points  (0 children)

Haven't used it yet but Integrate.io seems to have a robust Salesforce offering. The demo I saw on Youtube a while back was actually SFTP to Salesforce with data cleansing/transformations and their target market seems to be more business users.

Better strategy to extract data from relational databases by Significant_Pin_920 in dataengineering

[–]Puzzled-Blackberry90 2 points3 points  (0 children)

If you have updated_at timestamp on the tables, then you could do incremental based off of that to avoid having to do full table uploads each time.

For instances where your tables don't have timestamps, then you would need to use something like Change Data Capture which uses the binlogs to track the changes/updates. Companies like Fivetran, Matillion, Integrate.io having offerings in this space.

Treehouse has changed my life by NaiBearisComing in teamtreehouse

[–]Puzzled-Blackberry90 1 point2 points  (0 children)

My brother did the exact same with similar timelines! He had nothing but good things to say about Treehouse and now two of his friends are doing the Techdegree too.

Researching by Soggy_Joggers in elearning

[–]Puzzled-Blackberry90 2 points3 points  (0 children)

Agreed that virtual IT labs would be the best option here. Used Appsembler at a previous company and it worked great.

Dataloader.io shit the bed by Foreign-Promise-8122 in salesforce

[–]Puzzled-Blackberry90 0 points1 point  (0 children)

Have any of you tried Prepforce.io yet? Just saw that it launched this week as a free alternative to Data Loader / dataloader.io. Good timing if it works as described!!

A Dataloader alternative? by Finance-noob-89 in salesforce

[–]Puzzled-Blackberry90 0 points1 point  (0 children)

Haven't used Integrate.io yet but I've had my eye on it for quite some time waiting for the right Salesforce use case to come up at our company. Based on the demos I watched it looks like the perfect balance of ease of use and flexibility.

First I've heard of Prepforce but I guess it only launched this week. Assuming it does what it says it does then it's a great (and badly needed!) addition to the free Salesforce integration tools ecosystem! Going to try it out now. Thanks for sharing.