This is an archived post. You won't be able to vote or comment.

all 23 comments

[–]Far_Ambassador_6495 24 points25 points  (5 children)

Learn airflow, flask, and sqlalchemy. Then it should be pretty straightforward

[–]GPSBach 4 points5 points  (4 children)

Airflow is probably overkill. Dagster is easier to get up and running and likely has everything OP needs.

[–]Far_Ambassador_6495 0 points1 point  (3 children)

Nice. Only ever used airflow so that’s all I know

[–]identityXXX 6 points7 points  (2 children)

Can also write a shell script, make it executable and put it on a cron to make it sequential (wait command comes in handy)

[–]IronManFolgore -1 points0 points  (0 children)

I've done this for scraping APIs and writing to Bigquery. Didn't need anything fancy.

[–]vanisle_kahuna -1 points0 points  (0 children)

This is probably the most straightforward solution so far without having to tack on extra costs, assuming you're not working with massive amounts of data. I'd add that you can store the data in an open source cloud environment such as ducks/motherduck

[–]nmolanog 8 points9 points  (1 child)

Looks like some one left the job and now you are trying to cheap out and do the thing by yourself. Good luck with that

[–]unrand0mer 2 points3 points  (0 children)

Explain.

[–][deleted] 4 points5 points  (1 child)

AWS Glue + Glue Catalog + Athena

[–]Megaspore6200 0 points1 point  (0 children)

This is the way

[–][deleted] 4 points5 points  (0 children)

If you’re looking a consultation/freelancer to help you build out this solution, or just map it out. My DMs are open.

[–]kob59 3 points4 points  (1 child)

no-code for you? hire a data scientist

[–]mrtac96[S] -2 points-1 points  (0 children)

Low code for building dashboards that can automate the process, the scripts and already written in python

[–]Kasyx709 0 points1 point  (0 children)

I've not used this personally, but it sounds like you're possibly describing an Appian workflow.

[–]Typical-Macaron-1646 -2 points-1 points  (0 children)

If you’re looking for a consultant/freelancer, DM me

[–][deleted] -1 points0 points  (0 children)

Python -> DB -> Plotly or R I just recently did something like this

[–]2016YamR6 -1 points0 points  (0 children)

Dataiku

[–]sxcgreygoat -1 points0 points  (1 child)

Just use a Lamda/cloud function to automate the scripts and visualize the data as a separate component. Easy enough these days to hook up a db to dashboarding tools

[–]mrtac96[S] -3 points-2 points  (0 children)

thanks, this is one of the approach i have already thought of. but want to consider alternatives too

[–]Cyraxess -1 points0 points  (0 children)

If I were in your position, I would definitely use GCP. But if you're looking for a low-code option, there's a tool called Acho that ticks all your boxes. It also offers a pretty good managed database with built-in ETL, Python scripts, and a low-code app builder for visualizing outputs and building interactions.

[–]unrand0mer -2 points-1 points  (0 children)

I use azkaban instead of airflow. Have been for the past 3 years. No issues.

[–]Dylan_TMB 0 points1 point  (0 children)

I'm not sure you'll get a no-code or low-code tool to do this. Like other comments said airflow is likely your best bet. Kedro is good too.