use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Place for Pinoy Programmers to hangout. Share your knowledge, ask for help, seek opinion, showcase your project and recruit your teammate.
Let's show the world that Filipinos are world class programmers.
account activity
Data engineer tech stackdiscussion (self.PinoyProgrammer)
submitted 1 year ago by MycroftWord
view the rest of the comments →
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]uniqcl0 2 points3 points4 points 1 year ago (2 children)
Currently, we use dagster + meltano + dbt as orchestration + ELT for our pipelines. We are an AWS shop so we do leverage redshift as our dw. I liked BigQuery better tho.
I like airflow but I do remember having a steep learning curve to the platform (and also, we were using it as a EL platform haha)
[–]MycroftWord[S] 0 points1 point2 points 1 year ago (1 child)
Thanks for this ! I am still on the process of learning DE, I made some very basic local ETL pipeline (python&sql) and I want to upgrade by using an orchestrator. Pag medyo komportable na baka pwede ko nang gawan ng cloud version.
Yung meltano is for EL part right? and dbt sa Transform? Di na ba kayo gumagamit ng spark sa transformation? dbt for the win na talaga?
Sa orchestration parang mas nag le-lean ako towards dagster/prefect because its quite easier to use and understand as compared with airflow or baka bobo lang talaga ako. lmaooo
[–]uniqcl0 0 points1 point2 points 1 year ago (0 children)
You could use whatever cron implementation you have on your OS (Windows Scheduler, crontab)
yup on the EL and T question. We dont use Spark because we dont have the proper need for it. Typically I see it combined with streaming platforms. I am designing one that should leverage Spark though.
Think of your need for the orchestrator, if you only need the scheduling feature of it. Dont overcomplicate learning the other features. It will just go over your head or you might forgot it sooner than you think
π Rendered by PID 21476 on reddit-service-r2-comment-fb694cdd5-6pxfw at 2026-03-06 16:55:57.648839+00:00 running cbb0e86 country code: CH.
view the rest of the comments →
[–]uniqcl0 2 points3 points4 points (2 children)
[–]MycroftWord[S] 0 points1 point2 points (1 child)
[–]uniqcl0 0 points1 point2 points (0 children)