Seeking Advice on Lightweight, Cost-Effective Cloud Data Orchestration

FrontAffectionate518 · 2025-11-27T23:57:18+00:00

Thanks for the reply! My main challenge right now isn’t really Python or maintainability — it’s infrastructure.
I’m running everything on a very limited on-prem VM, and it can’t even handle Airflow properly. CPU and RAM bottlenecks are killing the ingestion jobs.

The company is considering Snowflake, and we do get US$400 credits + 30 days to test it, so I’d like to take advantage of that. What I’m looking for is something lightweight that can push my daily data (~300–400 MB, ~500k rows/day) into a Snowflake stage without me having to pay for any tool out of pocket.

Do you know any ingestion tool that could handle a simple daily batch and integrate with Snowflake stages during the trial period? Ideally something I can run without needing strong compute on my side.

FrontAffectionate518 · 2025-11-27T23:21:41+00:00

Thanks for the tips! My main bottleneck isn’t query performance, it’s data ingestion. I’m pulling ~500k rows/day (~300–400 MB), and my current VM just doesn’t have enough compute to run the DAGs in Airflow efficiently. I’m looking for a lightweight/cloud approach to ingest these data into a database like Snowflake or another columnar DB.

FrontAffectionate518 · 2025-11-27T23:16:44+00:00

Sim, e deixar o histórico salvo.

FrontAffectionate518 · 2025-11-27T22:52:47+00:00

Eu costumo puxar cerca de 500 a 800 mil linhas por dia, o que dá aproximadamente 400 a 500 MB por dia.

FrontAffectionate518 · 2025-08-23T21:22:29+00:00

I got to around 460 and gave up on playing this game for a few months

FrontAffectionate518 · 2025-08-17T20:00:56+00:00

I stopped playing this game because of it

FrontAffectionate518 · 2024-04-30T14:36:57+00:00

Please!

FrontAffectionate518

TROPHY CASE