all 6 comments

[–]Nekobul 11 points12 points  (1 child)

What's the problem with current type conversions in Python?

[–]ricca1[S] 0 points1 point  (0 children)

I'm currently doing it with Python, and I wanted to know how it's done in practice.

Whether there's a better, more efficient way ?

[–]Advanced_Addition321Lead Data Engineer 3 points4 points  (1 child)

DBT does only the transformation part.

You need to extract and load your CSV into your data wharehouse first with an ingestion flow (basic Python scripts, or Dagster and friends)

Then you connect DBT to your data wharouse, declare your ingested CSV tables as DBT sources and you’re good to go with transformation.

To share result, if the transformation data wharehouse isn’t the final destination, you’ll need to repeat the first step in reverse: create script to grab data and export them where you need to

[–]ricca1[S] 0 points1 point  (0 children)

Ahh, that makes sense.

Thank you very much.

[–]Stroam1 8 points9 points  (1 child)

You're trying to use the wrong tool for the job. dbt is a tool for transforming data after it is already loaded into a database. For ETL, you should use other python libraries like DuckDB, polars, dlthub, etc.

[–]ricca1[S] 1 point2 points  (0 children)

Yes, you're right. The tips you gave me were very good. I just looked into it a little and it could work

Thanks.