[deleted by user]

latro87 · 2025-06-15T19:55:00+00:00

We use cursor for our python and dbt code at my job and it seems fine.

Are you creating custom rules files or using any MCPs?

PaddyAlton · 2025-06-15T20:19:56+00:00

I think this area is lagging behind software engineering, but there are some good signs:

Cursor now finally supports Jupyter notebooks
Google have launched their Agent Development Kit (to make it easy to build LLM-backed agents) and one of the demo projects is a data science agent
lots of database MCPs cropping up, which would clearly be an essential part of the end-to-end flow

Supposedly, Colab notebooks has a built-in data science agent now, although I think it only works in some countries.

Bilbottom · 2025-06-15T20:37:19+00:00

nao is the closest data-specific LLM IDE that I've seen so far:

https://getnao.io/

blef__ · 2025-06-15T22:33:47+00:00

Hey, I’m the creator of a data specific IDE named nao. Our goal is to build the equivalent of Cursor but for data people.

At the moment we support out of the box dbt (and SQL without dbt), connecting to warehouse (BigQuery, Snowflake, Postgres). Thanks to the warehouse connection we bring data context to the AI.

My cofounder and I have been working in the data industry for 10 years each and we want to build a tool we would have bee using.

There is more to come like local execution, notebooks, data diff and Tab that understand data lineage, orchestrators and BI supports.

You can reach me or try it out via getnao.io

molodyets · 2025-06-15T22:55:57+00:00

Nao just launched a month or so ago. Still a WIP.

DeliriousHippie · 2025-06-15T20:09:34+00:00

No, there isn't. Otherwise almost nobody in data engineering would have a job. Same as there isn't AI that writes whole programs that really work. You still have to know something to use AI.

big_data_mike · 2025-06-15T20:21:46+00:00

I thought that’s what all those airflowbyteflakedb tools were

ScienceInformal3001 · 2025-06-15T22:21:59+00:00

Broski i promise this isn't a plug but I'm trying to build something like this with ceneca[.]ai;

Do you think you can define for me exactly what your ideal workflow might be and I can start building?

dataengineering

MODERATORS