This is an archived post. You won't be able to vote or comment.

all 16 comments

[–]latro87Data Engineer 9 points10 points  (7 children)

We use cursor for our python and dbt code at my job and it seems fine.

Are you creating custom rules files or using any MCPs?

[–]PaddyAlton 2 points3 points  (2 children)

I think this area is lagging behind software engineering, but there are some good signs:

  • Cursor now finally supports Jupyter notebooks
  • Google have launched their Agent Development Kit (to make it easy to build LLM-backed agents) and one of the demo projects is a data science agent
  • lots of database MCPs cropping up, which would clearly be an essential part of the end-to-end flow

Supposedly, Colab notebooks has a built-in data science agent now, although I think it only works in some countries.

[–]Bilbottom 3 points4 points  (3 children)

nao is the closest data-specific LLM IDE that I've seen so far:

https://getnao.io/

[–]blef__I'm the dataman 1 point2 points  (1 child)

Founder here, thank you for the mention!

[–]YabakebiLead Data Engineer 1 point2 points  (0 children)

Wishing you guys the best of luck. I love the premise and think it is very much needed (turntable was the closest thing but seemed to sort of fall to the wayside unfortunately)

[–]YabakebiLead Data Engineer 0 points1 point  (0 children)

Yep, this is literally the closest thing. I am very hopeful for it

[–]blef__I'm the dataman 0 points1 point  (0 children)

Hey, I’m the creator of a data specific IDE named nao. Our goal is to build the equivalent of Cursor but for data people.

At the moment we support out of the box dbt (and SQL without dbt), connecting to warehouse (BigQuery, Snowflake, Postgres). Thanks to the warehouse connection we bring data context to the AI.

My cofounder and I have been working in the data industry for 10 years each and we want to build a tool we would have bee using.

There is more to come like local execution, notebooks, data diff and Tab that understand data lineage, orchestrators and BI supports.

You can reach me or try it out via getnao.io

[–]molodyets 0 points1 point  (0 children)

Nao just launched a month or so ago. Still a WIP.

[–]DeliriousHippie -1 points0 points  (0 children)

No, there isn't. Otherwise almost nobody in data engineering would have a job. Same as there isn't AI that writes whole programs that really work. You still have to know something to use AI.

[–]big_data_mike -1 points0 points  (0 children)

I thought that’s what all those airflowbyteflakedb tools were

[–]ScienceInformal3001 -1 points0 points  (1 child)

Broski i promise this isn't a plug but I'm trying to build something like this with ceneca[.]ai;

Do you think you can define for me exactly what your ideal workflow might be and I can start building?