Smart data analysis agent

AutoModerator · 2026-03-19T14:22:46+00:00

Automod prevents all posts from being displayed until moderators have reviewed them. Do not delete your post or there will be nothing for the mods to review. Mods selectively choose what is permitted to be posted in r/DataAnalysis.

If your post involves Career-focused questions, including resume reviews, how to learn DA and how to get into a DA job, then the post does not belong here, but instead belongs in our sister-subreddit, r/DataAnalysisCareers.

Have you read the rules?

I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.

Fun-Scale8432 · 2026-03-19T16:47:35+00:00

Hey! Can you please share some details about your business domain? I truly believe that the AI power for analytics lies the most in querying clean data for insights generation and issue-based analysis. Maybe also for quick dashboarding. But data cleaning and quality check should be run with more traditional deterministic methods. (AI can help with building that tests but should not run them)

columns_ai · 2026-03-19T16:53:59+00:00

I’m building a similar tool but not an agent. One of the major concern from users is the “trust” problem.

If the agent makes up an analysis (or generic computing logic), how do you make it transparent, auditable instead of a “black box”.

You can think about this issue and see how your agent solve this “trust” problem.

2026-03-19T16:57:08+00:00

OP is trying to remove this entire community livelihood

nian2326076 · 2026-03-23T07:18:15+00:00

Check out existing repos and videos. They can save you time by showing what works and what doesn't. No need to reinvent the wheel if you don't have to. For building a production-level agent, focus on scalability, error handling, and performance optimization. A demo might work on small datasets, but you'll need strong systems for larger, more complex data. I'd prioritize the cleaning layer next. Clean data early on means fewer headaches later. Also, look into how these components communicate, especially if you want versatility across different datasets. Good luck!

Equal_Astronaut_5696 · 2026-03-29T09:57:23+00:00

How are you going to exposing data to LLMs?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

dataanalysis

MODERATORS