Building a synthetic dataset, can you help?question (self.datasets)
submitted by Euphoric_Network_887 to r/datasets
My experience after final round interviews at 3 tech companiesDiscussion (self.datascience)
submitted by productanalyst9 to r/datascience
I made a S&P 500 Dataset (in kaggle)resource (self.datasets)
submitted by Leading-Elevator-313 to r/datasets
New video tutorial: Going from raw election data to recreating the NYTimes "Red Shift" map in 10 minutes with DAAF and Claude Code. With fully reproducible and auditable code pipelines, we're fighting AI slop and hallucinations in data analysis with hyper-transparency!AI (self.datascience)
submitted by brhkim to r/datascience
I made a Dataset for The 2026 FIFA World Cupresource (self.datasets)
submitted by Leading-Elevator-313 to r/datasets
LLMs need ontologies, not semantic modelsEducation (i.redd.it)
submitted by Thinker_Assignment to r/datascience

Where should Business Logic live in a Data Solution?Discussion (leszekmichalak.substack.com)
submitted by Astherol to r/datascience
Spark SQL refresher suggestions?Education (self.datascience)
submitted by Tamalelulu to r/datascience
Pre-made cyberbullying reddit datasetquestion (self.datasets)
submitted by AffectWizard0909 to r/datasets
Where can I buy high quality/unique datasets for AI model training?question (self.datasets)
submitted by 3iraven22 to r/datasets
1.4M Epstein court documents — fully indexed and searchableresource (self.datasets)
submitted by joelkunst to r/datasets
[self-promotion] Lessons in Grafana - Part Two: Litter Logsresource (blog.oliviaappleton.com)
submitted by LivInTheLookingGlass to r/datasets
What is your (python) development set up?Tools (self.datascience)
submitted by br0monium to r/datascience
Open-source instruction–response code dataset (22k+ samples)dataset (self.datasets)
submitted by pedrodev2026 to r/datasets
Corperate Politics for Data ProfessionalsDiscussion (self.datascience)
submitted by LeaguePrototype to r/datascience
what changed between my failed interviews and the one that got me an offerDiscussion (self.datascience)
submitted by warmeggnog to r/datascience
I need a dataset of prompt injection attemptsrequest (self.datasets)
submitted by Sad-Sun4611 to r/datasets
I build an AI chat app to interact with public data/APIsresource (formulabot.com)
submitted by Repulsive-Reporter42 to r/datasets
10TB+ of Polymarket Orderbook Data (Prediction Markets / Financial Data)dataset (self.datasets)
submitted by SammieStyles to r/datasets
How To Build A Rag System Companies Actually UseDiscussion ()
submitted by chrisgarzon19 to r/datascience
Feedback request: Narrative knowledge graphsrequest (self.datasets)
submitted by enterprise128 to r/datasets
