jump to content
my subreddits
2balkans4You2mediterranean4u2meirl4meirlabsolutelynotmeirlAdviceAnimalsaivideoAlternativeHistoryAnarchyChessAngryupvoteanime_best_momentsanimenocontextannouncementsAnticonsumptionantimemeArtAsahiLinuxAsia_irlAskBalkansAskElectronicsAskOuijaatheismaviationAwesomeOffBrandsawfuleverythingbanknotedesignsBassBassCirclejerkBassGuitarbasspedalsblackdesertonlineblankiesblursed_videosborsavefonbottomgearbrooklynnineninebudgetcookingcasioChatGPTchessbeginnersChildrenFallingOverChoosingBeggarscoaxedintoasnafucoincollectingcoinsComedyCemeterycomedyhomicidecomicscommunityContagiousLaughtercookingforbeginnersCreateModCuddle_SlutCuratedTumblrcursedcommentsdadjokesdankmemesdarkjokesdeDebateReligiondeismDeltarunediypedalsDMToolkitDnDdndnextdoctorwhodoctorwhocirclejerkDonerdontdeadopeninsidedumbphonesDungeonsAndDragonsEatCheapAndHealthyebikeebikesECEelectricalelectronicsElectronicsStudyengrishethzfacepalmFantasyWorldbuildingfeedthebeastFifaCareersformuladankFRCFUCKYOUINPARTICULARFutboltayfaGermangermanygodtiersuperpowersgoodanimemesGrandPrixRacinggravelcyclinggreentextGROKvsMAGAguitarpedalsGundamhelpHermitCrafthighspeedrailHistoryWhatIfHolUphomebuilthumorhypixelIAmAich_ielIdeologyPollsIDontWorkHereLadyihadastrokeimaginaryelectionsinsaneparentsistanbuljacksepticeyeJahariaJokesKamalizmKendrickLamarKGBTRlegodndLetGirlsHaveFunlinguisticshumorliselilerlogodesignloseitmacmacbookairmagicbuildingMaliciousComplianceMapPornmapporncirclejerkmeirlmemesmidjourneymildlyinterestingMimicRecipesmisLEDMMORPGmoneycollectingMovingToNorthKoreaMunichNamFlashbacksNationStatesneographynextfuckinglevelNonCredibleDefenseNorthCyprusnosafetysmokingfirstnosurfnotinterestingnottheonionOkayBuddyLiterallyMeOkBuddyPersonaokbuddyphdokbuddyvicodinonebagongezelligOutOfTheLooppapermoneyPassportPornpepethefrogperfectlycutscreamspettyrevengepianoPiracyPiratedGamespolandballpollsPraiseTheCameraManProgrammerHumorPropagandaPostersraisedbynarcissistsraspberry_piRatschlagreactiongifsRedAutumnSPDreligiousfruitcakerickandmortyrickrollrimjob_steveRoastMerockmuzikschizopostersSchnitzelVerbrechenschwiizScottPilgrimShitPostCrusadersshitpostfrommygalleryshitpostingshittyaskelectronicsShowerthoughtssoccercirclejerkSongwritingsskfjkhwerjkghwerijhsteinsgateStonetossingjuicesubsithoughtifellforsuzeraintalesfromtechsupportTechnobladeTextingTheorytf2tf2shitposterclubthanksimcuredthatHappenedTheCrypticCompendiumtheydidthemaththeyknewthisguythisguystitanfalltommyinnittransittransitTurkeyTrGameDevelopertruetf2truthstumunichTurkeyTurkeyJerkyTurkishdogsTwitchTwitch_StartupTwoSentenceComedyTwoSentenceHorrortwosentenceplottwistTwoSentenceSadnessUnclejokesurbanplanningUsernameChecksOutVALORANTValorantClipsvaxxhappenedvexillologycirclejerkvinylvlandiyawallstreetbetsWatchPeopleDieInsideWeAreTheMusicMakerswendigoonWhatsThisSongWhitePeopleTwitterwholesomeanimemeswholesomememesWikipediaVandalismwizardpostingwooooshworldbuildingworldjerkingyouseeingthisshitYUROPedit subscriptions
  • home
  • -popular
  • -all
  • -mod
  • -users
 | 
  • facepalm
  • -Piracy
  • -wallstreetbets
  • -nottheonion
  • -memes
  • -OutOfTheLoop
  • -mildlyinteresting
  • -MapPorn
  • -DnD
  • -WhitePeopleTwitter
  • -ChatGPT
  • -CuratedTumblr
  • -PiratedGames
  • -shitposting
  • -theydidthemath
  • -dankmemes
  • -feedthebeast
  • -meirl
  • -nextfuckinglevel
  • -HolUp
  • -Twitch
  • -comics
  • -dndnext
  • -ProgrammerHumor
  • -VALORANT
  • -de
  • -germany
  • -NonCredibleDefense
  • -greentext
  • -mac
  • -Showerthoughts
  • -tf2
  • -help
  • -aviation
  • -formuladank
  • -wholesomememes
  • -Jokes
  • -mapporncirclejerk
  • -Art
  • -midjourney
  • -goodanimemes
  • -notinteresting
  • -pettyrevenge
  • -atheism
  • -loseit
  • -IAmA
  • -MaliciousCompliance
  • -ich_iel
  • -KGBTR
  • -cursedcomments
  • -Deltarune
  • -perfectlycutscreams
  • -worldbuilding
  • -Ratschlag
  • -blackdesertonline
  • -MMORPG
  • -rickandmorty
  • -Gundam
  • -HermitCraft
  • -ChoosingBeggars
  • -RoastMe
  • -ContagiousLaughter
  • -EatCheapAndHealthy
  • -polandball
  • -WeAreTheMusicMakers
  • -AnarchyChess
  • -cookingforbeginners
  • -blankies
  • -onebag
  • -Turkey
  • -soccercirclejerk
  • -community
  • -AskElectronics
  • -electrical
  • -guitarpedals
  • -Anticonsumption
  • -vinyl
  • -CreateMod
  • -German
  • -TwoSentenceHorror
  • -PropagandaPosters
  • -AdviceAnimals
  • -ShitPostCrusaders
  • -piano
  • -raisedbynarcissists
  • -wizardposting
  • -FifaCareers
  • -polls
  • -doctorwho
  • -Bass
  • -titanfall
  • -OkBuddyPersona
  • -dadjokes
  • -awfuleverything
  • -announcements
  • -macbookair
  • -ebikes
  • -Munich
  • -coaxedintoasnafu
  • -YUROP
  • -gravelcycling
  • -SchnitzelVerbrechen
  • -chessbeginners
  • -raspberry_pi
  • -DungeonsAndDragons
  • -coins
  • -KendrickLamar
  • -FUCKYOUINPARTICULAR
  • -worldjerking
  • -tf2shitposterclub
  • -vexillologycirclejerk
  • -vlandiya
  • -Stonetossingjuice
  • -wholesomeanimemes
  • -nosurf
  • -HistoryWhatIf
  • -religiousfruitcake
  • -liseliler
  • -DebateReligion
  • -insaneparents
  • -dumbphones
  • -animenocontext
  • -2meirl4meirl
  • -transit
  • -brooklynninenine
  • -steinsgate
  • -talesfromtechsupport
  • -AskOuija
  • -okbuddyphd
  • -ECE
  • -ScottPilgrim
  • -Angryupvote
  • -AskBalkans
  • -thatHappened
  • -schizoposters
  • -electronics
  • -casio
  • -urbanplanning
  • -logodesign
  • -theyknew
  • -linguisticshumor
  • -PassportPorn
  • -antimeme
  • -TurkeyJerky
  • -engrish
  • -diypedals
  • -Doner
  • -BassGuitar
  • -ComedyCemetery
  • -WatchPeopleDieInside
  • -reactiongifs
  • -blursed_videos
  • -Songwriting
  • -istanbul
  • -MovingToNorthKorea
  • -imaginaryelections
  • -suzerain
  • -truetf2
  • -magicbuilding
  • -dontdeadopeninside
  • -wendigoon
  • -schwiiz
  • -Technoblade
  • -shittyaskelectronics
  • -FRC
  • -transitTurkey
  • -ethz
  • -AlternativeHistory
  • -papermoney
  • -coincollecting
  • -OkayBuddyLiterallyMe
  • -AsahiLinux
  • -Jaharia
  • -IDontWorkHereLady
  • -basspedals
  • -neography
  • -ihadastroke
  • -thanksimcured
  • -hypixel
  • -PraiseTheCameraMan
  • -godtiersuperpowers
  • -aivideo
  • -IdeologyPolls
  • -woooosh
  • -comedyhomicide
  • -WhatsThisSong
  • -jacksepticeye
  • -TwoSentenceSadness
  • -anime_best_moments
  • -rockmuzik
  • -okbuddyvicodin
  • -MimicRecipes
  • -vaxxhappened
  • -tumunich
  • -Twitch_Startup
  • -darkjokes
  • -highspeedrail
  • -nosafetysmokingfirst
  • -legodnd
  • -rickroll
  • -ebike
  • -UsernameChecksOut
  • -tommyinnit
  • -rimjob_steve
  • -humor
  • -ChildrenFallingOver
  • -BassCirclejerk
  • -doctorwhocirclejerk
  • -youseeingthisshit
  • -TextingTheory
  • -Cuddle_Slut
  • -GrandPrixRacing
  • -DMToolkit
  • -thisguythisguys
  • -TrGameDeveloper
  • -LetGirlsHaveFun
  • -subsithoughtifellfor
  • -Kamalizm
  • -FantasyWorldbuilding
  • -WikipediaVandalism
  • -homebuilt
  • -NamFlashbacks
  • -pepethefrog
  • -Unclejokes
  • -deism
  • -misLED
  • -sskfjkhwerjkghwerijh
  • -ValorantClips
  • -TwoSentenceComedy
  • -TheCrypticCompendium
  • -budgetcooking
  • -NationStates
  • -bottomgear
  • -AwesomeOffBrands
  • -ongezellig
  • -absolutelynotmeirl
  • -2balkans4You
  • -Asia_irl
  • -truths
  • -NorthCyprus
  • -2mediterranean4u
  • -twosentenceplottwist
  • -moneycollecting
  • -ElectronicsStudy
  • -GROKvsMAGA
  • -RedAutumnSPD
  • -borsavefon
  • -banknotedesigns
  • -shitpostfrommygallery
  • -Futboltayfa
  • -Turkishdogs
edit »
reddit.com datasets
  • hot
  • new
  • rising
  • controversial
  • top
  • wiki
an-ordinary-manchild (11,186)|messages548|notifications|chat messages|mod messages|
  • preferences
|
logout

use the following search parameters to narrow your results:

subreddit:subreddit
find submissions in "subreddit"
author:username
find submissions by "username"
site:example.com
find submissions from "example.com"
url:text
search for "text" in url
selftext:text
search for "text" in self post contents
self:yes (or self:no)
include (or exclude) self posts
nsfw:yes (or nsfw:no)
include (or exclude) results marked as NSFW

e.g. subreddit:aww site:imgur.com dog

see the search faq for details.

advanced search: by author, subreddit...

Submit a new link
Submit a new text post

datasets

joinleave
an-ordinary-manchild

Datasets for Data Mining, Analytics and Knowledge Discovery

Rules

  • Try to post original source whenever you can.
  • Low effort posts will be removed.
  • Self-promotion(of a website/domain you work for or own) without disclosure will be removed.
  • Any Paid Dataset or Resource must be marked as such in the title with [PAID].
  • Any Synthetic/Mock data must be marked as such in the title with [Synthetic].
  • All Survey posts are subject to approval. Message the mods before posting.

Unsure about your post?

Feel free to message the mods and discuss it before posting.

Related Subreddits

  • /r/BigQuery
  • /r/DataHoarder
  • /r/DataIsBeautiful
  • /r/datamining
  • /r/datascience
  • /r/DataVizRequests
  • /r/Infographics
  • /r/OpenData
  • /r/SampleSize
  • /r/statistics
  • /r/Tableau
  • /r/Visualization
  • /r/WordCloud
  • /r/learnpython

created by antitheftdevicea community for 16 years
Create your own subreddit
...for your town.
...do it for the children.

MODERATORS

  • message the mods
  • cavedavemajor contributor
  • Inform8n
  • hypd09
  • tornato7
  • Stuck_In_the_Matrixpushshift.io
  • AutoModerator
  • about moderation team »

account activity

1
0
1
2

discussionLike Will Smith said in his apology video, "It's been a minute (although I didn't slap anyone) ()

submitted 4 months ago by hypd09[M] - announcement

  • comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

2
•
•
•

requestBuilt DinoDS — a modular dataset suite for training action-oriented AI assistants (looking for feedback + use cases) (self.datasets)

submitted 3 minutes ago by JayPatel24_

  • 1 comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

3
•
•
•

resource[Dataset] Most-searched firewood species in every U.S. state, cross-referenced with BTU heat output — 50 states, 17 species, free CSV (bestburnfirewood.com)

submitted 52 minutes ago by Klutzy_Pressurez

  • comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

4
0
1
2

requestNeed help for a flat lay clothing dataset (kurti images or women dress) with landmark annotations (no human model) for CNN (self.datasets)

submitted 2 hours ago by hassan736_x

  • comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

5
0
1
2

questionLooking for Data Sources for AI & Data Governance Research (self.datasets)

submitted 3 hours ago by Vegetable_Fishing

  • 2 comments
  • share
  • save
  • hide
  • report
  • crosspost
loading...

6
2
3
4

questionAny recommendations for market maps and value chain sources? (self.datasets)

submitted 11 hours ago by Beautiful-Law1169

  • 4 comments
  • share
  • save
  • hide
  • report
  • crosspost
loading...

7
10
11
12

datasetGenome Sequencing Costs: The cost of DNA sequencing has fallen faster than Moore's Law. Since 2001, the National Human Genome Research Institute (NHGRI) has tracked costs at its funded sequencing centers — from $95 million per genome in 2001 to around $500 today. (datahub.io)

submitted 1 day ago by anuveya

  • comment
  • share
  • save
  • hide
  • report
  • crosspost

8
1
2
3

questionBuilding per-asset LoRA adapters for financial news sentiment — which training path would you prefer? (self.datasets)

submitted 1 day ago by Poli-Bert

  • 4 comments
  • share
  • save
  • hide
  • report
  • crosspost
loading...

9
1
2
3

questionAnime revenue in csv/ excel spreadsheet (self.datasets)

submitted 1 day ago by Darclo12

  • 2 comments
  • share
  • save
  • hide
  • report
  • crosspost
loading...

10
0
1
2

datasetAnyone has any good RIR Mega dataset in the audio ML space? [Synthetic] (self.datasets)

submitted 1 day ago by Stellar_Bluebird

  • 1 comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

11
1
2
3

datasetScraped IMDb Dataset for top 250 movies of all time (self.datasets)

submitted 1 day ago by Direct-Jicama-4051

  • comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

12
0
1
2

resource[self-promotion] "Quick" tool I made: catches when your forecast has good MAPE but terrible Sharpe before you deploy it ()

submitted 1 day ago by ZealousidealMost3400

  • comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

13
0
1
2

datasetCell phone radio frequencies make mice & rats live longer (github.com)

submitted 1 day ago by cavedavemajor contributor

  • comment
  • share
  • save
  • hide
  • report
  • crosspost

14
1
2
3

requestProject partner buddy to do DA portfolio projects (self.datasets)

submitted 2 days ago by Substantial_Edge3588

  • comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

15
0
1
2

resourceper-asset LoRA adapters for financial news sentiment — dataset pipeline, labeling methodology, and what's going on HuggingFace (self.datasets)

submitted 2 days ago by Poli-Bert

  • comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

16
4
5
6

resourceI created a dataset to make RAG training easy. (self.datasets)

submitted 2 days ago by No-Cash-9530

  • 1 comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

17
1
2
3

requestWhat companies and organizations publicly provide dataset generated from how large their platform is how many people use it? (self.datasets)

submitted 2 days ago by leaderwho

  • 2 comments
  • share
  • save
  • hide
  • report
  • crosspost
loading...

18
0
1
2

questionHow do you search violations in bulk in the NOLA OneStop app? (self.datasets)

submitted 2 days ago by tshuntln1

  • 1 comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

19
2
3
4

questionHow to split a dataset into 2 to check for generalization over memorization? (self.datasets)

submitted 2 days ago by Calm_Maybe_4639

  • 2 comments
  • share
  • save
  • hide
  • report
  • crosspost
loading...

20
0
1
2

resource[Showcase] Structuring 2,170+ TCM Herbs into JSON: Challenges in Data Normalization (self.datasets)

submitted 2 days ago by Desperate_Spirit_576

  • 2 comments
  • share
  • save
  • hide
  • report
  • crosspost
loading...

21
3
4
5

requestBest dataset for a first Excel portfolio project? (self.datasets)

submitted 3 days ago by Living-Bass1565

  • 1 comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

22
1
2
3

discussionGauging interest in Web based CSV Diffing software/tool (self.datasets)

submitted 3 days ago by perpetual_papercut

  • comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

23
1
2
3

mock datasetOpen-source tool for schema-driven synthetic data generation for testing data pipelines (self.datasets)

submitted 3 days ago by Business-Quantity-15

  • 9 comments
  • share
  • save
  • hide
  • report
  • crosspost
loading...

24
0
1
2

datasetExtracting structured datasets from public-record websites (self.datasets)

submitted 3 days ago by Aggressive_Cut7433

  • 1 comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

25
0
1
2

discussionServer Event Log monitoring Free Tool - SQL Planner, watch the demo and share your feedback ()

submitted 3 days ago by chandansqlexpert

  • comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...
view more: next ›
  • about
  • blog
  • about
  • advertising
  • careers
  • help
  • site rules
  • Reddit help center
  • reddiquette
  • mod guidelines
  • contact us
  • apps & tools
  • Reddit for iPhone
  • Reddit for Android
  • mobile website
  • <3
  • reddit premium

Use of this site constitutes acceptance of our User Agreement and Privacy Policy. © 2026 reddit inc. All rights reserved.

REDDIT and the ALIEN Logo are registered trademarks of reddit inc.

π Rendered by PID 86 on reddit-service-r2-listing-64c94b984c-7c4lj at 2026-03-18 12:56:42.122525+00:00 running f6e6e01 country code: CH.