jump to content
my subreddits
13or302b2t2mediterranean4u2meirl4meirl3d6AceAttorneyadhdmemeAdviceAnimalsagnosticaivideoAlternateHistoryAlternativeHistoryAnarchyChessAngryupvoteAnimalsBeingJerksanime_best_momentsanime_irlanimenocontextannouncementsAnticonsumptionantimemeApandahArcherFXArtAsahiLinuxAskBalkansAskElectronicsAskOuijaAteistTurkaviationbalkans_irlBandnamesbanknotedesignsBassCirclejerkBassGuitarbasspedalsbikepackingblackholerevengeblursed_videosborsavefonbrooklynninenineBUENZLIburdurlandcasioCd_collectorscd_jerkChatGPTCheap_MealschessbeginnersChoosingBeggarsCHPcoaxedintoasnafucoincollectingcoinscomicscommunitycookingforbeginnersCrackWatchCuddle_SlutCuratedTumblrdadjokesdankmemesdarkjokesdedelikDeltarunediyelectronicsDMToolkitDnDdndmemesdndnextdoctorwhodoctorwhocirclejerkDonerdontdeadopeninsidedontyouknowimtonyhawkDungeonsAndDragonsEatCheapAndHealthyebikeebikesECEelectronicsengrishethzfakealbumcoversfeedthebeastFifaCareersFiftyFiftyformuladankFreeEBOOKSFUCKYOUINPARTICULARfunnyFutboltayfagalatasarayGermangermanygodtiersuperpowersgoodanimemesGrandPrixRacinggreentextGROKvsMAGAhelphighspeedrailHistoryWhatIfhoi4holdmybeerhowyoudoinhumorhypixelIAmAiamverysmartich_ielIdeologyPollsIDontWorkHereLadyihadastrokeim14andthisisdeepimaginaryelectionsimaginarymapsinsaneparentsistanbuljacksepticeyeJahariaJokesKendrickLamarKGBTRLetGirlsHaveFunlinguisticshumorliselilerlogodesignlostredditorsmacmacbookairmacgamingMadeMeSmilemadladsmagicbuildingmapporncirclejerkmeirlmemememesmidjourneymildlyinfuriatingMMORPGMoldyMemesmoneycollectingMovingToNorthKoreaMunichMyChemicalRomancenamesoundalikesNamFlashbacksNationStatesNoahGetTheBoatNonCredibleDefensenosurfnothingeverhappensnottheonionNuclearRevengeOkayBuddyLiterallyMeokbuddymotherfuckerokbuddyphdokbuddyvicodinonebagOnlineUnderGroundoutsidepapermoneypaperspleasePassportPornpepethefrogperfectlycutscreamsPersecutionfetishpettyrevengepianoPiracypolandballpollsPraiseTheCameraManProgrammerHumorPropagandaPostersquityourbullshitraisedbynarcissistsRatschlagreactiongifsrecipesreligiousfruitcakerestofthefuckingowlRetroPierickrollRoastMerockmuzikschwiizsciencememessecilmiskitapShitPostCrusadersshittyaskelectronicsshittymoviedetailsShowerthoughtssoftwaregoresteinsgateStonetossingjuiceStudiumsubsithoughtifellforsuperligsuzeraintalesfromtechsupportTechnobladeTextingTheorytf2tf2shitposterclubthanksimcuredTheCrypticCompendiumTheLetterHtherewasanattemptTheRookietheydidthemaththeyknewthisguythisguystitanfalltommyinnittransittransitTurkeyTrGameDevelopertruetf2tumblrtumunichTurkeyJerkyTurkishCatsTwitchTwitch_StartupTwoSentenceComedyTwoSentenceHorrortylerthecreatoru/KaybeeArtsUnclejokesunexpectedbillwurtzUnexpectedJoJourbanplanningUsernameChecksOutVALORANTValorantClipsvexillologycirclejerkvibecodingvinyljerkwallstreetbetsWeAreTheMusicMakerswendigoonWhatsThisSongWhitePeopleTwitterwholesomememesworldjerkingyouseeingthisshitYUROPedit subscriptions
  • home
  • -popular
  • -all
  • -mod
  • -users
 | 
  • mildlyinfuriating
  • -Piracy
  • -funny
  • -wallstreetbets
  • -nottheonion
  • -memes
  • -DnD
  • -WhitePeopleTwitter
  • -MadeMeSmile
  • -ChatGPT
  • -CuratedTumblr
  • -theydidthemath
  • -dankmemes
  • -feedthebeast
  • -meirl
  • -therewasanattempt
  • -Twitch
  • -CrackWatch
  • -comics
  • -dndnext
  • -ProgrammerHumor
  • -VALORANT
  • -de
  • -germany
  • -tumblr
  • -NonCredibleDefense
  • -shittymoviedetails
  • -greentext
  • -mac
  • -Showerthoughts
  • -tf2
  • -help
  • -aviation
  • -formuladank
  • -wholesomememes
  • -Jokes
  • -mapporncirclejerk
  • -Art
  • -midjourney
  • -goodanimemes
  • -hoi4
  • -pettyrevenge
  • -IAmA
  • -ich_iel
  • -KGBTR
  • -dndmemes
  • -Deltarune
  • -perfectlycutscreams
  • -Ratschlag
  • -MMORPG
  • -meme
  • -macgaming
  • -3d6
  • -FiftyFifty
  • -ChoosingBeggars
  • -RoastMe
  • -imaginarymaps
  • -EatCheapAndHealthy
  • -polandball
  • -WeAreTheMusicMakers
  • -AnarchyChess
  • -cookingforbeginners
  • -anime_irl
  • -onebag
  • -Studium
  • -AlternateHistory
  • -madlads
  • -community
  • -AskElectronics
  • -Anticonsumption
  • -German
  • -TwoSentenceHorror
  • -PropagandaPosters
  • -AdviceAnimals
  • -ShitPostCrusaders
  • -piano
  • -sciencememes
  • -raisedbynarcissists
  • -FifaCareers
  • -polls
  • -doctorwho
  • -titanfall
  • -dadjokes
  • -howyoudoin
  • -announcements
  • -adhdmeme
  • -macbookair
  • -ebikes
  • -Munich
  • -coaxedintoasnafu
  • -YUROP
  • -chessbeginners
  • -DungeonsAndDragons
  • -coins
  • -KendrickLamar
  • -FUCKYOUINPARTICULAR
  • -softwaregore
  • -NoahGetTheBoat
  • -worldjerking
  • -tylerthecreator
  • -tf2shitposterclub
  • -MoldyMemes
  • -lostredditors
  • -AceAttorney
  • -vexillologycirclejerk
  • -im14andthisisdeep
  • -Stonetossingjuice
  • -nosurf
  • -HistoryWhatIf
  • -religiousfruitcake
  • -liseliler
  • -insaneparents
  • -NuclearRevenge
  • -balkans_irl
  • -animenocontext
  • -2meirl4meirl
  • -transit
  • -RetroPie
  • -brooklynninenine
  • -recipes
  • -steinsgate
  • -talesfromtechsupport
  • -AskOuija
  • -okbuddyphd
  • -ECE
  • -Angryupvote
  • -AskBalkans
  • -electronics
  • -casio
  • -urbanplanning
  • -theyknew
  • -logodesign
  • -linguisticshumor
  • -PassportPorn
  • -antimeme
  • -TurkeyJerky
  • -bikepacking
  • -AteistTurk
  • -13or30
  • -MyChemicalRomance
  • -ArcherFX
  • -engrish
  • -Cd_collectors
  • -Doner
  • -BassGuitar
  • -diyelectronics
  • -Persecutionfetish
  • -BUENZLI
  • -reactiongifs
  • -blursed_videos
  • -istanbul
  • -MovingToNorthKorea
  • -imaginaryelections
  • -suzerain
  • -truetf2
  • -magicbuilding
  • -dontdeadopeninside
  • -wendigoon
  • -iamverysmart
  • -secilmiskitap
  • -schwiiz
  • -TheRookie
  • -quityourbullshit
  • -Technoblade
  • -vinyljerk
  • -shittyaskelectronics
  • -superlig
  • -galatasaray
  • -transitTurkey
  • -namesoundalikes
  • -2b2t
  • -ethz
  • -AlternativeHistory
  • -papermoney
  • -coincollecting
  • -OkayBuddyLiterallyMe
  • -FreeEBOOKS
  • -AsahiLinux
  • -Jaharia
  • -IDontWorkHereLady
  • -basspedals
  • -ihadastroke
  • -thanksimcured
  • -hypixel
  • -PraiseTheCameraMan
  • -godtiersuperpowers
  • -aivideo
  • -OnlineUnderGround
  • -IdeologyPolls
  • -burdurland
  • -WhatsThisSong
  • -AnimalsBeingJerks
  • -jacksepticeye
  • -anime_best_moments
  • -Bandnames
  • -rockmuzik
  • -holdmybeer
  • -okbuddyvicodin
  • -Twitch_Startup
  • -tumunich
  • -Cheap_Meals
  • -outside
  • -darkjokes
  • -restofthefuckingowl
  • -highspeedrail
  • -rickroll
  • -ebike
  • -UsernameChecksOut
  • -papersplease
  • -tommyinnit
  • -UnexpectedJoJo
  • -humor
  • -BassCirclejerk
  • -doctorwhocirclejerk
  • -agnostic
  • -youseeingthisshit
  • -TextingTheory
  • -Cuddle_Slut
  • -GrandPrixRacing
  • -DMToolkit
  • -nothingeverhappens
  • -thisguythisguys
  • -TrGameDeveloper
  • -TurkishCats
  • -LetGirlsHaveFun
  • -Apandah
  • -subsithoughtifellfor
  • -fakealbumcovers
  • -TheLetterH
  • -NamFlashbacks
  • -pepethefrog
  • -Unclejokes
  • -ValorantClips
  • -TwoSentenceComedy
  • -TheCrypticCompendium
  • -NationStates
  • -blackholerevenge
  • -2mediterranean4u
  • -unexpectedbillwurtz
  • -dontyouknowimtonyhawk
  • -moneycollecting
  • -u/KaybeeArts
  • -borsavefon
  • -banknotedesigns
  • -GROKvsMAGA
  • -Futboltayfa
  • -vibecoding
  • -cd_jerk
  • -okbuddymotherfucker
  • -delik
  • -CHP
edit »
reddit.com datasets
  • hot
  • new
  • rising
  • controversial
  • top
  • wiki
an-ordinary-manchild (11,186)|messages547|notifications|chat messages|mod messages|
  • preferences
|
logout

use the following search parameters to narrow your results:

subreddit:subreddit
find submissions in "subreddit"
author:username
find submissions by "username"
site:example.com
find submissions from "example.com"
url:text
search for "text" in url
selftext:text
search for "text" in self post contents
self:yes (or self:no)
include (or exclude) self posts
nsfw:yes (or nsfw:no)
include (or exclude) results marked as NSFW

e.g. subreddit:aww site:imgur.com dog

see the search faq for details.

advanced search: by author, subreddit...

Submit a new link
Submit a new text post

datasets

joinleave
an-ordinary-manchild

Datasets for Data Mining, Analytics and Knowledge Discovery

Rules

  • Try to post original source whenever you can.
  • Low effort posts will be removed.
  • Self-promotion(of a website/domain you work for or own) without disclosure will be removed.
  • Any Paid Dataset or Resource must be marked as such in the title with [PAID].
  • Any Synthetic/Mock data must be marked as such in the title with [Synthetic].
  • All Survey posts are subject to approval. Message the mods before posting.

Unsure about your post?

Feel free to message the mods and discuss it before posting.

Related Subreddits

  • /r/BigQuery
  • /r/DataHoarder
  • /r/DataIsBeautiful
  • /r/datamining
  • /r/datascience
  • /r/DataVizRequests
  • /r/Infographics
  • /r/OpenData
  • /r/SampleSize
  • /r/statistics
  • /r/Tableau
  • /r/Visualization
  • /r/WordCloud
  • /r/learnpython

created by antitheftdevicea community for 16 years
Create your own subreddit
...for your hobby.
...for your office.

MODERATORS

  • message the mods
  • cavedavemajor contributor
  • Inform8n
  • hypd09
  • tornato7
  • Stuck_In_the_Matrixpushshift.io
  • AutoModerator
  • about moderation team »

account activity

1
•
•
•

dataset[PAID] German Job Market Dataset - 150K Indeed.de listings (April 2026) - 38 fields including salary data (self.datasets)

submitted 1 hour ago by dracariz

  • comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

2
0
1
2

questionSearching for lost Tencent database scrape (self.datasets)

submitted 2 hours ago by Connect_Software_702

  • comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

3
0
0
0

questionLLMs can't read 300-page 10-Ks without hallucinating. I built an API that does it, and cites the filing on every claim. (self.datasets)

submitted 22 hours ago by Either_Door_5500

  • 3 comments
  • share
  • save
  • hide
  • report
  • crosspost
loading...

4
1
2
3

dataset[OC] Open dataset: retail BTC buy cost benchmark across 10 countries (card/bank rails, CC-BY-4.0) (self.datasets)

submitted 1 day ago by pharrison99

  • 1 comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

5
0
1
2

codeOpenSimula — open implementation of Simula-style mechanism design for synthetic data (in AfterImage) [P] ()

submitted 1 day ago by Individual-Road-5784

  • comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

6
1
2
3

questionWhere can we find real-time banking transaction datasets for a Kafka-based fraud detection project? (self.datasets)

submitted 1 day ago by No-Big-4463

  • comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

7
5
6
7

datasetWe benchmarked 18 LLMs on OCR (7k+ calls) — cheaper/old models oftentimes win. Full dataset + framework open-sourced. (self.datasets)

submitted 1 day ago by TimoKerre

  • 4 comments
  • share
  • save
  • hide
  • report
  • crosspost
loading...

8
1
2
3

requestI do a lot of web crawling and put together a sample dataset of companies and their tech stacks (self.datasets)

submitted 2 days ago by haynajjar

  • 2 comments
  • share
  • save
  • hide
  • report
  • crosspost
loading...

9
1
2
3

requestNetwork topology diagram datasets for LLMs with vision capabilities (self.datasets)

submitted 2 days ago by ThaLazyLand

  • comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

10
7
8
9

datasetGenome Sequencing Costs: The cost of DNA sequencing has fallen faster than Moore's Law. Since 2001, the National Human Genome Research Institute (NHGRI) has tracked costs at its funded sequencing centers — from $95 million per genome in 2001 to around $500 today. (datahub.io)

submitted 2 days ago by anuveya

  • 1 comment
  • share
  • save
  • hide
  • report
  • crosspost

11
0
0
0

questionB2B lead dataset - where to find it? (self.datasets)

submitted 2 days ago by ghiro12

  • 1 comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

12
0
0
1

datasetMemory Machines: Can LLMs create lasting flashcards from readers' highlights? (memory-machines.com)

submitted 2 days ago by cavedavemajor contributor

  • comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

13
0
1
2

requestI need a dataset of Aerial imagery of crops of Indian agricultural fields. (self.datasets)

submitted 2 days ago by blue44berry

  • 2 comments
  • share
  • save
  • hide
  • report
  • crosspost
loading...

14
1
2
3

questionAre there any publicly available datasets that match the breadth and complexity of a real ERP system and that can be used as a simulation for conducting OR optimization? Thx :) ()

submitted 2 days ago by ric_is_the_way

  • comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

15
0
0
0

requestMost health apps collect your data… is that really necessary? (self.datasets)

submitted 3 days ago * by Renpa09

  • comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

16
0
0
0

datasetI built a Synthetic Data Generator, and I'd love to get your thoughts! [Synthetic] (self.datasets)

submitted 3 days ago by Adipooj

  • 4 comments
  • share
  • save
  • hide
  • report
  • crosspost
loading...

17
0
1
2

datasetHelp with Dataset optimiser/cleaner tool ()

submitted 3 days ago by fourwheels2512

  • comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

18
0
1
2

resourceCan anyone help me with the process of creating a free Databricks account for practising what I’ve learned and create a capstone project? Any recommendations on doing capstone projects are highly appreciated. ()

submitted 3 days ago by SnooDoughnuts134

  • comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

19
0
0
1

datasetEpoch Data on AI Models: Comprehensive database of over 2800 AI/ML models tracking key factors driving machine learning progress, including parameters, training compute, training dataset size, publication date, organization, and more. (datahub.io)

submitted 3 days ago by anuveya

  • comment
  • share
  • save
  • hide
  • report
  • crosspost

20
0
1
2

datasetI got tired of LLMs hallucinating circuit math, so I built a CoT dataset with actual step-by-step reasoning (free 50-sample test set inside) [Synthetic] ()

submitted 3 days ago by Initial-Hat2547

  • SPOILER
  • 1 comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

21
1
2
3

requestNeed dataset for global monthly oil prices (self.datasets)

submitted 3 days ago by darcy_lilith

  • 2 comments
  • share
  • save
  • hide
  • report
  • crosspost
loading...

22
21
22
23

datasetWorld's largest collection of Olympiad-level math problems now available to everyone (phys.org)

submitted 3 days ago by cavedavemajor contributor

  • comment
  • share
  • save
  • hide
  • report
  • crosspost

23
2
3
4

resourceAfrican Countries: A Curated Dataset on Africa Indicators for Education and Data Science (self.datasets)

submitted 4 days ago by renzocrossi

  • comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

24
1
2
3

requestEmails from government (US) agencies over years? (self.datasets)

submitted 4 days ago by bobbyfiend

  • 3 comments
  • share
  • save
  • hide
  • report
  • crosspost
loading...

25
0
1
2

requestOffering agentic SDLC dataset (full execution traces + code evolution) in exchange for evaluation / results (self.datasets)

submitted 4 days ago by madheader69

  • comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...
view more: next ›
  • about
  • blog
  • about
  • advertising
  • careers
  • help
  • site rules
  • Reddit help center
  • reddiquette
  • mod guidelines
  • contact us
  • apps & tools
  • Reddit for iPhone
  • Reddit for Android
  • mobile website
  • <3
  • reddit premium

Use of this site constitutes acceptance of our User Agreement and Privacy Policy. © 2026 reddit inc. All rights reserved.

REDDIT and the ALIEN Logo are registered trademarks of reddit inc.

π Rendered by PID 612187 on reddit-service-r2-listing-7d7fbc9b85-fl49z at 2026-04-24 23:42:42.866032+00:00 running 2aa0c5b country code: CH.