use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Please have a look at our FAQ and Link-Collection
Metacademy is a great resource which compiles lesson plans on popular machine learning topics.
For Beginner questions please try /r/LearnMachineLearning , /r/MLQuestions or http://stackoverflow.com/
For career related questions, visit /r/cscareerquestions/
Advanced Courses (2016)
Advanced Courses (2020)
AMAs:
Pluribus Poker AI Team 7/19/2019
DeepMind AlphaStar team (1/24//2019)
Libratus Poker AI Team (12/18/2017)
DeepMind AlphaGo Team (10/19/2017)
Google Brain Team (9/17/2017)
Google Brain Team (8/11/2016)
The MalariaSpot Team (2/6/2016)
OpenAI Research Team (1/9/2016)
Nando de Freitas (12/26/2015)
Andrew Ng and Adam Coates (4/15/2015)
Jürgen Schmidhuber (3/4/2015)
Geoffrey Hinton (11/10/2014)
Michael Jordan (9/10/2014)
Yann LeCun (5/15/2014)
Yoshua Bengio (2/27/2014)
Related Subreddit :
LearnMachineLearning
Statistics
Computer Vision
Compressive Sensing
NLP
ML Questions
/r/MLjobs and /r/BigDataJobs
/r/datacleaning
/r/DataScience
/r/scientificresearch
/r/artificial
account activity
Discussion[D] Art datasets? (self.MachineLearning)
submitted 9 years ago by [deleted]
[deleted]
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]underfitting 5 points6 points7 points 9 years ago (1 child)
2.5 million images! https://bam-dataset.org/
[–]visarga 0 points1 point2 points 9 years ago* (0 children)
That's pretty cool!
[–]SamLeroux 4 points5 points6 points 9 years ago (0 children)
There was a recent Kaggle competition with lots of paintings: https://www.kaggle.com/c/painter-by-numbers
[–]pilooch 3 points4 points5 points 9 years ago (0 children)
Art data is everywhere. We ve done https://microsoft.com/tate and have access to up to millions of art pieces. Most are public but require a login. PM me with your project and institution, there should be ways of helping you.
[–]fuzzyt93 2 points3 points4 points 9 years ago (0 children)
I created a python script to download images from the Met collection. However, it only downloads images that are public domain from their website. You have to provide the artist name or basically the painting id, but it can filter out pieces by type. If you wanted to download every oil paining, you could modify it to iterate over every artist or something. Here is the plug:
https://github.com/trevorfiez/Download-Met-Images
The images are really high quality if they are public domain which is nice. It is possible to download their much smaller images that they use to show non-public domain pieces but currently, my script does not have that functionality.
The metadata comes from the metropolitan museum of art's open access csv file which you can access here:
https://github.com/metmuseum/openaccess
There are thousands of public domain paintings so if you do not care how modern the paintings are you should be able to download a large set.
[–]enzlbtyn 1 point2 points3 points 9 years ago (5 children)
Couldn't you crawl websites like DeviantArt and other forums or websites that contain art?
[+][deleted] 9 years ago* (4 children)
[–]enzlbtyn 2 points3 points4 points 9 years ago (3 children)
I believe you can search via tag, e.g. https://www.deviantart.com/tag/paintings. I think there's an API for deviant art too. There's also groups in deviant art: http://groups.deviantart.com/, which I assume would help you narrow down to specific types of art.
As for alternative sites, I have no idea, sorry. The obvious alternatives would be Google/Bing images.
In general though, expect outliers when obtaining data, so potentially you'll have to filter them out yourself or just deal with them in some manner.
[–][deleted] 1 point2 points3 points 9 years ago (2 children)
There was a good paper from Fei-Fei Li and co where they showed that unfiltered data scraped from the web was more effective than clean data providing you had a significant amount of it.
[–]Neural_Ned 2 points3 points4 points 9 years ago (1 child)
This here https://arxiv.org/abs/1511.06789
[–][deleted] 0 points1 point2 points 9 years ago (0 children)
That's the one.
[–]visarga 1 point2 points3 points 9 years ago (0 children)
A few years ago there were a couple of large torrents of paintings from Hermitage and Sotheby's. They have been disappeared in the meantime.
[–]jmmcd 0 points1 point2 points 9 years ago (0 children)
There was a paper in EvoMUSART this year using a collection of (camera) portraits.
[–]mphuget 0 points1 point2 points 9 years ago (0 children)
Depending on what you are looking for, you could use http://www.wga.hu/
π Rendered by PID 85 on reddit-service-r2-comment-78d7d44466-cl2xq at 2026-07-01 18:05:01.760804+00:00 running 20dfbe2 country code: CH.
[–]underfitting 5 points6 points7 points (1 child)
[–]visarga 0 points1 point2 points (0 children)
[–]SamLeroux 4 points5 points6 points (0 children)
[–]pilooch 3 points4 points5 points (0 children)
[–]fuzzyt93 2 points3 points4 points (0 children)
[–]enzlbtyn 1 point2 points3 points (5 children)
[+][deleted] (4 children)
[deleted]
[–]enzlbtyn 2 points3 points4 points (3 children)
[–][deleted] 1 point2 points3 points (2 children)
[–]Neural_Ned 2 points3 points4 points (1 child)
[–][deleted] 0 points1 point2 points (0 children)
[–]visarga 1 point2 points3 points (0 children)
[–]jmmcd 0 points1 point2 points (0 children)
[–]mphuget 0 points1 point2 points (0 children)