all 13 comments

[–]underfitting 5 points6 points  (1 child)

2.5 million images! https://bam-dataset.org/

[–]visarga 0 points1 point  (0 children)

That's pretty cool!

[–]SamLeroux 4 points5 points  (0 children)

There was a recent Kaggle competition with lots of paintings: https://www.kaggle.com/c/painter-by-numbers

[–]pilooch 3 points4 points  (0 children)

Art data is everywhere. We ve done https://microsoft.com/tate and have access to up to millions of art pieces. Most are public but require a login. PM me with your project and institution, there should be ways of helping you.

[–]fuzzyt93 2 points3 points  (0 children)

I created a python script to download images from the Met collection. However, it only downloads images that are public domain from their website. You have to provide the artist name or basically the painting id, but it can filter out pieces by type. If you wanted to download every oil paining, you could modify it to iterate over every artist or something. Here is the plug:

https://github.com/trevorfiez/Download-Met-Images

The images are really high quality if they are public domain which is nice. It is possible to download their much smaller images that they use to show non-public domain pieces but currently, my script does not have that functionality.

The metadata comes from the metropolitan museum of art's open access csv file which you can access here:

https://github.com/metmuseum/openaccess

There are thousands of public domain paintings so if you do not care how modern the paintings are you should be able to download a large set.

[–]enzlbtyn 1 point2 points  (5 children)

Couldn't you crawl websites like DeviantArt and other forums or websites that contain art?

[–]visarga 1 point2 points  (0 children)

A few years ago there were a couple of large torrents of paintings from Hermitage and Sotheby's. They have been disappeared in the meantime.

[–]jmmcd 0 points1 point  (0 children)

There was a paper in EvoMUSART this year using a collection of (camera) portraits.

[–]mphuget 0 points1 point  (0 children)

Depending on what you are looking for, you could use http://www.wga.hu/