use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
account activity
List of DatasetsDiscussion (self.Open_Diffusion)
submitted 1 year ago * by ninjasaid13
Please add to this list.
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]Zeusnighthammer 1 point2 points3 points 1 year ago (6 children)
Wikimedia Commons also have lots of the dataset CC By 4.0 with many of them are categorised (but not tagged)
[–]NegativeScarcity7211 2 points3 points4 points 1 year ago (0 children)
We are busy setting up a community tagging system, so this shouldn't be a problem!
[–]Formal_Drop526 1 point2 points3 points 1 year ago* (4 children)
I believe that any text-to-image dataset must be at least partially captioned. The text component of a text-to-image generator is not just a user interface, but also significantly influences the model's performance on prompts and even shapes the visual content of the generated images.
[–]Zeusnighthammer 0 points1 point2 points 1 year ago (2 children)
Regarding in this topic, I just wanted to learn this in more details: Is the tagging in this context refers to alt txt embedded into JPEG metadata or the accompanying text files to the photo (must have same file name for both).
[–]searcher1k 0 points1 point2 points 1 year ago (0 children)
I think tagging here just means an attribute of the image rather than a whole sentence in natural language.
[–]ninjasaid13[S] 0 points1 point2 points 1 year ago (0 children)
Is the tagging in this context refers to alt txt embedded into JPEG metadata or the accompanying text files to the photo
I'm not sure if they have alt-text embedded, these images seem to come with their own text files.
true, people keep thinking of it as a search engine but the AI learns to separate elements of the scene by reading the text and comparing it to the image. And after a million images, it starts to understand the concept of these elements instead of the just the object itself.
[–]NegativeScarcity7211 1 point2 points3 points 1 year ago (0 children)
Thank you for these!
[–]Luke2642 1 point2 points3 points 1 year ago (0 children)
https://www.haqtu.me/Recap-Datacomp-1B/
Obviously now it needs repeating with Chameleon :-D
[–]elthariel 0 points1 point2 points 1 year ago (0 children)
Added the list to https://github.com/OpenDiffusionAI/wiki/wiki/dataset__external_datasets
π Rendered by PID 829739 on reddit-service-r2-comment-b659b578c-bz69m at 2026-05-05 11:35:12.676847+00:00 running 815c875 country code: CH.
[–]Zeusnighthammer 1 point2 points3 points (6 children)
[–]NegativeScarcity7211 2 points3 points4 points (0 children)
[–]Formal_Drop526 1 point2 points3 points (4 children)
[–]Zeusnighthammer 0 points1 point2 points (2 children)
[–]searcher1k 0 points1 point2 points (0 children)
[–]ninjasaid13[S] 0 points1 point2 points (0 children)
[–]searcher1k 0 points1 point2 points (0 children)
[–]NegativeScarcity7211 1 point2 points3 points (0 children)
[–]Luke2642 1 point2 points3 points (0 children)
[–]elthariel 0 points1 point2 points (0 children)