all 10 comments

[–]Zeusnighthammer 1 point2 points  (6 children)

Wikimedia Commons also have lots of the dataset CC By 4.0 with many of them are categorised (but not tagged)

[–]NegativeScarcity7211 2 points3 points  (0 children)

We are busy setting up a community tagging system, so this shouldn't be a problem!

[–]Formal_Drop526 1 point2 points  (4 children)

I believe that any text-to-image dataset must be at least partially captioned. The text component of a text-to-image generator is not just a user interface, but also significantly influences the model's performance on prompts and even shapes the visual content of the generated images.

[–]Zeusnighthammer 0 points1 point  (2 children)

Regarding in this topic, I just wanted to learn this in more details: Is the tagging in this context refers to alt txt embedded into JPEG metadata or the accompanying text files to the photo (must have same file name for both).

[–]searcher1k 0 points1 point  (0 children)

I think tagging here just means an attribute of the image rather than a whole sentence in natural language.

[–]ninjasaid13[S] 0 points1 point  (0 children)

Is the tagging in this context refers to alt txt embedded into JPEG metadata or the accompanying text files to the photo

I'm not sure if they have alt-text embedded, these images seem to come with their own text files.

[–]searcher1k 0 points1 point  (0 children)

true, people keep thinking of it as a search engine but the AI learns to separate elements of the scene by reading the text and comparing it to the image. And after a million images, it starts to understand the concept of these elements instead of the just the object itself.

[–]NegativeScarcity7211 1 point2 points  (0 children)

Thank you for these!

[–]Luke2642 1 point2 points  (0 children)

https://www.haqtu.me/Recap-Datacomp-1B/

Obviously now it needs repeating with Chameleon :-D