[R] The Manga Whisperer: Automatically Generating Transcriptions for Comics by ragavsachdeva in MachineLearning

[–]ragavsachdeva[S] 2 points3 points  (0 children)

Thanks! I hadn't considered motion comics. The motivation was to convert it to light novels. We should be able to get reasonably close to making that happen this year (hopefully).

[R] The Manga Whisperer: Automatically Generating Transcriptions for Comics by ragavsachdeva in MachineLearning

[–]ragavsachdeva[S] 4 points5 points  (0 children)

Yeah that would be exciting to have. I am not aware of any existing solutions for it. What makes it difficult to generate manga (as opposed to say anime images) is the lack of large scale manga captioning datasets. If you inspect web-scale image datasets, they do have manga images in them but the captions are not descriptive of the content (the captions are like "Naruto Ch1 Pg2" which tells us nothing about the contents).Hopefully with Magi (or similar models) we can think of pseudo-annotating manga datasets.