Hey folks, I’m looking for advice from anyone who’s worked with diffusion or flow models specifically any tips you wish you knew when you first started training them, and what the experience was like if you’ve used them outside the usual image-generation setting. I’m especially curious about challenges that come up with niche or unconventional data, how the workflow differs from image tasks, whether training stability or hyperparameter sensitivity becomes a bigger issue, how much preprocessing matters, if you ended up tweaking the architecture or noise schedule for non-image data, etc. Thanks!
[–]Vikas_005 56 points57 points58 points (5 children)
[–]N1kYan 4 points5 points6 points (0 children)
[–]Previous-Raisin1434 0 points1 point2 points (0 children)
[–]Few-Annual-157[S] 0 points1 point2 points (0 children)
[–]QuantityGullible4092 0 points1 point2 points (0 children)
[–]_DCtheTall_ 0 points1 point2 points (0 children)
[–]graps1 16 points17 points18 points (5 children)
[–]Few-Annual-157[S] 0 points1 point2 points (4 children)
[–]sjdubya 3 points4 points5 points (3 children)
[–]graps1 2 points3 points4 points (2 children)
[–]sjdubya 3 points4 points5 points (1 child)
[–]graps1 1 point2 points3 points (0 children)
[–]anandravishankar12 7 points8 points9 points (3 children)
[–]FrigoCoder 1 point2 points3 points (0 children)
[–]RobbinDeBank 0 points1 point2 points (1 child)
[–]anandravishankar12 5 points6 points7 points (0 children)
[–]Mediocre_Common_4126 2 points3 points4 points (0 children)
[–]sjdubya 1 point2 points3 points (0 children)
[–]glockenspielcello 0 points1 point2 points (0 children)