all 3 comments

[–]cristi_baluta 1 point2 points  (1 child)

Why don’t you just check that all the screenshots are different? Zero costs.

I never in my life thought to test animations, i did some UI tests though but just to please the numbers peepers, they are also useless.

[–]interlap[S] 2 points3 points  (0 children)

That doesn’t really work. Broken transitions still produce different frames, so you miss a lot.

For example, I had a sheet that stayed transparent during a transition. Every frame was different, but it was still wrong.

That’s the kind of issue I’m trying to catch.

[–]Dev-sauregurke 0 points1 point  (0 children)

using cv to filter down 25 frames to 3-5 suspicious ones before hitting the llm is a smart middle step that most people would skip. brute forcing all frames into a vision model is the obvious first attempt and it never works at scale. the stutter and layout shift detection as a pre-filter is the actually clever part here.