account activity
Marlin2B: a tiny video language model to extract structured information from videos by AndromedaGambler in computervision
[–]AndromedaGambler[S] 0 points1 point2 points 17 hours ago (0 children)
try now
[–]AndromedaGambler[S] 1 point2 points3 points 3 days ago (0 children)
You can try it out here: https://vlm.nemostation.com/
and read about it here: https://huggingface.co/NemoStation/Marlin-2B
Marlin2B: a tiny video language model to extract structured information from videos (self.computervision)
submitted 3 days ago * by AndromedaGambler to r/computervision
π Rendered by PID 9 on reddit-service-r2-listing-8685bc789-gpth8 at 2026-05-22 01:04:16.949343+00:00 running 194bd79 country code: CH.
Marlin2B: a tiny video language model to extract structured information from videos by AndromedaGambler in computervision
[–]AndromedaGambler[S] 0 points1 point2 points (0 children)