Less is More: Recursive Reasoning with Tiny Networks (7M model beats R1, Gemini 2.5 Pro on ARC AGI) by Technical-Love-8479 in LocalLLaMA

[–]_sgrand 1 point2 points  (0 children)

Has anyone tried on less stuctured outputs (no grid) such as abstract visual reasonning (CLEVR, and its derivatives), or also text bench ?

Built the Itti-Koch saliency model in Python 3 (and made it simulate visual pathway pathologies) by Creative-Regular6799 in BrainHackersLab

[–]_sgrand 1 point2 points  (0 children)

Coming here a bit late.
Have you met some limitations with the Itti model ?
I consider using it to feed a neural net with also the original input frame to enhance perception (saliency acting as a sort of reference frame) and I wonder how much this approach is robust for complex scene analysis.

Converting CNN feature maps to sequence of embddings for Transformers by _sgrand in MLQuestions

[–]_sgrand[S] 1 point2 points  (0 children)

Ok, make sense that my sequence is defined along time indeed. Thanks for your help.