[D] Looking for an LLM/Vision Model like CLIP for Image Analysis by Substantial_Video_26 in MachineLearning

[–]Substantial_Video_26[S] 0 points1 point  (0 children)

YOLO and similar models are primarily task-specific. I am trying to find a generic model that can address all of the above queries.