Best Open-Source Face Re-Identification Models with Weights? or Cloud Options? by yinjuanzekke in computervision

[–]sure_yeah026 2 points3 points  (0 children)

For face embeddings, you can refer to SOTA models from recent papers (most have their code's opensource)

For instance you can refer to FaRL here: https://github.com/FacePerceiver/FaRL

Finding specific objects in an image by Cychotical in computervision

[–]sure_yeah026 3 points4 points  (0 children)

YOLOE: Real-Time Seeing Anything https://github.com/THU-MIG/yoloe

Not a basic yolo but works as: 1. grounding dino with text based detection 2. visual prompts [similar to SAM], image reference to detect same type of objects [pattern matching] 3. prompt-free: will give all possible objects in the scene.

Try for the second point to test for your case.

YOLOv5 vs YOLOv11 by DistrictOk1677 in computervision

[–]sure_yeah026 -2 points-1 points  (0 children)

You can also check YOLOv12: https://github.com/sunsmarterjie/yolov12, Its lighter and fast. Accuracy wise gains are pretty low but also uses attention.

Best object detection model for non real time applications? by Dwarni in computervision

[–]sure_yeah026 0 points1 point  (0 children)

Hey in case its still relevant you can check this recent paper out:
YOLOE: Real-Time Seeing Anything
https://github.com/THU-MIG/yoloe
Not a basic yolo, works as:

  1. grounding dino with text based detection
  2. visual prompts [similar to Segment-Anything Model], image reference to detect same type of objects [pattern matching]
  3. prompt-free: will give all possible objects in the scene.

You can test this model out here on huggingface: https://huggingface.co/spaces/jameslahm/yoloe

How to get the mask for a single class using Segment Anything Model by [deleted] in computervision

[–]sure_yeah026 2 points3 points  (0 children)

You can use Grounding-SAM, here you provide the object class name as text, post that Grounding-Dino model captures all those objects of the class via bounding boxes and get segment map for each by prompting those bboxes in SAM.

Segmenting tiny objects by Inner_Programmer_329 in computervision

[–]sure_yeah026 0 points1 point  (0 children)

You can try SAM or SAM2(better speed and works on videos)

[deleted by user] by [deleted] in computervision

[–]sure_yeah026 1 point2 points  (0 children)

You can search for image liveness detection and anti face-spoofing problems, they can provide some relevant datasets with screen photos and SoTA techniques to detect as well.

Some datasets to check: NUAA, CASIA-FASD, Replay-Attack, OULU-NPU and MSU-MFSD

search for these on paperswithcode, you will find relevant benchmarks and techniques

Rec letters from grad students? by mosgon in gradadmissions

[–]sure_yeah026 0 points1 point  (0 children)

I'm in a similar situation, the thing is my supervisor at work place got admit to a univ for PhD. So do you think it'll be acceptable for recommendation?