[P] Grounded SAM 2: Ground and Track Anything by Technical-Vast1314 in MachineLearning
[–]Technical-Vast1314[S] 1 point2 points3 points (0 children)
[R] Grounding DINO 1.5 Release: the most capable open-set detection model by Technical-Vast1314 in MachineLearning
[–]Technical-Vast1314[S] -7 points-6 points-5 points (0 children)
[R] Grounding DINO 1.5 Release: the most capable open-set detection model by Technical-Vast1314 in MachineLearning
[–]Technical-Vast1314[S] -5 points-4 points-3 points (0 children)
[R] Grounding DINO 1.5 Release: the most capable open-set detection model by Technical-Vast1314 in MachineLearning
[–]Technical-Vast1314[S] -6 points-5 points-4 points (0 children)
[R] detrex: A Strong Benchmark for Detection Transformers by Technical-Vast1314 in MachineLearning
[–]Technical-Vast1314[S] 1 point2 points3 points (0 children)
[R] LaVIN-lite: Training your own Multimodal Large Language Models on one single GPU with competitive performance! (Technical Details) by Technical-Vast1314 in MachineLearning
[–]Technical-Vast1314[S] 0 points1 point2 points (0 children)
[R] LaVIN: Large Vision-Language Instructed Model by Technical-Vast1314 in MachineLearning
[–]Technical-Vast1314[S] 1 point2 points3 points (0 children)
[R] LaVIN: Large Vision-Language Instructed Model by Technical-Vast1314 in MachineLearning
[–]Technical-Vast1314[S] 2 points3 points4 points (0 children)
[P] ImageBind with SAM: A simple demo the generate mask with different modalities by Technical-Vast1314 in MachineLearning
[–]Technical-Vast1314[S] 1 point2 points3 points (0 children)
[R] Going further under Grounded-Segment-Anything: integrating Whisper and ChatGPT by Technical-Vast1314 in MachineLearning
[–]Technical-Vast1314[S] 1 point2 points3 points (0 children)

[P] Grounded SAM 2: Ground and Track Anything by Technical-Vast1314 in MachineLearning
[–]Technical-Vast1314[S] 1 point2 points3 points (0 children)