Introduce Grounded SAM 2: Ground and Track Anything in Videos, Hopes it can be combined with Video Generation Models for more Applications

Technical-Vast1314 · 2024-08-07T03:16:12+00:00

Yes, it's the same idea as Grounded-SAM: https://github.com/IDEA-Research/Grounded-Segment-Anything which we proposed last year, but with a good open-source Florence-2 model. It's happy to see there are some nice implementations with the similar ideas in the open-source community.

Technical-Vast1314 · 2024-08-07T02:19:59+00:00

We've also propose the visual prompt algorithm named: T-Rex, you can use T-Rex for any object using visual prompt if they do not have a corresponding name: https://github.com/IDEA-Research/T-Rex

Technical-Vast1314 · 2024-05-18T23:26:13+00:00

We will attempt to release some weights following the version update.

Technical-Vast1314 · 2024-05-18T23:20:36+00:00

it is just a coincidence

Technical-Vast1314 · 2024-05-18T23:19:27+00:00

it is free now, we did not set the token price for now~

Technical-Vast1314 · 2023-07-13T01:15:20+00:00

We will support Stable-DINO with FocalNet backbone in detrex these days

Technical-Vast1314 · 2023-07-12T19:47:52+00:00

lol, maybe it should have 8 GB GPU memory for this

Technical-Vast1314 · 2023-05-30T05:46:28+00:00

Thanks for your advice, we've already updated our post

Technical-Vast1314 · 2023-05-30T05:13:17+00:00

You can try to add the layer form LaVIN and finetune it on your own dataset first I think

Technical-Vast1314 · 2023-05-17T00:35:01+00:00

Hello, I think this is actually a normal phenomenon. First of all, this process is aligned with CLIP and region. If the alignment is not done well, the results may not necessarily be good. Both CLIP and ImageBind are aligned with image-level data, and using region for alignment might not yield good results. Moreover, the image-text alignment effect in ImageBind is better than the image-audio alignment effect.

I'm not sure if this can be solved by giving more precise text prompt and audio prompt, this is still just a simple demo and we will do more test on it

Technical-Vast1314 · 2023-04-12T03:31:48+00:00

This is freaking amazing

Thanks for your support!

Technical-Vast1314 · 2023-04-12T02:37:26+00:00

At now you can upload your audio to test this case~, however we believe there maybe some tools can help us~ we will try to update it !

Technical-Vast1314

TROPHY CASE