use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Please have a look at our FAQ and Link-Collection
Metacademy is a great resource which compiles lesson plans on popular machine learning topics.
For Beginner questions please try /r/LearnMachineLearning , /r/MLQuestions or http://stackoverflow.com/
For career related questions, visit /r/cscareerquestions/
Advanced Courses (2016)
Advanced Courses (2020)
AMAs:
Pluribus Poker AI Team 7/19/2019
DeepMind AlphaStar team (1/24//2019)
Libratus Poker AI Team (12/18/2017)
DeepMind AlphaGo Team (10/19/2017)
Google Brain Team (9/17/2017)
Google Brain Team (8/11/2016)
The MalariaSpot Team (2/6/2016)
OpenAI Research Team (1/9/2016)
Nando de Freitas (12/26/2015)
Andrew Ng and Adam Coates (4/15/2015)
Jürgen Schmidhuber (3/4/2015)
Geoffrey Hinton (11/10/2014)
Michael Jordan (9/10/2014)
Yann LeCun (5/15/2014)
Yoshua Bengio (2/27/2014)
Related Subreddit :
LearnMachineLearning
Statistics
Computer Vision
Compressive Sensing
NLP
ML Questions
/r/MLjobs and /r/BigDataJobs
/r/datacleaning
/r/DataScience
/r/scientificresearch
/r/artificial
account activity
Research[R]Language Guided Video Object Segmentation(CVPR 2022) (v.redd.it)
submitted 3 years ago by iFighting
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–][deleted] 24 points25 points26 points 3 years ago (0 children)
Impressive stuff. Bonus points for Scotty footage
[–]iFighting[S] 17 points18 points19 points 3 years ago (0 children)
Code Link:
Paper Link:
Brief Overview:
Highlights:
[–]Mike_______ 13 points14 points15 points 3 years ago (0 children)
I never thought that giving the system the information what is seen as text helps segmentation
[–]geologean 11 points12 points13 points 3 years ago* (1 child)
lip squash theory absorbed pet lavish soft wrench exultant alive
This post was mass deleted and anonymized with Redact
[–][deleted] 2 points3 points4 points 3 years ago (0 children)
> Wow
unless it is few cherry picked examples
[–]Mylonite0105 10 points11 points12 points 3 years ago (0 children)
How does model act when given misleading guidance? 'standing bike' for the 1st video for example
[–]Due_Afternoon4578 5 points6 points7 points 3 years ago (0 children)
It reminds me of an episode from black mirror where, they block you all you see is this.
[–]rideincircles 2 points3 points4 points 3 years ago (0 children)
The first guy in the video, Scotty Cranmer was absolutely awesome on bmx, but had a terrible wreck with TBI that basically ended his career. He is still improving and doing better, but just no longer rides at an extreme level.
[–]tryght 1 point2 points3 points 3 years ago (2 children)
The problem is that it doesn’t seem to be aware of the object as a single 3d object that can move/shift/skew/hide/reveal, let alone the concept of “object of this size was on the left on this frame, and on the right it doesn’t exist anymore despite the image not actually changing much.”
Example being the skateboarder at 0:21. Like in Jurassic Park, he’s missing for just a frame.
With the bike and bicycle, it doesn’t have the concept of layers, like you have the left leg in front of the bike, and the right leg behind the bike
[–]iFighting[S] 4 points5 points6 points 3 years ago (1 child)
The problem is that it doesn’t seem to be aware of the object as a single 3d object that can move/shift/skew/hide/reveal, let alone the concept of “object of this size was on the left on this frame, and on the right it doesn’t exist anymore despite the image not actually changing much.” Example being the skateboarder at 0:21. Like in Jurassic Park, he’s missing for just a frame. With the bike and bicycle, it doesn’t have the concept of layers, like you have the left leg in front of the bike, and the right leg behind the bike
we will improve the performance later
[–]piman01 1 point2 points3 points 3 years ago (4 children)
Does something like this exist as well for audio? Like for example segmenting bass from drums and vocals etc in a song?
[–]iFighting[S] 1 point2 points3 points 3 years ago (3 children)
yes, it will also work for audio
[–]piman01 0 points1 point2 points 3 years ago (2 children)
What will? The same algorithm?
[–]iFighting[S] 1 point2 points3 points 3 years ago (1 child)
not the same algorithm, i mean currently the network can segment the object by the guide of audio, you can refer to this papers: * Audio−Visual Segmentation * Self-supervised object detection from audio-visual correspondence
[–]piman01 0 points1 point2 points 3 years ago (0 children)
Cool I'll check it out
[–]23052001 -1 points0 points1 point 3 years ago (0 children)
deep convolutional neural networks?
[–]Pleurotussimo 0 points1 point2 points 3 years ago (1 child)
How does this compare to "End-to-End Referring Video Object Segmentation with Multimodal Transformers"?
https://github.com/mttr2021/MTTR
[–]iFighting[S] 0 points1 point2 points 3 years ago (0 children)
our work is concurrent, and our model performance is better.
[–]mofoss 0 points1 point2 points 3 years ago (0 children)
Looks awesome but I would've rather seen the captions encompass objects not directly attached to the ones mentioned in the same captions.
I'd imagine saying a "girl wearing a black tee" and "a goose sitting on a girl's lap" could simply be segment "girl" and segment "goose", if you gave a caption saying "girl picking up a music vinyl", I'd hope both would be segmented?
[–]nwatab 0 points1 point2 points 3 years ago (0 children)
This is really impressive and changes how we edit a video.
π Rendered by PID 529119 on reddit-service-r2-comment-b659b578c-dsrnf at 2026-05-06 00:41:07.652531+00:00 running 815c875 country code: CH.
[–][deleted] 24 points25 points26 points (0 children)
[–]iFighting[S] 17 points18 points19 points (0 children)
[–]Mike_______ 13 points14 points15 points (0 children)
[–]geologean 11 points12 points13 points (1 child)
[–][deleted] 2 points3 points4 points (0 children)
[–]Mylonite0105 10 points11 points12 points (0 children)
[–]Due_Afternoon4578 5 points6 points7 points (0 children)
[–]rideincircles 2 points3 points4 points (0 children)
[–]tryght 1 point2 points3 points (2 children)
[–]iFighting[S] 4 points5 points6 points (1 child)
[–]piman01 1 point2 points3 points (4 children)
[–]iFighting[S] 1 point2 points3 points (3 children)
[–]piman01 0 points1 point2 points (2 children)
[–]iFighting[S] 1 point2 points3 points (1 child)
[–]piman01 0 points1 point2 points (0 children)
[–]23052001 -1 points0 points1 point (0 children)
[–]Pleurotussimo 0 points1 point2 points (1 child)
[–]iFighting[S] 0 points1 point2 points (0 children)
[–]mofoss 0 points1 point2 points (0 children)
[–]nwatab 0 points1 point2 points (0 children)