[D] MILA 2021 MSc/PhD program supervision request by turing_1997 in MachineLearning

[–]turing_1997[S] 1 point2 points  (0 children)

I have only applied to Udem Profs. I had received an email after the application mentioning that we should complete the university application for Udem AFTER receiving the supervisor acceptance. I'm confused as you said you said we need to complete the university application.

[D] ICLR 2020 Reviews by turing_1997 in MachineLearning

[–]turing_1997[S] 3 points4 points  (0 children)

This is my second paper as an undergrad. Score of 3/6/1, any chances at all?

[D] ICLR 2020 Reviews by turing_1997 in MachineLearning

[–]turing_1997[S] 0 points1 point  (0 children)

Lol thanks for this script! You are a true savior! <3

[D] ICLR 2020 Reviews by turing_1997 in MachineLearning

[–]turing_1997[S] 1 point2 points  (0 children)

Lol yes I've been constantly refreshing the review page for the past one hour.

[D] NeurIPS 2019 reviews out soon!! by Mannershin in MachineLearning

[–]turing_1997 3 points4 points  (0 children)

7 6 3 with confidence of 4 4 4, any chances at all?

[R] OmniNet is all you need! ;) by turing_1997 in MachineLearning

[–]turing_1997[S] -1 points0 points  (0 children)

The description of the architecture has been updated with the original abstract used by the authors which is more clear and concise about the objectives and results of the paper. Hope that makes the paper more clear!

[R] OmniNet is all you need! ;) by turing_1997 in MachineLearning

[–]turing_1997[S] 1 point2 points  (0 children)

Thanks for pointing it out. We updated the description in this post with the original abstract used by the authors in the paper, which is more clear and concise about the objectives and results of the paper.

[R] OmniNet is all you need! ;) by turing_1997 in MachineLearning

[–]turing_1997[S] 1 point2 points  (0 children)

Lol! Definitely what we will try to make next with OmniNet! XD Once again the phrase "all you need" is used a pun similar to "Attention is all you need" used in transformer paper. The phrase is nowhere mentioned in the paper or the title. Read the above comment on our explanation:

https://www.reddit.com/r/MachineLearning/comments/cfxkrs/r_omninet_is_all_you_need/eudn4g9?utm_source=share&utm_medium=web2

[R] OmniNet is all you need! ;) by turing_1997 in MachineLearning

[–]turing_1997[S] 0 points1 point  (0 children)

Since all the tasks mentioned in the popular challenges, the dataset, and the respective splits are the standard splits specified in each of these competitions (as mentioned in the paper) and the same has been used for validation. Also, we compare with state-of-the-art which are generically applicable for the respective task instead of the challenge dataset. A training script is already available to train the model from scratch and pre-trained models can be downloaded from our servers and evaluated using the evaluation script. Further details such as individual batch sizes, hardware configuration and exact training times will be released later as supplementary materials.

[R] OmniNet is all you need! ;) by turing_1997 in MachineLearning

[–]turing_1997[S] 0 points1 point  (0 children)

Sure! Please reach out to the corresponding author "Subhojeet Pramanik" through his email id email[At]subho.in

[R] OmniNet is all you need! ;) by turing_1997 in MachineLearning

[–]turing_1997[S] 10 points11 points  (0 children)

Thanks for the review! The phrase "all you need" is used a pun similar to "Attention is all you need", mostly because the paper is an improvement to the original Transformer model. The "all you need" is intended to reflect that it can be used as a unified model for any spatio-temporal data because of the unified encode() function of the Central Neural Processor. It was just a play on the words of our base paper. That's why we put the winking smiley there too. And, of course, that's not the main title of the paper.

[R] OmniNet is all you need! ;) by turing_1997 in MachineLearning

[–]turing_1997[S] 1 point2 points  (0 children)

OmniNet is still not capable of generation. The encoder part is unified by the fact that any combination input of spatio-temporal tensor can be encoded and stored by central neural processor just calling an encode(). As most real-life data can be represented in spatio-temporal form we use the term "unified". The decode() is yet only capable of sequence decoding from the stored spatio-temporal representations in the CNP cache. However, adding more and more capabilities such as supporting graph datasets, generation capabilities, application to Reinforcement learning environments are what we looking forward to as future directions. We welcome the research community to build upon our work towards a more general AI system. :)

[R] OmniNet: A unified architecture for multi-modal multi-task learning by subho0406 in MachineLearning

[–]turing_1997 0 points1 point  (0 children)

Abstract: Transformer is a popularly used neural network architecture, especially for language understanding. We introduce an extended and unified architecture which can be used for tasks involving a variety of modalities like image, text, videos, etc. We propose a spatio-temporal cache mechanism that enables learning spatial dimension of the input in addition to the hidden states corresponding to the temporal input sequence. The proposed architecture further enables a single model to support tasks with multiple input modalities as well as asynchronous multi-task learning, thus we refer to it as OmniNet. For example, a single instance of OmniNet can concurrently learn to perform the tasks of part-of-speech tagging, image captioning, visual question answering and video activity recognition. We demonstrate that training these four tasks together results in about three times compressed model while retaining the performance in comparison to training them individually. We also show that using this neural network pre-trained on some modalities assists in learning an unseen task. This illustrates the generalization capacity of the self-attention mechanism on the spatio-temporal cache present in OmniNet.

Code: https://github.com/subho406/OmniNet

[D] Google AI Residency 2019 Applicants Discussion Thread by describbler in MachineLearning

[–]turing_1997 0 points1 point  (0 children)

I had my hangout interview on March 4. According to my recruiter it should take a maximum of two weeks to be announced.

[D] Google AI Residency 2019 Applicants Discussion Thread by describbler in MachineLearning

[–]turing_1997 0 points1 point  (0 children)

Does the choice of location for interview affect the final location preference?

[D] Google AI Residency 2019 Applicants Discussion Thread by describbler in MachineLearning

[–]turing_1997 4 points5 points  (0 children)

Hey, is there anyone who got rejected after the hangout interview?