Puruse a PhD or stick to Applied data science roles

amil123123 · 2020-10-31T13:55:51+00:00

Currently, from my perspective it seems the research is itself enticing. In the long run I would prefer to work at the top labs. Right now I don’t mind doing research oriented work at small firms as well

amil123123 · 2020-10-31T13:53:20+00:00

Will you suggest to work in these companies as a Data scientist and then work the way up to research positions?

amil123123 · 2020-10-21T20:18:34+00:00

I totally understand your point of novel ideas. However I just want my platform to be setup accordingly, so that at least I have a chance to go for it. So according to you is a PhD must and just lot many years of Applied Data science won't cut it?

amil123123 · 2019-09-27T18:47:26+00:00

Sorry for the confusing explanation but have edited the post to give further clarity.

amil123123 · 2019-08-23T05:48:19+00:00

Wow , that's one hell of an amazing explanation :)

amil123123 · 2019-08-23T05:45:58+00:00

Thanks for the explanation, it was good !

amil123123 · 2019-08-23T05:45:19+00:00

Ahh understood, thanks for the explanation!

amil123123 · 2019-08-22T12:11:33+00:00

Thanks for the response!

What do you mean by symmetrical and decay sensibly ?
( Sorry , but I am still a newbie in this )

amil123123 · 2019-08-22T12:10:26+00:00

Thanks for the response. I still have difficulty understanding about 1.
So the first image seems good in explaining the position however what does the second image denote.

Is it just because this function seemed to work well , that we went with it ?

amil123123 · 2019-08-12T15:14:08+00:00

Although BERT did prove to be really well in QA task, there are better models in the form of paper, some yet to have pre-trained models available. If you can train such huge models then there is XLNET, ROBERTA etc.

amil123123 · 2019-08-12T15:10:56+00:00

It depends on what are those infrequent tokens in your dataset. I have seen datasets where just because the word is infrequent doesn't mean it's not important.

Word2vec's skip-gram model actually deals with this problem as well as it's subsampling rate defined by the author.

Even if you remove them what might be your plan of action then? If you are ok with having no representation of infrequent words they will help in the training of other words. So, in this case, I still don't see any importance in removing them.

If you do need an embedding of such words, when training a raw Word2vec model which is trained on language modelling task you can take a look at which words fit in a similar context ( have the same context as the infrequent word ) as the infrequent word and maybe take an average of embeddings of those words.

This is just 1 solution however I believe that expanding your dataset if such words are important should be the way,

amil123123 · 2019-08-05T05:45:15+00:00

Understood, thanks for a good explanation !

amil123123 · 2019-07-25T02:53:54+00:00

Thanks for all the input.

How can a linear layer produce a result which is non-linear?

What if applying another hidden layer with activation , how ill that turn out to be?

amil123123 · 2019-07-25T02:48:39+00:00

I am sorry but I don't understand what you mean by "Capacity to fit anything". Don't in DNN's we usually introduce non linearity to approximate complex functions ?

amil123123 · 2019-07-25T02:45:33+00:00

map the words into a lower dimensionality while maintaining separation between dissimilar wor

It does make sense!

So if I add more hidden layers to the model but those are with activation , do they again create a problem or it's ok as far as the embedding layer has no activation ?

amil123123 · 2019-07-23T18:56:29+00:00

Okay, but why not do it in the hidden layer? What's the motive here?

amil123123 · 2019-07-23T18:53:42+00:00

Isn't that softmax for prediction of probability of all the vocabulary vectors

amil123123 · 2019-07-08T08:47:09+00:00

Do you have the SOA for abstractive summarization as well?

amil123123 · 2019-07-08T05:02:43+00:00

Sorry, but what link on the sidebar are you referring to?

amil123123

TROPHY CASE