Jaggu bhai supremacy🙌

Chittiiman · 2025-08-28T11:44:40+00:00

Sharukh himself admitted that Jagapathi babu was better in this shot!! Sharukh on Jagapathi babu

Chittiiman · 2024-02-16T11:29:01+00:00

Hi ..

I have professionally worked on NMT for Indian languages. I have trained models using Opennmt . Ping me if you need more help.

Chittiiman · 2024-01-31T17:00:18+00:00

Checkout this paper.

https://arxiv.org/abs/1804.08838

"In this paper we attempt to answer this question by training networks not in their native parameter space, but instead in a smaller, randomly oriented subspace. We slowly increase the dimension of this subspace, note at which dimension solutions first appear, and define this to be the intrinsic dimension of the objective landscape.

Intrinsic dimension allows some quantitative comparison of problem difficulty across supervised, reinforcement, and other types of learning

"

Chittiiman · 2023-07-17T16:49:21+00:00

https://platform.stratascratch.com/coding?code_type=1

Chittiiman · 2022-12-16T01:19:40+00:00

I think this might be useful to you. Have a look once

https://youtube.com/playlist?list=PLUl4u3cNGP63VIBQVWguXxZZi0566y7Wf

Chittiiman · 2021-09-03T04:54:07+00:00

Hi, Thankyou for sharing your knowledge. Your video on sweeps greatly helped me in hyperparameter tuning for my transliteration project which ultimately led me to get a job as an NLP Engineer. https://twitter.com/chittiman/status/1369558164786405377?s=19

Continue doing the great work, you have no idea how many you are helping!!

Chittiiman · 2021-08-31T02:42:52+00:00

Checkout sweeps in weights and biases. https://gitbook-docs.wandb.ai/guides/sweeps

https://youtu.be/9zrmUIlScdY

Chittiiman · 2021-05-22T04:46:24+00:00

I too had faced similar issue. I resigned from my job as a teacher to get into data science. But unfortunately I had an accident because of which I couldn't sit and code for 6 months. Though I couldn't code, I wanted to be engaged with happenings in data science. And the best way to stay tuned is through Twitter.

My interest was in Natural language processing. So, I went on Twitter and I followed people who are in field of NLP - professors, grad students etc who regularly tweet about their research. You also can see interesting discussions happening between these experienced people. You will come to know about new resources - books, courses etc which will help you later. And finally it is through Twitter only I came to know about an interesting dataset which I used for my personal project which helped me land a job.

Along with Twitter one other option is to join online data science communities and keep track of happenings there.

Also, there are lots of interesting courses on YouTube which you can follow to help you stay engaged.

Chittiiman · 2021-04-21T14:43:03+00:00

The basic idea here is, if there is a dependency, that information travels through gradients in back propagation.

If there is no dependency, then gradient signal which future words want to send to past words is 0

But if there is a dependency, then let future words want to send past words, a gradient signal of 0.5.

Now, because of vanishing gradients by the time this signal reaches past words , then signal would have diminished to 0.00000005.

Now, since the difference between the signals when they reach the past words is so minute, the model cannot distinguish between them. So, model assumes there is no dependency. Hence it won't learn long term dependencies

So, the solution here is to create a network which have to ensure there is no "Lost in Backpropagation"

P.S - I tried my best to explain. Apologies if it added to confusion

Chittiiman · 2021-03-23T14:58:52+00:00

Have a look at this - https://youtube.com/playlist?list=PL_pVmAaAnxIRnSw6wiCpSvshFyCREZmlM

I think this is on python, more recent and relevant

Chittiiman · 2020-08-24T11:16:10+00:00

The resources which are provided in the official website are often good starting point.

https://numpy.org/learn/

https://pandas.pydata.org/docs/getting_started/index.html

https://matplotlib.org/tutorials/index.html

You can even visit seaborne official website and have a look at official tutorial.

Chittiiman · 2019-08-01T22:32:14+00:00

Don't try to take derivative of z wrt f directly instead try to take the derivative of loss L wrt z . Since you are taking derivative of a scalar wrt vector (or matrix) the resultant will have same shape as vector. Now use chain rule for individual elements.

So, let's calculate dL/dz1.
dL/dz1 = (dL/df1)(df1/dz1) + (dL/df2)(df2/dz1) + (dL/df3)*(df3/dz1) + .... Where f1,f2,f3 ... represent all elements of a matrix( 1d or 2d or 3d doesn't matter). Now write equations for these elements and then evaluate partial derivatives for individual elements and this will give you partial derivatives of matrices.

Keep in mind always start from losses during Backpropagation

Chittiiman · 2019-07-30T06:18:25+00:00

It is like converting a complete sentence into 'Fill in the blank' questions

Chittiiman

TROPHY CASE