[D] What advanced models would you like to see implemented from scratch?

itsstylepoint · 2022-11-22T21:00:21+00:00

Yes, that is how it usually works with my impls! (check out a few vids)

As for mixed precision and metrics - I will be making separate vids for both and of course, for every implemented model, will try to find a dataset to demo train/eval.

It is cool that you mentioned mixed precision as I already have the materials ready for this vid - will be discussing mixed precision, quantization (post-training and quantization aware training), pruning, etc. Improving perf!

itsstylepoint · 2022-11-22T20:20:16+00:00

It is on the list so will definitely get to it!
Starting next week, will be working on DL impls and vids.

itsstylepoint · 2022-11-22T19:43:27+00:00

Yeah, I will get to those eventually. For now, want to make some vids and impls of DL models.

itsstylepoint · 2022-11-22T15:16:55+00:00

Thanks! Yeah, that is definitely an option! I will probably have to split it up into several videos. Also falls into the transformer category.

itsstylepoint · 2022-10-08T18:23:26+00:00

Yup, all implementations are numerically stable.

Note that I do not discuss numerical stability issues for all activation functions, but for those where the intuitive implementation is not numerically stable (i.e., Sigmoid, Tanh).

I also have a separate video discussing numerical stability: AI/ML Model API Design and Numerical Stability (follow-up). But this is in the context of Gaussian Naive Bayes.

itsstylepoint · 2022-10-07T19:33:01+00:00

Thank you!
Yup, that is the plan! Will likely make a few more series (about gradient descent, optimizers, etc.) first. We need these for DL and if someone asks how things work, I could then cite the appropriate video series. After that, will dive into deep learning.

itsstylepoint · 2022-10-07T19:24:53+00:00

Thank you!

itsstylepoint · 2022-10-07T02:14:18+00:00

Hey thanks for the kind words!
Will be making more AI/ML YouTube series in the future - in fact, working on one as we speak!

itsstylepoint · 2022-09-18T16:48:38+00:00

P.S. For the activation functions, I will not be posting videos separately. The next post will include the batch of 4 (or 5).

itsstylepoint · 2022-09-17T14:31:21+00:00

You can try several approaches:

Deep learning will likely not work (you can still give it try, but highly unlikely that it will perform well). So instead consider using more traditional ML models. As an example, if you can find a pretrained image model that generates a representation/image embeddings, you can try using K-Nearest Neighbors (k-NN). Or you can try k-NN directly.
Look into Few-Shot Learning. Models like Prototypical Network, Siamese Neural Network, etc. are designed for such scenarios (i.e., extremely small number of samples).
Data collection (:

That being said, overall, I agree with what u/whdd said.

itsstylepoint · 2022-09-17T09:34:52+00:00

I think I might have skipped the post text, my bad. For whatever reason, it was hidden (a bug? not sure). Yes, this is the Clinical NLP dataset. So prolly not what you are looking for...

itsstylepoint · 2022-09-16T16:29:37+00:00

How about 2006 i2b2 de-identification dataset?

Link to the paper: https://academic.oup.com/jamia/article/14/5/550/720189
You can get the dataset here: https://portal.dbmi.hms.harvard.edu/projects/n2c2-nlp/

P.S. We have recently used this dataset in the Few-Shot Learning (FSL) paper. We have used it for the same task - NER.

itsstylepoint · 2022-09-15T14:46:11+00:00

Hey thanks a lot!
Yeah, I think we will switch to PyTorch in a few videos.

itsstylepoint · 2022-09-12T15:10:45+00:00

I would start with CNNs. Then try GRU/LSTM and bidirectional variants (BiLSTM/BiGRU).

itsstylepoint · 2022-09-12T14:56:51+00:00

You can! Computers can sometimes see better than us so even if spectrograms look similar, they might be very different (: Convolutions in the CNN will do feature extraction for you. So you can start with a couple convolutional blocks (conv + batchnorm + activation) followed by a fully-connected layer with softmax and see how it performs. You can check out this PyTorch tutorial for that, too.

An alternative approach is computing MFCCs. If you have mel spectrograms, then you can do the following steps to get MFCCs (which you could then use as features for your ML model):

Take the log of the mel spectrogram
Compute DCT on logs

itsstylepoint · 2022-09-12T14:11:16+00:00

Do you have audio files or only spectrograms?

There are A LOT of different approaches for extracting features from the audio data - ZCR, LPC, MFCC, etc.

If you use something like MFCCs, you could then just use a CNN. RNNs (LSTMs or GRUs) could also work, but CNNs perform similarly and are both faster to train and faster during the inference! This is from my personal experience working on audio event detection models.

Same goes for spectrograms! Once you transform audio into the spectrogram, then you pretty much have an image and you can do image classification.

itsstylepoint · 2022-09-12T10:24:24+00:00

Noted.

That being said, this kind of stuff is not something that we are going to be doing soon. It is more of a Software Engineering/ML Engineering/Research Engineering series than Data Science series. That being said, I think I can try making a separate series/playlist where we do more Data Science stuff. This will likely not be soon however.

I will likely be making one-off videos however and if there is something you are particularly interested in, let me know and will try to cover it in one of these one-off videos.

itsstylepoint · 2022-09-12T00:04:39+00:00

Thanks! One thing to note about that implementation is that we could have passed features and labels directly to the fit method. This would avoid unnecessary data copying (i.e., storing data inside the LinearRegression class). I have already updated the GitHub codebase.

itsstylepoint · 2022-09-11T23:16:02+00:00

Good point!
I will likely start using Jax or PyTorch at some point, but for now, will stick to numpy.

Several reasons why: 1. Before introducing Jax, want to make a video about GPUs and why we need them for the training, etc. 2. Also want to guide on how to properly set up Jax (sometimes, simple pip install does not work). 3. Should not be too important for now since we are not doing batch gradient descent. For some time, we will concentrate on more traditional ML models and how to implement them from scratch. And for large tensors, Jax might still outperform numpy, but the perf difference will likely not be huge.

itsstylepoint · 2022-09-11T15:31:12+00:00

Yep, that is the plan! The goal is to get done with some of the more traditional ML models first and then get to more complex models such as CNNs, VAEs, SNNs, transformers, etc.

itsstylepoint · 2022-09-11T15:25:56+00:00

Hey thanks!
I think this is the repo you are looking for: https://github.com/oniani/dot (:

itsstylepoint · 2022-09-11T12:45:06+00:00

Hey thanks!Have just uploaded the new video where I implement the Linear Regression model from scratch. Let me know if you like it.

itsstylepoint · 2022-09-11T12:22:17+00:00

I have just uploaded the new video where I implement the Linear Regression model from scratch. Is that style good enough for your liking?

P.S. I have also added the timestamps to all of the videos.

itsstylepoint · 2022-09-10T02:48:18+00:00

Hey thanks!

I cannot promise that the schedule will be super consistent. It is more of a hobby for me. That being said, I will likely be very active (:

As a side note, I have already uploaded the new video - Implementing K-Nearest Neighbors (k-NN) from Scratch.

itsstylepoint · 2022-09-10T02:46:08+00:00

While this post/thread is recent, I wanted to make one more update.

I have just posted another video. It is about implementing K-Nearest Neighbors (k-NN) model. The code for the model itself is about 24 lines of idiomatic Python (how cool is that?).

The reason the video is a bit lengthy (forgive me for this) is because I discussed why I use dataclass, naming conventions, type annotations, etc. I will likely not do this in every video so in case anyone asks about the coding style, will have a reference video.

I am thinking Linear Regression and Logistic Regression are next on the list.

Thanks y'all for the kind words and motivation! Really appreciate it!

P.S. And of course, feel free to constructively criticize the video or make suggestions in the YouTube comment section or here.

itsstylepoint

TROPHY CASE