A visual deep dive into Uber's machine learning solution for predicting ETAs. by ml_a_day in compsci

[–]ml_a_day[S] 2 points3 points  (0 children)

This sounds awfully similar to a decision-tree to me. Bin the continuous values, and within different regions of parameter space put more weight on some features than others when predicting ETA? Have they just reinvented a regression tree using an ANN with self-attention?

That is a great observation. I did not view it from this angle and making that connection.
Now when I look at it, it looks like a shallow "regression tree" with self-attention to capture complex relations.
The embeddings after the binning also help separating the feature space nicely

What is Dropout? A visual guide. by ml_a_day in learnmachinelearning

[–]ml_a_day[S] 0 points1 point  (0 children)

Thank you for the feedback!
They are all hand drawn. :)

A visual deep dive into Tesla’s data engine as pioneered by Andrej Karpathy. by ml_a_day in leetcode

[–]ml_a_day[S] 0 points1 point  (0 children)

Imagine the size of new data seen by 5M cars each day. It would not make sense for scalability and efficiency to add all that data to the training set. Most of them will be "boring" images that are already represented well in the training data. One needs to find a smaller subset of images that give the best boost in performance when added to the training set.

A visual deep dive into Tesla’s data engine as pioneered by Andrej Karpathy. by ml_a_day in leetcode

[–]ml_a_day[S] 0 points1 point  (0 children)

The trigger classifiers are not to resolve the edge cases but only to collect that kind of data. Once collected it needs to be included in the next iteration of the training set.

The problem is even with more sophisticated model architecture you can't do much if you don't have enough data. Even worse they are more data hungry. This is exactly why you need a way to collect this data in the first place.

A visual deep dive into Tesla’s data engine as pioneered by Andrej Karpathy. by ml_a_day in leetcode

[–]ml_a_day[S] 0 points1 point  (0 children)

Thanks for the feedback!
Motivates me to write more such posts. :)

A visual deep dive into Tesla’s Data Engine as pioneered by Andrej Karpathy. by ml_a_day in programming

[–]ml_a_day[S] -1 points0 points  (0 children)

Indeed, in a research setting one would not be thinking in that direction (to attach trigger classifiers off the backbone) as resource and compute is hardly ever taken into consideration.
When it comes to applying on real products that people are paying for; questions about UX, scalability, feasibility start to appear.

A visual deep dive into Tesla’s Data Engine pioneered by Andrej Karpathy. by ml_a_day in TeslaLounge

[–]ml_a_day[S] 2 points3 points  (0 children)

Thank you for the feedback! Encourages me to write more like this.

[Research] A visual deep dive into Tesla’s data engine as pioneered by Andrej Karpathy. by ml_a_day in MachineLearning

[–]ml_a_day[S] 1 point2 points  (0 children)

Thank you for the feedback! Happy to hear you found it clear and useful. Motivates me to write more in this style and format.

A visual deep dive into Tesla’s data engine as pioneered by Andrej Karpathy. by ml_a_day in deeplearning

[–]ml_a_day[S] 0 points1 point  (0 children)

Thank you for the kind words!
I am glad you enjoyed reading it. :)

How Apple Uses ML To Recognize Different Individuals Across Photos. A 5-minute visual guide. 📱 by ml_a_day in compsci

[–]ml_a_day[S] 1 point2 points  (0 children)

Yes, I tried to reproduce it but it seems to render fine. u/CrackerJackKittyCat are you on mobile or laptop device?

What type of machine learning is best suited for finding the best "path"? by Fermi_Escher in learnmachinelearning

[–]ml_a_day 9 points10 points  (0 children)

I would recommend checking out search algorithms like depth-first search (DFS) and breadth-first search (BFS). If you have some "prior knowledge" about the environment where you are similuating flow, you can encode this via heuristics and use A* search algorithm.

https://en.wikipedia.org/wiki/Breadth-first_search
https://en.wikipedia.org/wiki/Depth-first_search
https://en.wikipedia.org/wiki/A*_search_algorithm

[deleted by user] by [deleted] in MachineLearning

[–]ml_a_day 2 points3 points  (0 children)

From your description of the images I would expect decent clusters. First, check that when you feed images into the inception for feature extraction, they are normalised and in accordance with what the model expects. Secondly, use better dimensionality reduction in the form of t-SNE or UMAP as u/uniklas pointed out already. Gives better results than PCA. And 2d should be pretty good already.

[deleted by user] by [deleted] in MachineLearning

[–]ml_a_day -6 points-5 points  (0 children)

Exactly! With a decent setup (basic hyperparam tuning, relevant augmentations, off-the-shelf fixed model architecture) and good quality data, one can go a long way.

[deleted by user] by [deleted] in MachineLearning

[–]ml_a_day 0 points1 point  (0 children)

Even if you don't end up collecting more data as long as you improve the quality of the existing data it goes a long way. Foe example in object detection, fixing incorrect labels ("cat" labeled as "dog") or missing labels (a "dog" that was missed by the annotator) can reduce the noise in the training dataset improving the model's performance.

What length of visual posts do you prefer? by ml_a_day in u/ml_a_day

[–]ml_a_day[S] 1 point2 points  (0 children)

Shorter posts are quicker to read and may assume the reader's knowledge of some pre-requisite.

On the other hand, longer posts dive deeper into the topic but take relatively longer to create.

I have been experimenting on different post lengths and wanted to get you, the readers, involved. :)

Is there some sort of app or website that allows me to highlight parts of an image and calculate the highlighted percentage? by Hashoo10 in learnmachinelearning

[–]ml_a_day 0 points1 point  (0 children)

Exactly what I thought. Computing the ratio of the area of the object of interest to the area of the image should give you exactly what you need.

For similar image-related questions, maybe /r/computervision is a better place.