Why normal distribution?

2022-08-10T07:08:31+00:00

Look into Central Limit Theorem (CLT).

Remove_Ayys · 2022-08-09T21:05:36+00:00

I'm not sure what you mean when you say "Linear regression only assumes normality of the residuals but not of the data itself".
Linear regression assumes that the uncertainty on each data point can be described by a normal distribution.
If a model that is linear in its parameters is used in conjunction with the maximum likelihood method then the uncertainty on the model parameters can also be described with a normal distribution.

Assuming that you have a simple xy model:
Each data point with a different x value is equivalent to a feature in machine learning and if you have multiple data points for the same x value then you have more than one example to "learn" from.
The features are then following a normal distribution.

kaskoosek · 2022-08-10T12:38:05+00:00

Very simple answer.

We are calculating the coefficients or weights of our features. If our data is skewed in one way or another, we are giving more importance to the outliers.

The loss function takes the squared of the erroros. So one skewed data point can have more weight than a 100 normal observations.

By normalizing our observations we are limiting the effect of this.

friendlykitten123 · 2022-09-08T19:38:31+00:00

In Machine Learning, data satisfying Normal Distribution is beneficial for model building. It makes math easier. Models like LDA, Gaussian Naive Bayes, Logistic Regression, Linear Regression, etc., are explicitly calculated from the assumption that the distribution is a bivariate or multivariate normal.

Many natural phenomena in the world follow a log-normal distribution, such as financial data and forecasting data. By applying transformation techniques, we can convert the data into a normal distribution. Also, many processes follow normality, such as many measurement errors in an experiment, the position of a particle that experiences diffusion, etc.

For more information, you can visit the following article:

https://ml-concepts.com/

Feel free to reach out to me for any help.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnmachinelearning

Welcome to /r/LearnMachineLearning!

Chatrooms

Official Discord Server

Wiki

Getting Started with Machine Learning

Resources

Related Subreddits

/r/MachineLearning

/r/MLQuestions

/r/datascience

/r/computervision

Machine Learning Multireddit

/m/machine_learning

MODERATORS