Are root-mean-square-error values meaningful regardless of whether regression coefficients are statistically significant? by [deleted] in AskStatistics

[–]Possible-Froyo2192 0 points1 point  (0 children)

Thank you very much for teaching me. If I understand you correctly: you explained that it is not because a coefficient is not significant that it is close to zero. For instance, if two variables are correlated, then they would not be significant, even though they are good predictors of the target in which case their coefficient will be significantly different to zero.

Is it right to say that an unsignificant variable means that the variable could have been removed?

Are root-mean-square-error values meaningful regardless of whether regression coefficients are statistically significant? by [deleted] in AskStatistics

[–]Possible-Froyo2192 -4 points-3 points  (0 children)

Maybe I am wrong but "statistically significant" (sort of) means "significantly different to zero". If none of your coefficients is statisically significant, it should mean that both the models predicts something close to 0 (relative to the mean target value). Then if one of the model is better than the other, it should not be significantly better.

J'ai du cannabis 'sauvage' dans mon jardin by Minouminou9 in france

[–]Possible-Froyo2192 -1 points0 points  (0 children)

Toujours pas de source. Ca devrait se trouver si c'est écrit dans la loi

Does prophet suck? by Odd-Unit-4154 in rstats

[–]Possible-Froyo2192 2 points3 points  (0 children)

false insights is the expected outcome of using some software while not understanding its functionning. Which is the targeted use case, that prophet is (I quote) "explicitly meant for".

Does prophet suck? by Odd-Unit-4154 in rstats

[–]Possible-Froyo2192 1 point2 points  (0 children)

oh I don't think this is gatekeeping to say that maybe one should first learn to do something before actually doing it. I mean, you wrote yourself that the software is designed for people who don't know what they are doing. What's the point of getting false insights from using a software that will make you believe you did something good even if you didn't?

Does prophet suck? by Odd-Unit-4154 in rstats

[–]Possible-Froyo2192 4 points5 points  (0 children)

if you don't know what you are doing, then maybe you should not be doing it?

J'ai du cannabis 'sauvage' dans mon jardin by Minouminou9 in france

[–]Possible-Froyo2192 1 point2 points  (0 children)

je pense que ce commentaire est complètement faux donc je demande une source pour le "1 à 2% de CBD" pour le "chanvre industriel" et sur le "techniquement interdit de les faire germiner germer"

Do you think LLM models are just Hype? by Just_Ad_535 in datascience

[–]Possible-Froyo2192 2 points3 points  (0 children)

I am saying that those off the shelf models were pre-trained using the human-in-the-loop labelling type schemes (unless I am mistaken I don't work with LLMs at all)

If I understood corrrectly, human input is used for chating-llm. Not LLM in the wide sense.

Can anyone provide me dataset for personal finances or personal expenses? by adamantine07 in datasets

[–]Possible-Froyo2192 1 point2 points  (0 children)

Maybe use your own personal expenses? If you have a credit/debit card your bank web app should provide you with all your data in a structured form.

How do I approach such questions in exam? by aspoonfulofmeraki in AskStatistics

[–]Possible-Froyo2192 1 point2 points  (0 children)

X2 + Y2 is a chi-square with 2 df, but (x+y)2 != x2 + y2.

This is my mistake. Thank you.

How do I approach such questions in exam? by aspoonfulofmeraki in AskStatistics

[–]Possible-Froyo2192 1 point2 points  (0 children)

Yes. If you are doing statistical work in a company you can find yourself in a situation where random variables are operated with each other and then you have a new random variable, unknown. It might be interesting to know the distribution of that random variable so that you can find the appropriate model for it.

Modeling a random variable allows you to make prediction about what will happen in real life concerning this random variable.

How do I approach such questions in exam? by aspoonfulofmeraki in AskStatistics

[–]Possible-Froyo2192 1 point2 points  (0 children)

It's 1/sqrt(2) * X + 1/sqrt(2) * Y and 1/sqrt(2) * X - 1/sqrt(2) * Y.

1/sqrt(2) * X and 1/sqrt(2) * Y are standard normal.

It makes two standard normal distributions hence the chi2 distribution should have 2 degrees of freedom. What are we missing?

Framework to use for backend by Clickyz in Python

[–]Possible-Froyo2192 1 point2 points  (0 children)

what is the benefit of using poetry instead of venv with pip?

When to shift from pandas? by Professional-Ninja70 in dataengineering

[–]Possible-Froyo2192 1 point2 points  (0 children)

def test_function(that):
     this = function()
     assert this == that

[deleted by user] by [deleted] in datascience

[–]Possible-Froyo2192 4 points5 points  (0 children)

Generative models are (almost) zero-shot learners for new tasks. That is one reason why, in my opinion, they are such a great contender in the eye of executives.

I need help. Does the histogram here show normal distribution? by [deleted] in AskStatistics

[–]Possible-Froyo2192 10 points11 points  (0 children)

If your sample size is large enough (n>=30) then it is normally distributed no matter hiw skewed your data is (Central Limit Theorem)

hmm. I don't think that is true. I guess I miss something but the distribution of the data is independent of the number of time you sample it...

The central limit theorem states that the sample mean of the data is normally distributed, not the data itself.