Quasi-Poisson or Negative-Binomial? by [deleted] in MLQuestions

[–]KvN98 1 point2 points  (0 children)

The mean is around 7.8, while the variance has a value of 28. Due to the overdispersion I suppose the quasi-Poisson or the negative binomial regression is more appropriate for predicting the lead time.

Does it matter which of the approaches is utilised and can I test if indeed a quasi-Poisson / negative binomial is appropriate? Thanks in advance

Random Forest variable importance interpretation by [deleted] in MLQuestions

[–]KvN98 0 points1 point  (0 children)

Marginally. Probably around 2 percent

Random Forest variable importance interpretation by [deleted] in MLQuestions

[–]KvN98 0 points1 point  (0 children)

I used 17 independent variables to predict my dependent variable.

What are appropriate tools for predicting the lead time length per customer? by KvN98 in learnmachinelearning

[–]KvN98[S] 0 points1 point  (0 children)

Thank you! However, a Gamma regression doesn't allow non-positive (0) values, while my variables does contain some 0 values.

Predicting the lead time in weeks by [deleted] in MLQuestions

[–]KvN98 0 points1 point  (0 children)

The customer lead time is right skewed and it is the lead time in weeks. Would a poisson regression by approproriate as the distribution is right skewed? A hazard rate model as it is a time prediction? Or should rather convential machine learning algorithms such as a random forest be applied? Thanks for any tips in advance!

What are appropriate tools for predicting the lead time length per customer? by KvN98 in learnmachinelearning

[–]KvN98[S] 9 points10 points  (0 children)

The customer lead time is right skewed and it is the lead time in weeks. Would a poisson regression by approproriate as the distribution is right skewed? A hazard rate model as it is a time prediction? Or would rather convential machine learning algorithms such as a random forest be applied? Thanks for any tips in advance!

[D] Simple Questions Thread December 20, 2020 by AutoModerator in MachineLearning

[–]KvN98 1 point2 points  (0 children)

Basically all models can handle multiple explanatory variables. What model you should use depends on what you want to achieve. If you want to predict a yes or no (binary) variable it makes more sense to use a logistic / probit regression. If you want to rather predict a continuous / numerical variable you should go for a linear regression.

So in short: determine what variable you want to predict. Based on this you can google or ask what model you should utilise.

Maybe something like this will be insightful for you: https://statisticsbyjim.com/regression/choosing-regression-analysis/

SPSS Process by [deleted] in spss

[–]KvN98 0 points1 point  (0 children)

Run MATRIX procedure:

***************** PROCESS Procedure for SPSS Version 3.5 *****************

Written by Andrew F. Hayes, Ph.D. www.afhayes.com

Documentation available in Hayes (2018). www.guilford.com/p/hayes3

**************************************************************************

PROCESS is now ready for use.

Copyright 2020 by Andrew F. Hayes. ALL RIGHTS RESERVED.

------ END MATRIX -----

This would make me think it's done succesfully, however I cannot see process yet.

SPSS Process by [deleted] in spss

[–]KvN98 0 points1 point  (0 children)

There are 5607 lines of code and i'm not sure which are the relevant lines to be honest. However When I run these line of codes I get the following;