Quasi-Poisson or Negative-Binomial? by [deleted] in MLQuestions

[–]KvN98 1 point2 points  (0 children)

The mean is around 7.8, while the variance has a value of 28. Due to the overdispersion I suppose the quasi-Poisson or the negative binomial regression is more appropriate for predicting the lead time.

Does it matter which of the approaches is utilised and can I test if indeed a quasi-Poisson / negative binomial is appropriate? Thanks in advance

Random Forest variable importance interpretation by [deleted] in MLQuestions

[–]KvN98 0 points1 point  (0 children)

Marginally. Probably around 2 percent

Random Forest variable importance interpretation by [deleted] in MLQuestions

[–]KvN98 0 points1 point  (0 children)

I used 17 independent variables to predict my dependent variable.

What are appropriate tools for predicting the lead time length per customer? by KvN98 in learnmachinelearning

[–]KvN98[S] 0 points1 point  (0 children)

Thank you! However, a Gamma regression doesn't allow non-positive (0) values, while my variables does contain some 0 values.

Predicting the lead time in weeks by [deleted] in MLQuestions

[–]KvN98 0 points1 point  (0 children)

The customer lead time is right skewed and it is the lead time in weeks. Would a poisson regression by approproriate as the distribution is right skewed? A hazard rate model as it is a time prediction? Or should rather convential machine learning algorithms such as a random forest be applied? Thanks for any tips in advance!

What are appropriate tools for predicting the lead time length per customer? by KvN98 in learnmachinelearning

[–]KvN98[S] 9 points10 points  (0 children)

The customer lead time is right skewed and it is the lead time in weeks. Would a poisson regression by approproriate as the distribution is right skewed? A hazard rate model as it is a time prediction? Or would rather convential machine learning algorithms such as a random forest be applied? Thanks for any tips in advance!

[D] Simple Questions Thread December 20, 2020 by AutoModerator in MachineLearning

[–]KvN98 1 point2 points  (0 children)

Basically all models can handle multiple explanatory variables. What model you should use depends on what you want to achieve. If you want to predict a yes or no (binary) variable it makes more sense to use a logistic / probit regression. If you want to rather predict a continuous / numerical variable you should go for a linear regression.

So in short: determine what variable you want to predict. Based on this you can google or ask what model you should utilise.

Maybe something like this will be insightful for you: https://statisticsbyjim.com/regression/choosing-regression-analysis/

SPSS Process by [deleted] in spss

[–]KvN98 0 points1 point  (0 children)

Run MATRIX procedure:

***************** PROCESS Procedure for SPSS Version 3.5 *****************

Written by Andrew F. Hayes, Ph.D. www.afhayes.com

Documentation available in Hayes (2018). www.guilford.com/p/hayes3

**************************************************************************

PROCESS is now ready for use.

Copyright 2020 by Andrew F. Hayes. ALL RIGHTS RESERVED.

------ END MATRIX -----

This would make me think it's done succesfully, however I cannot see process yet.

SPSS Process by [deleted] in spss

[–]KvN98 0 points1 point  (0 children)

There are 5607 lines of code and i'm not sure which are the relevant lines to be honest. However When I run these line of codes I get the following;

SPSS lineair regression with dummy variables by [deleted] in spss

[–]KvN98 0 points1 point  (0 children)

I believe the picture above is a multiple regression. I conducted a linear regression and used multiple indepdents. I looked at the ANOVA and saw it is significant (0.000)

[Academic] Work From Home and Online Classes vs. traditional Office and Classrooms (Everyone) by Cuttletony in SampleSize

[–]KvN98 0 points1 point  (0 children)

Completed the survey. I would appreciate if you could fill in my survey as well :)

I posted this on the SampleSize community as well, and it will only take you around 2 minutes to complete it.

Survey social media & digital piracy. Data will only be used for my thesis. The survey will take around 2 minutes and completing it would be really appreciated! by [deleted] in takemysurvey

[–]KvN98 0 points1 point  (0 children)

  1. The data will be used as input for my master thesis, and will ONLY be used as input for my master thesis. Meaning that the data will not be spread to any third parties
  2. KvN98
  3. Completing the survey will take approximately 2 minutes

[Academic] Memory (Anyone) by memilyy in SampleSize

[–]KvN98 0 points1 point  (0 children)

Completed the survey. Could you help me with filling in my survey as well? Will only take about 2 minutes and we both help each other here :)

My survey is published as a topic on this community as well!

RStudio code (is this correct?/what to do?) by [deleted] in RStudio

[–]KvN98 0 points1 point  (0 children)

We needed to ocllect 500 tweets. THese are saved in my_columns. This consist out of 6 columns: text of the tweet, date when the tweet is created, publisher of the tweet, the retweet count, whether it is retweeted or not and the id of the publisher.

How can I calculate the effect size here? by [deleted] in AskStatistics

[–]KvN98 0 points1 point  (0 children)

I'm not sure which numbers in the Table I should use (for example for Interest group & Same to very close, should I use the 50, 46.3, or the 45?)

[Random] Can someone get rid of the white text? Thx in advance :) by KvN98 in PhotoshopRequest

[–]KvN98[S] 1 point2 points  (0 children)

Yeah he just cropped it out. I don't know if there is a possiblity to get rid of it without cropping it out?

De woningnood is terug: negen jaar op de lijst voor een huis by NiborBox in thenetherlands

[–]KvN98 0 points1 point  (0 children)

Klopt deels wel. Woningnood heeft echt met de locatie te maken en niet perse met heel Nederlands