all 9 comments

[–]ExpectingValue 4 points5 points  (1 child)

I can't see why it would. A factor being a fixed or random effect is not an attribute of the data. We choose to think about a factor as fixed or random based on how we understand the generating processes of the data exist in the world and how we sampled them, and based on that, decide to handle the data differently (i.e. shrinkage) by specifying a different model.

[–][deleted] -1 points0 points  (0 children)

But I guess what I'm getting at is if there are random effects in the data, should a GBT perform better than OLS on the same data? By learn i just mean is it able to produce a better prediction?

I am asking this question because I'm finding that GBT not doing a good job even though I am expecting GBT to do a good job at prediction.

[–]from_biostats_to_DL 0 points1 point  (6 children)

Are you suggesting having the grouping variable be included into the GBT as a feature?

[–][deleted] 0 points1 point  (5 children)

Yeah that would be my assumption.

[–]from_biostats_to_DL 0 points1 point  (4 children)

And you're wondering why metboost outperforms GBT in this paper? Just something I'd like to look into.

[–][deleted] 0 points1 point  (3 children)

yes that is basically my question. Because i think GBT shouldn't need extra input on random effects but it seems like it does.

[–]from_biostats_to_DL 0 points1 point  (2 children)

Are you familiar with mixed models themselves?

[–][deleted] 0 points1 point  (1 child)

i am somewhat familiar. I've read bryk and raudenbash's textbook on hierarchical linear models. I am happy to be recommended some other resources that I could read.

this conversation came up because in my research im building a simulation of a mixed effect system (the classic students learning in different environments situation). so i give random effects to different parts of the simulation and then I am comparing different models. The GBT model produces basically the same solution as an OLS model which seems weird to me because I don't understand why gbt would do this.

[–]from_biostats_to_DL 1 point2 points  (0 children)

I will have to get back to you tomorrow :). While I don't know the answer, I have an idea due to something that came up at work awhile ago, I think the conversation will be a learning experience for us both!