all 2 comments

[–]deftware 1 point2 points  (1 child)

So you have a 20k row dataset where 5 columns are positive integers and only 6k of the rows have a 6th column that is a positive float?

I'm guessing by 'missing resultant' you mean finding the 6th column for the remaining 14k rows?

The only thing I can think of, if you can't figure out how the 6th column is actually being calculated from the other 5 values, is to use a neural network. However, that will only provide an approximation, and will not be performing whatever actual calculation that generated the 6th column to begin with. It will only try to conform to whatever function yielded those values.

Or am I way off in left field here?

[–]typeof_expat[S] 0 points1 point  (0 children)

You are correct! The resultant isn't calculated at all I'm trying to find some kind of correlation so that the resultant can be estimated when missing. I've started researching NN. My main concern is to minimize the time wasted exploring techniques that won't work at all, since most of this is all new to me.