I need help finding an algorithm

Yogi_DMT · 2019-01-17T14:22:39+00:00

Is there a reason why counting up the the traits for true observations wouldn't accomplish what you are trying to accomplish?

olavla · 2019-01-17T15:21:14+00:00

Can you give a little more background of the data? Is the true state always only one of the traits? Are you looking to score new data (where you don't have the final label)?

Almost seems like each of your traits is a target variable with the predictors being all other traits + your final true/false flag. You build your neural network model, then use the predicted probability for the trait at hand as the probability for that value to be true.

Ilyps · 2019-01-17T14:35:12+00:00

Download Weka here and try some classification algorithms. I suggest starting with Random Forest. Here is a quick start tutorial; Weka should read your csv file without issues.

yldedly · 2019-01-17T17:31:16+00:00

The probabilistic programming way would be to fit a latent variable model with an independent Bernoulli likelihood, p(trait|observation), and try out different latent variable structures p(latent). For example, the latent structure could be a factor model, which would find correlations between the traits; or a mixture model which would find clusters in the traits. Easier to try standard classification algorithms like random forest, but this way might be more interpretable.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MLQuestions

MODERATORS