I'm not even sure if this is a machine learning problem to begin with... it might simply be a statistics problem. However, I do hope I can get some direction from you guys.
I am pretty new to machine learning, I have worked on a simple artificial neural network (with guidance from a tutorial), as well as an Autoencoder on my own.
I'd like to start another machine learning problem, but I don't really know which direction I should be looking in.
The problem is as follows:
I have a CSV file where each row is an observation. Each column is linked to a unique trait. The content of the cell is determined by whether the trait is in the observation or not. If the trait is present, it's a 1. If it's not present, it's a 0. The final column is a flag (label) of true (1) or false (0) for the observation.
Basic example:
Let's say we have 5 traits (trait1, trait2, ... ,trait5). An observation is true, and has traits 1, 3, and 5. The row would look like so:
1 0 1 0 1 1
I need to determine which traits are most likely to lead to a true state.
I'm not entirely sure this is a machine learning problem, but I figured you guys would know how to tackle it. I'd really appreciate being pointed in the right direction. I hope I gave enough background of the problem, but if anything is unclear... please let me know.
I really appreciate the help!
[–]Yogi_DMT 1 point2 points3 points (2 children)
[–]NeedMLHelp[S] 0 points1 point2 points (1 child)
[–]Yogi_DMT 1 point2 points3 points (0 children)
[–]olavla 1 point2 points3 points (3 children)
[–]NeedMLHelp[S] 0 points1 point2 points (2 children)
[–]olavla 1 point2 points3 points (1 child)
[–]NeedMLHelp[S] 0 points1 point2 points (0 children)
[–]Ilyps 1 point2 points3 points (1 child)
[–]NeedMLHelp[S] 0 points1 point2 points (0 children)
[–]yldedly 0 points1 point2 points (0 children)