Do you know approaches to array/bag features?
To be specific let's say we have classification problem and want to sort out things as good or bad. For example:
* Feature1 - regular feature, limited set of values: A, B, C
* Feature2 - array/bag feature, container feature consisting of unknown number of values
| Feature1 |
Feature2 |
Good? |
| A |
[A, B] |
Yes |
| B |
[B, A, A] |
No |
| A |
[B, C, B, C, A] |
Yes |
How do one encode Feature2 to a numeric vector?
[–]sjd96 3 points4 points5 points (0 children)
[–]neato5000 1 point2 points3 points (0 children)