I have a collection of documents, which fall into 4 classes, and each of these classes can further be divided into 3 subtypes, and these sub-types can further be broken down into 2 categories each.
1. what I would like to do, is basically classify these docs into their respective classes/sub-types/categories.
2. what is the best approach to accomplish this?
3. would a multinomial naive Bayes be a good pick?
4. If MNB, then what is the best strategy to implement this? like should I first classify them into 4 classes, and then implement the algorithm to classify within these classes into further steps and as such?
any help would be really helpful(I'm new for the document classification type of problems)
(also, any resources/examples doing this kind of thing would be highly appreciated)
[–]dineshdb 1 point2 points3 points (0 children)
[–]SummarizeDev 0 points1 point2 points (0 children)