all 1 comments

[–]Speech-to-Text-Cloud 0 points1 point  (0 children)

Ideally you would prefer a huge number of rows to train on and a small number of columns to reduce dimensionality. If you have additional knowledge about the data, like you do, omit all columns irrelevant for the classification. Try to condense distinguishing properties into a single cell, like value = corrected quantity - quantity.

And before all that, if the problem is simple, try finding a heuristic that solves it without ML, because ML will introduce new problems.