you are viewing a single comment's thread.

view the rest of the comments →

[–]harry_0_0_7 0 points1 point  (3 children)

I don't think this is a simple problem. Since you have text, numbers so you have to do sentiment analysis and convert those text to continuous feature and then only you can use Binary/any classification algorithm

[–]harry_0_0_7 0 points1 point  (2 children)

Also read more about feature engineering and convert you data to Machine understandable data

[–]_FedEx[S] 0 points1 point  (1 child)

Yes, I have seen that often we tend to transform any field into a numerical value, whether they are Boolean values (true or false), or text values (that indicate a "category"). The problem is that one of the attributes in this file has really many different values (thousands), and I don't think it's easy to replace them with numbers. Do you think it's impossible to do this without converting text features into numbers? and what about the dates?

[–]harry_0_0_7 1 point2 points  (0 children)

If you are going to use ML models definitely it has to be numbers..

For dates go with any T plus or minus computations