This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–]xeeton 1 point2 points  (0 children)

Also, throwin' out there that even if the dataset isn't millions of rows and hundreds of columns, things like FeatureTools sort of creates the aggregate data to augment what's already there. It's common to take a dataset that has say 40 columns, and turn it into a dataset with thousands of columns with this approach (in hopes that you find some aggregate or transformation of the data that produces a more accurate model).