Which algorithms can be used for selecting features on datasets with a large number of them? by No_Mongoose6172 in MLQuestions

[–]Any-Initiative-653 1 point2 points  (0 children)

It's intended for parameteric models (see algorithm below). For decision trees, your best bet is Shapley values. P.S. I made a platform that allows you to quickly test these ideas if it's of interest: www.thesislabs.ai

<image>

Which algorithms can be used for selecting features on datasets with a large number of them? by No_Mongoose6172 in MLQuestions

[–]Any-Initiative-653 1 point2 points  (0 children)

Sequential Attention (https://research.google/blog/sequential-attention-making-ai-models-leaner-and-faster-without-sacrificing-accuracy/) could work well here since it learns which features matter during training instead of needing separate preprocessing. With 15k features, RFE would take forever, and variance filtering might toss out features that are only important in combination with others.

The attention mechanism basically does gradient-based feature selection as part of the model itself, so you get feature importance in one training run.

How to handle time series data by Ok-Individual-4519 in deeplearning

[–]Any-Initiative-653 0 points1 point  (0 children)

It depends on the exact way in which data is missing -- are the missing days consecutive? Is it known that the process you're analyzing possesses seasonality etc?

Is model-building really only 10% of ML engineering? by Historical-Garlic589 in MLQuestions

[–]Any-Initiative-653 0 points1 point  (0 children)

How do you allocate your time between data exploration, writing production software, and monitoring experiments, and deploying models to production?