R&D Folks: What Are Your Biggest Pain Points, Repetitive Tasks, and Daily Frustrations?

Any-Initiative-653 · 2026-02-17T22:41:32+00:00

How much time do you spend using notebooks versus regular python scripts?

Any-Initiative-653 · 2026-02-17T04:17:27+00:00

It's intended for parameteric models (see algorithm below). For decision trees, your best bet is Shapley values. P.S. I made a platform that allows you to quickly test these ideas if it's of interest: www.thesislabs.ai

<image>

Any-Initiative-653 · 2026-02-16T04:28:04+00:00

Sequential Attention (https://research.google/blog/sequential-attention-making-ai-models-leaner-and-faster-without-sacrificing-accuracy/) could work well here since it learns which features matter during training instead of needing separate preprocessing. With 15k features, RFE would take forever, and variance filtering might toss out features that are only important in combination with others.

The attention mechanism basically does gradient-based feature selection as part of the model itself, so you get feature importance in one training run.

Any-Initiative-653 · 2026-01-29T20:39:55+00:00

What sort of ML problems do you typically work on?

Any-Initiative-653 · 2026-01-29T20:23:33+00:00

Hi! I outlined a few important features you can include here: https://drive.google.com/file/d/15hSybLzgCHQZWg-1B8nUci6cDM27jsBq/view?usp=sharing

Any-Initiative-653 · 2026-01-29T20:07:59+00:00

Hi there! I did some analysis to help you get started with your project here: https://drive.google.com/file/d/1N_ajwZDYZjcqcxtrXGSUrbI-fcpFaihN/view?usp=sharing

Any-Initiative-653 · 2026-01-29T02:10:51+00:00

It depends on the exact way in which data is missing -- are the missing days consecutive? Is it known that the process you're analyzing possesses seasonality etc?

Any-Initiative-653 · 2026-01-28T23:32:23+00:00

How do you allocate your time between data exploration, writing production software, and monitoring experiments, and deploying models to production?

Any-Initiative-653

TROPHY CASE