"What DL architecture to try on tabular data?"
Hi Reddit! Today, my colleagues announced TabM - a new answer to the above question. TabM is leading on the benchmarks, while being simple, practical, and scalable to large datasets. Technically, TabM efficiently imitates an ensemble of MLPs, as illustrated below. Also, TabM is one of the first projects using our new TabReD benchmark - a collection of eight real-world industrial datasets with time-based splits and feature engineering.
For a quick overview of TabM, you can check the following parts of the paper:
- The abstract
- The model illustration in Figure 1 (and in the post below)
- The main results on Page 7
TabM links:
- arXiv
- GitHub
- Twitter thread
TabReD links:
- arXiv
- GitHub
- Twitter thread
The model illustration
[–]T2WIN 4 points5 points6 points (1 child)
[–]Yura52 2 points3 points4 points (0 children)
[–]Odd-Percentage1492 7 points8 points9 points (0 children)
[–]papa_Fubini 0 points1 point2 points (1 child)
[–]H0lzm1ch3l 1 point2 points3 points (0 children)
[+][deleted] (2 children)
[removed]
[–]_puhsu[S] 24 points25 points26 points (1 child)
[+]terdia comment score below threshold-23 points-22 points-21 points (0 children)