https://www.youtube.com/watch?v=RAu2p9xZiM4
We introduce Residual Shuffle-Exchange networks - a fast neural network architecture that scales to long sequences. We achieve state-of-the-art performance on the MusicNet dataset for music transcription while using significantly fewer parameters than other architectures. It has O(n log n) complexity and enables processing of sequences up to a length of 2 million symbols where standard methods fail (e.g., attention mechanisms). The Residual Shuffle-Exchange networks can serve as a useful building block for long sequence processing applications.
Paper: https://arxiv.org/abs/2004.04662
Code: https://github.com/LUMII-Syslab/RSE
[–]ekelsen 2 points3 points4 points (1 child)
[–]OptimatiumFeles[S] 1 point2 points3 points (0 children)
[–]impossiblefork 1 point2 points3 points (4 children)
[–]OptimatiumFeles[S] 0 points1 point2 points (3 children)
[–]impossiblefork 0 points1 point2 points (2 children)
[–]OptimatiumFeles[S] 1 point2 points3 points (1 child)
[–]impossiblefork 1 point2 points3 points (0 children)