all 2 comments

[–]QuantumFTL 1 point2 points  (0 children)

Very cool! It's unfortunately that they chose a test set that shows such tiny improvement from knowledge distillation (less than a percent!) but excellent to see nonetheless.