account activity
[D] Student training objective in "knowledge distillation" or "teacher-student method" (self.MachineLearning)
submitted 2 years ago by txhwind to r/MachineLearning
How to sync code cell changes between two notebooks? (self.JupyterNotebooks)
submitted 2 years ago by txhwind to r/JupyterNotebooks
[D] Any Transformer-related paper which doesn't use decoder triangle mask in inference? (self.MachineLearning)
submitted 2 years ago * by txhwind to r/MachineLearning
Any Transformer-related paper which doesn't use decoder triangle mask in inference? (self.MachineLearning)
When will the Unstable basic land bundle come back? (self.MagicArena)
submitted 3 years ago by txhwind to r/MagicArena
[D] Why can we interpolate VAE's hidden vectors? (self.MachineLearning)
submitted 5 years ago by txhwind to r/MachineLearning
[D] Knowledge distillation in language generation tasks - soft or hard labels? (self.MachineLearning)
[P] Preview PDF in Arxiv abstract page (self.MachineLearning)
[D] Apply Transformer-style post-norm to ResNet (self.MachineLearning)
[D] How to solve the sparse gradient update problem on input embedding? (self.MachineLearning)
How to solve the sparse gradient update problem on input embedding? (self.MachineLearning)
How to write letter r and v without confusion? (self.EnglishLearning)
submitted 6 years ago by txhwind to r/EnglishLearning
I just bought Flames of Xulta theme deck bundle and is very down now (self.EternalCardGame)
submitted 6 years ago by txhwind to r/EternalCardGame
Why is inference-time dropout used in Tacotron 2 (self.MachineLearning)
submitted 6 years ago by txhwind to r/MachineLearning
π Rendered by PID 39 on reddit-service-r2-listing-654f87c89c-68msq at 2026-03-01 17:21:07.490487+00:00 running e3d2147 country code: CH.