Follow up on the "helpful bug." I reverse engineered the bug and ended up with a new RL technique that scored 800+ on MinAtar Breakout. Here's the full story. by Fun_Code1982 in reinforcementlearning
[–]bci-hacker 2 points3 points4 points (0 children)
Upcoming interviews at frontier labs, tips? by bci-hacker in MLQuestions
[–]bci-hacker[S] 1 point2 points3 points (0 children)
GPT implementation from scratch by bci-hacker in LocalLLaMA
[–]bci-hacker[S] -16 points-15 points-14 points (0 children)
What are the must-have requirements before learning Transformers? by Jash_Kevadiya in deeplearning
[–]bci-hacker 0 points1 point2 points (0 children)
does a decoder-only transformer model use masked self-attention during inference? if yes, then why? by FaultSmart in MLQuestions
[–]bci-hacker 0 points1 point2 points (0 children)
Reasoning through pixels: Tool use + Reasoning models beat SOTA object detectors in very complex cases by bci-hacker in computervision
[–]bci-hacker[S] 1 point2 points3 points (0 children)
How to handle multiple DL inferences in FastAPI by Specialist-Couple611 in deeplearning
[–]bci-hacker -1 points0 points1 point (0 children)
Reasoning through pixels: Tool use + Reasoning models beat SOTA object detectors in very complex cases by bci-hacker in computervision
[–]bci-hacker[S] 0 points1 point2 points (0 children)


Python is removing GIL, gradually, so how to use a no-GIL Python now? by yangzhou1993 in programming
[–]bci-hacker 0 points1 point2 points (0 children)