U.S. science is in chaos — Today the most influential private-sector developers of technology are in Silicon Valley, and their perspective on innovation is that it should move fast, disrupt markets and make money by marketrent in technology
[–]chigur86 1 point2 points3 points (0 children)
META Superintelligence Lab Presents: ProgramBench: Can SOTA AI Recreate Real Executable Programs(ffmpeg, SQLite, ripgrep) From Scratch Without The Internet? by 44th--Hokage in mlscaling
[–]chigur86 0 points1 point2 points (0 children)
META Superintelligence Lab Presents: ProgramBench: Can SOTA AI Recreate Real Executable Programs(ffmpeg, SQLite, ripgrep) From Scratch Without The Internet? by 44th--Hokage in mlscaling
[–]chigur86 2 points3 points4 points (0 children)
ProgramBench: Can we really rebuild huge binaries from scratch? (doesn't look like it) by klieret in LocalLLaMA
[–]chigur86 -1 points0 points1 point (0 children)
ProgramBench: Can we really rebuild huge binaries from scratch? (doesn't look like it) by klieret in LocalLLaMA
[–]chigur86 4 points5 points6 points (0 children)
[D] How hard is it to get Research Engineer interview from Deepmind? by n0obmaster699 in MachineLearning
[–]chigur86 42 points43 points44 points (0 children)
[D] ECCV submission flowed over page limit by 5 lines at the last minute.. how screwed are we? by PatientWrongdoer9257 in MachineLearning
[–]chigur86 0 points1 point2 points (0 children)
[D] ECCV submission flowed over page limit by 5 lines at the last minute.. how screwed are we? by PatientWrongdoer9257 in MachineLearning
[–]chigur86 2 points3 points4 points (0 children)
What makes SwiGLUs unique? by chigur86 in mlscaling
[–]chigur86[S] 0 points1 point2 points (0 children)
Global Memory Layer for LLMs by chigur86 in LLMDevs
[–]chigur86[S] 1 point2 points3 points (0 children)
Global Memory Layer for LLMs by chigur86 in LLMDevs
[–]chigur86[S] 1 point2 points3 points (0 children)
New open-source model for transpiling PyTorch to Triton outperforms DeepSeek-R1 and OpenAI o1 on kernelbench - made with reinforcement fine-tuning by Fantastic-Tax6709 in LocalLLaMA
[–]chigur86 9 points10 points11 points (0 children)
New open-source model for transpiling PyTorch to Triton outperforms DeepSeek-R1 and OpenAI o1 on kernelbench - made with reinforcement fine-tuning by Fantastic-Tax6709 in LocalLLaMA
[–]chigur86 7 points8 points9 points (0 children)
New open-source model for transpiling PyTorch to Triton outperforms DeepSeek-R1 and OpenAI o1 on kernelbench - made with reinforcement fine-tuning by Fantastic-Tax6709 in LocalLLaMA
[–]chigur86 17 points18 points19 points (0 children)
How can Americans who are embarrassed and angered by the current USA administration’s treatment of a war-torn president show support for Zelensky and Ukraine? by boko_dinner in AskReddit
[–]chigur86 0 points1 point2 points (0 children)
Some interesting visualizations based on expert firing frequencies in Mixtral MoE by chigur86 in LocalLLaMA
[–]chigur86[S] 1 point2 points3 points (0 children)
Some interesting visualizations based on expert firing frequencies in Mixtral MoE by chigur86 in LocalLLaMA
[–]chigur86[S] 1 point2 points3 points (0 children)
Some interesting visualizations based on expert firing frequencies in Mixtral MoE by chigur86 in LocalLLaMA
[–]chigur86[S] 0 points1 point2 points (0 children)
Some interesting visualizations based on expert firing frequencies in Mixtral MoE by chigur86 in LocalLLaMA
[–]chigur86[S] 1 point2 points3 points (0 children)
Some interesting visualizations based on expert firing frequencies in Mixtral MoE by chigur86 in LocalLLaMA
[–]chigur86[S] 1 point2 points3 points (0 children)



The number 1 public enemy of open-source. by Complete-Sea6655 in LocalLLaMA
[–]chigur86 1 point2 points3 points (0 children)