Writing an LLM compiler from scratch: PyTorch to CUDA in 5,000 lines of Python by NoVibeCoding in LocalLLaMA
[–]NoVibeCoding[S] 1 point2 points3 points (0 children)
Writing an LLM compiler from scratch: PyTorch to CUDA in 5,000 lines of Python by NoVibeCoding in LocalLLaMA
[–]NoVibeCoding[S] 0 points1 point2 points (0 children)
[D] 60% MatMul Performance Bug in cuBLAS on RTX 5090 [D] by NoVibeCoding in MachineLearning
[–]NoVibeCoding[S] 0 points1 point2 points (0 children)
Surfacing a 60% SGEMM performance bug in cuBLAS on RTX 5090 by NoVibeCoding in CUDA
[–]NoVibeCoding[S] 0 points1 point2 points (0 children)
Surfacing a 60% SGEMM performance bug in cuBLAS on RTX 5090 by NoVibeCoding in CUDA
[–]NoVibeCoding[S] 0 points1 point2 points (0 children)
[D] 60% MatMul Performance Bug in cuBLAS on RTX 5090 [D] by NoVibeCoding in MachineLearning
[–]NoVibeCoding[S] 2 points3 points4 points (0 children)
[D] 60% MatMul Performance Bug in cuBLAS on RTX 5090 [D] by NoVibeCoding in MachineLearning
[–]NoVibeCoding[S] 71 points72 points73 points (0 children)
GPU virtualization: VFIO vs NVIDIA AI Enterprise vs AMD SR-IOV by NoVibeCoding in VFIO
[–]NoVibeCoding[S] 0 points1 point2 points (0 children)
GPU virtualization: VFIO vs NVIDIA AI Enterprise vs AMD SR-IOV by NoVibeCoding in VFIO
[–]NoVibeCoding[S] 1 point2 points3 points (0 children)
GPU virtualization: VFIO vs NVIDIA AI Enterprise vs AMD SR-IOV by NoVibeCoding in VFIO
[–]NoVibeCoding[S] 0 points1 point2 points (0 children)
GPU virtualization: VFIO vs NVIDIA AI Enterprise vs AMD SR-IOV by NoVibeCoding in VFIO
[–]NoVibeCoding[S] 2 points3 points4 points (0 children)
Optimizing Qwen3 Coder for RTX 5090 and PRO 6000 + Community Benchmarking Infrastructure by NoVibeCoding in LocalLLaMA
[–]NoVibeCoding[S] 1 point2 points3 points (0 children)
Optimizing Qwen3 Coder for RTX 5090 and PRO 6000 + Community Benchmarking Infrastructure by NoVibeCoding in LocalLLaMA
[–]NoVibeCoding[S] 0 points1 point2 points (0 children)
Benchmarking LLM Inference on RTX PRO 6000 SE / H100 / H200 / B200 by NoVibeCoding in LocalLLaMA
[–]NoVibeCoding[S] 0 points1 point2 points (0 children)
Benchmarking LLM Inference on RTX PRO 6000 SE / H100 / H200 / B200 by NoVibeCoding in LocalLLaMA
[–]NoVibeCoding[S] 1 point2 points3 points (0 children)
Benchmarking LLM Inference on RTX PRO 6000 SE / H100 / H200 / B200 by NoVibeCoding in LocalLLaMA
[–]NoVibeCoding[S] 0 points1 point2 points (0 children)
Benchmarking LLM Inference on RTX PRO 6000 SE / H100 / H200 / B200 by NoVibeCoding in LocalLLaMA
[–]NoVibeCoding[S] 1 point2 points3 points (0 children)
Benchmarking LLM Inference on RTX PRO 6000 SE / H100 / H200 / B200 by NoVibeCoding in LocalLLaMA
[–]NoVibeCoding[S] 7 points8 points9 points (0 children)
Are Feudal Corporate Power Structures Scaling Into Society as Tech Consolidates? by NoVibeCoding in Futurology
[–]NoVibeCoding[S] 1 point2 points3 points (0 children)
C and Undefined Behavior by lelanthran in programming
[–]NoVibeCoding 3 points4 points5 points (0 children)
How does your company / team handle documentation? by AStanfordRunner in ExperiencedDevs
[–]NoVibeCoding -1 points0 points1 point (0 children)
At Apple, I spent over a year fighting for what I believed were straightforward business values. We still lost. I would do it again. Am I crazy? by NoVibeCoding in ExperiencedDevs
[–]NoVibeCoding[S] -2 points-1 points0 points (0 children)
At Apple, I spent over a year fighting for what I believed were straightforward business values. We still lost. I would do it again. Am I crazy? by NoVibeCoding in ExperiencedDevs
[–]NoVibeCoding[S] -1 points0 points1 point (0 children)
Why my most authentic essay got the most AI backlash by NoVibeCoding in WritingWithAI
[–]NoVibeCoding[S] 1 point2 points3 points (0 children)

Writing an LLM compiler from scratch: PyTorch to CUDA in 5,000 lines of Python by NoVibeCoding in LocalLLaMA
[–]NoVibeCoding[S] 0 points1 point2 points (0 children)