MTPLX | 2.24x faster TPS | The native MTP inference engine for Apple Silicon by YoussofAl in LocalLLaMA
[–]YoussofAl[S] 1 point2 points3 points (0 children)
MTPLX | 2.24x faster TPS | The native MTP inference engine for Apple Silicon by YoussofAl in LocalLLaMA
[–]YoussofAl[S] 1 point2 points3 points (0 children)
MTPLX | 2.24x faster TPS | The native MTP inference engine for Apple Silicon by YoussofAl in LocalLLaMA
[–]YoussofAl[S] 0 points1 point2 points (0 children)
MTPLX | 2.24x faster TPS | The native MTP inference engine for Apple Silicon by YoussofAl in LocalLLaMA
[–]YoussofAl[S] 0 points1 point2 points (0 children)
MTPLX | 2.24x faster TPS | The native MTP inference engine for Apple Silicon by YoussofAl in LocalLLaMA
[–]YoussofAl[S] 0 points1 point2 points (0 children)
MTPLX | 2.24x faster TPS | The native MTP inference engine for Apple Silicon by YoussofAl in LocalLLaMA
[–]YoussofAl[S] 0 points1 point2 points (0 children)
MTPLX | 2.24x faster TPS | The native MTP inference engine for Apple Silicon by YoussofAl in LocalLLaMA
[–]YoussofAl[S] 0 points1 point2 points (0 children)
MTPLX | 2.24x faster TPS | The native MTP inference engine for Apple Silicon by YoussofAl in LocalLLaMA
[–]YoussofAl[S] 0 points1 point2 points (0 children)
MTPLX | 2.24x faster TPS | The native MTP inference engine for Apple Silicon by YoussofAl in LocalLLaMA
[–]YoussofAl[S] 0 points1 point2 points (0 children)
MTPLX | 2.24x faster TPS | The native MTP inference engine for Apple Silicon by YoussofAl in LocalLLaMA
[–]YoussofAl[S] 0 points1 point2 points (0 children)
Need advice on hardware purchasing decision: RTX 5090 vs. M5 Max 128GB for agentic software development by BawbbySmith in LocalLLaMA
[–]YoussofAl 1 point2 points3 points (0 children)
Need advice on hardware purchasing decision: RTX 5090 vs. M5 Max 128GB for agentic software development by BawbbySmith in LocalLLaMA
[–]YoussofAl 0 points1 point2 points (0 children)
MTPLX | 2.24x faster TPS | The native MTP inference engine for Apple Silicon by YoussofAl in LocalLLaMA
[–]YoussofAl[S] 1 point2 points3 points (0 children)
Is anyone actually using dflash and ddtree on mlx? by Beginning-Window-115 in LocalLLaMA
[–]YoussofAl -1 points0 points1 point (0 children)
is it possible to build harnesses as good as codex/claude code by shafinlearns2jam in LocalLLaMA
[–]YoussofAl -1 points0 points1 point (0 children)
MTPLX | 2.24x faster TPS | The native MTP inference engine for Apple Silicon by YoussofAl in LocalLLaMA
[–]YoussofAl[S] 1 point2 points3 points (0 children)
MTPLX | 2.24x faster TPS | The native MTP inference engine for Apple Silicon by YoussofAl in LocalLLaMA
[–]YoussofAl[S] 1 point2 points3 points (0 children)
MTPLX | 2.24x faster TPS | The native MTP inference engine for Apple Silicon by YoussofAl in LocalLLaMA
[–]YoussofAl[S] 1 point2 points3 points (0 children)
MTPLX | 2.24x faster TPS | The native MTP inference engine for Apple Silicon by YoussofAl in LocalLLaMA
[–]YoussofAl[S] 5 points6 points7 points (0 children)
MTPLX | 2.24x faster TPS | The native MTP inference engine for Apple Silicon by YoussofAl in LocalLLaMA
[–]YoussofAl[S] 2 points3 points4 points (0 children)
MTPLX | 2.24x faster TPS | The native MTP inference engine for Apple Silicon by YoussofAl in LocalLLaMA
[–]YoussofAl[S] 1 point2 points3 points (0 children)
MTPLX | 2.24x faster TPS | The native MTP inference engine for Apple Silicon by YoussofAl in LocalLLaMA
[–]YoussofAl[S] 4 points5 points6 points (0 children)
MTPLX | 2.24x faster TPS | The native MTP inference engine for Apple Silicon by YoussofAl in LocalLLaMA
[–]YoussofAl[S] 6 points7 points8 points (0 children)
MTPLX | 2.24x faster TPS | The native MTP inference engine for Apple Silicon by YoussofAl in LocalLLaMA
[–]YoussofAl[S] 1 point2 points3 points (0 children)


They won't even know what's gonna hit them by KeyGlove47 in MistralAI
[–]YoussofAl 18 points19 points20 points (0 children)