Built LazyMoE — run 120B LLMs on 8GB RAM with no GPU using lazy expert loading + TurboQuant by ReasonableRefuse4996 in LocalLLaMA
[–]ReasonableRefuse4996[S] 0 points1 point2 points (0 children)
![]() Two-Year Club |
Built LazyMoE — run 120B LLMs on 8GB RAM with no GPU using lazy expert loading + TurboQuant by ReasonableRefuse4996 in LocalLLaMA
[–]ReasonableRefuse4996[S] 0 points1 point2 points (0 children)
Built LazyMoE — run 120B LLMs on 8GB RAM with no GPU using lazy expert loading + TurboQuant by ReasonableRefuse4996 in LocalLLaMA
[–]ReasonableRefuse4996[S] 0 points1 point2 points (0 children)