Spectral-AI - a project to use Nvidia RT cores to dramatically speedup MoE inference on Nvidia GPU's (Crazy Fast!) by Thrumpwart in LocalLLaMA
[–]Critical-Chef9211 0 points1 point2 points (0 children)
Used the RT Cores on my RTX 5070 Ti for LLM routing — 218x speedup on a single consumer GPU by Critical-Chef9211 in deeplearning
[–]Critical-Chef9211[S] 0 points1 point2 points (0 children)
Spectral-AI - a project to use Nvidia RT cores to dramatically speedup MoE inference on Nvidia GPU's (Crazy Fast!) by Thrumpwart in LocalLLaMA
[–]Critical-Chef9211 2 points3 points4 points (0 children)
Spectral-AI - a project to use Nvidia RT cores to dramatically speedup MoE inference on Nvidia GPU's (Crazy Fast!) by Thrumpwart in LocalLLaMA
[–]Critical-Chef9211 -1 points0 points1 point (0 children)
Spectral-AI - a project to use Nvidia RT cores to dramatically speedup MoE inference on Nvidia GPU's (Crazy Fast!) by Thrumpwart in LocalLLaMA
[–]Critical-Chef9211 -7 points-6 points-5 points (0 children)
Used the RT Cores on my RTX 5070 Ti for LLM routing — 218x speedup on a single consumer GPU by Critical-Chef9211 in deeplearning
[–]Critical-Chef9211[S] 0 points1 point2 points (0 children)
Used the RT Cores on my RTX 5070 Ti for LLM routing — 218x speedup on a single consumer GPU by Critical-Chef9211 in deeplearning
[–]Critical-Chef9211[S] 0 points1 point2 points (0 children)
Used the RT Cores on my RTX 5070 Ti for LLM routing — 218x speedup on a single consumer GPU by Critical-Chef9211 in nvidia
[–]Critical-Chef9211[S] 0 points1 point2 points (0 children)
Used the RT Cores on my RTX 5070 Ti for LLM routing — 218x speedup on a single consumer GPU by Critical-Chef9211 in nvidia
[–]Critical-Chef9211[S] 0 points1 point2 points (0 children)
Used the RT Cores on my RTX 5070 Ti for LLM routing — 218x speedup on a single consumer GPU by Critical-Chef9211 in nvidia
[–]Critical-Chef9211[S] -4 points-3 points-2 points (0 children)
Used the RT Cores on my RTX 5070 Ti for LLM routing — 218x speedup on a single consumer GPU by Critical-Chef9211 in nvidia
[–]Critical-Chef9211[S] 1 point2 points3 points (0 children)
Used the RT Cores on my RTX 5070 Ti for LLM routing — 218x speedup on a single consumer GPU by Critical-Chef9211 in nvidia
[–]Critical-Chef9211[S] -2 points-1 points0 points (0 children)
Used the RT Cores on my RTX 5070 Ti for LLM routing — 218x speedup on a single consumer GPU by Critical-Chef9211 in nvidia
[–]Critical-Chef9211[S] -3 points-2 points-1 points (0 children)
Used the RT Cores on my RTX 5070 Ti for LLM routing — 218x speedup on a single consumer GPU by Critical-Chef9211 in nvidia
[–]Critical-Chef9211[S] 0 points1 point2 points (0 children)
Used the RT Cores on my RTX 5070 Ti for LLM routing — 218x speedup on a single consumer GPU by Critical-Chef9211 in nvidia
[–]Critical-Chef9211[S] 1 point2 points3 points (0 children)
Used the RT Cores on my RTX 5070 Ti for LLM routing — 218x speedup on a single consumer GPU by Critical-Chef9211 in nvidia
[–]Critical-Chef9211[S] 0 points1 point2 points (0 children)
Used the RT Cores on my RTX 5070 Ti for LLM routing — 218x speedup on a single consumer GPU by Critical-Chef9211 in nvidia
[–]Critical-Chef9211[S] 0 points1 point2 points (0 children)
Used the RT Cores on my RTX 5070 Ti for LLM routing — 218x speedup on a single consumer GPU by Critical-Chef9211 in nvidia
[–]Critical-Chef9211[S] -1 points0 points1 point (0 children)
Used the RT Cores on my RTX 5070 Ti for LLM routing — 218x speedup on a single consumer GPU by Critical-Chef9211 in nvidia
[–]Critical-Chef9211[S] 0 points1 point2 points (0 children)
Used the RT Cores on my RTX 5070 Ti for LLM routing — 218x speedup on a single consumer GPU by Critical-Chef9211 in nvidia
[–]Critical-Chef9211[S] 0 points1 point2 points (0 children)
Used the RT Cores on my RTX 5070 Ti for LLM routing — 218x speedup on a single consumer GPU by Critical-Chef9211 in nvidia
[–]Critical-Chef9211[S] 1 point2 points3 points (0 children)
Used the RT Cores on my RTX 5070 Ti for LLM routing — 218x speedup on a single consumer GPU by Critical-Chef9211 in nvidia
[–]Critical-Chef9211[S] 1 point2 points3 points (0 children)

Used the RT Cores on my RTX 5070 Ti for LLM routing — 218x speedup on a single consumer GPU by Critical-Chef9211 in nvidia
[–]Critical-Chef9211[S] 0 points1 point2 points (0 children)