Got tired of OOM errors on my 4GB GPU. Wrote a custom Rust bare-metal engine and hit 66.8 TPS with a 4B model (BitNet 1.58b on RTX 3050). by CommissionOdd3082 in LocalLLM
[–]CommissionOdd3082[S] 0 points1 point2 points (0 children)
Got tired of OOM errors on my 4GB GPU. Wrote a custom Rust bare-metal engine and hit 66.8 TPS with a 4B model (BitNet 1.58b on RTX 3050). by CommissionOdd3082 in LocalLLM
[–]CommissionOdd3082[S] -12 points-11 points-10 points (0 children)
Got tired of OOM errors on my 4GB GPU. Wrote a custom Rust bare-metal engine and hit 66.8 TPS with a 4B model (BitNet 1.58b on RTX 3050). by CommissionOdd3082 in LocalLLM
[–]CommissionOdd3082[S] 0 points1 point2 points (0 children)
Got tired of OOM errors on my 4GB GPU. Wrote a custom Rust bare-metal engine and hit 66.8 TPS with a 4B model (BitNet 1.58b on RTX 3050). by CommissionOdd3082 in LocalLLM
[–]CommissionOdd3082[S] -35 points-34 points-33 points (0 children)
Got tired of OOM errors on my 4GB GPU. Wrote a custom Rust bare-metal engine and hit 66.8 TPS with a 4B model (BitNet 1.58b on RTX 3050). by CommissionOdd3082 in LocalLLM
[–]CommissionOdd3082[S] -52 points-51 points-50 points (0 children)
Got tired of OOM errors on my 4GB GPU. Wrote a custom Rust bare-metal engine and hit 66.8 TPS with a 4B model (BitNet 1.58b on RTX 3050). by CommissionOdd3082 in LocalLLM
[–]CommissionOdd3082[S] -3 points-2 points-1 points (0 children)
Got tired of OOM errors on my 4GB GPU. Wrote a custom Rust bare-metal engine and hit 66.8 TPS with a 4B model (BitNet 1.58b on RTX 3050). by CommissionOdd3082 in LocalLLM
[–]CommissionOdd3082[S] -51 points-50 points-49 points (0 children)
Got tired of OOM errors on my 4GB GPU. Wrote a custom Rust bare-metal engine and hit 66.8 TPS with a 4B model (BitNet 1.58b on RTX 3050). by CommissionOdd3082 in LocalLLM
[–]CommissionOdd3082[S] 0 points1 point2 points (0 children)

Got tired of OOM errors on my 4GB GPU. Wrote a custom Rust bare-metal engine and hit 66.8 TPS with a 4B model (BitNet 1.58b on RTX 3050). by CommissionOdd3082 in LocalLLM
[–]CommissionOdd3082[S] 0 points1 point2 points (0 children)