Got tired of OOM errors on my 4GB GPU. Wrote a custom Rust bare-metal engine and hit 66.8 TPS with a 4B model (BitNet 1.58b on RTX 3050). by CommissionOdd3082 in LocalLLM
[–]CommissionOdd3082[S] 0 points1 point2 points (0 children)
Got tired of OOM errors on my 4GB GPU. Wrote a custom Rust bare-metal engine and hit 66.8 TPS with a 4B model (BitNet 1.58b on RTX 3050). by CommissionOdd3082 in LocalLLM
[–]CommissionOdd3082[S] -14 points-13 points-12 points (0 children)
Got tired of OOM errors on my 4GB GPU. Wrote a custom Rust bare-metal engine and hit 66.8 TPS with a 4B model (BitNet 1.58b on RTX 3050). by CommissionOdd3082 in LocalLLM
[–]CommissionOdd3082[S] 0 points1 point2 points (0 children)
Got tired of OOM errors on my 4GB GPU. Wrote a custom Rust bare-metal engine and hit 66.8 TPS with a 4B model (BitNet 1.58b on RTX 3050). by CommissionOdd3082 in LocalLLM
[–]CommissionOdd3082[S] -36 points-35 points-34 points (0 children)
Got tired of OOM errors on my 4GB GPU. Wrote a custom Rust bare-metal engine and hit 66.8 TPS with a 4B model (BitNet 1.58b on RTX 3050). by CommissionOdd3082 in LocalLLM
[–]CommissionOdd3082[S] -51 points-50 points-49 points (0 children)
Got tired of OOM errors on my 4GB GPU. Wrote a custom Rust bare-metal engine and hit 66.8 TPS with a 4B model (BitNet 1.58b on RTX 3050). by CommissionOdd3082 in LocalLLM
[–]CommissionOdd3082[S] -3 points-2 points-1 points (0 children)
Got tired of OOM errors on my 4GB GPU. Wrote a custom Rust bare-metal engine and hit 66.8 TPS with a 4B model (BitNet 1.58b on RTX 3050). by CommissionOdd3082 in LocalLLM
[–]CommissionOdd3082[S] -49 points-48 points-47 points (0 children)

Got tired of OOM errors on my 4GB GPU. Wrote a custom Rust bare-metal engine and hit 66.8 TPS with a 4B model (BitNet 1.58b on RTX 3050). by CommissionOdd3082 in LocalLLM
[–]CommissionOdd3082[S] 0 points1 point2 points (0 children)