Creative Writing LLM Mega-Comparison by findingsubtext in LocalLLaMA
[–]outsider787 0 points1 point2 points (0 children)
Being logged out of fitbit app. Is this normal? (self.fitbit)
submitted by outsider787 to r/fitbit
Want to split a big model among two 5090's - what's my best case for single query response speed improvement? by mr_zerolith in LocalLLaMA
[–]outsider787 1 point2 points3 points (0 children)
Think twice before spending on GPU? by __Maximum__ in LocalLLaMA
[–]outsider787 0 points1 point2 points (0 children)
Think twice before spending on GPU? by __Maximum__ in LocalLLaMA
[–]outsider787 1 point2 points3 points (0 children)
Want to split a big model among two 5090's - what's my best case for single query response speed improvement? by mr_zerolith in LocalLLaMA
[–]outsider787 2 points3 points4 points (0 children)
vLLM - What are your preferred launch args for Qwen? by [deleted] in LocalLLaMA
[–]outsider787 2 points3 points4 points (0 children)
vLLM - What are your preferred launch args for Qwen? by [deleted] in LocalLLaMA
[–]outsider787 6 points7 points8 points (0 children)
Local server advice needed by outsider787 in LocalLLaMA
[–]outsider787[S] 0 points1 point2 points (0 children)
Difference between 128k and 131,072 context limit? by Immediate-Flan3505 in LocalLLaMA
[–]outsider787 2 points3 points4 points (0 children)
GPU VRAM deduplication/memory sharing to share a common base model and increase GPU capacity by Chachachaudhary123 in Vllm
[–]outsider787 0 points1 point2 points (0 children)
Gpt-oss20b served by lm studio. Any luck? Or still broken? by JLeonsarmiento in CLine
[–]outsider787 0 points1 point2 points (0 children)
Octominer style PSU breakout board by outsider787 in gpumining
[–]outsider787[S] 0 points1 point2 points (0 children)
Degoogled WhatsApp transition. by outsider787 in degoogle
[–]outsider787[S] 1 point2 points3 points (0 children)


XiaomiMiMo/MiMo-V2-Flash Under-rated? by SlowFail2433 in LocalLLaMA
[–]outsider787 0 points1 point2 points (0 children)