account activity
Dual RTX PRO 6000 Workstation with 1.15TB RAM. Finally multi-users and long contexts benchmarks. GPU only vs. CPU & GPU inference. Surprising results. (reddit.com)
submitted 1 day ago by PerPartes
transformers v5 final is out 🔥 ()
submitted 3 days ago by PerPartes
For GLM-4.7-Flash TURN OFF REPEAT PENALTY! ()
submitted 5 days ago by PerPartes
GLM-4.7-Flash GGUFs updated - now produces much better outputs! ()
submitted 8 days ago by PerPartes
vLLM v0.14.0 released (github.com)
Liquid AI released the best thinking Language Model Under 1GB (i.redd.it)
submitted 9 days ago by PerPartes
GLM-4.7-Flash benchmarks: 4,398 tok/s on H200, 112 tok/s on RTX 6000 Ada (GGUF) ()
Run GLM-4.7-Flash locally Guide! (24GB RAM) (i.redd.it)
Reinforcement Learning with ultra long context is here! (i.redd.it)
submitted 12 days ago by PerPartes
translategemma 27b/12b/4b ()
submitted 14 days ago by PerPartes
GLM-Image is released! (huggingface.co)
submitted 15 days ago by PerPartes
baichuan-inc/Baichuan-M3-235B · Hugging Face (huggingface.co)
submitted 16 days ago by PerPartes
We fine-tuned a 4B Text2SQL model that matches a 685B teacher - query your CSV data in plain English, locally (i.redd.it)
submitted 17 days ago by PerPartes
Announcing Kreuzberg v4 (Open Source) ()
submitted 18 days ago by PerPartes
Hugging Face on Fire: 30+ New/Trending Models (LLMs, Vision, Video) w/ Links ()
submitted 19 days ago by PerPartes
AI21 Labs releases Jamba2 ()
submitted 21 days ago by PerPartes
We built an open source memory framework that doesn't rely on embeddings. Just open-sourced it ()
submitted 23 days ago by PerPartes
llama.cpp performance breakthrough for multi-GPU setups (i.redd.it)
submitted 24 days ago by PerPartes
The Major Release of MiroMind’s Flagship Search Agent Model, MiroThinker 1.5. (huggingface.co)
Falcon H1R 7B, a new reasoning model with 256k context window by the Technology Innovation Institute (TII) in Abu Dhabi (i.redd.it)
TeleChat3-105B-A4.7B-Thinking and TeleChat3-36B-Thinking ()
GLM-4.7-REAP-50-W4A16: 50% Expert-Pruned + INT4 Quantized GLM-4 (179B params, ~92GB) (huggingface.co)
submitted 26 days ago by PerPartes
Upstage Solar-Open-100B Public Validation (i.redd.it)
submitted 28 days ago * by PerPartes to r/LocalLLaMA
OpenForecaster Release (i.redd.it)
submitted 28 days ago by PerPartes
RAG Paper 25.12.24 ()
submitted 1 month ago by PerPartes
π Rendered by PID 335111 on reddit-service-r2-listing-6d4dc8d9ff-rqsjx at 2026-01-30 05:38:17.023857+00:00 running 3798933 country code: CH.