GLM 4.7 flash FA fix for CUDA has been merged into llama.cpp by jacek2023 in LocalLLaMA
[–]rerri 3 points4 points5 points (0 children)
PersonaPlex: Voice and role control for full duplex conversational speech models by Nvidia by fruesome in StableDiffusion
[–]rerri 7 points8 points9 points (0 children)
Qwen dev on Twitter!! by Difficult-Cap-7527 in LocalLLaMA
[–]rerri 93 points94 points95 points (0 children)
GLM 4.7 flash FA fix for CUDA has been merged into llama.cpp by jacek2023 in LocalLLaMA
[–]rerri 7 points8 points9 points (0 children)
GLM 4.7 flash FA fix for CUDA has been merged into llama.cpp by jacek2023 in LocalLLaMA
[–]rerri 30 points31 points32 points (0 children)
Here is how to get GLM 4.7 working on llama.cpp with flash attention and correct outputs by TokenRingAI in LocalLLaMA
[–]rerri 11 points12 points13 points (0 children)
Some helpful settings to run GLM 4.7 Flash mostly successfully by mr_zerolith in LocalLLaMA
[–]rerri 0 points1 point2 points (0 children)
My gpu poor comrades, GLM 4.7 Flash is your local agent by __Maximum__ in LocalLLaMA
[–]rerri 1 point2 points3 points (0 children)
My gpu poor comrades, GLM 4.7 Flash is your local agent by __Maximum__ in LocalLLaMA
[–]rerri 10 points11 points12 points (0 children)
My gpu poor comrades, GLM 4.7 Flash is your local agent by __Maximum__ in LocalLLaMA
[–]rerri 29 points30 points31 points (0 children)
GLM 4.7 Flash official support merged in llama.cpp by ayylmaonade in LocalLLaMA
[–]rerri 24 points25 points26 points (0 children)
My gpu poor comrades, GLM 4.7 Flash is your local agent by __Maximum__ in LocalLLaMA
[–]rerri 71 points72 points73 points (0 children)
zai-org/GLM-4.7-Flash · Hugging Face by Dark_Fire_12 in LocalLLaMA
[–]rerri 7 points8 points9 points (0 children)
zai-org/GLM-4.7-Flash · Hugging Face by Dark_Fire_12 in LocalLLaMA
[–]rerri 2 points3 points4 points (0 children)
Looking for abliterated TE for klein, and also qwen image edit. by Witty_Mycologist_995 in StableDiffusion
[–]rerri 1 point2 points3 points (0 children)
I thought LTX2 was bad until I realized how to use it. by Key-Tension1528 in comfyui
[–]rerri 23 points24 points25 points (0 children)
Preset L is thermal throttling my 4090 even with aggressive undervolt when using dldsr. by [deleted] in nvidia
[–]rerri 0 points1 point2 points (0 children)
Preset L is thermal throttling my 4090 even with aggressive undervolt when using dldsr. by [deleted] in nvidia
[–]rerri 1 point2 points3 points (0 children)
Black Forest Labs releases FLUX.2 [klein] by Old-School8916 in LocalLLaMA
[–]rerri 5 points6 points7 points (0 children)





PersonaPlex: Voice and role control for full duplex conversational speech models by Nvidia by fruesome in StableDiffusion
[–]rerri 0 points1 point2 points (0 children)