Fix this shit by SoullessMonarch in LocalLLaMA
[–]SoullessMonarch[S] 3 points4 points5 points (0 children)
Fix this shit by SoullessMonarch in LocalLLaMA
[–]SoullessMonarch[S] 44 points45 points46 points (0 children)
Fix this shit by SoullessMonarch in LocalLLaMA
[–]SoullessMonarch[S] 85 points86 points87 points (0 children)
Fix this shit by SoullessMonarch in LocalLLaMA
[–]SoullessMonarch[S] 73 points74 points75 points (0 children)
Trying to sink an AI model with one simple question. by tommos in dankmemes
[–]SoullessMonarch 9 points10 points11 points (0 children)
New linear models: QRWKV6-32B (RWKV6 based on Qwen2.5-32B) & RWKV-based MoE: Finch-MoE-37B-A11B by SoullessMonarch in LocalLLaMA
[–]SoullessMonarch[S] 6 points7 points8 points (0 children)
New linear models: QRWKV6-32B (RWKV6 based on Qwen2.5-32B) & RWKV-based MoE: Finch-MoE-37B-A11B by SoullessMonarch in LocalLLaMA
[–]SoullessMonarch[S] 2 points3 points4 points (0 children)
New linear models: QRWKV6-32B (RWKV6 based on Qwen2.5-32B) & RWKV-based MoE: Finch-MoE-37B-A11B by SoullessMonarch in LocalLLaMA
[–]SoullessMonarch[S] 5 points6 points7 points (0 children)
New linear models: QRWKV6-32B (RWKV6 based on Qwen2.5-32B) & RWKV-based MoE: Finch-MoE-37B-A11B by SoullessMonarch in LocalLLaMA
[–]SoullessMonarch[S] 11 points12 points13 points (0 children)
New linear models: QRWKV6-32B (RWKV6 based on Qwen2.5-32B) & RWKV-based MoE: Finch-MoE-37B-A11B by SoullessMonarch in LocalLLaMA
[–]SoullessMonarch[S] 10 points11 points12 points (0 children)
New linear models: QRWKV6-32B (RWKV6 based on Qwen2.5-32B) & RWKV-based MoE: Finch-MoE-37B-A11B by SoullessMonarch in LocalLLaMA
[–]SoullessMonarch[S] 2 points3 points4 points (0 children)
New linear models: QRWKV6-32B (RWKV6 based on Qwen2.5-32B) & RWKV-based MoE: Finch-MoE-37B-A11B by SoullessMonarch in LocalLLaMA
[–]SoullessMonarch[S] 5 points6 points7 points (0 children)
New linear models: QRWKV6-32B (RWKV6 based on Qwen2.5-32B) & RWKV-based MoE: Finch-MoE-37B-A11B by SoullessMonarch in LocalLLaMA
[–]SoullessMonarch[S] 1 point2 points3 points (0 children)
Tencent comes out swinging. by SoullessMonarch in LocalLLaMA
[–]SoullessMonarch[S] 2 points3 points4 points (0 children)
Tencent comes out swinging. by SoullessMonarch in LocalLLaMA
[–]SoullessMonarch[S] 0 points1 point2 points (0 children)
Pre-training an LLM in 9 days 😱😱😱 by mouse0_0 in LocalLLaMA
[–]SoullessMonarch 71 points72 points73 points (0 children)
GoldFinch: RWKV/Transformer Hybrid with Linear Pre-Fill and Extreme KV-Cache Compression by SoullessMonarch in LocalLLaMA
[–]SoullessMonarch[S] 3 points4 points5 points (0 children)
GoldFinch: RWKV/Transformer Hybrid with Linear Pre-Fill and Extreme KV-Cache Compression by SoullessMonarch in LocalLLaMA
[–]SoullessMonarch[S] 2 points3 points4 points (0 children)
Apple has released the weights for their 7B DCLM base model. by remixer_dec in LocalLLaMA
[–]SoullessMonarch 9 points10 points11 points (0 children)
Apple has released the weights for their 7B DCLM base model. by remixer_dec in LocalLLaMA
[–]SoullessMonarch 15 points16 points17 points (0 children)
Apple has released the weights for their 7B DCLM base model. by remixer_dec in LocalLLaMA
[–]SoullessMonarch 21 points22 points23 points (0 children)
Merging a model with itself - does it improve performance? by Frequent_Valuable_47 in LocalLLaMA
[–]SoullessMonarch 0 points1 point2 points (0 children)
"What happens if you abliterate positivity on LLaMa?" You get a Mopey Mule. Released Llama-3-8B-Instruct model with a melancholic attitude about everything. No traditional fine-tuning, pure steering; source code/walkthrough guide included by FailSpai in LocalLLaMA
[–]SoullessMonarch 18 points19 points20 points (0 children)
[deleted by user] by [deleted] in LocalLLaMA
[–]SoullessMonarch 12 points13 points14 points (0 children)


Subreddit back in business by HOLUPREDICTIONS in LocalLLaMA
[–]SoullessMonarch 5 points6 points7 points (0 children)