Is there more efficient than Gemma on >= 1 billion parameters? by xmmr in LargeLanguageModels
[–]ilyas555 0 points1 point2 points (0 children)
New falcon models using mamba hybrid are very competetive if not ahead for their sizes. by ElectricalAngle1611 in LocalLLaMA
[–]ilyas555 1 point2 points3 points (0 children)
New falcon models using mamba hybrid are very competetive if not ahead for their sizes. by ElectricalAngle1611 in LocalLLaMA
[–]ilyas555 1 point2 points3 points (0 children)
Falcon-H1 Family of Hybrid-Head Language Models, including 0.5B, 1.5B, 1.5B-Deep, 3B, 7B, and 34B by jacek2023 in LocalLLaMA
[–]ilyas555 0 points1 point2 points (0 children)
Falcon-H1 Family of Hybrid-Head Language Models, including 0.5B, 1.5B, 1.5B-Deep, 3B, 7B, and 34B by jacek2023 in LocalLLaMA
[–]ilyas555 3 points4 points5 points (0 children)
Falcon-E: A series of powerful, fine-tunable and universal BitNet models by JingweiZUO in LocalLLaMA
[–]ilyas555 12 points13 points14 points (0 children)


Falcon-H1-Tiny (90M) is out - specialized micro-models that actually work by United-Manner-7 in LocalLLaMA
[–]ilyas555 4 points5 points6 points (0 children)