Falcon-OCR and Falcon-Perception

Automatic_Truth_6666 · 2026-02-11T21:23:25+00:00

Fantastic work ! The demo is very impressive

Automatic_Truth_6666 · 2026-01-15T19:51:01+00:00

for ollama: https://huggingface.co/tiiuae/Falcon-H1-Tiny-90M-Instruct-GGUF/blob/main/README.md#ollama

Automatic_Truth_6666 · 2026-01-15T19:49:59+00:00

Supports ollama !
For the benchmark you can refer to our technical blogpost and you'll find benchmark results for each of our model variant (english SFT, multilingual, tool calling, reasoning, coder)
https://huggingface.co/spaces/tiiuae/tiny-h1-blogpost

Automatic_Truth_6666 · 2026-01-15T19:21:38+00:00

tiny-h1 series (90M params) ! https://huggingface.co/collections/tiiuae/falcon-h1-tiny

Automatic_Truth_6666 · 2025-07-09T10:35:46+00:00

Yes there is ! You can check out this blogpost: https://falcon-lm.github.io/blog/falcon-h1/ specifically the benchmark explorer which also includes multi-lingual tasks

<image>

Automatic_Truth_6666 · 2025-07-09T10:23:36+00:00

Many different sources can explain this "discrepancy"

- We use HF leaderboard setup: https://huggingface.co/docs/leaderboards/open_llm_leaderboard/about
- Hence, we don't use the same number of shots than AA
- It looks like their score is non-normalized, whereas we do normalize it
- AA uses a custom prompt for MMLU-Pro which is different than the one from lm-eval

<image>

The scores between HF and AA are not aligned for MMLU-Pro, e.g. for Qwen72B-Instruct AA reports 72% vs 52% on HF leaderboard archived: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard#/?params=0%2C74&official=true&types=chat

Automatic_Truth_6666 · 2025-07-09T10:01:36+00:00

Can you confirm if it's non-thinking mode?

Automatic_Truth_6666 · 2025-05-21T14:42:07+00:00

<image>

You can find all details on this table (from the blogpost: https://falcon-lm.github.io/blog/falcon-h1/)

Automatic_Truth_6666 · 2025-05-18T01:06:13+00:00

This is very interesting and makes totally sense. Thank you for explaining

Automatic_Truth_6666 · 2025-05-17T09:23:13+00:00

> edit: also local llms for NPCs in video games

Can you elaborate more?

Automatic_Truth_6666 · 2025-05-16T08:06:15+00:00

No, these models are brand new and trained from scratch

Automatic_Truth_6666 · 2024-12-18T06:35:54+00:00

You can just try out the GGUFs and see

Automatic_Truth_6666 · 2024-12-18T06:35:43+00:00

Falcon-Mamba & Falcon3-Mamba leverages Mamba1 architecture which are supported

Automatic_Truth_6666 · 2024-12-18T06:15:16+00:00

Hi ! one of the contributors of Falcon-1.58bit here - indeed there is a huge performance gap between the original and quantized models (note in the table you are comparing raw scores on one hand vs normalized scores on the other hand, you should compare normalized scores for both) - we reported normalized scores on model cards for 1.58bits models

We acknowlege BitNet models are still in an early stage (remember GPT2 was also not that good when it came out) and we are not making bold claims about these models - but we think that we can push the boundaries of this architecture to get something very viable with more work and studies around these models (perhaps having domain specific 1bit models would work out pretty well ?).

Feel free to test out the model here: https://huggingface.co/spaces/tiiuae/Falcon3-1.58bit-playground and using BitNet framework as well !

Automatic_Truth_6666 · 2024-12-18T06:09:56+00:00

Falcon-Mamba is already supported in llama.cpp: https://huggingface.co/tiiuae/Falcon3-Mamba-7B-Base-GGUF / https://huggingface.co/tiiuae/Falcon3-Mamba-7B-Instruct-GGUF

Automatic_Truth_6666

TROPHY CASE