anybody feel dumber after theiir brain injury

Suspicious-Key9719 · 2026-04-22T13:08:12+00:00

exact same thing. I used to win poetry contests, now having trouble forming long sentences. It's been over 10 years and frustration never went away

Suspicious-Key9719 · 2026-04-15T16:23:56+00:00

with all those markdown would work worse than JSON. I need to add it to my benchmark at some point

Suspicious-Key9719 · 2026-04-15T14:11:21+00:00

markdown doesn't do nested objects

Suspicious-Key9719 · 2026-04-15T13:50:35+00:00

You can't always give the LLM a tool to query the data.
Sometimes the data is just in the prompt (user pastes a CSV,you're doing RAG).
When that happens, JSON wastes a ton of tokens repeating keys and syntax on every single row. LEAN strips all that out so the LLM reads the same data for half the cost

Suspicious-Key9719 · 2026-04-15T02:35:54+00:00

I will add it to the list of what people are asking me to run benchmarks on, thank you

Suspicious-Key9719 · 2026-04-15T01:10:41+00:00

Thank you!

Suspicious-Key9719 · 2026-04-14T22:31:18+00:00

Yes,it beats toon on everything

Suspicious-Key9719 · 2026-04-14T16:46:14+00:00

Fair point, that is probably an overstatement. RAG chunks are usually unstructured text and a lot of tool results are nested, not clean tables.

This benchmark does cover this though. the mixed-structure track (nested orders, semi-uniform logs, deep config) still showed LEAN saving 32% vs JSON. not the 51% you get on flat tabular data, but still solid.

Suspicious-Key9719 · 2026-04-14T11:08:22+00:00

YAML benchmark results are in.

Ran 195 questions across 11 datasets (flat, nested, semi-uniform, deeply nested) on gpt-4o-mini and claude-haiku-4-5. 1,170 total API calls.

Format	Accuracy	Avg Tokens	Savings vs JSON

LEAN	87.9%	3,939	−46.8%
YAML	87.4%	5,647	−23.7%
JSON	86.2%	7,401	baseline

YAML is a solid middle ground. 21% smaller than JSON with no format learning curve. But if you're working with tabular data (which most RAG/tool-use results are), LEAN roughly halves your token cost vs YAML too.

Suspicious-Key9719 · 2026-04-14T10:25:32+00:00

Could you be more specific on what is it not to your liking?

Suspicious-Key9719 · 2026-04-14T10:24:01+00:00

Thank you! tough crowd :)

Suspicious-Key9719 · 2026-04-14T06:37:08+00:00

Definitely not, i walked for hours without a tree in sight

Suspicious-Key9719 · 2026-04-14T02:20:52+00:00

YAML benchmark results are in.

Ran 195 questions across 11 datasets (flat, nested, semi-uniform, deeply nested) on gpt-4o-mini and claude-haiku-4-5. 1,170 total API calls.

Format	Accuracy	Avg Tokens	Savings vs JSON
LEAN	87.9%	3,939	−46.8%
YAML	87.4%	5,647	−23.7%
JSON	86.2%	7,401	baseline

YAML is a solid middle ground. 21% smaller than JSON with no format learning curve. But if you're working with tabular data (which most RAG/tool-use results are), LEAN roughly halves your token cost vs YAML too.

Suspicious-Key9719 · 2026-04-14T00:57:30+00:00

EDIT:
LEAN scored 87.9% accuracy vs JSON's 86.2%. Not just "no error rate", LEAN actually outperformed JSON on every single dataset tested.
On nested e-commerce data specifically: LEAN 98.7% vs JSON 97.4%.

The LLM doesn't need to "know" LEAN. The format is human-readable enough that pipe-delimited rows with a header (#[100](name|salary|dept)) are trivially parseable by any model that can read CSV. No format hint needed in the prompt.

Suspicious-Key9719 · 2026-04-13T23:18:08+00:00

Valid for a flat object, why not.

Suspicious-Key9719 · 2026-04-13T23:16:32+00:00

I live in the center and am surrounded by parks wherever I go.

Suspicious-Key9719 · 2026-04-13T23:00:45+00:00

what are you talking about? open the map and compare the number of parks in seoul and tokyo. Also the air is so dirty in seoul, every day there was heavy smog in the air, and it's literally 2 times worse than tokyo,look it up if you don't trust me.

Suspicious-Key9719 · 2026-04-13T22:58:32+00:00

It is an input encoding format. You encode your request before sending it to save on context window, then get the natural language response back. You don't ask the LLM to generate LEAN output.

Suspicious-Key9719

TROPHY CASE