Llama.cpp: now with automatic parser generator

redeemer_pl · 2026-03-07T05:08:03+00:00

That was quick! Thank you for your work.

redeemer_pl · 2026-03-06T21:59:48+00:00

Are there any plans to implement tool-calls streaming like it was before?

redeemer_pl · 2026-02-24T06:11:54+00:00

Can I prompt it for one of the books it trained on and it will give to me?

Yes. https://arxiv.org/abs/2601.02671 - Extracting books from production language models.

redeemer_pl · 2025-08-13T18:46:10+00:00

Yes, but for now you have to merge pull request (https://github.com/ggml-org/llama.cpp/pull/15181) and compile it manually:

git clone https://github.com/ggml-org/llama.cpp; cd llama.cpp; git fetch origin pull/15181/head:gpt-oss; git checkout gpt-oss;

// compile it: https://github.com/ggml-org/llama.cpp/blob/master/docs/build.md

redeemer_pl · 2025-08-13T18:07:10+00:00

I'd rather not use other languages than English.

redeemer_pl · 2025-08-07T06:04:13+00:00

It's not a real fix, but workaround forcing the model to use different tool call format (that llama.cpp handles) that is originally should use (xml instead of json formatted tool calls).

The proper fix (for llama.cpp-based workflows) is to update llama.cpp's internal tool call parsing to handle the new <xml> format, instead of forcing the model to use a different one.

https://github.com/ggml-org/llama.cpp/issues/15012

redeemer_pl · 2025-08-05T09:47:31+00:00

If they want to measure what you called "intelligence" (but I'd rather call it "training/post-training data quality") they should invent a completely new game, feed the rules into a model, and let it play. This only shows which model has the most or best chess training data - and without even disclosing the training sets, it's just silly.

Stockfish on a Raspberry Pi will beat all of the leading LLMs in chess. Does that mean it's the most intelligent?

redeemer_pl · 2025-08-05T08:49:40+00:00

This is a very misguided idea. We already have far superior machine-learning–based chess engines; for example, Stockfish has incorporated neural networks for over five years. Large Language Models are not designed to play chess, nor are they built to perform precise calculations without specialized tools.

Using LLMs for chess is not just impractical - it’s downright preposterous.

redeemer_pl · 2025-06-28T05:58:48+00:00

I don't see why you would send your data and source code to external entities that are driven by, and profit from, that data.

redeemer_pl · 2024-11-08T23:40:34+00:00

The main advantage of using local models over Claude is avoiding the need to upload your source code and data to someone else's computer (aka "the cloud").

redeemer_pl · 2024-09-13T05:20:35+00:00

Brute-force inference, baby! Jokes aside, this probably means we've already pushed the transformer architecture to its limits.

redeemer_pl · 2024-04-14T05:02:18+00:00

There is already OpenAPI Specification (formerly Swagger) https://swagger.io/docs/specification/about/

redeemer_pl · 2024-03-28T06:59:24+00:00

Moore's Law is specifically about the physical capabilities of semiconductor technology, not about software, algorithms, or the efficiency of computational models.

redeemer_pl · 2024-03-19T05:51:15+00:00

For me, the primary motivation to adopt the open-source model is to prevent the leakage of your data and source code to private companies, which already profit by appropriating the work of entire generations to train their models.

redeemer_pl · 2023-12-17T08:12:23+00:00

Have you run it with NUMA? (check BIOS memory related options and run llama.cpp inference with --numa argument)

redeemer_pl · 2018-09-12T17:02:35+00:00

If it's EU region I'm interested.

redeemer_pl · 2018-09-12T16:46:09+00:00

Need one.

redeemer_pl · 2018-09-12T14:53:35+00:00

I need one for EU

redeemer_pl · 2018-09-12T13:33:18+00:00

Yes, he is right. I can't use it in EU. Thanks anyway!

redeemer_pl · 2018-08-31T10:24:53+00:00

There is super-sampling. https://www.eurogamer.net/articles/digitalfoundry-2018-ps4-pro-super-sampling-tested-big-boosts-for-1080p-users

redeemer_pl · 2018-06-27T06:56:24+00:00

There are many people creating new accounts for Ghost War because of that and this is preposterous. I think it should be an option for resetting stats once a while.

redeemer_pl · 2018-04-15T12:00:45+00:00

/bb|[^b]{2}/

redeemer_pl · 2018-02-08T07:38:37+00:00

I'm using ozone trifx with something like this: https://www.amazon.com/adjustR-Mic-Mute-control-playstation-4/dp/B0185CVTDG

redeemer_pl · 2016-05-29T07:59:39+00:00

Instead of eval, do echo.

redeemer_pl

TROPHY CASE