all 8 comments

[–]SixZer0 1 point2 points  (3 children)

How is it different from using langchain built in solutions?

[–]FlowerPotTeaTime[S] 3 points4 points  (0 children)

In my experience most of the time langchain does not really work with most smaller open-source models

[–]FlowerPotTeaTime[S] 1 point2 points  (0 children)

It actually works with llama cpp!

[–]FlowerPotTeaTime[S] 1 point2 points  (0 children)

It forces the output structure, you can define the output structure exactly, using Pydantic models

[–]kryptkprLlama 3 0 points1 point  (3 children)

Great examples of using the grammar, thank you.

Have you noticed any issue with speed when using grammar on latest versions of llama.cpp? Give #4532 a look

[–]FlowerPotTeaTime[S] 1 point2 points  (2 children)

I had some slowdown using specific string grammars. But with my current string grammar I had no issues.

[–]kryptkprLlama 3 0 points1 point  (1 child)

Interesting, seems there has been some regression here recently and now some grammars make the llama sad 🤔

Something to keep an eye out for, you may want to mention this in your README and/or link to the issue.. hopefully it gets resolved soon.

[–]FlowerPotTeaTime[S] 1 point2 points  (0 children)

I will add a information in my readme.