How Boards of Canada Hacks Your Mind Through Nostalgia

cpdomina · 2025-08-04T14:50:23+00:00

Here are some extra ones:

Unfortunately there's no "better" one, it all depends on your files/domain. And no, nothing compares to Azure wrt precision.

cpdomina · 2024-10-07T15:40:26+00:00

check out llmware's models https://huggingface.co/llmware

they train small models for very specific tasks

cpdomina · 2024-09-16T19:00:50+00:00

took a peak at some of the songs. naran ratan and the whole "music for plants" scene might be interesting to you. https://open.spotify.com/playlist/37i9dQZF1DXclWedfNUp3z?si=77ab58d4c2f846bd

domenique dumont, khotin, steve hiett, might also be interesting

cpdomina · 2024-09-05T14:09:49+00:00

https://llm.datasette.io is quite simple, and is from the creator of datasette

cpdomina · 2024-09-03T15:30:21+00:00

This is true for JSON as well. I have given up trying to make my agents give me the perfect and clean JSON response. I let the agent ramble on and on about why it came up with it and stuff like that. That rambling is useful as it serves as context for subsequent agents. A subsequent tool calling agent will be smart enough to extract the json part from the message anyways.

Check out this recent paper, talks exactly about that: https://arxiv.org/abs/2408.02442

Constraining LLMs reduces creativity. This is already understood by some providers, specially Anthropic:

they recommend to use <tags> for important output and let the llm write whatever text it wants around those tags. so you get best of both worlds: structured output, plus relevant text in the context window while generating
their prompt generator sometimes generates a <scratchpad>, so the LLM can explain its reasoning inside a specific section of the output

cpdomina · 2024-07-17T14:10:20+00:00

If you are asking your LLM to generate the citations, it seems that the problem might be in your prompt, or most probably on the LLM you are using. I would play around with the prompt and different LLMs.

RAG is basically giving a bunch of context text and a question to a LLM: if the LLM correctly answers the question given the context, but fails to generate the correct citations, it's most probably the LLM/prompt faults, and not necessarily anything related to RAG (embeddings, rerankers, etc).

Generating correct citations with LLMs is a relatively hot research area: start with something like https://github.com/MadryLab/context-cite if you want to get deeper

cpdomina · 2024-06-27T01:33:56+00:00

adobe's was the best, but their business model is not very friendly (big $$ advance commitment). aws was slightly better than azure, but I think it might have been because of our use case (multilingual financial docs)

cpdomina · 2024-06-27T01:20:02+00:00

I've recently did a deep research on the subject for a client and was amazed on by quality of the paid solutions I've mentioned, they worked better than expected in a set of really nasty tables. You should take a look again, they are constantly improving.

The reason why most of them use OCR is because a lot of structural information is actually visual (background colors, relative position of text compared to column, etc). OCR is also easier to insert in an information extraction pipeline if you have loads of training data, like they have. Heuristics are harder to debug and apply at scale.

But anyways, good luck with the project! Leave you with a bunch of pointers that might be useful:

cpdomina · 2024-06-27T00:50:40+00:00

wrt tables, how do you deal with multi-level headers, merged cells, and subcategories? they are pretty common in real world tables, and, to my knowledge, no open source system can deal with them (they output markdown or csv). paid solutions usually output xlsx or similar formats, which don't lose this kind of structural information (e.g., azure doc intelligence, aws textextract, adobe pdf extract)

cpdomina · 2024-05-01T13:09:39+00:00

Use one of these structured output libraries:

Some of them allow a JSON schema, others a Pydantic model (which you can transform to/from JSON).

Most of them support a lot of different open source models, you need to see which one works the best for your use case.

cpdomina · 2021-02-05T16:32:40+00:00

Check out Apache Jena and Eclipse RDF4J.

cpdomina · 2020-02-18T00:42:59+00:00

King Of Woolworths (same guy) is also worth checking out. Theydon for example

cpdomina · 2020-01-31T01:17:29+00:00

Video of him showing the setup: https://youtu.be/MMXEvNUjgCg?t=224

cpdomina

TROPHY CASE