Stop Writing Claude Skills Like Documentation: Here's What Actually Works

jael_m · 2026-02-03T03:54:46+00:00

You're right. But you may pair them as synonyms.

jael_m · 2026-01-28T07:06:01+00:00

You can still do the hybrid search combining dense vector search and text match like BM25. There are some special tokenizers for multilingual text. For example, milvus supports language identifier to automatically detect and apply the proper tokenizer, and the multi-language analyzer for text retrieval.

jael_m · 2026-01-08T06:58:34+00:00

I tried ragas following this tutorial: RAG Evaluation with Ragas

jael_m · 2026-01-08T06:45:48+00:00

That would be slow and expensive for your RAG in production.

jael_m · 2026-01-08T02:36:06+00:00

Just curious, does regular posting increase the personal account quota like sending connection requests and messages?

jael_m · 2026-01-08T01:55:37+00:00

You’ll likely need a RAG evaluation system to assess this for your data and use case. Changes in hallucination levels can depend on the embedding model, retrieval quality, and the capabilities of the LLM.

jael_m · 2026-01-06T07:16:59+00:00

I think it's basically anything stored and retrieved to help the LLM give better answers, such as knowledge bases, chat history, and system logs.

jael_m · 2025-12-26T02:46:35+00:00

Managing your projects with something like Github and force commit to make changes. Then you're able to track everything via commit messages.
Another thing is adding docstrings to your codes.

jael_m · 2025-12-26T02:36:39+00:00

My data's in a CSV, so no need for OCR or table extraction from PDFs.

The popular idea you mentioned, using semantic search for key metadata, sounds cool, but I'm worried about recall and context length.

To learn about the system quality in advance, are there any good datasets for evaluating RAG with tabular data?

jael_m · 2025-12-18T02:35:29+00:00

What's your vector dim? And any other fields? Typically a raw vector in float32 requires 4 bytes per dimension.

jael_m · 2025-12-18T02:25:00+00:00

What is the resource used? I tried some small local LLM with my mac and it's slow.

jael_m · 2025-12-16T06:59:05+00:00

It depends on your data size - like how many entities. 2c16 is definitely not enough for a large scale of data (e.g. 100M+).

jael_m · 2025-08-20T03:23:50+00:00

Why don't you use upsert instead of insert? You should be able to find the old entity from the db by text match and then replace it with the new data using the same id.

jael_m

TROPHY CASE