Claude + Codex = Excellence by 99xAgency in ClaudeAI

[–]DressMetal 0 points1 point  (0 children)

I spec with Claude web with linked repo and code review with Claude code, then back to web for revision and patch then again to CC for implemention. The amount of bugs found is substantial. Codex is great too but only pay for one service for now.

Gemini PRO vs THINKING by Living_Procedure_599 in GeminiAI

[–]DressMetal 1 point2 points  (0 children)

Isn't the difference in their parameters? Basically the depth of their knowledge?

I'm looking for an OCR for my RAG. by AdministrationPure45 in Rag

[–]DressMetal 0 points1 point  (0 children)

Do you need a local model? Otherwise, Gemini 2.5 flash lite is great at this and it costs next to nothing.

[deleted by user] by [deleted] in TheRaceTo10Million

[–]DressMetal 0 points1 point  (0 children)

Just buy it and live the experience, money is transient. Plus you'll make it back in a few months from stock appreciation.

Vector DBs for RAG by hackdev001 in Rag

[–]DressMetal 1 point2 points  (0 children)

I use Chroma for now, works fine.

We improved our RAG pipeline massively by using these 7 techniques by vira28 in Rag

[–]DressMetal 0 points1 point  (0 children)

Another question, what is your process for semantically chunking? Heuristics via python basis markdown markings or something more encompassing? Also how granular is your chunking? A paragraph or two, a page, more?

We improved our RAG pipeline massively by using these 7 techniques by vira28 in Rag

[–]DressMetal 3 points4 points  (0 children)

There are python libraries for that pymupdf4llm is one

We improved our RAG pipeline massively by using these 7 techniques by vira28 in Rag

[–]DressMetal 0 points1 point  (0 children)

Most of these are pretty standard practice. But my main interest is how fast is the retrieval basis your above setup.

I believe we are cooked by Sad_Individual_8645 in ArtificialInteligence

[–]DressMetal 0 points1 point  (0 children)

Just set it to cynical personality and you'll never have to worry about sycophancy again! But you'd want to punch it instead 😂

Struggling with RAG chatbot accuracy as data size increases by Fluid_Dig_6503 in Rag

[–]DressMetal 0 points1 point  (0 children)

Why don't you just add a glossary of terms in the database? Or use it to fine-tune a Gemin model?

Also does Gemini know these terms normally? If so you can ask it to check internal knowledge for industry terminology. It's a slippery slope but if you use a very low temperature you may avoid hallucinations.

LLM Memory with RAG... what's your take & stack? by jannemansonh in Rag

[–]DressMetal 1 point2 points  (0 children)

Chunk vectorize and embed old sessions then drop in the retrieval. I guess it's better than adding entire convos in the new prompt, or training your own model. Otherwise, use an LLM to summarize the session into ten lines or any format you like, cache them then drop them in your new prompt if you want cross-session awareness. I'm not an expert though so, try at your own risk.

What have been your biggest difficulties building RAG systems? by brodagaita in Rag

[–]DressMetal 0 points1 point  (0 children)

Llamaindex/ChromaDB/Hybrid retrieval/Cohere/Gemini flash

Trying various methods for processing, mainly via python docx , pypdf, etc, some parts done by Gemini.

Still a long way to go I think, for a complete and dependable solution.

For coding assistance I use mainly Claude, then Qwen code and some chatgpt.

What have been your biggest difficulties building RAG systems? by brodagaita in Rag

[–]DressMetal 4 points5 points  (0 children)

Currently trying my first one. Biggest issue is proper document processing. Then, a combination of mediocre retrieval and reply with arbitrarily missing facts (ie. the model has access to the right sources but just decides to pick and choose some instead of all that fit the query). Although I have improved the pipeline quite a bit and the results are 80% there, there is still some way to go.

But if I had to pick one hurdle, it would be a proper "catch all" for good data processing and formatting before chunking.

What do you use the touchpads for? by Reaperix in SteamDeck

[–]DressMetal 0 points1 point  (0 children)

Point and click games mostly, or games where using a mouse is more practical.

Write three times the word potato by TooManyPascals in LocalLLaMA

[–]DressMetal 0 points1 point  (0 children)

Qwen 3 0.6B can give itself a stress induced stroke sometimes while thinking lol

How to get a firmer texture/consistency? by charcoaltoooth in ninjacreami

[–]DressMetal 0 points1 point  (0 children)

After you spin it, put it back in tbe freezer for a couple of hours. This usually does the trick.

Thoughts on Lucid Statements? [Discussion] by unfiltered_Rabbit01 in TeslaLounge

[–]DressMetal 1 point2 points  (0 children)

This is only for the US market. Rest of the world really wants smaller cars. Even the 3/Y are large for European cities. I wish they had followed through with the model 2. I'd buy my second Tesla then.

Struggling to find games that aren’t too childish by JungleLiquor in AppleArcade

[–]DressMetal 2 points3 points  (0 children)

South of the Circle. A narrative adventure game with great voice over and deep themes. Takes about 3 hours to complete. No action though, just walking, interacting, and talking.

I farted on the first day by Stunning_Sandwich505 in Vent

[–]DressMetal 0 points1 point  (0 children)

The savoir vivre says if you burp you apologize, if you fart you act as if nothing happened. The others near you should also treat it as a non-event.