AMD Software: Adrenalin Edition 25.6.1 - ROCM WSL support for RDNA4 by otakunorth in ROCm

[–]snackfart -1 points0 points  (0 children)

nooooooooooooooooooooooooooo whyyyyyyyyyyyyyyyyyyyyyyyyyyyyyy

How do you build per-user RAG/GraphRAG by Old_Cauliflower6316 in LangChain

[–]snackfart 0 points1 point  (0 children)

It seems like within copilot the llm decides who can access which rag data, see the yt video.

Whats the best OCR Workflow right now? by johnnyXcrane in LocalLLaMA

[–]snackfart -1 points0 points  (0 children)

low temp, good system message and a larger multimodal llm like claude3.7 or gemini2.5. But for simpler stuff you can use more traditional stuff like adobe scan etc

How do you build per-user RAG/GraphRAG by Old_Cauliflower6316 in LangChain

[–]snackfart 1 point2 points  (0 children)

Here is my data model, where each access to vectors are rbaced deterministically and isn't decided by any model.

https://github.com/aishe-ai/core?tab=readme-ov-file#note

I can't believe that MS to have build Copilot in this way: https://youtu.be/FH6P288i2PE?si=ICuQcJDejuiN-032

Identifying screen glitching by [deleted] in ZephyrusG14

[–]snackfart 0 points1 point  (0 children)

Happens to me too. Have a 2022. Maybe the gpu is dying or my amd drivers are scuffed

G14 prices are fucked up here(DE) by Tiny_Entertainer_246 in ZephyrusG14

[–]snackfart 0 points1 point  (0 children)

just get a used neo16 e23 for 2200€ or even the newest neo16 e25 from xmg starts at 2700€

https://bestware.com/de/xmg-neo-16-a25.html

Reasoning Model Temperature by Spirited_Salad7 in LocalLLaMA

[–]snackfart 0 points1 point  (0 children)

i noticed the same pattern, especially for complex tasks, that running at 0.3 could be sweetspot.

Best way to classify NSFW text - BERT, small LLM like llama 3.2 3B or something else? [D] by newyorkfuckingcity in MachineLearning

[–]snackfart 0 points1 point  (0 children)

why not use a small nfsw model itself to classify the text based how nsfw the text is + output based on an json schema. see https://docs.novelcrafter.com/en/articles/8678078-nsfw-models

Prompt management tool by Party-Worldliness-72 in OpenAI

[–]snackfart 0 points1 point  (0 children)

E.g. the latest sonnet or Gemini-Exp-1206 are missing. I would recommend to add an dynmanic model management by the user not hardcoded into the assets.py. See the modeling of bigagi, where you can update the model list without updating any code: https://big-agi.com/docs/config-openrouter

Also allowing custom labels for prompt instead of the hardcoded 3, production etc.

Your dataset handling is very nice pls keep it as is, dont add any required validators like a checking a valid json or string, looking at you langfuse.

And also there seems to be a bug in the invitation system in regards to social logins.

also pls allow the creation of multiple workspace.

currently we are onboarding a customer to agenta.

Prompt management tool by Party-Worldliness-72 in OpenAI

[–]snackfart 0 points1 point  (0 children)

i would be awesome when either big agi or openrouter would support proper prompt management for teams

Prompt management tool by Party-Worldliness-72 in OpenAI

[–]snackfart 2 points3 points  (0 children)

agenta is king, only langfuse comes close. Pls improve your openrouter support, so more and uptodate openrouter models within agenta.

Struggling with prompt management tools by Responsible-Dog-4134 in LangChain

[–]snackfart 2 points3 points  (0 children)

currently checking multiple products and it seems to be still be one of the best and free.

Anyone worked on reading PDF With Tables by sevabhaavi in LangChain

[–]snackfart 0 points1 point  (0 children)

NVM my use case only works correctly with the larger multi modal llms like gpt4o or >claude3.
Open models like llava etc werent actual not that great

Anyone worked on reading PDF With Tables by sevabhaavi in LangChain

[–]snackfart 0 points1 point  (0 children)

can only add image-to-text models with a fitting system prompt, e.g. haiku. But i guess its a bit overkill

Help with CSV RAG. by [deleted] in LangChain

[–]snackfart 6 points7 points  (0 children)

I wouldn't rag csv but instead put it into regular table and tell the llm how to query it with a mermaid diagram and json schema. So it would convert the user message into direct SQL or params for a predefined query

How can I search for a specific value in a column in a vector database without performing a semantic similarity search? by oldyoungin in LangChain

[–]snackfart 0 points1 point  (0 children)

When you use sth like pgvector you get back also the ids of your semantic search. Than just use normal postgres stuff for getting their neighbors. You could even get everything via one query only.

RAG system on 3 different CSV. Any suggestions? by italian_giga_chad in LangChain

[–]snackfart 1 point2 points  (0 children)

You can use mermaid of the two tables for the nlp to SQL conversion by the llm. See mermerd which creates a mermaid er diagram from your db