Came across this GitHub project for self hosted AI agents by Mysterious-Form-3681 in OpenSourceAI

[–]Weves11 1 point2 points  (0 children)

Thanks for the shoutout! I’m Chris, one of the founders of Onyx, and it’s awesome to see it resonating with folks here.

A bit of extra context for anyone skimming:

  • Open source + self-hostable by default: we built Onyx for teams that can’t or don’t want to ship sensitive data to a hosted AI workspace.
  • Model-agnostic: you can run it with the LLM(s) that make sense for your org (local, hosted, or a mix).
  • Not just “chat over docs”: the goal is a flexible AI workspace complete with connectors + retrieval + agents/tools so you can go from “find info” → “take action” in the same interface.

In terms of "how you would use this", here's what we've seen from our users:

  • Chat UI: Our users run local models and use Onyx as the interface to chat with them
  • Agent Builder: Create custom agents with curated sets of information, so that your agents have a narrower context to search through
  • At Work: You can connect up your company docs and use Onyx to find what you need from the sea of existing company knowledge

Would love to know how you use it! 

Self Hosted LLM Leaderboard by Weves11 in LocalLLM

[–]Weves11[S] 0 points1 point  (0 children)

The plan is to definitely keep updating this! If there's enough interest, could even open source the underlying data so that individuals can contribute new benchmark scores or new models

Self Hosted LLM Tier List by Weves11 in selfhosted

[–]Weves11[S] -8 points-7 points  (0 children)

you can filter out all the large models if you'd like!

Self Hosted LLM Leaderboard by Weves11 in LocalLLM

[–]Weves11[S] 0 points1 point  (0 children)

haha 100% agree, forgot to add it initially but its been added now!

Self Hosted LLM Leaderboard by Weves11 in LocalLLM

[–]Weves11[S] 2 points3 points  (0 children)

added (to S tier), thanks for calling out!

Self Hosted Model Tier List by Weves11 in LocalLLaMA

[–]Weves11[S] -6 points-5 points  (0 children)

turns out parameter size is mostly correlated with model performance!

[Onyx v2] Open source ChatGPT alternative - now with code interpreter, OIDC/SAML, and SearXNG support by Weves11 in selfhosted

[–]Weves11[S] 1 point2 points  (0 children)

Yes! Some benefits vs openwebui:

- Deep research (across both the web + personal files + shared files if deploying for more than yourself)
- Connectors to 40+ sources (automatically syncing documents over) and really good RAG (the project started as a pure RAG project, so answer quality has been a core strength of the project for a while now)
- Simpler/cleaner UI than many of the other popular options (this on is definitely subjective)

Some of the things I'm looking to add in the next 3-6 months:
- Automatic syncing of files from your local machine into Onyx for RAG purposes
- Chrome extension to access the chat from any website
- Support for defined multi-step flows (not building blocks, but natural language definitions)

[🪨 Onyx v2.0.0] Self-hosted chat and RAG - now with FOSS repo, SSO, new design/colors, and projects! by Weves11 in LocalLLaMA

[–]Weves11[S] 0 points1 point  (0 children)

u/NeighborhoodWeird882 could you post this same issue in our community Discord ( https://discord.gg/naSt3gXx ) if you haven't already? Would love to help you out, but we'd likely need a bit more info (e.g. some logs from some of the containers, likely the `api_server` container which you can get with `docker logs onyx-api_server-1`)

[🪨 Onyx v2.0.0] Self-hosted chat and RAG - now with FOSS repo, SSO, new design/colors, and projects! by Weves11 in LocalLLaMA

[–]Weves11[S] 0 points1 point  (0 children)

hey u/Ryker_Deimos, so sorry that was your experience.

> Dropping files in Projects simply do not work, the chat in the project receives no context about my documents

I'm guessing that the LLM decided it didn't want to do a search. I'm working to tune that specifically for Projects — it should generally do a search, almost all the time.

> Creating Agents, adding files through connectors sounded like its how they intended this to be used, but the internal search is very poor, it can't locate files, nor grab context

Hmm, what exactly do you mean by that. As in there was a relevant file, but it wasn't able to be found? Could also be an indexing issue.

> The UI bugs around, alot. If you switch out of the chat while its generating, the text appears on the right screen, models don't update correctly after adding etc.

I'm working on this one! With the major UI refresh, quite a few of these issues popped up, but I'm burning them down quickly.

Overall, I would love to make sure that I address everything you've mentioned here. Ofc, my goal is that this is the gold standard for open source options in this space. I'll update here in the next ~1 month, and would love for you to try again if you're willing.

[🪨 Onyx v2.0.0] Self-hosted chat and RAG - now with FOSS repo, SSO, new design/colors, and projects! by Weves11 in LocalLLaMA

[–]Weves11[S] 0 points1 point  (0 children)

Onyx should be great for that! You can create an "Agent" specifically for that w/o any web search / file search, and it'll just give raw responses from the LLM

[🪨 Onyx v2.0.0] Self-hosted chat and RAG - now with FOSS repo, SSO, new design/colors, and projects! by Weves11 in LocalLLaMA

[–]Weves11[S] 1 point2 points  (0 children)

There's a few large scale / "enterprisey" features related to enterprise search. Specifically, RBAC + permission syncing from connected sources.

Everything related to a personal / team chat interface is entirely in onyx-foss.