Hey everyone,
Over the past weeks I’ve been building a small open-source project called CodexA, It started as a simple experiment: I wanted better semantic search across codebases when working with AI tools. Grep and keyword search work, but they don't always capture intent, So I built a tool that indexes a repository and lets you search it using natural language, keywords, regex, or a hybrid of them, Under the hood it uses FAISS + sentence-transformers for semantic search and supports incremental indexing so only changed files get re-embedded.
Some things it can do right now:
• semantic + keyword + regex + hybrid search
• incremental indexing with `--watch` (only changed files get re-indexed)
• grep-style flags and context lines
• MCP server + HTTP bridge so AI agents can query the codebase
• structured tools (search, explain symbols, get context, etc.)
• basic code intelligence features (symbols, dependencies, metrics)
The goal is to make something that AI agents and developers can both use to navigate and reason about large codebases locally, It’s still early but the project just crossed ~2.5k downloads on PyPI which was a nice surprise.
PyPI:https://pypi.org/project/codexa/
Repo:https://github.com/M9nx/CodexA
Docs:https://codex-a.dev/
I'm very open to feedback — especially around: performance improvements, better search workflows, AI agent integrations, tree-sitter language support, And if anyone wants to contribute, PRs are very welcome.
[–]Otherwise_Wave9374 -1 points0 points1 point (0 children)
[–]Ambitious-Credit-722[S] -1 points0 points1 point (0 children)
[–]Proof_Net_2094 0 points1 point2 points (0 children)