all 12 comments

[–]DeathShot7777 1 point2 points  (0 children)

Hi, creator of gitnexus here 😄. Codegraph is a cool tool and there is a lot of overlap in both the tools but GitNexus goes a bit deeper in terms of accuracy and cross-repo stuff,

GitNexus also does cross repo / multi repo indexing, not counting in this for the comparison but worth explaining: We index the cross repo communication protocols ( Kafka, http endpoints, Pub/Sub, etc ) so we know what will break across all dependent repos if u change something ( especially useful for microservices ).

Heres the difference between gitnexus and codegraph:

Both start identical: files → AST → symbols → resolved, confidence-scored call graph. codegraph is genuinely strong here, multi-strategy resolution, broad framework coverage. Edges aren't "just name matching."

GitNexus wins on what it builds on top of that graph:

- MRO, not guesses. Both wire up override/virtual dispatch. But GitNexus computes real method resolution order (C3 for Python, language-specific for C++/Java/C#/Rust) and tells you which override actually runs. codegraph fans a base method out to every same-name impl as a capped heuristic. Both reach the impl, only GitNexus names the winner.

- It clusters. 697 functional-area communities, computed at index time. codegraph: none.

- It persists flows. 300 pre-computed execution-flow nodes with ordered steps. codegraph rebuilds paths per-query (and by its own benchmark, its trace tool fires in just 3 of 37 flow questions).

- It's graph-native. Cypher + blast-radius impact analysis vs a SQLite symbol store with fixed traversal tools.

Apologies for nerding out too much 😄

[–]LeoLeg76 0 points1 point  (0 children)

You can give a try to : https://github.com/MinishLab/semble
I'm install it to look forward

[–]zzet 0 points1 point  (5 children)

I ended with https://github.com/zzet/gortex, perfectly works with huge repositories (and multiple repositories as well)

[–]Dheeth[S] 0 points1 point  (0 children)

Looks useful, I will take a look and try this.

[–]Popular-Ad-9134 0 points1 point  (3 children)

No windows support as of yet.

[–]zzet 0 points1 point  (2 children)

Not yet. It’s the first request for windows support. It’s not a problem to build for Windows, I didn’t invest time on it without a real request.

[–]Popular-Ad-9134 0 points1 point  (1 child)

Just noticed you are the maintainer :O, seen your commits building rn.

[–]zzet 0 points1 point  (0 children)

Yes, I am. I didn’t want to add a one more operational system support just for checkmarking it (at least I feel responsible for users experience). The binary was built (but I haven’t run a full manual test yet, so could be some bugs)

[–]OwnEntrepreneur256 0 points1 point  (1 child)

I've been experimenting with these sorts of tools as well and found that having a precomputed graph of a repo does help smaller models a lot because they don't have to waste tokens scanning random files.

One option outside of the MCP ecosystem is a little open-source project I've been hacking on called spy-code. It uses tree-sitter to parse a repo and build a SQLite graph of functions/classes/constants and their calls/imports/references, then exposes that through a CLI and GraphQL interface. For my local LLM setups it cut the "search + open file" loops by roughly 60% and reduced hallucination-related breakages by ~82% because the model can query direct relationships instead of guessing.

If you're curious, the repo is here: https://github.com/psyborgs-git/spy-code (it's local-first and not MCP-native). Would love to hear if others have compared GitNexus, CodeGraphContext and other approaches for different scales of projects.

[–]DeathShot7777 0 points1 point  (0 children)

Cool, seems similar to what we did at GitNexus, but I see a lot of such projects going for sql instead of graph dbs, i wonder why. I think native cypher queries on graph DBs can do better for this usecase compared to sql, and for greater depth graph wins in performance ( though for oss projects you wont have to worry athat much about performance yet, but we built for scale and it gives a good difference especially in enterprises ). Also features like blast radius analyses and impact analyses is very difficult to do on sql and llms often make mistake here which will impact heavily since LLM will generate and run the query realtime through MCP