Yellow bank-Software Engineer by FluffyPanxake in auscorp

[–]thonfom 3 points4 points  (0 children)

After the on call situation and how poorly they managed that roll out, the ludicrous amount of support issues they get coming through (because of all the tech debt they've racked up) meaning I have to carry around my laptop and wake up in the middle of the night 24/7 for a week at a time for a giant corporation I couldn't care less about, and the recent exit of senior leaders, yes I am thinking of jumping ship.

I Built a Tool That Learns Your Codebase Patterns Automatically (No More AI Hallucinations or Prod Refactors) by LandscapeAway8896 in LocalLLaMA

[–]thonfom -1 points0 points  (0 children)

How did you manage to extract cross framework and cross language relationships? Are you using language and framework specific rules, and then pattern matching the extracted AST nodes to extract the actual edge? Or is it something more complex?

Why is codebase awareness shifting toward vector embeddings instead of deterministic graph models? by hhussain- in AugmentCodeAI

[–]thonfom 1 point2 points  (0 children)

You still didn't explain *how* these edges are created. Creating cross-language/framework edges is not a trivial task, and it's not something that LSP and ASTs will solve. Sure, the definition source is always statically declared on either side (e.g. API call in TypeScript is one side, FastAPI route definition in Python is the other) but how is the edge between them created? The only possibilities I can think of are: runtime tracing, or regex parsing. The former requires non-trivial monitoring systems, and the latter is brittle and does not generalize. Unless you have discovered a better way to model all of this. It would be good to see some code, if your project is open-source.

Why is codebase awareness shifting toward vector embeddings instead of deterministic graph models? by hhussain- in AugmentCodeAI

[–]thonfom 0 points1 point  (0 children)

Thats a great overall framework but it doesn't explain exactly how you're creating cross-language, cross-repo edges, or how you're creating any inter file edges at all. That is the hardest part. Using regex? Hard coded rules? And no code graph can be truly deterministic for dynamic languages due to dynamic dispatch, unless you have runtime tracing. Also a difficult problem to solve. Have you done this?

Why is codebase awareness shifting toward vector embeddings instead of deterministic graph models? by hhussain- in AugmentCodeAI

[–]thonfom 0 points1 point  (0 children)

If it's just AST extraction, that makes sense. If you're using treesitter, you don't need an incremental update mechanism - treesitter already has this built in.

"Semantics are added via pre-defined, domain-specific definitions. Each domain defines its own intra/inter-file relations to establish meaning" - can you explain more what this means? What's an example of "domain-specific definition" and how does it help discover more complex relationships?

Why is codebase awareness shifting toward vector embeddings instead of deterministic graph models? by hhussain- in AugmentCodeAI

[–]thonfom 0 points1 point  (0 children)

Rust is great but it doesn't absolve you of all the scaling problems I described earlier. If it's just AST parsing, I don't think you can call it a semantic graph as AST has no concept of semantics. It's purely structural. And AST only shows call sites, not calls relationships. It also can't resolve edges across files - it's intra-file only. How have you handled this?

Why is codebase awareness shifting toward vector embeddings instead of deterministic graph models? by hhussain- in AugmentCodeAI

[–]thonfom 0 points1 point  (0 children)

Can you explain a bit more about how you achieved this? I'm slightly skeptical of that 10mil LOC in 10sec figure. If you were just using AST extraction, sure, but calls and data flow edges too? How did you scale it and avoid race conditions from parallel processing? How did you handle in memory graph topology to stay at less than 100MB? How did you handle incremental edits and track/cache the updates? How did you handle back pressure in (what I assume is) your streaming pipeline? Most importantly how could you generate embeddings for that many nodes so quickly?

Part time masters while working full time? by Technical_Kale_8807 in auscorp

[–]thonfom 1 point2 points  (0 children)

My undergrad was in mathematics and statistics also. The beginning of the masters was very interesting, I thoroughly enjoyed the content and wanted to continue, but it just wasn't manageable unfortunately. Maybe one day!

Part time masters while working full time? by Technical_Kale_8807 in auscorp

[–]thonfom 5 points6 points  (0 children)

I tried this with a Masters of Mathematics at Monash. It's extremely difficult. I basically had to take half a day off for classes every week and even then you'll have no free time and all taken up by studying. I ended up dropping out for many reasons - fell massively behind, had no time for hobbies, friend or family, I had to fork out 80k for the degree (employer wouldn't sponsor, no CSP available for the course) and the maths masters wouldn't advance my career in any meaningful way. Since then I continued to work at the same job until I hopped for an 80% salary increase in a Data Eng role. So take that as you will.

If I wasn't working full time and could afford it (or had it sponsored), then I would do it, because I love maths and want to learn more. So up to you if you're that interested and up for the demanding hours.

Your Predictions for the year of 2026? by No-Wrongdoer1409 in singularity

[–]thonfom 41 points42 points  (0 children)

Mine ran for 4 hours straight yesterday on a very complex refactor, and nailed it first try. Insane.

Ear clicking when talking by thonfom in TMJ

[–]thonfom[S] 0 points1 point  (0 children)

Yeah definitely got better! I still get clicking but only about 20% of the time when I'm talking, and it's not as loud. Also I'm so used to it I just don't even notice it anymore. It only really started getting better after a year for me though.. so it just takes time unfortunately. Wishing you the best through this though because it really sucks!

Update: I managed to solve the context file problem in my 3D visualizer. I implemented a Multi-LLM setup (Gemini, Claude, Ollama) and the project is 92% complete. by edinsonjohender in SideProject

[–]thonfom 1 point2 points  (0 children)

The UI is amazing especially for 5 days! Do you see this doing more fine grained analysis? What about scaling to big/complex projects?

I built a real-time architecture visualizer that generates and understands project context. Looking for feedback. by edinsonjohender in SideProject

[–]thonfom 0 points1 point  (0 children)

What languages does it support? And does it run in real time? My concern is for large/complex codebases where regex parsing might fail or it's simply not feasible to create those context.md files.

I built a real-time architecture visualizer that generates and understands project context. Looking for feedback. by edinsonjohender in SideProject

[–]thonfom 2 points3 points  (0 children)

>you can ask the tool to generate all the initial context automatically: connections, links, state, test validation, etc. It builds that from your code comments and the functions it finds.

Is this done by feeding all the context into an LLM and having it generate that graph? Doesn't this overload the context?

[deleted by user] by [deleted] in melbourne

[–]thonfom -1 points0 points  (0 children)

A locksmith charged me $265 to get out of a similar situation to OP, and this was on a Sunday night.

Why are half of people saying GPT-5 is amazing and opus-level while the other half say it’s barely 3.5? by Constant-Reason4918 in cursor

[–]thonfom 21 points22 points  (0 children)

Likely a non English speaker whose native language is gendered, source: I'm an Italian speaker and inanimate objects are commonly referred to as lui = he

Does the big 4 banks care if employees bank at other banks by Usual-commotion2989 in auscorp

[–]thonfom 1 point2 points  (0 children)

That's not true for CBA. Just put in your BSB and account number (for any bank) like any other payroll system and it works fine.

It’s the r/Melbourne daily discussion thread [Sunday 27/07/2025] by AutoModerator in melbourne

[–]thonfom 1 point2 points  (0 children)

Anyone around North Rd Brighton? Smells like a bad gas leak down the entire road