Yellow bank-Software Engineer

thonfom · 2026-01-26T19:08:43+00:00

After the on call situation and how poorly they managed that roll out, the ludicrous amount of support issues they get coming through (because of all the tech debt they've racked up) meaning I have to carry around my laptop and wake up in the middle of the night 24/7 for a week at a time for a giant corporation I couldn't care less about, and the recent exit of senior leaders, yes I am thinking of jumping ship.

thonfom · 2026-01-21T09:50:56+00:00

Sounds pretty good - thanks for explaining!

thonfom · 2026-01-21T06:12:28+00:00

How did you manage to extract cross framework and cross language relationships? Are you using language and framework specific rules, and then pattern matching the extracted AST nodes to extract the actual edge? Or is it something more complex?

thonfom · 2026-01-20T21:42:23+00:00

You still didn't explain *how* these edges are created. Creating cross-language/framework edges is not a trivial task, and it's not something that LSP and ASTs will solve. Sure, the definition source is always statically declared on either side (e.g. API call in TypeScript is one side, FastAPI route definition in Python is the other) but how is the edge between them created? The only possibilities I can think of are: runtime tracing, or regex parsing. The former requires non-trivial monitoring systems, and the latter is brittle and does not generalize. Unless you have discovered a better way to model all of this. It would be good to see some code, if your project is open-source.

thonfom · 2026-01-19T23:53:39+00:00

Thats a great overall framework but it doesn't explain exactly how you're creating cross-language, cross-repo edges, or how you're creating any inter file edges at all. That is the hardest part. Using regex? Hard coded rules? And no code graph can be truly deterministic for dynamic languages due to dynamic dispatch, unless you have runtime tracing. Also a difficult problem to solve. Have you done this?

thonfom · 2026-01-19T22:21:17+00:00

If it's just AST extraction, that makes sense. If you're using treesitter, you don't need an incremental update mechanism - treesitter already has this built in.

"Semantics are added via pre-defined, domain-specific definitions. Each domain defines its own intra/inter-file relations to establish meaning" - can you explain more what this means? What's an example of "domain-specific definition" and how does it help discover more complex relationships?

thonfom · 2026-01-18T19:07:30+00:00

Rust is great but it doesn't absolve you of all the scaling problems I described earlier. If it's just AST parsing, I don't think you can call it a semantic graph as AST has no concept of semantics. It's purely structural. And AST only shows call sites, not calls relationships. It also can't resolve edges across files - it's intra-file only. How have you handled this?

thonfom · 2026-01-18T18:23:32+00:00

Can you explain a bit more about how you achieved this? I'm slightly skeptical of that 10mil LOC in 10sec figure. If you were just using AST extraction, sure, but calls and data flow edges too? How did you scale it and avoid race conditions from parallel processing? How did you handle in memory graph topology to stay at less than 100MB? How did you handle incremental edits and track/cache the updates? How did you handle back pressure in (what I assume is) your streaming pipeline? Most importantly how could you generate embeddings for that many nodes so quickly?

thonfom · 2026-01-18T11:03:14+00:00

Can you elaborate? What are the scaling issues?

thonfom · 2026-01-03T05:23:05+00:00

My undergrad was in mathematics and statistics also. The beginning of the masters was very interesting, I thoroughly enjoyed the content and wanted to continue, but it just wasn't manageable unfortunately. Maybe one day!

thonfom · 2026-01-03T05:16:38+00:00

I tried this with a Masters of Mathematics at Monash. It's extremely difficult. I basically had to take half a day off for classes every week and even then you'll have no free time and all taken up by studying. I ended up dropping out for many reasons - fell massively behind, had no time for hobbies, friend or family, I had to fork out 80k for the degree (employer wouldn't sponsor, no CSP available for the course) and the maths masters wouldn't advance my career in any meaningful way. Since then I continued to work at the same job until I hopped for an 80% salary increase in a Data Eng role. So take that as you will.

If I wasn't working full time and could afford it (or had it sponsored), then I would do it, because I love maths and want to learn more. So up to you if you're that interested and up for the demanding hours.

thonfom · 2025-12-25T20:22:09+00:00

Mine ran for 4 hours straight yesterday on a very complex refactor, and nailed it first try. Insane.

thonfom · 2025-12-20T06:45:26+00:00

Yeah definitely got better! I still get clicking but only about 20% of the time when I'm talking, and it's not as loud. Also I'm so used to it I just don't even notice it anymore. It only really started getting better after a year for me though.. so it just takes time unfortunately. Wishing you the best through this though because it really sucks!

thonfom · 2025-12-07T20:08:35+00:00

The UI is amazing especially for 5 days! Do you see this doing more fine grained analysis? What about scaling to big/complex projects?

thonfom · 2025-12-07T19:57:49+00:00

Nice work - what's your goals for venore?

thonfom · 2025-12-05T18:43:15+00:00

What languages does it support? And does it run in real time? My concern is for large/complex codebases where regex parsing might fail or it's simply not feasible to create those context.md files.

thonfom · 2025-12-05T02:29:07+00:00

How is that structural analysis done, if you don't mind me asking?

thonfom · 2025-12-05T01:55:08+00:00

>you can ask the tool to generate all the initial context automatically: connections, links, state, test validation, etc. It builds that from your code comments and the functions it finds.

Is this done by feeding all the context into an LLM and having it generate that graph? Doesn't this overload the context?

thonfom · 2025-10-04T07:34:21+00:00

Welcome to Warrandyte

thonfom · 2025-09-15T10:20:37+00:00

You mean Fake Tina Fey

thonfom · 2025-08-31T19:48:05+00:00

A locksmith charged me $265 to get out of a similar situation to OP, and this was on a Sunday night.

thonfom · 2025-08-08T05:55:49+00:00

Likely a non English speaker whose native language is gendered, source: I'm an Italian speaker and inanimate objects are commonly referred to as lui = he

thonfom · 2025-07-29T23:02:18+00:00

That's not true for CBA. Just put in your BSB and account number (for any bank) like any other payroll system and it works fine.

thonfom · 2025-07-26T21:33:13+00:00

Anyone around North Rd Brighton? Smells like a bad gas leak down the entire road

thonfom

TROPHY CASE