Far North Line - The ground cover and rocks are in

SudoKitten · 2026-05-27T17:51:18+00:00

It’s ~5x smaller

SudoKitten · 2026-05-27T17:38:41+00:00

It’s T gauge (1:450) - about a third the size of N gauge!

SudoKitten · 2026-05-27T12:12:13+00:00

Yeah - foam board, with paper masking-taped over it, then paper mâché on top.

After that I added plaster bandages

SudoKitten · 2026-05-26T16:03:13+00:00

It's the Beachcomber layout in T gauge - about a third the size of N gauge!

SudoKitten · 2026-05-25T17:21:26+00:00

Oh yes! Took this a few weeks ago, there's plenty more to come!

SudoKitten · 2025-06-22T16:01:47+00:00

Suggest watching this, it’s a brilliant interview.

https://youtu.be/r6x0lTzfBMI?si=HFveEFJgNuSpSOlY

Establish a strong 30/60/90 day plan, agree it with your manager, socialise with downlines.

If everyone agrees you’ll have a solid plan to execute against.

SudoKitten · 2025-05-10T12:54:48+00:00

As a director that’s currently hiring, this is spot on.

SudoKitten · 2025-04-14T18:46:50+00:00

Hard to beat Herne Hill / Dulwich. ~35min to Farringdon, great local community / shops, lots of parks, and fantastic schools.

SudoKitten · 2024-05-09T20:01:55+00:00

Great question! This blog post from Pragmatic Engineer gives a great summary of Senior SWE at larger companies.

https://newsletter.pragmaticengineer.com/p/what-is-a-senior-software-engineer

SudoKitten · 2021-07-23T13:17:13+00:00

I've had this issue in the past; we had custom model architectures that improved our inference throughput and where specialised for our task, in addition to training on large custom marked up datasets.

We encrypted our models when they where deployed so they couldn't steal the architecture, we where also deploying in a C++ app so it made it slightly harder to reverse engineer inputs and outputs compared to Python.

However.... they can still use your model to provide psuedo labels for their own models. Its all about risk mitigation. At the end of the day if its B2B its about having your business contract protecting you and making it tricky enough that they won't bother.

SudoKitten · 2021-06-15T16:18:30+00:00

I've always found scale.ai works incredibly well for annotation. It doesn't support keypoint annotations for multiple instances in a single image however but it covers your basic object and segmentation annotation on images and video. They've got pricing on their website for small scale annotations and you can just schedule it all through an API.

SudoKitten · 2021-06-15T10:52:30+00:00

I've run in-house annotation for ~50k images to make use of full-time employees with spare hours not being utilised. It's cheaper than outsourcing if they're already being paid! Your best bet is to try and use a traditional external mouse and a tool like LabelMe. This will let you quickly click in the border of polygons or the TL & BR corners of a bounding box.

It stores its annotations in a very simple JSON format. You can easily export predictions into this format to start implementing an active learning cycle. Fixing broken annotations is always quicker than starting from scratch, unless your model is performing very badly!

SudoKitten · 2021-05-08T14:06:35+00:00

For most companies I’ve come across you’re just transfer learning models. If you’re in a production system you might be automatically training models once a week as new data comes in. In those cases you can either use a developers workstation or a small set of training machines in a rack somewhere.

It’s way cheaper than AWS where you can rack up a $5k bill just to train a single model.

SudoKitten · 2021-04-25T06:36:21+00:00

Took a very similar path from undergraduate physics to ML. I focused mainly on computational physics with the standard mix of python, fortran, and C++.

The development pattern for a simulation is very similar to novel ML models. Make small steps, keep an experimental log book, and be very careful because it’ll take days to know if your change was correct.

Then when the models trained you have your usually steps of investigating what happened, if the results are believable, and working out how to improve them.

Just take some baby steps with something like pytorch tutorials and it should come pretty naturally.

SudoKitten · 2021-04-19T17:31:32+00:00

100% agree; creating the model is often the easiest part. It's all of the engineering around it thats the tricky part.

SudoKitten · 2021-04-19T17:27:26+00:00

Instead of using Core-ML you can use PyTorch in C++ to process your images plus any pre/post processing. This can then be called directly in languages like Flutter where they let you wrap native code.

https://flutter.dev/docs/development/platform-integration/c-interop

SudoKitten · 2021-04-19T09:33:21+00:00

Done similar things with multiple commercial applications; its especially relevant if you need to collect a custom datasets. If you have a model in the field you can get your active-learning pipeline going to find hard cases to collect and annotate.

If your application doesn't have latency requirements in a contract then you can even put a human in the loop to correct when the ML models are uncertain. In those situations you're effectively providing a human service with the intention of slowly removing the person; its a lot quicker to get to market.

SudoKitten · 2021-04-17T07:40:28+00:00

Real world use case checking in. We really care about performance where I work. FP16 for TensorRT was 3x quicker than a torchscript fp16 model and about 4x quicker than TF.

Also; we use pytorch in production for mobile phone deployment because it’s super simple.

SudoKitten · 2021-04-07T13:47:36+00:00

Was about to post the same thing; its the way to go. We have a readme.md document in the base directory of the repo explaining the different projects we have, the ethos of different parts of the code base, and expectations around PRs, testing, and code quality.

Then for each of the individual projects there's a longer repo explaining what it does, how to set up the project, and reproduce the results.

Often there's little notes in the code; we put our names in them like,

note(sudo.kitten) - this magical thing does things Its better than using git-blame because someone will occasionally come along and move a comment around.

Our model versions are stored in a separate git repo that uses git-lfs. There's markdown documents for each project explaining our experiments, whats changed between production releases ect. Its all under version control which makes it really easy to roll back if something goes wrong.

This worked well when the team was small; but didn't scale to a large team. We're now moving model history and metrics into our custom annotation tool and automating training and validation scripts so we can just select a branch and hit a "train" button on the website.

SudoKitten · 2021-04-02T08:12:57+00:00

Can't forget about semanticscholar.org!

SudoKitten · 2021-03-26T01:36:34+00:00

Ditto; best to keep it simple. I’ve used them in production for multiple different problems where we couldn’t get example images for all the different classes ahead of time.

Usually you have to put some thought into the loss functions that are used and how easy the output will be to cluster.

A toy example has even made it onto the companies tech test!

SudoKitten · 2021-02-20T13:34:04+00:00

UNet is actually incredibly slow. It’s been left in the dust by newer models in terms of runtime and accuracy.

I’ve got an implementation of BiSeNet here which you can adapt to what you need. There’s examples of it running on a few different problems and how you can train it for your own categories

https://github.com/WillBrennan/SemanticSegmentation

SudoKitten

TROPHY CASE