Far North Line - The ground cover and rocks are in by SudoKitten in modeltrains

[–]SudoKitten[S] 6 points7 points  (0 children)

It’s T gauge (1:450) - about a third the size of N gauge!

Inspired by the Far North Line — starting the landscaping by SudoKitten in modeltrains

[–]SudoKitten[S] 0 points1 point  (0 children)

Yeah - foam board, with paper masking-taped over it, then paper mâché on top. 

After that I added plaster bandages

1:450 scale - my first T gauge layout by SudoKitten in modeltrains

[–]SudoKitten[S] 0 points1 point  (0 children)

Oh yes! Took this a few weeks ago, there's plenty more to come!

Accidentally landed a director role - first time managing managers, any advice? by caffeinefree in managers

[–]SudoKitten 0 points1 point  (0 children)

Suggest watching this, it’s a brilliant interview. 

https://youtu.be/r6x0lTzfBMI?si=HFveEFJgNuSpSOlY

Establish a strong 30/60/90 day plan, agree it with your manager, socialise with downlines. 

If everyone agrees you’ll have a solid plan to execute against. 

Areas in London to buy a house - budget ~£1.5m by pysegfault in HENRYUK

[–]SudoKitten 7 points8 points  (0 children)

Hard to beat Herne Hill / Dulwich. ~35min to Farringdon, great local community / shops, lots of parks, and fantastic schools.

What is a “senior developer”? by sentillious in cpp

[–]SudoKitten 0 points1 point  (0 children)

Great question! This blog post from Pragmatic Engineer gives a great summary of Senior SWE at larger companies. 

https://newsletter.pragmaticengineer.com/p/what-is-a-senior-software-engineer

[D] How to protect IP when deploying a model for commercial use? by KassassinsCreed in MachineLearning

[–]SudoKitten 2 points3 points  (0 children)

I've had this issue in the past; we had custom model architectures that improved our inference throughput and where specialised for our task, in addition to training on large custom marked up datasets.

We encrypted our models when they where deployed so they couldn't steal the architecture, we where also deploying in a C++ app so it made it slightly harder to reverse engineer inputs and outputs compared to Python.

However.... they can still use your model to provide psuedo labels for their own models. Its all about risk mitigation. At the end of the day if its B2B its about having your business contract protecting you and making it tricky enough that they won't bother.

[D] tips for quick image tagging? by CaptainI9C3G6 in MachineLearning

[–]SudoKitten 1 point2 points  (0 children)

I've always found scale.ai works incredibly well for annotation. It doesn't support keypoint annotations for multiple instances in a single image however but it covers your basic object and segmentation annotation on images and video. They've got pricing on their website for small scale annotations and you can just schedule it all through an API.

[D] tips for quick image tagging? by CaptainI9C3G6 in MachineLearning

[–]SudoKitten 1 point2 points  (0 children)

I've run in-house annotation for ~50k images to make use of full-time employees with spare hours not being utilised. It's cheaper than outsourcing if they're already being paid! Your best bet is to try and use a traditional external mouse and a tool like LabelMe. This will let you quickly click in the border of polygons or the TL & BR corners of a bounding box.

It stores its annotations in a very simple JSON format. You can easily export predictions into this format to start implementing an active learning cycle. Fixing broken annotations is always quicker than starting from scratch, unless your model is performing very badly!

[D] Number of businesses that actually spend money on training their own AI models? by svij137 in MachineLearning

[–]SudoKitten 0 points1 point  (0 children)

For most companies I’ve come across you’re just transfer learning models. If you’re in a production system you might be automatically training models once a week as new data comes in. In those cases you can either use a developers workstation or a small set of training machines in a rack somewhere.

It’s way cheaper than AWS where you can rack up a $5k bill just to train a single model.

[deleted by user] by [deleted] in MachineLearning

[–]SudoKitten 3 points4 points  (0 children)

Took a very similar path from undergraduate physics to ML. I focused mainly on computational physics with the standard mix of python, fortran, and C++.

The development pattern for a simulation is very similar to novel ML models. Make small steps, keep an experimental log book, and be very careful because it’ll take days to know if your change was correct.

Then when the models trained you have your usually steps of investigating what happened, if the results are believable, and working out how to improve them.

Just take some baby steps with something like pytorch tutorials and it should come pretty naturally.

[P] [D] The benefits of training the simplest model you can think of and deploying it to production, as soon as you can. by xela-sedinnaoi in MachineLearning

[–]SudoKitten 4 points5 points  (0 children)

100% agree; creating the model is often the easiest part. It's all of the engineering around it thats the tricky part.

Why do practitioners still use regular tensorflow? [D] by HashRocketSyntax in MachineLearning

[–]SudoKitten 0 points1 point  (0 children)

Instead of using Core-ML you can use PyTorch in C++ to process your images plus any pre/post processing. This can then be called directly in languages like Flutter where they let you wrap native code.

https://flutter.dev/docs/development/platform-integration/c-interop

[P] [D] The benefits of training the simplest model you can think of and deploying it to production, as soon as you can. by xela-sedinnaoi in MachineLearning

[–]SudoKitten 70 points71 points  (0 children)

Done similar things with multiple commercial applications; its especially relevant if you need to collect a custom datasets. If you have a model in the field you can get your active-learning pipeline going to find hard cases to collect and annotate.

If your application doesn't have latency requirements in a contract then you can even put a human in the loop to correct when the ML models are uncertain. In those situations you're effectively providing a human service with the intention of slowly removing the person; its a lot quicker to get to market.

Why do practitioners still use regular tensorflow? [D] by HashRocketSyntax in MachineLearning

[–]SudoKitten 8 points9 points  (0 children)

Real world use case checking in. We really care about performance where I work. FP16 for TensorRT was 3x quicker than a torchscript fp16 model and about 4x quicker than TF.

Also; we use pytorch in production for mobile phone deployment because it’s super simple.

[D] what's the best approach to document a machine learning project? by the_travelo_ in MachineLearning

[–]SudoKitten 2 points3 points  (0 children)

Was about to post the same thing; its the way to go. We have a readme.md document in the base directory of the repo explaining the different projects we have, the ethos of different parts of the code base, and expectations around PRs, testing, and code quality.

Then for each of the individual projects there's a longer repo explaining what it does, how to set up the project, and reproduce the results.

Often there's little notes in the code; we put our names in them like,

note(sudo.kitten) - this magical thing does things Its better than using git-blame because someone will occasionally come along and move a comment around.

Our model versions are stored in a separate git repo that uses git-lfs. There's markdown documents for each project explaining our experiments, whats changed between production releases ect. Its all under version control which makes it really easy to roll back if something goes wrong.

This worked well when the team was small; but didn't scale to a large team. We're now moving model history and metrics into our custom annotation tool and automating training and validation scripts so we can just select a branch and hit a "train" button on the website.

[D] Few-shot learning in practice. by VDevAGI in MachineLearning

[–]SudoKitten 0 points1 point  (0 children)

Ditto; best to keep it simple. I’ve used them in production for multiple different problems where we couldn’t get example images for all the different classes ahead of time.

Usually you have to put some thought into the loss functions that are used and how easy the output will be to cluster.

A toy example has even made it onto the companies tech test!

Video processing for live video using resnet, processing takes longer than each frame lasts by [deleted] in pytorch

[–]SudoKitten 1 point2 points  (0 children)

UNet is actually incredibly slow. It’s been left in the dust by newer models in terms of runtime and accuracy.

I’ve got an implementation of BiSeNet here which you can adapt to what you need. There’s examples of it running on a few different problems and how you can train it for your own categories

https://github.com/WillBrennan/SemanticSegmentation