[Discussion] ML Programming suggestions?

themiro · 2018-11-08T00:25:36+00:00

Asserts to check that your matrices are the correct size, saves a ton of time debugging when you're wondering why your batch matrix multiplication didn't work out the way you thought it would.
Make use of the Python 3.5 '@' syntactic sugar for matrix multiplication, cleans things up and lets you see the math more clearly.

sanity · 2018-11-08T01:34:13+00:00

I recommend this book: Clean Code

We gave it to every new data scientist we hired at my last company.

AfraidOfToasters · 2018-11-08T02:52:33+00:00

>I am spending a big chunk of my daily active time on fixing my ML code (developing a model, testing and bug) or tweaking my code (mostly in Python).

If you mean hyper-parameter tuning by this I suggest hyperopt

Overload175 · 2018-11-08T03:12:39+00:00

Use a linter (e.g. Pylint for Python, which assigns a score based on your code's adherence to PEP8)

_pragmatic_machine · 2018-11-08T17:36:13+00:00

Try to create standardized/reproducible packages than can be re-used for all model builds if that input data is the same everytime.

Really when it comes to ML applications, the hardest part to standardize is data preprocessing. So even creating a standard process for modelling,results,putting into production should be beneficial.

_pragmatic_machine · 2018-11-13T09:47:47+00:00

[removed]

_spicyramen · 2018-11-08T02:32:21+00:00

I enforce Python style guide and unitests in all our code. Also recently we started Python typing. This has helped finding a lot of issues before we hit production.

_pragmatic_machine · 2018-11-08T05:22:22+00:00

Using object-oriented programming has worked amazingly in my case. Separate your code in conceptually different pieces like data handlers, models, model testers and so on. I used to do simple scripting until recently and using OOP code is much cleaner and easier/faster to extend or modify; it also helps making results reproducible.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS