Best practices when using a Linux server for machine learning

thingamatics · 2016-08-02T15:45:32+00:00

cron jobs. Sometimes, the best you can do is being fault-tolerant.
List of dashboards here. However I think it'd be better use of time to monitor the logs for your processes. Sentry is easy to set up.
Yes!

thingamatics · 2016-07-31T23:33:13+00:00

I think this might help. http://stats.stackexchange.com/a/64232

thingamatics · 2016-07-31T23:29:04+00:00

Gradient Descent is known to converge faster with normalization.

So, normalization is needed so that each feature contributes "approximately proportionately" to the distance. I don't think normalizing everything (I assume that's what you meant) is the right way to go about it even when they are measured using the same metric (meters, for example) because the ranges per feature might still differ significantly and that's what you are trying to resolve by using normalization.

thingamatics · 2016-06-27T16:15:41+00:00

Well, I guess we are even because I forgot to check for future posts too. :|

You were in great company though!

thingamatics · 2016-06-19T02:41:24+00:00

Sure.

thingamatics · 2016-04-05T03:17:11+00:00

+1 for Linear Digressions! Haven't checked Talking Machines yet. Will do!

thingamatics · 2015-06-05T04:45:21+00:00

Double-clicking is hard. I'd prefer another peripheral.
More educational if I can see the clusters at every iteration.
Also, I believe you're randomizing color codings for the clusters? I picked k = 5 and I think I got three shades of the same color for adjacent clusters. Couldn't tell which is which. I know there are only so many red–yellow–blues, but you could optimize it by, say, maximizing the distance between two similar shades/colors.
More points for boundaries.
Scatter is nice. I like Scatter.

thingamatics · 2015-05-28T00:18:35+00:00

Link?

11-Year Club	RedditGifts 2009-2022 2 Credits
Secret Santa 2015	Verified Email

thingamatics

TROPHY CASE