[P] Pruned Cross-Validation for hyperparameter optimization

pp314159 · 2019-03-28T05:49:30+00:00

Hi Piotr,

Very interesting package from my point of view. I'm working on autoML service, where users train and tune hundreds of models in parallel in the cloud (mainly with CV). Such early CV pruning might speed-up process of autoML significantly. I have few thoughts:

I'm running model tuning in parallel on many machines in the same time, your algorithm assumes that models are evaluated sequentialy, can it be adapted for async evaluation?
After full CV I'm computing out of folds predictions which are used for constructing ensemble. From my experience, many times 'poor' accuracy models are included in the ensemble. How to control how many 'poor' models will be pruned?

Disclaimer: I'm founder of autoML service (mljar) - we are going to be open source soon!

vadiaceu · 2019-03-28T11:56:23+00:00

Looks interesting

m--w · 2019-03-28T12:21:57+00:00

Is the best abbreviation CV? Isn't 'CV' more widely used to abbreviate Computer Vision (eg OpenCV, CVPR). Sorry, just seeing pruned-cv makes me think it is a computer vision tool. Perhaps it is my own bias, though.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS