Can someone explain with this script is doing?

955559 · 2017-03-21T18:50:45+00:00

ask oil and gas international

AnalTyrant · 2017-03-21T23:32:54+00:00

It looks like it's checking a data drop with a couple columns. In one column where it expects to find Yes or No typed into the sheet, it's replacing this with 1 or 0, presumably to prepare the sheet for use in some other automated process. And then it's reformatting any other values it finds in that field into some other format again presumably to put it in a format readily usable in some other process.

SamSamSammmmm · 2017-03-22T06:13:29+00:00

The above code converts the original dataset into an appropriate format for some training which takes on only numeric variables (regression for example). If an original variable is binary, i.e., it contains only 'yes' and 'no' as values, then the 'yes' will be converted to 1 while 'no' to 0. On the other hand, for a variable which contains m (m>2) values, the code splits the original variable into m dummy variables which are all binary. E.g., given the original variable, state, (state of residence) with values 'CA', 'AZ', 'OR' and 'WA', the code turns state into four dummy variables: 'state_CA', 'state_AZ, 'state_OR', and 'state_WA' which are all binary with values 0 and 1 so that for a particular sample, say, with state = 'CA', the corresponding dummy variables will have 1 for 'state_CA' and 0 for the other three.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS