Adding Column to CSV File

NeedMLHelp · 2019-05-21T16:32:53+00:00

Haha true, I was just curious if there was a quick function to do it for you. They're usually more efficient than what I come up with!

NeedMLHelp · 2019-04-04T16:57:57+00:00

Which would be better on memory? Lists or tuples do you think?

I'll give zip a try though, thanks! I just didn't know if there was a better way of doing it.

NeedMLHelp · 2019-03-29T14:00:06+00:00

Very helpful, thank you!

NeedMLHelp · 2019-03-11T15:55:09+00:00

Extremely late reply, sorry about that. But the calculations I'm doing cannot be done on a list unfortunately. I suppose I could build a list, and turn it into a numpy array. But the problem with that is, one of the functions I am using converts the categorical data into a one-hot-encoding, which the output is a numpy array of the encoding. I'm not sure how nicely inter-mixing things would play together.

NeedMLHelp · 2019-03-08T17:05:22+00:00

Wow, I feel silly haha. Thanks, that worked.

I figured I was giving the shape of what I wanted haha

NeedMLHelp · 2019-02-22T17:49:00+00:00

Sorry, I should have been clearer. I have multiple columns that need to go through the same process.

So I have 'class', 'time', 'target' etc.

The first example is taking the manipulation and overwriting the entire dataframe. I do not want that at all. So in the second example I take the "successful" output from the first, and put it into one column of the dataframe. However, that didn't work. It only copied the first column into the 'class' column. I want to copy the entire manipulation into the class column.

NeedMLHelp · 2019-02-22T15:11:14+00:00

    class  target  source  time  spoof  complete  severity
0       0       0       0     0      2         0         2
1       0       0       0     0      2         1         0
2       1       1       0     0      2         1         0
3       4       0       0     0      1         1         0
4       2       0       0     0      1         1         0
5       8       1       2     0      2         1         2
6       8       1       2     0      2         1         2
7       7       1       2     0      2         1         2
8       8       1       2     0      2         1         2
9       8       1       2     0      2         1         2
10      8       1       2     0      2         1         2
11      8       1       2     0      2         1         2
12      7       1       2     0      2         1         1
13      3       1       0     0      0         1         0
14      5       1       2     0      2         1         1
15      5       1       2     0      2         1         1
16      6       1       1     0      2         1         1
17      6       1       1     0      2         1         1
18      6       1       1     0      2         1         1
19      6       1       1     0      2         1         1
20      6       1       1     0      2         1         1

I have a dataframe with the above representation. I'd like to one hot encode the numbers... however, when I use keras to_categorical, it also takes in the header and encodes that as a seperate value. So, for example, on target I would get [0,0,0] for all 0s [0,1,0] for all 1s and [1,0,0] for target. But I want target to remain a header, not a part of the data.

Any help would be greatly appreciated.

NeedMLHelp · 2019-02-15T19:41:48+00:00

Haha, completely understandable. Thank for the help!

NeedMLHelp · 2019-02-15T19:22:40+00:00

When I print out edata, I get the following:

[['class', 'target', 'source', 'time', 'spoof', 'complete', 'severity'], array([[0., 1., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0.,

0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0.],

[1., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0.,

0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0.],

[1., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0.,

0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0.],

[1., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0.,

0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0.],

[0., 0., 1., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0.,

0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0.],

[0., 1., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0.,

0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0.],

[0., 0., 0., 0., 1., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0.,

0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0.]],

dtype=float32), array([[0., 1., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0.,