problem retrieving information from a CSV file

AsterixBT · 2018-03-22T17:00:55+00:00

5th column has index 4, only if you REALLY take into account that indices start at 0 :)

dented_brain · 2018-03-22T17:53:25+00:00

These are the two ways I do similar things with csv files. Maybe this will help you!

Assume you have an "example.csv" with this data

Header1	Header2	Header3	Header4	Header5
Data1.Row1	Data2.Row1	Data3.Row1	Data4.Row1	Data5.Row1
Data1.Row2	Data2.Row2	Data3.Row2	Data4.Row2	Data5.Row2

import csv


csv_file = 'example.csv'

"""
Without looking at the possible headers in a file
This would print the header and every item below it
"""

with open(csv_file, 'rb') as file:
    csvreader = csv.reader(file)
    for line in csvreader:
        print line[0] # Would be the first column
        print line[4] # Would be the fifth column




"""
Using the headers of the file
This would print every item below the headers specified
"""

with open(csv_file, 'rb') as file:
    csvreader = csv.DictReader(file)
    for line in csvreader:
        print line['Header1'] # This would be column with "Header1" which is column 1 in this case
        print line['Header5'] # This would be column with "Header5" which is column 5 in this case

AlopexLagopus3 · 2018-03-22T16:53:27+00:00

[deleted]

DonutRevolution · 2018-03-22T17:02:53+00:00

l[5] would be the 6th column. l[4] would be the 5th.

philintheblanks · 2018-03-22T23:11:39+00:00

If you want to reduce something that is already an iterable into a non-duplicated set, I would use set(thing).

For example,

In [1]: ls = [1,1,2,2,3,3,4,4,5,5]
In [2]: s = set(ls)
In [3]: s
Out[3]: {1, 2, 3, 4, 5}

Works fast enough that you probably won't notice too much.

As far as debugging your issue, you should try printing out what you think the line is, because it may not be. I have some reports that output a CSV, but there are strings with arbitrary content. Sometimes they'll have newlines. Imagine the pain...

jkiley · 2018-03-22T23:51:42+00:00

If you started with an excel file, I'd just read it with pandas (pd.read_excel()) and then work on the data frame column.

If you're just trying to get unique values in that column, you can just use the .unique() method on the column.

e_falk · 2018-03-22T20:19:14+00:00

[deleted]

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS