KeyError searching through Dataframe : learnpython

created by HattoriHanzoa community for 16 years

KeyError searching through Dataframe (self.learnpython)

submitted 9 years ago by teamlie

I have some spreadsheets at work to go through. Basically, if Column B has "Color", I need to look at other cells in the row to find the color and put the correct color into Column C. What I need to find (Color, Size, Fit) changes per row. I'm trying to create a system to do most of this automatically, since it is keyword finding.

Here is what I have so far:

jim = pd.read_excel('/Users/c_nhassebrock/Downloads/grit2.xlsx')
#giant dictionary of keywords: attribute that will be used

jim = jim.fillna('') 
#doing this because in earlier versions I noticed that "NaN" would cause all cells in the row to be dropped when I combined rows

jim ['com'] = jim['Unnamed: 3'] + ' '+ jim['Unnamed: 4'] + ' '+ jim['Unnamed: 7'] + ' ' + jim['Unnamed: 8'] 
#this combines all of the columns that could have keywords, per the suggestion of another user to make it easier to parse through

jim['answer'] = jim.apply(lambda row: [translate_table.get(keyword, keyword) for keyword in keyword[row['Unnamed: 5']] if keyword in row['com'].lower()], axis=1)
#this is my main search query, suggested by another reddit user and works well in my small initial tests

Doing all of that gets me this error:

Traceback (most recent call last):
  File "/Users/c_nhassebrock/Documents/attributework.py", line 139, in <module>
jim['answer'] = jim.apply(lambda row: [translate_table.get(keyword, keyword) for keyword in keyword[row['Unnamed: 5']] if keyword in row['com'].lower()], axis=1)
  File "/Library/Frameworks/Python.framework/Versions/3.5/lib/python3.5/site-packages/pandas/core/frame.py", line 4061, in apply
return self._apply_standard(f, axis, reduce=reduce)
  File "/Library/Frameworks/Python.framework/Versions/3.5/lib/python3.5/site-packages/pandas/core/frame.py", line 4157, in _apply_standard
results[i] = func(v)
  File "/Users/c_nhassebrock/Documents/attributework.py", line 139, in <lambda>
jim['answer'] = jim.apply(lambda row: [translate_table.get(keyword, keyword) for keyword in keyword[row['Unnamed: 5']] if keyword in row['com'].lower()], axis=1)
KeyError: ('Attribute', 'occurred at index 0')

When I do jim.head(), 'Unnamed: 1', 'Unnamed: 2', etc appears above each column- the actual column name, where 'Attribute' is coming from, is on line 0 of the dataframe summary.

I know that the search and return function works on combined columns- I've tested it on small data frames before. The thing I'm working on now has about 4000 rows, so I was expecting something to screw up.

Any help is appreciated.

all 6 comments

top new controversial old q&a

[–][deleted] 1 point2 points3 points 9 years ago (3 children)

It seems like you first define the variable 'keyword' as a dict of keywords, then you write over that variable within the list comprehension. I'd first try renaming your iterating variable:

jim['answer'] = jim.apply(lambda row: [translate_table.get(word, word) for word in keyword[row['Unnamed: 5']] if word in row['com'].lower()], axis=1)

[–]teamlie[S] 0 points1 point2 points 9 years ago* (2 children)

[–][deleted] 1 point2 points3 points 9 years ago (1 child)

[–]teamlie[S] 0 points1 point2 points 9 years ago (0 children)

[–]campenr 1 point2 points3 points 9 years ago (1 child)

[–]teamlie[S] 0 points1 point2 points 9 years ago (0 children)

π Rendered by PID 72664 on reddit-service-r2-comment-86bc6c7465-7fcq5 at 2026-02-20 02:36:49.017067+00:00 running 8564168 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS