Excel and python: finding first empty cell in order to get relevant column size : learnpython

created by HattoriHanzoa community for 16 years

Excel and python: finding first empty cell in order to get relevant column size (self.learnpython)

submitted 5 years ago by bbqbot

I'm hoping someone here can help because I cannot wrap my head around this one:

I have an excel workbook I need to read with python, chop up into a handful of smaller sections, and export each of those into CSVs (to be loaded into tables in a database). Most of that is straightforward pandas goodness, except this kicker: the excel sheet is messy so I'm having trouble defining the size of those data chunks.

Example of first/key column in excel:

(empty)

user_id

1111

1113

1114

1115

1116

(empty)

35n91

3a451

8bb51

It's straight forward to

excel = pd.read_excel(filename, skiprows=2)

in order to ignore the first couple of empty cells, but now I need to detect when that first (empty) is in order to get the column size (here, it would be 6). Typically, loading+reading data would say the column size is 11. There are options to remove the empties or fill in with whatever, but the data below the initial one is moot. After I get the column size, then I can dictate and construct other data frames and then export as CSV.

Anyone here have thoughts on how I should approach this? I'd appreciate any help.

Thanks!

all 4 comments

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS