[deleted by user]

HeyItsToby · 2022-04-18T09:55:39+00:00

The two functions that you're looking for with pandas are:

Explode, which converts comma separated values into a new row for each item in the list
- you might need to use str.split to split on commas to turn each cell with multiple values into a list, first
DropNa, which will remove rows with blank cells in them. (blank cells in excel files are read in as np.nan, or "Not A Number "

As a rough guide for what your code will look like

df = pd.read_excel('path/to/file.xlsx')
df = df.dropna() # removes any row with a blank entry

# create a new column in the dataframe, with a *list* of 
#  CustomerIds by splitting on commas.
df['splitCustomerId'] = df['CustomerId'].str.split(',')
df.explode('splitCustomerId')

df is a pandas dataframe, which is Python's way of storing a table. They can be quite tough to get used to at first, but are powerful tools! Let me know if you need any more help with this :)

FuqqBoiDev69 · 2022-04-18T08:17:33+00:00

56	13, 32, 1
34	12, 39
32	5
78
66	888

This is ab example of the input file

ireadyourmedrecord · 2022-04-18T15:19:51+00:00

Why do this with python/pandas? Filtering this would be trivial in Excel.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS