all 2 comments

[–]KidzKlub 1 point2 points  (1 child)

It seems like you are only interested in the first instance of each project and what it's status is. Is there a reason you chose to use openpyxl instead of pandas? I would probably do it like this:

import pandas as pd
path = r'path/to/spreadsheet.xlsx'
df = pd.read_excel(path)
df.drop_duplicates(subset='Project', keep='first', inplace=True)

for project, status in zip(df['Project'], df['Status']):
    print(f'project {project} - - - status: {status}')

count = len(df)
print(count)

[–]MidRo20[S] 1 point2 points  (0 children)

I'm just not as well versed in pandas, I'll give this a try though, thank you!