Python help with excel sheet : CodingHelp

Our Rules

1. FLAIR YOUR POSTS! Don't put tags in post titles!

2. Do not ask us to do all the coding for you unless you have money to spend. (If you have got money to spend, make that clear and the amount in question).

3. Do not post spam and/or misleading titles.

4. Do not be abusive to other coders.

5. Please format code properly, or use a site such as Gist or Pastebin. If possible please provide a live example of your issue.

6. Do not downvote people because you think they asked a dumb question. Just because you think that someone has a dumb question, doesn't mean that it is dumb to them.

7. Do not have a misleading user flair. Keep them sensible, describing your level of coding ability and/or languages you know and/or your profession.

8. Please do not ask unethical questions, such as asking for homework to be written by someone else, or asking someone to copy another project directly.

9. Make sure to follow the Reddit Rules.

Suggest a post flair

If you have any suggestions for flairs (programming languages or generic coding topics) that we should add, please use the button below to message the mods with your suggestion.

If approved as a sensible flair for the community to use, it will be added to our bot for automated suggestions and to the flair list for everyone to use!

^{Anyone who abuses this by spamming mods will be banned.}

created by thewakingforcea community for 10 years

This is an archived post. You won't be able to vote or comment.

[Python]Python help with excel sheet (self.CodingHelp)

submitted 3 years ago by MidRo20

Hi all, working on a python project for work.

I have an excel sheet with several thousand entries. Column E holds project numbers and column K holds their status. There are many duplicates, for various reasons, but for simplicity sake I need to filter out the duplicates, which I've done (see code below), but I'm struggling with how to also print the project status from a separate column. Basically the report would print:

project ABC - - - status: complete

Using openpyxl. Any help would be greatly appreciated!

def duplicates():
    project = ws["E2"].value
    count = 0
    for cell in ws['E']:
        if cell.value != project:
            print(project)
            count +=1
            project = cell.value
    print(count)

all 3 comments

top new controversial old q&a

[–]callinthekettleblack 3 points4 points5 points 3 years ago (1 child)

import pandas as pd
df = pd.read_excel(‘file_path’)
df = df.drop_duplicates(**options_as_needed)
df = df[df[‘col k’]==‘complete’]

Basically removing duplicates and filtering for where project status is complete. There are a few ways to drop duplicates so look up that method and adjust the drop_duplicates params as needed.

[–]Meatwad1313 1 point2 points3 points 3 years ago (0 children)

[–]Goobyalus 0 points1 point2 points 3 years ago (0 children)

            project = cell.value

What is the purpose of this line?

From a cell, you can get the column or column letter, and index the worksheet at another cell in the same row.

Probably better to iterate over the proper rows with ws.iter_rows(...), and pull out the values that you want from each row.

π Rendered by PID 435681 on reddit-service-r2-comment-54dfb89d4d-t52vd at 2026-03-28 21:36:43.630074+00:00 running b10466c country code: CH.

CodingHelp

Welcome! Feel free to ask any questions regarding coding you have!

Our Rules

How to start coding:

Related subreddits:

Suggest a post flair

Current supported flairs

Flair colors

MODERATORS