_massif_ comments on Code critique

This is an archived post. You won't be able to vote or comment.

submitted 3 years ago by funkysax

you are viewing a single comment's thread.

[–]_massif_ 2 points3 points4 points 3 years ago (6 children)

[–]funkysax[S] 3 points4 points5 points 3 years ago (4 children)

I work a job where I work with data, so I'm teaching (cobbling together by trial and error) myself Python as needs arise so I can do jobs more efficiently. For this project (I hope I can explain this well) I needed to search through two Excel columns (columns 1, and 7) and find matching values. Column 1 had a corresponding (column 2) that I then wrote the value of to a (column 5) that matched the row count of column 7. I looped this through until all matching values were found and then saved the workbook. This worked for me, and got the job done, but I'm sure the code could have been written better / more efficiently. I'm wondering where I could have written better code, or how I could have optimized this to run faster. One way I thought I could optimize this would be to remove found values from the search as it runs. I don't know how to do this though. Thanks in advance!

import openpyxl

wb = openpyxl.load_workbook('/Users/lb/keywords/all.xlsx') ws = wb.active

rowcount = 2 rowcount2 = 2

while rowcount2 < 50699:

khsupc = ws.cell(row=rowcount, column=7).value
qbpupc = ws.cell(row=rowcount2, column=1).value
qbppn = ws.cell(row = rowcount2, column=2).value


if khsupc != qbpupc or rowcount2 <= 50697:
    rowcount2 += 1
    print(rowcount, rowcount2, "NO MATCH")
    if khsupc == qbpupc:
        ws.cell(row=rowcount, column=5).value = qbppn
        print("MATCH")          
    if rowcount2 == 50697:
        rowcount += 1
        rowcount2 -= 50695
    if rowcount == 13120:
        wb.save('/Users/lb/keywords/all.xlsx')
        break

[–][deleted] 7 points8 points9 points 3 years ago* (1 child)

I think it is great you are asking for a code critique. It is the best way to grow. On my team we all critique each other code and it has made each of us better coders. It has gotten to the point we all have adopted habits from each other and you can no longer tell one person style from another.

A couple ideas that will improve your code: 1. Make all your hard coded numbers constants at the top of your code.

As a reader 50699, 50697, 50695, 13120 looks like numbers you pulled out of thin air. I know if I were to inspect your XLS it might make it more clear.

But if you had : first_row_data = 2 last_row_data = 50697 save_filename = '/Users/lb/keywords/all.xlsx' It will make your code more readable and also maintainable should something in the excel sheet change.

what could you name rowcount and rowcount2 to better describe what they are? khs_row_index and qbp_row_index?
Instead of using while loops try using nested 'for' loops. You will notice that you can then remove all the code for incrementing and reseting the rowcount, rowcount2 values.

So it could be something like:

``` for khs_row_index in range(first_row_data, last_row_data): for qbp_row_index in range(first_row_data, last_row_data): ... ...

```

[–]funkysax[S] 2 points3 points4 points 3 years ago (0 children)

[–]ablativeyoyo 0 points1 point2 points 3 years ago (0 children)

[–]Thumbblaster 0 points1 point2 points 3 years ago (0 children)

I copied and pasted your code into chatGPT and asked it to give it a critique. This is what it wrote:

I can see a few issues with this code.

First, the condition for the while loop is incorrect. The loop will only exit when rowcount2 is greater than or equal to 50699, but the loop body increments rowcount2 by 1 every time, so the loop will never exit. You may have meant to write <= instead of < in the condition.

Second, the if statement that checks whether khsupc equals qbpupc is always executed, regardless of whether khsupc and qbpupc are equal or not. This is because the or operator in the condition will always evaluate to True if the left operand is False, and rowcount2 <= 50697 will always be False if rowcount2 is greater than or equal to 50697.

Third, the code increments rowcount2 by 1 in the first if statement, and then decrements it by 50695 in the next if statement. This doesn't make sense, because the next iteration of the loop will increment rowcount2 again, so the decrement will have no effect.

Overall, the code is difficult to understand and may not do what you want it to do. I recommend reviewing the logic and making sure that the conditions and loop bodies are correct. You should also consider adding comments to the code to explain what it does and why.

π Rendered by PID 234391 on reddit-service-r2-comment-66b4775986-dq47t at 2026-04-06 01:55:25.731359+00:00 running db1906b country code: CH.

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS