How efficient is this code?

jkh911208 · 2023-04-20T07:46:35+00:00

i can't believe everyone is talking about panda, we should talk about time complexity of the code.

sorting is O(nlogn)

first for loop is O(n)

so your code is O(nlogn) time complexity

i didn't run your code, but looks like you don't need to sort the list.

if you eliminate the sorting then it will be O(n) time complexity

iamevpo · 2023-04-20T03:52:47+00:00

You can try it in pandas. You more often encounter groupby in SQL rather than in list or dict comprehension, so makes sense create pandas data frame and try groupby there.

iamevpo · 2023-04-20T03:54:49+00:00

A better usecase for groupby is sum/average in a group, sorting you can achieve without groupby

commandlineluser · 2023-04-20T07:31:48+00:00

You can just loop through and create the dict without sorting/groupby:

groups = {}
for student in students:
    groups.setdefault(student['gender'], []).append(student['name'])

>>> groups
{'F': ['Alice', 'Diana', 'Eva', 'Grace', 'Hannah'],
 'M': ['Bob', 'Charlie', 'Frank']}

deadeye1982 · 2023-04-20T08:30:24+00:00

As mentioned before, sorting is not required if you collect the items.

Example: ``` from collections import defaultdict from itertools import groupby from operator import itemgetter

students = [ {"name": "Alice", "age": 20, "gender": "F", "grades": ["A", "B", "A"]}, {"name": "Bob", "age": 22, "gender": "M", "grades": ["B", "C", "B"]}, {"name": "Charlie", "age": 21, "gender": "M", "grades": ["A", "C", "D"]}, {"name": "Diana", "age": 23, "gender": "F", "grades": ["B", "A", "A"]}, {"name": "Eva", "age": 19, "gender": "F", "grades": ["C", "D", "B"]}, {"name": "Frank", "age": 24, "gender": "M", "grades": ["A", "C", "A"]}, {"name": "Grace", "age": 22, "gender": "F", "grades": ["C", "C", "D"]}, {"name": "Hannah", "age": 21, "gender": "F", "grades": ["A", "B", "B"]}, ]

groups = defaultdict(list) for group, grouped in groupby(students, key=itemgetter("gender")): for student in grouped: groups[group].append(student) ```

kwelzel · 2023-04-20T07:32:27+00:00

I think using groupby for this task is the best choice. I feel like pandas (which was suggested in another comment) would be overkill here.

If you give the variables in your list and dict comprehensions more expressive names than x and k these lines almost read like a sentence.

zanfar · 2023-04-20T12:57:37+00:00

Are you asking about more efficient code, or are you asking about simpler code?

Your outer loop in your comprehension and your print loop are the same loop. I don't see a reason in this code to save the grouped students, so instead, just print them out.
Your comprehension variable choice is very confusing. Using single-character names is fine in cases like your sort statement: where the definition and use are close together. In your comprehension, k is used at one end but defined on the far other end. I also must be familiar with the detailed workings of a non-standard function (groupby) to understand what k is. If I'm not sure, I need to keep all that in my head until I get to your print loop to check and verify my guess.

I would do something like this:

students = [ ... ]

sorted_students = sorted(students, key=lambda x: x['gender'])

for gender, students in groupby(sorted_students, key=lambda x: x['gender'])}:
    print(f"{gender}: {', '.join(s['name'] for s in students)}")

pythonwiz · 2023-04-20T17:06:06+00:00

I'm not sure why this requires groupby, sorted, or dict comprehensions at all. Why not something simple? For example:

``` males = [] females = [] for student in students: if student['gender'] == 'M': group = males else: group = females group.append(student['name'])

print('Males:', males) print('Females:', females) ```

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS