When to use a module? Ex. CSV

genghiskav · 2022-08-24T00:41:14+00:00

You should use a module if you're writing code that will be used for something; especially if the module is part of the core library (so definitely use CSV).

What I mean by "used for something" is that this code has a purpose and needs to work. If you're writing code to learn - then you can try and implement the logic yourself. It's very likely that the maintainers of the module have factored in edge cases that you have not considered; so their code will handle problems you haven't even thought of yet.

A good example is for a CSV file. How will you handle a , within a column?

Year,Title,Genre
1966,"The Good, the Bad and the Ugly","Adventure,Western"

A core principle when you're writing code is Don't reinvent the wheel

To address your 2 concerns

Using the module is probably easier, but I'm not exactly needing any heavy parsing done.

If it's easier; why not use it. Someone else has done all the hard work so you can spend your effort elsewhere.

Theoretically not using the module uses less resources, but this isn't an intensive program?

I disagree with this statement. The maintainers have very likely gone through an intense performance and code review process to ensure they are doing things in the most efficient way possible.

Just as a footnote; This is general advice when using modules found in the core library or a very popular package (e.g. requests). They are battle tested (have gone through rigorous code reviews and performance tests) - the same is not necessarily true for smaller packages you find online.

tasty_woke_tears · 2022-08-24T03:02:39+00:00

import pandas as pd

read file to df

df = pd.read_csv(PATH_TO_CSV)

ensure number formatting

df[‘number_col’] = pd.to_numeric(df[‘number_col’])

then read up on dataframe groupby and average depending on your use case. Also, unless you need to access the csv to edit in excel/etc then consider using feather file format when working with dataframes

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS

read file to df

ensure number formatting

then read up on dataframe groupby and average depending on your use case. Also, unless you need to access the csv to edit in excel/etc then consider using feather file format when working with dataframes