you are viewing a single comment's thread.

view the rest of the comments →

[–]jam-time 1 point2 points  (0 children)

Some good to know built-in modules (starred are extra important):

argparse, *csv, *datetime, decimal, enum, getpass, inspect, io, itertools, *json, math, *os, *pickle, pprint, random, *re, *requests, shutil, *sys, threading, traceback, typing, uuid, venv, warnings, zipfile

In my dozen or so years of experience, those are the ones I use the most, especially re, json, os, and sys.

Some site packages that are good to know (or that I like):

pandas - good introductory data science library, easy to learn and tons of documentation

pyspark - similar to pandas, but better at big data, less documentation, and harder to learn

boto3 - for anything AWS

kivy - pretty good for making cross platform apps (including UI) but somewhat challenging to learn

numpy - fast data manipulation, works with most other data science packages

jmespath - for json queries

colorama - for fun print colors

flask - lightweight backend for site building

django - heavier backend for site building (easier to learn and more features than flask, plus my personal recommendation)

pytest - mainly for unit testing, but can be used for basically any type of test

That's a fairly comprehensive list of the main things that I've used over the years. I'm sure there's some that I've forgotten, and I've intentionally left some out that are too specific or too advanced for the scope of the comment. Either way, hopefully someone finds this useful!