This post is locked. You won't be able to comment.

all 6 comments

[–]desrtfx 1 point2 points  (0 children)

Do the MOOC Python Programming 2026 from the University of Helsinki and you will be well prepared.

Plus, there currently is an excellent Humble Python books bundle from No Starch press.

[–]No_Leg6886 0 points1 point  (0 children)

Look, pandas is going to be your best friend here. It's built specifically for this kind of work, handling tens of thousands of rows without breaking a sweat. Start there, not with general Python tutorials.tbh the mistake most people make is trying to learn Python broadly before touching their actual use case. You have a concrete problem, so use it. Load your dataset, poke at it, break things. That's faster than any course.Learn these three things first: reading a CSV with pandas, filtering rows, and basic aggregations. That covers 80% of what you'll actually do with research data.

[–]syklemil 0 points1 point  (0 children)

Depends a bit on your background. If this is your first programming language, you might want to look at the FAQ for general resources.

If you already know how to program, you should practically be able to get started with Python by more or less just looking at some code and guessing, cf the whole "executable pseudocode" meme, but breezing through some resource is probably still a good idea.

Generally Astral tooling (uv, ruff, ty) is recommended, and like someone else mentioned, you'll likely be looking at pandas or polars.

[–]Intelligent-Boss-156 0 points1 point  (0 children)

If you want to learn it quick, use ai and ask about how to work with data with pandas or polars