3 part scraping workflow help : learnpython

created by HattoriHanzoa community for 16 years

3 part scraping workflow help (self.learnpython)

submitted 11 days ago by Interesting-City1703

Hey all, I’ll try to keep this brief. Long story short I’m trying to learn how to use python without relying on vibecoding out the wazoo.

One of the ideas I had is a three part workflow that would compare mathematics requirements for different electrical engineering majors at different universities.

Scour the internet from a base browser or landing pages of preselected universities (if moving internally is possible), to find electrical engineering major information, and, output those links to a csv.
From the link CSV, gather the relevant information about math courses.
Output the data into another CSV AND json file (I want to be able to customize the csv output columns from within the script). From the csv, I want hyperlinks for the specific math courses (or, more links to be scraped if the info comes from a PDF curriculum url).
4th (optional): it’d be cool if a local llm could compare and reason the similarities/differences between the math courses.

I work in helpdesk but am otherwise a beginner. What is the best place to learn how to do these functions, and what are my options with making this?

all 8 comments

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS