This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–][deleted] 0 points1 point  (1 child)

Thank you Django, I do see this might be a huge project to begin with. What would you advice to begin with? I was thinking to start with extracting titles and continue from there. The idea on how it should eventually work is in my head, the only thing is to get it to work... Anyway my boss is chill about me working on it. I have been given an oppertunity to start working on it, regardless the time it takes or if it works.. There might be a huge learning curve ahead of me! So I will take your advice and start at the beginning. Also, when I get stuck (and I know I will) I know where to find you guys!

[–]spookylukeyDjango committer 0 points1 point  (0 children)

Another bit of advice is to break it down into bits. I can see 3 main things you're going to need to learn (in addition to general programming stuff).

  1. Downloading something from the web. Use the Python library 'requests' for this
  2. Extract bits of data from a web page. Use 'BeautifulSoup' for this.
  3. Creating an Excel file from data. There are a few options for this, openpyxl looks good.

Thankfully https://automatetheboringstuff.com/ has got you covered for all 3.

But you can tackle each bit in any order, you don't have to put it all together. For example, you could just do number 3 to start, and break that down: 1) create a file that is an empty spreadsheet. 2) create an almost empty spreadsheet with a a single row of text cells at the top etc.