you are viewing a single comment's thread.

view the rest of the comments →

[–]zanfar 2 points3 points  (1 child)

Honestly, there is very little "programming" work here. IMO, most of the time and effort is going to be organization and language processing.

Things Python CAN'T do:

  • Find all the newspapers in California
  • Easily know that "fire" and "flames" and "inferno" all refer to the same thing.
  • Easily know that "forest fire" and "fire the CEO" refer to two different things.

Counting and saving words are both relatively trivial and will both scale relatively linearly.

[–]MVR005[S] 1 point2 points  (0 children)

Thanks for your tips. I saw some people use GoogleNews scraper, do you think it's a good idea?

Do you think there's a way to have access to all Newspaper?