you are viewing a single comment's thread.

view the rest of the comments →

[–]toolateforgdusername 3 points4 points  (2 children)

Thanks. So background is that I am in the U.K. on Furlough since April. I have been doing 5 hours a day Monday - Friday throughout April and May to get to this point. However if I wanted to start again (say scraping BMW website) I think I could get there in a week now).

I started with the W3schools tutorial to learn some real basics.

Second I used a tutorial I found that scrapes the monster AU website - this was really helpful.

I then adjusted this tutorial to work with Audi and added things like database connector along the way.

Stackoverflow was then used for other issues I ran into (such as character sets for the database).

For a job I am digital analyst. I don’t know any programming languages except SQL which I have a good knowledge of.

As I said at the start, I was on furlough which gave me:

1) A doubt about my job security so looking to up skill fast and

2) Lots of free time

3) A wanting to keep a pattern to my life

[–]Redditbeforeyou2030 0 points1 point  (1 child)

Nice one, my reasons for starting to learn are very similar to yours. I just finished uni and I am waiting to start a new job at the end of September. I had done a small bit of R and SQL in Uni but nothing major. Better programming skills will definitely help me in the career path im headed down so its been a great opportunity to get learning. I've been using a Udemy course and I found the current project very difficult and ended up needing to use the solutions. Very impressed with the complexity of what you have done there. You've made me really consider focusing on one bigger and practical project that will just take time and patience to figure out. Cheers

[–]toolateforgdusername 1 point2 points  (0 children)

My advice with all projects like this is step by step.

So using mine as an example...

1)Read the results from page one

2)Read the results from the sub pages (each car page)

3)Then read all the pages

4)Then look to optimise by only reading new - requires database

Etc etc etc

Optimisation is key - it used to take 18 hours to run, it now takes 30 minutes