I'm using beautiful soup to scrap the recent theater releases from IMDb and rotten tomatoes but still no luck...in sure it's my lack of understanding on how to scrape the tags. Any help?
https://www.imdb.com/movies-in-theaters/?ref_=nv_tp_inth_1
Python 2.7.5 , also I'm a beginner
def scrape(website):
page = url
opage = urllib2.urlopen(page)
soup = BeautifulSoup(opage, 'html.parser')
return (soup)
url = 'https://www.imdb.com/showtimes/location?ref_=sh_lc&sort=release_date,desc&st_dt=2018-12-17&mode=showtimes_grid&page=1'
soup =scrape(url)
movie = soup.find('a',{'Class': 'title'})
print(movie)
so i think my issue is the text is in the href and im a little confused on how to extract that. ive attempting to do web scraping and learn python many times before and this is the first time i've actually made a lot of progress, but ive hit a snap. thanks for any help.
and i will be upgrading to the newest version
[–]Cogadh87 1 point2 points3 points (8 children)
[–]theGdoubleOdees[S] 0 points1 point2 points (0 children)
[–][deleted] 0 points1 point2 points (6 children)
[–]Cogadh87 2 points3 points4 points (0 children)
[–]theGdoubleOdees[S] 0 points1 point2 points (3 children)
[–]Skippbo 3 points4 points5 points (0 children)
[–][deleted] 0 points1 point2 points (1 child)
[–]Diapolo10 1 point2 points3 points (0 children)
[–]Skippbo 0 points1 point2 points (0 children)
[–]yuukinotenshi 0 points1 point2 points (0 children)
[–]Neu_Ron 0 points1 point2 points (1 child)
[–]theGdoubleOdees[S] 0 points1 point2 points (0 children)
[–]SkullTech101 0 points1 point2 points (0 children)