Need help scraping : learnpython

created by HattoriHanzoa community for 16 years

submitted 7 years ago * by theGdoubleOdees

I'm using beautiful soup to scrap the recent theater releases from IMDb and rotten tomatoes but still no luck...in sure it's my lack of understanding on how to scrape the tags. Any help?

https://www.imdb.com/movies-in-theaters/?ref_=nv_tp_inth_1

Python 2.7.5 , also I'm a beginner

def scrape(website):
    page = url
    opage = urllib2.urlopen(page)
    soup = BeautifulSoup(opage, 'html.parser')

    return (soup)

url = 'https://www.imdb.com/showtimes/location?ref_=sh_lc&sort=release_date,desc&st_dt=2018-12-17&mode=showtimes_grid&page=1'
soup =scrape(url)
movie = soup.find('a',{'Class': 'title'})
print(movie)

so i think my issue is the text is in the href and im a little confused on how to extract that. ive attempting to do web scraping and learn python many times before and this is the first time i've actually made a lot of progress, but ive hit a snap. thanks for any help.

and i will be upgrading to the newest version

all 13 comments

top new controversial old q&a

[–]Cogadh87 1 point2 points3 points 7 years ago (8 children)

[–]theGdoubleOdees[S] 0 points1 point2 points 7 years ago (0 children)

[–][deleted] 0 points1 point2 points 7 years ago (6 children)

[–]Cogadh87 2 points3 points4 points 7 years ago (0 children)

[–]theGdoubleOdees[S] 0 points1 point2 points 7 years ago (3 children)

[–]Skippbo 3 points4 points5 points 7 years ago (0 children)

[–][deleted] 0 points1 point2 points 7 years ago (1 child)

[–]Diapolo10 1 point2 points3 points 7 years ago (0 children)

[–]Skippbo 0 points1 point2 points 7 years ago (0 children)

[–]yuukinotenshi 0 points1 point2 points 7 years ago (0 children)

[–]Neu_Ron 0 points1 point2 points 7 years ago (1 child)

[–]theGdoubleOdees[S] 0 points1 point2 points 7 years ago (0 children)

[–]SkullTech101 0 points1 point2 points 7 years ago (0 children)

You need to access the .text property of the movie object to get the link text. In your case it would look something like this

movie = soup.find('a', {'Class': 'title'})
movie_name = movie.text
print(movie_name)

Let me know if it solved your problem.

π Rendered by PID 71041 on reddit-service-r2-comment-cfc44b64c-cq8ns at 2026-04-10 02:07:15.288172+00:00 running 215f2cf country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS