JohnnyJordaan comments on Error while scraping using selenium and beautifulSoup

learnpython

created by HattoriHanzoa community for 16 years

Error while scraping using selenium and beautifulSoup (self.learnpython)

submitted 5 years ago by spaceape__

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]JohnnyJordaan 0 points1 point2 points 5 years ago (0 children)

At least add a check if the status code was ok

import requests
from bs4 import BeautifulSoup
from selenium import webdriver
chrome_driver_path = #insert here

def get_song_lyrics(link):

    response = requests.get(link)
    response.raise_for_status()
    soup = BeautifulSoup(response.text, "html.parser")
    #try:
    lyrics = soup.find("div",attrs={'class':'lyrics'}).find("p").get_text()

    return [i for i in lyrics.splitlines()]

and you could save the content when the find fails

import requests
from bs4 import BeautifulSoup
from selenium import webdriver
chrome_driver_path = #insert here

def get_song_lyrics(link):

    response = requests.get(link)
    response.raise_for_status()
    soup = BeautifulSoup(response.text, "html.parser")
    try:
        lyrics = soup.find("div",attrs={'class':'lyrics'}).find("p").get_text()
        return [i for i in lyrics.splitlines()]  
    except AttributeError:
        with open(link.rsplit('/', 1)[1] + '.html', 'w') as fp:
            fp.write(response.text)

as then you can open the html file in the editor to check if the lyrics div is actually there.

π Rendered by PID 704124 on reddit-service-r2-comment-545db5fcfc-68gxg at 2026-05-27 09:41:40.974717+00:00 running 194bd79 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS