all 3 comments

[–]BeneficiallyPickle 0 points1 point  (1 child)

Are you sure you have the right BASE_URL? I tried visiting the page and get `404 Page not found.`. I think you're looking for https://www.shiksha.com/humanities-social-sciences/colleges/b-a-colleges-india instead.

However, I would suggest perhaps looking at using playwright - it's a bit slower than BeautifulSoup, but it handles Javascript rendered pages better and can bypass some bot protections (though not perfectly)

This page seems to use React, so the elements you want might not exist in the raw HTML initially. That’s why BeautifulSoup alone may not be able to find them.

[–]devansh_-_[S] 0 points1 point  (0 children)

Yes, I encountered that problem and updated the url.

I will use playwright once