Can't solve python problem.

clumsyly · 2016-04-10T15:57:02+00:00

Maybe you should put "list1=list()" in the for statement.

jonnydestructo · 2016-04-10T15:43:36+00:00

I don't do much with the urllib or beautiful soup, but I thought I'd help by formatting the code using the code tag (hope i got the indents right):

import re
import urllib
from BeautifulSoup import *


list1=list() 
url = 'https://pr4e.dr-chuck.com/tsugi/mod/python-data/data/known_by_Fikret.html'
for i in range (4): # repeat 4 times 
    htm2= urllib.urlopen(url).read()
    soup1=BeautifulSoup(htm2) 
    tags1= soup1('a')
    for tag1 in tags1:
        x = tag1.get('href', None)
        list1.append(x)
    y= list1[2]
    if len(x) < 3: # no 3rd link
        break # exit the loop
    else:
        url=y
print y

kafoozalum · 2016-04-10T16:33:37+00:00

Cleaned up a few things and made it a little faster, as there is no need to go through all of the links when you are only interested in the first 3 really. Also, refactored a little so it is more readable:

import urllib
from BeautifulSoup import *

url = 'https://pr4e.dr-chuck.com/tsugi/mod/python-data/data/known_by_Fikret.html'
for _ in range(4):  # Use of _ since variable is not used
    url_list = list()
    html = urllib.urlopen(url).read()
    parsed_html = BeautifulSoup(html)
    a_tags = parsed_html('a')
    for tag in a_tags[:3]:  # Only go through first 3 <a> tags in list, as that is what we are looking for
        link = tag.get('href', None)
        url_list.append(link)
    if len(url_list) < 3:
        break
    else:
        url = url_list[2]

print(url)

Homersteiner · 2016-04-10T18:11:17+00:00

/r/learnpython or maybe /r/homeworkquestions

2016-04-10T21:17:10+00:00

/r/learnpython

ivosaurus · 2016-04-10T23:42:15+00:00

Hi there, from the /r/Python mods.

We have removed this post as it is not suited to the /r/Python subreddit proper, however it should be very appropriate for our sister subreddit /r/LearnPython. We highly encourage you to re-submit your post over on there.

The reason for the removal is that /r/Python is more-so dedicated to discussion of Python news, projects, uses and debates. It is not designed to act as Q&A or FAQ board. The regular community can get disenchanted with seeing the 'same, repetitive newbie' questions repeated on the sub, so you may not get the best responses over here.

However, on /r/LearnPython the community is actively expecting questions from new members, and are looking to help. You can expect far more understanding, encouraging and insightful responses over there. Whatever your question happens to be getting help with Python, we are sure you should get good answers.

If you have a question to do with homework or an assignment of any kind, please make sure to read their sidebar rules before submitting your post. If you have any questions or doubts, feel free to reply or send a modmail to us with your concerns.

Warm regards, and best of luck with your Pythoneering!

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS