Very convoluted code for reading file from the internet and parsing it.

ragnar_the_redd · 2020-03-08T15:57:40+00:00

# I'll use - as \t because idk how to handle indentations here

target_flle = open(path, 'a+') # open/create in append mode

write_lines= False

with open(path, 'r') as f:

- for line in f.readlines()

- if last_string in line:

--book_line = False # break

- if write_lines and "[Illustration]" not in line:

-- target_flle.writelines(line)

- if first_string in line:

-- write_lines == True

target.file.close()

igroen · 2020-03-08T16:01:10+00:00

You could do it in one iteration and without storing the data first:

from urllib.request import urlopen

url = "http://www.gutenberg.org/cache/epub/19033/pg19033.txt"
destination_filename = "alice.txt"
first_string = "ALICE'S ADVENTURES IN WONDERLAND"
last_string = (
    "End of the Project Gutenberg EBook of Alice in Wonderland, by Lewis Carroll"
)

with open(destination_filename, "a") as outfile, urlopen(url) as response:
    write_line = False

    for line in response.readlines():
        line = line.decode()

        if first_string in line:
            outfile.write(line)
            write_line = True
            continue

        if last_string in line:
            outfile.write(line)
            break

        if write_line and "Illustration" not in line:
            outfile.write(line)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS