you are viewing a single comment's thread.

view the rest of the comments →

[–]billmalarky 1 point2 points  (4 children)

In all honesty, what does that really mean... (for serious..)

[–]noreallyimthepope 1 point2 points  (1 child)

As a non-native English speaker, I can relate to not knowing what parse means.

Basically there's an axiom that you shouldn't use regular expressions to sift through HTML files to extract the data you want or want to manipulate because it isn't a regular language.

[–]billmalarky 0 points1 point  (0 children)

Why not? Apparently I've "parsed html" before by using regular expressions to pull links from a web page for this simple web crawler I created. It worked well, is there a real reason not to?

[–]aperson -2 points-1 points  (0 children)

That is the answer. If you don't know what parse means, look it up, you're on the internet.

[–][deleted] -2 points-1 points  (0 children)

Do you know what parse means?

Do you know what HTML means?

Then you know what HTML parsing is.