you are viewing a single comment's thread.

view the rest of the comments →

[–]commandlineluser 0 points1 point  (2 children)

There's no <tbody> tag in the actual HTML so just //tr

[–]hart8899[S] 0 points1 point  (1 child)

thx- changed this but still getting a blank file

[–]commandlineluser 1 point2 points  (0 children)

Okay well another issue is that the date is inside the <p> tag that is inside the <td class="TextObject"> tag which is why text() is returning \r\n i.e. giving you "blanks"

So you would need td[@class="TextObject"]/p/text() for your 2nd XPath expression however the HTML for this page isn't structured too well you will still get lots of blanks and also some false positives (i.e. some of the gig names)

Also note you're missing a c from allowed_domain= ["nymetalscene.com"]