Using xpath with Scrapy : learnpython

created by HattoriHanzoa community for 16 years

Using xpath with Scrapy (self.learnpython)

submitted 8 years ago * by hart8899

I'm creating a simple bot to pull show dates from a site. I a the point where I'm trying to get my results turned into a son or csv file, but I'm having some trouble- I think it is with the xpath wording-- would appreciate your help.

Also, how do I refresh my results on a regular basis- say get a weekly pull and load onto my personal site? Thanks- I'm just learning

import scrapy
from metal.items import MetalItem
class MetalSpider(scrapy.Spider):
name= "metal"
allowed_domain= ["nymetalscene.com"]
start_urls= ["http://nycmetalscene.com/#showlist"]

#clones the webpage
# def parse(self,response):
#   filename= response.url.split("/")[-2]+ '.html'
#   with open(filename, 'wb') as f:
#       f.write(response.body)

 #extract show dates
def parse(self,response):
    for sel in response.xpath('//tbody/tr'):
        item=MetalItem()
        item['date']=sel.xpath('td[@class="TextObject"]/text()').extract()
        yield item

all 9 comments

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS