DOCS removed .tld suffix for spider names for the sake of consistency.

上级 c2ced9a2
......@@ -137,7 +137,7 @@ Finally, here's the spider code::
class MininovaSpider(CrawlSpider):
name = 'mininova.org'
name = 'mininova'
allowed_domains = ['mininova.org']
start_urls = ['http://www.mininova.org/today']
rules = [Rule(SgmlLinkExtractor(allow=['/tor/\d+']), 'parse_torrent')]
......@@ -161,7 +161,7 @@ Run the spider to extract the data
Finally, we'll run the spider to crawl the site an output file
``scraped_data.json`` with the scraped data in JSON format::
scrapy crawl mininova.org -o scraped_data.json -t json
scrapy crawl mininova -o scraped_data.json -t json
This uses :ref:`feed exports <topics-feed-exports>` to generate the JSON file.
You can easily change the export format (XML or CSV, for example) or the
......
......@@ -56,7 +56,7 @@ Settings attribute.
Example::
scrapy crawl domain.com -s LOG_FILE=scrapy.log
scrapy crawl myspider -s LOG_FILE=scrapy.log
2. Project settings module
--------------------------
......
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册