scrapy的基本使用步骤(二)

一.创建项目

scrapy startproject example 命令创建example项目

1
2
3
4
5
6
7
8
(venv) ╭─[email protected] ~/PycharmProjects/python_reptiles/teach/scrapy/basic  
╰─➤ scrapy startproject example
New Scrapy project 'example', using template directory '/home/lzq/PycharmProjects/python_reptiles/venv/lib/python3.6/site-packages/scrapy/templates/project', created in:
/home/lzq/PycharmProjects/python_reptiles/teach/scrapy/basic/example

You can start your first spider with:
cd example
scrapy genspider example example.com

二.生成spider

到生成的example项目下,执行命令scrapy genspider dmoz_spider dmoz.org

命令解释:生成一个dmoz_spider域名为dmoz.org

1
2
3
4
5
6
7

(venv) ╭─[email protected] ~/PycharmProjects/python_reptiles/teach/scrapy/basic
╰─➤ cd example 2 ↵
(venv) ╭─[email protected] ~/PycharmProjects/python_reptiles/teach/scrapy/basic/example
╰─➤ scrapy genspider dmoz_spider dmoz.org
Created spider 'dmoz_spider' using template 'basic' in module:
example.spiders.dmoz_spider