ScrapyMySQL爬取链家网中北京地区租房信息


此爬虫主要基于Scrapy MySQL爬取链家网中,北京地区的租房信息。 Python版本为Python3.6
资源截图
代码片段和文件信息
# -*- coding=utf-8 -*-
import MySQLdb
account = input(‘请输入MySQL用户名
 >‘)
password = input(‘请输入MySQL密码
 >‘)
table = input(‘请输入数据库名称
 >‘)
# 连接数据库
db = MySQLdb.connect(‘localhost‘account str(password) table)
# 获取游标
cursor = db.cursor()
sql = “““CREATE TABLE lianjia(
    title VARCHAR(30)
    LOCATION VARCHAR(100)
    ZONE VARCHAR(10)
    METERS VARCHAR(10)
    DIRECTION VARCHAR(10)
    MONEY VARCHAR(10)

“““
# 执行sql语句
cursor.execute(sql)
# 关闭数据库
db.close()

 属性            大小     日期    时间   名称
----------- ---------  ---------- -----  ----
     目录           0  2018-01-08 04:02  LianjiaSpider-master
     文件          65  2018-01-08 04:02  LianjiaSpider-master.gitattributes
     文件        1063  2018-01-08 04:02  LianjiaSpider-masterLICENSE
     文件         435  2018-01-08 04:02  LianjiaSpider-masterREADME.md
     文件         543  2018-01-08 04:02  LianjiaSpider-masterdatabase.py
     目录           0  2018-01-08 04:02  LianjiaSpider-masterscreenshots
     文件     1452678  2018-01-08 04:02  LianjiaSpider-masterscreenshotsmysql.png
     目录           0  2018-01-08 04:02  LianjiaSpider-mastersrc
     目录           0  2018-01-08 04:02  LianjiaSpider-mastersrcLianjiaSpider
     文件           0  2018-01-08 04:02  LianjiaSpider-mastersrcLianjiaSpider\__init__.py
     目录           0  2018-01-08 04:02  LianjiaSpider-mastersrcLianjiaSpider\__pycache__
     文件         152  2018-01-08 04:02  LianjiaSpider-mastersrcLianjiaSpider\__pycache__\__init__.cpython-36.pyc
     文件         456  2018-01-08 04:02  LianjiaSpider-mastersrcLianjiaSpider\__pycache__items.cpython-36.pyc
     文件        1590  2018-01-08 04:02  LianjiaSpider-mastersrcLianjiaSpider\__pycache__middlewares.cpython-36.pyc
     文件        1164  2018-01-08 04:02  LianjiaSpider-mastersrcLianjiaSpider\__pycache__pipelines.cpython-36.pyc
     文件         795  2018-01-08 04:02  LianjiaSpider-mastersrcLianjiaSpider\__pycache__proxy_middleware.cpython-36.pyc
     文件         688  2018-01-08 04:02  LianjiaSpider-mastersrcLianjiaSpider\__pycache__settings.cpython-36.pyc
     文件        2857  2018-01-08 04:02  LianjiaSpider-mastersrcLianjiaSpider\__pycache__useragent_middleware.cpython-36.pyc
     文件         656  2018-01-08 04:02  LianjiaSpider-mastersrcLianjiaSpideritems.py
     文件        1911  2018-01-08 04:02  LianjiaSpider-mastersrcLianjiaSpidermiddlewares.py
     文件         917  2018-01-08 04:02  LianjiaSpider-mastersrcLianjiaSpiderpipelines.py
     文件         392  2018-01-08 04:02  LianjiaSpider-mastersrcLianjiaSpiderproxy_middleware.py
     文件        3704  2018-01-08 04:02  LianjiaSpider-mastersrcLianjiaSpidersettings.py
     目录           0  2018-01-08 04:02  LianjiaSpider-mastersrcLianjiaSpiderspiders
     文件        1705  2018-01-08 04:02  LianjiaSpider-mastersrcLianjiaSpiderspidersLianjiaSpider.py
     文件         161  2018-01-08 04:02  LianjiaSpider-mastersrcLianjiaSpiderspiders\__init__.py
     目录           0  2018-01-08 04:02  LianjiaSpider-mastersrcLianjiaSpiderspiders\__pycache__
     文件        1480  2018-01-08 04:02  LianjiaSpider-mastersrcLianjiaSpiderspiders\__pycache__LianjiaSpider.cpython-36.pyc
     文件         160  2018-01-08 04:02  LianjiaSpider-mastersrcLianjiaSpiderspiders\__pycache__\__init__.cpython-36.pyc
     文件         173  2018-01-08 04:02  LianjiaSpider-mastersrcLianjiaSpiderspiders\__pycache__downloader_middleware.cpython-36.pyc
     文件         932  2018-01-08 04:02  LianjiaSpider-mastersrcLianjiaSpiderspiders\__pycache__middlewares.cpython-36.pyc
............此处省略4个文件信息

版权声明:本文内容由互联网用户自发贡献,该文观点仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容, 请发送邮件举报,一经查实,本站将立刻删除。

发表评论

评论列表(条)