python 第二周（第九天）我的python成长记一个月搞定python数据挖掘！(16) -scrapy框架

scrapy 框架

response的解析

>>> response.css('title::text').extract()
['Quotes to Scrape']

There are two things to note here:
　　(1)one is that we’ve added ::text to the CSS query, to mean we want to select only the text elements directly inside <title> element. If we don’t specify ::text, we’d get the full title element, including its tags:　　
　　(2)the other thing is that the result of calling .extract() is a list, because we’re dealing with an instance of SelectorList. When you know you just want the first result, as in this case, you can do:
When you know you just want the first result, as in this case, you can do:

>>> response.css('title::text').extract_first()
'Quotes to Scrape'

Besides the extract() and extract_first() methods, you can also use the re() method to extract using regular expressions:

>>> response.css('title::text').re(r'Quotes.*')
['Quotes to Scrape']
>>> response.css('title::text').re(r'Qw+')
['Quotes']
>>> response.css('title::text').re(r'(w+) to (w+)')
['Quotes', 'Scrape']

相关阅读:
java 代码添加控件修改位置 View
获取整个Activity的layout
线程加锁同步
应用内悬浮按钮可吸附展开有动画 mini播放器
svg 动画
动画之二：属性动画 Property Animation
ButterKnife 免去findviewby的麻烦
ImageView 控件的宽高随图片变化
python pip使用国内镜像安装第三方库：命令行或PyCharm
pycharm安装pika提示CondaHTTPError: HTTP 000 CONNECTION FAILED for url <https://repo.anaconda.com>

原文地址：https://www.cnblogs.com/yugengde/p/7270696.html

python 第二周（第九天） 我的python成长记 一个月搞定python数据挖掘！(16) -scrapy框架

python 第二周（第九天）我的python成长记一个月搞定python数据挖掘！(16) -scrapy框架