Python urllib2爬虫豆瓣小说名称和评分

#-*- coding:utf-8 -*-
import urllib2
import re

url = 'https://book.douban.com/tag/%E5%B0%8F%E8%AF%B4'
request = urllib2.Request(url)
urlopen = urllib2.urlopen(request)
content = urlopen.read()
reg_0 = re.findall(r'title.+"s*on', content)
reg_1 = re.findall(r'rating_nums">.*<', content)
for title,score in zip(reg_0,reg_1):
    title = re.split(r'"',title)
    score = re.split(r'>|<',score)
    print title[1],score[1]



#<span class="rating_nums">8.6</span>

相关阅读:
创建用户自定义函数 SQL
关于“该列没有包含在聚合函数或 GROUP BY 子句中”
转Oracle性能参数—经典常用
The server committed a protocol violation. Section=ResponseHeader Detail=CR must be followed by LF 错误
js定时刷新
用户获取mac地址的方法
聚集索引和非聚集索引的区别
WCF启动报错：“进程不具有此命名空间的访问权限”的解决方法
利用js文件加载js文件的方法
C#下载的几种方法

原文地址：https://www.cnblogs.com/lovephysics/p/7262282.html