python scrapy

import requests
res=requests.get('http://www.baidu.com')
res.encoding='utf-8'
print(res.text)

from bs4  import BeatifulSoup
html = """
... <html><head>head title</head><p>history</p></html>"""
soup=BeautifulSoup(html)

print(soup.prettify())
print(soup.select('p'))
print(soup.select('p')[0])
print(soup.select('p')[0].text)
print(soup.p)
print(soup.p.attr)

print(soup.find_all('p'))

print(soup.find_all(id='dwww'))

////////////////++++++++++////////
names = soup.find_all('td', class_="job")
re.findAll(">(.{2,5})</a>", names) //正则表达式匹配a链接中任意2到5个字符

soup re组合使用

////////////////++++++++++////////


links=soup.select('p')
for link in links:
　　print(link.text)

相关阅读:
RAC安装时,报The specified nodes are not clusterable 的解决方法
Unix sar 命令
Linux 修改 IP地址和网关
Oracle ASM 详解
RAC安装时需要执行4个脚本及意义
RAC 的一些概念性和原理性的知识
Oracle 10g RAC 启动与关闭
Oracle RAC 修改 IP 地址
Linux 时间同步配置
RAC安装时,报The specified nodes are not clusterable 的解决方法

原文地址：https://www.cnblogs.com/agang-php/p/9685584.html