HTML解析器BeautifulSoup - 润新知

HTML解析器BeautifulSoup
BeautifulSoup是Python的一个库，可解析用urllib2抓取下来的HTML

1.Beautiful Soup 安装

可以利用 pip 来安装，在Python程序中导入
```
pip install beautifulsoup4
```
2.在Python中导入
```
from BeautifulSoup import BeautifulSoup
```
3.创建 beautifulsoup 对象
```
soup = BeautifulSoup(html)
```
4.beautifulsoup 的使用方法

拿到第一个标签的内容：.title()

想要获取的内容为utf-8格式需要使用.decode方法
```
print str(soup.title).decode('utf-8')
```
获取某标签的某属性值：find_all( name , attrs , recursive , text , **kwargs )
```
p_detail = soup.find("p")  
```
相关阅读:
感觉每天打开自己的博客园, 想编程的心情就多了起来~~~
算法图解相关代码整理
 github cli
What's WebFlux ? And how to use it ? 一股有咖喱味的WebFlux简介
 style
gradle 1
gradle打包可运行jar
外面下着雨
 天晴朗看花儿多多开放
 Full Stack Reactive with React and Spring WebFlux
原文地址：https://www.cnblogs.com/corolcorona/p/6668695.html

Copyright © 2020-2023 润新知