• 25、Python之禅


    要求:
    爬取网页你好,蜘蛛侠!中的Python之禅中英文版本,并且打印。
     
    目的:
    练习使用selenium爬取动态网页的信息。
    练习selenium与BeautifulSoup的搭配使用。
     
     
     
    方法一: 用selenium
     
     1 from selenium import webdriver
     2 import time
     3 
     4 driver = webdriver.Chrome()
     5 
     6 driver.get('https://localprod.pandateacher.com/python-manuscript/hello-spiderman/')
     7 time.sleep(2)
     8 
     9 button = driver.find_element_by_class_name('sub')
    10 button.click()
    11 time.sleep(1)
    12 
    13 python_zens = driver.find_elements_by_class_name('content')
    14 
    15 for python_zen in python_zens:
    16     print(python_zen.find_element_by_tag_name('h1').text,end='
    
    ')
    17     print(python_zen.find_element_by_tag_name('p').text,end='
    
    ')
    18 
    19 driver.close()
     1 The Zen of Python
     2 
     3 Beautiful is better than ugly.
     4 Explicit is better than implicit.
     5 Simple is better than complex.
     6 Complex is better than complicated.
     7 Flat is better than nested.
     8 Sparse is better than dense.
     9 Readability counts.
    10 Special cases aren't special enough to break the rules.
    11 Although practicality beats purity.
    12 Errors should never pass silently.
    13 Unless explicitly silenced.
    14 In the face of ambiguity, refuse the temptation to guess.
    15 There should be one-- and preferably only one --obvious way to do it.
    16 Although that way may not be obvious at first unless you're Dutch.
    17 Now is better than never.
    18 Although never is often better than *right* now.
    19 If the implementation is hard to explain, it's a bad idea.
    20 If the implementation is easy to explain, it may be a good idea.
    21 Namespaces are one honking great idea -- let's do more of those!
    22 
    23 Python之禅
    24 
    25 优美胜于丑陋
    26 明了胜于晦涩
    27 简洁胜于复杂
    28 复杂胜于凌乱
    29 扁平胜于嵌套
    30 间隔胜于紧凑
    31 可读性很重要
    32 即便假借特例的实用性之名,也不可违背这些规则
    33 不要包容所有错误,除非你确定需要这样做
    34 当存在多种可能,不要尝试去猜测
    35 而是尽量找一种,最好是唯一一种明显的解决方案
    36 虽然这并不容易,因为你不是 Python 之父
    37 做也许好过不做,但不假思索就动手还不如不做
    38 如果你无法向人描述你的方案,那肯定不是一个好方案;反之亦然
    39 命名空间是一种绝妙的理念,我们应当多加利用
     
    方法二:用selenium 和 BeautifulSoup
     
     1 from selenium import webdriver
     2 from bs4 import BeautifulSoup
     3 import time
     4 
     5 driver = webdriver.Chrome()
     6 
     7 driver.get('https://localprod.pandateacher.com/python-manuscript/hello-spiderman/')
     8 time.sleep(2)
     9 
    10 button = driver.find_element_by_class_name('sub')
    11 button.click()
    12 time.sleep(1)
    13 
    14 pagesource = driver.page_source
    15 
    16 soup = BeautifulSoup(pagesource,'html.parser')
    17 items = soup.find_all(class_='content')
    18 for item in items:
    19     print('
    	'+item.find('h1').text)
    20     print(item.find('p').text)
    21 
    22 driver.close()
     1         The Zen of Python
     2 
     3             Beautiful is better than ugly.
     4             Explicit is better than implicit.
     5             Simple is better than complex.
     6             Complex is better than complicated.
     7             Flat is better than nested.
     8             Sparse is better than dense.
     9             Readability counts.
    10             Special cases aren't special enough to break the rules.
    11             Although practicality beats purity.
    12             Errors should never pass silently.
    13             Unless explicitly silenced.
    14             In the face of ambiguity, refuse the temptation to guess.
    15             There should be one-- and preferably only one --obvious way to do it.
    16             Although that way may not be obvious at first unless you're Dutch.
    17             Now is better than never.
    18             Although never is often better than *right* now.
    19             If the implementation is hard to explain, it's a bad idea.
    20             If the implementation is easy to explain, it may be a good idea.
    21             Namespaces are one honking great idea -- let's do more of those!
    22 
    23         Python之禅
    24 
    25             优美胜于丑陋
    26             明了胜于晦涩
    27             简洁胜于复杂
    28             复杂胜于凌乱
    29             扁平胜于嵌套
    30             间隔胜于紧凑
    31             可读性很重要
    32             即便假借特例的实用性之名,也不可违背这些规则
    33             不要包容所有错误,除非你确定需要这样做
    34             当存在多种可能,不要尝试去猜测
    35             而是尽量找一种,最好是唯一一种明显的解决方案
    36             虽然这并不容易,因为你不是 Python 之父
    37             做也许好过不做,但不假思索就动手还不如不做
    38             如果你无法向人描述你的方案,那肯定不是一个好方案;反之亦然
    39             命名空间是一种绝妙的理念,我们应当多加利用

     

  • 相关阅读:
    云服务器Ubuntu更改默认python版本
    例题4-1-3-古老的密码、刽子手的游戏,救济金发放
    Github pages和Hexo搭建自己的博客
    Python字典基本操作介绍
    python win32api win32gui win32con PyUserInput实现自动化脚本
    spring--分类索引
    目录-java并发基础知识
    【转】集群单点故障的应对策略
    CnBlogs自定义博客样式
    读书笔记——《redis入门指南(第2版)》第七章 持久化
  • 原文地址:https://www.cnblogs.com/www1707/p/10850638.html
Copyright © 2020-2023  润新知