• Requests模块


    Requests模块

    1.headers关键字

    import requests
    from urllib.parse import urlencode
    
    keyword = input('>>:').strip()
    res = urlencode({'wd': keyword}, encoding='utf-8')
    url = 'https://www.baidu.com/s?' + res
    print(url)
    response = requests.get(url,
                            headers={
                                'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/61.0.3163.79 Safari/537.36'
                            },
                            )
    print(response.status_code)
    with open('a.html','w',encoding='utf-8') as f:
        f.write(response.text)
    headers关键字

    2.params方法实现上面关键字的方法。(优化了)

    import requests
    from urllib.parse import urlencode
    
    keyword = input('>>:').strip()
    response = requests.get('https://www.baidu.com/s?',
                            params={
                                'wd':keyword,
                                'pn':20
                            },
                            headers={
                                'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/61.0.3163.79 Safari/537.36'
                            },
                            )
    print(response.status_code)
    with open('a.html','w',encoding='utf-8') as f:
        f.write(response.text)
    params方法

    3.headers涉及到User_agent,加了user_agent才能获取到内容。

    import requests
    
    response = requests.get('https://www.zhihu.com/explore',
                            headers={
                                'User-Agent': 'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/63.0.3239.132 Safari/537.36',}
                            )   
    print(response.status_code)
    print(response.text)
    User_agent

    4.headers涉及到cookie,加了cookie才能获取到内容。

    response=requests.get(
        url='https://github.com/settings/emails',
        headers={
            'User-Agent': 'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/63.0.3239.132 Safari/537.36',
        },
        cookies={
            "k1":"v1",
        },
    )
    print('378533872@qq.com' in response.text)
    cookie

    5.allow_redirects: 跳转,  默认跳转,改为Flask就不跳转

    import requests
    response=requests.get(
        url='https://github.com/settings/emails',
        headers={
            'User-Agent': 'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/63.0.3239.132 Safari/537.36',
        },
        cookies={
            "k1":"v1",
        },
        allow_redirects=False,
    )
    allow_redirects

     

     

     

     

  • 相关阅读:
    Java编程规范
    java 编程军规
    数据库编程军规条例
    过滤重复记录(因为关联扩展表)
    根据子部门获取其到根部门的路径
    (原创)defparam的应用(Verilog,CPLD/FPGA)
    (原创)学习MCU的感悟_初级(MCU,经验)
    (原创)基于FPGA的调光流水灯(Verilog,CPLD/FPGA)
    (原创)基于MCU的频率可调,占空比可调的PWM实现(MCU,MCS-51/MSP430)
    (原创)动物照的情感与角度拍摄(摄影,欣赏)
  • 原文地址:https://www.cnblogs.com/zhongbokun/p/8296980.html
Copyright © 2020-2023  润新知