• selumium 中 xpath获取文本、属性正确写法


    报错“The result of the xpath expression is: [object Attr]. It should be an element”

    yutube爬虫动态加载,需要用到selenium-webdriver,使用过程中,首先使用

    find_elements_by_xpath进行批量标签的定位选取,之后

    使用find_element_by_xpath精细筛选选标签的时候出现上面错误提示,

    原因是这个webdriver的定位方法和浏览器xpath不一样,不能直接定位到标签的属性

    需要首先定位到webelement,之后get到属性

    正确

                try:
                    temp['host_url'] = node.find_element_by_xpath('./div/div/div/ytd-video-meta-block/div/div/div/yt-formatted-string/a').get_attribute('href')
                except Exception as e:
                    print(e)
                try:
                    temp['show_url'] = node.find_element_by_xpath('./div/ytd-thumbnail/a').get_attribute('href')
                except Exception as e:
                    print(e)
                try:
                    temp['title'] = node.find_element_by_xpath('./div/div/div[1]/div/h3/a').get_attribute('title')
                except Exception as e:
                    print(e)
                try:
                    temp['user'] = node.find_element_by_xpath('./div/div/div/ytd-video-meta-block/div/div/div/yt-formatted-string/a').text
                except Exception as e:

    错误:

    try:
                    temp['host_url'] = node.find_element_by_xpath('./div/div/div/ytd-video-meta-block/div/div/div/yt-formatted-string/a/@href')
                except Exception as e:
                    print(e)
                try:
                    temp['show_url'] = node.find_element_by_xpath('./div/ytd-thumbnail/a/@href')
                except Exception as e:
                    print(e)
                try:
                    temp['title'] = node.find_element_by_xpath('./div/div/div[1]/div/h3/a/@title')
                except Exception as e:
                    print(e)
                try:
                    temp['user'] = node.find_element_by_xpath('./div/div/div/ytd-video-meta-block/div/div/div/yt-formatted-string/a/text()')
                except Exception as e:
                    print(e)
  • 相关阅读:
    PAT乙级:1083 是否存在相等的差 (20分)
    PAT乙级:1082 射击比赛 (20分)
    PAT乙级:1084 外观数列 (20分)
    PAT乙级:1070 结绳 (25分)
    PAT乙级:1064 朋友数 (20分)
    JavaScript实现拖放效果
    Lesson2 Thirteen equals one
    PAT乙级:1076 Wifi密码 (15分)
    条件查询
    根据配置生成创始块文件和通道文件
  • 原文地址:https://www.cnblogs.com/chenxi188/p/11640729.html
Copyright © 2020-2023  润新知