selenium获取html源代码

# 执行js得到整个HTML

　　　　html = driver.execute_script("return document.documentElement.outerHTML")

获得整个文档的HTML

　　　　html = driver.find_element_by_xpath("//*").get_attribute("outerHTML")
　　　　# 不要用 driver.page_source，那样得到的页面源码不标准

获取单个元素具体的HTML源文件

　　　　webElement.getAttribute("outerHTML")

获取元素的所有属性

Object[] attr = ((JavascriptExecutor)seleniumdriver).executeScript("return arguments[0].attributes);", webElement);

String source=driver.findElement(By.xpath("/html/body/script[6]")).getAttribute("innerHTML");

分隔的方法

If we have this:

<a href="#" class="ui-dialog-titlebar-close ui-corner-all" role="button"
style="position: absolute; border-radius: 0px 0px 4px 4px;">
<span class="ui-icon ui-icon-closethick">close</span></a>

and we need to get all attributes of "a" which will be this:

href="#" class="ui-dialog-titlebar-close ui-corner-all" role="button"
    style="position: absolute; border-radius: 0px 0px 4px 4px;"

We can use this code:

webElement.getAttribute("outerHTML").split(">")[0]

where webElement is "a".

Or more precisely:

String s = we.getAttribute("outerHTML");

s = s.substring(2, s.indexOf(">"));

相关阅读:
python zip()函数转置表，操作列
python 集合无序非重容器的操作
前行
工作读书放松：做其他事情 1.运动（如焦），2.闭眼睡觉休息（如蔡），3.选择读其他书
心在哪时间在哪路在哪
vi全部替换
选择
知识代码算法
code algorithm
绕过CND查找真是IP地址

原文地址：https://www.cnblogs.com/hushaojun/p/5985673.html