• Jericho 3.3 发布,HTML 解析器


    Jericho 3.3 包含重要的 bug 修复和改进,Jericho HTML解析器是一个Java库,以分析和操纵部分的HTML文件,其中包括服务器端的标签,而过滤掉任何无法识别的或无效的HTML 。它也提供高层次的HTML表单操作函数。

    详情请看:

    3.3   (2012-10-31)        - Bug Fixes:          - [3581664] CharacterReference.decode() does not decode entities            containing digits - ½ ¼ ¾ ¹ ² ³            ∴          - [3311286] SourceCompactor does not respect TEXTAREA          - [3519131] Renderer output incorrect when constructed with an            Element object.          - [3538829] Renderer output of font decoration on block boundaries            incorrect.          - Segment.getAllStartTags(name) and Segment.getFirstElement(name)            do not work if the argument contains upper case characters.          - The end delimiter of a common server tag inside an escaped server            tag is falsely recognised as the end delimiter of the escaped tag.        - CHANGES THAT COULD AFFECT THE BEHAVIOUR OF EXISTING PROGRAMS:          - [3427073] Segment.getStyleURISegments() now includes style element            content as well as style attribute values.          - [3427927] Segment.getURIAttributes() now includes the archive            attributes of object and applet elements.          - Comments no longer recognised inside script elements during full            sequential parse. Previously they were recognised for compatibility            with major browsers but modern browser behaviour has changed.          - Changed the log level of all parsing errors from INFO to ERROR, and            the log level of the Source.fullSequentialParse() advisory message            from WARN to INFO. The previous levels gave the advisory message a            higher severity than the parsing errors, preventing logging systems            from hiding the advisory message while showing parsing errors.            Character encoding warnings remain unchanged at WARN level.          - Changed the behaviour of the Renderer.renderHyperlinkURL(StartTag)            method so that relative URLs are not rendered.          - Changed the behaviour of the Renderer so that hyperlink element            content is not rendered if it is the same as the hyperlink URL,            ignoring any http:// prefix or / suffix.          - EndTag.tidy() now removes whitespace before the closing bracket.        - Added Source(File) constructor.        - Added OutputDocument.getSegment() method.        - Added OutputDocument.remove(int begin, int end) method.        - Added Renderer.setHRLineLength() method.        - Added RenderToText.jsp webapp sample.        - Added Segment.getRowColumnVector() method.        - Encoding detection now ignores common encodings specified in meta tags          that have a code unit size incompatible with the preliminary encoding.        - Upgraded to the following logger APIs:          slf4j-api-1.7.2, log4j-1.2.17
  • 相关阅读:
    redis相关问题
    redis的持久化功能
    redis运维常用的server端命令
    Redis运维时需要注意的参数
    redis模拟消息订阅
    redis设置密码
    redis简单的事务
    Redis的配置文件详解
    Linux中安装redis的phpredis扩展
    css动画属性--小球移动
  • 原文地址:https://www.cnblogs.com/shihao/p/2749138.html
Copyright © 2020-2023  润新知