• 1410. HTML Entity Parser


    HTML entity parser is the parser that takes HTML code as input and replace all the entities of the special characters by the characters itself.

    The special characters and their entities for HTML are:

    • Quotation Mark: the entity is " and symbol character is ".
    • Single Quote Mark: the entity is ' and symbol character is '.
    • Ampersand: the entity is & and symbol character is &.
    • Greater Than Sign: the entity is > and symbol character is >.
    • Less Than Sign: the entity is &lt; and symbol character is <.
    • Slash: the entity is &frasl; and symbol character is /.

    Given the input text string to the HTML parser, you have to implement the entity parser.

    Return the text after replacing the entities by the special characters.

    Example 1:

    Input: text = "&amp; is an HTML entity but &ambassador; is not."
    Output: "& is an HTML entity but &ambassador; is not."
    Explanation: The parser will replace the &amp; entity by &
    

    Example 2:

    Input: text = "and I quote: &quot;...&quot;"
    Output: "and I quote: "...""
    

    Example 3:

    Input: text = "Stay home! Practice on Leetcode :)"
    Output: "Stay home! Practice on Leetcode :)"
    

    Example 4:

    Input: text = "x &gt; y &amp;&amp; x &lt; y is always false"
    Output: "x > y && x < y is always false"
    

    Example 5:

    Input: text = "leetcode.com&frasl;problemset&frasl;all"
    Output: "leetcode.com/problemset/all"
    

    Constraints:

    • 1 <= text.length <= 10^5
    • The string may contain any possible characters out of all the 256 ASCII characters.
       public String entityParser(String text) {
            StringBuilder sb=new StringBuilder(), cur=new StringBuilder();
            Map<String, String> dic=new HashMap<>();
            dic.put("&quot;", """);
            dic.put("&apos;", "'");
            dic.put("&amp;", "&");
            dic.put("&gt;", ">");
            dic.put("&lt;", "<");
            dic.put("&frasl;", "/");
            char[] txt=text.toCharArray();
            for(int i=0;i<txt.length;i++) {
                if(txt[i]=='&') {
                    sb.append(cur);
                    cur.setLength(0);
                    cur.append("&");
                }
                else if(txt[i]==';') {
                    cur.append(";");
                    String s=cur.toString();
                    if(dic.containsKey(s)) sb.append(dic.get(s));
                    else sb.append(cur);
                    cur.setLength(0);
                }
                else cur.append(txt[i]);
            }
            return sb.append(cur).toString();
        }
  • 相关阅读:
    1. 二分查找
    Filezilla使用
    正则表达式regex
    TCP的三次握手和四次挥手
    Pycharm 更换源
    寒假学习进度(十四)
    寒假学习进度(十)
    寒假学习进度(九)
    寒假学习进度(八)
    寒假学习进度(七)
  • 原文地址:https://www.cnblogs.com/wentiliangkaihua/p/13097281.html
Copyright © 2020-2023  润新知