• String 字符串中含有 Unicode 编码时,转为UTF-8


    1、单纯的Unicode 转码

    String a = "u53efu4ee5u6ce8u518c";
    a = new String(a.getBytes("UTF-16"),"Unicode");

    2、String 字符串中含有 Unicode 编码时,转为UTF-8

    public static String decodeUnicode(String theString) {    
            char aChar;    
            int len = theString.length();    
            StringBuffer outBuffer = new StringBuffer(len);    
            for (int x = 0; x < len;) {    
                aChar = theString.charAt(x++);    
                if (aChar == '\') {    
                    aChar = theString.charAt(x++);    
                    if (aChar == 'u') {    
                        // Read the xxxx    
                        int value = 0;    
                        for (int i = 0; i < 4; i++) {    
                            aChar = theString.charAt(x++);    
                            switch (aChar) {    
                            case '0':    
                            case '1':    
                            case '2':    
                            case '3':    
                            case '4':    
                            case '5':    
                            case '6':    
                            case '7':    
                            case '8':    
                            case '9':    
                                value = (value << 4) + aChar - '0';    
                                break;    
                            case 'a':    
                            case 'b':    
                            case 'c':    
                            case 'd':    
                            case 'e':    
                            case 'f':    
                                value = (value << 4) + 10 + aChar - 'a';    
                                break;    
                            case 'A':    
                            case 'B':    
                            case 'C':    
                            case 'D':    
                            case 'E':    
                            case 'F':    
                                value = (value << 4) + 10 + aChar - 'A';    
                                break;    
                            default:    
                                throw new IllegalArgumentException(    
                                        "Malformed   \uxxxx   encoding.");    
                            }    
            
                        }    
                        outBuffer.append((char) value);    
                    } else {    
                        if (aChar == 't')    
                            aChar = '	';    
                        else if (aChar == 'r')    
                            aChar = '
    ';    
                        else if (aChar == 'n')    
                            aChar = '
    ';    
                        else if (aChar == 'f')    
                            aChar = 'f';    
                        outBuffer.append(aChar);    
                    }    
                } else    
                    outBuffer.append(aChar);    
            }    
            return outBuffer.toString();    
        }
  • 相关阅读:
    Solr 删除数据的几种方式
    velocity 随笔
    LOG4J.PROPERTIES配置详解(转载)
    转 如何使用velocity模板引擎开发网站
    通过pinyin4j将汉字转换为全拼 和 拼音首字母
    去除数组中的重复数据
    java 转义字符
    多重背包(学习笔记)
    Team Queue
    [HAOI2008]糖果传递
  • 原文地址:https://www.cnblogs.com/lemon-flm/p/9531250.html
Copyright © 2020-2023  润新知