• Python3、Unicode、UTF-8、编码


    text = u'你好,今天天气不错'
    text
    print(text)
    
    text = 'u4f60u597duff0cu4ecau5929u5929u6c14u4e0du9519'
    text
    print(text)
    
    text = u'u4f60u597duff0cu4ecau5929u5929u6c14u4e0du9519'
    text
    print(text)
    
    text = '\u4f60\u597d\uff0c\u4eca\u5929\u5929\u6c14\u4e0d\u9519'
    text
    print(text)
    text = text.encode('utf-8').decode('unicode_escape')
    text
    print(text)
    
    text = '\u4f60\u597d\uff0c今天天气不错'
    text
    print(text)
    import re
    text = re.sub(r'(\u[0-9a-fA-F]{4})', lambda matched: matched.group(1).encode('utf-8').decode('unicode_escape'), text)
    text
    print(text)

    以上为运行的代码,运行的结果如下:

    >>> text = u'你好,今天天气不错'
    >>> text
    '你好,今天天气不错'
    >>> print(text)
    你好,今天天气不错
    
    >>> text = 'u4f60u597duff0cu4ecau5929u5929u6c14u4e0du9519'
    >>> text
    '你好,今天天气不错'
    >>> print(text)
    你好,今天天气不错
    
    >>> text = u'u4f60u597duff0cu4ecau5929u5929u6c14u4e0du9519'
    >>> text
    '你好,今天天气不错'
    >>> print(text)
    你好,今天天气不错
    
    >>> text = '\u4f60\u597d\uff0c\u4eca\u5929\u5929\u6c14\u4e0d\u9519'
    >>> text
    '\u4f60\u597d\uff0c\u4eca\u5929\u5929\u6c14\u4e0d\u9519'
    >>> print(text)
    u4f60u597duff0cu4ecau5929u5929u6c14u4e0du9519
    >>> text = text.encode('utf-8').decode('unicode_escape')
    >>> text
    '你好,今天天气不错'
    >>> print(text)
    你好,今天天气不错
    
    >>> text = '\u4f60\u597d\uff0c今天天气不错'
    >>> text
    '\u4f60\u597d\uff0c今天天气不错'
    >>> print(text)
    u4f60u597duff0c今天天气不错
    >>> import re
    >>> text = re.sub(r'(\u[0-9a-fA-F]{4})', lambda matched: matched.group(1).encode('utf-8').decode('unicode_escape'), text)
    >>> text
    '你好,今天天气不错'
    >>> print(text)
    你好,今天天气不错

  • 相关阅读:
    C#基础—string等类的有趣方法_1
    设计模式
    OOP-面向对象程序设计
    CSS3实用效果大全
    HTML5 DOM元素类名相关操作API classList简介(转载自张鑫旭大神)
    Js写的一个倒计时效果实例
    垂直居中的几种方案
    大图片加载优化解决方案
    DomReady实现策略
    脱离文档流
  • 原文地址:https://www.cnblogs.com/jacen789/p/9401877.html
Copyright © 2020-2023  润新知