Python正则re.sub使用自定义替换方法

在使用正则替换时，有时候需要将匹配的结果做对应处理，便可以使用自定义替换方法。
re.sub的用法为：

re.sub('匹配表达式'，替换字符或替换函数，替换次数，其他参数)

例如，加入将字符串中的$开头的变量替换为上下文context中对应的值，可以操作如下。

import re
context = {'a': 1, 'b': 2, 'except': 3}
text = '$a + $b = $except'

def repl_func(matched):
    if matched:
        text = matched.group(1)
        return str(context.get(text))

result = re.sub('$(w+)', repl_func, text)
print(result)

打印结果为：

1 + 2 = 3

下面是一个用于替换字符串，字典，列表，元祖类型中特定变量的方法。

import re
import json


def parser(origin, *args, delimiter="$", **kwargs):  # 支持修改delimiter定界符

    patten = r'{}(?P<var>.+?)'.format(delimiter)

    def repl_func(matched):   # 自定义re.sub使用的替换方法
        var = matched.group('var')
        if var.isdigit():   # 如果是数字, 则从args中替换
            index = int(var) - 1
            if index < len(args):
                return args[index]
            else:
                return "{}{}".format(delimiter, var)   # 无替换参数则返回原值
        else:
            return kwargs.get(var, None) or "{}{}".format(delimiter, var)   # 返回kwargs参数中值 or 原值

    if isinstance(origin, str):
        return re.sub(patten, repl_func, origin, re.M)
    elif isinstance(origin, (dict, list)):  # 使用json.dumps转为字符串, 替换,然后重新转为dict/list
        return json.loads(re.sub(patten, repl_func, json.dumps(origin), re.M))
    else:
        if isinstance(origin, tuple):
            return tuple(json.loads(re.sub(patten, repl_func, json.dumps(origin), re.M)))  # 转换后重新转为tuple


if __name__ == '__main__':
    s = ['性别: $2  年龄: $3
$a', '$1', {"say": "$a"}]
    print(parser(s, 'kevin', 'male', '20', a="hello, world!"))

相关阅读:
Scala（二）——基础语法（与Java的区分）和函数式编程
LeetCode3.无重复字符的最大子串
LeetCode2.两数相加
LeetCode1.两数之和
Scala（一）——基本类型
图论算法总结（一）——图的遍历
Hutool强大的工具类
Jdk8新特性之接口新增方法
mybatis的xml配置中if text判断
Jdk8新特性目录

原文地址：https://www.cnblogs.com/superhin/p/12763906.html