• shell 脚本大文件处理


    shell  脚本大文件处理

    字符串处理

    s='{"_id":{"$oid":"59b73d80930c17474f9f050d"},"address":"01ny.cn/xinxi/17571162.html"}'
    s_sub='{"_id":{"$oid":"59b73d80930c17474f9f050d"},"address":"'
    s_sub_sub='"}'
    r=${s//$s_sub/}
    r=${r//$s_sub_sub/}
    echo $r
    
    expr length '{"_id":{"$oid":"59b73d80930c17474f9f050d"},"address":"'
    
    r=${s:55:100}
    r=${r//$s_sub_sub/}
    echo $r
    

      

    shell变量里的字符替换 - CSDN博客 http://blog.csdn.net/augusdi/article/details/41010041

    linux shell 字符串操作详解 (长度,读取,替换,截取,连接,对比,删除,位置 ) - gaomatlab - 博客园 https://www.cnblogs.com/gaochsh/p/6901809.html

    大文件处理

    awk分割字符串

     awk '{split(substr($1,55,100),arr,""") ;print arr[1]}' kwaddress_address_20180227.json

    awk '{split(substr($1,55,100),arr,"""); split(arr[1],arr_b,"/");print arr_b[1]}' kwaddress_address_20180227.json

    [root@hadoop3 kwaddress]# cat  extract.2g.sh 
    s_sub_sub='"}'
    r=''
    s=`awk '{split(substr($1,55,100),arr,"""); split(arr[1],arr_b,"/");print arr_b[1]}' kwaddress_address_20180227.json`
    #for LINE in `cat /home/data/kwaddress/kwaddress_address_20180227.json`
    #for LINE in `awk '{print substr($1,55,100)}' kwaddress_address_20180227.json`
    for LINE in $s
    do 
     echo $LINE
     #r=${LINE:55:100}
    # r=${r//$s_sub_sub/}
    # echo $r
    done
    
    echo
    exit 0
    
    [root@hadoop3 kwaddress]# 
    

      

     awk '{split(substr($1,55,100),arr,"""); split(arr[1],arr_b,"/");print arr_b[1]}' kwaddress_address_20180227.json >> url.pool.txt 

  • 相关阅读:
    PCB封装分配
    1.26-CAD异形封装的制作
    1.40-CAE封装6个实例单门制作
    简单的URL解析
    冒泡排序和简单的递归
    作用域的提升
    数组的去重,以及随机一个验证码
    数组和字符串的相互转换。
    substring,substr,和slice的区别详解。
    去除Input输入框中边框的方式
  • 原文地址:https://www.cnblogs.com/rsapaper/p/8578646.html
Copyright © 2020-2023  润新知