• Word Frequency


    Write a bash script to calculate the frequency of each word in a text file words.txt.

    For simplicity sake, you may assume:

    • words.txt contains only lowercase characters and space ' ' characters.
    • Each word must consist of lowercase characters only.
    • Words are separated by one or more whitespace characters.

    For example, assume that words.txt has the following content:

    the day is sunny the the
    the sunny is is
    

    Your script should output the following, sorted by descending frequency:

    the 4
    is 3
    sunny 2
    day 1
    

    Note:
    Don't worry about handling ties, it is guaranteed that each word's frequency count is unique.

    tr -s "	| " "
    " <words.txt|sort |uniq -c | sort -k1 -r|awk '{print $2,$1}'

    tr -s: 使用指定字符串替换出现一次或者连续出现的目标字符串(把一个或多个连续空格用换行符代替)

    sort: 将单词从小到大排序

    uniq -c: uniq用来对连续出现的行去重,-c参数为计数

    sort -r: -r 倒序排列

    awk '{ print $2, $1 }': 格式化输出,将每一行的内容用空格分隔成若干部分,$i为第i个部分。

  • 相关阅读:
    mysql DCL(数据控制语句)
    sybase 脚本建表和删除表
    mysql DML(数据操纵语句)
    delphi fastmm4 调试
    delphi class of 类引用
    DELPHI SetLocaleInfo 设置本地时间
    DELPHI 去字符串中所有空格
    SUSE 11 安装MongoDB
    suse下 登录mongodb
    DELPHI 的 {$M +} 和{$M -}
  • 原文地址:https://www.cnblogs.com/qionghua520/p/4369808.html
Copyright © 2020-2023  润新知