• 9.26


    1.读入待分析的字符串

    fo=open('test.txt','w')
    fo.write('''All the times that you rain on my parade
    And all the clubs you get in using my name
    You think you broke my heart
    Ohhh girl for goodness sake
    You think I'm crying
    Oh my ohhh, well I ain't!
    And I didn't wanna write a song
    'Cause I didn't want anyone thinking I still care, I don't
    But, you still hit my phone up
    And baby I be moving on
    And I think you should be somethin' I don't wanna hold back
    Maybe you should know that
    My mama don't like you and she like's everyone''')
    fo=close()

    2.分解提取单词

    news=news.lower()
    for i in ',!'':
        news=news.replace(i,' ')
    words=sorry.split(' ')
    print(words)

    3.计数字典

    4.排除语法型词汇

    dic={}
    words.sort()
    d=set(words)
    d=d-exc
    for i in d:
        dic[i]=words.count(i)

    5.排序

    6.输出TOP(20)

    wc=list(dic.items())
    wc.sort(key=lambda x:x[1],reverse=True)#排序
    for i in range(20):
        print(wc[i])
  • 相关阅读:
    re.sub函数的深入了解
    xpath
    改变评分查询
    Boolean Query
    固定分数查询
    Unicode编码的原型
    java中基本类型占用字节数
    Java Socket网络编程的经典例子(转)
    (转)工厂模式
    (转)java垃圾回收机制
  • 原文地址:https://www.cnblogs.com/chenyanxi123/p/7597321.html
Copyright © 2020-2023  润新知