• hive基本操作与应用


    通过hadoop上的hive完成WordCount

    启动hadoop

    Hdfs上创建文件夹

    上传文件至hdfs

    启动Hive

    创建原始文档表

    导入文件内容到表docs并查看

    用HQL进行词频统计,结果放在表word_count里

    查看统计结果

    ssh localhost
    cd /usr/local/hadoop
    ./sbin/start-dfs.sh
    cd /usr/local/hive/lib
    service mysql start
    start-all.sh
    
    hdfs dfs -mkdir test1
    hdfs dfs -ls /user/hadoop
    
    hdfs dfs -put ./123.txt test1
    hdfs dfs -ls /user/hadoop/test1
    
    hive
    
    create table docs(line string)
    
    load data inpath '/user/hadoop/tese1/123.txt' overwrite into table docs
    select * from docs
    
    create table word_count as select word,count(1) as count from (select explode(split(line," ")) as word from docs) word group by word order by word;
    
    show tables;
    select * from word_count;
    

      

  • 相关阅读:
    AOP概述
    AOP-动态代理
    IOC容器和Bean的配置
    Spring框架概述
    异常
    Optional 类
    Stream API
    方法引用(Method References)
    函数式(Functional)接口
    stm8笔记1-搭建工程+孤独的小灯闪烁
  • 原文地址:https://www.cnblogs.com/shadows24/p/9047594.html
Copyright © 2020-2023  润新知