• linux、hdfs、hive、hbase经常使用的命令


    linux经常使用命令

    pwd
    查看当前工作文件夹的绝对路径

    cat input.txt
    查看input.txt文件的内容

    ls
    显示当前文件夹下全部的文件及子文件夹

    rm recommender-dm-1.0-SNAPSHOT-lib.jar
    删除当前文件夹下recommender-dm-1.0-SNAPSHOT-lib.jar文件

    cp /home/deploy/pctr/recommender-dm_fat.jar ./
    把/home/deploy/pctr/文件夹下的recommender-dm_fat.jar拷贝到当前文件夹下

    rm -rf 0000*
    强行直接删除(不作不论什么提示)全部前缀为0000的文件

    rm -rf
    删除目录
    rm
    -r:向下递归,无论有多少级文件夹,一并删除
    -f:直接强行删除。不做不论什么提示的意思

    rz
    从client向server上传文件(receive:server接收)

    sz
    从server下载文件(send:server发送)



    hadoop hdfs经常使用命令

    hadoop fs -ls /user/deploy/recsys/workspace/ouyangyewei
    查看ouyangyewei文件夹文件

    hadoop fs -mkdir /user/deploy/recsys/workspace/ouyangyewei/input
    在ouyangyewei文件夹下创建input文件夹

    hadoop fs -rm /user/deploy/recsys/workspace/ouyangyewei/input/input.txt
    删除input.txt文件

    hadoop fs -rmr /user/deploy/recsys/workspace/ouyangyewei/input
    删除input文件夹以及文件夹下的全部文件

    hadoop fs -put ./input.txt /home/deploy/recsys/workspace/ouyangyewei/input
    把当前文件夹下的input.txt文件拷贝到input文件夹下

    hadoop fs -dus /data/share/trackinfo/ds=2014-05-12
    查看文件 “/data/share/trackinfo/ds=2014-05-12”的大小(以字节为单位)

    hadoop jar recommender-dm-1.0-SNAPSHOT-lib.jar com.yhd.ml.statistics.click.WordCount /home/deploy/recsys/workspace/ouyangyewei/input /home/deploy/recsys/workspace/ouyangyewei/output
    执行Job,指定的jar包是recommender-dm-1.0-SNAPSHOT-lib.jar。主类是com.yhd.ml.statistics.click.WordCount,输入文件夹是input,输出文件夹是output

    hadoop job -kill job_201403291618_274044
    杀掉hadoop的job



    hbase经常使用命令

    /usr/local/cloud/hbase/bin/hbase shell
    用shell来连接hbase

    exit
    退出hbase shell

    version
    查看hbase版本号

    hbase(main):045:0> is_enabled 't1'
    true                                                                                                                                                                            
    0 row(s) in 0.0020 seconds
    測试表t1是否有效

    hbase(main):046:0> is_disabled 't1'
    false                                                                                                                                                                           
    0 row(s) in 0.0010 seconds
    測试表t1是否无效

    hbase(main):044:0> exists 't1'
    Table t1 does exist                                                                                                                                                             
    0 row(s) in 0.0270 seconds
    測试表t1是否存在

    scan 'full_user_profile', {LIMIT=>1}
    输出数据表full_user_profile中的1个RowKey

    list
    列出全部数据表

    describe 'full_user_profile'
    列出full_user_profile数据表的结构

    hbase(main):003:0> disable 'score'
    0 row(s) in 2.1080 seconds
    使数据表score无效

    hbase(main):004:0> drop 'score'
    0 row(s) in 10.6740 seconds
    删除数据表score(注意在删除表之前要使表无效

    -------------------------------------------------------------------------
    hbase(main):013:0> create 'score', 'name', 'course'
    0 row(s) in 5.1050 seconds
    创建数据表score,当中name是Row Key,course是列族

    hbase(main):014:0> put 'score', 'xiaowen', 'course:China', '95'
    0 row(s) in 33.4270 seconds
    在列族course下创建列China,值为95

    hbase(main):015:0> put 'score', 'xiaowen', 'course:Math', '99'
    0 row(s) in 0.0130 seconds
    在列族course下创建列Math,值为99

    hbase(main):016:0> put 'score', 'xiaowen', 'course:English', '98'
    0 row(s) in 0.0040 seconds
    在列族course下创建列English,值为98

    hbase(main):017:0> scan 'score'
    ROW                                           COLUMN+CELL                                                                                                                       
    xiaowen                                      column=course:China, timestamp=1400141524101, value=95                                                                            
    xiaowen                                      column=course:English, timestamp=1400141591123, value=98                                                                          
    xiaowen                                      column=course:Math, timestamp=1400141579107, value=99                                                                             
    1 row(s) in 0.0250 seconds
    查看score整张表的数据

    hbase(main):018:0> get 'score', 'xiaowen'
    COLUMN                                        CELL                                                                                                                              
    course:China                                 timestamp=1400141524101, value=95                                                                                                 
    course:English                               timestamp=1400141591123, value=98                                                                                                 
    course:Math                                  timestamp=1400141579107, value=99                                                                                                 
    3 row(s) in 0.0110 seconds
    查看score表的xiaowen行数据

    hbase(main):019:0> get 'score', 'xiaowen', 'course:Math'
    COLUMN                                        CELL                                                                                                                              
    course:Math                                  timestamp=1400141579107, value=99                                                                                                 
    1 row(s) in 0.0070 seconds
    查看score表的xiaowen行,course列族上的Math列的数据

    hbase(main):008:0> scan 'score'
    ROW                                           COLUMN+CELL                                                                                                                       
    xiaowen                                      column=course:China, timestamp=1400141524101, value=95                                                                            
    xiaowen                                      column=course:English, timestamp=1400141591123, value=98                                                                          
    xiaowen                                      column=course:Math, timestamp=1400141579107, value=99                                                                             
    xiaoye                                       column=course:China, timestamp=1400143888087, value=85                                                                            
    xiaoye                                       column=course:English, timestamp=1400143921395, value=85                                                                          
    xiaoye                                       column=course:Math, timestamp=1400143907407, value=85                                                                             
    2 row(s) in 0.0240 seconds
    查看score表的全部值

    hbase(main):013:0> scan 'score', {COLUMNS=>'course'}
    ROW                                           COLUMN+CELL                                                                                                                       
    xiaowen                                      column=course:China, timestamp=1400141524101, value=95                                                                            
    xiaowen                                      column=course:English, timestamp=1400141591123, value=98                                                                          
    xiaowen                                      column=course:Math, timestamp=1400141579107, value=99                                                                             
    xiaoye                                       column=course:China, timestamp=1400143888087, value=85                                                                            
    xiaoye                                       column=course:English, timestamp=1400143921395, value=85                                                                          
    xiaoye                                       column=course:Math, timestamp=1400143907407, value=85                                                                             
    2 row(s) in 0.0230 seconds
    查看score表的course列的全部值

    hbase(main):014:0> scan 'score', {COLUMNS=>'course:Math'}
    ROW                                           COLUMN+CELL                                                                                                                       
    xiaowen                                      column=course:Math, timestamp=1400141579107, value=99                                                                             
    xiaoye                                       column=course:Math, timestamp=1400143907407, value=85                                                                             
    2 row(s) in 0.0270 seconds
    查看score表course:Math列的全部值

    hbase(main):021:0> count 'score'
    2 row(s) in 0.1880 seconds
    统计score表有多少行

    --------------------------------------------------------------------
    test表的实践

    hbase(main):022:0> create 'test', 'c1', 'c2'
    0 row(s) in 1.1260 seconds

    hbase(main):023:0> put 'test', 'r1', 'c1:1', 'value1-1/1'
    0 row(s) in 0.0360 seconds

    hbase(main):024:0> put 'test', 'r1', 'c1:2', 'value1-1/2'
    0 row(s) in 0.0210 seconds

    hbase(main):025:0> put 'test', 'r1', 'c1:3', 'value1-1/3'
    0 row(s) in 0.0170 seconds

    hbase(main):026:0> put 'test', 'r1', 'c2:1', 'value1-2/1'
    0 row(s) in 0.0100 seconds

    hbase(main):027:0> put 'test', 'r1', 'c2:2', 'value1-2/2'
    0 row(s) in 0.0060 seconds

    hbase(main):028:0> put 'test', 'r2', 'c1:1', 'value2-1/1'
    0 row(s) in 0.0110 seconds

    hbase(main):029:0> put 'test', 'r2', 'c2:1', 'value2-2/1'
    0 row(s) in 0.0080 seconds

    hbase(main):030:0> scan 'test'
    ROW                                           COLUMN+CELL                                                                                                                        
    r1                                           column=c1:1, timestamp=1400152716678, value=value1-1/1                                                                             
    r1                                           column=c1:2, timestamp=1400152749600, value=value1-1/2                                                                             
    r1                                           column=c1:3, timestamp=1400152770555, value=value1-1/3                                                                             
    r1                                           column=c2:1, timestamp=1400152793839, value=value1-2/1                                                                             
    r1                                           column=c2:2, timestamp=1400152811436, value=value1-2/2                                                                             
    r2                                           column=c1:1, timestamp=1400152843148, value=value2-1/1                                                                             
    r2                                           column=c2:1, timestamp=1400152858073, value=value2-2/1                                                                             
    2 row(s) in 0.0490 seconds

    hbase(main):031:0> describe 'test'
    DESCRIPTION                                                                                                        ENABLED                                                       
    {NAME => 'test', FAMILIES => [{NAME => 'c1', DATA_BLOCK_ENCODING => 'NONE', BLOOMFILTER => 'NONE', REPLICATION_SC true                                                          
    OPE => '0', VERSIONS => '3', COMPRESSION => 'NONE', MIN_VERSIONS => '0', TTL => '2147483647', KEEP_DELETED_CELLS                                                                
    => 'false', BLOCKSIZE => '65536', IN_MEMORY => 'false', ENCODE_ON_DISK => 'true', BLOCKCACHE => 'true'}, {NAME =>                                                               
      'c2'
    , DATA_BLOCK_ENCODING => 'NONE', BLOOMFILTER => 'NONE', REPLICATION_SCOPE => '0', VERSIONS => '3', COMPRESSI                                                               
    ON => 'NONE', MIN_VERSIONS => '0', TTL => '2147483647', KEEP_DELETED_CELLS => 'false', BLOCKSIZE => '65536', IN_M                                                               
    EMORY => 'false', ENCODE_ON_DISK => 'true', BLOCKCACHE => 'true'}]}                                                                                                             
    1 row(s) in 0.2560 seconds

    从describe命令可见test表有两个列族



    hive经常使用命令

    show tables;
    列出hive里面全部数据表名

    desc userProfile;
    显示数据表userProfile的基本表字段及字段type

    desc extended trackinfo;
    显示数据表trackinfo的具体信息,包含字段说明,数据表等

    /usr/local/cloud/hive/bin/hive
    进入hive数据库

    select attribute_name from pms_attribute where attribute_id=21000 and attribute_value_id=105991;
    hive的select操作

    select user_id, category_id, catgory_pref, attribute_id, attribute_pref, attribute_value_id, attribute_value_pref from userProfile limit 10;
    hive的select操作。仅仅显示10行

    /usr/local/cloud/hive/bin/hive -e "select category_id, attribute_id, count(user_id) from userProfile group by category_id, attribute_id" >> /home/deploy/recsys/workspace/ouyangyewei/statistics_data/number_attention_of_attribute_for_mobilePhone.csv;
    将sql查询的数据导出到csv文件里


    版权声明:本文博主原创文章,博客,未经同意不得转载。

  • 相关阅读:
    一个Web文件上传的C#源代码
    DataSets and Serialization 数据集和序列化 (英文版)
    如何在Unity中播放影片
    靠边伸缩菜单的做法(类似QQ,碰到就会伸出来)
    Lightmapper
    Unity官方教學專案 Character Customization (紙娃娃系統)
    unity3d用鼠标拖动物体的一段代码
    [unity3d程序] 颜色渐变效果
    C# typeof()实例详解
    XNA Billboard(公告板技术)
  • 原文地址:https://www.cnblogs.com/zfyouxi/p/4878753.html
Copyright © 2020-2023  润新知