• HBase性能测试


    HBase PerformanceEvaluation:

    Options:
     nomapred        Run multiple clients using threads (rather than use mapreduce)
     rows            Rows each client runs. Default: One million
     size            Total size in GiB. Mutually exclusive with --rows. Default: 1.0.
     sampleRate      Execute test on a sample of total rows. Only supported by randomRead. Default: 1.0
     traceRate       Enable HTrace spans. Initiate tracing every N rows. Default: 0
     table           Alternate table name. Default: 'TestTable'
     multiGet        If >0, when doing RandomRead, perform multiple gets instead of single gets. Default: 0
     compress        Compression type to use (GZ, LZO, ...). Default: 'NONE'
     flushCommits    Used to determine if the test should flush the table. Default: false
     writeToWAL      Set writeToWAL on puts. Default: True
     autoFlush       Set autoFlush on htable. Default: False
     oneCon          all the threads share the same connection. Default: False
     presplit        Create presplit table. Recommended for accurate perf analysis (see guide).  Default: disabled
     inmemory        Tries to keep the HFiles of the CF inmemory as far as possible. Not guaranteed that reads are always served from memory.  Default: false
     usetags         Writes tags along with KVs. Use with HFile V3. Default: false
     numoftags       Specify the no of tags that would be needed. This works only if usetags is true.
     filterAll       Helps to filter out all the rows on the server side there by not returning any thing back to the client. 
     latency         Set to report operation latencies. Default: False
     bloomFilter      Bloom filter type, one of [NONE, ROW, ROWCOL]
     valueSize       Pass value size to use: Default: 1024
     valueRandom     Set if we should vary value size between 0 and 'valueSize'; set on read for stats on size: Default: Not set.
     valueZipf       Set if we should vary value size between 0 and 'valueSize' in zipf form: Default: Not set.
     period          Report every 'period' rows: Default: opts.perClientRunRows / 10
     multiGet        Batch gets together into groups of N. Only supported by randomRead. Default: disabled
     addColumns      Adds columns to scans/gets explicitly. Default: true
     replicas        Enable region replica testing. Defaults: 1.
     splitPolicy     Specify a custom RegionSplitPolicy for the table.
     randomSleep     Do a random sleep before each get between 0 and entered value. Defaults: 0
     columns         Columns to write per row. Default: 1
     caching         Scan caching to use. Default: 30
    
     Note: -D properties will be applied to the conf used. 
      For example: 
       -Dmapreduce.output.fileoutputformat.compress=true
       -Dmapreduce.task.timeout=60000
    
    Command:
     filterScan      Run scan test using a filter to find a specific row based on it's value (make sure to use --rows=20)
     randomRead      Run random read test
     randomSeekScan  Run random seek and scan 100 test
     randomWrite     Run random write test
     scan            Run scan test (read every row)
     scanRange10     Run random seek scan with both start and stop row (max 10 rows)
     scanRange100    Run random seek scan with both start and stop row (max 100 rows)
     scanRange1000   Run random seek scan with both start and stop row (max 1000 rows)
     scanRange10000  Run random seek scan with both start and stop row (max 10000 rows)
     sequentialRead  Run sequential read test
     sequentialWrite Run sequential write test
    
    Args:
     nclients        Integer. Required. Total number of clients (and HRegionServers)
                     running: 1 <= value <= 500

    hbase pe --table=htest --nomapred --rows=20000 --presplit=100 randomWrite 100

    Yahoo Cloud Serving Benchmark

    ./ycsb load hbase10 -P ../workloads/workloadb -p threads=10 -p columnfamily=f1 -p recordcount=20000 -s

  • 相关阅读:
    SQL每日一题(20200512)
    SQL每日一题(20200506)
    SQL每日一题(20200509)
    sql每日一题(20200423)
    Oracle内存全面分析
    dbms_output.put与put_line
    oracle xml操作
    超级强大的破解极验滑动验证码--讲解非常详细
    python开发---目录
    Flask大全
  • 原文地址:https://www.cnblogs.com/rilley/p/6340229.html
Copyright © 2020-2023  润新知