• ElasticSearch安装入门


    ElasticSearch介绍:
      ElasticSearch是一个基于Lucene的搜索服务器 ,elasticsearch封装了抽象接口用于连接Lucene
      Lucene是一套用于全文检索和搜寻的开源程式库
    安装很简单:
    解压就可以用,几乎是o配置
    分布式全文检索服务器
    要求是两个节点,最好是三个及以上
    安装步骤:
    ------------------------------------------环境准备-----------------------------------------------
    ###【在多台机器上执行下面的命令】###
    #es启动时需要使用非root用户,所有创建一个bigdata用户:
    useradd bigdata
    #为hadoop用户添加密码:
    echo 123456 | passwd --stdin bigdata
    #将bigdata添加到sudoers
    echo "bigdata ALL = (root) NOPASSWD:ALL" | tee /etc/sudoers.d/bigdata
    chmod 0440 /etc/sudoers.d/bigdata
     
    #解决sudo: sorry, you must have a tty to run sudo问题,在/etc/sudoer注释掉 Default requiretty 一行
    sudo sed -i 's/Defaults requiretty/Defaults:bigdata !requiretty/' /etc/sudoers
     
    #创建一个bigdata目录
    mkdir /{bigdata,data}
    #给相应的目录添加权限
    chown -R bigdata:bigdata /{bigdata,data}
     
    ------------------------------------------ES安装-----------------------------------------------
    We recommend installing the Java 8 update 20 or later, or Java 7 update 55 or later.
    Previous versions of Java 7 are known to have bugs that can cause index corruption and data loss.
    Elasticsearch will refuse to start if a known-bad version of Java is used.
    ###【切换到bigdata用户安装】###
    1.安装jdk(jdk要求1.8.20或1.7.55以上)
    2.上传es安装包
    3.解压es
    tar -zxvf elasticsearch-2.3.1.tar.gz -C /bigdata/
    4.修改配置
    vi /bigdata/elasticsearch-2.3.1/config/elasticsearch.yml
    #集群名称,通过组播的方式通信,通过名称判断属于哪个集群
    cluster.name: bigdata
    #节点名称,要唯一
    node.name: es-1
    #数据存放位置
    path.data: /data/es/data
    #日志存放位置
    path.logs: /data/es/logs
    #es绑定的ip地址
    network.host: 172.16.0.14
    #初始化时可进行选举的节点
    discovery.zen.ping.unicast.hosts: ["node-4.itcast.cn", "node-5.itcast.cn", "node-6.itcast.cn"]
    5.使用scp拷贝到其他节点
    scp -r elasticsearch-2.3.1/ node-5.itcast.cn:$PWD
    scp -r elasticsearch-2.3.1/ node-6.itcast.cn:$PWD
    6.在其他节点上修改es配置,需要修改的有node.name和network.host
    7.启动es(/bigdata/elasticsearch-2.3.1/bin/elasticsearch -h查看帮助文档)
    /bigdata/elasticsearch-2.3.1/bin/elasticsearch -d
     启动报错:
    	max file descriptors [4096] for elasticsearch process is too low, increase to at least [65535]
    	max virtual memory areas vm.max_map_count [65530] is too low, increase to at least [262144]
       解决办法:
    	1、编辑 /etc/security/limits.conf,追加以下内容;
    		* soft nofile 65536
    		* hard nofile 65536
    	2、在/etc/sysctl.conf文件最后添加一行
    		vm.max_map_count=262144
    8.用浏览器访问es所在机器的9200端口
    自带主节点选举机制,最好配置奇数个,防止闹了,选举不依赖zookeeper
    可视化管理界面:
    #es安装插件下载es插件
    /bigdata/elasticsearch-2.3.1/bin/plugin install mobz/elasticsearch-head
     
    [hadoop@master es]$ bin/plugin install mobz/elasticsearch-head
    -> Installing mobz/elasticsearch-head...
    Trying https://github.com/mobz/elasticsearch-head/archive/master.zip ...
    Downloading ....................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................DONE
    Verifying https://github.com/mobz/elasticsearch-head/archive/master.zip checksums if available ...
    NOTE: Unable to verify checksum for downloaded plugin (unable to find .sha1 or .md5 file to verify)
    Installed head into /export/servers/es/plugins/head
     
    #本地方式安装head插件
    ./plugin install file:///home/bigdata/elasticsearch-head-master.zip
     
    #访问head管理页面
     
    elasticSearch简单使用:
    RESTful接口URL的格式:
    http://localhost:9200/<index>/<type>/[<id>]
    其中index、type是必须提供的。
    id是可选的,不提供es会自动生成。
    index、type将信息进行分层,利于管理。
    index可以理解为数据库;type理解为数据表;id相当于数据库表中记录的主键,是唯一的。
     
    #向store索引中添加一些书籍
    curl -XPUT 'http://172.16.0.14:9200/store/books/1' -d '{
      "title": "Elasticsearch: The Definitive Guide",
      "name" : {
        "first" : "Zachary",
        "last" : "Tong"
      },
      "publish_date":"2015-02-06",
      "price":"49.99"
    }'
    
    #通过浏览器查询
    http://172.16.0.14:9200/store/books/1
    
    #在linux中通过curl的方式查询
    curl -XGET 'http://172.16.0.14:9200/store/books/1'
    
    #在添加一个书的信息
    curl -XPUT 'http://172.16.0.14:9200/store/books/2' -d '{
      "title": "Elasticsearch Blueprints",
      "name" : {
        "first" : "Vineeth",
        "last" : "Mohan"
      },
      "publish_date":"2015-06-06",
      "price":"35.99"
    }'
    
    
    # 通过ID获得文档信息
    curl -XGET 'http://172.16.0.14:9200/bookstore/books/1'
    
    #在浏览器中查看
    http://172.16.0.14:9200/bookstore/books/1
    
    # 通过_source获取指定的字段
    curl -XGET 'http://172.16.0.14:9200/store/books/1?_source=title'
    curl -XGET 'http://172.16.0.14:9200/store/books/1?_source=title,price'
    curl -XGET 'http://172.16.0.14:9200/store/books/1?_source'
    
    #可以通过覆盖的方式更新
    curl -XPUT 'http://172.16.0.14:9200/store/books/1' -d '{
      "title": "Elasticsearch: The Definitive Guide",
      "name" : {
        "first" : "Zachary",
        "last" : "Tong"
      },
      "publish_date":"2016-02-06",
      "price":"99.99"
    }'
    
    # 或者通过 _update  API的方式单独更新你想要更新的
    curl -XPOST 'http://172.16.0.14:9200/store/books/1/_update' -d '{
      "doc": {
         "price" : 88.88
      }
    }'
    
    curl -XGET 'http://172.16.0.14:9200/store/books/1'
    
    #删除一个文档
    curl -XDELETE 'http://172.16.0.14:9200/store/books/1'
    
    
    # 最简单filter查询
    # SELECT * FROM books WHERE price = 35.99
    # filtered 查询价格是35.99的
    curl -XGET 'http://172.16.0.14:9200/store/books/_search' -d '{
        "query" : {
            "filtered" : {
                "query" : {
                    "match_all" : {}
                },
                "filter" : {
                    "term" : {
                        "price" : 35.99
                      }
                  }
            }
        }
    }'
    
    #指定多个值
    curl -XGET 'http://172.16.0.14:9200/store/books/_search' -d '{
        "query" : {
            "filtered" : {
                "filter" : {
                    "terms" : {
                        "price" : [35.99, 99.99]
                      }
                  }
            }
        }
    }'
    
    
    # SELECT * FROM books WHERE publish_date = "2015-02-06"
    curl -XGET 'http://172.16.0.14:9200/bookstore/books/_search' -d '{
      "query" : {
        "filtered" : {
            "filter" : {
               "term" : {
                  "publish_date" : "2015-02-06"
                }
              }
          }
      }
    }'
    
    
    
    # bool过滤查询,可以做组合过滤查询
    # SELECT * FROM books WHERE (price = 35.99 OR price = 99.99) AND (publish_date != "2016-02-06")
    # 类似的,Elasticsearch也有 and, or, not这样的组合条件的查询方式
    # 格式如下:
    #  {
    #    "bool" : {
    #    "must" :     [],
    #    "should" :   [],
    #    "must_not" : [],
    #    }
    #  }
    #
    # must: 条件必须满足,相当于 and
    # should: 条件可以满足也可以不满足,相当于 or
    # must_not: 条件不需要满足,相当于 not
    
    curl -XGET 'http://172.16.0.14:9200/bookstore/books/_search' -d '{
      "query" : {
        "filtered" : {
          "filter" : {
            "bool" : {
              "should" : [
                { "term" : {"price" : 35.99}},
                { "term" : {"price" : 99.99}}
              ],
    		  "must_not" : {
                "term" : {"publish_date" : "2016-02-06"}
              }
            }
          }
        }
      }
    }'
    
    
    # 嵌套查询
    # SELECT * FROM books WHERE price = 35.99 OR ( publish_date = "2016-02-06" AND price = 99.99 )
    
    curl -XGET 'http://172.16.0.14:9200/bookstore/books/_search' -d '{
      "query" : {
        "filtered" : {
          "filter" : {
            "bool" : {
              "should" : [
                  { "term" : {"price" : 35.99}},
                  { "bool" : {
                  "must" : [
                    {"term" : {"publish_date" : "2016-02-06"}},
                    {"term" : {"price" : 99.99}}
                  ]
                }}
              ]
            }
          }
        }
      }
    }'
    
    # range范围过滤
    # SELECT * FROM books WHERE price >= 20 AND price < 100
    # gt :  > 大于
    # lt :  < 小于
    # gte :  >= 大于等于
    # lte :  <= 小于等于
    
    curl -XGET 'http://172.16.0.14:9200/store/books/_search' -d '{
      "query" : {
        "filtered" : {
          "filter" : {
            "range" : {
              "price" : {
                "gt" : 20.0,
                "lt" : 100
              }
            }
          }
        }
      }
    }'
    
    
    # 另外一种 and, or, not查询
    # 没有bool, 直接使用and , or , not
    # 注意: 不带bool的这种查询不能利用缓存
    # 查询价格既是35.99,publish_date又为"2015-02-06"的结果
    curl -XGET 'http://172.16.0.14:9200/bookstore/books/_search' -d '{
      "query": {
        "filtered": {
          "filter": {
            "and": [
            {
              "term": {
                "price":59.99
              }
            },
            {
              "term": {
                "publish_date":"2015-02-06"
              }
            }
           ]
         },
         "query": {
          "match_all": {}
          }
        }
      }
    }'
    
    http://172.16.0.14:9200/bookstore/books/_search
    
    #es安装插件下载es插件
    /bigdata/elasticsearch-2.3.1/bin/plugin install elasticsearch/marvel/latest
    #访问head管理页面
    http://172.16.0.14:9200/_plugin/marvel
    
  • 相关阅读:
    投资银行的IT部门——不同之处与常见误解
    C++ error C2440: “类型转换” : 无法从“std::vector::iterator”转换为“
    查看端口占用
    Sc config http start= disabled
    DDL、DML和DCL的区别与理解
    不同网段,在路由器上如何设置网关
    服务器调优
    查看SQL Server版本信息
    基于32位Windows2003的数据库服务器优化,启用AWE,优化SQL Server
    在WIN7操作系统下,如何显示文件夹里文件的数目
  • 原文地址:https://www.cnblogs.com/gentle-awen/p/10000801.html
Copyright © 2020-2023  润新知