• (转)es进行聚合操作时提示Fielddata is disabled on text fields by default


    根据es官网的文档执行

    GET /megacorp/employee/_search
    {
      "aggs": {
        "all_interests": {
          "terms": { "field": "interests" }
        }
      }
    }

    这个例子时,报错

    {
      "error": {
        "root_cause": [
          {
            "type": "illegal_argument_exception",
            "reason": "Fielddata is disabled on text fields by default. Set fielddata=true on [interests] in order to load fielddata in memory by uninverting the inverted index. Note that this can however use significant memory."
          }
        ],
        "type": "search_phase_execution_exception",
        "reason": "all shards failed",
        "phase": "query",
        "grouped": true,
        "failed_shards": [
          {
            "shard": 0,
            "index": "megacorp",
            "node": "-Md3f007Q3G6HtdnkXoRiA",
            "reason": {
              "type": "illegal_argument_exception",
              "reason": "Fielddata is disabled on text fields by default. Set fielddata=true on [interests] in order to load fielddata in memory by uninverting the inverted index. Note that this can however use significant memory."
            }
          }
        ],
        "caused_by": {
          "type": "illegal_argument_exception",
          "reason": "Fielddata is disabled on text fields by default. Set fielddata=true on [interests] in order to load fielddata in memory by uninverting the inverted index. Note that this can however use significant memory."
        }
      },
      "status": 400
    }

    搜了一下应该是5.x后对排序,聚合这些操作用单独的数据结构(fielddata)缓存到内存里了,需要单独开启,官方解释在此fielddata

    简单来说就是在聚合前执行如下操作

    PUT megacorp/_mapping/employee/
    {
      "properties": {
        "interests": { 
          "type":     "text",
          "fielddata": true
        }
      }
    }
    

    PS:执行上面操作前,先GET megacorp/_mapping/employee/查看mapping结构,然后执行上述命令,贴一下我聚合logstash读取tomcat.log到es里cilentip字段的步骤:

    1.首先先GET logstash-apacheaccesslog*/_mapping/logs/查看mapping结构

    PUT logstash-apacheaccesslog*/_mapping/logs/

    {

      "properties": {
         "verb": {
        "type": "text",
        "norms": false,
        "fielddata": true
         }
      }

    }

    2、对clientip字段进行聚合

     
  • 相关阅读:
    linux —— 学习笔记(汇总)
    linux —— ubuntu 初次安装问题
    更改CMD默认的初始路径
    深入浅出理解linux inode结构
    重拾简单的linux指令之info 【转】
    Python 中数据的序列化和反序列化(json处理)
    day07
    Python 的反射机制
    Python 的 __new__()方法与实例化
    Classes as objects
  • 原文地址:https://www.cnblogs.com/zhangmingcheng/p/7590743.html
Copyright © 2020-2023  润新知