（转）es进行聚合操作时提示Fielddata is disabled on text fields by default

根据es官网的文档执行

GET /megacorp/employee/_search
{
  "aggs": {
    "all_interests": {
      "terms": { "field": "interests" }
    }
  }
}

这个例子时，报错

{
  "error": {
    "root_cause": [
      {
        "type": "illegal_argument_exception",
        "reason": "Fielddata is disabled on text fields by default. Set fielddata=true on [interests] in order to load fielddata in memory by uninverting the inverted index. Note that this can however use significant memory."
      }
    ],
    "type": "search_phase_execution_exception",
    "reason": "all shards failed",
    "phase": "query",
    "grouped": true,
    "failed_shards": [
      {
        "shard": 0,
        "index": "megacorp",
        "node": "-Md3f007Q3G6HtdnkXoRiA",
        "reason": {
          "type": "illegal_argument_exception",
          "reason": "Fielddata is disabled on text fields by default. Set fielddata=true on [interests] in order to load fielddata in memory by uninverting the inverted index. Note that this can however use significant memory."
        }
      }
    ],
    "caused_by": {
      "type": "illegal_argument_exception",
      "reason": "Fielddata is disabled on text fields by default. Set fielddata=true on [interests] in order to load fielddata in memory by uninverting the inverted index. Note that this can however use significant memory."
    }
  },
  "status": 400
}

搜了一下应该是5.x后对排序，聚合这些操作用单独的数据结构(fielddata)缓存到内存里了，需要单独开启，官方解释在此fielddata

简单来说就是在聚合前执行如下操作

PUT megacorp/_mapping/employee/
{
  "properties": {
    "interests": { 
      "type":     "text",
      "fielddata": true
    }
  }
}

PS:执行上面操作前，先GET megacorp/_mapping/employee/查看mapping结构，然后执行上述命令，贴一下我聚合logstash读取tomcat.log到es里cilentip字段的步骤：

1.首先先GET logstash-apacheaccesslog*/_mapping/logs/查看mapping结构

PUT logstash-apacheaccesslog*/_mapping/logs/

{

　　"properties": {
　　 "verb": {
　　　　"type": "text",
　　　　"norms": false,
　　　　"fielddata": true
　　　 }
　　}

}

2、对clientip字段进行聚合

转自：http://blog.csdn.net/u011403655/article/details/71107415

相关阅读:
linux —— 学习笔记（汇总）
linux —— ubuntu 初次安装问题
更改CMD默认的初始路径
深入浅出理解linux inode结构
重拾简单的linux指令之info 【转】
Python 中数据的序列化和反序列化（json处理）
day07
Python 的反射机制
Python 的 __new__()方法与实例化
Classes as objects

原文地址：https://www.cnblogs.com/zhangmingcheng/p/7590743.html