先看下面es查询语句
{ "size": 0, "aggs" : { "all_articleId" : { "terms" : { "field" : "articleId" } } } }
得到的结果:
该索引下有2w多条数据,经过聚合分桶后,也绝对不仅仅只是10个bucket,很显然,这似乎不是我想要的结果,经过查官方API发现下面一段话:
Edit Updating the answer based on @PhaedrusTheGreek comment. setting size:0 is deprecated in 2.x onwards, due to memory issues inflicted on your cluster with high-cardinality field values. You can read more about it in the github issue here . It is recommended to explicitly set reasonable value for size a number between 1 to 2147483647.
意思就是说:在聚合中设置 size:0 在2.x之后已被弃用,建议为 size 明确设置合理的值,一个介于1到2147483647之间的数字。之后我将语句改为如下
{ "size": 0, "aggs" : { "all_articleId" : { "terms" : { "field" : "articleId", "size": 20000 } } } }
所查询到的bucket有200多,正式自己要的结果