prometheus的坑

prometheus是一个用于监控k8s集群状态的工具．今天在主机上配置这个东西，遇到了一个坑，调查了一段时间才解决，记之．

首先，根据网上的教程，利用helm安装这个东西很方便，只要三条指令（ref:https://itnext.io/kubernetes-monitoring-with-prometheus-in-15-minutes-8e54d1de2e13）

$ helm repo add coreos https://s3-eu-west-1.amazonaws.com/coreos-charts/stable/

$ helm install coreos/prometheus-operator --name prometheus-operator --namespace monitoring

$ helm install coreos/kube-prometheus --name kube-prometheus --set global.rbacEnable=true --namespace monitoring

但是，监控系统却没有正确的启动．经过一番调查，发现是有两个pod挂了，切到他们的container里面，进一步发现挂掉的container的

log信息是相同的：

再经过一番调查，在prometheus的文档中发现下面这段话：

github.com/coreos/prometheus-operator/vendor/github.com/fsnotify/fsnotify/README.md

How many files can be watched at once?

There are OS-specific limits as to how many watches can be created:

Linux: /proc/sys/fs/inotify/max_user_watches contains the limit, reaching this limit results in a "no space left on device" error.
BSD / OSX: sysctl variables "kern.maxfiles" and "kern.maxfilesperproc", reaching these limits results in a "too many open files" error.

原来是要达到了系统所允许的watch文件数目的上限．修改文件/proc/sys/fs/inotify/max_user_watches contains的值，再次部署，成功．

相关阅读:
四种访问修饰符详解（推荐）
三层架构中DAL层Sqlhelper怎样快速掌握？（常用）
ASP.NET中最常用的验证控件使用方法（推荐）
.NetFrom验证方便的webconfig 配置及前台使用（推荐）
CefSharp访问需要认证网页或接口(在Request的Headers中添加认证Token)
CentOS7中配置vsftpd
CentOS7下安装RabbitMQ
CentOS7下让Asp.Net Core的网站自动运行
Winform下的Combox根据值来选中项
golang简单实现jwt验证(beego、xorm、jwt)

原文地址：https://www.cnblogs.com/elnino/p/9707890.html