• prometheus operator监控coredns


    1、创建service

    apiVersion: v1
    kind: Service
    metadata:
      labels:
        app: prometheus-operator-coredns
        chart: prometheus-operator-7.4.0
        heritage: Tiller
        jobLabel: coredns
        release: pharos-prometheus-operator
      name: pharos-prometheus-operator-coredns
      namespace: kube-system
    spec:
      clusterIP: None
      ports:
      - name: http-metrics
        port: 9153
        protocol: TCP
        targetPort: 9153
      selector:
        k8s-app: kube-dns
      sessionAffinity: None
      type: ClusterIP
    

     2、创建serviceMonitor

    apiVersion: monitoring.coreos.com/v1
    kind: ServiceMonitor
    metadata:
      labels:
        app: prometheus-operator-coredns
        chart: prometheus-operator-7.4.0
        heritage: Tiller
        release: pharos-prometheus-operator
      name: pharos-prometheus-operator-coredns
      namespace: cattle-prometheus
    spec:
      endpoints:
      - bearerTokenFile: /var/run/secrets/kubernetes.io/serviceaccount/token
        port: http-metrics
      jobLabel: jobLabel
      namespaceSelector:
        matchNames:
        - kube-system
      selector:
        matchLabels:
          app: prometheus-operator-coredns
          release: pharos-prometheus-operator
    

    3、创建 prometheusrules

    apiVersion: monitoring.coreos.com/v1
    kind: PrometheusRule
    metadata:
      annotations:
        prometheus-operator-validated: "true"
      labels:
        app: prometheus-operator
        release: pharos-prometheus-operator
      name: coredns-panic
      namespace: cattle-prometheus
    spec:
      groups:
      - name: coredns-panic
        rules:
        - alert: coredns-panic
          annotations:
            description: Coredns panic count is {{ $value }},please check!
          expr: sum by(instance, job, namespace) (coredns_panic_count_total) >= 10
          for: 15m
          labels:
            alertname: coredns-panic
            severity: warning
    ---
    apiVersion: monitoring.coreos.com/v1
    kind: PrometheusRule
    metadata:
      annotations:
        prometheus-operator-validated: "true"
      labels:
        app: prometheus-operator
        release: pharos-prometheus-operator
      name: coredns-responses
      namespace: cattle-prometheus
    spec:
      groups:
      - name: coredns-responses
        rules:
        - alert: coredns-responses
          annotations:
            description: Coredns has longer response time, please check !
          expr: histogram_quantile(0.99, sum by(le, job) (rate(coredns_dns_request_duration_seconds_bucket[5m])))  *
            10 >= 3
          for: 15m
          labels:
            alertname: coredns-responses
            severity: warning
    
  • 相关阅读:
    VS2010 创建 windows service 程序
    应用程序 调用 webservice
    管理随笔-组织无全面系统目标和规划
    笔记1——自写综保模块非语言实现方式
    测试模型V模型和W模型
    常用研发模型
    测试方法分类
    初识测试
    IntelliJ IDEA使用快捷键生成各种for循环。增强for循环等等
    java之jdbc数据库连接
  • 原文地址:https://www.cnblogs.com/Wshile/p/13268510.html
Copyright © 2020-2023  润新知