侧边栏壁纸
  • 累计撰写 28 篇文章
  • 累计创建 23 个标签
  • 累计收到 0 条评论

目 录CONTENT

文章目录

node_exporter配置采集模块启用黑白名单

zhanjie.me
2024-08-06 / 0 评论 / 0 点赞 / 7 阅读 / 0 字

node_exporter 中有默认开启和默认关闭的采集模块,主要基于以下几个原因:

  • 资源使用和性能考虑:某些采集模块可能会消耗较多的系统资源,比如 CPU、内存或网络带宽。默认关闭这些模块可以避免在不需要时对系统性能产生不必要的影响。
    例如,一些详细的硬件监测模块,如果在大多数场景下不是关键的监控需求,默认关闭可以节省资源。
  • 通用性和针对性:默认开启的模块通常是被认为在大多数节点监控场景中普遍有用和关键的信息,如基本的系统负载、内存使用等。而默认关闭的模块可能是针对特定环境或特定需求的,不是所有用户都需要。
    比如,对于特定类型的服务器(如存储服务器)可能需要特定的磁盘详细信息采集模块,但对于一般的计算节点可能并非必需。
  • 数据量和存储:某些采集模块可能会产生大量的数据,如果默认开启可能会导致存储成本增加,并且在查询和分析数据时增加复杂性。
  • 安全性和隐私:某些模块可能涉及到潜在的安全或隐私敏感信息,默认关闭可以确保在不需要时不收集这些数据。

用户可以根据自己的具体需求和环境,灵活地配置开启或关闭相应的采集模块,以实现精准和有效的节点监控。

项目地址

启动日志

通过启动日志查看默认开启的模块

Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.525Z caller=node_exporter.go:106 msg="Enabled collectors"
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.525Z caller=node_exporter.go:113 collector=arp
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.525Z caller=node_exporter.go:113 collector=bcache
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.525Z caller=node_exporter.go:113 collector=bonding
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.525Z caller=node_exporter.go:113 collector=btrfs
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.525Z caller=node_exporter.go:113 collector=conntrack
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.525Z caller=node_exporter.go:113 collector=cpu
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.525Z caller=node_exporter.go:113 collector=cpufreq
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.525Z caller=node_exporter.go:113 collector=diskstats
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.525Z caller=node_exporter.go:113 collector=edac
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.525Z caller=node_exporter.go:113 collector=entropy
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.525Z caller=node_exporter.go:113 collector=fibrechannel
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.525Z caller=node_exporter.go:113 collector=filefd
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.525Z caller=node_exporter.go:113 collector=filesystem
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.525Z caller=node_exporter.go:113 collector=hwmon
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.525Z caller=node_exporter.go:113 collector=infiniband
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.525Z caller=node_exporter.go:113 collector=ipvs
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.525Z caller=node_exporter.go:113 collector=loadavg
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.525Z caller=node_exporter.go:113 collector=mdadm
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.525Z caller=node_exporter.go:113 collector=meminfo
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.525Z caller=node_exporter.go:113 collector=netclass
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.525Z caller=node_exporter.go:113 collector=netdev
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.525Z caller=node_exporter.go:113 collector=netstat
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.525Z caller=node_exporter.go:113 collector=nfs
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.525Z caller=node_exporter.go:113 collector=nfsd
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.525Z caller=node_exporter.go:113 collector=powersupplyclass
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.525Z caller=node_exporter.go:113 collector=pressure
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.525Z caller=node_exporter.go:113 collector=rapl
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.525Z caller=node_exporter.go:113 collector=schedstat
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.525Z caller=node_exporter.go:113 collector=sockstat
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.526Z caller=node_exporter.go:113 collector=softnet
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.526Z caller=node_exporter.go:113 collector=stat
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.526Z caller=node_exporter.go:113 collector=textfile
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.526Z caller=node_exporter.go:113 collector=thermal_zone
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.526Z caller=node_exporter.go:113 collector=time
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.526Z caller=node_exporter.go:113 collector=timex
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.526Z caller=node_exporter.go:113 collector=udp_queues
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.526Z caller=node_exporter.go:113 collector=uname
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.526Z caller=node_exporter.go:113 collector=vmstat
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.526Z caller=node_exporter.go:113 collector=xfs
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.526Z caller=node_exporter.go:113 collector=zfs
Aug  6 15:01:42 prome-master node_exporter: level=info ts=2023-08-06T07:01:42.526Z caller=node_exporter.go:195 msg="Listening on" address=:9100

通过API获取指标数据

[root@prome-master ~]# curl -s localhost:9100/metrics|grep node_|head -20
# HELP node_arp_entries ARP entries by device
# TYPE node_arp_entries gauge
node_arp_entries{device="ens192"} 6
# HELP node_boot_time_seconds Node boot time, in unixtime.
# TYPE node_boot_time_seconds gauge
node_boot_time_seconds 1.722892555e+09
# HELP node_context_switches_total Total number of context switches.
# TYPE node_context_switches_total counter
node_context_switches_total 7.148688e+06
# HELP node_cooling_device_cur_state Current throttle state of the cooling device
# TYPE node_cooling_device_cur_state gauge
node_cooling_device_cur_state{name="0",type="Processor"} 0
node_cooling_device_cur_state{name="1",type="Processor"} 0
node_cooling_device_cur_state{name="2",type="Processor"} 0
node_cooling_device_cur_state{name="3",type="Processor"} 0
# HELP node_cooling_device_max_state Maximum throttle state of the cooling device
# TYPE node_cooling_device_max_state gauge
node_cooling_device_max_state{name="0",type="Processor"} 7
node_cooling_device_max_state{name="1",type="Processor"} 7
node_cooling_device_max_state{name="2",type="Processor"} 7

默认开启的采集项目

image-abrobkun.png

黑名单: 关闭某一项默认开启的采集项

--no-collector.<name> flag

# 未开启前
[root@prome_master_01 node_exporter]# curl  -s  localhost:9100/metrics |grep node_cpu
# HELP node_cpu_guest_seconds_total Seconds the CPUs spent in guests (VMs) for each mode.
# TYPE node_cpu_guest_seconds_total counter
node_cpu_guest_seconds_total{cpu="0",mode="nice"} 0
node_cpu_guest_seconds_total{cpu="0",mode="user"} 0
node_cpu_guest_seconds_total{cpu="1",mode="nice"} 0
node_cpu_guest_seconds_total{cpu="1",mode="user"} 0
node_cpu_guest_seconds_total{cpu="2",mode="nice"} 0
node_cpu_guest_seconds_total{cpu="2",mode="user"} 0
node_cpu_guest_seconds_total{cpu="3",mode="nice"} 0
node_cpu_guest_seconds_total{cpu="3",mode="user"} 0
# HELP node_cpu_seconds_total Seconds the CPUs spent in each mode.
# TYPE node_cpu_seconds_total counter
node_cpu_seconds_total{cpu="0",mode="idle"} 17691.27
node_cpu_seconds_total{cpu="0",mode="iowait"} 8.9
node_cpu_seconds_total{cpu="0",mode="irq"} 0
node_cpu_seconds_total{cpu="0",mode="nice"} 0.32
node_cpu_seconds_total{cpu="0",mode="softirq"} 0.28
node_cpu_seconds_total{cpu="0",mode="steal"} 2.7

关闭cpu采集

  • ./node_exporter --no-collector.cpu
  • curl -s localhost:9100/metrics |grep node_cpu

白名单:关闭默认采集项而只开启某些采集

 --collector.disable-defaults --collector.<name> .

# 只开启mem采集
 ./node_exporter --collector.disable-defaults --collector.meminfo

# 只开启mem 和cpu 采集
./node_exporter --collector.disable-defaults --collector.meminfo --collector.cpu
0

评论区