Collector Type: Agent

Category: Application Monitors

Application Name: nvidiagpumonitor

G2 Monitor Name: Agent G2 - Nvidia Gpu Monitor

Global Template Name: Agent G2 - Linux - Nvidia GPU Monitoring

Supported DCGM Version: 3.1.7

Configuration Parameters

NameDescriptionDefault Value
NamespaceNamespace on which dcgm exporter is runninggpu-operator
PortPort on which metrics are exported9400

Collected Metrics

Monitor NameDisplay NameDescription
nvidia_dcgm_power_usageNvidia Dcgm Power UsagePower draw
nvidia_dcgm_mem_clockNvidia Dcgm Mem Clock FreqMemory clock frequency
nvidia_dcgm_mem_copy_utilNvidia Dcgm Mem UtilMemory utilization
nvidia_dcgm_fb_mem_usedNvidia Dcgm Framebuffer Memory UsedFramebuffer memory used
nvidia_dcgm_gpu_tempNvidia Dcgm Gpu TempGPU temperature
nvidia_dcgm_memory_tempNvidia Dcgm Memory TempMemory temperature
nvidia_dcgm_gpu_utilNvidia Dcgm Gpu UtilGPU utilization