Hello, I followed the steps mentioned in this repository but I am getting error as below
error from server (NotFound): the server could not find the metric DCGM_FI_DEV_GPU_UTIL_AVG for services gpu-api
I running below command, I am getting this error, I can see this metric in DCGM pod, even I tried below step, but it is getting timeout.
Could you please help me here ?
If there is no value, connect to the DCGM exporter pod, and check connectivity with wget http://: