Skip to main content
Filter by
Sorted by
Tagged with
1 vote
1 answer
59 views

I’m trying to set up a LocalCUDACluster on WSL2 (Ubuntu 22.04) from Windows 11 for GPU computations. The cluster starts and runs, but performance is ~10× slower than running directly on the GPU, and ...
Marek Majoch's user avatar
0 votes
0 answers
116 views

I'm invoking nvmlDeviceGetCount() on a system with 2 GPUs, and it returns a device count of 0 GPUs - with no error. Why would it say that? Additional information: CUDA version: 12.6.68 (and bundled ...
einpoklum's user avatar
  • 138k
0 votes
1 answer
127 views

We have a CCTV system where we use NVIDIA GPUs for video decoding. Our current requirement is to monitor GPU decoding and memory usage, and if the usage reaches 80%, we need to automatically switch ...
Ranjith Ram's user avatar
2 votes
1 answer
430 views

In a project I'm using the nvml lib to get info about the GPU in a system. I use it to query the GPU name and GPU UUID. This happens cyclic 6 to 8 time per minute. I noticed a small memory leak which ...
Jakob's user avatar
  • 170
0 votes
1 answer
2k views

./nvml_lib: symbol lookup error: ./nvml_lib: undefined symbol: nvmlDeviceGetComputeRunningProcesses_v3 nvcc --version nvcc: NVIDIA (R) Cuda compiler driver Copyright (c) 2005-2022 NVIDIA ...
fizwit's user avatar
  • 11
0 votes
0 answers
200 views

I am trying to build a C-code for NVML for A5000 GPU. I got a code from internet which is as below. #include <stdio.h> #include <nvml.h> ///usr/include/hwloc/nvml.h int main() { ...
Aravind D. Chakravarti's user avatar
1 vote
1 answer
739 views

I'm writing a program that monitors how processes use the GPU and I found an API provided by nvml, nvmlDeviceGetProcessUtilization. Acordding the comment of this API, It reads recent utilization of ...
nezhababa's user avatar
1 vote
1 answer
198 views

Using VS2022 the following code snippet works in debug mode but not in release mode: nvmlInit(); nvmlDevice_t devH; auto ret = nvmlDeviceGetHandleByIndex_v2(0, &devH); if (ret != NVML_SUCCESS) ...
poby's user avatar
  • 1,786
-1 votes
1 answer
154 views

I'm having trouble with the shape of a struct to pass to an NVML Library function via pinvoke. The struct contains a fixed size array and some unsigned long long's I'm not encountering any compiler ...
codingNewb's user avatar
0 votes
1 answer
199 views

I'm attempting to pass a uint array into the NVML function nvmlDeviceGetAccountingPids(Doc here) from C#, here's a minimum working sample of what I have so far: { public const string ...
codingNewb's user avatar
0 votes
1 answer
246 views

Finding that code examples for the nvml API for nvidia cards is just really sparse. Before any nvml calls could be conducted CMAKE required: target_link_libraries(04_nvml_testing "/usr/lib/x86_64-...
user794667's user avatar
2 votes
0 answers
156 views

is there any command to reset nvidia gpu after error happened in golang through golang-nvml? i only found how to get GPU infos API in golang-nvml like: nvml.DeviceGetCount() nvml....
LibAndLab's user avatar
1 vote
0 answers
222 views

I am running an ML inference for image recognition on the GPU using onnxruntime and I am seeing an upper limit for how much performance improvement batching of images is giving me - there is reduction ...
sn710's user avatar
  • 621
3 votes
0 answers
829 views

We are using Slurm 20.02 with NVML autodetect, and on some 8-GPU nodes with NVLink, 4-GPU jobs get allocated by Slurm in a surprising way that appears sub-optimal. On a system with 8 Nvidia A40 GPUs, ...
leilu's user avatar
  • 366
2 votes
4 answers
19k views

I don't know why nvidia-smi doesn't work what I need to do for fix it? I think my library and driver version is match but nvidia-smi dosen't recognize it test
Jonghoe Ku's user avatar
5 votes
1 answer
1k views

I use some NVIDIA Management Library features to produce metrics in my application. Every 1 second I call nvmlDeviceGetMemoryInfo() in a thread, and after a few minutes, in the output of Visual Studio,...
SamT's user avatar
  • 567
1 vote
1 answer
1k views

I am working with gpu's and want to get the serial numbers of the gpu's.In NVIDIA Management Library there is a function that I can use. The function prototype is. nvmlReturn_t nvmlDeviceGetSerial ( ...
user avatar
1 vote
1 answer
853 views

I want to get some basic GPU data: name, RAM size, and do temperature monitoring. From NVIDIA docs, it's not clear which library to use. Is NVAPI a legacy API which should be avoided?
Andrei Moiseev's user avatar
3 votes
1 answer
3k views

I would like to measure the GPU usage per process as done in Windows taskmgr.exe, but I have encountered several problems when attempting to use the pyNVML library. As a result, I have a few questions....
h1d3r00t's user avatar
2 votes
2 answers
648 views

I am developing a monitoring agent for GPU cards that is capable of providing real-time telemetry using CUDA and NVML libraries. I want to understand a little more about GPU core operation vs how ...
MyronStewart's user avatar
4 votes
1 answer
1k views

I would like to know what library AMD has that mimics the NVML counterpart of nvidia. What I want is to get temperature, powerusage, etc. in c++. Best regards!
yannickwe87's user avatar
3 votes
1 answer
4k views

I tried to get current power usage with the following command in Windows 10 x64: nvidia-smi.exe --format=csv,noheader --query-gpu=power.draw And got the next result: [Not Supported] I checked it ...
Ivan Kolesnikov's user avatar
3 votes
1 answer
8k views

I compiled a software (GROMACS 2016.3) using cmake (3.5.1) with the following flags: cmake .. -DGMX_BUILD_OWN_FFTW=ON -DREGRESSIONTEST_DOWNLOAD=ON -DGMX_MPI=on -DGMX_GPU=on -DGMX_OPENMP=on -...
Irene Marzuoli's user avatar
0 votes
0 answers
722 views

I'm trying to compile Perl bindings to Nvidia Management library (NVML). When I use the makefile, following errors appear: /usr/lib/powerpc64le-linux-gnu/perl/5.22/CORE/handy.h:117:34: error: ‘bool’ ...
user3509540's user avatar
1 vote
1 answer
812 views

I am writing a code to measure the power usage of an NVIDIA Tesla K20 GPU (Kepler architecture) periodically using the NVML API. Variables: nvmlReturn_t result; nvmlEnableState_t pmmode; ...
Kajal's user avatar
  • 611
1 vote
1 answer
7k views

I am implementing an example program with nvml library as shown at https://devtalk.nvidia.com/default/topic/504951/how-to-call-nvml-apis-/ The program is as follows: #include <stdio.h> #...
Pranjal's user avatar
  • 699
0 votes
1 answer
1k views

I am using Tesla K20c and measuring power with nvidia-smi as my application is run. My problem is power consumption does not reach a steady state but keeps rising. For example, if my application runs ...
Tania's user avatar
  • 438
3 votes
0 answers
737 views

Previously, I tried NVML by using the function nvmlDeviceGetUtilizationRates(). I test it by this way, while the collection is running, I excute a DFT ( the Kernel is organised as <7,32>) on Tesla ...
Loong Draw's user avatar
0 votes
2 answers
2k views

I installed NVML on Jetson TK1 and compiled a cuda program. The compilation does not show any error but when running it shows the error /NVML-installed-path/usr/src/gdk/nvml/lib//libnvidia-ml.so: ...
Barath's user avatar
  • 1
5 votes
1 answer
3k views

I am trying to get the information about the overall utilization of a GPU (mine is an NVIDIA Tesla K20, running on Linux) during a period of time. By "overall" I mean something like, how many ...
rsm's user avatar
  • 103
0 votes
2 answers
4k views

I need gpu information for my cuda project test. I am using nvml library, and I successfully get temperature information. But, nvml reports ERROR_NOT_SUPPORTED in nvmlDeviceGetUtilizationRates(). So ...
LocoField's user avatar
1 vote
1 answer
3k views

My machine has nvidia Tesla K20m gpu. I would like to know gpu utilzation, memory utilization, temperature and fan speed. So I have used nvidia-smi to know the details. Nvidia-smi log is as follows ==...
Alvin's user avatar
  • 980
1 vote
1 answer
1k views

I'm running an application using the NVML function nvmlDeviceGetPowerUsage(). The problem is that I always get the same number for different applications I'm running using on a TESLA M2050. Any ...
user1730250's user avatar
6 votes
4 answers
25k views

Recently a colleague needed to use NVML to query device information, so I downloaded the Tesla development kit 3.304.5 and copied the file nvml.h to /usr/include. To test, I compiled the example code ...
Brian R's user avatar
  • 785
2 votes
1 answer
7k views

I have been using NVML library to get the values of graphics and memory utilization for Rodinia benchmark suite. I observe that with different frequencies, the utilization of the same application ...
Vaibhav Sundriyal's user avatar
0 votes
1 answer
3k views

I learnt than nvidia-smi -ac can be used to change the clock rate of GPU cores and memory. Is nvidia-smi built upon the NVML library? What is its equivalent in NVML since I checked the document http:/...
Vaibhav Sundriyal's user avatar
3 votes
2 answers
9k views

I am trying to execute some CUDA code which happens to have some NVML library functions like nvmlSystemGetDriverVersion. But, when I try to compile the code it says nvml.h not found. How should I ...
Vaibhav Sundriyal's user avatar