Thursday, August 1, 2013

Turning off and on ECC RAM for NVIDIA GP-GPU Cards

From NVIDIA Developer site.

Turn off ECC (C2050 and later). ECC can cost you up to 10% in performance and hurts parallel scaling. You should verify that your GPUs are working correctly, and not giving ECC errors for example before attempting this. You can turn this off on Fermi based cards and later by running the following command for each GPU ID as root, followed by a reboot:
Extensive testing of AMBER on a wide range of hardware has established that ECC has little to no benefit on the reliability of AMBER simulations. This is part of the reason it is acceptable (see recommended hardware) to use the GeForce gaming cards for AMBER simulations.

1. To Turn off the ECC RAM, just do a
# nvidia-smi -g 0 --ecc-config=0
(repeat with -g x for each GPU ID)

2. To Turn back on ECC RAM, just do
# nvidia-smi -g 0 --ecc-config=1
(repeat with -g x for each GPU ID)

No comments: