CUDA DOUBLE PRECISION
Posted: Sat May 14, 2011 6:41 pm
Hello to all,
At this moment I'm doing some test on BUDA machine which have 4 tesla GPU (thanks Jean Michel and Xavier).
I was having some doubt about my cuda code because nothing works in DOUBLE PRECISION.
This was strange because, my last test in DP passed all RECURSION tests.
After many tests, trying to debug DP, I remember that my old tests DP was obtained with the compilation option 1.3:
(NVCCFLAGS+= --arch sm_13)
This option permits to use the "compute capability" 1.3 which includes double.
Otherwise all is demonted to float causing some problemes.
I added manually this option and all work fine: tests are successful with GPU
Yann, Could be possible to do that automatically when cuda DP is chosen?
Regards,
Marco
At this moment I'm doing some test on BUDA machine which have 4 tesla GPU (thanks Jean Michel and Xavier).
I was having some doubt about my cuda code because nothing works in DOUBLE PRECISION.
This was strange because, my last test in DP passed all RECURSION tests.
After many tests, trying to debug DP, I remember that my old tests DP was obtained with the compilation option 1.3:
(NVCCFLAGS+= --arch sm_13)
This option permits to use the "compute capability" 1.3 which includes double.
Otherwise all is demonted to float causing some problemes.
I added manually this option and all work fine: tests are successful with GPU
Yann, Could be possible to do that automatically when cuda DP is chosen?
Regards,
Marco