FFT Parallelization not working with 6.6.3 + ifort
Posted: Fri Jun 24, 2011 5:49 pm
Hi everyone-
I am having a lot of trouble compiling abinit 6.6.3 for fft parallelization and need help! I am using ifort 11.1.075 (the updated version) and I can't get fft parallelization working. I have large jobs (supercell calculations) that I need to run with limited memory, so I'm thinking I need to split the fft work over more processors. If I have any other options, aside from using fft parallelization, please let me know!
My compilation has no problems with k-point parallelizing, but using npband seems to break it. I have tried many things including updating fortran, using mkl, and adding a fortran heap-arrays flag, based on the suggestions in these posts (viewtopic.php?f=2&t=655, viewtopic.php?f=2&t=1000, viewtopic.php?f=3&t=1139&p=3665&hilit=fftw3#p3665). I have also tried using ethernet rather than myrinet, the standard on our cluster. Nothing has worked and I am feeling quite desperate.
Here is the most recent configuration command I have used:
../configure --enable-64bit-flags --enable-mpi --enable-mpi-io --with-linalg-flavor=mkl --with-linalg-incs="-I/share/apps/intel/Compiler/11.1/075/mkl/include" --with-linalg-libs="-L/share/apps/intel/Compiler/11.1/075/mkl/lib/em64t -lmkl_intel_lp64 -lmkl_blacs_lp64 -lmkl_intel_thread -lmkl_core -liomp5 -lpthread -lmkl_blas95_lp64 -lmkl_lapack" --with-mpi-prefix=/share/apps/openmpi-intel11.1.075/ FCFLAGS="-heap-arrays 64"
When running with npband > 1, the code always stops on the first or second iteration, though the error can differ depending on the config setup. The config above gives this error:
ITER STEP NUMBER 1
vtorho : nnsclo_now= 2, note that nnsclo,dbl_nnsclo,istep= 0 0 1
starting lobpcg, with nblockbd,mpi_enreg%nproc_band 45 6
forrtl: severe (174): SIGSEGV, segmentation fault occurred
Image PC Routine Line Source
abinip-6.6.3-i11. 0000000000DDC908 Unknown Unknown Unknown
abinip-6.6.3-i11. 000000000067CD6A Unknown Unknown Unknown
abinip-6.6.3-i11. 00000000006149C6 Unknown Unknown Unknown
abinip-6.6.3-i11. 00000000005FAA33 Unknown Unknown Unknown
If anyone can help me I would greatly appreciate it! Thanks in advance.
Best,
Aaron Wolf
I am having a lot of trouble compiling abinit 6.6.3 for fft parallelization and need help! I am using ifort 11.1.075 (the updated version) and I can't get fft parallelization working. I have large jobs (supercell calculations) that I need to run with limited memory, so I'm thinking I need to split the fft work over more processors. If I have any other options, aside from using fft parallelization, please let me know!
My compilation has no problems with k-point parallelizing, but using npband seems to break it. I have tried many things including updating fortran, using mkl, and adding a fortran heap-arrays flag, based on the suggestions in these posts (viewtopic.php?f=2&t=655, viewtopic.php?f=2&t=1000, viewtopic.php?f=3&t=1139&p=3665&hilit=fftw3#p3665). I have also tried using ethernet rather than myrinet, the standard on our cluster. Nothing has worked and I am feeling quite desperate.
Here is the most recent configuration command I have used:
../configure --enable-64bit-flags --enable-mpi --enable-mpi-io --with-linalg-flavor=mkl --with-linalg-incs="-I/share/apps/intel/Compiler/11.1/075/mkl/include" --with-linalg-libs="-L/share/apps/intel/Compiler/11.1/075/mkl/lib/em64t -lmkl_intel_lp64 -lmkl_blacs_lp64 -lmkl_intel_thread -lmkl_core -liomp5 -lpthread -lmkl_blas95_lp64 -lmkl_lapack" --with-mpi-prefix=/share/apps/openmpi-intel11.1.075/ FCFLAGS="-heap-arrays 64"
When running with npband > 1, the code always stops on the first or second iteration, though the error can differ depending on the config setup. The config above gives this error:
ITER STEP NUMBER 1
vtorho : nnsclo_now= 2, note that nnsclo,dbl_nnsclo,istep= 0 0 1
starting lobpcg, with nblockbd,mpi_enreg%nproc_band 45 6
forrtl: severe (174): SIGSEGV, segmentation fault occurred
Image PC Routine Line Source
abinip-6.6.3-i11. 0000000000DDC908 Unknown Unknown Unknown
abinip-6.6.3-i11. 000000000067CD6A Unknown Unknown Unknown
abinip-6.6.3-i11. 00000000006149C6 Unknown Unknown Unknown
abinip-6.6.3-i11. 00000000005FAA33 Unknown Unknown Unknown
If anyone can help me I would greatly appreciate it! Thanks in advance.
Best,
Aaron Wolf