Abinit 9.0.4, linalg segfault on cluster
Posted: Mon Aug 17, 2020 5:41 pm
Hello everybody,
I am trying to compile the git version abinit 9.0.4 on the cluster beluga.
I am able to configure, make and install. I can ask abinit --version or abinit --build. However, whenever I try to start a simulation, I get a Segmentation fault inside the linalg module even while using 1 proc. I've added the stackTrace at the end of the post. Also, the output, ac9 file and config log are added as an attachments.
I've tried to configure directly in the shell and from an interactive session. This is the command I use : ../configure --with-mpi -enable-openmp --with-config-file=olivier.ac9 --prefix="/path/to/Installation/folder/"
Inside the log, I'm getting 2 errors in the linalg section :
1. I don't have Elpa.
2. I don't have <lapacke.h> while trying to use LAPACKE C API support.
Does somebody have an idea where the error could be coming from and how to fix it?
Thank you,
Olivier
Note : I've removed mpi-io to help pinpoint the error.
==== backtrace ====
0 0x0000000000010e90 __funlockfile() ???:0
1 0x0000000000097201 PMPI_Comm_size() ???:0
2 0x0000000000029de9 MKLMPI_Comm_size() ???:0
3 0x0000000000027fb1 mkl_blacs_init() ???:0
4 0x0000000000027ef8 Cblacs_pinfo() ???:0
5 0x00000000000187f9 blacs_gridmap_() ???:0
6 0x00000000000181ce blacs_gridinit_() ???:0
7 0x00000000025bc394 m_slk_mp_init_scalapack_() ???:0
8 0x000000000252a26b m_abi_linalg_mp_abi_linalg_init_() ???:0
9 0x000000000041bda7 m_driver_mp_driver_() ???:0
10 0x000000000040b687 MAIN__() ???:0
11 0x000000000040a0fe main() ???:0
12 0x00000000000202e0 __libc_start_main() ???:0
13 0x000000000040a01a _start() /tmp/nix-build-glibc-2.24.drv-0/glibc-2.24/csu/../sysdeps/x86_64/start.S:120
===================
I am trying to compile the git version abinit 9.0.4 on the cluster beluga.
I am able to configure, make and install. I can ask abinit --version or abinit --build. However, whenever I try to start a simulation, I get a Segmentation fault inside the linalg module even while using 1 proc. I've added the stackTrace at the end of the post. Also, the output, ac9 file and config log are added as an attachments.
I've tried to configure directly in the shell and from an interactive session. This is the command I use : ../configure --with-mpi -enable-openmp --with-config-file=olivier.ac9 --prefix="/path/to/Installation/folder/"
Inside the log, I'm getting 2 errors in the linalg section :
1. I don't have Elpa.
2. I don't have <lapacke.h> while trying to use LAPACKE C API support.
Does somebody have an idea where the error could be coming from and how to fix it?
Thank you,
Olivier
Note : I've removed mpi-io to help pinpoint the error.
==== backtrace ====
0 0x0000000000010e90 __funlockfile() ???:0
1 0x0000000000097201 PMPI_Comm_size() ???:0
2 0x0000000000029de9 MKLMPI_Comm_size() ???:0
3 0x0000000000027fb1 mkl_blacs_init() ???:0
4 0x0000000000027ef8 Cblacs_pinfo() ???:0
5 0x00000000000187f9 blacs_gridmap_() ???:0
6 0x00000000000181ce blacs_gridinit_() ???:0
7 0x00000000025bc394 m_slk_mp_init_scalapack_() ???:0
8 0x000000000252a26b m_abi_linalg_mp_abi_linalg_init_() ???:0
9 0x000000000041bda7 m_driver_mp_driver_() ???:0
10 0x000000000040b687 MAIN__() ???:0
11 0x000000000040a0fe main() ???:0
12 0x00000000000202e0 __libc_start_main() ???:0
13 0x000000000040a01a _start() /tmp/nix-build-glibc-2.24.drv-0/glibc-2.24/csu/../sysdeps/x86_64/start.S:120
===================