Probel of memory with paral_kgb

Total energy, geometry optimization, DFT+U, spin....

Moderator: bguster

Locked
Boris
Posts: 128
Joined: Tue Feb 16, 2010 10:13 am
Location: France

Probel of memory with paral_kgb

Post by Boris » Fri Jun 17, 2011 7:25 pm

Hi all,

I have what seems a memory problem when I use paral_kgb=1. I get the following error message from abinit :

Code: Select all

 Biggest array : pawfgrtab%gylm(gr), with   4355.9783 MBytes.
-P-0000  leave_test : synchronization done...
 memana : allocated an array of   4355.978 Mbytes, for testing purposes.
-P-0000
-P-0000  memana : ERROR -
-P-0000   Test failed to allocate    8332.909 Mbytes
-P-0000   It is not worth to continue
-P-0000   Action : modify input variable to fit the available memory.
-P-0000   or increase limit on available memory.
-P-0000
-P-0000  leave_new : decision taken to exit ...


Then at the end of the LOG file:

Code: Select all

-P-0000  leave_new : calling XMPI_ABORT...
  MPI_ERROR_STRING: MPI_ERR_UNKNOWN: unknown error
--------------------------------------------------------------------------
MPI_ABORT was invoked on rank 0 in communicator MPI_COMM_WORLD
with errorcode 14.

NOTE: invoking MPI_ABORT causes Open MPI to kill all MPI processes.
You may or may not see output from other processes, depending on
exactly when Open MPI kills them.
--------------------------------------------------------------------------
--------------------------------------------------------------------------
mpirun has exited due to process rank 4 with PID 17727 on
node cja065.localdomain exiting without calling "finalize". This may
have caused other processes in the application to be
terminated by signals sent by mpirun (as reported here).
--------------------------------------------------------------------------
[cja065.localdomain:17716] 23 more processes have sent help message help-mpi-api.txt / mpi-abort
[cja065.localdomain:17716] Set MCA parameter "orte_base_help_aggregate" to 0 to see all help / error messages


I tried different values for npkpt, npband or npfft but I always end up with the same error. I have 320 bands and 27 kpoints in my calculation. It is uranium carbide.

I have another calculation running with paral_kgb for mixed uranium/plutonium oxide on the same machine and it works perfectly fine. I have 120 bands and 36 kpoints in this calculation.

Could you please help me with that issue?

Thanks
----------------------------------------------------------
Boris Dorado
Atomic Energy Commission
France
----------------------------------------------------------

Locked