Job exiting after P000
Posted: Thu Aug 05, 2010 11:15 pm
While running abinit, I'm consistently getting jobs which exit early on larger jobs (either during the first iteration or beforehand) just after the following message.
"-P-0000 leave_test : synchronization done...
-P-0000 leave_test : exiting..."
This happened when I went from a 2x2 unit cell of graphene to a 3x3 unit cell of graphene, and when I increased the vacuum spacing on a silver slab to a large unit cell. I would have thought it was due to a memory usage issue, but it doesn't go away if I increase the number of nodes & processors (& thus the available memory). Am I doing something dumb? I've attached the input, output, and log files for the graphene unit cell where this happens (labelled as graphod.* for my own personal naming reasons). I've queued this job on 2 nodes with 20GB on each node.
"-P-0000 leave_test : synchronization done...
-P-0000 leave_test : exiting..."
This happened when I went from a 2x2 unit cell of graphene to a 3x3 unit cell of graphene, and when I increased the vacuum spacing on a silver slab to a large unit cell. I would have thought it was due to a memory usage issue, but it doesn't go away if I increase the number of nodes & processors (& thus the available memory). Am I doing something dumb? I've attached the input, output, and log files for the graphene unit cell where this happens (labelled as graphod.* for my own personal naming reasons). I've queued this job on 2 nodes with 20GB on each node.