[RegCNET] strange behaviour in RegCM paralell configuration
[BE] Ufuk Utku Turuncoglu
turuncu at be.itu.edu.tr
Thu Aug 9 16:17:47 CEST 2007
Hi,
I try to run RegCM in parallel mode but when i submit job to cluster
wrong number of process will be spawn in each node.
For example, I define the total number of cpu as 24 and i am using 8 cpu
nodes. so, i am using 3 nodes to create 24 porcess. When i check the
number of process in each node and count them, it is not exactly 24.
There are less process that i define in domain.param. I check the all
configuration again and again and i could not find any problem.
I have already run model successfully in 24 cpu using different input
data (NCEP). But in this case (using ECHAM data) it is not running. But
once time i faced same problem with NCEP case but after installing again
of the model code, it solved and i could not find the bug. Is it
possible to input data could generate error?
Also in buggy case, when i submit job, each one of the process runs like
an single/independent job and writes the information to the regcm.out
seperately. It means single processor version of RegCM runs in 24 cpu.
I am using Redhat Linux AS 4.0 and Intel MPI 3.0 to compile the code.
There is not any problem in compile stage and executable is created
without error.
Any suggestions will be helpful,
Best wishes
Ufuk Utku Turuncoglu
Istanbul Technical University
Informatics Institute
More information about the RegCNET
mailing list