[RegCNET] strange behaviour in RegCM paralell configuration

[BE] Ufuk Utku Turuncoglu turuncu at be.itu.edu.tr
Thu Aug 9 16:17:47 CEST 2007


Hi,

I try to run RegCM in parallel mode but when i submit job to cluster 
wrong number of process will be spawn in each node.

For example, I define the total number of cpu as 24 and i am using 8 cpu 
nodes. so, i am using 3 nodes to create 24 porcess. When i check the 
number of process in each node and count them, it is not exactly 24. 
There are less process that i define in domain.param. I check the all 
configuration again and again and i could not find any problem.

I have already run model successfully in 24 cpu using different input 
data (NCEP). But in this case (using ECHAM data) it is not running. But 
once time i faced same problem with NCEP case but after installing again 
of the model code, it solved and i could not find the bug. Is it 
possible to input data could generate error?

Also in buggy case, when i submit job, each one of the process runs like 
an single/independent job and writes the information to the regcm.out 
seperately. It means single processor version of RegCM runs in 24 cpu.

I am using Redhat Linux AS 4.0 and Intel MPI 3.0 to compile the code. 
There is not any problem in compile stage and executable is created 
without error.

Any suggestions will be helpful,
Best wishes

Ufuk Utku Turuncoglu
Istanbul Technical University
Informatics Institute



More information about the RegCNET mailing list