[RegCNET] request

bixq bixq at ictp.it
Wed Jul 20 17:08:57 CEST 2011


Hi, Wang Xuejia:

To run the parallel mode of RegCM3, you'd better not run regcm.x,
instead, you run thoses steps seperately. then the last command:

mpirun -np 2 ./regcm regcm.in



On Wed, 20 Jul 2011, Xuejia Wang wrote:

> Hi,
> Everybody, an unformiliar problem  are below happened to me.  My model is running under the parallel enivenment. And the operation seems not to be stop. I don't know how to deal with it. Any suggestion will be appreciated.
> Thanks !
> --------------------------------------------------------------------------------
> OUT-history written date =     1994121312.000000
> BATS variables written at    1994121312    0.000000000000000
> PGFIO/stdio: No such file or directory
> PGFIO-F-/unformatted write/unit=54/error code returned by host stdio - 2.
> File name = output/SRF.1994120100    unformatted, direct access   record = 2686
> In source file outsrf.f, at line number 43
> [node56:11963] *** Process received signal ***
> [node56:11963] Signal: Bus error (7)
> [node56:11963] Signal code:  (2)
> [node56:11963] Failing at address: 0x405000
> /apps/lsf/7.0/linux2.6-glibc2.3-x86_64/bin/openmpi_wrapper: line 320: 11963 Bus error               (core dumped) $MPIRUN_CMD --app $APP_FILE
> Job  /apps/lsf/7.0/linux2.6-glibc2.3-x86_64/bin/openmpi_wrapper ./regcm.x
> TID   HOST_NAME   COMMAND_LINE            STATUS            TERMINATION_TIME
> ===== ========== ================  =======================  ===================
> 00000 node56     ./regcm.x         Exit (127)               06/27/2011 12:19:25
> 00001 node56     ./regcm.x         Exit (127)               07/05/2011 20:38:54
> 00002 node56     ./regcm.x         Exit (127)               06/27/2011 12:19:25
> 00003 node56     ./regcm.x         Exit (127)               07/18/2011 22:28:08
> 00004 node56     ./regcm.x         Exit (127)               06/27/2011 12:19:25
> 00005 node56     ./regcm.x         Exit (127)               07/18/2011 22:28:03
> 00006 node56     ./regcm.x         Exit (127)               06/27/2011 12:19:25
> 00007 node56     ./regcm.x         Exit (127)               06/27/2011 12:19:25
> 00008 node56     ./regcm.x         Exit (127)               06/27/2011 12:19:25
> 00009 node56     ./regcm.x         Exit (127)               07/18/2011 22:29:19
> 00010 node56     ./regcm.x         Exit (127)               07/18/2011 22:29:41
> 00011 node56     ./regcm.x         Exit (127)               06/27/2011 12:19:25
> 00012 node56     ./regcm.x         Exit (127)               07/18/2011 22:27:34
> 00013 node56     ./regcm.x         Exit (127)               07/17/2011 08:49:09
> 00014 node56     ./regcm.x         Exit (127)               07/18/2011 22:29:20
> 00015 node56     ./regcm.x         Exit (127)               07/18/2011 22:28:24h
> -------------------------------------------------------------------------------
> JOBID   USER    STAT  QUEUE      FROM_HOST   EXEC_HOST   JOB_NAME   SUBMIT_TIME
> 54373   wangxj  UNKWN normal     node74      16*node56   *./regcm.x Jun 27 12:15

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
   Dr. Xunqiang Bi         email:bixq at ictp.it
   Earth System Physics Group
   The Abdus Salam ICTP
   Strada Costiera, 11
   P.O. BOX 586, 34100 Trieste, ITALY
   Tel: +39-040-2240302  Fax: +39-040-2240449
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~


More information about the RegCNET mailing list