[RegCNET] request
bixq
bixq at ictp.it
Wed Jul 20 17:08:57 CEST 2011
Hi, Wang Xuejia:
To run the parallel mode of RegCM3, you'd better not run regcm.x,
instead, you run thoses steps seperately. then the last command:
mpirun -np 2 ./regcm regcm.in
On Wed, 20 Jul 2011, Xuejia Wang wrote:
> Hi,
> Everybody, an unformiliar problem are below happened to me. My model is running under the parallel enivenment. And the operation seems not to be stop. I don't know how to deal with it. Any suggestion will be appreciated.
> Thanks !
> --------------------------------------------------------------------------------
> OUT-history written date = 1994121312.000000
> BATS variables written at 1994121312 0.000000000000000
> PGFIO/stdio: No such file or directory
> PGFIO-F-/unformatted write/unit=54/error code returned by host stdio - 2.
> File name = output/SRF.1994120100 unformatted, direct access record = 2686
> In source file outsrf.f, at line number 43
> [node56:11963] *** Process received signal ***
> [node56:11963] Signal: Bus error (7)
> [node56:11963] Signal code: (2)
> [node56:11963] Failing at address: 0x405000
> /apps/lsf/7.0/linux2.6-glibc2.3-x86_64/bin/openmpi_wrapper: line 320: 11963 Bus error (core dumped) $MPIRUN_CMD --app $APP_FILE
> Job /apps/lsf/7.0/linux2.6-glibc2.3-x86_64/bin/openmpi_wrapper ./regcm.x
> TID HOST_NAME COMMAND_LINE STATUS TERMINATION_TIME
> ===== ========== ================ ======================= ===================
> 00000 node56 ./regcm.x Exit (127) 06/27/2011 12:19:25
> 00001 node56 ./regcm.x Exit (127) 07/05/2011 20:38:54
> 00002 node56 ./regcm.x Exit (127) 06/27/2011 12:19:25
> 00003 node56 ./regcm.x Exit (127) 07/18/2011 22:28:08
> 00004 node56 ./regcm.x Exit (127) 06/27/2011 12:19:25
> 00005 node56 ./regcm.x Exit (127) 07/18/2011 22:28:03
> 00006 node56 ./regcm.x Exit (127) 06/27/2011 12:19:25
> 00007 node56 ./regcm.x Exit (127) 06/27/2011 12:19:25
> 00008 node56 ./regcm.x Exit (127) 06/27/2011 12:19:25
> 00009 node56 ./regcm.x Exit (127) 07/18/2011 22:29:19
> 00010 node56 ./regcm.x Exit (127) 07/18/2011 22:29:41
> 00011 node56 ./regcm.x Exit (127) 06/27/2011 12:19:25
> 00012 node56 ./regcm.x Exit (127) 07/18/2011 22:27:34
> 00013 node56 ./regcm.x Exit (127) 07/17/2011 08:49:09
> 00014 node56 ./regcm.x Exit (127) 07/18/2011 22:29:20
> 00015 node56 ./regcm.x Exit (127) 07/18/2011 22:28:24h
> -------------------------------------------------------------------------------
> JOBID USER STAT QUEUE FROM_HOST EXEC_HOST JOB_NAME SUBMIT_TIME
> 54373 wangxj UNKWN normal node74 16*node56 *./regcm.x Jun 27 12:15
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Dr. Xunqiang Bi email:bixq at ictp.it
Earth System Physics Group
The Abdus Salam ICTP
Strada Costiera, 11
P.O. BOX 586, 34100 Trieste, ITALY
Tel: +39-040-2240302 Fax: +39-040-2240449
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
More information about the RegCNET
mailing list