Hello,

Whilenusing NPROC=16 in a AMD Opteron dual core x86-64, using PGI compiler 7.1, I'm with problems to run regcm
my grid is with iy=232 and ix=160, ds=40.0

 when I types:
nohup mpirun -np 16 -machinefile machine.cluster  -nolocal regcm > log.out.txt &

but no full sucess.
the regcm run the 56 first days, exit with error mesg ->

...
 BATS variables written at    2005022509    180.0000000000000    
     at day =   55.4156, ktau =      53200 :  1st, 2nd time deriv of ps =  0.13552E-04 0.93734E-07,  no. of points w/convection =  process            3 of           16
p3_26190:  p4_error: interrupt SIGFPE: 8
 process            8 of           16
 process            4 of           16
p4_23892:  p4_error: interrupt SIGx: 13
 process           12 of           16
 process            2 of           16
p2_25778:  p4_error: net_recv read:  probable EOF on socket: 1
 process           10 of           16
 process           14 of           16
rm_l_2_25789: (36773.972656) net_send: could not write to fd=5, errno = 32
 process            6 of           16
 process            9 of           16
p9_30291:  p4_error: net_recv read:  probable EOF on socket: 1
 process           13 of           16
rm_l_9_30302: (36773.339844) net_send: could not write to fd=5, errno = 32
 process            5 of           16
 process            1 of           16

...
 Writing rad fields at ktau =         53760   2005022600
 SAVTMP RESTART WRITTEN: idatex=   2005022600 ktau=        53760
 /bin/rm -f                                  SAVTMP.2005022400        
 BCs are ready from    2005022600   to    2005022606
rm_l_3_26205: (36773.968750) net_send: could not write to fd=5, errno = 32
 process           11 of           16
 process            7 of           16
 process           15 of           16
p4_23892: (36789.746094) net_send: could not write to fd=5, errno = 32
p2_25778: (36797.988281) net_send: could not write to fd=5, errno = 32
p9_30291: (36799.359375) net_send: could not write to fd=5, errno = 32


my regcm.param2 as:

      INTEGER IX
      INTEGER NPROC
      INTEGER MJX
      INTEGER KX
      INTEGER NSG
      INTEGER NNSG
      INTEGER IBYTE
      INTEGER JXP
      CHARACTER*5 DATTYP
      CHARACTER*4 LSMTYP
      CHARACTER*7 AERTYP
      integer jxbb
      parameter(IX     =   232)
      parameter(NPROC  =    16)
      parameter(MJX    =   160)
      parameter(JXP    = MJX/NPROC)
      parameter(KX     =    18)
      parameter(NSG    =     1)
      parameter(NNSG   =     1)
      parameter(IBYTE  =     4)
      parameter(DATTYP='NNRP1')
      parameter(LSMTYP='BATS')
      parameter(AERTYP='AER00D0')
      parameter(jxbb=mjx-1)
~



Can somebody suggests me how to overcome this, please!


PS.: The problem can be ,  no. of points w/convection
Regards,

Paulo Ricardo Teixeira

#########################################################################

CV (Currículo Lattes): http://buscatextual.cnpq.br/buscatextual/visualizacv.jsp?id=K4705902T0
ou neste link: http://lattes.cnpq.br/8914320939610393
Paulo Ricardo Teixeira da Silva
Diretor Adjunto de Assuntos Acadêmicos e Científico da UNEMET
Mestre em Meteorologia - Radiação Solar / Modelagem da Radiação Solar
(Processos de Superfície Terrestre)

Bolsista/Pesquisador do NMA/LBA/INPA
Instituto Nacional de Pesquisas da Amazônia - INPA
Fone: +55 92 3643-3623
Fax: +55 92 3643 3625
Av. André Araújo, 2936 - Campus II
Bairro: Aleixo - Cx. Postal 478 / Cep 69060-001
Manaus/Amazonas


Linux Counter desde de 2001-11-22
N_LinuxCounter : #246599

#########################################################################