Project

General

Profile

Bug #1711

syncft tests: unclear failure

Added by Phil Miller 2 months ago. Updated 2 months ago.

Status:
New
Priority:
Normal
Assignee:
Category:
-
Target version:
Start date:
10/10/2017
Due date:
% Done:

0%


Description

http://ppl-jenkins:8080/job/Nightly-Build/label=trusty,platform=net-linux-x86_64-syncft/1338/console

../../../bin/testrun  ./jacobi 4 2 2 200 +vp16 +p8 +balancer DummyLB +isomalloc_sync +killFile kill_02.txt  
DISPLAY "(null)" invalid; disabling X11 forwarding
Charmrun> scalable start enabled. 
Charmrun> started all node programs in 1.863 seconds.
Converse/Charm++ Commit ID: d62df12
Charm++> synchronizing isomalloc memory region...
Program finished after 655.080204 seconds.
Fatal socket error: code 93610-- Timeout on socket recv!
Charmrun> error on request socket to node 6 'localhost'--
Socket closed before recv.
Socket 11 failed 
DISPLAY "(null)" invalid; disabling X11 forwarding
charmrun says Processor 6 failed on Node 6
socket_index 7 crashed_node 6 reconnected fd 11  
Charmrun finished launching new process in 1.236517s
Program finished after 1256.395664 seconds.
Fatal socket error: code 93610-- Timeout on socket recv!
Charmrun> error on request socket to node 0 'localhost'--
Socket closed before recv.
Socket 9 failed 
DISPLAY "(null)" invalid; disabling X11 forwarding
charmrun says Processor 0 failed on Node 0
socket_index 5 crashed_node 0 reconnected fd 9  
ERROR> Charmrun detected multiple crashes.
Charmrun finished launching new process in 1.281417s
make[3]: Leaving directory `/scratch/jenkins/builds/Nightly-Build/label=trusty,platform=net-linux-x86_64-syncft@1338/charm/net-linux-x86_64-syncft/tests/ampi/jacobi3d'
make[3]: *** [syncfttest] Error 1
make[2]: *** [syncfttest] Error 1


Related issues

Related to Charm++ - Bug #1710: syncft tests: warning and crash on init_checkpt New 10/10/2017

History

#1 Updated by Phil Miller 2 months ago

  • Related to Bug #1710: syncft tests: warning and crash on init_checkpt added

#2 Updated by Phil Miller 2 months ago

Possibly similar / the same: http://ppl-jenkins:8080/job/Nightly-Build/label=trusty,platform=net-linux-x86_64-syncft/1304/console

../../../bin/testrun  ./jacobi 2 2 2 200 +vp8 +p8 +balancer DummyLB +isomalloc_sync +killFile kill_01.txt  
DISPLAY "(null)" invalid; disabling X11 forwarding
Charmrun> scalable start enabled. 
Charmrun> started all node programs in 1.862 seconds.
Converse/Charm++ Commit ID: 54b77a7
Charm++> synchronizing isomalloc memory region...
Fatal socket error: code 93610-- Timeout on socket recv!
Program finished after 601.856377 seconds.
Charmrun> error on request socket to node 1 'localhost'--
Socket closed before recv.
Socket 5 failed 
DISPLAY "(null)" invalid; disabling X11 forwarding
charmrun says Processor 1 failed on Node 1
socket_index 1 crashed_node 1 reconnected fd 5  
Caught SIGPIPE.
Caught SIGPIPE.
Charmrun finished launching new process in 1.244774s
Program finished after 602.148137 seconds.
Caught SIGPIPE.
Caught SIGPIPE.
Fatal socket error: code 93610-- Timeout on socket recv!
Charmrun> error on request socket to node 7 'localhost'--
Socket closed before recv.
Socket 10 failed 
DISPLAY "(null)" invalid; disabling X11 forwarding
charmrun says Processor 7 failed on Node 7
socket_index 6 crashed_node 7 reconnected fd 10  
Charmrun finished launching new process in 1.229961s
ERROR> Charmrun detected multiple crashes.
make[3]: Leaving directory `/scratch/jenkins/builds/Nightly-Build/label=trusty,platform=net-linux-x86_64-syncft@1304/charm/net-linux-x86_64-syncft/tests/ampi/jacobi3d'
make[3]: *** [syncfttest] Error 1
make[2]: *** [syncfttest] Error 1

#3 Updated by Eric Bohm 2 months ago

  • Assignee set to Juan Galvez

Also available in: Atom PDF