Project

General

Profile

Bug #1775

chkpt test hangs for pami-linux-ppc64le-smp & pamilrts-linux-ppc64le-smp build

Added by Nitin Bhat 5 months ago. Updated about 1 month ago.

Status:
New
Priority:
Normal
Assignee:
Category:
-
Target version:
-
Start date:
01/11/2018
Due date:
% Done:

0%


Description

On Summitdev, chkpt test (tests/charm++/chkpt) with pami-linux-ppc64le-smp build hangs with the following output:


Running on 2 processors:  ./hello +restart log +ppn 2
jsrun -n2 ./hello +restart log +ppn 2
Choosing optimized barrier algorithm name I0:HybridBinomial:SHMEM:P2P
Converse/Charm++ Commit ID: v6.8.2-58-gc448343
Warning> Randomization of virtual memory (ASLR) is turned on in the kernel, thread migration may not work! Run 'echo 0 > /proc/sys/kernel/randomize_va_space' as root to disable it, or try running with '+isomalloc_sync'.
CharmLB> Load balancer assumes all CPUs are same.
Charm++> cpu affinity enabled.
Setting default affinity
Charm++> Running on 1 unique compute nodes (160-way SMP).
Charm++> cpu topology info is gathered in 0.002 seconds.
Received 1 arguments: { |./hello| }
Main's MigCtor. a=987(0x10003b9f5bc), b[0]=654(0x10003b9f5c0), b[1]=321, old PE number 4
Main's PUPer. a=123(0x10003b9f5bc), b[0]=456(0x10003b9f5c0), b[1]=789
[1] data on Group 1
[3] data on Group 3
[2] data on Group 2
CHello's PUPer. step=3.
[0] data on Group 0
[1] data on NOdeGroup 1
[0] data on NOdeGroup 0
[0]CkRestartMain done. sending out callback.

To replicate, run with make test TESTOPTS="++ppn 2"


Related issues

Related to Charm++ - Bug #1774: Thread migration fails on ppc64le builds New 01/11/2018
Related to Charm++ - Bug #1773: Zerocopy examples fail on pami-linux-ppc64le-smp due to a low level assertion failure New 01/11/2018
Related to Charm++ - Bug #1772: Programs built on pami-linux-ppc64le-async-smp fail a low level assertion at runtime. New 01/11/2018

History

#1 Updated by Nitin Bhat 5 months ago

  • Subject changed from chkpt test hangs for pami-linux-ppc64le-smp build to chkpt test hangs for pami-linux-ppc64le-smp & pamilrts-linux-ppc64le-smp build

Similar hang is seen for pamilrts-linux-ppc64le-smp build as well. (https://charm.cs.illinois.edu/gerrit/#/c/3141/)

#2 Updated by Nitin Bhat 5 months ago

  • Related to Bug #1774: Thread migration fails on ppc64le builds added

#3 Updated by Nitin Bhat 5 months ago

  • Related to Bug #1773: Zerocopy examples fail on pami-linux-ppc64le-smp due to a low level assertion failure added

#4 Updated by Nitin Bhat 5 months ago

  • Related to Bug #1772: Programs built on pami-linux-ppc64le-async-smp fail a low level assertion at runtime. added

#5 Updated by Eric Bohm about 1 month ago

  • Assignee set to Nitin Bhat

Also available in: Atom PDF