Project

General

Profile

Activity

From 09/21/2017 to 10/20/2017

10/19/2017

05:22 PM Cleanup #537 (Implemented): Data races in handler registration and assignment to global index var...
https://charm.cs.illinois.edu/gerrit/#/c/381/ Ronak Buch

10/18/2017

10:26 AM Feature #1704 (In Progress): Add a pamilrts-linux-ppc64le build target
Nitin Bhat
09:47 PM Bug #1715: 20% slowdown in ChaNGa after commit 159fd36fc
The given case doesn't have an unusual number of chares/PE. It does have a very small compute/communication ratio: I... Thomas Quinn

10/17/2017

06:40 PM Bug #1715: 20% slowdown in ChaNGa after commit 159fd36fc
Do you happen to know if that input case running on that many PEs will result in having many more chares per PE than ... Sam White
12:16 PM Bug #1715 (New): 20% slowdown in ChaNGa after commit 159fd36fc
A ChaNGa user reported a noticeable slow down when compiling with recent versions of charm. A "git bisect" session p... Thomas Quinn
11:06 AM Feature #1657: pxshm/xpmem support for nocopy sends across processes on the same host
See the following paper for a description of how to use XPMEM efficiently. The key is that you can register the entir... Sam White

10/16/2017

02:24 PM Bug #1714 (Implemented): examples/zerocopy/pingpong crashes intermittently on OFI layer SMP mode ...
Fix: https://charm.cs.illinois.edu/gerrit/#/c/3133/
The bug was due to a race condition where the message in the c...
Nitin Bhat
09:37 AM Bug #1714 (Implemented): examples/zerocopy/pingpong crashes intermittently on OFI layer SMP mode ...
Nitin Bhat

10/13/2017

01:52 PM Feature #1713 (New): DDT support for getting the addresses of contiguous parts of non-contiguous ...
When sending large buffers consisting of non-contiguous datatypes via the zero copy API, we want to perform multiple ... Sam White
11:07 AM Feature #1637 (Merged): Zero-copy send support in the OFI layer
Phil Miller
09:57 AM Bug #1708: Charm++ programs hang with mpi-crayxc build on Edison when run on 2 hosts
The hang occurs in NAMD with "setenv HUGETLB_MORECORE yes" but not "setenv HUGETLB_MORECORE no". Jim Phillips
09:50 AM Bug #1708: Charm++ programs hang with mpi-crayxc build on Edison when run on 2 hosts
The mpi-crayxc build of NAMD works fine with craype-hugepages8M loaded at Charm++ build, NAMD build, and run.
I also...
Jim Phillips
09:05 AM Feature #112: object location services: Share array element location cache above PE level
AMPI could now benefit from this, and I think many Charm applications already do their own process-level location man... Sam White

10/12/2017

02:48 PM Bug #1709 (Implemented): Need a test that uses +partitions
Gerrit patch : https://charm.cs.illinois.edu/gerrit/#/c/3125/ Karthik Senthil
10:40 AM Feature #1321: multiple communication threads per process
Ronak, do your results mean that CmiPushPE is main hotspot for communication thread?
As far as I understand there ar...
Mikhail Shiryaev

10/11/2017

05:10 PM Bug #1708 (Implemented): Charm++ programs hang with mpi-crayxc build on Edison when run on 2 hosts
Added a fix to check that hugepages is not loaded while building using charmc for mpi-crayxc and mpi-crayxe builds. F... Nitin Bhat
01:52 PM Bug #1708: Charm++ programs hang with mpi-crayxc build on Edison when run on 2 hosts
When does this issue occur?
The issue occurs presumably because of an incompatibility between using Cray MPI when C...
Nitin Bhat
10:20 AM Bug #1708: Charm++ programs hang with mpi-crayxc build on Edison when run on 2 hosts
I ran into this while running examples/charm++/zerocopy/pingpong/ to get numbers for mpi-crayxc build. I later found ... Nitin Bhat
03:45 PM Bug #1641: charmrun with nodelist option (++nodelist) fails on campus cluster
Would it be possible to grant me access to Golub/Taub so I can test this directly? Evan Ramos
03:39 PM Feature #1394: Node-level message aggregation for CkMulticast
This is crashing on BW with 64 nodes.
The dependency chain for building CkArray group is locMgr->mcastMgr->array. ...
Juan Galvez
03:23 PM Feature #1394: Node-level message aggregation for CkMulticast
Currently debugging this on Blue Waters. Juan Galvez
03:23 PM Feature #176: objid_t: tracing infrastructure should use objid_t
This is implicitly dependent on 64-bit ID, which is somewhat unstable according to Eric Mikida, so it's been waiting ... Ronak Buch
03:19 PM Cleanup #1059: Unify Data Collection in Charm++
Ronak, could you make the scheduling decision on this, and maybe identify incremental subtasks that could be schedule... Phil Miller
03:17 PM Feature #885: extend physical node detection across partitions
Michael, putting this on you as the the GPU support lead. Phil Miller
03:16 PM Feature #1040: support multiple InfiniBand cards per node
Nitin, please work out how critical and feasible this is, and this whether it should be a target to complete for 6.9,... Phil Miller
03:14 PM Feature #1436: trace CcdCallFnAfter() causality
Ronak or Karthik, please get whatever further details are necessary, and decide if this should be addressed in the ne... Phil Miller
03:11 PM Bug #1104 (Merged): AMPI instances may change if migrated while suspended
The above patches have fixed all known instances of this. Sam White
03:11 PM Feature #1677: improved topology-aware partitioner
Juan, please follow up with more details/discussion, and decide on scheduling this. Phil Miller
03:10 PM Bug #1214 (New): AMPI_Just_migrated callbacks break using tlsglobals/isomalloc
Sam White
03:10 PM Bug #1155 (New): AMPI's non-blocking collectives are not sequenced
Sam White
03:10 PM Feature #1321: multiple communication threads per process
I've been doing more detailed tracing on OFI. Here's a representative example of the machine state tracing (two proce... Ronak Buch
03:10 PM Bug #1325 (New): AMPI programs fail to link with Isomalloc heaps
Sam White
03:09 PM Bug #1279 (New): Proactive fault tolerance fails due to sending message to dead node.
Sam White
02:55 PM Feature #363 (Rejected): Investigate implementation of CCS on BG/Q
This is probably just not going to happen, so closing it out. Phil Miller
02:43 PM Support #1450 (Feedback): Clean up and add CUDA example programs
Jaemin Choi
02:41 PM Cleanup #537: Data races in handler registration and assignment to global index variables
Some fixes that I've been trying to do for this (editing calls to happen only once and inserting barriers to prevent ... Ronak Buch
02:37 PM Cleanup #537: Data races in handler registration and assignment to global index variables
Ronak noted that the fix ran into trouble in rebasing and cleaning up. He'll add details here and/or on Gerrit Phil Miller
02:40 PM Documentation #1611: Document network dependent rdma thresholds, above which benefits of the zero...
Not going to hold the 6.8.1 release for this. Phil Miller
02:38 PM Bug #1162 (Closed): tracing runs segfault while writing logs
No longer seems to be reproducible. Re-open or open a new issue if it is observed again or can be reproduced. Phil Miller
10:28 AM Bug #1706 (Merged): MPI LrtsAbort doesn't kill all replicas
Phil Miller
09:46 PM Feature #1637 (Implemented): Zero-copy send support in the OFI layer
Gerrit Link: https://charm.cs.illinois.edu/gerrit/#/c/3122/ Nitin Bhat

10/10/2017

02:33 PM Feature #1712 (New): Avoid intermediate ctx to scheduler in case of ULTs
If a ULT yields, we currently context switch back to the scheduler thread always, even if the next task in the schedu... Sam White
12:16 PM Bug #1710: syncft tests: warning and crash on init_checkpt
I think the flag '+restartisomalloc' may be needed here? If so we need to try to automate that or at least document it Sam White
07:55 AM Bug #1710 (New): syncft tests: warning and crash on init_checkpt
http://ppl-jenkins:8080/job/Nightly-Build/label=trusty,platform=net-linux-x86_64-syncft/1346/console... Phil Miller
07:58 AM Bug #1711: syncft tests: unclear failure
Possibly similar / the same: http://ppl-jenkins:8080/job/Nightly-Build/label=trusty,platform=net-linux-x86_64-syncft/... Phil Miller
07:57 AM Bug #1711 (New): syncft tests: unclear failure
http://ppl-jenkins:8080/job/Nightly-Build/label=trusty,platform=net-linux-x86_64-syncft/1338/console... Phil Miller

10/09/2017

08:59 AM Bug #1705 (Merged): examples/charm++/kmeans occasionally loops forever, seen on uth-linux-x86_64
Phil Miller

10/08/2017

09:48 AM Bug #1706: MPI LrtsAbort doesn't kill all replicas
How would you detect that the partition you are sending an asynchronous message to has aborted?
Also, there are many...
Jim Phillips
08:54 AM Bug #1706: MPI LrtsAbort doesn't kill all replicas
I'm kind of inclined to question the assumption that (presumably) an application-level call to @CmiAbort@ *should* br... Phil Miller
09:42 AM Bug #1709 (Implemented): Need a test that uses +partitions
Bug #1675 should have been caught much earlier with a simple @make test@ in the main repository.
One challenge is ...
Phil Miller
08:44 AM Bug #1675 (Merged): OFI replica crashes
Phil Miller
12:15 AM Bug #1675 (Implemented): OFI replica crashes
Karthik Senthil
12:14 AM Bug #1675: OFI replica crashes
Gerrit patch : https://charm.cs.illinois.edu/gerrit/#/c/3115/ Karthik Senthil

10/06/2017

06:11 PM Documentation #1611 (Implemented): Document network dependent rdma thresholds, above which benefi...
Fix: https://charm.cs.illinois.edu/gerrit/#/c/3100/
This documentation exposed two bugs: https://charm.cs.illinois...
Nitin Bhat
05:51 PM Bug #1708: Charm++ programs hang with mpi-crayxc build on Edison when run on 2 hosts
The bug was not caught by autobuild. My guess is that it runs mpi-crayxc tests only on 1 host. Nitin Bhat
05:46 PM Bug #1708 (Implemented): Charm++ programs hang with mpi-crayxc build on Edison when run on 2 hosts
The hang happens when a 2 process (logical node) run is made on 2 hosts. It however works on a 2 process run on 1 hos... Nitin Bhat
05:43 PM Bug #1707 (New): Nocopy Entry method API ack handling crashes on pamilrts-bluegeneq-async-smp
Nitin Bhat
04:41 PM Bug #1706: MPI LrtsAbort doesn't kill all replicas
Regarding 3, if you could get the comm thread to participate in a node-all barrier you could just have it call MPI_Ab... Jim Phillips
12:17 PM Bug #1706 (In Progress): MPI LrtsAbort doesn't kill all replicas
I'm not sure how we should go about making this safe for thread level FUNNELED in SMP mode.
1. Add a flag to the obj...
Sam White
10:46 AM Bug #1706: MPI LrtsAbort doesn't kill all replicas
Or have the comm thread call MPI_Abort. Jim Phillips
10:29 AM Bug #1706: MPI LrtsAbort doesn't kill all replicas
Assign to Sam as MPI machine layer owner.
I'm thinking @LrtsAbort@ should just call @MPI_Abort@ and not try to do ...
Phil Miller
09:48 AM Bug #1706: MPI LrtsAbort doesn't kill all replicas
Also, we have rank 0 (rather than the communication thread) making MPI calls, which is not kosher for MPI_THREAD_FUNN... Jim Phillips
09:44 AM Bug #1706 (Merged): MPI LrtsAbort doesn't kill all replicas
User reports that when one replica on Stampede2 dies the others keep running. It looks like the machine_exit code do... Jim Phillips
11:00 AM Bug #1701: Cannot have non-copyable types in constructor arguments
Yep, that's exactly correct. The only difference between my modified code and the generated is the addition of @std::... Nils Deppe
10:53 AM Bug #1701: Cannot have non-copyable types in constructor arguments
OK, so I think what you're saying is that the generated code needs to move the temporary instances, so that the recip... Phil Miller
10:49 AM Bug #1701: Cannot have non-copyable types in constructor arguments
OK, our notes crossed paths as we were writing them. I'll look further in a bit. Phil Miller
10:47 AM Bug #1701: Cannot have non-copyable types in constructor arguments
Indeed, CProxy_Foo's generated code takes its arguments by const& even when declared in the .ci file as taking them b... Phil Miller
10:44 AM Bug #1701: Cannot have non-copyable types in constructor arguments
Looking now, this ticket could be made much more helpful on my part. The serialization is not the problem, it is the ... Nils Deppe
10:32 AM Bug #1701: Cannot have non-copyable types in constructor arguments
I take it you're trying to pass an rvalue reference to such an object? It doesn't really make semantic sense for any ... Phil Miller
10:58 AM Bug #1700: Overloaded reduction targets result in compilation error
I'm not sure you'll get the correct behavior even with just one of those methods - what does the contribute call to p... Phil Miller
10:45 AM Bug #1685 (Closed): charmc Chokes on @explicit@ constructors
Phil Miller
10:31 AM Bug #1685: charmc Chokes on @explicit@ constructors
Ah good. Yes, with moving away from ci files I don't think this is worth anyone's time to fix. Nils Deppe
10:05 AM Bug #1685: charmc Chokes on @explicit@ constructors
So, it wasn't stated earlier, but it looks like having @explicit@ in the class declarations in the C++ code does work... Phil Miller
10:20 AM Support #126: Document process launching arguments with aim of cross-machine rationalization
This likely ends up being obviated by the shift to hwloc-driven launch Phil Miller
10:17 AM Projections Feature #995: Color by user supplied parameter (e.g. timestep) in non-timeline tools
Where's the code for this? Did it ever get integrated? I don't see it on master in the projections repo. Phil Miller
10:06 AM Support #1391 (Closed): Add an SMP/multicore build test to Jenkins
This has been working smoothly for a while. Phil Miller
09:59 PM Bug #1676: Replicas slower than separate jobs on GNI systems
Sorry, dynamic and static SMSG have indistinguishable performance at large replica counts, although the final WallClo... Jim Phillips
09:20 PM Bug #1676: Replicas slower than separate jobs on GNI systems
For 16 nodes:
aprun -n 496 -r 1 -N 31 -d 1 /u/sciteam/jphillip/NAMD_LATEST_CRAY-XE-ugni-BlueWaters/namd2 +pemap 0-30...
Jim Phillips

10/05/2017

06:09 PM Bug #1676: Replicas slower than separate jobs on GNI systems
And while you're at it, could you post your full command line and the runtime's startup output? Phil Miller
06:08 PM Bug #1676: Replicas slower than separate jobs on GNI systems
Could you try the same test (4 nodes per replica, increasing replica count) with @+useDynamicSmsg@? I'm kinda suspect... Phil Miller
12:54 PM Feature #1682 (In Progress): Expose Arrays' Index Type as a Type Alias
https://charm.cs.illinois.edu/gerrit/3109
I was able to add @array_index_t@ to @CProxy_@*. Adding it to the base l...
Evan Ramos
10:54 AM Feature #1682: Expose Arrays' Index Type as a Type Alias
I'd go with the obvious: @array_element_t@ :) Nils Deppe
11:04 AM Bug #1705 (Implemented): examples/charm++/kmeans occasionally loops forever, seen on uth-linux-x8...
https://charm.cs.illinois.edu/gerrit/3107 Phil Miller
10:51 AM Bug #1705: examples/charm++/kmeans occasionally loops forever, seen on uth-linux-x86_64
Looks like a floating point associativity failure in the use of the @sum_double@ reduction:... Phil Miller
10:39 AM Bug #1705 (Merged): examples/charm++/kmeans occasionally loops forever, seen on uth-linux-x86_64
We've seen this failure a few times, but never debugged it. I've added some prints and after a dozen or so runs, got ... Phil Miller
10:55 AM Bug #1686: Use a namespace for Charm++ code
Sounds like a good plan to me :) Nils Deppe
08:53 AM Feature #1704 (In Progress): Add a pamilrts-linux-ppc64le build target
pami is in some sense deprecated in favor of pamilrts already. We only support the zero-copy API on pamilrts, not pam... Sam White
09:33 PM Bug #1702: Inconsistent charm++ archives
Awesome, thank you very much! Nils Deppe
08:11 PM Bug #1702 (Closed): Inconsistent charm++ archives
I've posted a gzipped version of the same tarball.
We'll release future versions as tar.gz, mostly for convenience...
Phil Miller

10/04/2017

05:13 PM Bug #1676: Replicas slower than separate jobs on GNI systems
From some basic profiling it appears that the amount of time spent in alloc_mempool_block (but not the number of call... Jim Phillips
03:34 PM Bug #1702: Inconsistent charm++ archives
Ah okay, well I'm fine with the change, I just wanted to make sure it's intentional. We restrict ourselves to the new... Nils Deppe
03:18 PM Bug #1702: Inconsistent charm++ archives
It was intentional, to provide a smaller download. I hadn't thought about the impact on systems that would be looking... Phil Miller
02:21 PM Bug #881: Automatically determine location of nvcc when compiling programs using charmc in accel
Sam White wrote:
> Is this critical for 6.8.1? Retarget to 6.9.0 if not
After looking at the title of this bug I ...
Michael Robson
01:35 PM Bug #1162 (Feedback): tracing runs segfault while writing logs
I was never able to reproduce this. Has this been an issue for you at all as of late, Jim (or anyone else)? Ronak Buch
01:29 PM Bug #1273 (Closed): Tracemode utilization crashes in production build of Charm++
Was likely fixed a while ago, but never updated. Since it's not reproducible, I'll close it for now. Ronak Buch
01:25 PM Bug #829 (Closed): CkLoop projections tracing doesn't reflect the origin/traceback of work
As far as I know, this is a duplicate of #1437, and it's been fixed in https://charm.cs.illinois.edu/gerrit/#/c/3084/... Ronak Buch

10/03/2017

02:32 PM Bug #1676: Replicas slower than separate jobs on GNI systems
All of the replicas are uniformly slow. There is no inter-replica interaction.
I haven't looked at large node count...
Jim Phillips
02:19 PM Bug #1676: Replicas slower than separate jobs on GNI systems
Ok, so the effect grows in magnitude with replica count, and requires at least a few nodes to occur.
What about th...
Phil Miller
02:14 PM Bug #1676: Replicas slower than separate jobs on GNI systems
No and no. I've been using 4 nodes per replica, non-smp. The effect starts to be visible above noise at 16 replicas... Jim Phillips
11:47 AM Bug #1676: Replicas slower than separate jobs on GNI systems
Querying test-case reduction, since there are basically no progress notes on this issue:
* Is a 2 node, 2 replica jo...
Phil Miller
02:32 PM Bug #1510: Hang in tests/charm++/chkpt when using -tracemode perfReport
So it looks like after the restart, @t->getTraceOn()@ returns false, and on multiple paths, this means that the code ... Phil Miller
01:11 PM Bug #1510: Hang in tests/charm++/chkpt when using -tracemode perfReport
Hang happens in @traceAutoPerfExitFunction@, at @autoPerfProxy.endStepResumeCb(true, CkMyPe(), CkCallbackResumeThread... Phil Miller
01:00 PM Bug #1510: Hang in tests/charm++/chkpt when using -tracemode perfReport
https://charm.cs.illinois.edu/gerrit/3097 tracemode perfReport: don't close file in race with code that will write to... Phil Miller
12:40 PM Bug #1510: Hang in tests/charm++/chkpt when using -tracemode perfReport
OK, I'm running this now, and actually seeing a crash in the exit path in the first run of @./hello@:... Phil Miller
12:09 PM Bug #1510: Hang in tests/charm++/chkpt when using -tracemode perfReport
Taking a second look at this, are we particularly concerned with tracing support across checkpoint/restart? Do we act... Phil Miller
01:14 PM Cleanup #566 (Merged): Charm++ cell example cleanup
Phil Miller
11:48 AM Cleanup #566 (Feedback): Charm++ cell example cleanup
Phil Miller
11:48 AM Cleanup #566 (Implemented): Charm++ cell example cleanup
Sam White
12:54 PM Bug #1273: Tracemode utilization crashes in production build of Charm++
I tested this again on my lab machine (netlrts-linux-x86_64-smp) for wave2d and jacobi2d. I didn't run into any crashes. Karthik Senthil
11:55 AM Bug #1273: Tracemode utilization crashes in production build of Charm++
Karthik can you test this again? Sam White
11:50 AM Bug #1162: tracing runs segfault while writing logs
Bump. Need to reproduce the failures and address them, or retarget to 6.9.0 Sam White
11:48 AM Bug #829 (Feedback): CkLoop projections tracing doesn't reflect the origin/traceback of work
Phil Miller
11:47 AM Bug #881: Automatically determine location of nvcc when compiling programs using charmc in accel
Is this critical for 6.8.1? Retarget to 6.9.0 if not Sam White
11:40 AM Bug #1081 (Merged): Converse command line arguments produce false warnings
Phil Miller

10/02/2017

03:48 PM Cleanup #566: Charm++ cell example cleanup
Or better, just review this if indeed that's the direction to go:
https://charm.cs.illinois.edu/gerrit/3092
Phil Miller
03:46 PM Cleanup #566: Charm++ cell example cleanup
Wasn't the decision on this to just delete the code in question, since there are up-to-date replacements in the upcom... Phil Miller
03:35 PM Bug #1201 (Rejected): SMP builds segfault on NULL lock in tests/charm++/chkpt
This doesn't seem to have appeared in any of the Jenkins Nightly-Build runs for any configuration since at least June... Phil Miller
03:23 PM Bug #1081: Converse command line arguments produce false warnings
https://charm.cs.illinois.edu/gerrit/3091 Phil Miller
03:18 PM Bug #1081 (Implemented): Converse command line arguments produce false warnings
Phil Miller
03:02 PM Bug #1081: Converse command line arguments produce false warnings
I'm going to take a quick look at this, and if I can't just nail it down, we should defer it. If we've lived with it ... Phil Miller
02:36 PM Bug #1680: ci file compilation fails with no details to debug when the module name has a hyphen
Fix for charmxi was here: https://charm.cs.illinois.edu/gerrit/3090 Phil Miller
02:33 PM Bug #1680 (Merged): ci file compilation fails with no details to debug when the module name has a...
Phil Miller
12:45 PM Bug #1680 (Implemented): ci file compilation fails with no details to debug when the module name ...
Samarth Kulshreshtha
02:35 PM Bug #1683 (Merged): Charmc ends up deleting .C/.cpp file in case you forget to specify the output...
Phil Miller
12:48 PM Bug #1702 (Closed): Inconsistent charm++ archives
The v6.8.0 archive is a @.tar.bz2@ archive while older versions are @.tar.gz@. This makes in more difficult when pack... Nils Deppe

10/01/2017

05:10 AM Bug #1676: Replicas slower than separate jobs on GNI systems
I can confirm that the bug also affects Cori (XC40), so I would assume all XC/XE/XK machines. Jim Phillips

09/30/2017

04:28 PM Bug #1701 (New): Cannot have non-copyable types in constructor arguments
If I try to pass a non-copyable type to a chare constructor my only option is to move an lvalue reference, which is n... Nils Deppe
11:37 AM Bug #1700 (New): Overloaded reduction targets result in compilation error
In my ci file I have:... Nils Deppe
10:52 AM Bug #1699 (New): [inline] entry methods should use perfect forwarding in C++11
Currently inline entry methods are not as useful as they could be in C++11 because they do not use perfect forwarding... Nils Deppe
10:42 AM Bug #1698 (New): [inline] template entry methods do not work
I'm trying to call an @[inline]@ template entry method, and it fails because the generated code calls:... Nils Deppe

09/29/2017

04:27 PM Cleanup #567 (Merged): Charm++ barnes-charm example cleanup
Phil Miller
04:27 PM Bug #1437 (Merged): CkLoop worker traces to previous entry on pe rather than caller
Phil Miller
02:02 PM Feature #1695: Allow "type aliases" for explicit instantiations of member function templates
Alright, cool. I'm not sure how far along you are with replacing interface files, but I have managed to eliminate the... Nils Deppe
12:18 PM Support #1534 (Closed): Print out the job output, error and log for failed BGQ autobuild jobs and...
Phil Miller
10:12 AM Bug #829: CkLoop projections tracing doesn't reflect the origin/traceback of work
Is this a direct duplicate of #1437? Or is there something more subtle here still to be addressed beyond that? Phil Miller

09/28/2017

05:04 PM Bug #1437: CkLoop worker traces to previous entry on pe rather than caller
https://charm.cs.illinois.edu/gerrit/#/c/3084/ Seonmyeong Bak
05:03 PM Bug #1437 (Implemented): CkLoop worker traces to previous entry on pe rather than caller
Seonmyeong Bak
02:28 PM Bug #78: AMPI failure with migration under Cray compiler due to tcmalloc bugs or incompatibility
I don't think there's going to be urgency for fixing this any time soon, so pulling it off any release target. Phil Miller
12:39 PM Bug #1696: Allow default arguments in group entry methods
Alright, thanks for the clarification! Nils Deppe
12:20 PM Bug #1696: Allow default arguments in group entry methods
I'd call this a bug, indeed. Default arguments to entry methods are supported somewhat intentionally, and it's incorr... Phil Miller
12:39 PM Support #1681: Document Exception Handling in Charm++
Thanks for the thorough explanation! I look forward to the Charm++ changes resulting from 1) and 2). These might even... Nils Deppe
12:06 PM Support #1681: Document Exception Handling in Charm++
Thanks for bringing these questions.
1. @std::terminate@: We should probably modify the runtime system itself to m...
Phil Miller
12:37 PM Bug #1685: charmc Chokes on @explicit@ constructors
I'm trying to maintain consistency between my C++ code and the CI file, that is all. I would not expect marking the c... Nils Deppe
12:11 PM Bug #1685 (Feedback): charmc Chokes on @explicit@ constructors
I'm not sure what it would mean to mark a chare class constructor @explicit@ at all, let alone declaring it in the .c... Phil Miller
12:26 PM Bug #1669: Add machine layer files as Makefile dependencies for building the machine layer library
Could this be addressed by running something equivalent to the current @make depends@ and putting the output directly... Phil Miller
12:18 PM Feature #1695: Allow "type aliases" for explicit instantiations of member function templates
Along the path to prospectively eliminating the need for .ci files and just using straight C++, we're going to try to... Phil Miller
12:14 PM Feature #1682: Expose Arrays' Index Type as a Type Alias
Sounds simple enough. Any suggestions for the name of the typedef? Evan Ramos
12:07 PM Feature #1682: Expose Arrays' Index Type as a Type Alias
Good suggestion. We'll try to get it into the next release. Phil Miller
12:13 PM Bug #1692: charmc Dies on Array Indices with Template Parameters
The long-range direction on this is to get away from .ci files entirely, so we don't need to run on the 'parsing C++'... Phil Miller
12:11 PM Bug #1686: Use a namespace for Charm++ code
When we take an API break with the next major release, addressing this will be part of it. Phil Miller
11:12 AM Bug #1686: Use a namespace for Charm++ code
Looking through more of the Charm++ code, this ticket should be made a higher priority than I initially thought. Char... Nils Deppe
12:01 PM Bug #1679 (In Progress): Do Not Require Default Constructors for Serializable Classes
We actually ended up implementing something that should satisfy this request for a contract customer who wanted it AS... Phil Miller
11:53 AM Bug #1687 (Merged): charmc Strips Empty Template Parameter Lists
Phil Miller
11:14 AM Support #1534 (In Progress): Print out the job output, error and log for failed BGQ autobuild job...
Fix to print out the error, output and scheduler log: https://charm.cs.illinois.edu/gerrit/#/c/3083/ Nitin Bhat
11:01 PM Bug #1441 (Merged): Lies at startup about "The comm. thread both sends and receives messages"
Phil Miller
07:32 PM Bug #1690 (Merged): Verbs autobuild crashes for rdma example zerocopy/simpleZeroCopy on Golub in ...
Phil Miller

09/27/2017

05:30 PM Bug #1441 (Implemented): Lies at startup about "The comm. thread both sends and receives messages"
Did a rundown of the LRTS layers to confirm that the displayed message is correct. The default mode is COMM_THREAD_SE... Nitin Bhat
03:04 PM Bug #902 (In Progress): Projections shows garbage for the source PE of a chare array insertion event
The straight-forward fix for this messes up AMPI intercommunicator creation. Deferring for now, since the full fix wi... Phil Miller
02:59 PM Bug #529 (New): LrtsInitCpuTopo() operates asynchronously and overlaps with group constructors
I'm pretty sure this bug has been solved because the above mentioned patches prevent Charm init from progressing on a... Juan Galvez
02:47 PM Documentation #1611: Document network dependent rdma thresholds, above which benefits of the zero...
Since the memory registration API and registration cache optimization hasn't been done for the Rdma Nocopy Entry meth... Nitin Bhat
01:29 PM Feature #1697 (New): Distinguishing local object calls from entry method calls
I would like to be able to check if a method invocation was made locally via a @ckLocalBranch()@ or as a remote call ... Nils Deppe
07:42 AM Bug #1696 (New): Allow default arguments in group entry methods
I'm not sure this is a bug or a feature request since that depends on whether or not default arguments are supposed t... Nils Deppe
01:01 AM Bug #1437 (In Progress): CkLoop worker traces to previous entry on pe rather than caller
This is because CkLoop uses converse msgs by default. The msg doesn't contain info of the caller.
Figure out how t...
Seonmyeong Bak

09/26/2017

04:37 PM Feature #1695 (New): Allow "type aliases" for explicit instantiations of member function templates
I have a class template that has a non-negligible number of template parameters, which also has an entry method templ... Nils Deppe
12:23 PM Bug #902 (Implemented): Projections shows garbage for the source PE of a chare array insertion event
https://charm.cs.illinois.edu/gerrit/3079 Phil Miller
10:52 AM Bug #1694 (New): Projections shows garbage for indices of 4d, 5d, 6d array elements
Projections seems to come with an implicit assumption that chare arrays have (at most) 3 dimensions, and hence elemen... Phil Miller
10:39 AM Feature #1693 (New): Print template parameters when missing explicit instantiation
First, thanks for now reporting missing explicit instantiations of template entry methods! I have a minor feature req... Nils Deppe
10:04 AM Bug #1692 (New): charmc Dies on Array Indices with Template Parameters
If the index to a chare array has template parameters then charmc chokes. A workaround is defining a type alias to th... Nils Deppe
09:24 AM Bug #1661 (Merged): Building charm with papi support is successful even when papi is not found (-...
Phil Miller
11:32 PM Bug #1687: charmc Strips Empty Template Parameter Lists
I haven't tested yet, but thanks for getting to this so quickly! Nils Deppe

09/25/2017

04:53 PM Bug #902 (In Progress): Projections shows garbage for the source PE of a chare array insertion event
I'll try to take care of this quickly. Phil Miller
03:33 PM Feature #1691 (New): GPU Handler PEs
It is often the case that the execution of heterogeneous tasks on the same PE deters the execution of GPU tasks:
1) ...
Jaemin Choi
03:15 PM Bug #1690 (Implemented): Verbs autobuild crashes for rdma example zerocopy/simpleZeroCopy on Golu...
This bug caused the verbs-smp crash in autobuild. The crash occurred because of the default send tokens in verbs bein... Nitin Bhat
02:59 PM Bug #1690 (Merged): Verbs autobuild crashes for rdma example zerocopy/simpleZeroCopy on Golub in ...
... Nitin Bhat
02:41 PM Bug #1687 (Implemented): charmc Strips Empty Template Parameter Lists
Fix here: https://charm.cs.illinois.edu/gerrit/3077 Phil Miller
10:16 AM Bug #1687: charmc Strips Empty Template Parameter Lists
Here is the simplest ci file I can think of that shows the problem:... Nils Deppe
10:05 AM Bug #1687: charmc Strips Empty Template Parameter Lists
Could you please post a complete .ci file that exhibits this? Phil Miller
01:01 PM Bug #1683 (Implemented): Charmc ends up deleting .C/.cpp file in case you forget to specify the o...
https://charm.cs.illinois.edu/gerrit/3076 Phil Miller
12:06 PM Bug #1544 (Merged): CMK_TIMER_USE_PPC64 inaccurate with variable clock speeds
Phil Miller
10:00 AM Bug #1544: CMK_TIMER_USE_PPC64 inaccurate with variable clock speeds
Switch to just use getrusage for now posted here: https://charm.cs.illinois.edu/gerrit/3075 Phil Miller
12:06 PM Bug #1689 (Implemented): CMK_TIMER_USE_PPC64 inaccurate with variable clock speeds
The pami-linux-ppc64le machine layer is defaulting to CMK_TIMER_USE_PPC64
src/arch/pami-linux-ppc64le/conv-mach.h:...
Phil Miller
10:48 PM Bug #1675: OFI replica crashes
Replicas never share a process, they simply distribute the processes among the replicas. Jim Phillips

09/24/2017

06:28 PM Feature #1321: multiple communication threads per process
Hello team, is there any update on that work?
The latest versions of PSM2 and OFI have improved support for multi-th...
Mikhail Shiryaev

09/23/2017

05:37 PM Bug #1281: Bigsim crash in SDAG code in examples/charm++/load_balancing/stencil3d
Pushed my change here: https://charm.cs.illinois.edu/gerrit/3074
As I said before, no idea whether this is right or ...
Phil Miller

09/22/2017

05:01 PM Bug #1675: OFI replica crashes
In a single instance, 1 thread is created. But I'm not sure how exactly replicas work. Will 2 replicas (2 charm insta... Nitin Bhat
01:27 PM Bug #1675: OFI replica crashes
But are there actually multiple threads launched, or is there just a single thread making pthread calls? Jim Phillips
09:39 AM Bug #1675: OFI replica crashes
Yes. It is a non-smp run. I'm guessing these are pthreads created by libfabric. I've asked the folks from Intel about... Nitin Bhat
08:00 PM Bug #1675: OFI replica crashes
This is a non-smp run, right? Are there other threads created by libfabric? Does pthreads require initialization? Jim Phillips
12:17 PM Bug #1688 (New): Core Dump file not available unless `--disable-charmdebug` is used while building.
While building charm with debug symbols without production mode, CMK_CHARMDEBUG is turned on and this leads to Charm ... Nitin Bhat
12:10 PM Bug #1441: Lies at startup about "The comm. thread both sends and receives messages"
Sure. I'll do that. Nitin Bhat

09/21/2017

04:07 PM Bug #1675: OFI replica crashes
I was able to replicate the hang for a smaller case with following command on Stampede 2:... Karthik Senthil
12:16 PM Bug #1687 (Merged): charmc Strips Empty Template Parameter Lists
If I have @Klass<>@ in a ci file the generated code is @Klass@, which has a completely different meaning and results ... Nils Deppe
11:29 AM Bug #1686 (New): Use a namespace for Charm++ code
Charm++ code that is not generated should be in a namespace. Currently name collisions arise (eg Group). In general, ... Nils Deppe
10:44 AM Bug #1685 (Closed): charmc Chokes on @explicit@ constructors
If I mark a constructor explicit using the explicit keyword as follows:... Nils Deppe
 

Also available in: Atom