Project

General

Profile

Activity

From 05/08/2017 to 06/06/2017

06/06/2017

04:03 PM Bug #1580 (Implemented): Abort from AMPI RMA routines on derived datatypes
Phil Miller
12:29 PM Bug #1570 (Merged): Cpuaffinity ignores '++quiet'
Sam White
12:20 PM Feature #1551 (Implemented): Better support for AMPI/Projections with multiple virtual ranks
Matthias Diener
11:20 AM Feature #1098: Better CMake support
We're going in on this heavily at Charmworks. Assigning to myself for now. Phil Miller
02:02 AM Bug #1577 (Implemented): User-level thread based OpenMP integration support on Mac
The change to resolve this issue is included in this patch.
https://charm.cs.illinois.edu/gerrit/#/c/2427/
Seonmyeong Bak
12:18 AM Bug #1539: Failure in migration when using RDMA sends in AMPI
RDMA sends are being used, the failure here just doesn't look like it's RDMA because the message itself has arrived b... Sam White
12:12 AM Bug #1542: CkArrayCreated callback should be part of CkArrayOptions
Core discussion is needed here Sam White
10:33 PM Bug #1502 (Implemented): Fix gfortran paths on Darwin
Matthias Diener

06/05/2017

06:17 PM Bug #1539: Failure in migration when using RDMA sends in AMPI
Looking at the core dump on failure for multicore-linux-x86_64, it looks like the segfault doesn't happen through the... Nitin Bhat
05:19 PM Bug #1502: Fix gfortran paths on Darwin
Patch here: https://charm.cs.illinois.edu/gerrit/2590 Matthias Diener
03:12 PM Bug #635 (Implemented): all trees should be pe/node/physnode/network topology aware
Patches:
https://charm.cs.illinois.edu/gerrit/#/c/2578/
https://charm.cs.illinois.edu/gerrit/#/c/2585/
https://c...
Juan Galvez
03:11 PM Bug #1570 (Implemented): Cpuaffinity ignores '++quiet'
Fix here: https://charm.cs.illinois.edu/gerrit/#/c/2589/ Juan Galvez
01:44 PM Bug #738: prevent direct calls to sdag entry functions
Changed the prefix for sdag entry methods to "_sdag_fnc_" as "_sdag_init" is an already generated xi function.
Nitin Bhat
12:14 PM Feature #1575: The OpenMP integration modified to run on Converse user-level threads
Implementation here: https://charm.cs.illinois.edu/gerrit/#/c/2561/ (for ease of cross-reference) Phil Miller
11:51 AM Bug #1585 (Merged): CmiCheckAffinity fails on large core counts with SMP
Sam White
08:56 PM Bug #1585: CmiCheckAffinity fails on large core counts with SMP
Yep, diagnosis is correct. CmiPhysicalNodeID can only be called for non-comm threads/pes. I had the order of conditio... Juan Galvez
08:51 PM Bug #1585 (Implemented): CmiCheckAffinity fails on large core counts with SMP
Fix here: https://charm.cs.illinois.edu/gerrit/#/c/2584/ Juan Galvez

06/04/2017

03:23 PM Bug #1539: Failure in migration when using RDMA sends in AMPI
Note that the problem occurs only sometimes. On my laptop, running with netlrts-darwin-x86_64-smp, @make test@ in exa... Matthias Diener
02:47 PM Bug #1585 (Merged): CmiCheckAffinity fails on large core counts with SMP
Running ChaNGa on Blue Waters with 1024 nodes and affinity options of:
+ppn 15 +setcpuaffinity +pemap 1-15,17-31 +...
Thomas Quinn
02:17 PM Feature #1584 (Merged): Allow exit codes in CkExit() / ConverseExit()
Currently, CkExit() and CkAbort() do not allow passing a custom exit code to the calling shell, making it necessary t... Matthias Diener

06/02/2017

02:51 PM Bug #1502: Fix gfortran paths on Darwin
* Homebrew's gfortran is called just "gfortran", and fortran programs work out of the box when specifying "gcc gfortr... Matthias Diener
11:59 AM Documentation #1583 (Merged): Check commented old template documentation in doc/charm++/inhertmpl...
Phil Miller
11:48 AM Documentation #1196 (Implemented): Document and exemplify chare class inheritance
Phil Miller

06/01/2017

04:35 PM Cleanup #1192 (Merged): AMPI is missing const qualifiers on buffer arguments to communication rou...
Sam White
09:42 PM Cleanup #1192 (Implemented): AMPI is missing const qualifiers on buffer arguments to communicatio...
https://charm.cs.illinois.edu/gerrit/#/c/2571/ Sam White
04:06 PM Feature #1582 (New): DDT support for direct copy from a noncontiguous type to another noncontiguo...
DDT currently only has a method serialize() which can either pack a noncontiguous datatype into a contiguous buffer o... Sam White
02:14 PM Bug #1507 (Merged): ckio test failure on gni-crayxc
The subsequent fixed issue you found seems likely to have been the culprit. Reclosing for now. Phil Miller
10:45 PM Bug #1507: ckio test failure on gni-crayxc
I tried this with a later version of charm (cd4d6f8), and things seem to now be OK.
Perhaps this was related to Bug ...
Thomas Quinn
11:22 AM Bug #1581 (Implemented): AMPI_Get_accumulate is broken
Sam White
11:22 AM Bug #1581: AMPI_Get_accumulate is broken
Fix for Get_accumulate: https://charm.cs.illinois.edu/gerrit/#/c/2574/ Sam White
10:39 AM Bug #1581: AMPI_Get_accumulate is broken
That can be added to this: https://charm.cs.illinois.edu/gerrit/#/c/2148/ Sam White
10:30 AM Bug #1581: AMPI_Get_accumulate is broken
Might be worth adding a test case in examples/ampi or tests/ampi . Matthias Diener
10:14 AM Bug #1581 (Merged): AMPI_Get_accumulate is broken
AMPI's implementation of MPI_Get_accumulate is completely wrong, and will always produce incorrect results at the tar... Sam White
02:40 AM Feature #1546 (Implemented): RDMA example with migration
Added the stencil3d example as a part of the SDAG support for rdma marked entry methods. This example tests both Migr... Nitin Bhat
02:36 AM Bug #1553 (Implemented): Support for sdag entry method with rdma parameter
The receive side code changes because we need to follow the same order in unmarshalling as we did in marshalling i.e ... Nitin Bhat
07:58 PM Bug #1580 (Merged): Abort from AMPI RMA routines on derived datatypes
We don't currently support these, and won't before the 6.8.0 release, so it would be good to add aborts to these rout... Sam White

05/31/2017

02:07 PM Feature #1575: The OpenMP integration modified to run on Converse user-level threads
This implementation adds a CMIQueue for suspended OpenMP user level threads.
When OpenMP ULTs are suspended and r...
Seonmyeong Bak
12:43 PM Feature #1579 (Merged): Mark CkAbort [[noreturn]]
Once C++11 is required @ckExit@ should be marked @[[noreturn]]@ Nils Deppe
11:18 AM Bug #1578 (Rejected): win64 debug build fails to link
Build options are "--no-build-shared --enable-randomized-msgq --with-prio-type=int --enable-error-checking -debug", a... Jim Phillips
10:00 PM Bug #1539: Failure in migration when using RDMA sends in AMPI
Core decided that this bug and support for SDAG should be prioritized over the GNI RDMA implementation. Sam White

05/30/2017

05:05 PM Bug #1560 (Merged): icc build fails on NASA Pleiades
Phil Miller
01:28 PM Bug #1560: icc build fails on NASA Pleiades
The fix works. Thanks. Jim Phillips
12:43 PM Bug #1560: icc build fails on NASA Pleiades
Seems to have failed the auto-test. Will test after that passes. Jim Phillips
11:52 AM Bug #1560: icc build fails on NASA Pleiades
Jim:
I think the definitive resolution of the exact issue you're now seeing would be to pass @$OPTS@ to the various ...
Phil Miller
11:09 AM Bug #1560 (In Progress): icc build fails on NASA Pleiades
Phil Miller
11:03 AM Bug #1560: icc build fails on NASA Pleiades
This test is now failing on my build box because it doesn't pass through the "-gcc-name=gcc44 -gxx-name=g++44" option... Jim Phillips
09:51 AM Bug #1560 (Merged): icc build fails on NASA Pleiades
Was working as intended. I had screwed up the login script configuration for the account used by autobuild, to say @m... Phil Miller
09:05 AM Bug #1560 (In Progress): icc build fails on NASA Pleiades
This seems to have broken the Edison autobuild targets. Sam White
04:18 PM Documentation #1256 (New): Document '-tracemode perfReport' in the manual
Sam White
02:13 PM Bug #1577 (Closed): User-level thread based OpenMP integration support on Mac
OpenMP integration can work with GCC, ICC and Clang, which are OpenMP enabled.
Apple Clang, the default compiler ...
Seonmyeong Bak
11:51 AM Bug #1576 (Merged): OpenAtom crashes during launch at startup on BlueWaters using master charm
Only happens when multinode runs are attempted. Requires more experimentation to isolate issue.
Does not happen...
Eric Bohm
10:19 AM Bug #1572: Improve pup_stl performance
Ah, yeah, that makes sense. I'd agree with your preferred expression. Phil Miller
09:53 AM Bug #1572: Improve pup_stl performance
@push_back@ has an r-value override in C++11, so as long as `std::move` is used that will work fine. I prefer using @... Nils Deppe
09:44 AM Bug #1572: Improve pup_stl performance
Also to use @emplace_back@ instead of @push_back@ where possible to move instead of copy. Phil Miller
09:34 AM Bug #1572: Improve pup_stl performance
The two patches I posted above were merged, so the remaining issues are to specialize for all arithmetic types (or al... Sam White

05/29/2017

12:20 PM Bug #1539: Failure in migration when using RDMA sends in AMPI
See also http://charm.cs.illinois.edu/autobuild/old.2017_05_29__01_01/mpi-crayxc.txt Phil Miller
12:19 PM Bug #1553: Support for sdag entry method with rdma parameter
This may be a rather naive question, but why is the receive side code (as suggested by the mention of closure and unm... Phil Miller

05/28/2017

03:34 PM Feature #1575: The OpenMP integration modified to run on Converse user-level threads
I just started this feature from most frequently used implementation.
Even thought Converse threads use common APIs...
Seonmyeong Bak
08:08 AM Feature #1575: The OpenMP integration modified to run on Converse user-level threads
... Sam White
08:30 AM Bug #1539 (In Progress): Failure in migration when using RDMA sends in AMPI
We're still seeing the same failure in autobuild, though now less frequently. See multicore-linux-x86_64 and mpi-linu... Sam White

05/27/2017

10:31 PM Feature #1575 (Merged): The OpenMP integration modified to run on Converse user-level threads
In the original implementation, OpenMP integration is implemented by Converse message, which is kind of stalkless use... Seonmyeong Bak

05/26/2017

03:21 PM Bug #1574 (Merged): lrts smp/multicore megacon build fails with undefined reference to `TraceTime...
Phil Miller
12:27 PM Bug #1574 (Implemented): lrts smp/multicore megacon build fails with undefined reference to `Trac...
https://charm.cs.illinois.edu/gerrit/2558 Phil Miller
12:17 PM Bug #1574: lrts smp/multicore megacon build fails with undefined reference to `TraceTimerCommon'
OK, reproduced with @./build charm++ multicore-darwin-x86_64 --no-build-shared --enable-tracing --enable-tracing-comm... Phil Miller
01:19 PM Bug #1572: Improve pup_stl performance
I added explicit specializations for the common arithmetic types so that we have that for the common cases in 6.8.0: ... Sam White
07:17 AM Bug #1572: Improve pup_stl performance
Thank you! I think we should keep this ticket around setting the target to 6.9.0 for the second optimization. Thinkin... Nils Deppe
10:49 AM Bug #1560 (Merged): icc build fails on NASA Pleiades
Phil Miller

05/25/2017

06:36 PM Bug #1574 (Merged): lrts smp/multicore megacon build fails with undefined reference to `TraceTime...
One example, on Bridges:... Jim Phillips
05:45 PM Bug #1572: Improve pup_stl performance
I can't imagine why it was written this way, and the git history offers no explanation. I added Nils's first suggesti... Sam White
12:08 PM Bug #1572 (Merged): Improve pup_stl performance
The serialization of @std::vector@ (and other STL containers) is extremely slow. The reason for this is twofold. Firs... Nils Deppe
04:46 PM Bug #1507 (In Progress): ckio test failure on gni-crayxc
Re-opening pending investigation Phil Miller
08:57 AM Bug #1507: ckio test failure on gni-crayxc
I'm not sure if it is the same bug, but after several checkpoints I get the following errors while in CkIO:
--------...
Thomas Quinn
04:21 PM Bug #1559 (Merged): cpuaffinity.c build errors on Linux and Win64 with --enable-tracing
The patches intended to fix the first two errors have been merged, so I'm closing this. If the @TraceTimerCommon@ fai... Phil Miller
02:44 PM Bug #635 (In Progress): all trees should be pe/node/physnode/network topology aware
Juan Galvez
02:44 PM Feature #1573: Make HDF5 library available for AMPI
CkExit has no return code parameter - how do charm++ applications signal failure? Matthias Diener
02:18 PM Feature #1573: Make HDF5 library available for AMPI
We can now. It used to be that CkExit could only be called once by something on PE 0, I believe. That was changed rec... Sam White
02:16 PM Feature #1573: Make HDF5 library available for AMPI
Ok, I'll do that. Regarding the exit() vs. CkExit() calls, can't we just @#define exit(foo) CkExit(foo)@ in ampi.h ? Matthias Diener
02:14 PM Feature #1573: Make HDF5 library available for AMPI
I would recompile AMPI with MSG_ORDER_DEBUG prints (top of ampi.C) enabled. That usually helps debug hangs. Sam White
02:11 PM Feature #1573: Make HDF5 library available for AMPI
With some modest changes, hdf5 compiles successfully (serial+parallel version).
The following serial tests (@make te...
Matthias Diener
12:31 PM Feature #1573 (Merged): Make HDF5 library available for AMPI
Currently, the hdf5 library needs some changes to work correctly under AMPI:
* -exit vs. CkExit()-
* -charmrun outp...
Matthias Diener
12:41 PM Bug #1571 (Merged): Documentation for ReadOnly is inaccurate regarding the number of copies per p...
Phil Miller

05/24/2017

12:55 PM Projections Feature #1524: Time Profile With Bracketed User Events
There have been substantial changes to the visualization of user bracketed events in Projections that are scheduled t... Matthias Diener
10:47 PM Bug #1571 (Implemented): Documentation for ReadOnly is inaccurate regarding the number of copies ...
https://charm.cs.illinois.edu/gerrit/#/c/2549/ Sam White

05/23/2017

02:51 PM Bug #1571 (Merged): Documentation for ReadOnly is inaccurate regarding the number of copies per p...
"They are broadcast to every PE by the Charm++ runtime, and can be accessed in the same way as C++ ``global'' variabl... Eric Bohm
01:47 PM Bug #1553 (In Progress): Support for sdag entry method with rdma parameter
I have finished implementing the SDAG code for rdma entry methods. The usecase for this is the stencil3d example wher... Nitin Bhat

05/22/2017

09:52 AM Feature #1237 (Implemented): Onesided sender side implementation for GNI layer
Feature: https://charm.cs.illinois.edu/gerrit/#/c/1908/
- Used buffering of short messages for sending messages whe...
Nitin Bhat
09:33 PM Bug #1509: -tracemode summary always fails an assertion at exit
Does '-tracemode summary' now pass 'make test' for LIBS? If so, we should add it to one of the autobuild targets Sam White
09:26 PM Bug #1522 (Rejected): Verbs failure on small messages
This only happens with CmiDirect, which I think is being replaced by rdma entry methods... Sam White

05/21/2017

04:32 PM Bug #1542: CkArrayCreated callback should be part of CkArrayOptions
We may want to get any API change here into 6.8.0. We need to decide on the relationship between the new initCallback... Sam White
03:03 PM Feature #1088: Trace MPI_ functions in AMPI
Change std::map to std::unordered_map: https://charm.cs.illinois.edu/gerrit/#/c/2545/ Sam White
02:46 PM Bug #1570 (Merged): Cpuaffinity ignores '++quiet'
When specifying a commap, cpuaffinity ignores '++quiet' and prints anyways:... Sam White
12:31 PM Bug #1561 (Merged): RDMA failures on multicore/SMP builds
Sam White

05/19/2017

01:30 PM Bug #1561 (Implemented): RDMA failures on multicore/SMP builds
Patch: https://charm.cs.illinois.edu/gerrit/#/c/2543/ Vipul Harsh
01:27 PM Bug #1561 (In Progress): RDMA failures on multicore/SMP builds
Vipul Harsh
09:44 AM Bug #1561: RDMA failures on multicore/SMP builds
The issue manifests in AMPI tests, but is not an AMPI issue as such. So, recategorizing and tagging Phil Miller
09:09 AM Bug #1568 (Merged): ckio failure on netlrts-linux
Phil Miller
09:08 AM Bug #1560 (Implemented): icc build fails on NASA Pleiades
https://charm.cs.illinois.edu/gerrit/2542 Phil Miller
08:26 PM Feature #1569 (Merged): Support the Flang Fortran compiler
Add Flang configurations to the build system: https://github.com/flang-compiler/flang Sam White

05/18/2017

03:37 PM Bug #1560: icc build fails on NASA Pleiades
The Intel 16 / GCC 6 combination errors at compile time. The configure output also shows error messages, but somehow ... Phil Miller
02:34 PM Bug #1560: icc build fails on NASA Pleiades
Notes on the errors encountered with different icc/gcc version matchups:
|_.gcc \ Intel |_.12 ...
Phil Miller
11:21 AM Bug #1568 (Implemented): ckio failure on netlrts-linux
https://charm.cs.illinois.edu/gerrit/2537 Phil Miller
08:29 AM Bug #1568 (Merged): ckio failure on netlrts-linux
Autobuild for netlrts-linux failed in the code added here: https://charm.cs.illinois.edu/autobuild/cur/netlrts-linux.txt Sam White

05/17/2017

05:45 PM Feature #1492: Remove need for +LBCommOff
Related: this patch allows '+LBCommOff' to avoid more of the overhead that comm stats collection entails: https://cha... Sam White
05:43 PM Bug #1514 (Merged): Throw a runtime error for registrations that occur after startup
Sam White
04:17 PM Bug #1564 (Implemented): Inline entry methods don't respect group dependence from CkEntryOptions
Phil Miller
02:59 PM Bug #1564 (Merged): Inline entry methods don't respect group dependence from CkEntryOptions
When the target object exists locally and delivery happens inline, the CkEntryOptions are ignored, even if it indicat... Phil Miller
04:17 PM Bug #1567 (Merged): [aggregate] entry methods should refuse to accept CkEntryOptions, since the c...
Without C++11 support, we can stick in... Phil Miller
04:10 PM Bug #1566 (Implemented): Parameter marshalled entry methods mostly don't set group dependence in ...
Phil Miller
04:08 PM Bug #1566 (Merged): Parameter marshalled entry methods mostly don't set group dependence in messa...
For parameter marshalled entry methods, charmxi only generated code to pull the group dependence from CkEntryOptions ... Phil Miller
03:55 PM Bug #1565 (Implemented): Non-group entry methods don't respect envelope group dependence
Phil Miller
03:37 PM Bug #1565 (Merged): Non-group entry methods don't respect envelope group dependence
Phil Miller
03:35 PM Feature #1417: Reduce CkReductionMsg envelope size
We could potentially have a boolean field for if the CkReductionMsg is for a section and only allocate the space for ... Sam White
02:52 PM Bug #1563 (Merged): Chare Array construction doesn't respect setGroupDepID in CkEntryOptions (or ...
@CProxy_ArrayFoo::ckNew@ accepts a @CkEntryOptions@ argument, but doesn't put it to sensible use.
When the underly...
Phil Miller
02:06 PM Bug #1509 (Merged): -tracemode summary always fails an assertion at exit
Phil Miller
08:51 PM Bug #1509 (Implemented): -tracemode summary always fails an assertion at exit
https://charm.cs.illinois.edu/gerrit/#/c/2527/ Ronak Buch
10:20 AM Bug #1561: RDMA failures on multicore/SMP builds
At least on netlrts-darwin-x86_64-smp, this does not crash for me when specifying ppn (current charm master, examples... Matthias Diener
09:21 AM Bug #1561: RDMA failures on multicore/SMP builds
Yes, we talked to Vipul after group meeting, and he is taking this issue over. For RDMA sends within a process, the m... Sam White
09:19 AM Bug #1561: RDMA failures on multicore/SMP builds
I suspect that the issue is that the RDMA code is doing packing/unpacking in something of the wrong place - when the ... Phil Miller
09:18 AM Bug #1561: RDMA failures on multicore/SMP builds
That started failing exactly the same as the multicore builds:
http://localhost:8080/job/Nightly-Build/label=linux,p...
Phil Miller
09:17 AM Bug #1561: RDMA failures on multicore/SMP builds
Indeed it does:
http://localhost:8080/job/Nightly-Build/label=linux,platform=mpi-linux-x86_64-smp/
(correct for por...
Phil Miller
08:52 AM Documentation #1219 (Merged): Update SDAG forall documentation
Sam White

05/16/2017

05:41 PM Feature #1353: charmc hardcodes unversioned compiler names
Unfortunately, I've found that using different compilers for building and linking can cause linking failures, so redu... William Throwe
09:22 AM Feature #1353: charmc hardcodes unversioned compiler names
Also, my apologies for the slow response to this. Phil Miller
09:21 AM Feature #1353: charmc hardcodes unversioned compiler names
A fix for this, and more generally to enable standard build configuration practices for Charm++, will be available th... Phil Miller
05:14 PM Bug #1509: -tracemode summary always fails an assertion at exit
OK, I've confirmed that this really did come about precisely with the commit changing the exit process, 67aa76d3b7e42... Phil Miller
04:03 PM Bug #1561: RDMA failures on multicore/SMP builds
The same failure should happen in SMP mode when running with ppn > 1. Sam White
01:56 PM Bug #1561: RDMA failures on multicore/SMP builds
git bisect shows that this commit causes the error:
https://charm.cs.illinois.edu/gerrit/#/c/2520/
Matthias Diener
01:17 PM Bug #1561: RDMA failures on multicore/SMP builds
K, doing a git bisect on './build AMPI multicore-darwin-x86_64 -j8 -g -O0' to find what commit broke this would be go... Sam White
01:12 PM Bug #1561: RDMA failures on multicore/SMP builds
(Note that the stack trace in message #8 was with RDMA enabled).
I think I did not run @make clean@ in the first t...
Matthias Diener
10:52 AM Bug #1561: RDMA failures on multicore/SMP builds
Whoa, what is SDAG doing in there?!
Can you try reverting this recently merged series of 3 commits (in which Eric ...
Sam White
10:47 AM Bug #1561: RDMA failures on multicore/SMP builds
Full stack trace with @-g -O0@, RDMA is enabled:... Matthias Diener
10:34 AM Bug #1561: RDMA failures on multicore/SMP builds
We still use the RDMA path for sends that are local (the sender and the recver are on the same PE), so that's one pos... Sam White
10:28 AM Bug #1561: RDMA failures on multicore/SMP builds
Full stack frame just before the crash (this is with RDMA off, supposedly):... Matthias Diener
10:25 AM Bug #1561: RDMA failures on multicore/SMP builds
Weird, but megampi may be simpler to debug and is the first issue here.
That stack trace is the same one we saw be...
Sam White
10:19 AM Bug #1561: RDMA failures on multicore/SMP builds
GDB output:... Matthias Diener
10:15 AM Bug #1561: RDMA failures on multicore/SMP builds
On Darwin, compiling with @-DAMPI_RDMA_IMPL=0@ does not fix the crash. (Full build command: @./build AMPI multicore-d... Matthias Diener
12:37 PM Bug #1560: icc build fails on NASA Pleiades
If we don't catch this during configure and abort with a message explaining why it failed, I think we're basically as... Sam White
09:17 AM Bug #1560: icc build fails on NASA Pleiades
We ran into the exact same issue on some of the NERSC Cray systems.
We could maybe push a test that would trigger ...
Phil Miller
12:13 PM Documentation #1219 (Implemented): Update SDAG forall documentation
https://charm.cs.illinois.edu/gerrit/2526 Phil Miller
11:56 AM Bug #635: all trees should be pe/node/physnode/network topology aware
This is a potentially serious performance defect, not just something to tidy up. Phil Miller
11:53 AM Bug #635: all trees should be pe/node/physnode/network topology aware
Reductions are now at least SMP aware due to the fix for #1278. We still form a topology-oblivious tree over the node... Phil Miller
09:14 AM Charm-NG Feature #1562: Enable message allocation, construction, packing, etc, without generated .ci file ...
One key goal in any API evolution or redesign would be eliminating the need for the @-fno-lifetime-dse@ flag passed t... Phil Miller
09:12 AM Charm-NG Feature #1562 (New): Enable message allocation, construction, packing, etc, without generated .ci...
If we keep explicit message types around as Charm++'s API design moves forward, we need to address how they will be h... Phil Miller
09:08 AM Feature #1343: Let user-defined main() work for all execution environments
Revisiting this, are there situation where the desired outcome of this issue isn't satisfied?
Maybe @readonly@ var...
Phil Miller
08:46 AM Bug #1507 (Merged): ckio test failure on gni-crayxc
Phil Miller

05/15/2017

06:28 PM Bug #1561: RDMA failures on multicore/SMP builds
One potential source of this issue is the RDMA stuff that was recently merged in AMPI. You can build AMPI with '-DAMP... Sam White
08:15 AM Bug #1561 (Merged): RDMA failures on multicore/SMP builds
multicore builds for linux, darwin, and win all failed in tests/ampi/megampi/ with +p2 +vp2 Sam White
05:10 PM Bug #1509: -tracemode summary always fails an assertion at exit
Ping. Progress? Phil Miller
05:05 PM Bug #1507 (Implemented): ckio test failure on gni-crayxc
Underlying issue with the patch provided, given that it was failing after restart from a checkpoint, was that an arra... Phil Miller
03:53 PM Bug #1507 (In Progress): ckio test failure on gni-crayxc
I'm seeing issues with that patch on simple ChaNGa test runs. Working through them now. Phil Miller
12:05 PM Bug #1559: cpuaffinity.c build errors on Linux and Win64 with --enable-tracing
This https://charm.cs.illinois.edu/gerrit/#/c/2524/ should take care of all cpuaffinity errors, although I didn't exp... Juan Galvez
10:55 PM Bug #1560: icc build fails on NASA Pleiades
Yes, that is true on a few other systems too. AFAIK we decided to require at least gcc v4.4 headers for 6.8.0, and we... Sam White
10:20 PM Bug #1560: icc build fails on NASA Pleiades
That last line was a clue: it looks like the intel compiler depends on the g++ libraries, so a modern gcc has to be l... Thomas Quinn
08:42 PM Bug #1560 (Merged): icc build fails on NASA Pleiades
Building with
./build ChaNGa verbs-linux-x86_64 cuda smp icc -j8 --with-production
gives errors like:
../bin/charm...
Thomas Quinn

05/14/2017

11:37 AM Bug #1556 (Merged): AMPI Fortran bindings for MPI_STATUS(ES)_IGNORE are broken
Phil Miller
09:53 AM Feature #1352 (Merged): CkArrayOptions callback for completion of chare array initialization
Sam White
09:52 AM Bug #1558 (Merged): win64 debug build fails to build due to missing lrand48
Sam White
08:30 PM Bug #1559 (In Progress): cpuaffinity.c build errors on Linux and Win64 with --enable-tracing
This should fix some of the machine layers but possibly not all of them:
https://charm.cs.illinois.edu/gerrit/#/c/25...
Juan Galvez

05/13/2017

12:14 AM Bug #1559: cpuaffinity.c build errors on Linux and Win64 with --enable-tracing
Somewhat arbitrarily assigning to Juan only because he's touched cpuaffinity before and doesn't appear to have any ot... Sam White
10:41 PM Feature #1546: RDMA example with migration
I think it's important to have an SDAG + Migration + RDMA example/test, but up to you whether that is this issue or not Sam White

05/12/2017

05:20 PM Bug #1559 (Merged): cpuaffinity.c build errors on Linux and Win64 with --enable-tracing
Building charm --no-build-shared --enable-tracing --enable-tracing-commthread -optimize
On multicore-linux64-iccst...
Jim Phillips
05:02 PM Feature #1546: RDMA example with migration
The bug that spawned this request, #1539, has now been fixed. Is it still critical to have a new test/example that sp... Phil Miller
08:56 AM Feature #1546: RDMA example with migration
I believe Nitin modified the stencil load balancing to use RDMA, and that is pending on a fix for RDMA entry methods ... Sam White
04:59 PM Bug #1539 (Merged): Failure in migration when using RDMA sends in AMPI
Phil Miller
08:33 PM Bug #1539 (Implemented): Failure in migration when using RDMA sends in AMPI
Patch: https://charm.cs.illinois.edu/gerrit/#/c/2520/ Vipul Harsh
04:23 PM Bug #1558 (Implemented): win64 debug build fails to build due to missing lrand48
https://charm.cs.illinois.edu/gerrit/#/c/2522/ Sam White
01:42 PM Bug #1558: win64 debug build fails to build due to missing lrand48
The temporary fix that I have now is to replace @lrand48()@ with @rand()@ for WIN64 builds. Would that be sufficient?... Karthik Senthil
12:43 PM Bug #1558: win64 debug build fails to build due to missing lrand48
Yes. My "debug" build options are "--no-build-shared --enable-randomized-msgq --with-prio-type=int --enable-error-ch... Jim Phillips
12:35 PM Bug #1558: win64 debug build fails to build due to missing lrand48
I think this is only on builds with randomized queues on Windows, but it still needs to be fixed for 6.8.0.
Assign...
Sam White
09:53 AM Bug #1558 (Merged): win64 debug build fails to build due to missing lrand48
... Jim Phillips
03:09 PM Bug #1556 (Implemented): AMPI Fortran bindings for MPI_STATUS(ES)_IGNORE are broken
https://charm.cs.illinois.edu/gerrit/#/c/2521/ Sam White
12:26 PM Bug #1556 (In Progress): AMPI Fortran bindings for MPI_STATUS(ES)_IGNORE are broken
Sam White
12:25 PM Documentation #1432 (Merged): Document CkLoop caller function
Sam White
12:25 PM Bug #1555 (Merged): converse segfaults processing msg whose handler has not been registered on th...
Sam White
12:24 PM Bug #833 (Merged): mpi smp build is locked to one core per node by default
Sam White

05/11/2017

06:27 PM Bug #1539: Failure in migration when using RDMA sends in AMPI
If that's all, I think the message just needs to have pack called in it before it gets forwarded, and unpack after it... Phil Miller
06:23 PM Bug #1539: Failure in migration when using RDMA sends in AMPI
I looked at the code and it never actually changes the pointers inside the rdma wrappers in the message, hence the rd... Vipul Harsh
05:10 PM Bug #1507: ckio test failure on gni-crayxc
It will take me a little while to reproduce my problem, since it usually happens after restarting from a checkpoint.
...
Thomas Quinn
04:40 PM Bug #1507 (Implemented): ckio test failure on gni-crayxc
https://charm.cs.illinois.edu/gerrit/2519
Tom, if you're still seeing this an issue here, could you try the above ...
Phil Miller
02:38 PM Bug #1507 (In Progress): ckio test failure on gni-crayxc
Phil Miller
02:36 PM Bug #1507: ckio test failure on gni-crayxc
Looks like the issue is that a message referencing the newly-constructed write session is reaching PEs other than 0 b... Phil Miller
04:21 PM Feature #1352 (Implemented): CkArrayOptions callback for completion of chare array initialization
Hackishly re-using the reduction manager's spanning tree now. Phil Miller
02:38 PM Feature #1352: CkArrayOptions callback for completion of chare array initialization
It looks like there's another use case for this outside AMPI - CkIO. Phil Miller
03:05 PM Bug #1557 (New): AMPI bindings for C-Fortran interop are incomplete
AMPI is missing definitions for MPI_F_STATUS(ES)_IGNORE and MPI_Status_f2c and MPI_Status_c2f. There may be other sim... Sam White
03:04 PM Bug #1556 (Merged): AMPI Fortran bindings for MPI_STATUS(ES)_IGNORE are broken
MPI_STATUS_IGNORE and MPI_STATUSES_IGNORE are both declared as arrays of 8 integers in ampif.h while the C++ code in ... Sam White
12:39 PM Bug #1555 (Implemented): converse segfaults processing msg whose handler has not been registered ...
Fix here: https://charm.cs.illinois.edu/gerrit/#/c/2517/
Juan Galvez
11:50 AM Bug #1555 (Merged): converse segfaults processing msg whose handler has not been registered on th...
No error is printed even with error checking enabled.
So, at least, with error checking there should be an explici...
Juan Galvez
10:04 AM Bug #1275: DistributedLB: Objects not migrating after strategy runs
I have updated the gerrit patch https://charm.cs.illinois.edu/gerrit/#/c/1951/ with Harshitha's fix from her branch. ... Kavitha Chandrasekar
09:13 PM Bug #1514 (Implemented): Throw a runtime error for registrations that occur after startup
Seonmyeong Bak

05/10/2017

06:01 PM Bug #833 (Implemented): mpi smp build is locked to one core per node by default
Posted a new patch in gerrit.
Should work on all architectures, including Cray because it does not rely on the Net...
Juan Galvez
03:55 PM Bug #1514: Throw a runtime error for registrations that occur after startup
After the initialization is done, the calling the templated entry method in an uninstantiated form leads to CkAbort. ... Seonmyeong Bak
02:40 AM Bug #1514: Throw a runtime error for registrations that occur after startup
https://charm.cs.illinois.edu/gerrit/#/c/2510/ Seonmyeong Bak
01:38 AM Bug #1553 (Merged): Support for sdag entry method with rdma parameter
I tried adding rdma functionality to the receiveGhosts method in examples/charm++/load_balancing/stencil3d. The closu... Nitin Bhat

05/09/2017

05:37 PM Bug #1547 (In Progress): Deprecate the FFT library in ck-libs in favor of Nikhil's new FFT library
There are a few complications to accomplishing this in that Nikhil's fft library is not a drop in replacement for the... Eric Bohm
04:59 PM Bug #647 (Merged): Make MeshStreamer classes [migratable] to support checkpoint/restart
Phil Miller
03:17 PM Bug #647 (Implemented): Make MeshStreamer classes [migratable] to support checkpoint/restart
Sam White
04:58 PM Bug #854 (Merged): RRMap broken for >1D chare arrays
Phil Miller
03:18 PM Bug #854 (Implemented): RRMap broken for >1D chare arrays
Sam White
04:09 PM Feature #1352: CkArrayOptions callback for completion of chare array initialization
We decided that since getting a proper reduction done inside CkArray will be ugly and doing all-to-one pt2pt sends wi... Sam White
03:52 PM Feature #1468: Enable pre-pinning memory for the zero-copy message sends through the Entry Method...
The automatic caching approach could go in 6.8.1, but an explicit API would have to be the next feature release. Phil Miller
03:50 PM Feature #1394: Node-level message aggregation for CkMulticast
This won't be an API change, AFAICT, so it could be done in a patch release. Phil Miller
03:43 PM Bug #1539: Failure in migration when using RDMA sends in AMPI
To reproduce this, do './build AMPI mpi-linux-x86_64 -g -O0' then 'make test' in examples/ampi/Cjacobi3D/.
Basical...
Sam White
03:18 PM Bug #833 (In Progress): mpi smp build is locked to one core per node by default
Sam White
02:00 PM Feature #1551: Better support for AMPI/Projections with multiple virtual ranks
Related issues:
https://charm.cs.illinois.edu/redmine/issues/1005
https://charm.cs.illinois.edu/redmine/issues/1524
Sam White
09:19 PM Bug #1540 (Merged): Memory leaks in RDMA
Phil Miller
07:28 PM Feature #1459: Zero-copy send support for the netlrts machine layer
We could do the packetization in a set aside buffer that we copy the user's data through as we send it. The key is to... Phil Miller
07:18 PM Feature #1459 (In Progress): Zero-copy send support for the netlrts machine layer
The current netlrts layer (UDP) in machine-eth.c sends a Datagram header with every packet it sends.
For every pack...
Nitin Bhat

05/08/2017

06:14 PM Feature #1551 (In Progress): Better support for AMPI/Projections with multiple virtual ranks
Two WIP patches:
- https://charm.cs.illinois.edu/gerrit/2503 (Projections)
- https://charm.cs.illinois.edu/gerrit/2...
Matthias Diener
05:20 PM Bug #1540: Memory leaks in RDMA
That fixes the memory leaks in the lower layer implementations.
In the rdma example too, there was a memory leak ...
Nitin Bhat
05:02 PM Bug #1540: Memory leaks in RDMA
Is that fix addressing all of the known leaks? If so, this can be marked Merged Phil Miller
05:05 PM Bug #887 (Closed): Investigate initialization of NullLB WRT thread safety
Phil Miller
 

Also available in: Atom