Project

General

Profile

Activity

From 01/17/2017 to 02/15/2017

02/16/2017

06:42 PM Cleanup #1423 (Merged): Conditionalize PUP sizing on load balancers' needs
Currently an element's PUP sizer is called before the LB strategy runs so that that information can possibly be taken... Sam White
06:33 PM Bug #1244: mlogft and causalft builds do not compile
A large number of the issues look to be in src/ck-core/ckobjid.h, which wasn't updated to use a 64bit ID instead of a... Eric Mikida
06:13 PM Bug #1244: mlogft and causalft builds do not compile
It would be good for these to at least compile in the release. Sam White

02/15/2017

05:55 PM Cleanup #1422: Cleanup dangling issues from 64bit merge
inform(CmiUInt8, CkArrayIdx, int) and inform(CmiUInt8, int) do completely different things and it is undocumented as ... Eric Mikida
05:29 PM Cleanup #1422 (In Progress): Cleanup dangling issues from 64bit merge
https://charm.cs.illinois.edu/gerrit/#/c/2250/ is being implemented to address the poor hashing issue for CmiUInt8. Eric Mikida
05:28 PM Cleanup #1422 (In Progress): Cleanup dangling issues from 64bit merge
CkLocMgr still has some unnecessary or erroneous bits of code that were left over from the 64bit merge. At a quick gl... Eric Mikida
03:54 PM Cleanup #1416: Eliminate use of deprecated/removed keyword 'register'
Thanks for pointing that out, fix is here: https://charm.cs.illinois.edu/gerrit/#/c/2256/ Sam White
03:37 PM Cleanup #1416: Eliminate use of deprecated/removed keyword 'register'
Breaks BG/Q:... Jim Phillips
10:39 AM Cleanup #1416 (Merged): Eliminate use of deprecated/removed keyword 'register'
Phil Miller
01:41 PM Bug #1278 (Merged): ReductionStarting overhead in SMP mode
And, merged. Finally. Phil Miller
01:07 PM Bug #1421: Running leanmd with error checking enabled in Charm++ triggers assertion error in lbdb.h
Solution might be to add support in CentralLB::removeCommDataOfDeletedObjs for multicast messages, going through all ... Juan Galvez
12:30 PM Bug #1421: Running leanmd with error checking enabled in Charm++ triggers assertion error in lbdb.h
The assertion seems to fail on type 3, LD_OBJLIST_MSG, for multicast messages as you suggest. The CentralLB::removeNo... Kavitha Chandrasekar
11:50 AM Bug #1421 (Merged): Running leanmd with error checking enabled in Charm++ triggers assertion erro...
Building Charm++ with error checking:
./build charm++ netlrts-linux-x86_64 --with-production --enable-error-checking...
Juan Galvez
11:58 AM Cleanup #1418 (Implemented): Remove commlibInfo from _CkSectionInfoStruct
Patch here:
https://charm.cs.illinois.edu/gerrit/#/c/2255/
Juan Galvez
10:40 AM Bug #1413 (Merged): Various Mac clang warnings are likely bugs
Phil Miller
10:29 AM Bug #1419 (Merged): Verbs builds on Omnipath should not build and suggest mpi instead
Phil Miller
10:08 AM Bug #1419 (Implemented): Verbs builds on Omnipath should not build and suggest mpi instead
https://charm.cs.illinois.edu/gerrit/#/c/2253/ Bilge Acun
10:17 AM Bug #1155 (In Progress): AMPI's non-blocking collectives are not sequenced
Sam White
10:17 PM Bug #1404 (Implemented): Support Cray CC on {mpi,gni}-crayxe
Fix for mpi- and gni- crayxe is here: https://charm.cs.illinois.edu/gerrit/#/c/2252/
Works on Blue Waters with the...
Sam White
09:35 PM Bug #1404: Support Cray CC on {mpi,gni}-crayxe
The original patch from Cray already includes support for mpi-crayxc. Sam White
09:28 PM Bug #1360 (Merged): AMPI megampi test fails on mpi-crayxc and darwin builds
The real issue was that we were returning the address of a variable on the stack when getting builtin keyvals like AM... Sam White
09:27 PM Bug #1050 (Merged): AMPI collectives support for MPI_IN_PLACE
Sam White

02/14/2017

04:54 PM Bug #1390: AMPI_Alltoall crashes for short messages
I think we can safely abandon the short message protocol (and use the medium message protocol for those). Sam White
04:35 PM Feature #1420 (Merged): Lockless queue build option --enable-lockless-queue (off by default)
Implementation of the new lockless queues: https://charm.cs.illinois.edu/gerrit/#/c/1302
Performance tests from vari...
Bilge Acun
04:19 PM Feature #1234: Avoid sender-side copy for large contiguous messages. API for charm and converse l...
The initial implementation of this doesn't need to be based on RDMA. The problem is not that the data is copied, but ... Jim Phillips
03:51 PM Bug #1419 (Merged): Verbs builds on Omnipath should not build and suggest mpi instead
Detect if it's an Omnipath architecture, give a warning to use MPI layer instead and quit. Bilge Acun
03:49 PM Feature #1401 (Implemented): Converting OpenMP test suite for the OpenMP integration.
Seonmyeong Bak
03:48 PM Bug #1276 (Rejected): net-cygwin: multiple testing failures
The net-cygwin target is no longer relevant. Ronak Buch
03:44 PM Bug #1289 (Rejected): net-cygwin crashes in megampi
The net-cygwin target is no longer relevant. Ronak Buch
01:49 PM Feature #1417: Reduce CkReductionMsg envelope size
Remove unused member variable 'no': https://charm.cs.illinois.edu/gerrit/#/c/2244/ Sam White
10:58 AM Feature #1417 (Merged): Reduce CkReductionMsg envelope size
CkReductionMsg has accumulated fields that are often unused. Here are its fields that are unconditionally compiled:
...
Sam White
12:34 PM Feature #275 (In Progress): Performance testing and tuning of CkIO
Ronak Buch
12:34 PM Bug #1274 (In Progress): Tracemode utilization produces incorrect results for EP utilization metric
These were fixed as far as I remember, but I think Karthik had the relevant information (I sat down with him at his d... Ronak Buch
12:33 PM Bug #1273 (In Progress): Tracemode utilization crashes in production build of Charm++
These were fixed as far as I remember, but I think Karthik had the relevant information (I sat down with him at his d... Ronak Buch
12:32 PM Projections Bug #1293 (Closed): Projections summary shows 0s for EP Exec Time for a single core large ChaNGa ...
This is correct as far as I know, those EPs just didn't have attributable time. This was part of a bigger issue wher... Ronak Buch
12:29 PM Cleanup #1418: Remove commlibInfo from _CkSectionInfoStruct
Taking over this since it looks like I'm going to make changes to CkSectionInfo anyway. Juan Galvez
11:14 AM Cleanup #1418 (Merged): Remove commlibInfo from _CkSectionInfoStruct
Commlib has been removed from Charm for a while now right? Remove this leftover data structure. While at it, I don't ... Sam White
12:04 PM Feature #176 (In Progress): objid_t: tracing infrastructure should use objid_t
Ronak Buch
10:16 AM Bug #1410: Tuple reducer leaks memory when using set/concat/custom reducers
One of the TODOs, in CkReduction::tupleReduction(), is concerning:... Sam White
09:58 PM Bug #1406: Mac OS X builds broken due to sigaction macro inconsistency
net-win64 is still broken on this, though I think it is deprecated and can be removed from autobuild at this point...... Sam White
09:47 PM Cleanup #1416 (Implemented): Eliminate use of deprecated/removed keyword 'register'
Removed from all files above except for the generated file xi-scan.C and the upstream LLVM OpenMP runtime: https://ch... Sam White
07:10 PM Cleanup #1416: Eliminate use of deprecated/removed keyword 'register'
I removed a bunch of these uses a while ago here: https://charm.cs.illinois.edu/gerrit/#/c/2005/
Here are the files ...
Sam White
06:21 PM Cleanup #1416 (Merged): Eliminate use of deprecated/removed keyword 'register'
C++17 eliminates the @register@ keyword, and compilers have already been warning about it. We should clean it out of ... Phil Miller
09:30 PM Bug #1413 (Implemented): Various Mac clang warnings are likely bugs
https://charm.cs.illinois.edu/gerrit/#/c/2242/ Sam White
07:02 PM Bug #1413: Various Mac clang warnings are likely bugs
For the machine.c case, it would be easy enough to add the suggested volatile to it. That is precisely an intended use. Phil Miller
07:03 PM Bug #1278: ReductionStarting overhead in SMP mode
The above commit leaves a TODO behind for the following:... Sam White

02/13/2017

05:42 PM Bug #1050 (Implemented): AMPI collectives support for MPI_IN_PLACE
Fix for large Alltoalls using MPI_IN_PLACE, the last part of this issue: https://charm.cs.illinois.edu/gerrit/#/c/2240/ Sam White
04:09 PM Bug #1050: AMPI collectives support for MPI_IN_PLACE
I just pushed a patch that implements MPI_IN_PLACE for MPI_Alltoall:
https://charm.cs.illinois.edu/gerrit/#/c/2240...
Matthias Diener
06:20 PM Bug #1050: AMPI collectives support for MPI_IN_PLACE
What's the status of this? It would be good to have a fix merged before the 6.8.0 release Sam White
05:31 PM Support #1414: Autobuild should run tests on SMP builds with multiple threads per process
tests/charm++/kNeighbor/Makefile already has a separate test-smp target, perhaps that should be adopted by all tests ... Sam White
05:29 PM Support #1414 (New): Autobuild should run tests on SMP builds with multiple threads per process
Since SMP builds are becoming more and more the normal build targets used by applications, we should be testing SMP b... Sam White
05:26 PM Bug #1329: Hang in exit in TRAM test on gni-crayxc-smp
The hang does not happen when using run options like @+p4 ++ppn4@. However, for @+p8 ++ppn4@ the program hangs. Karthik Senthil
04:40 PM Bug #1329 (In Progress): Hang in exit in TRAM test on gni-crayxc-smp
The hang is not related to TRAM, it just happens in the TRAM test.
The real issue seems to be somewhere in our exi...
Sam White
03:42 PM Bug #1201: SMP builds segfault on NULL lock in tests/charm++/chkpt
In case it's not clear, this is ongoing:
http://ppl-jenkins:8080/job/Nightly-Build/label=linux,platform=mpi-linux-x8...
Phil Miller
03:15 PM Bug #1411 (Rejected): Reductions to CkCallback::ignore should be no-ops
I agree that this is not worth implementing. Sam White
10:24 AM Bug #1411: Reductions to CkCallback::ignore should be no-ops
I think that could lead to state inconsistency, or at least a weakening of the checking for inconsistent callbacks be... Phil Miller
09:09 AM Bug #1411 (Rejected): Reductions to CkCallback::ignore should be no-ops
Currently if you perform a reduction targeted to a CkCallback::ignore routine, the RTS will still perform the reducti... Sam White
03:11 PM Bug #1413: Various Mac clang warnings are likely bugs
This commit will remove ckarrayreductionmgr completely, so that warning will go away: https://charm.cs.illinois.edu/g... Sam White
12:28 PM Bug #1413 (Merged): Various Mac clang warnings are likely bugs
The pup_xlater.C error turns cvt_bool() into an infinite loop.
The machine.c error will make LrtsAbort do nothing ...
Jim Phillips
01:53 PM Bug #1376 (Implemented): AMPI_Ireduce only creates a request at the root
https://charm.cs.illinois.edu/gerrit/#/c/2239/ Sam White
09:02 PM Bug #1376 (New): AMPI_Ireduce only creates a request at the root
I don't remember marking this rejected? It is a real issue, so reopening... Sam White
01:29 PM Bug #1412 (Implemented): AMPI collectives on COMM_SELF using derived datatypes are broken
https://charm.cs.illinois.edu/gerrit/#/c/2237/
Also realized that MPI_Scan and MPI_Exscan are the only collectiv...
Sam White
09:51 AM Bug #1412 (Merged): AMPI collectives on COMM_SELF using derived datatypes are broken
For collectives on MPI_COMM_SELF, we just copy the message data from the send buffer to the recv buffer, but currentl... Sam White
10:33 AM Bug #1048: Verbs on Bridges at PSC crashes or hangs
Would it be possible to try loading older versions of the relevant libraries to see which ones do or don't crash? The... Phil Miller
10:29 AM Bug #1409: verbs crashes on Stampede KNL and Bridges
Format description Phil Miller
10:20 AM Bug #1406 (Merged): Mac OS X builds broken due to sigaction macro inconsistency
https://charm.cs.illinois.edu/gerrit/#/c/2235/ Phil Miller
09:44 AM Bug #1390: AMPI_Alltoall crashes for short messages
I think there are multiple bugs associated with this issue.
1. When I run the test as @./pgm +vp 5@, the program c...
Karthik Senthil
06:18 PM Bug #1390 (In Progress): AMPI_Alltoall crashes for short messages
What's the status of this? Sam White
11:11 PM Bug #1410 (Merged): Tuple reducer leaks memory when using set/concat/custom reducers
Tuple reductions were added after 6.7.1, but it looks like only CkReductionMgr::reduceMessages() was updated with the... Sam White

02/12/2017

08:30 PM Bug #1001: netlrts with replicas leaves zombie processes behind
For the sake of diligence, it would be useful to know what commit actually resolved this. There have been about 650 c... Phil Miller
08:24 PM Cleanup #12: Factor out massive duplication in reductions
With the fix for #1278, CkArrayReductionMgr has been abolished. Four copies are become three. Phil Miller
08:12 PM Feature #793: Write default cuda kernelSelect so cuda build works out of the box
Already done, so mark as part of the current pending release, not its successor Phil Miller
08:09 PM Bug #1376: AMPI_Ireduce only creates a request at the root
Was the code not actually wrong, or did this get fixed? 'Rejected' without an explanation is rather hard to follow. Phil Miller
08:03 PM Bug #1403 (Merged): *-win64 build fails on undefined struct sigaction
Fix merged:
https://charm.cs.illinois.edu/gerrit/#/c/2231/
Phil Miller

02/11/2017

03:13 PM Bug #1404: Support Cray CC on {mpi,gni}-crayxe
= Sam White
03:06 AM Bug #1329: Hang in exit in TRAM test on gni-crayxc-smp
Some notes on the experiments I performed today for this bug:
https://charm.cs.illinois.edu/newTms/tasks/1753
Karthik Senthil
06:03 PM Bug #1409: verbs crashes on Stampede KNL and Bridges
The crashes also happen in non-SMP mode.
Thought it might be a compiler issue (icc is standard on Stampede 1.5) so t...
Jaemin Choi

02/10/2017

04:25 PM Bug #1404: Support Cray CC on {mpi,gni}-crayxe
It sounds like the issue here is much more that the recent Cray-provided changes weren't ported to gni-crayxe and mpi... Phil Miller
04:22 PM Bug #1404: Support Cray CC on {mpi,gni}-crayxe
Same commit builds with the same command using cce/8.5.4 and cce/8.5.7 on Cori. Phil Miller
04:13 PM Bug #1404: Support Cray CC on {mpi,gni}-crayxe
I think I ran into that issue on Blue Waters and then assumed it was the same on Edison, but it may have been some pr... Sam White
04:03 PM Bug #1404: Support Cray CC on {mpi,gni}-crayxe
On Edison with mainline charm, commit v6.7.0-643-g2429278, I was able to successfully run
> ./build charm++ gni-c...
Phil Miller
03:24 PM Bug #1405 (Merged): Charm fails to compile on BGQ and Cray XE giving an error in TopoManager.C
Both fixes (BGQ, Cray XT/XE) merged Phil Miller
11:39 AM Bug #1405: Charm fails to compile on BGQ and Cray XE giving an error in TopoManager.C
Related fix for Cray XE: https://charm.cs.illinois.edu/gerrit/#/c/2234/ Sam White
09:52 AM Bug #1405 (Implemented): Charm fails to compile on BGQ and Cray XE giving an error in TopoManager.C
https://charm.cs.illinois.edu/gerrit/#/c/2233/ Phil Miller
06:45 PM Bug #1405 (Merged): Charm fails to compile on BGQ and Cray XE giving an error in TopoManager.C
Machine: Vesta
Latest commit: 22035bcbda2e7987ee429d586243a90b35f4a8b7 (Bug #815: Add OPTS to hybrid API makefile)...
Nitin Bhat
03:17 PM Feature #1088 (New): Trace MPI_ functions in AMPI
Sam White
02:58 PM Bug #1409: verbs crashes on Stampede KNL and Bridges
This is the same issue as on Quartz machine: https://charm.cs.illinois.edu/redmine/issues/1381 Sam White
02:55 PM Bug #1409: verbs crashes on Stampede KNL and Bridges
The issue still persists in the build using --with-qlogic option. Karthik Senthil
02:25 PM Bug #1409: verbs crashes on Stampede KNL and Bridges
Try adding --with-qlogic to the build line. Jim Phillips
02:09 PM Bug #1409 (Rejected): verbs crashes on Stampede KNL and Bridges
Charm++ programs crash before launch with a "Length mismatch" abort for the ibverbs SMP build on Stampede KNL.
Bui...
Karthik Senthil
01:25 PM Bug #1408 (Merged): Improve visibility and usability of flushTraceLog()
I just went looking in the manual for how to force Projections logs out to disk at a defined point in my application ... Phil Miller
12:27 PM Bug #1406 (Merged): Mac OS X builds broken due to sigaction macro inconsistency
... Jim Phillips
11:31 AM Bug #1239: Cleanup reduction uses in the runtime
Another note for optimizing reductions:
If a custom reduction uses a fixed size message (the messages size doesn't...
Sam White
11:22 AM Bug #1329: Hang in exit in TRAM test on gni-crayxc-smp
I have isolated the bug and it occurs in the "directSends" version of the test. The "usingTram" part is clean. As sus... Karthik Senthil
09:53 AM Bug #815 (In Progress): Makefile for hybrid API is not using the system OPTS
See Tim's complaint about passing @-optimize -production@ straight to nvcc in production builds based on this commit. Phil Miller

02/09/2017

05:38 PM Bug #1364: Review use of volatile variables in the runtime
CkLoop's curChunkIdx cleaned out here: https://charm.cs.illinois.edu/gerrit/2229
It's just a simple atomic counter.
Phil Miller
05:35 PM Bug #337 (Merged): Make topology physical node-aware when detailed info is not available
Phil Miller
10:26 AM Bug #337 (Implemented): Make topology physical node-aware when detailed info is not available
Fix here:
https://charm.cs.illinois.edu/gerrit/#/c/2227/
Juan Galvez
05:33 PM Bug #815 (Merged): Makefile for hybrid API is not using the system OPTS
Phil Miller
04:06 PM Bug #1329: Hang in exit in TRAM test on gni-crayxc-smp
Any progress on this? If you need help, please ask for it. Phil Miller
04:04 PM Bug #1397 (Feedback): Document that array creation must occur on PE0
Waiting on info from bug reporter. I can't see what's causing problems in the code as it exists. Phil Miller
03:01 PM Bug #1399 (Merged): TaskQ logic for randomly choosing a victim PE that is not self is incorrect
Phil Miller
02:32 PM Bug #1399: TaskQ logic for randomly choosing a victim PE that is not self is incorrect
https://charm.cs.illinois.edu/gerrit/2206 Phil Miller
02:55 PM Bug #1372 (Closed): Using GCC-6 in support of Intel icc/icpc on Edison fails with no member "iter...
Phil Miller
01:58 PM Bug #1403: *-win64 build fails on undefined struct sigaction
Today, please. This is a small fix, and the brokenness shouldn't be left lingering. Phil Miller
06:01 PM Bug #1403 (In Progress): *-win64 build fails on undefined struct sigaction
Sam White
01:53 PM Bug #1388 (Merged): charmrun doesn't compile on netlrts-win-x86_64
Phil Miller
01:44 PM Feature #1234: Avoid sender-side copy for large contiguous messages. API for charm and converse l...
From minutes of the last core meeting:
> Callback fix to invoke on sending PE and not comm thread implemented.
>...
Phil Miller
08:22 AM Bug #1402 (Merged): MPI builds fail due to sigaction parameter type mismatch
Sam White
06:01 PM Bug #1402 (Implemented): MPI builds fail due to sigaction parameter type mismatch
Sam White

02/08/2017

05:27 PM Bug #1402: MPI builds fail due to sigaction parameter type mismatch
https://charm.cs.illinois.edu/gerrit/#/c/2225/ Seonmyeong Bak
03:06 PM Bug #1402: MPI builds fail due to sigaction parameter type mismatch
Offending commit: https://charm.cs.illinois.edu/gerrit/#/c/887/ Sam White
11:30 AM Bug #1402 (Merged): MPI builds fail due to sigaction parameter type mismatch
On Stampede 2:... Jim Phillips
05:27 PM Bug #1403: *-win64 build fails on undefined struct sigaction
This is because sigaction is POSIX API. So windows doesn't support this.
For windows, I'll reapply the previous usa...
Seonmyeong Bak
03:06 PM Bug #1403: *-win64 build fails on undefined struct sigaction
Offending commit: https://charm.cs.illinois.edu/gerrit/#/c/887/ Sam White
11:38 AM Bug #1403: *-win64 build fails on undefined struct sigaction
This actually affects all win64 builds, not just MPI.... Jim Phillips
11:32 AM Bug #1403 (Merged): *-win64 build fails on undefined struct sigaction
... Jim Phillips
04:33 PM Bug #1404 (Merged): Support Cray CC on {mpi,gni}-crayxe
Currently, Charm will build on CCE 8.5.4+. However, to do so you have to remove the check in our top-level build scri... Sam White
12:20 PM Bug #1149: Cray CC builds are broken
This now works if you build with 'craycc' specified explicitly as an option to build. In order for that to work you h... Sam White
12:09 PM Bug #1149: Cray CC builds are broken
The issue with intrinsics.h should be fixed by https://charm.cs.illinois.edu/gerrit/#/c/2140/ (deleting its inclusion... Phil Miller
12:07 PM Bug #1388 (Implemented): charmrun doesn't compile on netlrts-win-x86_64
Fix here: https://charm.cs.illinois.edu/gerrit/2224 Phil Miller
11:40 AM Cleanup #197: Replace use of signal() with sigaction()
Also broke all Windows builds. See bug #1403. Jim Phillips
11:33 AM Cleanup #197: Replace use of signal() with sigaction()
This broke MPI builds. See bugs #1402 and #1403. Jim Phillips
10:31 PM Bug #1400 (Merged): charmxi: debug #line directives show file name as STDIN
Sam White
10:13 PM Bug #1392 (In Progress): Stampede test script fails during autobuild (verbs)
Nitin Bhat
07:59 PM Bug #1392 (Implemented): Stampede test script fails during autobuild (verbs)
Fixed the 'src/scripts/testrun' script used by verbs autobuild to run jobs on stampede.
Fix included modifying the ...
Nitin Bhat

02/07/2017

05:12 PM Bug #1361 (Merged): multicore-linux-x86_64 should alias to multicore-linux64
Phil Miller
05:11 PM Bug #1396 (Merged): AMPI intercomm_coll test break BigSim autobuild
Phil Miller
05:10 PM Bug #1323 (Merged): megatest multisection test failures
Phil Miller
05:10 PM Cleanup #960 (Merged): Replace CmiEnforce(0) calls in verbs and net layer
Phil Miller
05:10 PM Cleanup #197 (Merged): Replace use of signal() with sigaction()
Phil Miller
03:57 PM Bug #1278 (Implemented): ReductionStarting overhead in SMP mode
Implemented in [[https://charm.cs.illinois.edu/gerrit/2216]] by making CkReductionMgr node aware when in SMP mode. Eric Mikida
03:56 PM Bug #1278: ReductionStarting overhead in SMP mode
Implemented in https://charm.cs.illinois.edu/gerrit/2216 by making CkReductionMgr node aware when in SMP mode. Eric Mikida
03:55 PM Bug #1360 (In Progress): AMPI megampi test fails on mpi-crayxc and darwin builds
Sam White
03:48 PM Bug #815 (Implemented): Makefile for hybrid API is not using the system OPTS
https://charm.cs.illinois.edu/gerrit/#/c/2221/ Michael Robson
03:19 PM Bug #1364: Review use of volatile variables in the runtime
@interopCommThdExit@ is removed (except for BG XLC) here: https://charm.cs.illinois.edu/gerrit/#/c/2145/2 Phil Miller
02:46 PM Bug #1397: Document that array creation must occur on PE0
This can't go into a release. Way too disruptive. Phil Miller
09:16 PM Bug #854: RRMap broken for >1D chare arrays
The real work here is in passing the bounds from CkArrayOptions through to CkArrayMap, so that RRMap can linearize th... Sam White

02/06/2017

05:42 PM Cleanup #512: Remove duplication of conv-mach files
See changes: https://charm.cs.illinois.edu/gerrit/2210 to https://charm.cs.illinois.edu/gerrit/2215 Michael Robson
05:24 PM Cleanup #512: Remove duplication of conv-mach files
I re-ran ... Michael Robson
05:39 PM Feature #1352 (In Progress): CkArrayOptions callback for completion of chare array initialization
Hacky implementation using pt2pt sends, rather than a reduction, here: https://charm.cs.illinois.edu/gerrit/#/c/2209/... Sam White
05:31 PM Feature #1401 (Implemented): Converting OpenMP test suite for the OpenMP integration.
The LLVM OpenMP open source project has a test suite for OpenMP pragmas and environmental APIs developed by Universit... Seonmyeong Bak
05:27 PM Feature #1300 (In Progress): Integrated OpenMP should co-exist with MPI interop
Seonmyeong Bak
05:03 PM Feature #1300: Integrated OpenMP should co-exist with MPI interop
OK, that's matches my expectations. Please make it work. Phil Miller
04:56 PM Feature #1300: Integrated OpenMP should co-exist with MPI interop
When MPI interoperate with Charm++ and OpenMP integration, MPI codes cannot use OpenMP properly right now.
It's b...
Seonmyeong Bak
05:27 PM Cleanup #197 (Implemented): Replace use of signal() with sigaction()
Seonmyeong Bak
05:11 PM Bug #503 (Rejected): Replace use of obsolete memalign function with posix_memalign
There is a primary concern of whether posix_memalign would be portable, and the usage of memalign for the linux build... Dong Hun Lee
03:54 PM Bug #1399 (Implemented): TaskQ logic for randomly choosing a victim PE that is not self is incorrect
Seonmyeong Bak
02:42 PM Bug #1399: TaskQ logic for randomly choosing a victim PE that is not self is incorrect
Yeah, this is not correct.
Probably, stealing happens more frequently if this is fixed. (In the previous codes, i...
Seonmyeong Bak
01:39 PM Bug #1399 (Merged): TaskQ logic for randomly choosing a victim PE that is not self is incorrect
Sanjay discovered this: the logic for randomly choosing a PE that is not self is wrong because it compares a global P... Sam White
03:37 PM Feature #1394 (In Progress): Node-level message aggregation for CkMulticast
Juan Galvez
03:36 PM Bug #337 (In Progress): Make topology physical node-aware when detailed info is not available
Juan Galvez
03:34 PM Bug #1400: charmxi: debug #line directives show file name as STDIN
https://charm.cs.illinois.edu/gerrit/2208 Phil Miller
03:33 PM Bug #1400 (Merged): charmxi: debug #line directives show file name as STDIN
When we switched to running all .ci files through the C preprocessor, we changed charmxi to show STDIN when it wasn't... Phil Miller
03:29 PM Bug #1381: Crash in LrtsInitCpuTopo() on Quartz with verbs layer
I have reproduced this bug on 2 nodes, it works on +p>1 in one node. This is likely to be the same issue we saw on Br... Bilge Acun
03:03 PM Feature #1351: For broadcasts, avoid copies for destination PEs on same node for [nokeep] messages
Seems like all destinations on the same PE do get one copy of the message for a broadcast Vipul Harsh
02:59 PM Bug #738 (Implemented): prevent direct calls to sdag entry functions
Modified every sdag entry method to be implemented in .C as "_sdag_" + entry method name
In a .ci file if there's ...
Nitin Bhat
02:54 PM Bug #1001 (Resolved): netlrts with replicas leaves zombie processes behind
Bilge Acun
02:09 PM Bug #1001: netlrts with replicas leaves zombie processes behind
I can confirm it works for me too. OK to close. Jim Phillips
01:28 PM Bug #1001: netlrts with replicas leaves zombie processes behind
I am unable to reproduce this bug. After megatest run ends with the command above, I do; _killall -9 pgm_ and I get _... Bilge Acun
01:04 PM Bug #1361 (Implemented): multicore-linux-x86_64 should alias to multicore-linux64
https://charm.cs.illinois.edu/gerrit/#/c/2203/ Dong Hun Lee
01:02 PM Bug #1202: Memory leaks in converse's cldb

Fix for cputopology mem leak:
https://charm.cs.illinois.edu/gerrit/#/c/2202/
The mem leak in cldb is just somet...
Juan Galvez
12:27 PM Feature #262 (In Progress): MetaBalancer: add mechanism to inform user of AtSync call that result...
Kavitha Chandrasekar
12:21 PM Bug #999 (Implemented): netlrts writeableDgrams is never reset to 0
https://charm.cs.illinois.edu/gerrit/#/c/2199 Bilge Acun
12:13 PM Bug #1360: AMPI megampi test fails on mpi-crayxc and darwin builds
Seems to be a clang stack alignment issue. Adding explicit alignment ... Matthias Diener
11:15 AM Bug #1360: AMPI megampi test fails on mpi-crayxc and darwin builds
megampi fails with OPTS="-optimize" (even on my macbook), compiling without optimization works fine. Matthias Diener
12:00 PM Cleanup #960 (Implemented): Replace CmiEnforce(0) calls in verbs and net layer
There were CmiEnforce(0) calls in the net layer, so also replaced those. Jaemin Choi
11:46 AM Bug #1108 (Merged): Insert "// Potentially missing FooClass_SDAG_CODE in your class definition?" ...
Phil Miller
11:10 AM Bug #1275 (In Progress): DistributedLB: Objects not migrating after strategy runs
As you suggest, this might make the receiver PE overloaded, thereby not migrating objects. Before sending to an under... Kavitha Chandrasekar
11:45 PM Documentation #1398 (Merged): Document addReducer's new option 'streamable'
Users can now ask that their custom reductions be processed in a streaming fashion when registering their reduction f... Sam White

02/05/2017

01:28 PM Bug #1397 (Merged): Document that array creation must occur on PE0
ckNew calls are restricted to PE0 since the 64-bit ID changes were integrated. Document this restriction in the array... Eric Mikida
09:38 AM Bug #1396: AMPI intercomm_coll test break BigSim autobuild
Autobuild failed inside intercomm_coll's bgtest now, during intercommunicator creation. I think we should not test Bi... Sam White
09:36 PM Feature #885: extend physical node detection across partitions
Yes, it is still desired. NAMD has a work-around in the form of a +devicesperreplica <n> command line option, but it... Jim Phillips
08:03 PM Bug #1307 (In Progress): AMPI_Comm_free should free the ampi instance
We need to reference count communicators b/c a user could create an intercommunicator out of two comms, then free bot... Sam White
07:51 PM Bug #1307: AMPI_Comm_free should free the ampi instance
We can at least clear all of the ampi instance's heap memory from AMPI_Comm_free, even if we can't delete the chare a... Sam White

02/04/2017

02:45 PM Bug #1108: Insert "// Potentially missing FooClass_SDAG_CODE in your class definition?" in genera...
I took a quick crack at this after coming across it: https://charm.cs.illinois.edu/gerrit/#/c/2190/
There are stil...
Sam White
01:19 PM Bug #1396 (Implemented): AMPI intercomm_coll test break BigSim autobuild
Karthik Senthil
08:47 AM Bug #1396 (Merged): AMPI intercomm_coll test break BigSim autobuild
https://charm.cs.illinois.edu/autobuild/cur/netlrts-linux-x86_64-bigsim.txt
The netlrts-linux-x86_64-bigsim build ...
Sam White
12:40 PM Feature #1357: Add AMPI RMA routine tests to megampi
https://charm.cs.illinois.edu/gerrit/#/c/2148/ Sam White
12:39 PM Feature #1357: Add AMPI RMA routine tests to megampi
We'd like to have this testing in place for the 6.8.0 release. Sam White
11:38 AM Bug #1327: AMPI derived datatype support for RMA routines
I think we can maybe get the basic support for this into the 6.8.0 release (~ next week), and then follow up with per... Sam White
06:56 PM Bug #1395: AMPI_Get_elements returns incorrect results for struct type
Low priority until we have a concrete use case other than mpich-tests.
The basic issue is that the MPI standard al...
Sam White

02/03/2017

05:14 PM Bug #647 (New): Make MeshStreamer classes [migratable] to support checkpoint/restart
Sam White
03:37 PM Bug #1374 (Merged): Reduction clause doesn't work on the OpenMP integration with icc and clang
Seonmyeong Bak
02:47 PM Bug #1155: AMPI's non-blocking collectives are not sequenced
Refactor AmpiMsg so that seq is the refnum, to ease future support of sequencing on CkReductionMsg's: https://charm.c... Sam White
09:38 AM Bug #1395 (Merged): AMPI_Get_elements returns incorrect results for struct type
test/ampi/mpich-tests/pt2pt/getelm fails with "Wrong number of elements for struct recv; got 1001 expected 24".
Th...
Sam White

02/02/2017

04:08 PM Bug #684: Isomalloc PUP has invalid read memory errors
These are still present in Isomalloc. I'll take a look. Sam White
02:38 PM Feature #1394 (In Progress): Node-level message aggregation for CkMulticast
Because CkMulticastMgr is a group, it uses a tree structure of PEs to send group messages. The problem is that if one... Juan Galvez
12:02 AM Bug #1050: AMPI collectives support for MPI_IN_PLACE
As noted above, the only thing left to do on this is MPI_Alltoall. Sam White
11:55 PM Cleanup #960: Replace CmiEnforce(0) calls in verbs and net layer
So what do you want me to do here?
Replace CmiEnforce(0) with CmiAbort()?
Jaemin Choi
11:44 PM Bug #1390: AMPI_Alltoall crashes for short messages
We'll want to have MPI_Alltoall working for all message sizes in our 6.8.0 release. In the worst case, we could alway... Sam White
10:41 PM Feature #239: Charmrun wrapper around ibrun.symm for dual binary execution on host+mic with IMPI ...
Dual binary launching no longer relevant for KNL Sam White
10:38 PM Feature #190: Build target(s) for Intel MIC
KNL is self-hosted and doesn't require the complicated dual binary launch scheme. Since this is off the radar for now... Sam White
10:35 PM Bug #968: charm++ programs fail to run on BlueWaters due to craype-hugepages8M
Can this be closed? Sam White

02/01/2017

02:19 PM Feature #793 (Closed): Write default cuda kernelSelect so cuda build works out of the box
This bug is fixed/obviated by patch https://charm.cs.illinois.edu/gerrit/#/c/2063/ Michael Robson
02:18 PM Bug #792 (Closed): Enable cuda builds of charm to run non-cuda programs
This bug is fixed/obviated by patch https://charm.cs.illinois.edu/gerrit/#/c/2063/ Michael Robson
12:33 PM Bug #792: Enable cuda builds of charm to run non-cuda programs
This seems like a significant bug to me if I understand it correctly and if it is still pertinent: you can't build Ch... Sam White
01:58 PM Feature #315 (New): Reduction Starting messages - Performance optimal solution
Reassigning to Eric since he is already assigned the related issue. Sam White
01:57 PM Bug #598 (Closed): Unused tests in test/charm++
Sam White
01:55 PM Bug #805 (New): SDAG tracing counts time waiting for a when to be satisfied as overhead
Sam White
01:53 PM Bug #1005 (New): Some Projections views broken for AMPI with user-registered functions
Sam White
01:46 PM Cleanup #1065 (In Progress): Create a more efficient caching structure for location lookup
Change from std::map to std::unordered_map: https://charm.cs.illinois.edu/gerrit/#/c/1978/
Use tr1/unordered_map on ...
Sam White
12:56 PM Feature #105: Unify memory pool implementations where possible
Here's a related patch Nikhil originally wrote to make exceptions for large allocations (to not use the mempool when ... Sam White
12:44 PM Feature #105: Unify memory pool implementations where possible
CmiPool is now only used in the state-space search engine, so we don't really need to worry about that one anymore.
...
Sam White
12:40 PM Bug #234 (Rejected): net-linux-x86_64-*-smp-pgcc crashes in megatest
closing due to net- being deprecated and pgcc not generally being able to compile Charm (and the lack of requests fro... Sam White
12:36 PM Cleanup #537: Data races in handler registration and assignment to global index variables
Still hanging in gerrit, waiting on cleanup... Sam White
12:36 PM Feature #975: OFI Layer
Version 2 patch is here: https://charm.cs.illinois.edu/gerrit/#/c/1945/ Bilge Acun
12:35 PM Cleanup #536: Data Races in SMP PCQueue
Does the new lockless queue fix this? Sam White
12:33 PM Bug #756: CUDA build does not correctly find cuda location
Any update? Sam White
12:31 PM Feature #808 (Rejected): Implement a communication aware load balancer
This issue description is pretty vague to me, so closing for now. It can be reopened under a more descriptive listing... Sam White
12:30 PM Bug #815: Makefile for hybrid API is not using the system OPTS
Is this a real issue? If not close it please. Sam White
12:29 PM Feature #869 (Merged): Merge PICS framework to Charm
Marking this merged since there is nothing left from PICS that we decided we wanted to merge Sam White
12:28 PM Support #865: Update section 7.3 of Manual on Load Balancing Chare arrays
It would be good if you can do a quick read over and correction of this documentation since no else in the lab has yo... Sam White
12:25 PM Projections Feature #976: Tool for analyzing network message latencies
Can you document here where this tool is and how to use it? Sam White
12:04 PM Bug #571: pxshm shared queue lockless implementation is invalid
pxshm is becoming slightly more important with wider nodes, so bumping up the priority. Also putting under SMP becaus... Sam White
12:03 PM Bug #802: CUDA examples broken on Blue Waters
Is this issue up to date or did things get merged for it? Sam White
12:02 PM Cleanup #966 (Merged): cuda build fails unituitvely on blue waters when cudatoolkit isn't loaded
Jaemin Choi
12:01 PM Feature #885: extend physical node detection across partitions
Is this still desired? We should assign it to someone if so. Sam White
11:42 AM Feature #641: protect load balancer from variable cpu clock
It looks like this was fixed a while ago in the following two commits:
https://charm.cs.illinois.edu/gerrit/#/c/577/...
Sam White
11:38 AM Bug #1035 (Implemented): Idle PEs compete with comm thread for node queue lock
The new lockless queue (gerrit patch linked above) will address this issue, so re-assigning to Bilge as the owner of ... Sam White
11:24 AM Feature #1393 (Merged): Redesign of Hybrid API (GPU Manager) to support concurrent kernel execution
The original design of GPUManager had two data transfer streams and one kernel stream per GPUManager instance, which ... Jaemin Choi
09:59 AM Feature #1133: PMPI_ interface for AMPI
I started the implementation for weak symbols and non-MPI communication layers (#2 above) on a branch named 'ampi_wit... Sam White
09:29 AM Cleanup #1365 (Merged): Deduplicate AmpiRequest routines and their handling of NULL statuses
https://charm.cs.illinois.edu/gerrit/#/c/2163/ Sam White
08:32 PM Bug #1329: Hang in exit in TRAM test on gni-crayxc-smp
The bug does not happen in 'debug' build of Charm++ on Edison.
I am currently trying to find the exact location of...
Karthik Senthil

01/31/2017

05:53 PM Bug #1376 (Rejected): AMPI_Ireduce only creates a request at the root
Sam White
04:58 PM Cleanup #1058 (Merged): Make tracing warning about flushing logs more informative
Ronak Buch
03:06 PM Cleanup #1058 (Implemented): Make tracing warning about flushing logs more informative
Implemented at https://charm.cs.illinois.edu/gerrit/#/c/2177/. The documentation already contained the correct infor... Ronak Buch
03:42 PM Bug #1392 (Closed): Stampede test script fails during autobuild (verbs)
Testrun script used in autobuild uses charmrun that isn't allowed to run on a login node.
../../bin/testrun +p4 ...
Nitin Bhat
10:55 AM Feature #1351: For broadcasts, avoid copies for destination PEs on same node for [nokeep] messages
First, all features should have a target version of 6.8.0-beta1 or 6.8.1, not 6.8.0.
Second, even in smp mode it sho...
Jim Phillips
10:04 PM Support #1391 (Closed): Add an SMP/multicore build test to Jenkins
Decided at Core last week: before any commit is merged, it should have to pass the test suite on an SMP or multicore ... Sam White
06:47 PM Bug #1148: Define 'thisIndex' for Groups
It would be good to have this soon, before the 6.8.0 release. Sam White

01/30/2017

04:05 PM Bug #1390 (Merged): AMPI_Alltoall crashes for short messages
Running the megampi test for more number of ranks(instead of 4) crashes with a memory corruption. More investigation ... Karthik Senthil
10:29 AM Feature #858: improve efficiency of exclusive entry methods
In the parameter marshalling case, it seems like CmiReference(impl_msg) would suffice to balance out the deletion tha... Phil Miller
06:41 PM Feature #858: improve efficiency of exclusive entry methods
Some progress on at least the simple case here before the 6.8.0 release would be valuable. Sam White
10:18 AM Feature #1389 (Merged): AMPI ATAReq test/wait performs poorly
AMPI uses IATAReqs to track completion of a group of requests from MPI_Ialltoall* and MPI_Ineighbor_*. Currently we t... Sam White
07:02 PM Bug #1388 (Merged): charmrun doesn't compile on netlrts-win-x86_64
This commit broke charmrun compilation on net-win64 and netlrts-win-x86_64: https://charm.cs.illinois.edu/gerrit/#/c/... Sam White
06:40 PM Cleanup #1058: Make tracing warning about flushing logs more informative
This should be quick and would be valuable to have merged before the 6.8.0 release. Sam White
06:35 PM Feature #1105: AMPI support for large counts
We will merge this once the same changes for 64 bit messages are made to Charm. That is a separate issue. Sam White

01/29/2017

07:00 PM Support #1387 (New): Optimised algorithms for scatterv
Vipul Harsh
06:58 PM Feature #1248 (Implemented): Charm++ scatter(v) collective interface
Naive point to point implementation: https://charm.cs.illinois.edu/gerrit/#/c/2083/ Vipul Harsh

01/28/2017

10:11 AM Feature #1386 (Rejected): ckDestroy for Groups and NodeGroups
Jozsef Bakosi pointed out on the charm mailing list that we currently do not support deleting Groups. ckDestroy is on... Sam White

01/27/2017

09:56 PM Bug #1379: SDAG doesn't properly handle callbacks to [reductiontarget] methods with refnums
OK I didn't know that you couldn't do marshaling in callbacks although there are still two oddities about what you pr... Eric Mikida

01/26/2017

05:57 PM Bug #1379: SDAG doesn't properly handle callbacks to [reductiontarget] methods with refnums
I think we may be able to simplify things.
For a common parameter-marshalled method invocation, we can make sure t...
Phil Miller
05:46 PM Bug #1379: SDAG doesn't properly handle callbacks to [reductiontarget] methods with refnums
My thoughts after looking at this:
There are two ways in which reference numbers are used in SDAG which are "well-de...
Eric Mikida
01:02 PM Bug #1379: SDAG doesn't properly handle callbacks to [reductiontarget] methods with refnums
Semi-related is the issue of tuple reductions not being able to be delivered to [reductiontarget] entry methods: http... Sam White
12:58 PM Bug #1379: SDAG doesn't properly handle callbacks to [reductiontarget] methods with refnums
I think we should try to get this dealt with before 6.8.0 is released Phil Miller
01:39 PM Feature #1382: Add streaming floating point sum reducers
https://charm.cs.illinois.edu/gerrit/#/c/2159/ Sam White
01:39 PM Feature #1382 (In Progress): Add streaming floating point sum reducers
Sam White
01:29 PM Feature #1382 (Merged): Add streaming floating point sum reducers
We chose to make the default floating point sum reducer be not streaming when streaming reducers were added.
We can ...
Sam White
01:25 PM Feature #1322: PSM2 network layer
More on active messages:
https://github.com/01org/opa-psm2/blob/master/psm2_am.h
Jim Phillips
01:15 PM Feature #1322: PSM2 network layer
Presentation on OFI in MPICH:
https://www.openfabrics.org/images/eventpresos/workshops2015/DevWorkshop/Monday/mond...
Jim Phillips
01:07 PM Feature #1322: PSM2 network layer
Look for the first occurrence of psm2_am in the following:
https://github.com/01org/opa-psm2/blob/master/psm2.h
...
Jim Phillips
12:58 PM Feature #1113: smart-build.pl should detect supercomputers with specialized software environments...
Eric B - NB Phil Miller
12:57 PM Feature #1113: smart-build.pl should detect supercomputers with specialized software environments...
I think we should try to get this dealt with before 6.8.0 is released Phil Miller
12:54 PM Bug #1381: Crash in LrtsInitCpuTopo() on Quartz with verbs layer
This is the commit that I thought automated Mellanox vs. QLogic choice in verbs: https://charm.cs.illinois.edu/gerrit... Sam White
12:50 PM Bug #1381: Crash in LrtsInitCpuTopo() on Quartz with verbs layer
Building with --with-qlogic didn't help, I get the same error. Sam White
12:39 PM Bug #1381: Crash in LrtsInitCpuTopo() on Quartz with verbs layer
I only see this:
https://charm.cs.illinois.edu/gerrit/gitweb?p=charm.git;a=commitdiff;h=e3cfe89fffdd9349ababf5b13b01...
Jim Phillips
12:36 PM Bug #1381: Crash in LrtsInitCpuTopo() on Quartz with verbs layer
I added Bilge as a watcher, since I thought we had automated that build option? Sam White
12:35 PM Bug #1381: Crash in LrtsInitCpuTopo() on Quartz with verbs layer
You can fix this by adding --with-qlogic to the command line, although it will probably still be slower than MPI. Jim Phillips
12:03 PM Bug #1381 (Rejected): Crash in LrtsInitCpuTopo() on Quartz with verbs layer
Quartz is a new cluster with an Intel Omni-Path 100 Gb/s interconnect. If I build on verbs layer and run on more than... Sam White
12:44 PM Bug #1329: Hang in exit in TRAM test on gni-crayxc-smp
Core group would like to see some progress on this issue. Do you have access to Edison to reproduce this? Sam White
12:33 PM Feature #68 (In Progress): LRTS support for setting up a message to send and transmitting a GET h...
Is this solved by #1234? Eric Bohm
12:30 PM Feature #117: Projections traces with fewer files than 1-per-PE
This is a good point, and a change should definitely be made to plan ahead. I'll sketch out some possible designs as... Ronak Buch
12:28 PM Feature #109 (Merged): Test and merge section ID and manager work
Eric Bohm
10:32 AM Bug #1367 (Rejected): AMPI comm creation should use bulk construction
Distributed creation makes more sense and is already used. It's not distributed vs bulk that is the problem, it's tha... Sam White

01/25/2017

02:55 PM Bug #1294 (Merged): Darwin SMP failure in exit
Michael Robson
10:18 AM Bug #1294: Darwin SMP failure in exit
Reported to Apple as Bug #30186931. There doesn't seem to be any way in their system to CC others, so I'll update if/... Phil Miller
02:53 PM Bug #1379: SDAG doesn't properly handle callbacks to [reductiontarget] methods with refnums
Simple example program to reproduce this issue (hang) is attached. Written by Ed Hutter. Sam White
06:18 PM Feature #1378: 64-bit Charm message sizes
Previously merged, related patches:
Use size_t instead of int for all PUP interfaces: https://charm.cs.illinois.e...
Sam White

01/24/2017

04:46 PM Bug #1379 (Merged): SDAG doesn't properly handle callbacks to [reductiontarget] methods with refnums
If you want to do something like 'when rednResult[refnum](int iter, double sum) serial { }' where rednResult is a [re... Sam White
02:59 PM Feature #1378: 64-bit Charm message sizes
Also, note that AMPI has a patch in gerrit for MPI-3 large count support, which should work once this Charm message s... Sam White
02:59 PM Feature #1378 (In Progress): 64-bit Charm message sizes
Currently, PUP supports 64bit sizes, sizes of chares are 64bit, the GNI/Isomalloc mempool uses 64bit sizes, but Charm... Sam White
02:51 PM Feature #1164 (Merged): AMPI support for MPI-3's request-based RMA routines
Sam White
02:07 PM Bug #1360: AMPI megampi test fails on mpi-crayxc and darwin builds
Also seeing a failure on netlrts-darwin-x86_64 on Wit in megampi with +p1 +vp1. Doesn't reproduce on my Mac, but is ... Sam White
02:02 PM Feature #1366 (Merged): AMPI thread should not be awoken unless progress can be made
I will follow up with the similar changes for recv and probe. They each have their own complications.
Recv: https:...
Sam White
09:46 PM Feature #1366 (Implemented): AMPI thread should not be awoken unless progress can be made
Sam White

01/23/2017

03:34 PM Bug #1375: os-isomalloc failures during startup on SMP builds
After testing this more, it appears the above change broke '-memory isomalloc' on SMP mode (it was already broken for... Sam White
01:01 PM Bug #1375: os-isomalloc failures during startup on SMP builds
Fixed the hang during initialization: https://charm.cs.illinois.edu/gerrit/#/c/2151/
But now we get a failure afte...
Sam White
02:24 PM Feature #1366 (In Progress): AMPI thread should not be awoken unless progress can be made
Sam White
12:06 PM Cleanup #966 (Implemented): cuda build fails unituitvely on blue waters when cudatoolkit isn't lo...
Same thing happens on NCSA nano cluster when no CUDA toolkit directory exists.
Modified build script to bail out as ...
Jaemin Choi
09:48 PM Bug #1377 (Closed): linking megatest on Eos fails with undefined references to ceil and floor fro...
The trick for C++ 11 that works on Edison (Cray XC) fails to link C++ on OLCF Eos:... Jim Phillips
09:21 PM Bug #1376: AMPI_Ireduce only creates a request at the root
examples/ampi/creduce only waits on the result at the root. That should be changed as well. Sam White
09:20 PM Bug #1376 (Merged): AMPI_Ireduce only creates a request at the root
MPI_Ireduce and other "all-to-one" type nonblocking collectives currently only create and set the request on the root... Sam White

01/22/2017

04:16 PM Feature #1357 (In Progress): Add AMPI RMA routine tests to megampi
This isn't really a bug, update the commit message accordingly: https://charm.cs.illinois.edu/gerrit/#/c/2148/ Sam White
10:40 AM Bug #1331: Isomalloc hangs in startup for Clang non-SMP builds
I opened a separate issue for os-isomalloc failures on SMP builds here: https://charm.cs.illinois.edu/redmine/issues/... Sam White
10:39 AM Bug #1375 (Merged): os-isomalloc failures during startup on SMP builds
os-isomalloc fails during startup on the first access to a lock in cputopolgy.C. os-isomalloc works on Clang non-SMP ... Sam White

01/21/2017

10:42 PM Bug #1358 (Merged): AMPI Iget extensions are broken
Sam White

01/20/2017

05:41 PM Bug #1374 (Merged): Reduction clause doesn't work on the OpenMP integration with icc and clang
Reduction clause doesn't work on the OpenMP integration with icc and clang but works with GCC.
The reason why it ...
Seonmyeong Bak
04:46 PM Bug #1294 (Implemented): Darwin SMP failure in exit
We (Phil and I) isolated what we think might be a compiler bug that's causing this issue. See the attached files for ... Michael Robson
02:28 PM CharmDebug Bug #1373 (Merged): Unable to attach to an application it launched
Created launch configuration using the built in gui. When "start" is triggered, it was capable of launching the appli... Eric Bohm
02:23 PM Bug #1372 (Resolved): Using GCC-6 in support of Intel icc/icpc on Edison fails with no member "it...
Compatible versions of gcc and icc must be used together (gcc/4.9 and icc/15.9 work together). Sam White
09:15 AM Bug #1372: Using GCC-6 in support of Intel icc/icpc on Edison fails with no member "iterator_cate...
Using gcc/4.9.3 works. I'm not sure if gcc-5 works or not (or if that's available on Edison).
This suggests that t...
Sam White
09:01 AM Bug #1372 (Closed): Using GCC-6 in support of Intel icc/icpc on Edison fails with no member "iter...
Trying to build on Edison (which has an older gcc in /usr/bin) the advice
for C++11 code is to module load gcc (6.1...
Sam White
09:03 AM Feature #1366: AMPI thread should not be awoken unless progress can be made
Support in AMPI_Wait* routines: https://charm.cs.illinois.edu/gerrit/#/c/2152/
Similar changes can be made in bloc...
Sam White

01/19/2017

05:46 PM Feature #1371 (New): Within-node PUP API
When migrating a chare within the same logical node, we can optimize migration time by allowing the application to no... Sam White
05:40 PM Bug #1278: ReductionStarting overhead in SMP mode
In Core we discussed the possibility of passing CkArrayOptions through to the reduction managers just like what is al... Sam White
04:51 PM Feature #1352: CkArrayOptions callback for completion of chare array initialization
Reassign if urgent. Isomalloc hangs and other AMPI things are more important to me at the moment. This is only really... Sam White
11:49 AM Feature #1352: CkArrayOptions callback for completion of chare array initialization
AMPI's use case for this (intercomm creation) is one-at-a-time/element-by-element initial insertion. Perhaps we shoul... Sam White
04:47 PM Bug #1189 (Merged): AMPI resumeOnRecv should be a property of the thread, not each comm
Sam White
04:46 PM Bug #1358 (Implemented): AMPI Iget extensions are broken
Sam White
04:46 PM Bug #1240 (New): MPI_COMM_SELF handling issues in AMPI
Sam White
02:47 PM Bug #1367 (Rejected): AMPI comm creation should use bulk construction
Currently, AMPI subcommunicators and intercommunicators are created by ckNew with numInitial=0, then proxy[idx].inser... Sam White
11:27 AM Feature #1366 (Merged): AMPI thread should not be awoken unless progress can be made
AMPI's multiple completion routines currently block on any recv, while they could sleep until all or a specific reque... Sam White
11:33 PM Cleanup #1365 (Merged): Deduplicate AmpiRequest routines and their handling of NULL statuses
There are 2 parts to this:
1. A successful itest/test should have the same effect as a call to complete/wait, so tha...
Sam White

01/18/2017

05:16 PM Bug #1342 (Merged): Segfaults when using '-tracemode perfReport'
Ronak Buch
04:33 PM Bug #1342 (Implemented): Segfaults when using '-tracemode perfReport'
https://charm.cs.illinois.edu/gerrit/#/c/2136/1 Phil Miller
02:30 PM Cleanup #1363: Remove/deprecate dead machine layers
Not really a bug - their presence doesn't much hurt anything. Phil Miller
02:30 PM Cleanup #1363: Remove/deprecate dead machine layers
Format description to avoid ugly strike-through display Phil Miller
10:52 AM Bug #1080 (Merged): multicore projections tracing runs hang at startup on 129 pes
Phil Miller
08:58 AM Bug #1364: Review use of volatile variables in the runtime
The one in ckarrayreductionmgr was unused, so removed here: https://charm.cs.illinois.edu/gerrit/#/c/2132/ Sam White
08:36 AM Bug #1364 (New): Review use of volatile variables in the runtime
We've seen instances of volatile being used in places in the runtime where memory fences/barriers are actually needed... Sam White

01/17/2017

11:12 AM Bug #1331: Isomalloc hangs in startup for Clang non-SMP builds
Does rearranging that to not be a static constructor fix this crash? Phil Miller
09:03 AM Bug #1331: Isomalloc hangs in startup for Clang non-SMP builds
Yes, the failure is that a CmiNodeLock in conv-core/cputopology.C is NULL when it shouldn't be. Trying to figure out ... Sam White
09:00 AM Bug #1331: Isomalloc hangs in startup for Clang non-SMP builds
It looked like fairly similar failures across several of them to me. I suspect that the one we're seeing now ought to... Phil Miller
08:40 AM Bug #1331: Isomalloc hangs in startup for Clang non-SMP builds
Looking at autobuild from last night, os-isomalloc passed the tests isomalloc had been except netlrts-linux-smp, netl... Sam White
10:26 AM Bug #1337: Cpv Declarations of types with constructors may induce 'static initialization order fi...
Found another static variable of complex type in src/conv-core/cputopology.C: static CpuTopology cpuTopo;
Edit: nv...
Sam White
 

Also available in: Atom