Project

General

Profile

Activity

From 03/11/2018 to 04/09/2018

04/10/2018

07:12 PM Bug #1849 (In Progress): mpi-linux-x86_64-syncft broken by zero copy API
Yeah, in-memory checkpointing can't work in the +p1 case, since there's no remote memory to checkpoint and restart fr... Sam White

04/09/2018

06:40 PM Bug #1849: mpi-linux-x86_64-syncft broken by zero copy API
The first expression is true. The while loop seems to become an infinite loop when both pe and budpe are 0.
Print...
Nitin Bhat
03:28 PM Bug #1849: mpi-linux-x86_64-syncft broken by zero copy API
Which of the while expressions is always true? Sam White
02:02 PM Bug #1849: mpi-linux-x86_64-syncft broken by zero copy API
Launching a serial version using gdb ./pgm causes the hang at the following location:
*CkMemCheckPT::BuddyPE (pe=...
Nitin Bhat
01:41 PM Bug #1849: mpi-linux-x86_64-syncft broken by zero copy API
I see the hang as well, when I add the fix. However, I am intermittently seeing the hang on earlier commits too. I tr... Nitin Bhat
08:44 AM Bug #1849: mpi-linux-x86_64-syncft broken by zero copy API
This is a release blocker Sam White
06:34 PM Bug #1852 (Merged): verbs-linux-x86_64 failure in tests/charm++/pingpong with +p2 after direct AP...
Sam White
05:50 PM Bug #1852 (Implemented): verbs-linux-x86_64 failure in tests/charm++/pingpong with +p2 after dire...
Fix: https://charm.cs.illinois.edu/gerrit/#/c/charm/+/3971/ Nitin Bhat
08:44 AM Bug #1852: verbs-linux-x86_64 failure in tests/charm++/pingpong with +p2 after direct API merge
This is a release blocker Sam White
06:04 PM Bug #1853: Charmxi doesn't parse std::tuple
Evan, can you take a stab at this and parameter packs (issue #1854) in the next week or two to see if we can get it i... Sam White
03:22 PM Bug #1853 (Merged): Charmxi doesn't parse std::tuple
Nils pointed this out, Spectre currently patches charmxi's generated code to get around this. Sam White
04:53 PM Feature #559: Generic (un)marshalling code to replace code generation per entry-method
From Nils, we can do this with an initializer_list that uses std::forward on the variadic template arguments. Sam White
04:47 PM Bug #1856 (Merged): Keep track of entry method template arguments in traces for Projections
We currently don't record the template arguments of entry methods in traces for Projections, so if you have two 'entr... Sam White
04:22 PM Bug #1855 (Rejected): Zero copy send with std::move'ing the buffer to the runtime
This would allow the application to tell the runtime to send a buffer without copying it and remove the need for the ... Sam White
03:25 PM Bug #1679: Do Not Require Default Constructors for Serializable Classes
For direct documentation here, you can now specify a constructor that takes "PUP::reconstruct" as its argument Sam White
03:23 PM Bug #1854 (Merged): Charmxi chokes on parameter packs
Nils is currently working around this in Spectre. charmxi doesn't parse "..." properly. Sam White
08:51 PM Support #1681: Document Exception Handling in Charm++
Make the RTS use std::set_terminate: https://charm.cs.illinois.edu/redmine/issues/1730
Mark the entire RTS as 'noe...
Sam White
07:59 PM Bug #1737: tests/charm++/pingpong and examples/charm++/zerocopy/pingpong fail when run on 2 proce...
Just curious: which system do we want pamilrts-linux-x86_64 (not ppc64le?) to run on? Sam White

04/08/2018

10:36 AM Bug #1852 (Merged): verbs-linux-x86_64 failure in tests/charm++/pingpong with +p2 after direct AP...
It seems to be failing autobuild consistently here since the direct API verbs patch was merged. The next case in this... Sam White

04/07/2018

04:36 PM Bug #1850 (Merged): multicore builds failing to parse CLAs correctly in tests/charm++/load_balanc...
Sam White
12:22 PM Bug #1850 (Implemented): multicore builds failing to parse CLAs correctly in tests/charm++/load_b...
https://charm.cs.illinois.edu/gerrit/#/c/charm/+/3964/ Sam White
10:41 AM Bug #1850 (Merged): multicore builds failing to parse CLAs correctly in tests/charm++/load_balanc...
Since the change to command line argument parsing in charmrun on multicore builds, they are all failing to parse lb_t... Sam White
02:11 PM Bug #1849: mpi-linux-x86_64-syncft broken by zero copy API
The issue looks to be that mpi/conv-mach-syncft.h was not updated with the additional field 'msgType' in the MPI dire... Sam White
10:39 AM Bug #1849 (Merged): mpi-linux-x86_64-syncft broken by zero copy API
... Sam White
01:02 PM Feature #1851 (New): Ensure PPCAtomicQueue has proper alignment/padding on ppc64le
PPCAtomicQueue is used by pamilrts on both BGQ and POWER8/9 systems. POWER8/9 has 128 Bytes cache line size, while BG... Sam White

04/06/2018

06:41 PM Bug #1846 (Merged): mpi-win64 machine-rdma.h 'void *': unknown size error
Sam White
04:58 PM Bug #1846 (Implemented): mpi-win64 machine-rdma.h 'void *': unknown size error
https://charm.cs.illinois.edu/gerrit/#/c/charm/+/3960/ Sam White
11:46 AM Bug #1846: mpi-win64 machine-rdma.h 'void *': unknown size error
Offending commit: https://charm.cs.illinois.edu/gerrit/#/c/charm/+/3120/ Sam White
06:41 PM Bug #1848 (Merged): Make multicore charmrun parse all arguments
Sam White
05:31 PM Bug #1848 (Implemented): Make multicore charmrun parse all arguments
https://charm.cs.illinois.edu/gerrit/3961 Evan Ramos
05:17 PM Bug #1848 (Merged): Make multicore charmrun parse all arguments
Currently the multicore version of charmrun only parses arguments until it hits one that doesn't start with a '+'. Th... Sam White
05:18 PM Cleanup #1847 (In Progress): Eliminate sources of various compiler warnings
Sam White
04:38 PM Cleanup #1847: Eliminate sources of various compiler warnings
I've updated the patch on gerrit. These are the only warnings left now from what I originally posted above:... Sam White
02:35 PM Cleanup #1847: Eliminate sources of various compiler warnings
> The if condition is only to avoid warnings that would happen if we define _GNU_SOURCE when it is already defined. E... Sam White
02:31 PM Cleanup #1847: Eliminate sources of various compiler warnings
Sam White wrote:
> I think the noreturn warnings can be solved by adding "while(1);" to the end of those functions: ...
Evan Ramos
12:41 PM Cleanup #1847: Eliminate sources of various compiler warnings
I think the noreturn warnings can be solved by adding "while(1);" to the end of those functions: https://stackoverflo... Sam White
10:16 AM Cleanup #1847: Eliminate sources of various compiler warnings
I got rid of most of these here: https://charm.cs.illinois.edu/gerrit/#/c/charm/+/3955/
The remaining ones are:
...
Sam White
09:30 AM Cleanup #1847 (Merged): Eliminate sources of various compiler warnings
Looking over autobuild I see a lot compiler warnings, some of which were added recently, that should be easily remove... Sam White
02:22 PM Bug #1801: charmrun on multicore builds doesn't consume ++local
Yikes, that's correct, it works if ++local comes before the program's name. Sam White
02:17 PM Bug #1801: charmrun on multicore builds doesn't consume ++local
It looks like the multicore charmrun shell script is only set up to parse arguments until it reaches one that does no... Evan Ramos
02:14 PM Bug #1801: charmrun on multicore builds doesn't consume ++local
I am still seeing ++local not consumed on multicore-darwin-x86_64. It looks like it gets passed through to the RTS wh... Sam White
02:03 PM Bug #1789: ++oneWthPerSocket doesn't work on Darwin
https://charm.cs.illinois.edu/gerrit/#/c/charm/+/3958/ Sam White
07:51 PM Feature #1742: Send large readonly variables using the Nocopy API to reduce memory footprint and ...
Moving this after 6.9 Nitin Bhat
07:16 PM Bug #937: [local] entry methods don't set tracing event dependencies
Bump Sam White
07:11 PM Feature #921: Entry tag [inline] is unable to optimize away most of the overhead
tests/charm++/pingpong has some of them, though I don't think it passes its arguments by const& Sam White
07:09 PM Feature #921: Entry tag [inline] is unable to optimize away most of the overhead
Is there any existing code to profile/time these? Evan Ramos
07:01 PM Bug #1699: [inline] entry methods should use perfect forwarding in C++11
I think so. Evan Ramos

04/05/2018

06:56 PM Bug #1699: [inline] entry methods should use perfect forwarding in C++11
Would that mean we'd basically have to make every single entry method a templated method? Well, I guess just every si... Eric Mikida
06:41 PM Bug #1699: [inline] entry methods should use perfect forwarding in C++11
I believe the build failures we are seeing with Eric's patch are a result of the generated code not meeting the requi... Evan Ramos
04:59 PM Bug #1846 (Merged): mpi-win64 machine-rdma.h 'void *': unknown size error
... Jim Phillips
04:45 PM Feature #921: Entry tag [inline] is unable to optimize away most of the overhead
Yeah, I am unsure what is left to do here that isn't covered by Bug #1699. It might be good to have another round of ... Sam White
04:30 PM Feature #921: Entry tag [inline] is unable to optimize away most of the overhead
I'm having trouble understanding what problem(s) still need fixing for this issue to be considered complete. It appea... Evan Ramos
02:57 PM Support #1725 (In Progress): Improve pup_stl testing
Samarth Kulshreshtha
02:42 PM Feature #1809 (Merged): Direct API for the OFI layer
Nitin Bhat
02:42 PM Feature #1807 (Merged): Direct API for the Verbs layer
Nitin Bhat
12:25 PM Feature #1808 (Merged): Direct API for the MPI layer
Nitin Bhat
11:55 AM Feature #1805 (Merged): Direct API for the GNI layer
Nitin Bhat
10:15 AM Feature #1810 (Merged): Direct API - Support multiple user operation modes
Nitin Bhat

04/04/2018

05:03 PM Support #1842 (Closed): Make Jenkins SMP builds run faster
Adding +CmiSleepOnIdle seems to have fixed the issue in Jenkins SMP builds. I also increased the parallel builds from... Sam White
02:33 PM Support #1842: Make Jenkins SMP builds run faster
Actually it is a much more drastic improvement in some cases: from 64511.965036 us to 36.540985 us for 1D array pingp... Sam White
02:27 PM Support #1842: Make Jenkins SMP builds run faster
On my lab machine (beauty), adding +CmiSleepOnIdle to TESTOPTS cuts down the time taken for SMP pingpong with +p2 by ... Sam White
09:36 AM Support #1842: Make Jenkins SMP builds run faster
The netlrts-linux-x86_64-smp autobuild target should I think have +CmiSleepOnIdle added to its TESTOPTS as well. netl... Sam White
08:44 PM Support #1842 (Closed): Make Jenkins SMP builds run faster
We are seeing a lot of timeouts in SMP builds for Jenkins commit-triggered builds, because tests/charm++/pingpong/ ta... Sam White
05:02 PM Feature #984: AMPI support for missing MPI-3 non-blocking collectives
Implemented MPI_I{(ex)scan,reduce_scatter(_block)} using calls to their blocking variants: https://charm.cs.illinois.... Sam White
04:57 PM Bug #1843 (Merged): Undefined references to Converse QD
Sam White
11:52 AM Bug #1843 (Implemented): Undefined references to Converse QD
Nitin Bhat
11:30 AM Bug #1843: Undefined references to Converse QD
Fix is here: https://charm.cs.illinois.edu/gerrit/#/c/charm/+/3944/ Sam White
11:27 AM Bug #1843: Undefined references to Converse QD
The error I am seeing is:... Nitin Bhat
11:23 AM Bug #1843 (Merged): Undefined references to Converse QD
Builds with -j1, -j2, -j4 seems to be going fine. (For this reason, this was probably not caught in autobuild).
I'...
Nitin Bhat
03:03 PM Feature #65: MetaLB: Separate cost/benefit analysis for running LB strategy and performing migration
Estimation of migration costs is already included in the model based method to select Load balancing strategy. So thi... Kavitha Chandrasekar
02:26 PM Feature #1667: Direct API for nocopy operations on sender-side and receiver-side
Except for documentation for the Direct API, which is targeted for 6.9, all the other tasks are for later releases. Nitin Bhat
02:23 PM Documentation #1845 (Merged): Documentation for the Zerocopy Direct API
Nitin Bhat
02:19 PM Feature #1844 (New): SMP/non-SMP agnostic job launching arguments
Currently a user has to know whether Charm++ was built in SMP mode or not when they are running their application. We... Sam White
12:14 PM Feature #921: Entry tag [inline] is unable to optimize away most of the overhead
Reassigning to Evan in the hopes that he will have the right mix of time and expertise. Eric Bohm
11:45 AM Support #1841 (Closed): Add -Werror=vla to Jenkins per-commit builds
Nitin Bhat
11:45 AM Support #1841: Add -Werror=vla to Jenkins per-commit builds
Added it to non-smp build - charm-commit-triggered and smp build - test-charm-smp.
Nitin Bhat
08:38 PM Support #1841 (Closed): Add -Werror=vla to Jenkins per-commit builds
This was done a year ago, but has apparently been undone sometime since then: https://charm.cs.illinois.edu/redmine/i... Sam White
10:51 AM Feature #1579: Mark CkAbort [[noreturn]]
I'm seeing compiler warnings about this now:... Sam White
07:57 PM Bug #1699: [inline] entry methods should use perfect forwarding in C++11
It is certainly possible that my implementation is correct, this is really my first time doing anything specifically ... Eric Mikida
07:21 PM Feature #1803 (Merged): Direct API for the LRTS layer
Sam White

04/03/2018

10:12 PM Cleanup #1840 (Merged): Direct API - Pass CkNcpyDestination and CkNcpySource by reference (instea...
Sam White

04/02/2018

05:43 PM Cleanup #1840 (Implemented): Direct API - Pass CkNcpyDestination and CkNcpySource by reference (i...
Fix: https://charm.cs.illinois.edu/gerrit/#/c/charm/+/3935/ Nitin Bhat
05:39 PM Cleanup #1840 (Merged): Direct API - Pass CkNcpyDestination and CkNcpySource by reference (instea...
Nitin Bhat
04:14 PM Feature #1804 (Merged): Direct API - Add CMA support for netlrts, multicore and pami builds
Nitin Bhat
03:47 PM Bug #1832 (Merged): SMP hangs in megatest/multisectiontest
Nitin Bhat
02:51 PM Bug #1832 (Implemented): SMP hangs in megatest/multisectiontest
Fix: https://charm.cs.illinois.edu/gerrit/#/c/charm/+/3934/ Nitin Bhat
02:53 PM Bug #738: prevent direct calls to sdag entry functions
Openatom changes corresponding to this bug fix have been merged: https://charm.cs.illinois.edu/gerrit/#/c/openatom/+/... Nitin Bhat
08:12 AM Cleanup #1839 (Merged): tests/charm++/pmetest/ doesn't have any source files in it?
Sam White

03/30/2018

12:23 PM Bug #1220 (Merged): AMPI: Support tlsglobals with dynamically linked objects
Sam White
10:03 AM Bug #1811 (In Progress): Cross-array section reduction and multicast with callback set at contrib...
Raghavendra Kanakagiri
09:29 PM Bug #1832: SMP hangs in megatest/multisectiontest
This is a QD bug introduced by the group dependence patch.
It hangs multisection test only because that test uses ...
Juan Galvez

03/29/2018

05:10 PM Feature #1579 (Merged): Mark CkAbort [[noreturn]]
Sam White
10:51 AM Feature #1579: Mark CkAbort [[noreturn]]
The patch above has been updated to only mark CkAbort() as noreturn. We can't safely mark CkExit() as noreturn becaus... Sam White
04:04 PM Cleanup #1839 (Implemented): tests/charm++/pmetest/ doesn't have any source files in it?
Nikhil removed this test in 2015, but forgot to get rid of the Makefile: https://charm.cs.illinois.edu/gerrit/#/c/cha... Sam White
03:38 PM Cleanup #1839 (Merged): tests/charm++/pmetest/ doesn't have any source files in it?
I don't know what's going on with this test, but there's nothing besides a Makefile in its directory Sam White
03:46 PM Bug #1789 (Merged): ++oneWthPerSocket doesn't work on Darwin
Sam White
03:42 PM Bug #1797 (Merged): ARM64 build and stability issues
Evan confirmed that making context threads the default has fixed the ARM 64-bit builds. Sam White
03:41 PM Bug #1798 (Merged): uFcontext thread support for 64-bit ARM systems
Marking this complete for now since we've worked around it by using context threads on ARM8.
There's another issue...
Sam White
03:13 PM Feature #1417 (Merged): Reduce CkReductionMsg envelope size
Sam White
01:33 PM Bug #1699: [inline] entry methods should use perfect forwarding in C++11
If it would be useful at all, it is possible for lvalue and rvalue reference overloads to exist side by side.
<pre...
Evan Ramos
10:46 AM Bug #1699: [inline] entry methods should use perfect forwarding in C++11
Here's the patch: https://charm.cs.illinois.edu/gerrit/#/c/charm/+/3788/ Sam White
09:20 AM Bug #1699: [inline] entry methods should use perfect forwarding in C++11
This should not be an API breaking change. If implemented correctly perfect forwarding does not break existing code, ... Nils Deppe
10:52 AM Cleanup #980: Cleanup examples/charm++/cuda/hello
Not a release blocker Sam White
10:48 AM Feature #1695: Allow "type aliases" for explicit instantiations of member function templates
This seems unlikely to get done before 6.9.0 Sam White

03/28/2018

05:41 PM Feature #1731: Complete spack installation script
I just asked Edward and he just hadn't seen the comments on the pull request for whatever reason. So he'll keep worki... Sam White
05:22 PM Feature #1731: Complete spack installation script
@Sam, do you know what the status is? I'd be happy to take this over, but I don't have permissions to modify the pull... Matthias Diener
08:16 PM Feature #1731: Complete spack installation script
Spack pull request here: https://github.com/spack/spack/pull/7291
I'm not sure if Edward Kim is still working for ...
Sam White
03:32 PM Bug #1838: Reevaluate LBCommTable initialization
LBCommData is 88 Bytes, and TableState is 4 Bytes.
This commit cuts TableState from 4 Bytes to 1 Byte: https://cha...
Sam White
02:17 PM Bug #1838 (New): Reevaluate LBCommTable initialization
This is what LBCommTable currently looks like:... Sam White
03:13 PM Bug #1831 (Merged): Shrink/expand broken by charmrun refactoring
Sam White
02:33 PM Feature #177 (In Progress): objid_t: load balancing infrastructure should use objid_t
https://charm.cs.illinois.edu/gerrit/#/c/3742/ Kavitha Chandrasekar
10:32 AM Bug #1832: SMP hangs in megatest/multisectiontest
Most of the hangs or crashes are in megatest/multisection test, but today SMP seemed to fail in tests/converse/commbe... Juan Galvez
09:34 AM Bug #1832: SMP hangs in megatest/multisectiontest
Yeah, I thought it had been fixed by the above patch.
So what we are getting now is a random hang, not always in t...
Juan Galvez
08:37 AM Bug #1832: SMP hangs in megatest/multisectiontest
Originally [1] caused failures in this test. That initial failure was then fixed by [2], but now we are seeing this h... Sam White
08:27 AM Bug #1832: SMP hangs in megatest/multisectiontest
Do we know the commit that triggered this bug? Juan Galvez
08:26 PM Bug #1832: SMP hangs in megatest/multisectiontest
This is one of the few bugs left that would block the release of 6.9.0 Sam White
11:19 PM Bug #1837 (New): uFcontext thread issue on ARM 64 bit systems
Currently, uFcontext based ULT doesn't work on ARM64 bit systems.
For ARM 64bit systems, context is set by defaul...
Seonmyeong Bak
08:21 PM Feature #641: protect load balancer from variable cpu clock
Please close this issue if it is done Sam White
08:19 PM Documentation #1656: Update manual entries on Load Balancing strategies
This would be good to have for 6.9.0 Sam White

03/27/2018

06:09 PM Bug #1699: [inline] entry methods should use perfect forwarding in C++11
It's not clear what to do here for 6.9.0, any thoughts Nils? Sam White
06:08 PM Bug #1798 (In Progress): uFcontext thread support for 64-bit ARM systems
As noted on the gerrit issue, this still isn't passing any tests that use ULTs. Maybe for now just make the default o... Sam White
06:06 PM Bug #1665 (Merged): DDT needs to reference count and free its type objects
Sam White
03:26 PM Feature #1657: CMA support for nocopy sends using the Entry Method API across processes on the sa...
Moving it out of 6.9 release as the Broadcast API has higher priority. Nitin Bhat
03:26 PM Feature #1468: Enable pre-pinning memory for the zero-copy message sends through the Entry Method...
Moving it out of 6.9 release as the Nocopy Broadcast API has higher priority. Nitin Bhat

03/26/2018

05:35 PM Feature #1378: 64-bit Charm message sizes
I don't think we want to expand to support 64bit sizes for all messages.
That would pointlessly blow up the size f...
Eric Bohm
03:44 PM Feature #1378 (In Progress): 64-bit Charm message sizes
Evan Ramos
03:42 PM Bug #1220 (Implemented): AMPI: Support tlsglobals with dynamically linked objects
Evan Ramos
03:29 PM Feature #1113 (Merged): smart-build.pl should detect supercomputers with specialized software env...
Evan Ramos
03:28 PM Bug #522 (Feedback): static linking breaks on multicore builds with 'undefined reference to `get_...
Proposing closure unless the issue can be reproduced. Evan Ramos
02:55 PM Bug #1831 (Implemented): Shrink/expand broken by charmrun refactoring
Fixed in: https://charm.cs.illinois.edu/gerrit/3900
Tested with netlrts-darwin-x86_64 and netlrts-linux-x86_64.
Evan Ramos
09:28 AM Cleanup #1836 (New): Cleanup warnings in charmdebug-python
Compiling multicore-linux-x86_64 with ICC on Stampede2 gives the following warnings, some in the generated code:
<pr...
Sam White

03/20/2018

03:57 PM Bug #738: prevent direct calls to sdag entry functions
I have made changes to Openatom to convert direct sdag calls to proxy calls and have this change on a branch called `... Nitin Bhat
03:38 PM Bug #522 (In Progress): static linking breaks on multicore builds with 'undefined reference to `g...
I am unable to reproduce this. My command was:
./build AMPI multicore-linux-x86_64 -j4 --no-build-shared -optimize...
Evan Ramos
02:38 PM Bug #756: CUDA build does not correctly find cuda location
Sam White wrote:
> What does that documentation patch have to do with this issue?
From the Description:
> tl;dr ...
Michael Robson

03/19/2018

10:47 AM Bug #1832 (In Progress): SMP hangs in megatest/multisectiontest
Nitin Bhat
10:38 AM Feature #541 (Implemented): SMP mesage passing must enforce memory ordering consistency
Gerrit: https://charm.cs.illinois.edu/gerrit/#/c/3877/ Nitin Bhat
10:00 AM Feature #1834 (Implemented): Functionality to Broadcast large buffers using the nocopy API
Nitin Bhat

03/17/2018

04:42 PM Cleanup #1833 (New): Cleanup SDAG Closure refnum handling with enable_if
The SDAG code has this in a comment:... Sam White
04:21 PM Bug #1832 (Merged): SMP hangs in megatest/multisectiontest
Jenkins builds and the netlrts-linux-x86_64-smp autobuild target are hanging pretty consistently in tests/charm++/meg... Sam White
12:02 PM Bug #1822 (Merged): megatest/multisection test failures caused by changes to group dependencies
Evan created a patch to prevent similar issues in user code: https://charm.cs.illinois.edu/gerrit/#/c/3871/ Sam White
12:00 PM Bug #1819 (Merged): bigsim failing lb_test inside CmiMemoryUsage()
Sam White
12:00 PM Bug #1828 (Merged): Infinite recursion inside malloc_info in CmiMemoryUsage
Sam White

03/16/2018

01:08 PM Bug #1828: Infinite recursion inside malloc_info in CmiMemoryUsage
We reverted the malloc_info patch for now Sam White
01:07 PM Bug #1819: bigsim failing lb_test inside CmiMemoryUsage()
We reverted the malloc_info patch for now Sam White

03/15/2018

06:53 PM Feature #1417: Reduce CkReductionMsg envelope size
This commit reduces the size of CkReductionMsg's from 168 to 136 Bytes and the size of CkCallback's from 72 to 64 Byt... Sam White
05:25 PM Bug #1800 (Merged): Section reduction contribute() should accept std::vector
Sam White
05:25 PM Bug #1830 (Merged): ChaNGa deadlocks due to recent change to QD types
Sam White
08:25 PM Bug #1830 (Implemented): ChaNGa deadlocks due to recent change to QD types
Sam White
08:20 PM Bug #1830: ChaNGa deadlocks due to recent change to QD types
Commit 6b1b6d708 fixes the problem. Thomas Quinn
03:32 PM Bug #756: CUDA build does not correctly find cuda location
Not a release blocker Sam White
03:29 PM Bug #756: CUDA build does not correctly find cuda location
What does that documentation patch have to do with this issue? Sam White
03:31 PM Bug #802: CUDA examples broken on Blue Waters
Not a release blocker Sam White
03:31 PM Bug #815: Makefile for hybrid API is not using the system OPTS
Not a release blocker Sam White
03:28 PM Bug #1797: ARM64 build and stability issues
Patch for uFcontext threads support: https://charm.cs.illinois.edu/gerrit/#/c/3847/
That just needs testing valida...
Sam White
03:24 PM Bug #1816 (New): Cray CCE/8.5 builds are broken
CCE 8.6 works on Cori Sam White
02:50 PM Bug #1831: Shrink/expand broken by charmrun refactoring
k, since it doesn't have automated testing it may have been broken for a while now. Though I believe we tested it for... Sam White
02:12 PM Bug #1831: Shrink/expand broken by charmrun refactoring
The example crashes. I will have to debug this. Evan Ramos
01:53 PM Bug #1831: Shrink/expand broken by charmrun refactoring
examples/charm++/shrink_expand. It doesn't look like it's part of automated testing Sam White
01:50 PM Bug #1831 (In Progress): Shrink/expand broken by charmrun refactoring
https://charm.cs.illinois.edu/gerrit/3860
Are there any tests for shrink/expand?
Evan Ramos
12:04 PM Bug #1831 (Merged): Shrink/expand broken by charmrun refactoring
Trying to build with '--enable-shrinkexpand' fails on compiling charmrun with the following on netlrts-darwin-x86_64:... Sam White
02:49 PM Bug #1239: Cleanup reduction uses in the runtime
Remove uses in tests/ and examples/ directories:
https://charm.cs.illinois.edu/gerrit/#/c/2792/
https://charm.cs....
Sam White
09:55 PM Bug #1828: Infinite recursion inside malloc_info in CmiMemoryUsage
Thomas Quinn wrote:
> I still get the crash with "-O2".
>
> Running with
> ./hello +stack-size 10000000 ++ppn 8
...
Shaoqin Lu
09:51 PM Bug #1828: Infinite recursion inside malloc_info in CmiMemoryUsage
I still get the crash with "-O2".
Running with
./hello +stack-size 10000000 ++ppn 8
gets me past the problem.
Thomas Quinn
09:25 PM Bug #1828: Infinite recursion inside malloc_info in CmiMemoryUsage
Does -O2 work though?
I know relying on compiler optimization is not reliable. I will come up with some workaroun...
Shaoqin Lu
09:22 PM Bug #1828: Infinite recursion inside malloc_info in CmiMemoryUsage
I built with "-g -O2", and I still get the crash.
Thomas Quinn
07:11 PM Bug #1828: Infinite recursion inside malloc_info in CmiMemoryUsage
This issue is potentially resolved. See the g++ stdlib thread
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=84865
...
Shaoqin Lu

03/14/2018

04:42 PM Feature #1704: Add a pamilrts-linux-ppc64le build target
Is it possible for us to set up an autobuild target for pamilrts-linux-ppc64le on Summit or SummitDev? I think Vesta ... Sam White
03:25 PM Bug #1830: ChaNGa deadlocks due to recent change to QD types
Actually, we found another issue in the original patch, I updated this patch to fix that. Can you try ChaNGa on the u... Sam White
10:48 AM Bug #1830: ChaNGa deadlocks due to recent change to QD types
Could you provide the run command for ChaNGa that reproduces the deadlock? In case we want to pursue the datatype cha... Sam White
10:35 AM Bug #1830: ChaNGa deadlocks due to recent change to QD types
Revert commit: https://charm.cs.illinois.edu/gerrit/#/c/3848/ Sam White
10:33 AM Bug #1830: ChaNGa deadlocks due to recent change to QD types
K we can just revert that change for now then, since I'm not sure how that is causing a problem and it's an easy one ... Sam White
11:21 PM Bug #1830: ChaNGa deadlocks due to recent change to QD types
The problem goes away if I run with commit 8de6719c613f34a3d6b1baccb9aa4aa8f78c12db, the commit before 2897. Thomas Quinn
02:19 PM Bug #992 (Closed): +setcpuaffinity ignores cpuset
Eric Bohm
02:18 PM Bug #1010 (Closed): +setcpuaffinity +showcpuaffinity doesn't actually print the affinity settings
since resolved ins't closed, we choose closed Eric Bohm
02:16 PM Bug #1010 (Resolved): +setcpuaffinity +showcpuaffinity doesn't actually print the affinity settings
Was fixed by Evan's work. Eric Bohm
02:14 PM Bug #756: CUDA build does not correctly find cuda location
Renewing an old issue: https://charm.cs.illinois.edu/gerrit/#/c/2048/ Michael Robson
02:02 PM Bug #1473: verbs build hangs in tests/charm++/communication_overhead
Current issue is not a hang, it crashes.
PE MSG SIZE PER MSG TIME(us) BW(MB/s) ...
Eric Bohm
01:53 PM Bug #1822 (Implemented): megatest/multisection test failures caused by changes to group dependencies
Thanks for the valgrind outputs, Sam.
The bug was caused because of the charm++ message in multisection test not al...
Nitin Bhat
09:22 AM Bug #1822 (In Progress): megatest/multisection test failures caused by changes to group dependencies
Nitin Bhat
12:55 PM Bug #1699: [inline] entry methods should use perfect forwarding in C++11
The patch where this is implemented just replaces the currently generated code with the code that Nils provided above... Eric Mikida
12:16 PM Bug #1337: Cpv Declarations of types with constructors may induce 'static initialization order fi...
no known use cases to justify making this a release 6.9.1 target Eric Bohm
11:13 AM Bug #1811 (Implemented): Cross-array section reduction and multicast with callback set at contrib...
Raghavendra Kanakagiri
10:51 AM Bug #1828: Infinite recursion inside malloc_info in CmiMemoryUsage
I originally got the failure with this version:
trq@daint105:~/src/charm> gcc -v
Using built-in specs.
COLLECT_G...
Thomas Quinn
10:48 AM Bug #1828: Infinite recursion inside malloc_info in CmiMemoryUsage
Thomas Quinn wrote:
> gcc -v:
> Using built-in specs.
> COLLECT_GCC=/opt/gcc/5.3.0/bin/../snos/bin/gcc
> COLLECT_...
Shaoqin Lu
10:47 AM Bug #1828: Infinite recursion inside malloc_info in CmiMemoryUsage
gcc -v:
Using built-in specs.
COLLECT_GCC=/opt/gcc/5.3.0/bin/../snos/bin/gcc
COLLECT_LTO_WRAPPER=/opt/gcc/5.3.0/sn...
Thomas Quinn
10:45 AM Bug #1828: Infinite recursion inside malloc_info in CmiMemoryUsage
g++ bug report link:
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=84865
Shaoqin Lu
10:42 AM Bug #1828: Infinite recursion inside malloc_info in CmiMemoryUsage
Hi Thomas, I am filing a bug report to g++ on this issue. Do you mind posting the
`g++ -v` and `uname -a` informat...
Shaoqin Lu
01:40 AM Bug #1798: uFcontext thread support for 64-bit ARM systems
https://charm.cs.illinois.edu/gerrit/#/c/3847/
Added configuration directories for ARMv8 processors.
Except Cort...
Seonmyeong Bak
01:37 AM Bug #1798 (Implemented): uFcontext thread support for 64-bit ARM systems
Seonmyeong Bak
10:22 PM Feature #1807 (Implemented): Direct API for the Verbs layer
Nitin Bhat

03/13/2018

06:49 PM Bug #1830: ChaNGa deadlocks due to recent change to QD types
But reverting 2897 makes the problem go away?
https://charm.cs.illinois.edu/gerrit/#/c/2897/
Sam White
03:29 PM Bug #1830: ChaNGa deadlocks due to recent change to QD types
I tried changeset 3844, and I still get the hang. Thomas Quinn
10:02 PM Bug #1830: ChaNGa deadlocks due to recent change to QD types
Does this patch fix the problem? https://charm.cs.illinois.edu/gerrit/#/c/3844/ Sam White
07:48 PM Bug #1830 (Merged): ChaNGa deadlocks due to recent change to QD types
I'm getting a deadlock when running ChaNGa with recent versions of charm. "git bisect" says that the problem commit ... Thomas Quinn
05:06 PM Bug #1828: Infinite recursion inside malloc_info in CmiMemoryUsage
Thomas Quinn wrote:
> Somewhat of an aside, but the usage numbers reported by malloc_info() on the cray-xc are not i...
Shaoqin Lu
05:04 PM Bug #1828: Infinite recursion inside malloc_info in CmiMemoryUsage
Somewhat of an aside, but the usage numbers reported by malloc_info() on the cray-xc are not informative. I doubt th... Thomas Quinn
09:42 AM Bug #1828 (In Progress): Infinite recursion inside malloc_info in CmiMemoryUsage
Sam White
02:42 PM Bug #1822: megatest/multisection test failures caused by changes to group dependencies
multisectiontest sets group dependencies on messages that are multicasted over a section via CkMulticast. Perhaps the... Sam White
01:26 PM Bug #1822: megatest/multisection test failures caused by changes to group dependencies
I added the valgrind output to the issue description above. Sam White
01:26 PM Bug #1822: megatest/multisection test failures caused by changes to group dependencies
Running the multisection test under Valgrind shows a bunch of leaks coming from _allocEnv() and the following invalid... Sam White
01:25 PM Bug #1822: megatest/multisection test failures caused by changes to group dependencies
Running the multisection test under Valgrind shows a bunch of leaks coming from _allocEnv() and the following invalid... Sam White
01:00 PM Bug #1822: megatest/multisection test failures caused by changes to group dependencies
I bisected the failure to the Group Dependence patch. I had to run "./pgm +p4" many times to trigger the failure, but... Sam White
09:44 AM Bug #1822: megatest/multisection test failures caused by changes to group dependencies
netlrts-linux-x86_64-smp appears to have hung in this last night, and the various Darwin builds have all been failing... Sam White
02:34 PM Bug #1800 (Implemented): Section reduction contribute() should accept std::vector
Raghavendra Kanakagiri
02:33 PM Bug #1811 (In Progress): Cross-array section reduction and multicast with callback set at contrib...
Raghavendra Kanakagiri
02:32 PM Bug #1811: Cross-array section reduction and multicast with callback set at contribute time
Gerrit link:
https://charm.cs.illinois.edu/gerrit/#/c/3827/
Raghavendra Kanakagiri
09:42 AM Bug #1819 (In Progress): bigsim failing lb_test inside CmiMemoryUsage()
Sam White

03/12/2018

04:57 PM Bug #1816: Cray CCE/8.5 builds are broken
Yeah, we don't want to disable warnings outside of configure checks. Maybe we should just not care about CCE 8.5 sinc... Sam White
04:30 PM Bug #1816: Cray CCE/8.5 builds are broken
Ok, but it's been suggested other times in the past to use msglevel_4 but apparently it was never done. Are we sure w... Juan Galvez
03:45 PM Bug #1816: Cray CCE/8.5 builds are broken
Can you submit a patch for ignoring the warning when checking C++11 support of Cray CC? Sam White
12:56 PM Bug #1816: Cray CCE/8.5 builds are broken
It builds on BW without -MM flag (didn't test with -M) using craycc 8.5.8 and `-h std=c++11 -h msglevel_4` to suppres... Juan Galvez
12:43 PM Bug #1816: Cray CCE/8.5 builds are broken
Fix for -MM issue: https://charm.cs.illinois.edu/gerrit/#/c/3841/ Sam White
12:40 PM Bug #1816: Cray CCE/8.5 builds are broken
Changing "-MM" to "-M" works. Sam White
11:56 AM Bug #1816 (In Progress): Cray CCE/8.5 builds are broken
Yes, removing the "-MM" from src/scripts/Makefile solves the issue. Charm++ build then passes on Cori with CCE 8.6.2.... Sam White
11:18 AM Bug #1816: Cray CCE/8.5 builds are broken
I'll that patch try on Cori, but we should also fix this configure error on BW. We should ignore warnings when testin... Sam White
11:09 AM Bug #1816: Cray CCE/8.5 builds are broken
It seems like the configure C++11 test with `-h std=c++11` passes, but with a warning. config.log doesn't contain any... Juan Galvez
11:04 AM Bug #1816: Cray CCE/8.5 builds are broken
I'll try to look into that.
But could you also try building on Cori without -MM flag (or just -M)?...
Juan Galvez
10:53 AM Bug #1816: Cray CCE/8.5 builds are broken
Ugh, can you investigate why "checking "whether C++ compiler supports C++11 with '-h std=c++11'"... "yes" " doesn't e... Sam White
10:40 AM Bug #1816: Cray CCE/8.5 builds are broken
I switched to craycc 8.5.8 on Blue Waters (which is apparently the most recent version they have) by doing
`module...
Juan Galvez
03:49 PM Bug #1827 (Merged): +showcpuaffinity doesn't show info about remote comm thread (comm thread not ...
Sam White
12:08 PM Bug #1827 (Implemented): +showcpuaffinity doesn't show info about remote comm thread (comm thread...
Potential fix: https://charm.cs.illinois.edu/gerrit/3840
Test output:...
Evan Ramos
11:15 AM Bug #1828: Infinite recursion inside malloc_info in CmiMemoryUsage
This failure can also be reproduced on a Linux machine using a non-SMP bigsim build: https://charm.cs.illinois.edu/re... Sam White
12:06 AM Bug #1828: Infinite recursion inside malloc_info in CmiMemoryUsage
I just want to post an update to this issue. I am able to reproduce this on crayxc machine and it is a multi thread r... Shaoqin Lu
11:14 AM Bug #1819: bigsim failing lb_test inside CmiMemoryUsage()
The issue here is actually infinite recursion in std::regex in CmiMemoryUsage(), duplicate of issue #1828.
To repr...
Sam White
08:47 AM Bug #1819: bigsim failing lb_test inside CmiMemoryUsage()
This test passes on netlrts-darwin-x86_64 Sam White
10:58 AM Feature #1829 (In Progress): Enable low memory footprint migration with Isomalloc via RDMA
Using zero copy direct API, we can use RDMA to reduce the memory footprint of AMPI programs during LB. When migrating... Sam White
10:52 AM Feature #1809 (Implemented): Direct API for the OFI layer
Nitin Bhat
08:49 AM Feature #1748 (Merged): CharmPy support in Charm++
Sam White

03/11/2018

01:17 AM Feature #1353 (Merged): charmc hardcodes unversioned compiler names
Ronak Buch
 

Also available in: Atom