charm.git
9 years agoMerge branch 'charm' of charmgit:charm into isomalloc
Nikhil Jain [Thu, 10 Nov 2011 01:34:02 +0000 (19:34 -0600)]
Merge branch 'charm' of charmgit:charm into isomalloc

9 years agoMeshStreamer: bug fix in final reduction.
Lukasz Wesolowski [Wed, 9 Nov 2011 23:34:26 +0000 (17:34 -0600)]
MeshStreamer: bug fix in final reduction.

9 years agofixed the registration
Yanhua Sun [Wed, 9 Nov 2011 20:07:31 +0000 (12:07 -0800)]
fixed the registration

9 years agofixed 100M smp ppn5 by fragmentation
Yanhua Sun [Wed, 9 Nov 2011 06:52:01 +0000 (01:52 -0500)]
fixed 100M smp ppn5 by fragmentation

9 years agofragmentation for large msg
Yanhua Sun [Wed, 9 Nov 2011 04:11:40 +0000 (20:11 -0800)]
fragmentation for large msg

9 years agoafter steal, if it fails, also linear search all memblocks to free some un-used ones...
Gengbin Zheng [Sun, 6 Nov 2011 20:53:05 +0000 (12:53 -0800)]
after steal, if it fails, also linear search all memblocks to free some un-used ones, and deregister the memory.

9 years agosome minor code cleanup and comments
Gengbin Zheng [Sat, 5 Nov 2011 05:00:47 +0000 (22:00 -0700)]
some minor code cleanup and comments

9 years agowhen steal mempool succeed, should just return
Gengbin Zheng [Sat, 5 Nov 2011 03:57:08 +0000 (20:57 -0700)]
when steal mempool succeed, should just return

9 years agoMeshStreamer: added hashing of coordinate information and refactored the code
Lukasz Wesolowski [Sat, 5 Nov 2011 02:45:21 +0000 (21:45 -0500)]
MeshStreamer: added hashing of coordinate information and refactored the code
to use coordinate objects which encapsulate row, column, plane, msgType

9 years agodoes not have to clear the buffer to 0.
Gengbin Zheng [Sat, 5 Nov 2011 02:10:15 +0000 (19:10 -0700)]
does not have to clear the buffer to 0.

9 years agoMerge branch 'charm' of charmgit:charm into charm
Gengbin Zheng [Sat, 5 Nov 2011 02:05:55 +0000 (19:05 -0700)]
Merge branch 'charm' of charmgit:charm into charm

9 years agoadd a mempool stealing scheme. When running out of registered memory, for SMP version...
Gengbin Zheng [Sat, 5 Nov 2011 02:04:36 +0000 (19:04 -0700)]
add a mempool stealing scheme. When running out of registered memory, for SMP version, try to steal free memblocks from other rank on the same node.

9 years agoinitialize cmiArgDebugFlag to 0
Gengbin Zheng [Sat, 5 Nov 2011 01:59:28 +0000 (18:59 -0700)]
initialize cmiArgDebugFlag to 0

9 years agockmulticast: more docs for sendToSection
Ramprasad Venkataraman [Fri, 4 Nov 2011 22:26:54 +0000 (17:26 -0500)]
ckmulticast: more docs for sendToSection

9 years agockmcast - avoid message copying when only 1 fragment; patch provided by Gengbin
Akhil Langer [Fri, 4 Nov 2011 20:06:55 +0000 (15:06 -0500)]
ckmcast - avoid message copying when only 1 fragment; patch provided by Gengbin

9 years agoSMP uses context thread
Gengbin Zheng [Fri, 4 Nov 2011 15:33:44 +0000 (08:33 -0700)]
SMP uses context thread

9 years agofix a size bug in LrtsAlloc (should minus mempool header),
Gengbin Zheng [Fri, 4 Nov 2011 09:04:17 +0000 (02:04 -0700)]
fix a size bug in LrtsAlloc (should minus mempool header),
also fix a bug in expand mempool with size greater than default. The size should consider mempool_block size.

9 years agockmulticast: Fix recent checkin with default args gone awry
Ramprasad Venkataraman [Fri, 4 Nov 2011 01:54:54 +0000 (20:54 -0500)]
ckmulticast: Fix recent checkin with default args gone awry

9 years agoMerge branch 'charm' of charmgit:charm into charm
Akhil Langer [Fri, 4 Nov 2011 01:06:55 +0000 (20:06 -0500)]
Merge branch 'charm' of charmgit:charm into charm

9 years agoAlways do Pipeling in CkMulticast && make SPLIT_SIZE and SPLIT_THRESHOLD as run time...
Akhil Langer [Fri, 4 Nov 2011 01:06:42 +0000 (20:06 -0500)]
Always do Pipeling in CkMulticast && make SPLIT_SIZE and SPLIT_THRESHOLD as run time variables rather than compile time constants

9 years agoMeshStreamer: Added the option to let users specify when each group member
Lukasz Wesolowski [Thu, 3 Nov 2011 23:04:14 +0000 (18:04 -0500)]
MeshStreamer: Added the option to let users specify when each group member
is done inserting, instead of using quiescence. When using periodic flushing,
added an optimization to prevent unnecessary flushing if recent progress
has been made.

9 years agoMeshStreamer: reintroduced a periodic flush function to help with
Lukasz Wesolowski [Thu, 3 Nov 2011 05:02:06 +0000 (00:02 -0500)]
MeshStreamer: reintroduced a periodic flush function to help with
the use case of applications which generate additional messages
in response to streamed messages.

9 years agoMerge branch 'charm' of charmgit:charm into charm
Gengbin Zheng [Thu, 3 Nov 2011 06:04:53 +0000 (01:04 -0500)]
Merge branch 'charm' of charmgit:charm into charm

9 years agotest gfortran lib in another possible location.
Gengbin Zheng [Thu, 3 Nov 2011 06:04:03 +0000 (01:04 -0500)]
test gfortran lib in another possible location.

9 years agoMerge branch 'charm' of charmgit:charm into charm
Yanhua Sun [Wed, 2 Nov 2011 20:28:53 +0000 (13:28 -0700)]
Merge branch 'charm' of charmgit:charm into charm

9 years agoadded error information for memory registration
Yanhua Sun [Wed, 2 Nov 2011 20:28:35 +0000 (13:28 -0700)]
added error information for memory registration

9 years agoMerge branch 'charm' of charmgit:charm into charm
Gengbin Zheng [Wed, 2 Nov 2011 19:22:51 +0000 (14:22 -0500)]
Merge branch 'charm' of charmgit:charm into charm

9 years agomake receiveCombinedMessage a virtual function.
Gengbin Zheng [Wed, 2 Nov 2011 18:55:14 +0000 (13:55 -0500)]
make receiveCombinedMessage a virtual function.

9 years agoremoving stupid binary that I pushed
Harshitha Menon [Wed, 2 Nov 2011 18:49:28 +0000 (13:49 -0500)]
removing stupid binary that I pushed

9 years agoExample usage of cksequence
Harshitha Menon [Wed, 2 Nov 2011 18:47:08 +0000 (13:47 -0500)]
Example usage of cksequence

9 years agoa flag to tell if yield or not
Gengbin Zheng [Wed, 2 Nov 2011 00:10:16 +0000 (19:10 -0500)]
a flag to tell if yield or not

9 years agosignificantly simply interface
Gengbin Zheng [Tue, 1 Nov 2011 21:23:29 +0000 (16:23 -0500)]
significantly simply interface
The user only needs to provide a function call process(data), library takes care of all streaming stuff.

9 years agoMeshStreamer: function rename to improve readability and avoid confusion
Lukasz Wesolowski [Tue, 1 Nov 2011 20:18:02 +0000 (15:18 -0500)]
MeshStreamer: function rename to improve readability and avoid confusion

9 years agoMeshStreamer: Modified the "start" function name to the more descriptive "callWhenFin...
Lukasz Wesolowski [Tue, 1 Nov 2011 20:15:44 +0000 (15:15 -0500)]
MeshStreamer: Modified the "start" function name to the more descriptive "callWhenFinished"

9 years agoMeshStreamer: commented out prototype for a function which is no longer being used.
Lukasz Wesolowski [Tue, 1 Nov 2011 19:25:26 +0000 (14:25 -0500)]
MeshStreamer: commented out prototype for a function which is no longer being used.

9 years agorestrict the hack for pxshm only for cray xe and smp.
Gengbin Zheng [Tue, 1 Nov 2011 16:43:19 +0000 (11:43 -0500)]
restrict the hack for pxshm only for cray xe and smp.

9 years agoMerge branch 'charm' of charmgit:charm into charm
Gengbin Zheng [Tue, 1 Nov 2011 14:37:26 +0000 (09:37 -0500)]
Merge branch 'charm' of charmgit:charm into charm

9 years agofix for VC++
Gengbin Zheng [Tue, 1 Nov 2011 14:32:51 +0000 (09:32 -0500)]
fix for VC++

9 years agoguard ft code by ft macro
Gengbin Zheng [Tue, 1 Nov 2011 14:30:30 +0000 (09:30 -0500)]
guard ft code by ft macro

9 years agoadd a start(callback) function to omit the need for a second QD from user code to...
Gengbin Zheng [Tue, 1 Nov 2011 04:12:51 +0000 (23:12 -0500)]
add a start(callback) function to omit the need for a second QD from user code to flush the buffered message

9 years agosimilar fix for cray-xe mallinfo
Gengbin Zheng [Mon, 31 Oct 2011 17:20:53 +0000 (12:20 -0500)]
similar fix for cray-xe mallinfo

9 years agominor change about a print
Gengbin Zheng [Mon, 31 Oct 2011 17:19:18 +0000 (12:19 -0500)]
minor change about a print

9 years agoSDAG: Move CCounter's pointers stars into its type string, to match changes for templ...
Phil Miller [Mon, 31 Oct 2011 13:49:18 +0000 (08:49 -0500)]
SDAG: Move CCounter's pointers stars into its type string, to match changes for template support

Now tested against the full set of Charm++ examples and tests, rather than the
couple I happened to look at.

9 years agoSDAG: simplify entry method matching logic
Phil Miller [Mon, 31 Oct 2011 07:44:47 +0000 (02:44 -0500)]
SDAG: simplify entry method matching logic

9 years agoSDAG: Consolidate handling of parameter types to improve template support
Phil Miller [Mon, 31 Oct 2011 07:43:56 +0000 (02:43 -0500)]
SDAG: Consolidate handling of parameter types to improve template support

9 years agoxi: rearrange inclusion a bit
Phil Miller [Mon, 31 Oct 2011 07:32:17 +0000 (02:32 -0500)]
xi: rearrange inclusion a bit

9 years agodon't print pid for mpi version. still print for net, which make it easy to pick...
Gengbin Zheng [Mon, 31 Oct 2011 00:50:01 +0000 (19:50 -0500)]
don't print pid for mpi version. still print for net, which make it easy to pick up a process to kill

9 years agoMerge branch 'charm' of charmgit:charm into charm
Gengbin Zheng [Mon, 31 Oct 2011 00:23:31 +0000 (19:23 -0500)]
Merge branch 'charm' of charmgit:charm into charm

9 years agoimplement CkDieNow function for net version too
Gengbin Zheng [Mon, 31 Oct 2011 00:23:07 +0000 (19:23 -0500)]
implement CkDieNow function for net version too

9 years agoclear
Xiang Ni [Mon, 31 Oct 2011 00:22:42 +0000 (19:22 -0500)]
clear

9 years agomissing initialization of total_pes when +wp is not present
Gengbin Zheng [Sun, 30 Oct 2011 22:32:04 +0000 (17:32 -0500)]
missing initialization of total_pes when +wp is not present

9 years agofix a bug in generating bufferMssage call for a when trigger function with no argument.
Gengbin Zheng [Sun, 30 Oct 2011 15:25:15 +0000 (10:25 -0500)]
fix a bug in generating bufferMssage call for a when trigger function with no argument.

9 years agoFT: print out which spare processor is used to replace the failed PE.
Gengbin Zheng [Sun, 30 Oct 2011 02:24:53 +0000 (21:24 -0500)]
FT: print out which spare processor is used to replace the failed PE.

9 years agoavoid TimerInit global synchronization under bgp for FT
Gengbin Zheng [Sun, 30 Oct 2011 01:33:13 +0000 (01:33 +0000)]
avoid TimerInit global synchronization under bgp for FT

9 years agoclean up
Gengbin Zheng [Sun, 30 Oct 2011 01:28:50 +0000 (01:28 +0000)]
clean up

9 years agoupdated
Gengbin Zheng [Sat, 29 Oct 2011 23:50:37 +0000 (18:50 -0500)]
updated

9 years agodon't test -lmpich for bgp builds
Gengbin Zheng [Sat, 29 Oct 2011 23:48:41 +0000 (23:48 +0000)]
don't test -lmpich for bgp builds

9 years agominor cleanup
Gengbin Zheng [Sat, 29 Oct 2011 22:23:01 +0000 (17:23 -0500)]
minor cleanup

9 years agoa minor optimization to find reverse buddy
Gengbin Zheng [Sat, 29 Oct 2011 22:16:00 +0000 (17:16 -0500)]
a minor optimization to find reverse buddy

9 years agoguard new function with mem checkpoint macro
Gengbin Zheng [Sat, 29 Oct 2011 22:11:05 +0000 (17:11 -0500)]
guard new function with mem checkpoint macro

9 years agoFurther reducing the size for memoryAccess to make sure that it runs on every machine
Nikhil Jain [Sat, 29 Oct 2011 16:41:17 +0000 (11:41 -0500)]
Further reducing the size for memoryAccess to make sure that it runs on every machine

9 years agoMerge branch 'charm' of charmgit:charm into charm
Harshitha Menon [Sat, 29 Oct 2011 07:05:03 +0000 (02:05 -0500)]
Merge branch 'charm' of charmgit:charm into charm

9 years agocksequence implementation with memory compaction
Harshitha Menon [Sat, 29 Oct 2011 06:52:19 +0000 (01:52 -0500)]
cksequence implementation with memory compaction

9 years agofix a bug when multiple crashes. The spare processors need to be updated with the...
Gengbin Zheng [Sat, 29 Oct 2011 06:27:11 +0000 (01:27 -0500)]
fix a bug when multiple crashes. The spare processors need to be updated with the petorank array.
added a function CkHasCheckpoints(0 which returns true when there is already checkpoints.

9 years agofix issue at end of program, where MPI_Barrier needs to be called on all processors...
Gengbin Zheng [Sat, 29 Oct 2011 03:04:20 +0000 (22:04 -0500)]
fix issue at end of program, where MPI_Barrier needs to be called on all processors including spare and crashed processors and MPI_Finalize is called to properly exit the MPI program.

9 years agoMeshStreamer: Tried to add const wherever possible and inserted some
Lukasz Wesolowski [Fri, 28 Oct 2011 22:33:20 +0000 (17:33 -0500)]
MeshStreamer: Tried to add const wherever possible and inserted some
missing parameter names in function prototypes.

9 years agoMerge branch 'charm' of charmgit:charm into charm
Gengbin Zheng [Fri, 28 Oct 2011 22:05:28 +0000 (17:05 -0500)]
Merge branch 'charm' of charmgit:charm into charm

9 years agodemonstrate inmem checkpoint restart on MPI layer:
Gengbin Zheng [Fri, 28 Oct 2011 22:04:08 +0000 (17:04 -0500)]
demonstrate inmem checkpoint restart on MPI layer:
1. CkDieNow()  to fake die a processor
2. +wp <numpe> to specify the number of charm processors,  +p fires with spared processors
3. a spared processor will be activated to replace the crashed one.

9 years agoMeshStreamer: Removed capacity from MeshStreamerMessage to decrease
Lukasz Wesolowski [Fri, 28 Oct 2011 21:56:48 +0000 (16:56 -0500)]
MeshStreamer: Removed capacity from MeshStreamerMessage to decrease
message size. Capacity can be obtained directly from the library.
Moved assert which checks that no messages remain after flush.
The check now occurs after all three flush phases are finished.

9 years agofix a bug in ckmulticast pipelining
Akhil Langer [Fri, 28 Oct 2011 19:09:50 +0000 (14:09 -0500)]
fix a bug in ckmulticast pipelining

9 years agoMerge branch 'charm' of charmgit:charm into charm
Akhil Langer [Fri, 28 Oct 2011 18:17:04 +0000 (13:17 -0500)]
Merge branch 'charm' of charmgit:charm into charm

9 years agockmulticast.C - SPLIT_NUM is redundant, set the FRAGMENT_SIZE for pipelining to be...
Akhil Langer [Fri, 28 Oct 2011 18:16:50 +0000 (13:16 -0500)]
ckmulticast.C - SPLIT_NUM is redundant, set the FRAGMENT_SIZE for pipelining to be 32k and increase the MAX_FRAGS to be 100 to allow large messages

9 years agoMerge branch 'charm' of charmgit:charm into charm
Gengbin Zheng [Fri, 28 Oct 2011 03:54:43 +0000 (20:54 -0700)]
Merge branch 'charm' of charmgit:charm into charm

9 years agotweak pxshm to work for SMP build
Gengbin Zheng [Fri, 28 Oct 2011 03:54:18 +0000 (20:54 -0700)]
tweak pxshm to work for SMP build

9 years agoMempool: Declare variables before other statements, to appease MSVC
Phil Miller [Fri, 28 Oct 2011 01:55:36 +0000 (20:55 -0500)]
Mempool: Declare variables before other statements, to appease MSVC

9 years agoreducing the test count
Nikhil Jain [Thu, 27 Oct 2011 17:15:29 +0000 (12:15 -0500)]
reducing the test count

9 years agoMerge branch 'charm' of charmgit:charm into charm
Yanhua Sun [Thu, 27 Oct 2011 06:23:47 +0000 (23:23 -0700)]
Merge branch 'charm' of charmgit:charm into charm

9 years agomake mempool thread-safe in smp; fix smp performance problem in gemini
Yanhua Sun [Thu, 27 Oct 2011 06:23:27 +0000 (23:23 -0700)]
make mempool thread-safe in smp; fix smp performance problem in gemini

9 years agoMerge branch 'charm' of charmgit:charm into charm
Gengbin Zheng [Thu, 27 Oct 2011 05:03:00 +0000 (00:03 -0500)]
Merge branch 'charm' of charmgit:charm into charm

9 years agofix CmiGetRestartPhase definition
Gengbin Zheng [Thu, 27 Oct 2011 05:02:44 +0000 (00:02 -0500)]
fix CmiGetRestartPhase definition

9 years agocorrection the previous error of misplacing memoryAccess file
Nikhil Jain [Wed, 26 Oct 2011 23:35:14 +0000 (18:35 -0500)]
correction the previous error of misplacing memoryAccess file

9 years agoMerge branch 'charm' of charmgit:charm into charm
Gengbin Zheng [Wed, 26 Oct 2011 21:20:23 +0000 (16:20 -0500)]
Merge branch 'charm' of charmgit:charm into charm

9 years agoin FT when using GROUP_LEVEL_REDUCTION, don't flush nodeproxy since it does not exist
Gengbin Zheng [Wed, 26 Oct 2011 21:19:56 +0000 (16:19 -0500)]
in FT when using GROUP_LEVEL_REDUCTION, don't flush nodeproxy since it does not exist

9 years agoadding a new test to commbench
Nikhil Jain [Wed, 26 Oct 2011 21:10:42 +0000 (16:10 -0500)]
adding a new test to commbench

9 years agoRemove redundant check for NULL in MeshStreamer
Lukasz Wesolowski [Wed, 26 Oct 2011 20:41:38 +0000 (15:41 -0500)]
Remove redundant check for NULL in MeshStreamer

9 years agofix a compile problem when not using mempool
Yanhua Sun [Wed, 26 Oct 2011 18:13:11 +0000 (11:13 -0700)]
fix a compile problem when not using mempool

9 years agowhen flush message buffer which are below the full capacity, set message size to... hpcc-2011 hpcc-2011
Gengbin Zheng [Tue, 25 Oct 2011 07:10:50 +0000 (02:10 -0500)]
when flush message buffer which are below the full capacity, set message size to the actual data size.

9 years agomake MeshStreamer a ck-lib library
Gengbin Zheng [Tue, 25 Oct 2011 04:13:01 +0000 (23:13 -0500)]
make MeshStreamer a ck-lib library

9 years agoFixing bug in compilation flag for fault tolerance.
Xiang Ni [Mon, 24 Oct 2011 21:52:54 +0000 (16:52 -0500)]
Fixing bug in compilation flag for fault tolerance.

9 years agoMerge branch 'charm' of charmgit:charm into charm
Gengbin Zheng [Mon, 24 Oct 2011 21:02:26 +0000 (16:02 -0500)]
Merge branch 'charm' of charmgit:charm into charm

9 years agofix the HACK when dealing with WhenTrigger data structure.
Gengbin Zheng [Mon, 24 Oct 2011 21:01:09 +0000 (16:01 -0500)]
fix the HACK when dealing with WhenTrigger data structure.
It is only work when "when" function is "void" type.

9 years agoTyped Reductions example: Don't leak command line argument message
Phil Miller [Sun, 23 Oct 2011 23:43:09 +0000 (18:43 -0500)]
Typed Reductions example: Don't leak command line argument message

9 years agoTyped Reductions example: accept standard Charm++ make arguments
Phil Miller [Sun, 23 Oct 2011 23:42:45 +0000 (18:42 -0500)]
Typed Reductions example: accept standard Charm++ make arguments

9 years agoTyped Reductions: Don't double-free message delivered from null-reduction
Phil Miller [Sun, 23 Oct 2011 23:42:09 +0000 (18:42 -0500)]
Typed Reductions: Don't double-free message delivered from null-reduction

9 years ago32K stack seems to be small for MPI version of charm. Where program can trap in charm...
Gengbin Zheng [Sun, 23 Oct 2011 05:06:04 +0000 (00:06 -0500)]
32K stack seems to be small for MPI version of charm. Where program can trap in charm send, and calling progress engine for flow control./

9 years agoScotch based LBs including ScotchRefine and ScotchTopo to work with the latest versio...
Harshitha Menon [Sat, 22 Oct 2011 19:43:51 +0000 (14:43 -0500)]
Scotch based LBs including ScotchRefine and ScotchTopo to work with the latest version of Scotch

9 years agoMerge branch 'charm' of charmgit:charm into charm
Gengbin Zheng [Sat, 22 Oct 2011 03:01:08 +0000 (22:01 -0500)]
Merge branch 'charm' of charmgit:charm into charm

9 years agoautoconfig test offsetof
Gengbin Zheng [Sat, 22 Oct 2011 02:57:17 +0000 (21:57 -0500)]
autoconfig test offsetof

9 years agotyped redn: Update example program to also reduce an array of doubles
Ramprasad Venkataraman [Fri, 21 Oct 2011 18:08:59 +0000 (13:08 -0500)]
typed redn: Update example program to also reduce an array of doubles

Intended as a sanity check to ensure recent bug fixes don't get unfixed

9 years agoxlat: Generate redn_wrapper fn definitions for mainchare redn targets too
Ramprasad Venkataraman [Fri, 21 Oct 2011 17:39:01 +0000 (12:39 -0500)]
xlat: Generate redn_wrapper fn definitions for mainchare redn targets too

Mystifyingly, the typed reductions example code would not build because the
generated code contained declarations, but no definitions, of the redn_wrapper
functions. It turns out this behavior could be toggled by changing the
reduction root from a mainchare to a regular chare. Some investigation with
Phil's help seemed to indicate that the genCall to generate the redn_wrapper
definition could not be reached for mainchares because of an incomplete if
condition. Fix.