charm.git
16 years agopreserve OPTS_LDXX when update its value in conv-mach-pre.sh
Gengbin Zheng [Sun, 16 Jan 2005 06:35:23 +0000 (06:35 +0000)]
preserve OPTS_LDXX when update its value in conv-mach-pre.sh

16 years agomoved -Wl,-u,_gmpi_macosx_malloc_hack from LD options to CMK_LIBS together with linki...
Gengbin Zheng [Sun, 16 Jan 2005 06:03:05 +0000 (06:03 +0000)]
moved -Wl,-u,_gmpi_macosx_malloc_hack from LD options to CMK_LIBS together with linking of MPI libs, because this flag does not work if not linking with mpich-gm.

16 years agoput all libraries in CMK_SYSLIBS to the very end of link command line.
Gengbin Zheng [Sun, 16 Jan 2005 05:50:35 +0000 (05:50 +0000)]
put all libraries in CMK_SYSLIBS to the very end of link command line.
When specifying "-save", leave moduleinit.Co.

16 years agorewritten and merged readme.
Gengbin Zheng [Sun, 16 Jan 2005 05:44:54 +0000 (05:44 +0000)]
rewritten and merged readme.

16 years agoremoved
Gengbin Zheng [Sun, 16 Jan 2005 05:44:24 +0000 (05:44 +0000)]
removed

16 years agoremoved -Wno-deprecated flag for c compiler.
Gengbin Zheng [Sun, 16 Jan 2005 05:33:39 +0000 (05:33 +0000)]
removed -Wno-deprecated flag for c compiler.

16 years agomake charm++ compilable with mpich-gm.
Gengbin Zheng [Sun, 16 Jan 2005 05:22:23 +0000 (05:22 +0000)]
make charm++ compilable with mpich-gm.

16 years agodefined OPTS_LD, OPTS_CXX instead of CMK_LIBDIR CMK_INCDIR
Gengbin Zheng [Sun, 16 Jan 2005 05:17:59 +0000 (05:17 +0000)]
defined OPTS_LD, OPTS_CXX instead of CMK_LIBDIR CMK_INCDIR
OPTS_CXX normally are put after charm headers, so that for example mpi.h is looked at charm include first.

16 years agostatistics collection instrument maximum of send queue length.
Gengbin Zheng [Sat, 15 Jan 2005 23:54:40 +0000 (23:54 +0000)]
statistics collection instrument maximum of send queue length.

16 years agoafter ConverseExit, Cmi_check_delay is set to be one second frequency so that once...
Gengbin Zheng [Sat, 15 Jan 2005 23:52:20 +0000 (23:52 +0000)]
after ConverseExit, Cmi_check_delay is set to be one second frequency so that once charmrun goes away, this process dies promptly.

16 years agofixed charmc path
Gengbin Zheng [Sat, 15 Jan 2005 08:55:10 +0000 (08:55 +0000)]
fixed charmc path

16 years agoa bug fix for GM - program hang after normal exit.
Gengbin Zheng [Sat, 15 Jan 2005 08:52:24 +0000 (08:52 +0000)]
a bug fix for GM - program hang after normal exit.
It seems that it hang when call to service stdout via charmrun fd.
Also added GM communication statistics collection which output information to one file for each processor.

16 years agofixed the previous fix.
Gengbin Zheng [Sat, 15 Jan 2005 01:10:14 +0000 (01:10 +0000)]
fixed the previous fix.

16 years agoupdated
Gengbin Zheng [Fri, 14 Jan 2005 23:25:58 +0000 (23:25 +0000)]
updated

16 years agoadded.
Gengbin Zheng [Fri, 14 Jan 2005 23:25:23 +0000 (23:25 +0000)]
added.

16 years agodefine seq/native compilers.
Gengbin Zheng [Fri, 14 Jan 2005 23:19:22 +0000 (23:19 +0000)]
define seq/native compilers.

16 years agodirected error to /dev/null.
Gengbin Zheng [Fri, 14 Jan 2005 23:18:44 +0000 (23:18 +0000)]
directed error to /dev/null.

16 years agoadded -fPIC.
Gengbin Zheng [Fri, 14 Jan 2005 22:46:27 +0000 (22:46 +0000)]
added -fPIC.

16 years agoAdded interface for real fft computation.
Yan Shi [Fri, 14 Jan 2005 20:17:52 +0000 (20:17 +0000)]
Added interface for real fft computation.

16 years ago*** empty log message ***
Esteban Pauli [Fri, 14 Jan 2005 03:41:28 +0000 (03:41 +0000)]
*** empty log message ***

16 years agoSupport for AMPI_DOUBLE_COMPLEX
Chao Huang [Thu, 13 Jan 2005 19:57:13 +0000 (19:57 +0000)]
Support for AMPI_DOUBLE_COMPLEX

16 years agoAdded support for AMPI_DOUBLE_COMPLEX, which should behave exactly the same
Chao Huang [Thu, 13 Jan 2005 19:52:04 +0000 (19:52 +0000)]
Added support for AMPI_DOUBLE_COMPLEX, which should behave exactly the same
as AMPI_COMPLEX.

16 years agoremoved PBS -ncpus which new turing doesn't like.
Gengbin Zheng [Thu, 13 Jan 2005 16:06:45 +0000 (16:06 +0000)]
removed  PBS -ncpus which new turing doesn't like.

16 years agoPermanantly (hopefully) fix out-of-bounds errors for boundary
Orion Lawlor [Thu, 13 Jan 2005 00:50:07 +0000 (00:50 +0000)]
Permanantly (hopefully) fix out-of-bounds errors for boundary
condition maintainance.  If this doesn't work, just hardcode
prop=1.0 and slope=0.0.

16 years agochange LBAgent.h to reflect name change in cvs.
Gengbin Zheng [Wed, 12 Jan 2005 06:46:29 +0000 (06:46 +0000)]
change LBAgent.h to reflect name change in cvs.

16 years agocleanup moduleinit file at end.
Gengbin Zheng [Wed, 12 Jan 2005 04:58:05 +0000 (04:58 +0000)]
cleanup moduleinit file at end.
trap control^C for better cleanup.

16 years agoadded OPTS to link time.
Gengbin Zheng [Tue, 11 Jan 2005 22:23:33 +0000 (22:23 +0000)]
added OPTS to link time.

16 years agoadd cflags to Makefile.
Gengbin Zheng [Tue, 11 Jan 2005 21:50:30 +0000 (21:50 +0000)]
add cflags to Makefile.

16 years ago*** empty log message ***
Amit Sharma [Tue, 11 Jan 2005 21:15:01 +0000 (21:15 +0000)]
*** empty log message ***

16 years agonew agent class for load balancer to consult specific knowledge about topology, commu...
Amit Sharma [Tue, 11 Jan 2005 20:54:10 +0000 (20:54 +0000)]
new agent class for load balancer to consult specific knowledge about topology, communication, etc. It returns a prefered list of processors for consideration.

16 years agoremoved
Gengbin Zheng [Tue, 11 Jan 2005 09:06:27 +0000 (09:06 +0000)]
removed

16 years agocheck against CMK_HAS_SYNC for sync
Gengbin Zheng [Tue, 11 Jan 2005 08:55:55 +0000 (08:55 +0000)]
check against CMK_HAS_SYNC for sync

16 years agoupdated for running on teragrid
Gengbin Zheng [Tue, 11 Jan 2005 08:52:48 +0000 (08:52 +0000)]
updated for running on teragrid

16 years agoadded crack2D back.
Gengbin Zheng [Mon, 10 Jan 2005 19:13:07 +0000 (19:13 +0000)]
added crack2D back.

16 years agoMinimal fix for timestepping bug. There are two problems here:
Orion Lawlor [Mon, 10 Jan 2005 19:09:08 +0000 (19:09 +0000)]
Minimal fix for timestepping bug.  There are two problems here:
  1.) The boundary condition control variables "kk", prop, slope, etc
have strange meanings and complicated processing.

  2.) Everything assumes Fortran 1-based array indexing, but we're
in 0-based C here.

16 years agoadded general k-ary tree.
Gengbin Zheng [Mon, 10 Jan 2005 06:45:34 +0000 (06:45 +0000)]
added general k-ary tree.

16 years agofixed a bug in print load summary for max obj load.
Gengbin Zheng [Mon, 10 Jan 2005 06:42:00 +0000 (06:42 +0000)]
fixed a bug in print load summary for max obj load.

16 years agoincreased version number.
Gengbin Zheng [Mon, 10 Jan 2005 04:30:01 +0000 (04:30 +0000)]
increased version number.

16 years agoupdated CentralLB::LDStats to BaseLB::LDStats
Gengbin Zheng [Mon, 10 Jan 2005 04:20:39 +0000 (04:20 +0000)]
updated CentralLB::LDStats to BaseLB::LDStats

16 years agomoved LDStats from class CentralLB to BaseLB so that it is available for other catego...
Gengbin Zheng [Mon, 10 Jan 2005 04:11:10 +0000 (04:11 +0000)]
moved LDStats from class CentralLB to BaseLB so that it is available for other categories of load balancers.

16 years agoallow registering multicast communication for multicast messages in one call.
Gengbin Zheng [Sun, 9 Jan 2005 23:55:23 +0000 (23:55 +0000)]
allow registering multicast communication for multicast messages in one call.
Moving function of printing LB database to be a member function of LDStats.

16 years agoallow calling addMessage with an optional "nMessages" parameter.
Gengbin Zheng [Sun, 9 Jan 2005 23:51:57 +0000 (23:51 +0000)]
allow calling addMessage with an optional "nMessages" parameter.

16 years agodifferenitate ObjOnly mode and ignoring Background load mode.
Gengbin Zheng [Sun, 9 Jan 2005 23:51:18 +0000 (23:51 +0000)]
differenitate ObjOnly mode and ignoring Background load mode.

16 years agobug fix for +LBObjOnly mode.
Gengbin Zheng [Sun, 9 Jan 2005 23:50:42 +0000 (23:50 +0000)]
bug fix for +LBObjOnly mode.

16 years agotake multicast into account.
Gengbin Zheng [Sun, 9 Jan 2005 23:40:02 +0000 (23:40 +0000)]
take multicast into account.

16 years agoremove crack2D test for now since it is broken on lemieux.
Gengbin Zheng [Sun, 9 Jan 2005 04:44:45 +0000 (04:44 +0000)]
remove crack2D test for now since it is broken on lemieux.

16 years agodon't remove output file and job script when job error occurs.
Gengbin Zheng [Sat, 8 Jan 2005 09:17:40 +0000 (09:17 +0000)]
don't remove output file and job script when job error occurs.

16 years agoset stacksize to 0 to use the default
Gengbin Zheng [Sat, 8 Jan 2005 06:57:01 +0000 (06:57 +0000)]
set stacksize to 0 to use the default

16 years agoinclude memory-gnu.c if gnu malloc is prefered to memory-gnuold.c
Gengbin Zheng [Sat, 8 Jan 2005 06:29:29 +0000 (06:29 +0000)]
include memory-gnu.c if gnu malloc is prefered to memory-gnuold.c

16 years agotweaked netpoll mode, instead of completely eliminated sigio, now open it only for...
Gengbin Zheng [Sat, 8 Jan 2005 06:15:47 +0000 (06:15 +0000)]
tweaked netpoll mode, instead of completely eliminated sigio, now open it only for stdout. So that print with large amount of output won't hang.

16 years agouse exit status of mpirun to better catch the runtime failure. also detect if pr
Gengbin Zheng [Sat, 8 Jan 2005 00:25:06 +0000 (00:25 +0000)]
use exit status of mpirun to better catch the runtime failure. also detect if pr
ogram hang.

16 years agotested for setpriority call, and test if "sync" command exists
Gengbin Zheng [Fri, 7 Jan 2005 21:54:40 +0000 (21:54 +0000)]
tested for setpriority call, and test if "sync" command exists

16 years agoa fix in echo $? in here file.
Gengbin Zheng [Fri, 7 Jan 2005 15:22:58 +0000 (15:22 +0000)]
a fix in echo $? in here file.

16 years agoa minor change
Gengbin Zheng [Fri, 7 Jan 2005 08:36:38 +0000 (08:36 +0000)]
a minor change

16 years agoremoved several questionabale PBS directives.
Gengbin Zheng [Fri, 7 Jan 2005 07:46:49 +0000 (07:46 +0000)]
removed several questionabale PBS directives.

16 years agofor real implementation for CmiBarrierZero() for synchronizing timers.
Gengbin Zheng [Fri, 7 Jan 2005 06:00:04 +0000 (06:00 +0000)]
for real implementation for CmiBarrierZero() for synchronizing timers.

16 years agoadded an empty CmiBarrierZero for now.
Gengbin Zheng [Thu, 6 Jan 2005 22:59:42 +0000 (22:59 +0000)]
added an empty CmiBarrierZero for now.

16 years agoa minor change to make some turing specific parameters
Gengbin Zheng [Thu, 6 Jan 2005 22:16:17 +0000 (22:16 +0000)]
a minor change to make some turing specific parameters

16 years agotest $PBS_NODEFILE to see if charmrun is runnin inside job script or not.
Gengbin Zheng [Thu, 6 Jan 2005 21:45:50 +0000 (21:45 +0000)]
test $PBS_NODEFILE to see if charmrun is runnin inside job script or not.

16 years agomake job submission script smarter by detecting unexpecting errors.
Gengbin Zheng [Thu, 6 Jan 2005 20:45:44 +0000 (20:45 +0000)]
make job submission script smarter by detecting unexpecting errors.

16 years agoreturn correct status and does better cleanup
Gengbin Zheng [Thu, 6 Jan 2005 17:41:44 +0000 (17:41 +0000)]
return correct status and does better cleanup

16 years agouse PBS_QUEUE to specify the job queue to submit.
Gengbin Zheng [Thu, 6 Jan 2005 17:04:30 +0000 (17:04 +0000)]
use PBS_QUEUE to specify the job queue to submit.

16 years agoprint 'End of program' to fake parallel run for autobuild.
Gengbin Zheng [Thu, 6 Jan 2005 09:23:46 +0000 (09:23 +0000)]
print 'End of program' to fake parallel run for autobuild.

16 years agochanged to check end of program to tell if a job succeed or not.
Gengbin Zheng [Thu, 6 Jan 2005 09:12:22 +0000 (09:12 +0000)]
changed to check end of program to tell if a job succeed or not.

16 years agoremove tmp files generated
Gengbin Zheng [Thu, 6 Jan 2005 06:55:54 +0000 (06:55 +0000)]
remove tmp files generated

16 years agotest error output to detect error.
Gengbin Zheng [Thu, 6 Jan 2005 06:47:10 +0000 (06:47 +0000)]
test error output to detect error.

16 years agoautomatically qsub a job if a queueing system (qsub) is needed to run a job.
Gengbin Zheng [Thu, 6 Jan 2005 05:45:26 +0000 (05:45 +0000)]
automatically qsub a job if a queueing system (qsub) is needed to run a job.
PBS_TEST environment variable need to be set.

16 years agofixed a bug in checkpointing, without setting maxStartRequest to 0, processors with...
Gengbin Zheng [Thu, 6 Jan 2005 05:16:47 +0000 (05:16 +0000)]
fixed a bug in checkpointing, without setting maxStartRequest to 0, processors with 0 array elements can trap in an infinite loop doing reductions.

16 years agoset default thread stack size to be 64K
Gengbin Zheng [Wed, 5 Jan 2005 18:03:19 +0000 (18:03 +0000)]
set default thread stack size to be 64K

16 years agoset default stack size to be 64K.
Gengbin Zheng [Wed, 5 Jan 2005 18:02:08 +0000 (18:02 +0000)]
set default stack size to be 64K.

16 years agomake thread default stacksize a macro which can be changed in config.
Gengbin Zheng [Wed, 5 Jan 2005 18:01:07 +0000 (18:01 +0000)]
make thread default stacksize a macro which can be changed in config.

16 years agoswitch to os malloc instead of gnu malloc which seemed to fix a crash bug with fem.
Gengbin Zheng [Wed, 5 Jan 2005 06:54:59 +0000 (06:54 +0000)]
switch to os malloc instead of gnu malloc which seemed to fix a crash bug with fem.

16 years agoadded comlib_stream back for bigsim
Gengbin Zheng [Wed, 5 Jan 2005 05:14:40 +0000 (05:14 +0000)]
added comlib_stream back for bigsim

16 years agoset default size to 64K which seemed to fix the crash bug on mpi-ppc-darwin
Gengbin Zheng [Wed, 5 Jan 2005 04:35:32 +0000 (04:35 +0000)]
set default size to 64K which seemed to fix the crash bug on mpi-ppc-darwin

16 years agotook off commlib_stream
Gengbin Zheng [Wed, 5 Jan 2005 02:11:36 +0000 (02:11 +0000)]
took off commlib_stream

16 years agotake off commlib_stream since it broke on mac cluster
Gengbin Zheng [Wed, 5 Jan 2005 01:11:33 +0000 (01:11 +0000)]
take off commlib_stream since it broke on mac cluster

16 years agogm config for mac.
Gengbin Zheng [Tue, 4 Jan 2005 23:09:30 +0000 (23:09 +0000)]
gm config for mac.

16 years agoAdded support for reduction to a method that does not belong to this
Chao Huang [Tue, 4 Jan 2005 22:52:21 +0000 (22:52 +0000)]
Added support for reduction to a method that does not belong to this
class. Eg, in WorkerClass we do a reduction to mainProxy.target, the
reduction client WorkerClass._target will still be generated, and the
reduction msgs all go to thisProxy(0), or (0,0) if 2D array, afterwards,
it will pass the result by calling MainClass.target.

16 years agoAdded MPI_Finalized
Chao Huang [Tue, 4 Jan 2005 22:48:30 +0000 (22:48 +0000)]
Added MPI_Finalized

16 years agomany changes:
Gengbin Zheng [Tue, 4 Jan 2005 04:27:49 +0000 (04:27 +0000)]
many changes:
1. interface with LB instrumentation for logging multicast
2. provide resetSection() call for rebuild multicast spanning tree when multicast root migrates
3. fix a performance bug when after load balancing, rebuilding spanning tree was not optimized, this is because the new root processor of the tree does not know the whereabouts of all group members, and the tree built does not match the real locations.

16 years agocreate new message sending options: CK_MSG_LB_NOTRACE
Gengbin Zheng [Tue, 4 Jan 2005 04:15:37 +0000 (04:15 +0000)]
create new message sending options: CK_MSG_LB_NOTRACE
which disable the trace in LB for array send.

16 years agoFully implemented functions for instrument multicast communication.
Gengbin Zheng [Tue, 4 Jan 2005 04:06:50 +0000 (04:06 +0000)]
Fully implemented functions for instrument multicast communication.

16 years agoavoid expensive copy constructor by using passing by reference.
Gengbin Zheng [Sun, 2 Jan 2005 09:09:51 +0000 (09:09 +0000)]
avoid expensive copy constructor by using passing by reference.

16 years agofixed several missing implementation for multicast communication in lbdb database.
Gengbin Zheng [Sun, 2 Jan 2005 08:52:30 +0000 (08:52 +0000)]
fixed several missing implementation for multicast communication in lbdb database.

16 years agoOriginal: use CK_MSG_KEEP and call inline send for local multicast
Gengbin Zheng [Sat, 1 Jan 2005 23:23:46 +0000 (23:23 +0000)]
Original:  use CK_MSG_KEEP  and call inline send for local multicast
The drawback of this scheme bypassing queue is that if # of local element is huge, this leads to a long time occupying CPU.
also load balancer seems not be able to correctly instrument the load.
Now: only call inline when entry is marked "nokeep", and send through scheduler queue in other case.

16 years agoremoved and use common one
Gengbin Zheng [Fri, 31 Dec 2004 19:43:41 +0000 (19:43 +0000)]
removed and use common one

16 years agocleanup for common use.
Gengbin Zheng [Fri, 31 Dec 2004 19:43:16 +0000 (19:43 +0000)]
cleanup for common use.

16 years agoremoved and turn to the default one in common/
Gengbin Zheng [Fri, 31 Dec 2004 19:42:17 +0000 (19:42 +0000)]
removed and turn to the default one in common/

16 years agopgcc generally has problem dealing with QT, use gnu compiler instead
Gengbin Zheng [Fri, 31 Dec 2004 19:24:44 +0000 (19:24 +0000)]
pgcc generally has problem dealing with QT, use gnu compiler instead

16 years agogm2 option for mpi-linux-amd64
Gengbin Zheng [Fri, 31 Dec 2004 06:04:23 +0000 (06:04 +0000)]
gm2 option for mpi-linux-amd64

16 years agoclustermatic option for x86_64.
Gengbin Zheng [Fri, 31 Dec 2004 00:36:33 +0000 (00:36 +0000)]
clustermatic option for x86_64.

16 years agouse GetTime() for retrieving all time from database including total wall/cpu time...
Gengbin Zheng [Thu, 30 Dec 2004 07:13:49 +0000 (07:13 +0000)]
use GetTime() for retrieving all time from database including total wall/cpu time, idle time and background wall/cpu time.

16 years agobackground load as calculated by substracting objtime and idletime from total walltim...
Gengbin Zheng [Thu, 30 Dec 2004 07:12:02 +0000 (07:12 +0000)]
background load as calculated by substracting objtime and idletime from total walltime sometime can lead to negative number due to floating errors. Fixed by making it 0 when negative number occurs.

16 years agoprint alpha and beta in LB debug mode.
Gengbin Zheng [Thu, 30 Dec 2004 07:09:46 +0000 (07:09 +0000)]
print alpha and beta in LB debug mode.

16 years agomodified to take background load into account
Gengbin Zheng [Thu, 30 Dec 2004 07:05:43 +0000 (07:05 +0000)]
modified to take background load into account

16 years agomake per message overhead (alpha) and per byte overhead (beta) command line option.
Gengbin Zheng [Wed, 29 Dec 2004 05:04:59 +0000 (05:04 +0000)]
make per message overhead (alpha) and per byte overhead (beta) command line option.

16 years agoimproved LBInfo print for maiximum obj load and communicaiton load.
Gengbin Zheng [Wed, 29 Dec 2004 02:17:48 +0000 (02:17 +0000)]
improved LBInfo print for maiximum obj load and communicaiton load.

16 years agoimproved LBInfo usage so that it is easier for any LB strategy to print the summary...
Gengbin Zheng [Wed, 29 Dec 2004 00:48:04 +0000 (00:48 +0000)]
improved LBInfo usage so that it is easier for any LB strategy to print the summary infomration for evaluating the LB results.

16 years agofixed bugs about memory leak, improve support for non-migratable objects and fixed...
Gengbin Zheng [Wed, 29 Dec 2004 00:44:15 +0000 (00:44 +0000)]
fixed bugs about memory leak, improve support for non-migratable objects and fixed a bug that prevent obj load from assigning to processors before first_avail_proc.

16 years agodefine LBInfo to contain information for evaluating LB strategy
Gengbin Zheng [Tue, 28 Dec 2004 22:35:39 +0000 (22:35 +0000)]
define LBInfo to contain information for evaluating LB strategy