13-58
2013
[Talk]
Scalable and Asynchronous Algorithms for Structured Adaptive Mesh Refinement [HiPC 2013]
13-57
2013
[Talk]
Parallel Branch-and-Bound for Two-stage Stochastic Integer Optimization [HiPC 2013]
13-55
2013
[Talk]
Performance Optimization Under Thermal and Power Constraints For High Performance Computing Data Centers [Thesis 2013]
13-54
2013
[Talk]
ERS: Techniques for Improving Observed Network Performance [SC 2013]
13-53
2013
[Poster]
Fast Prediction of Network Performance: k-packet Simulation [SC 2013]
13-52
2013
[Poster]
ACM SRC: Structure-Aware Parallel Algorithm for Solution of Sparse Triangular Linear Systems [SC 2013]
| Ehsan Totoni | Michael Heath | Laxmikant Kale
13-50
2013
[Talk]
A `Cool' Way of Improving the Reliability of HPC Machines [SC 2013]
13-49
2013
[Talk]
ACR: Automatic Checkpoint/Restart for Soft and Hard Error Protection [SC 2013]
13-48
2013
[Talk]
Predicting application performance using supervised learning on communication features [SC 2013]
13-47
2013
[Talk]
A Distributed Dynamic Load Balancer for Iterative Applications [SC 2013]
13-35
2013
[Poster]
Steal Tree: Low-Overhead Tracing of Work Stealing Schedulers [PLDI 2013]
| Jonathan Lifflander | Sriram Krishnamoorthy | Laxmikant Kale
13-34
2013
[Talk]
Projections: Scalable Performance Analysis and Visualization [VAPLS 2013]
13-32
2013
[Talk]
Keynote: The Coming Era of Adaptive Control Systems in HPC [ICPP 2013]
13-31
2013
[Poster]
Towards Efficient Mapping, Scheduling, and Execution of HPC Applications on Platforms in Cloud [IPDPS PhD Forum 2013]
13-28
2012
[Poster]
Understanding Network Contention on Blue Gene Supercomputers [LLNL Poster Symposium 2012]
13-27
2013
[Poster]
Chizu: A Framework to Enable Topology Aware Task Mapping [LLNL Poster Symposium 2013]
13-23
2013
[Talk]
Tutorial: Programming with Parallel Migratable Objects [ATPESC 2013]
13-15
2012
[Talk]
Collectives on Two-tier Direct Networks [EuroMPI 2012]
13-14
2013
[Talk]
Charm++ Interoperability [Charm++ Workshop 2013]
13-13
2013
[Talk]
Predicting application performance using supervised learning on communication features [Lawrence Livermore Talk 2013]
13-12
2013
[Talk]
Steal Tree: Low-Overhead Tracing of Work Stealing Schedulers [PLDI 2013]
| Jonathan Lifflander | Sriram Krishnamoorthy | Laxmikant Kale
13-11
2013
[Talk]
Characteristics of Adaptive Runtime Systems in HPC [ROSS 2013]
13-10
2013
[Talk]
Toward Runtime Power Management of Exascale Networks by On/Off Control of Links [HPPAC 2013]
13-06
2013
[Talk]
Adoption Protocols for Fanout-Optimal Fault-Tolerant Termination Detection [PPoPP 2013]
13-03
2013
[Poster]
Charm++: Migratable Objects + Active Messages + Adaptive Runtime = Productivity + Performance [PSAAP Site-visit 2013]
13-02
2013
[Poster]
Scalable Algorithms for Distributed-Memory Adaptive Mesh Refinement [PSAAP Site-visit 2013]
12-53
2012
[Talk]
Automated Load Balancing Invocation based on Application Characteristics [Cluster 2012]
12-52
2012
[Poster]
Work Stealing and Persistence-based Load Balancers for Iterative Overdecomposed Applications [HPDC 2012]
| Jonathan Lifflander | Sriram Krishnamoorthy | Laxmikant Kale
12-51
2012
[Talk]
Scalable Algorithms for Distributed-Memory Adaptive Mesh Refinement [SBAC-PAD 2012]
| Akhil Langer | Jonathan Lifflander | Phil Miller | Kuo-Chuan Pan | Laxmikant Kale | Paul Ricker
12-49
2012
[Talk]
Performance Optimization of a Parallel, Two Stage Stochastic Linear Program: The Military Aircraft Allocation Problem [ICPADS 2012]
| Akhil Langer | Ramprasad Venkataraman | Udatta Palekar | Laxmikant Kale | Steven Baker
12-48
2012
[Talk]
Optimizing Fine-grained Communication in a Biomolecular Simulation Application on Cray XK6 [SC 2012]
12-43
2012
[Talk]
Assessing Energy Efficiency of Fault Tolerance Protocols for HPC Systems [SBAC-PAD 2012]
12-40
2012
[Talk]
Scalable Algorithms for Constructing Balanced Spanning Trees on System-ranked Process Groups [EuroMPI 2012]
12-34
2012
[Talk]
A Scalable Double In-memory Checkpoint and Restart Scheme towards Exascale [FTXS 2012]
12-30
2012
[Talk]
A Message-Logging Protocol for Multicore Systems [FTXS 2012]
12-26
2012
[Talk]
A uGNI-Based Asynchronous Message-driven Runtime System for Cray Supercomputers with Gemini Interconnect [IPDPS 2012]
12-25
2012
[Talk]
Work Stealing and Persistence-based Load Balancers for Iterative Overdecomposed Applications [HPDC 2012]
| Sriram Krishnamoorthy | Jonathan Lifflander
12-24
2012
[Talk]
Dynamic Scheduling for Work Agglomeration on Heterogeneous Clusters [Workshop on Multicore and GPU Programming Models, Languages and Compilers at IPDPS 2012]
12-23
2012
[Talk]
Mapping Dense LU Factorization on Multicore Supercomputer Nodes [IPDPS 2012]
12-16
2011
[Talk]
Charm++ Tutorial for UIUC SIAM [No Conference 2011]
12-10
2012
[Talk]
Comparing the Power and Performance of Intel’s SCC to State-of-the-Art CPUs and GPUs [ISPASS 2012]
12-09
2012
[Talk]
Composable and modular exascale programming models with intelligent runtime systems [Sandia Talk 2012]
12-05
2012
[Talk]
Composable Libraries for Parallel Programming [SIAM Conference on Parallel Processing for Scientific Computing 2012]
12-01
2012
[Talk]
Performance Issues and Techniques in Scalable Parallel Programming [C-DAC 2012]
11-56
2011
[Talk]
Charm++ Tutorial [ICS 2011]
11-52
2012
[Poster]
Collective Algorithms for Sub-communicators [PPoPP 2012]
| Anshul Mittal | Nikhil Jain | Thomas George | Yogish Sabharwal | Sameer Kumar
11-48
2011
[Talk]
Charm++ for Productivity and Performance: A Submission to the 2011 HPC Class II Challenge [SC 2011]
11-47
2011
[Talk]
Avoiding hot-spots on two-level direct networks [SC 2011]
11-46
2011
[Talk]
ACM SRC: Optimizing AlltoAll Algorithm for PERCS Network Using Simulation [SC 2011]
11-45
2011
[Talk]
HPC Runtime System Software [SC 2011]
11-44
2011
[Poster]
Optimizing All-to-All Algorithm for PERCS Network Using Simulation [SC 2011]
11-43
2011
[Poster]
Tune Up for Blue Waters Before It Arrives [No Conference 2011]
11-42
2011
[Talk]
Large Scale Simulations Enabled by BigSim [Charm++ Workshop 2011]
11-38
2011
[Talk]
Dynamic Load Balance for Optimized Message Logging in Fault Tolerant HPC Applications [Cluster 2011]
| Esteban Meneses | Greg Bronevetsky | Laxmikant Kale
11-36
2011
[Talk]
Heuristic-based techniques for mapping irregular communication graphs to mesh topologies [ESCAPE 2011]
11-33
2011
[Poster]
Enabling Massive Parallelism for Stochastic Optimization [SC 2011]
| Akhil Langer | Ramprasad Venkataraman | Gagan Gupta | Laxmikant Kale | Udatta Palekar | Steven Baker | Mark Surina
11-31
2011
[Talk]
Composable and Modular Exascale Programming Models with Intelligent Runtime Systems [ASCR Programming Challenges 2011]
11-20
2011
[Poster]
Scaling NAno Molecular Dynamic(NAMD) on Petascale machines using Charm++ [PPL Talk 2011]
11-19
2011
[Talk]
An Adaptive Framework for Large-scale State Space Search [LSPP 2011]
11-16
2011
[Talk]
Architectural constraints required to attain 1 Exaflop/s for scientific applications [IPDPS 2011]
11-15
2011
[Talk]
Techniques for effective Petascale Application Development based on adaptive runtime systems [PPL Talk 2011]
11-14
2011
[Poster]
Molecular dynamics simulations on supercomputers performing 10^18 flop/s [UIUC Postdoc Symposium 2011]
| Abhinav Bhatele | William Gropp | Laxmikant Kale
10-44
2010
[Talk]
Hierarchical Load Balancing for Charm++ Applications on Large Supercomputers [P2S2 2010]
10-41
2010
[Talk]
A Study of Memory-Aware Scheduling in Message Driven Parallel Programs [HiPC 2010]
10-40
2010
[Talk]
Automated Mapping of Regular Communication Graphs on Mesh Interconnects [HiPC 2010]
10-39
2010
[Talk]
Topology Aware Mapping [PPL Talk 2010]
10-38
2010
[Talk]
Using BigSim to Estimate Application Performance [PPL Talk 2010]
10-37
2010
[Talk]
Clustering Parallel Applications to Enhance Message Logging Protocols [PPL Talk 2010]
10-36
2010
[Talk]
Mapping your application on interconnect topologies: Effort versus benefits [SC 2010]
10-35
2010
[Talk]
Debugging Large Scale Applications in a Virtualized Environment [LCPC 2010]
10-34
2010
[Talk]
Debugging Large Scale Applications with Virtualization [PPL Talk 2010]
10-33
2010
[Talk]
Optimizing a Parallel Runtime System for Multicore Clusters: A Case Study [TeraGrid 2010]
10-32
2010
[Talk]
Mapping parallel applications on the machine topology: Lessons learned [TeraGrid 2010]
10-31
2010
[Talk]
Biomolecular Simulations using NAMD on TeraGrid machines [TeraGrid 2010]
10-30
2010
[Talk]
Robust Non-Intrusive Record-Replay with Processor Extraction [PADTAD 2010]
10-29
2010
[Talk]
Team-based Message Logging: Preliminary Results [Resilience 2010]
10-28
2010
[Talk]
Highly Scalable Parallel Sorting [IPDPS 2010]
10-27
2010
[Talk]
Static Macro Data Flow: Compiling Global Control into Local Control [HIPS 2010]
09-29
2009
[Talk]
Techniques in Scalable and Effective Performance Analysis [PPL Talk 2009]
09-28
2009
[Talk]
Automating Topology Aware Task Mapping for Large Supercomputers [SC 2009]
09-27
2009
[Talk]
Adaptive Runtime Support for Fault Tolerance [PPL Talk 2009]
09-26
2010
[Talk]
Scalable Fault Tolerance Schemes Using Adaptive Runtime Support [No Conference 2010]
09-25
2010
[Talk]
Load Balancing and Topology Aware Mapping for Petascale Machines [No Conference 2010]
09-24
2010
[Talk]
A Case Study of Communication Optimizations on 3D Interconnects [No Conference 2010]
09-23
2010
[Talk]
Load Balancing Techniques for Asynchronous Spacetime Discontinuous Galerkin Methods [No Conference 2010]
09-22
2010
[Talk]
Scalable Interaction with Parallel Applications [No Conference 2010]
09-21
2010
[Talk]
Dynamic High-Level Scripting in Parallel Applications [No Conference 2010]
09-20
2010
[Talk]
Dynamic Topology Aware Load Balancing Algorithms for MD Applications [No Conference 2010]
09-19
2010
[Talk]
Object-based Over-Decomposition can enable powerful Fault Tolerance Schemes [No Conference 2010]
09-18
2010
[Talk]
An Evaluative Study on the Effects of Contention on Message Latencies in Large Supercomputers [No Conference 2010]
09-17
2010
[Talk]
The Charm++ Programming Model and NAMD [No Conference 2010]
09-16
2009
[Poster]
Topology Aware Task Mapping Techniques: An API and Case Study [PPoPP 2009]
09-15
2009
[Poster]
Performance Comparison of Intrepid, Jaguar and Ranger using Scientific Applications [SC 2009]
08-30
2010
[Talk]
IS TOPOLOGY IMPORTANT AGAIN?<BR>- Effects of Contention on Message Latencies in Large Supercomputers [No Conference 2010]
08-29
2010
[Talk]
Topology Aware Mapping for Performance Optimization of Science Applications [No Conference 2010]
08-28
2010
[Talk]
Dynamic Topology Aware Load Balancing Algorithms for MD Applications [No Conference 2010]
08-27
2010
[Talk]
Preparing for Petascale and Beyond [No Conference 2010]
08-26
2010
[Talk]
Simplifying Parallel Programming with Incomplete Parallel Languages [No Conference 2010]
08-25
2010
[Talk]
A Case Study in Tightly-Coupled Multiparadigm Parallel Programming [No Conference 2010]
08-24
2010
[Talk]
Some Essential Techniques for Developing Efficient Peta scale Applications [No Conference 2010]
08-23
2010
[Talk]
Memory Tagging in Charm++ [No Conference 2010]
08-22
2008
[Talk]
Massively Parallel Cosmological Simulations with ChaNGa [IPDPS 2008]
08-21
2010
[Talk]
Towards Scalable Performance Analysis and Visualization through Data Reduction [No Conference 2010]
08-20
2010
[Talk]
Application-specific Topology-aware Mapping for Three Dimensional Topologies [No Conference 2010]
08-19
2010
[Talk]
Overcoming Scaling Challenges in Biomolecular Simulations across Multiple Platforms [No Conference 2010]
08-18
2008
[Poster]
Effects of Contention on Message Latencies in Large Supercomputers [SC 2008]
08-17
2008
[Poster]
Automatic Topology-Aware Task Mapping for Parallel Applications Running on Large Parallel Machines [IPDPS 2008]
07-16
2010
[Talk]
Charisma: Orchestrating Migratable Parallel Objects [No Conference 2010]
06-24
2010
[Talk]
Scalable Cosmological Simulations on Parallel Machines [No Conference 2010]
06-23
2010
[Talk]
Support for Adaptivity in ARMCI Using Migratable Objects [No Conference 2010]
06-22
2010
[Talk]
Performance Evaluation of Adaptive MPI [No Conference 2010]
06-21
2006
[Poster]
Cosmological Simulations on Supercomputers [SC 2006]
06-20
2006
[Poster]
Charm++ on Cell [PPL Poster 2006]
06-19
2006
[Poster]
Charm++ Simplifies Programming for the Cell Processor [SC 2006]
05-29
2010
[Talk]
Performance Degradataion in the Presence of Subnormal Floating-Point Values [No Conference 2010]
05-28
2010
[Talk]
Parallelization Of The Spacetime Discontinuous Galerkin Method Using The Charm++ ParFUM Framework [No Conference 2010]
05-27
2010
[Talk]
Adaptive MPI: Intelligent runtime strategies and performance prediction via simulation [No Conference 2010]
05-26
2005
[Poster]
Speeding Up Parallel Simulation with Automatic Load Balancing [PPL Poster 2005]
| Hari Govind | Gengbin Zheng | Laxmikant Kale | Michael Breitenfeld | Philippe Geubelle
05-25
2005
[Poster]
Parallel VHDL Simulation [PPL Poster 2005]
04-26
2010
[Talk]
An Orchestration Language for Parallel Objects [No Conference 2010]
04-25
2010
[Talk]
Scaling Collective Multicast on Fat-tree Networks [No Conference 2010]
04-24
2010
[Talk]
Opportunity and Challanges of Modern Communication Architectures: Case Study with QsNet [No Conference 2010]
04-23
2010
[Talk]
FTC-Charm++: An In-Memory Checkpoint-Based Fault Tolerant Runtime for Charm++ and MPI [No Conference 2010]
04-22
2010
[Talk]
BigSim: A Parallel Simulator for Performance Prediction of Extremely Large Parallel Machines [No Conference 2010]
04-21
2010
[Talk]
A Fault tolerant protocol for massively parallel machines [No Conference 2010]
04-20
2010
[Talk]
Using Multiphase Shared Arrays [No Conference 2010]
04-19
2004
[Poster]
Salsa: a parallel, interactive, particle-based analysis tool [SC 2004]
| Thomas Quinn | Laxmikant Kale | Filippo Gioachin | Orion Lawlor | Graeme Lufkin | Gregory Stinson
03-29
2010
[Talk]
Survey on Mesh formats for Scientific Computing [No Conference 2010]
03-28
2010
[Talk]
Introduction to Performance Tuning [No Conference 2010]
03-27
2010
[Talk]
Charm++/AMPI Tutorial [No Conference 2010]
03-26
2010
[Talk]
Faucets Tutorial [No Conference 2010]
03-25
2010
[Talk]
Quantum Chemistry Presentation at Lyon [No Conference 2010]
03-24
2010
[Talk]
Adaptive MPI [No Conference 2010]
03-23
2010
[Talk]
Parallel Spacetime Discontinuous Galerkin Methods [No Conference 2010]
03-22
2010
[Talk]
Parallelization of CPAIMD [No Conference 2010]
03-21
2010
[Talk]
A Framework for Collective Personalized Communication [No Conference 2010]
03-20
2010
[Talk]
Bluegene Timestamp Correction [No Conference 2010]
02-17
2010
[Talk]
Adaptive MPI [No Conference 2010]
02-16
2010
[Talk]
Molecular Dynamics on Thousands of Processors [No Conference 2010]
02-15
2010
[Talk]
Runtime Optimizations via Processor Virtualization [No Conference 2010]
02-14
2010
[Talk]
Charm++ Internals [No Conference 2010]
02-13
2010
[Talk]
Charm++ Internals - Introduction to Charm++ Machine Layer [No Conference 2010]
02-12
2010
[Talk]
Charm++ Overview and Simple Examples [No Conference 2010]
02-11
2010
[Talk]
Faucets: Efficient Utilization of Multiple Clusters [No Conference 2010]
01-14
2010
[Talk]
Charm++ Arrays, Parameter Marshalling, and Load Balancing [No Conference 2010]
01-13
2010
[Talk]
Faucets [No Conference 2010]
01-12
2010
[Talk]
Faucets Queueing Systems [No Conference 2010]
01-11
2010
[Talk]
BlueGene Emulator [No Conference 2010]
01-10
2010
[Talk]
NAMD [No Conference 2010]
01-09
2010
[Talk]
Adaptive Mesh Refinement (AMR) [No Conference 2010]
01-08
2010
[Talk]
Parallel Object-oriented Simulation Environment (POSE) [No Conference 2010]
01-07
2010
[Talk]
Component Frameworks for Parallel Applications [No Conference 2010]
98-11
2010
[Talk]
Load Balancing in Parallel Molecular Dynamics [No Conference 2010]
96-23
2010
[Talk]
Charm++: A Portable Concurrent Object Oriented System Based on C++ [No Conference 2010]
96-22
2010
[Talk]
CONVERSE: An Interoperable Framework for Parallel Programming [No Conference 2010]
96-21
2010
[Talk]
Efficient Parallel Graph Coloring with Prioritization [No Conference 2010]
| Laxmikant Kale | Ben H.Richards | Terry Allen
96-20
2010
[Talk]
Towards Automatic Performance Analysis of Parallel Programs [No Conference 2010]
96-19
2010
[Talk]
Automatic Parallel Runtime Optimizations using Post-Mortem Analysis [No Conference 2010]
96-18
2010
[Talk]
Charm++: What Have We Learned? [No Conference 2010]
96-17
2010
[Talk]
Threads for Interoperable Parallel Programming [No Conference 2010]
96-16
2010
[Talk]
A Parallel Array Abstraction for Data-Driven Objects [No Conference 2010]
95-19
2010
[Talk]
Efficient Implementation of High Performance Fortran via Adaptive Scheduling - An Overview [No Conference 2010]
95-18
2010
[Talk]
Modularity, Reuse and Efficiency with Message-Driven Libraries [No Conference 2010]
95-17
2010
[Talk]
Agents: An Undistorted Representation of Problem Structure [No Conference 2010]
| J.Yelon J.Yelon | Laxmikant Kale
94-06
2010
[Talk]
Dagger: Combining Benefits of Synchronous and Asynchronous Communication Styles [No Conference 2010]
91-10
2010
[Talk]
Supporting Machine Independent Parallel Programming on Diverse Parallel Architectures [No Conference 2010]
| Wayne Fenton | B. Ramkumar | Vikram Saletore | Amitabh Sinha | Laxmikant Kale