Scaling Implicit Parallelism via Dynamic Control Replication (PPoPP 2021 - Main Conference)

Who

Michael Bauer, Wonchan Lee, Elliott Slaughter, Zhihao Jia, Mario Di Renzo, Manolis Papadakis, Galen Shipman, Patrick McCormick, Michael Garland, Alex Aiken

Track

PPoPP 2021 Main Conference

Time Zone

The program is currently displayed in (GMT-05:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-05:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Mon 1 Mar 2021 11:55 - 12:10 - Session 2. Compilers, Analysis, Synthesis Chair(s): Milind Chabbi

Abstract

We present dynamic control replication, a run-time program analysis that enables scalable execution of implicitly parallel programs on large machines through a distributed and efficient dynamic dependence analysis. Dynamic control replication distributes dependence analysis by executing multiple copies of an implicitly parallel program while ensuring that they still collectively behave as a single execution. By distributing and parallelizing the dependence analysis, dynamic control replication supports efficient, on-the-fly computation of dependences for programs with arbitrary control flow at scale. We describe an asymptotically scalable algorithm for implementing dynamic control replication that maintains the sequential semantics of implicitly parallel programs.

An implementation of dynamic control replication in the Legion runtime delivers the same programmer productivity as writing in other implicitly parallel programming models, such as Dask or TensorFlow, while providing better performance (11.4X and 14.9X respectively in our experiments), and scalability to hundreds of nodes. We also show that dynamic control replication provides good absolute performance and scaling for HPC applications, competitive in many cases with explicitly parallel programming systems.

Link to Publication

https://dl.acm.org/doi/10.1145/3437801.3441587

Michael Bauer

NVIDIA

Wonchan Lee

NVIDIA

Elliott Slaughter

SLAC National Accelerator Laboratory

Zhihao Jia

Carnegie Mellon University

Mario Di Renzo

Sapienza University of Rome

Manolis Papadakis

NVIDIA

United States

Galen Shipman

Los Alamos National Laboratory

Patrick McCormick

Los Alamos National Laboratory

Michael Garland

NVIDIA

Alex Aiken

Stanford Univeristy

Time Zone

The program is currently displayed in (GMT-05:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-05:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Mon 1 Mar
Displayed time zone: Eastern Time (US & Canada) change

11:10 - 12:10	Session 2. Compilers, Analysis, SynthesisMain Conference Chair(s): Milind Chabbi Uber Technologies

11:10 15m Talk		Synthesizing Optimal Collective Algorithms Main Conference Zixian Cai Australian National University, Zhengyang Liu University of Utah, Saeed Maleki Microsoft Research, Madan Musuvathi Microsoft Research, Todd Mytkowicz Microsoft Research, Jacob Nelson Microsoft Research, Olli Saarikivi Microsoft Research, Redmond Link to publication
11:25 15m Talk		Parallel Binary Code Analysis Main Conference Xiaozhu Meng Rice University, Jonathon Anderson Rice University, John Mellor-Crummey Rice University, Mark W. Krentel Rice University, Barton P. Miller University of Wisconsin - Madison, Srđan Milaković Rice University Link to publication
11:40 15m Talk		Compiler Support for Near Data Computing Main Conference Mahmut Taylan Kandemir Penn State University, USA, Jihyun Ryoo Penn State University, USA, Xulong Tang University of Pittsburgh, USA, Mustafa Karakoy TUBITAK-BILGEM, Turkey Link to publication
11:55 15m Talk		Scaling Implicit Parallelism via Dynamic Control Replication Main Conference Michael Bauer NVIDIA, Wonchan Lee NVIDIA, Elliott Slaughter SLAC National Accelerator Laboratory, Zhihao Jia Carnegie Mellon University, Mario Di Renzo Sapienza University of Rome, Manolis Papadakis NVIDIA, Galen Shipman Los Alamos National Laboratory, Patrick McCormick Los Alamos National Laboratory, Michael Garland NVIDIA, Alex Aiken Stanford Univeristy Link to publication