ApproxTuner: A Compiler and Runtime System for Adaptive Approximations (PPoPP 2021 - Main Conference)

Who

Hashim Sharif, Yifan Zhao, Maria Kotsifakou, Akash Kothari, Ben Schreiber, Elizabeth Wang, Yasmin Sarita, Nathan Zhao, Keyur Joshi, Vikram S. Adve, Sasa Misailovic, Sarita Adve

Track

PPoPP 2021 Main Conference

Time Zone

The program is currently displayed in (GMT-05:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-05:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Tue 2 Mar 2021 11:50 - 12:10 - Session 5. Auto Tuning Chair(s): Saeed Maleki

Abstract

Manually optimizing the tradeoffs between accuracy, performance and energy for resource-intensive applications with flexible accuracy or precision requirements is extremely difficult. We present ApproxTuner, an automatic framework for accuracy-aware optimization of tensor-based applications while requiring only high-level end-to-end quality specifications. ApproxTuner implements and manages approximations in algorithms, system software, and hardware.

The key contribution in ApproxTuner is a novel three-phase approach to approximation-tuning that consists of development-time, install-time, and run-time phases. Our approach decouples tuning of hardware-independent and hardware-specific approximations, thus providing retargetability across devices. To enable efficient autotuning of approximation choices, we present a novel accuracy-aware tuning technique called predictive approximation-tuning, which speeds up autotuning by analytically predicting the accuracy impacts of approximations.

We evaluate ApproxTuner across 10 convolutional neural networks (CNNs) and a combined CNN and image processing benchmark. For the evaluated CNNs, using only hardware-independent approximation choices we achieve a mean speedup of 2.1x (max 2.7x) on a GPU, and 1.3x mean speedup (max 1.9x) on the CPU, while staying within 1 percentage point of inference accuracy loss. For two different accuracy-prediction models, ApproxTuner speeds up tuning by 12.8x and 20.4x compared to conventional empirical tuning while achieving comparable benefits.

Link to Publication

https://dl.acm.org/doi/10.1145/3437801.3446108

Hashim Sharif

University of Illinois at Urbana Champaign

Yifan Zhao

University of Illinois at Urbana Champaign

Maria Kotsifakou

Runtime Verification, Inc.

Akash Kothari

University of Illinois at Urbana Champaign

Ben Schreiber

University of Illinois at Urbana Champaign

Elizabeth Wang

University of Illinois at Urbana Champaign

Yasmin Sarita

Cornell University

Nathan Zhao

University of Illinois at Urbana-Champaign

Keyur Joshi

University of Illinois at Urbana-Champaign

United States

Vikram S. Adve

University of Illinois at Urbana-Champaign

Sasa Misailovic

University of Illinois at Urbana-Champaign

Sarita Adve

University of Illinois at Urbana-Champaign

Time Zone

The program is currently displayed in (GMT-05:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-05:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Tue 2 Mar
Displayed time zone: Eastern Time (US & Canada) change

11:10 - 12:10	Session 5. Auto TuningMain Conference Chair(s): Saeed Maleki Microsoft Research

11:10 20m Talk		GPTune: Multitask Learning for Autotuning Exascale Applications Main Conference Yang Liu , Wissam M. Sid-Lakhdar Lawrence Berkeley National Laboratory, Osni Marques Lawrence Berkeley National Laboratory, Xinran Zhu Cornell University, Chang Meng Emory University, James W. Demmel UC Berkeley, Xiaoye S. Li Lawrence Berkeley National Laboratory Link to publication
11:30 20m Talk		I/O Lower Bounds for Auto-tuning of Convolutions in CNNs Main Conference Xiaoyang Zhang Institute of Computing Technology, Chinese Academy of Sciences, Junmin Xiao Institute of Computing Technology, Chinese Academy of Sciences, Guangming Tan Institute of Computing Technology, Chinese Academy of Sciences Link to publication
11:50 20m Talk		ApproxTuner: A Compiler and Runtime System for Adaptive Approximations Main Conference Hashim Sharif University of Illinois at Urbana Champaign, Yifan Zhao University of Illinois at Urbana Champaign, Maria Kotsifakou Runtime Verification, Inc., Akash Kothari University of Illinois at Urbana Champaign, Ben Schreiber University of Illinois at Urbana Champaign, Elizabeth Wang University of Illinois at Urbana Champaign, Yasmin Sarita Cornell University, Nathan Zhao University of Illinois at Urbana-Champaign, Keyur Joshi University of Illinois at Urbana-Champaign, Vikram S. Adve University of Illinois at Urbana-Champaign, Sasa Misailovic University of Illinois at Urbana-Champaign, Sarita Adve University of Illinois at Urbana-Champaign Link to publication