Sparta: High-Performance, Element-Wise Sparse Tensor Contraction on Heterogeneous Memory
Sparse tensor contractions appear commonly in many applications. Efficiently computing a two sparse tensor product is challenging: It not only inherits the challenges from common sparse matrix-matrix multiplication (SpGEMM), i.e., indirect memory access and unknown output size before computation, but also raises new challenges because of high dimensionality of tensors, expensive multi-dimensional index search, and massive intermediate and output data. To address the above challenges, we introduce three optimization techniques by using multi-dimensional, efficient hash table representation for the accumulator and larger input tensor, and all-stage parallelization. Evaluating with 15 datasets, we show that Sparta brings 28 − 576 x speedup over traditional sparse tensor contraction with SPA. With our proposed algorithm- and memory heterogeneity-aware data management, Sparta brings extra performance improvement on the heterogeneous memory with DRAM and Intel Optane DC Persistent Memory Module (PMM) over a state-of-the-art software-based data management solution, a hardware-based data management solution, and PMM-only by 30.7% (up to 98.5%), 10.7% (up to 28.3%) and 17% (up to 65.1%) respectively.
Wed 3 MarDisplayed time zone: Eastern Time (US & Canada) change
10:00 - 11:00 | |||
10:00 15mTalk | EGEMM-TC: Accelerating Scientific Computing on Tensor Cores with Extended Precision Main Conference Boyuan Feng UC Santa Barbara, Yuke Wang UC Santa Barbara, Guoyang Chen Alibaba Group US Inc., Weifeng Zhang Alibaba Group US Inc., Yuan Xie UCSB, Yufei Ding UC Santa Barbara Link to publication | ||
10:15 15mTalk | Efficiently Running SpMV on Long Vector Architectures Main Conference Constantino Gómez Barcelona Supercomputing Center, Filippo Mantovani Barcelona Supercomputing Center, Erich Focht NEC, Marc Casas Barcelona Supercomputing Center Link to publication | ||
10:30 15mTalk | Improving Communication by Optimizing On-Node Data Movement with Data Layout Main Conference Tuowen Zhao University of Utah, Mary Hall University of Utah, Hans Johansen Lawrence Berkeley National Laboratory, Samuel Williams Lawrence Berkeley National Laboratory Link to publication | ||
10:45 15mTalk | Sparta: High-Performance, Element-Wise Sparse Tensor Contraction on Heterogeneous Memory Main Conference Jiawen Liu University of California, Merced, Jie Ren University of California, Merced, Roberto Gioiosa Pacific Northwest National Laboratory, Dong Li University of California, Merced, Jiajia Li Pacific Northwest National Laboratory Link to publication |