ShadowVM: Accelerating Data Plane for Data Analytics with Bare Metal CPUs and GPUs
With the development of the big data ecosystem, large-scale data analytics has become more prevalent in the past few years. Apache Spark, etc., provide a flexible approach for scalable processing upon massive data. However, they are not designed for handling computing-intensive workloads due to the restrictions of JVM runtime. In contrast, GPU has been the de facto accelerator for graphics rendering and deep learning in recent years. Nevertheless, the current architecture makes it difficult to take advantage of GPUs and other accelerators in the big data world.
Now, it is time to break down this obstacle by changing the fundamental architecture. To integrate accelerators efficiently, we decouple the control plane and the data plane within big data systems via action shadowing. The control plane keeps logic information to fit well with the host systems like Spark, while the data plane holds data and performs execution upon bare metal CPUs and GPUs. Under this decoupled architecture, both the control plane and the data plane could leverage the appropriate approaches without breaking existing mechanisms. Based on this idea, we implement an accelerated data plane, namely ShadowVM. In our experiments on the SSB benchmark, ShadowVM lifts the JVM-based Spark with up to 14.7x speedup. Furthermore, ShadowVM could also outperform the GPU-only fashion by adopting mixed CPU-GPU execution.
Mon 1 MarDisplayed time zone: Eastern Time (US & Canada) change
12:30 - 13:30 | |||
12:30 15mTalk | Understanding and Bridging the Gaps in Current GNN Performance Optimizations Main Conference Kezhao Huang Tsinghua University, Jidong Zhai Tsinghua University, Zhen Zheng Alibaba Group, Youngmin Yi University of Seoul, Xipeng Shen North Carolina State University Link to publication | ||
12:45 15mTalk | A Fast Work-Efficient SSSP Algorithm for GPUs Main Conference Kai Wang University of Texas at Austin, Donald Fussell University of Texas at Austin, Calvin Lin University of Texas at Austin Link to publication | ||
13:00 15mTalk | ShadowVM: Accelerating Data Plane for Data Analytics with Bare Metal CPUs and GPUs Main Conference Zhifang Li East China Normal University, Mingcong Han East China Normal University, Shangwei Wu East China Normal University, Chuliang Weng East China Normal University Link to publication | ||
13:15 15mTalk | BiPart: A Parallel and Deterministic Hypergraph Partitioner Main Conference Sepideh Maleki The University of Texas at Austin, Udit Agarwal UT Austin, Martin Burtscher Texas State University, Keshav Pingali The University of Texas at Austin Link to publication |