Proteus is a database engine designed for today's heterogeneous environments. Proteus adapts to variable data, hardware and workloads through a combination of GPU acceleration, data virtualization, and adaptive scheduling.
HetExchange: Encapsulating heterogeneous CPU-GPU parallelism in JIT compiled engines
VLDB 2019.Abstract
Modern server hardware is increasingly heterogeneous as hardware accelerators, such as GPUs, are used together with multicore CPUs to meet the computational demands of modern data analytics workloads. Unfortunately, query parallelization techniques used by analytical database engines are designed for homogeneous multicore servers, where query plans are parallelized across CPUs to process data stored in cache coherent shared memory. Thus, these techniques are unable to fully exploit available heterogeneous hardware, where one needs to exploit task-parallelism of CPUs and data-parallelism of GPUs for processing data stored in a deep, noncache-coherent memory hierarchy with widely varying access latencies and bandwidth.
In this paper, we introduce HetExchange-a parallel query execution framework that encapsulates the heterogeneous parallelism of modern multi-CPU-multi-GPU servers and enables the parallelization of (pre-)existing sequential relational operators. In contrast to the interpreted nature of traditional Exchange, HetExchange is designed to be used in conjunction with JIT compiled engines in order to allow a tight integration with the proposed operators and generation of efficient code for heterogeneous hardware. We validate the applicability and efficiency of our design by building a prototype that can operate over both CPUs and GPUs, and enables its operators to be parallelism- and data-location-agnostic. In doing so, we show that efficiently exploiting CPU-GPU parallelism can provide 2.8x and 6.4x improvement in performance compared to state-of-the-art CPU-based and GPU-based DBMS.
Links
@article{DBLP:journals/pvldb/ChrysogelosKAA19, author = {Periklis Chrysogelos and Manos Karpathiotakis and Raja Appuswamy and Anastasia Ailamaki}, title = {HetExchange: Encapsulating heterogeneous {CPU-GPU} parallelism in {JIT} compiled engines}, journal = {Proc. {VLDB} Endow.}, volume = {12}, number = {5}, pages = {544--556}, year = {2019}, url = {http://www.vldb.org/pvldb/vol12/p544-chrysogelos.pdf}, doi = {10.14778/3303753.3303760}, timestamp = {Sat, 25 Apr 2020 13:59:33 +0200}, biburl = {https://dblp.org/rec/journals/pvldb/ChrysogelosKAA19.bib}, bibsource = {dblp computer science bibliography, https://dblp.org} }