Department of Computer Science

PhD defence: Adaptive Lightweight Compression Acceleration on Hybrid CPU-FPGA System

On Thursday December 10, 2020, Nusrat Jahan Lisa will defend her PhD thesis "Adaptive Lightweight Compression Acceleration on Hybrid CPU-FPGA System".

Time

10.12.2020 kl. 16.30 - 18.30

Description

On Thursday, December 10, Nusrat Jahan Lisa will defend her PhD thesis. Due to the current circumstance, the defense will be carried out online via Zoom, using this link: 

https://tu-dresden.zoom.us/j/6975581484?pwd=MDZ4R1A2ZVJLMks3aFhLSUVJb21tQT09

Online rules:

  • Be aware that you must be muted during the entire session
  • Please leave your camera off
  • For questions, please use the chat function
  • The defense will start at precisely 16:30. The session is open from 16:00. You are not allowed to join after the defense starts, neither during the break or examination

Abstract

With an increasingly large amount of data being collected in numerous application areas, the importance of online analytical processing (OLAP) workloads increases constantly. OLAP queries typically access only a small number of columns but a high number of rows and are, thus, most efficiently executed by column-stores. With the significant developments in the main memory domain even large datasets can be entirely held in the main memory.

Thus, main memory column-stores have been established as state-of-the-art for OLAP scenarios. In these systems, all values of every column are encoded as a sequence of integer values and, thus, query processing is completely done on these integer sequences.

To improve query processing, vectorization based the Single Instruction Multiple Data (SIMD) parallel paradigm is a state-of-the-art technique. Aside from vectorization, lightweight integer compression algorithms also play an important role to reduce the necessary memory space.

Unfortunately, there is no single-best lightweight integer compression algorithm, and the algorithm selection decision depends most importantly on the data characteristics. Nevertheless, vectorization and integer compression complement each other, and the combined usage improves the query performance.

Unfortunately, the benefits of vectorization are limited on modern x86-processors due to predefined and fixed SIMD instruction set extensions. Nowadays, the Field Programmable Gate Array (FPGA) offers a novel opportunity with regard to hardware reconfigurable capability. For example, we can use an arbitrary length of processor word in FPGA leading to a higher performance, we can prepare proper pipeline-based custom-made database accelerators, and we can develop embedded systems through utilizing such accelerators.

Moreover, modern hybrid CPU-FPGA systems have a direct data communication channel between the main memory and FPGA which is useful for throughput acceleration. Based on these advantages, this thesis examines the utilization of FPGA for main memory column-stores. This examination is two-fold.

First, we investigate the column scan on compressed data as important operation and second, we systematically look at lightweight integer compression. These two aspects are considered from the hardware perspective to guarantee a certain level of query performance acceleration. In particular, this thesis explores different embedded design options and proposes an adaptive lightweight integer compression system.

Based on a comprehensive evaluation, we find out the optimal design constraint as per implementation mechanism for column scan and lightweight integer compression. Finally, we conclude this thesis by mentioning our upcoming research activities.

 

Host

Dresden University/Aalborg University

More information

https://tu-dresden.zoom.us/j/6975581484?pwd=MDZ4R1A2ZVJLMks3aFhLSUVJb21tQT09

Go to event list

Department of Computer Science, Aalborg University

Selma Lagerlöfs Vej 300
9220 Aalborg East, Denmark

Questions and inputs to the intranet?