QuIT your B+-tree for the Quick Insertion Tree

Aneesh Raman; Konstantinos Karatsenidis; Shaolin Xie; Matthaios Olma; Subhadeep Sarkar; Manos Athanassoulis

doi:10.48786/edbt.2025.36

Back

Conference paper

QuIT your B+-tree for the Quick Insertion Tree

Aneesh Raman, Konstantinos Karatsenidis, Shaolin Xie, Matthaios Olma, Subhadeep Sarkar and Manos Athanassoulis

In Proceedings of the International Conference on Extending Database Technology (EDBT) (Barcelona, Spain)

DOI: https://doi.org/10.48786/edbt.2025.36

Abstract

Database Technology

Indexing Data

Data Structures & Algorithms

Big Data

Data Systems

Search trees, like B⁺-trees, are often used as index structures in data systems to improve query performance at the cost of index construction and maintenance. For state-of-the-art B⁺-tree designs used in commercial data systems, this cost is negligible if the data arrives as fully sorted on the index attribute. Further, production systems employ a fast-path ingestion technique for B⁺-trees that directly appends the incoming entries to the tail leaf if the data is fully sorted, drastically reducing the index construction cost. However, this is only effective if the incoming data arrives fully sorted or with an extremely small number of out-of-order entries. In addition, the state-of-the-art sortedness-aware design (SWARE) navigates a tradeoff between reads and writes by buffering incoming data to absorb near-sortedness, which comes at the cost of slower query performance and increased overall design complexity.

To address these challenges, we present Quick Insertion Tree (QuIT), a sortedness-aware indexing data structure that improves ingestion performance with minimal design complexity and no read overhead. QuIT maintains in memory a pointer to the predicted ordered-leaf (pole) that provides a sortedness-aware fast-path optimization, and facilitates faster index ingestion. The key benefit comes from accurately predicting pole throughout data ingestion. Further, QuIT achieves high memory utilization by maintaining tightly packed leaf nodes when the ingested data arrives as near-sorted. This, in turn, helps improve performance during range lookups. Overall, we demonstrate that QuIT outperforms B⁺-tree (SWARE) by up to 3× (2×) for ingestion, while maintaining the same point lookup performance (up to 1.23× faster). QuIT also accesses up to 2× fewer leaf nodes than the B⁺-tree during range lookups.

Files and links (1)

pdf

QuIT_your_B+_tree_for_the_Quick_Insertion_Tree4.55 MBDownload View

Open Access

Metrics

1 Record Views

Details

Title: QuIT your B+-tree for the Quick Insertion Tree
Creators: Aneesh Raman (Corresponding Author)
Konstantinos Karatsenidis (Author)
Shaolin Xie (Author)
Matthaios Olma (Author)
Subhadeep Sarkar (Author)
Manos Athanassoulis (Author)
Conference: In Proceedings of the International Conference on Extending Database Technology (EDBT) (Barcelona, Spain)
Identifiers: 9924457450801921
Academic Unit: Michtom School of Computer Science
Language: English
Resource Type: Conference paper

QuIT your B+-tree for the Quick Insertion Tree

Abstract

Files and links (1)

Metrics

Details

Brandeis University Social media