PAT

PAT is a page affinity-based transaction routing scheme based on shared-cache architecture.

Abstract

Cloud-native databases decouple compute from storage to achieve scalability, elasticity, and high availability. Most such databases use a shared-cache architecture, where compute nodes jointly maintain a shared cache to buffer pages from shared storage. During transaction execution, local cache misses incur significant page transfer overhead, while exclusive access to pages for row writes causes expensive inter-node transaction blocking. Both increase transaction latency. Existing shared-cache databases overlook the distribution of cached pages maintained by each compute node and route transactions to nodes based on the access patterns of the workload, increasing these overheads.

We propose PAT, a novel transaction routing mechanism that uses page affinity to route transactions to reduce local cache misses and page contention, significantly improving system performance. PAT uses key range to identify page affinity and routes transactions that access pages with strong affinity to the same compute node. Through periodic page reorganization, rows accessed by the same node are stored in the same set of pages, allowing different compute nodes to cache distinct pages and further reduce the page contention. Experiments show that PAT achieves 2.42 - 14.36 X higher throughput than state-of-the-art approaches under TPC-C and YCSB workloads.

Building

Dependencies

PAT is developed and tested on Ubuntu 20.04 with the Linux kernel version 5.15.0-101-generic. It should also work on other unix-like distributions. Building libgrape-lite requires the following softwares installed as dependencies.

gflags
lib_aio
ibverbs
tabulate
rdma cm
cmake
spdlog

Huge pages

We are using huge pages for the memory buffers:

echo N | sudo tee /sys/devices/system/node/node0/hugepages/hugepages-2048kB/nr_hugepages

Configuration

To configure the servers and their ips the following configuration in ScaleStore/shared-headers/Defs.hpp and Proxy/shared-headers/Defs.hpp needs to be adapted:

const std::vector<std::vector<std::string>> NODES{
    {""},                                                                                              // 0 to allow direct offset
    {"172.18.94.80"},                                                                                  // 1
    {"172.18.94.80", "172.18.94.70"},                                                                  // 2
    {"172.18.94.80", "172.18.94.70", "172.18.94.10"},                                                  // 3
    {"172.18.94.80", "172.18.94.70", "172.18.94.10", "172.18.94.20"},                                  // 4
    {"172.18.94.80", "172.18.94.70", "172.18.94.10", "172.18.94.20", "172.18.94.40"},                  // 5
    {"172.18.94.80", "172.18.94.70", "172.18.94.10", "172.18.94.20", "172.18.94.40", "172.18.94.30"},  // 6
};

The last column indicates the ip of router, and the other columns indicates the ips of servers.

The main configuration file for our experiments can be found in *.ini, and the execution scripts can be found in run*.sh

CMake build for Router

mkdir build
cd build

we can build the executable with either in debug mode with address sanitizers enabled:

cmake -D CMAKE_C_COMPILER=gcc-10 -D CMAKE_CXX_COMPILER=g++-10 -DCMAKE_BUILD_TYPE=Debug -DSANI=On .. && make -j

or in release mode:

cmake -D CMAKE_C_COMPILER=gcc-10 -D CMAKE_CXX_COMPILER=g++-10 -DCMAKE_BUILD_TYPE=Release .. && make -j

Router Run executable

make -j && numactl --membind=0 --cpunodebind=0 ./frontend/ycsb --ownIp=10.0.0.89 --nodes=1 --dramGB=4 --port=1401 --messageHandlerThreads=20 --YCSB_read_ratio=50 --YCSB_tuple_count=1000000000 --sqlSendThreads=4 --route_mode=3 --partition_mode=2  --YCSB_zipf_factor=0.99

CMake build for Storage Node

mkdir build
cd build

we can build the executable with either in debug mode with address sanitizers enabled:

cmake -D CMAKE_C_COMPILER=gcc-10 -D CMAKE_CXX_COMPILER=g++-10 -DCMAKE_BUILD_TYPE=Debug -DSANI=On .. && make -j

or in release mode:

cmake -D CMAKE_C_COMPILER=gcc-10 -D CMAKE_CXX_COMPILER=g++-10 -DCMAKE_BUILD_TYPE=Release .. && make -j

Storage Node Run executable

make -j && numactl --membind=0 --cpunodebind=0 ./frontend/ycsb --ownIp=172.18.94.80 --nodes=2 --port=1400 --worker=20 --dramGB=150 --ssd_path=/dev/md0 --ssd_gib=400 --pageProviderThreads=4 --YCSB_run_for_seconds=300 --messageHandlerThreads=4 --sqlSendThreads=4 --use_proxy=true --use-codesign=true --YCSB_tuple_count=1000000000 --YCSB_zipf_factor=0.99

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.vscode		.vscode
Proxy		Proxy
Proxy_ycsb		Proxy_ycsb
dbnode		dbnode
fmt-master		fmt-master
.gitignore		.gitignore
README.md		README.md
intersection		intersection
intersection.cpp		intersection.cpp
proxy_config.ini		proxy_config.ini
proxy_ycsb_config.ini		proxy_ycsb_config.ini
run_proxy.sh		run_proxy.sh
run_proxy_ycsb.sh		run_proxy_ycsb.sh
run_tpcc.sh		run_tpcc.sh
run_ycsb.sh		run_ycsb.sh
tpcc_config.ini		tpcc_config.ini
utils		utils
utils.hpp		utils.hpp
ycsb_config.ini		ycsb_config.ini
ycsb_intersection		ycsb_intersection
ycsb_intersection.cpp		ycsb_intersection.cpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PAT

Abstract

Building

Dependencies

Huge pages

Configuration

CMake build for Router

Router Run executable

CMake build for Storage Node

Storage Node Run executable

About

Uh oh!

Releases

Packages

Languages

exit-code-1/PAT-ScaleStore

Folders and files

Latest commit

History

Repository files navigation

PAT

Abstract

Building

Dependencies

Huge pages

Configuration

CMake build for Router

Router Run executable

CMake build for Storage Node

Storage Node Run executable

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages