PARALLEL DATA LAB 

PDL Abstract

TABLEFS: Embedding a NoSQL Database Inside the Local File System

Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-103. May 2012.

Kai Ren, Garth A. Gibson

School of Computer Science
Carnegie Mellon University
Pittsburgh, PA 15213

http://www.pdl.cmu.edu/

Conventional file systems are optimzed for large file transfers instead of workloads that are dominated by metadata and small file accesses. This paper examines using techniques adopated from NoSQL databases to manage file system metadata and small files, which feature high rate of changes and efficient out-of-core data representation. A FUSE file system prototype was built by storing file system metadata and small files into a modern key-value store LevelDB. We demonstrate that such techniques can improve the performance of modern local file systems in Linux as much as an order of magnitude for workloads dominated by metadata and tiny files.

KEYWORDS: TableFS, File System, File System Metadata, NoSQL Database, LSM Tree

FULL TR: pdf