PARALLEL DATA LAB 

PDL Code Distribution

Ext4-lazy

The original ext4-lazy patch for Linux kernel version 4.6 is here.

It has since been integrated to the unstable portion of the ext4 patch queue in the Ext4 filesystem tree. The most up-to-date version can be found here: http://git.kernel.org/cgit/linux/kernel/git/tytso/ext4.git

Iterstore

IterStore is a parameter server library for iterative convergent machine learning applications. Our example applications include matrix factorization, LDA, multi-class logistic regression, and PageRank.

GeePS

GeePS is a parameter server library that scales single-machine GPU machine learning applications (such as Caffe) to a cluster of machines.

Query/Update Protocol

Process Scheduling

  • Cluster Scheduling for Explicitly-speculative Tasks
    David Petrou. Carnegie Mellon University Ph.D Dissertation. CMU-PDL-04-112, September 2004.
    Abstract / PDF [4.2M] / Simulator Code

DiskSim

NASD

RAIDframe

TIP (Transparent Informed Prefetching)