PDL Technical Reports

2020

  • CMU-PDL-20-101: Vilamb: Low Overhead Asynchronous Redundancy for Direct Access NVM. Rajat Kateja, Andy Pavlo, Greg Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-20-101, April 2020.
    Abstract / PDF [665K]

2019

  • CMU-PDL-19-105: TVARAK: Software-Managed Hardware Offload for DAX NVM Storage Redundancy. Rajat Kateja, Nathan Beckmann, Greg Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-19-105, Aug 2019.
    Abstract / PDF [975K]

  • CMU-PDL-19-104: Compact Filter Structures for Fast Data Partitioning. Qing Zheng, Charles D. Cranor, Ankush Jain, Gregory R. Ganger, Garth A. Gibson, George Amvrosiadis, Bradley W. Settlemyer, Gary A. Grider. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-19-104, June 2019.
    Abstract / PDF[574K]

  • CMU-PDL-19-103: This is Why ML-driven Cluster Scheduling Remains Widely Impractical. Michael Kuchnik, Jun Woo Park, Chuck Cranor, Elisabeth Moore, Nathan DeBardeleben, George Amvrosiadis. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-19-103, May 2019.
    Abstract / PDF [715K]

  • CMU-PDL-19-102: Reconciling LSM-Trees with Modern Hard Drives using BlueFS. Abutalib Aghayev, Sage Weil, Greg Ganger, George Amvrosiadis. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-19-102, April 2019.
    Abstract / PDF [735K]

  • CMU-PDL-19-101: Lazy Redundancy for NVM Storage: Handing the Performance-Reliability Tradeoff to Applications. Rajat Kateja, Andy Pavlo, Greg Ganger Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-19-101, April 2019.
    Abstract / PDF [800K]

2018

  • CMU-PDL-18-102: Tributary: Spot-dancing for elastic services with latency SLOs. Aaron Harlap, Andrew Chung, Alexey Tumanov, Gregory R. Ganger, Phillip B. Gibbons. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-18-102, Jan. 2018. Superseded by 2018 USENIX Annual Technical Conference. July 11–13, 2018, Boston, MA, USA.
    Abstract / PDF [990K]

  • CMU-PDL-18-101: Addressing the Long-Lineage Bottleneck in Apache Spark. Haoran Wang, Jinliang Wei, Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-18-101, January 2018.
    Abstract / PDF [250K]

2017

  • CMU-PDL-17-107: 3Sigma: Distribution-based cluster scheduling for runtime uncertainty. Jun Woo Park, Alexey Tumanov, Angela Jiang, Michael A. Kozuch, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-17-107, November 2017. Superseded by EuroSys ’18, April 23–26, 2018, Porto, Portugal.
    Abstract / PDF [800K]

  • CMU-PDL-17-106: Aging Gracefully with Geriatrix: A File System Aging Tool. Saurabh Kadekodi, Vaishnavh Nagarajan, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-17-106, October 2017. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-105. October, 2016.
    Abstract / PDF [560K]

  • CMU-PDL-17-105: A Case for Packing and Indexing in Cloud File Systems. Saurabh Kadekodi, Bin Fan, Adit Madan, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-17-105, October 2017.
    Abstract / PDF [280K]

  • CMU-PDL-17-104: Bigger, Longer, Fewer: What do cluster jobs look like outside Google? George Amvrosiadis, Jun Woo Park, Gregory R. Ganger, Garth A. Gibson, Elisabeth Baseman, Nathan DeBardeleben. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-17-104, October 2017.
    Abstract / PDF [360K]

  • CMU-PDL-17-103: Litz: An Elastic Framework for High-Performance Distributed Machine Learning. Aurick Qiao, Abutalib Aghayev, Weiren Yu, Haoyang Chen, Qirong Ho, Garth A. Gibson, Eric P. Xing. Carnegie Mellon Univedrsity Parallel Data Laboratory Technical Report CMU-PDL-17-103. June 2017.
    Abstract / PDF [424K]

  • CMU-PDL-17-102: not published

  • CMU-PDL-17-101: not published

2016

  • CMU-PDL-16-109: A Survey of Security Vulnerabilities in Bluetooth Low Energy Beacons. Hui Jun Tay, Jiaqi Tan, Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-109. November 2016.
    Abstract / PDF [110K]

  • CMU-PDL-16-108: MLtuner: System Support for Automatic Machine Learning Tuning. Henggang Cui, Gregory R. Ganger, and Phillip B. Gibbons. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-108, October 2016.
    Abstract / PDF [900K]

  • CMU-PDL-16-107: Benchmarking Apache Spark with Machine Learning Applications. Jinliang Wei, Jin Kyu Kim, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-107 October 2016.
    Abstract / PDF [360K]

  • CMU-PDL-16-106: Ext4-zcj: An Evolved Journal Optimized for Drive-Managed Shingled Magnetic Recording Disks. Abutalib Aghayev, Theodore Ts’o, Garth Gibson, Peter Desnoyers. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-106. October 2016. Superseded by 15th USENIX Conference on File and Storage Technologies (FAST '17), Feb 27–Mar 2, 2017. Santa Clara, CA.
    Abstract / PDF [1.1M]

  • CMU-PDL-16-105: Aging Gracefully with Geriatrix: A File System Aging Suite. Saurabh Kadekodi, Vaishnavh Nagarajan, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-105. October, 2016.
    Abstract / PDF [503K]

  • CMU-PDL-16-104: JamaisVu: Robust Scheduling with Auto-Estimated Job Runtimes. Alexey Tumanov, Angela Jiang, Jun Woo Park, Michael A. Kozuch, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-104. September 2016.
    Abstract / PDF [1.6M]

  • CMU-PDL-16-103: Soundness Proofs for Iterative Deepening. Ben Blum. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-103, September 6, 2016.
    Abstract / PDF [356K]

  • CMU-PDL-16-102: TierML: Using Tiers of Reliability for Agile Elasticity in Machine Learning. Aaron Harlap, Gregory R. Ganger, Phillip B. Gibbons. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-102. May 2016. Superseded by "Proteus: Agile ML Elasticity through Tiered Reliability in Dynamic Resource Markets" (EuroSys'17).
    Abstract / PDF [590K]

  • CMU-PDL-16-101: Similarity-based Deduplication for Databases. Lianghong Xu, Andrew Pavlo, Sudipta Sengupta, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-101, April 2016.
    Abstract / PDF [1M]

2015

  • CMU-PDL-15-107: Scalable Deep Learning on Distributed GPUs with a GPU-specialized Parameter Server. Henggang Cui, Gregory R. Ganger, Phillip B. Gibbons. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-15-107, October 2015.
    Abstract / PDF [537K]

  • CMU-PDL-15-106: Using Data Transformations for Low-latency Time Series Analysis. Henggang Cui, Kimberly Keeton, Indrajit Roy Krishnamurthy Viswanathan, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-15-106. April 2015. Extended version of the 2015 SoCC paper.

  • CMU-PDL-15-105: Managed Communication and Consistency for Fast Data-Parallel Iterative Analytics. Jinliang Wei, Wei Dai, Aurick Qiao, Qirong Ho*, Henggang Cui, Gregory R. Ganger, Phillip B. Gibbons, Garth A. Gibson, Eric P. Xing. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-15-105. April 2015.
    Abstract / PDF [2.62M]

  • CMU-PDL-15-104: ShardFS vs. IndexFS: Replication vs. Caching Strategies for Distributed Metadata Management in Cloud Storage Systems. Lin Xiao, Kai Ren, Qing Zheng, Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-15-104, April 2015.
    Abstract / PDF [696K]

  • CMU-PDL-15-103: SMPFRAME: A Distributed Framework for Scheduled Model Parallel Machine Learning. Jin Kyu Kim, Qirong Hoy, Seunghak Lee Xun Zheng, Wei Dai, Garth Gibson, Eric Xing. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-15-103, May 2015.
    Abstract / PDF [1.57M]

  • CMU-PDL-15-102: Addressing the Straggler Problem for Iterative Convergent Parallel ML. Aaron Harlap, Henggang Cui, Wei Dai, Jinliang Wei Gregory R. Ganger, Phillip B. Gibbons, Garth A. Gibson, Eric P. Xing. ACM Symposium on Cloud Computing 2016. Oct 5-7, Santa Clara, CA. Supersedes Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-15-102, April 2015.
    Abstract / PDF [519K]

  • CMU-PDL-15-101: Caveat-Scriptor: Write Anywhere Shingled Disks. Saurabh Kadekodi, Swapnil Pimpale, Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-15-101. Shorter version published in Proc. Of the Seventh USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage’15), Santa Clara, CA, July 2015.
    Abstract / PDF [672K]

  • CMU-CS-15-132: Resource-Efficient Data-Intensive System Designs for High Performance and Capacity. Hyeontaek Lim. Carnegie Mellon University PhD Dissertation CMU-CS-15-132, September 2015.
    Abstract / PDF [3.1M]

2014

  • CMU-PDL-14-109: A Comparative Study of Baremetal Provisioning Frameworks. Ashok Chandrasekar, Garth Gibson. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-14-109, December 2014.
    Abstract / PDF [447K]

  • CMU-PDL-14-108: Reducing Replication Bandwidth for Distributed Document Databases. Lianghong Xu, Andrew Pavlo, Sudipta Sengupta Jin Li, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-14-108. December 2014.
    Abstract / PDF [646K]

  • CMU-PDL-14-107: Exploiting Iterative-ness for Parallel ML Computations. Henggang Cui, Alexey Tumanov, Jinliang Wei, Lianghong Xu, Wei Dai, Jesse Haber-Kucharsky, Qirong Ho, Greg R. Ganger, Phil B. Gibbons, Garth A. Gibson, Eric P. Xing. ACM Symposium on Cloud Computing 2014 (SoCC'14), Seattle, WA, Nov 2014. Supersedes Carnegie Mellon University Parallel Data Technical Report CMU-PDL-14-107.
    Abstract / PDF [609K]

  • CMU-PDL-14-106: Using RDMA Efficiently for Key-Value Services. Anuj Kalia, Michael Kaminsky, David G. Andersen. ACM SIGCOMM 2014. Chicago, Illinois, August 17-22, 2014. Supersedes CMU-PDL-14-106, June 2014.
    Abstract / PDF [462K]

  • CMU-PDL-14-105: Paxos Quorum Leases: Fast Reads Without Sacrificing Writes. Iulian Moraru, David G. Andersen, Michael Kaminsky. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-14-105. May 2014.
    Abstract / PDF [444K]

  • CMU-PDL-14-104: Superseded by CMU-PDL-14-107.

  • CMU-PDL-14-103: Scaling File System Metadata Performance with Stateless Caching and Bulk Insertion. Kai Ren, Qing Zheng, Swapnil Patil, Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-14-103. May 2014.
    Abstract / PDF [763K]

  • CMU-PDL-14-102: So, You Want To Trace Your Distributed System? Key Design Insights from Years of Practical Experience. Raja R. Sambasivan, Rodrigo Fonseca, Ilari Shafer, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-14-102, April 2014.
    Abstract / PDF [870K]

  • CMU-PDL-14-101: Exploiting Bounded Staleness to Speed up Big Data Analytics. Henggang Cui, James Cipar, Qirong Ho, Jin Kyu Kim, Seunghak Lee, Abhimanu Kumar Jinliang Wei, Wei Dai, Gregory R. Ganger, Phillip B. Gibbons, Garth A. Gibson, Eric P. Xing. 2014 USENIX Annual Technical Conference (ATC'14). June 19-20, 2014. Philadelphia, PA. Supersedes CMU-PDL-14-101.
    Abstract / PDF [731K]

  • CMU-CS-14-144: Trading Freshness for Performance in Distributed Systems. James Cipar. Carnegie Mellon University School of Computer Science Ph.D. Dissertation CMU-CS-14-144. December 2014.
    Abstract / PDF [1.82M]

  • CMU-CS-14-133: Egalitarian Distributed Consensus. Iulian Moraru. Carnegie Mellon University Ph.D. Dissertation CMU-CS-14-133. August 2014.
    Abstract / PDF [1.95M]

2013

  • CMU-PDL-13-113: Challenges in Security and Privacy for Mobile Edge-Clouds. Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-13-113. October, 2013.
    Abstract / PDF [212K]

  • CMU-PDL-13-112: Tetrisched: Space-Time Scheduling for Heterogeneous Datacenters. Alexey Tumanov, Timothy Zhu, Michael A. Kozuch†, Mor Harchol-Balter, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-13-112, December, 2013.
    Abstract / PDF [716K]

  • CMU-PDL-13-111: A Proof of Correctness for Egalitarian Paxos. Iulian Moraru, David G. Andersen, Michael Kaminsky. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-13-111. August 2013.
    Abstract / PDF [2.3M]

  • CMU-PDL-13-110: Toward Strong, Usable Access Control for Shared Distributed Data. Michelle L. Mazurek, Yuan Liang, Manya Sleeper, Lujo Bauer, Gregory R. Ganger, Nitin Gupta, Michael K. Reiter. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-13-110. June 2013.
    Abstract / PDF [1M]

  • CMU-PDL-13-109: Automated Diagnosis of Chronic Performance Problems in Production Systems. Soila P. Kavulya. Carnegie Mellon University Parallel Data Lab Ph.D. Dissertation. CMU-PDL-13-109, May 2013.
    Abstract / PDF [12.6M]

  • CMU-PDL-13-108: Metadata Optimization for Shingled Disks. Pavan K Alampalli. Information Networking Institute Master of Science Thesis. CMU-PDL-13-108, May, 2013.
    Abstract / PDF [500K]

  • CMU-PDL-13-107: Building a High-Performance Metadata Service by Reusing Scalable I/O Bandwidth. Kai Ren, Swapnil Patil, Kartik Kulkarni, Adit Madan, Garth Gibson. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-13-107, May 2013.
    Abstract / PDF [690K]

  • CMU-PDL-13-106: PETAL: Preset Encoding Table Information Leakage. Jiaqi Tan, Jayvardhan Nahata. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-13-106, April 2013.
    Abstract / PDF [294K]

  • CMU-PDL-13-105: Diagnosing Performance Changes in Distributed Systems by Comparing Request Flows. Raja R. Sambasivan. Carnegie Mellon University Parallel Data Lab Ph.D. Dissertation. CMU-PDL-13-105, May 2013.
    Abstract / PDF [3.9M]

  • CMU-PDL-13-104: Visualizing Request-flow Comparison to Aid Performance Diagnosis in Distributed Systems. Raja R. Sambasivan, Ilari Shafer, Michelle L. Mazurek, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-13-104 (supersedes CMU-PDL-12-102), April 2013.
    Abstract / PDF [1.93M]

  • CMU-PDL-13-103: Not Published

  • CMU-PDL-13-102: TABLEFS: Enhancing Metadata Efficiency in the Local File System. Kai Ren, Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-13-102, January 2013. Revised version of CMU-PDL-12-110.
    Abstract / PDF [798K]

  • CMU-PDL-13-101: Giga+TableFS on PanFS: Scaling Metadata Performance on Cluster File Systems. Kartik Kulkarni, Kai Ren, Swapnil Patil, Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-13-101, January 2013.
    Abstract / PDF [679K]

2012

  • CMU-PDL-12-116: MemC3: Compact and Concurrent MemCache with Dumber Caching and Smarter Hashing. Bin Fan, David G. Andersen, Michael Kaminsky. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-116. November 2012.
    Abstract / PDF [824K]

  • CMU-PDL-12-115: HPC Computation on Hadoop Storage with PLFS. Chuck Cranor, Milo Polte, Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-115 . November 2012.
    Abstract / PDF [170K]

  • CMU-PDL-12-114: Not Published

  • CMU-PDL-12-113: Runtime Estimation and Resource Allocation for Concurrency Testing. Jiri Simsa, Randy Bryant, Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-113. December 2012.
    Abstract / PDF [490K]

  • CMU-PDL-12-112: JackRabbit: Improved Agility In Elastic Distributed Storage. James Cipar, Lianghong Xu, Elie Krevat, Alexey Tumanov Nitin Gupta, Michael A. Kozuch, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-112, October 2012.
    Abstract / PDF [395K]

  • CMU-PDL-12-111: A Case for Scaling HPC Metadata Performance through De-specialization. Swapnil Patil, Kai Ren, Garth Gibson. 7th Petascale Data Storage Workshop held in conjunction with Supercomputing '12, November 12, 2012. Salt Lake City, UT. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-111, November 2012.
    Abstract / PDF [512K]

  • CMU-PDL-12-110: TABLEFS: Embedding a NoSQL Database Inside the Local File System. Kai Ren, Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-110, September 2012.
    Abstract / PDF [1.43M]

  • CMU-PDL-12-109: A Proof of Correctness for Egalitarian Paxos. Iulian Moraru, David G. Andersen, Michael Kaminsky. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-109. September 2012. Superseded by CMU-PDL-13-111, August 2013.
    Abstract / PDF [2.3M]

  • CMU-PDL-12-108: Egalitarian Paxos. Iulian Moraru, David G. Andersen, Michael Kaminsky. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-108. July 2012.
    Abstract / PDF [363K]

  • CMU-PDL-12-107: Light-weight Black-box Failure Detection for Distributed Systems. Jiaqi Tan, Soila Kavulya, Rajeev Gandhi, Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-107. July 2012
    Abstract / PDF [300K]

  • CMU-PDL-12-106: Hadoop's Adolescence: A Comparative Workload Analysis from Three Research Clusters. Kai Ren, YongChul Kwon, Magdalena Balazinska, Bill Howe. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-106. June 2012.
    Abstract / PDF [1.76M]

  • CMU-PDL-12-105: Shingled Magnetic Recording for Big Data Applications. Anand Suresh, Garth Gibson, Greg Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-105. May 2012.
    Abstract / PDF [561K]

  • CMU-PDL-12-104: SkyeFS: Distributed Directories using Giga+ and PVFS. Anthony Chivetta, Swapnil Patil & Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-104, May 2012.
    Abstract / PDF [398K]

  • CMU-PDL-12-103: TABLEFS: Embedding a NoSQL Database Inside the Local File System. Kai Ren, Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report. CMU-PDL-12-103 May 2012.
    Abstract / PDF [339K]

  • CMU-PDL-12-102: Visualizing Request-flow Comparison to Aid Performance Diagnosis in Distributed Systems. Raja R. Sambasivan, Ilari Shafer, Michelle L. Mazurek, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-102. May 2012. Superceded by CMU-PDL-13-104.
    Abstract / PDF [1.93M]

  • CMU-PDL-12-101: Concurrent Systematic Testing at Scale. Jiri Simsa, Randy Bryant, Garth Gibson, Jason Hickey. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-101. May 2012.
    Abstract / PDF [397K]

  • CMU-CS-12-118: Landslide: Systematic Dynamic Race Detection in Kernel Space. Ben Blum. Carnegie Mellon University School of Computer Science MS Thesis CMU-CS-12-118. May 2012.
    Abstract / PDF [1.7M]

  • CMU-CS-12-111R: SOFTScale: Stealing Opportunistically For Transient Scaling. Anshul Gandhi, Timothy Zhu, Mor Harchol-Balter, Michael Kozuchy. Carnegie Mellon University School of Computer Science Technical Report CMU-CS-12-111R, August 2012.
    Abstract / PDF [477K]

  • CMU-CS-12-103: Near-Real-Time Inference of File-Level Mutations from Virtual Disk Writes. Wolfgang Richter, Mahadev Satyanarayanan, Jan Harkes, Benjamin Gilbert. Carnegie Mellon University School of Computer Science Technical Report CMU-CS-12-103. February 2012.
    Abstract / PDF [343K]

  • ISTC-CC-TR-12-101: Towards Understanding Heterogeneous Clouds at Scale: Google Trace Analysis Charles Reiss, Alexey Tumanov, Gregory R. Ganger, Randy H. Katz, Michael A. Kozuch. Intel Science and Technology Center for Cloud Computing Technical Report ISTC-CC-TR-12-101, April 27, 2012.
    Abstract / PDF [876K]

2011

  • CMU-PDL-11-115: Active Disk Meets Flash: A Case for Intelligent SSDs. Sangyeun Cho, Chanik Park , Hyunok Oh, Sungchan Kim, Youngmin Yi and Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-115. Dec. 2011.
    Abstract / PDF [989K]

  • CMU-PDL-11-114: Persistent, Protected and Cached: Building Blocks for Main Memory Data Stores. Iulian Moraru, David G. Andersen, Michael Kaminsky, Nathan Binkert, Niraj Tolia, Reinhard Munz,Parthasarathy Ranganathan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-114v2, Nov. 2012. Supersedes CMU-PDL-11-114. Dec. 2011.
    Abstract / PDF [1.0M]

  • CMU-PDL-11-113: Efficient Exploratory Testing of Concurrent Systems. Jiri Simsa, Randy Bryant, Garth Gibson, Jason Hickey (Google). Carnegie Mellon University Parallel Data Laboratory Techical Report CMU-PDL-11-113, November 2011.
    Abstract / PDF [786K]

  • CMU-PDL-11-112: DiskReduce: Replication as a Prelude to Erasure Coding in Data-Intensive Scalable Computing. Bin Fan, Wittawat Tantisiriroj, Lin Xiao, Garth Gibson. Carnegie Mellon Univsersity Parallel Data Laboratory Technical Report CMU-PDL-11-112, October, 2011.
    Abstract / PDF [897K]

  • CMU-PDL-11-111: YCSB++: Benchmarking and Performance Debugging Advanced Features in Scalable Table Stores. Swapnil Patil, Milo Polte, Kai Ren, Wittawat Tantisiriroj, Lin Xiao, Julio Lopez, Garth Gibson, Adam Fuchs, Billie Rinaldi. Proc. of the 2nd ACM Symposium on Cloud Computing (SOCC '11), October 27–28, 2011, Cascais, Portugal. Supersedes Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-11-111, August 2011.
    Abstract / PDF [1.2M]

  • CMU-PDL-11-110: Diagnosis in Automotive Systems: A Survey. Patrick E. Lanigan, Soila Kavulya, Priya Narasimhan, Thomas E. Fuhrman, Mutasim A. Salman. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-110. June 2011.
    Abstract / PDF [369K]

  • CMU-PDL-11-109: Draco: Top-Down Statistical Diagnosis of Large-scale VoIP Networks. Soila P. Kavulya, Kaustubh Joshi, Matti Hiltunen, Scott Daniels, Rajeev Gandhi, Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-109, April 2011.
    Abstract / PDF [787K]

  • CMU-PDL-11-108: On the Duality of Data-intensive File System Design: Reconciling HDFS and PVFS. Wittawat Tantisiriroj, Swapnil Patil, Garth Gibson, Seung Woo Son, Samuel J. Lang, Robert B. Ross. SC11, November 12-18, 2011, Seattle, Washington USA. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-108. April 2011.
    Abstract / PDF [459K]

  • CMU-PDL-11-107: Principles of Operation for Shingled Disk Devices. Garth Gibson, Greg Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-107. April 2011.
    Abstract / PDF [500K]

  • CMU-PDL-11-106: Otus: Resource Attribution in Data-Intensive Clusters. Kai Ren, Julio López, Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-106, April 2011. Superceded by MapReduce'11, June 8, 2011, San Jose, California, USA.
    Abstract / PDF [2.5M]

  • CMU-PDL-11-105: Exertion-based Billing for Cloud Storage Access. Matthew Wachs, Lianghong Xu, Arkady Kanevsky, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-105. March 2011. Superseded by Proceedings of the 3rd USENIX Workshop on Hot Topics in Cloud Computing (HotCloud '11). June 14-15, 2011, Portland, OR.
    Abstract / PDF [181K]

  • CMU-PDL-11-104: Recipes for Baking Black Forest Databases: Building and Querying Black Hole Merger Trees from Cosmological Simulations. Julio Lopez, Colin Degraf, Tiziana DiMatteo, Bin Fu, Eugene Fink, Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-104. April 2011.
    Abstract / PDF [6.5M]

  • CMU-PDL-11-103: Applying Simple Performance Models to Understand Inefficiencies in Data-Intensive Computing. Elie Krevat, Tomer Shiran, Eric Anderson, Joseph Tucek, Jay J. Wylie, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-103. February 2011.
    Abstract / PDF [476K]

  • CMU-PDL-11-102: Disks Are Like Snowflakes: No Two Are Alike. Elie Krevat, Joseph Tucek, Gregory R. Ganger. 13th Workshop on Hot Topics in Operating Systems (HotOS 2011), Napa Valley, CA. May 2011. Supersedes Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-11-102, February 2011.
    Abstract / PDF [1.8M]

  • CMU-PDL-11-101: Automation Without Predictability is a Recipe for Failure. Raja R. Sambasivan, Gregory R. Ganger. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-11-101, January 2011.
    Abstract / PDF [336K]

  • CMU-CS-11-134: Performance Insulation: More Predictable Shared Storage. Matthew Wachs. Carnegie Mellon University School of Computer Science Ph.D. Dissertation CMU-CS-11-134. September 2011.
    Abstract / PDF [2.65M]

  • CMU-CS-11-120: End-to-end Tracing in HDFS. William Wang Carnegie Mellon University School of Computer Science Technical Report (Masters Thesis) CMU-CS-11-120, July 2011.
    Abstract / PDF [489K]

2010

  • CMU-PDL-10-113: Improving Storage Bandwidth Guarantees with Performance Insulation. Matthew Wachs, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-113, October 2010.
    Abstract / PDF [285K]

  • CMU-PDL-10-112: SmartScan: Efficient Metadata Crawl for Storage Management Metadata Querying in Large File Systems. Likun Liu, Lianghong Xu, Yongwei Wu, Guangwen Yang, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-112, Oct. 2010.
    Abstract / PDF [366K]

  • CMU-PDL-10-111: Not Published

  • CMU-PDL-10-110: Scale and Concurrency of GIGA+: File System Directories with Millions of Files. Swapnil Patil, Garth Gibson. Proceedings of the 9th USENIX Conference on File and Storage Technologies (FAST '11), San Jose CA, February 2011. Supersedes Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-10-110, Sept. 2010.
    Abstract / PDF [508K]

  • CMU-PDL-10-109: Speeding Up Finite Element Wave Propagation for Large-Scale Earthquake Simulations. Ricardo Taborda, Julio López, Haydar Karaoglu, John Urbanic, Jacobo Bielak. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-109, October 2010.
    Abstract / PDF [4.4M]

  • CMU-PDL-10-108: Applying Performance Models to Understand Data-intensive Computing Efficiency. Elie Krevat, Tomer Shiran, Eric Anderson†, Joseph Tucek†, Jay J. Wylie†, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-108. May 2010.
    Abstract / PDF [304K]

  • CMU-PDL-10-107: Diagnosing Performance Changes by Comparing System Behaviours. Raja R. Sambasivan, Alice X. Zheng, Elie Krevat, Spencer Whitman, Michael Stroucken, William Wang, Lianghong Xu, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-107. July 2010. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-103.
    Abstract / PDF [503K]

  • CMU-PDL-10-106: Robust and Flexible Power-proportional Storage. Hrishikesh Amur, James Cipar, Varun Gupta, Gregory R. Ganger, Michael A. Kozuch, Karsten Schwan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-106, February 2010. Superseded by ACM Symposium on Cloud Computing (SOCC). June 10-11, 2010, Indianapolis, IN.
    Abstract / PDF [944K]

  • CMU-PDL-10-105: File System Virtual Appliances: Portable File System Implementations. Michael Abd-El-Malek, Matthew Wachs, James Cipar, Karan Sanghi, Gregory R. Ganger, Garth A. Gibson, Michael K. Reiter. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-105, April 2010.
    Abstract / PDF [513K]

  • CMU-PDL-10-104: DiscFinder: A data-intensive scalable cluster finder for astrophysics. Bin Fu, Kai Ren, Julio López, Eugene Fink, and Garth Gibson. In Proceedings of the ACM International Symposium on High Performance Distributed Computing (HPDC), Chicago, IL. June, 2010. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-104..
    Abstract / PDF [372K]

  • CMU-PDL-10-103: Diagnosing Performance Problems by Visualizing and Comparing System Behaviours. Raja R. Sambasivan, Alice X. Zheng, Elie Krevat, Spencer Whitman, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-103, February 2010. Superseded by Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-107.
    Abstract / PDF [503K]

  • CMU-PDL-10-102: A Transparently-Scalable Metadata Service for the Ursa Minor Storage System. Shafeeq Sinnamohideen, Raja R. Sambasivan, James Hendricks, Likun Liu, Gregory R. Ganger. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-10-102. March 2010.Superceded by Usenix Annual Technical Conference, Boston, MA, June 23-25, 2010.
    Abstract / PDF [400K]

  • CMU-PDL-10-101: Not Published

  • CMU-CS-10-141: Reusing Migration to Simply and Efficiently Implement Multi-server Operations in Transparently Scalable Storage Systems. Shafeeq Sinnamohideen. Carnegie Mellon University School of Computer Science Ph.D. Dissertation CMU-CS-10-141. May 2010.
    Abstract / PDF [926K]

2009

  • CMU-PDL-09-113: Putting Home Data Management into Perspective. Brandon Watts Salmon. Carnegie Mellon University ECE Ph.D. Dissertation, CMU-PDL-09-113, August 17, 2009.
    Abstract / PDF [3.6M]

  • CMU-PDL-09-112: DiskReduce: RAID for Data-Intensive Scalable Computing. Bin Fan, Wittawat Tantisiriroj, Lin Xiao, Garth Gibson. 4th Petascale Data Storage Workshop held in conjunction with Supercomputing '09, November 15, 2009. Portland, Oregon. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-112, November 2009.
    Abstract / PDF [304K]

  • CMU-PDL-09-111: ...And eat it too: High read performance in write-optimized HPC I/O middleware file formats. Milo Polte, Jay Lofstead, John Bent, Garth Gibson, Scott A. Klasky, Qing Liu, Manish Parashar, Norbert Podhorszki, Karsten Schwan, Meghan Wingate, Matthew Wolf. 4th Petascale Data Storage Workshop held in conjunction with Supercomputing '09, November 15, 2009. Portland, Oregon. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-111, November 2009.
    Abstract / PDF [388K]

  • CMU-PDL-09-110: Access Control for Home Data Sharing: Attitudes, Needs and Practices. Michelle L. Mazurek, J.P. Arsenault, Joanna Bresee, Nitin Gupta, Iulia Ion, Christina Johns, Daniel Lee, Yuan Liang, Jenny Olsen, Brandon Salmon, Richard Shay, Kami Vaniea, Lujo Bauer, Lorrie Faith Cranor, Gregory R. Ganger, Michael K. Reiter. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-110. October 2009. Superseded by CHI 2010, April 10 – 15, 2010, Atlanta, Georgia.
    Abstract / PDF [250K]

  • CMU-PDL-09-109: File System Virtual Appliances. Michael Abd-El-Malek. Ph.D. Dissertation. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-109, August 2009.
    Abstract / PDF [1.15M]

  • CMU-PDL-09-108: Delayed Instantiation Bulk Operations for Management of Distributed, Object-based Storage Systems. Andrew J. Klosterman. Ph.D. Dissertation. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-108, August 2009.
    Abstract / PDF [2M]

  • CMU-PDL-09-107: An Analysis of Traces from a Production MapReduce Cluster. Soila Kavulya, Jiaqi Tan, Rajeev Gandhi and Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-107. December 2009. Superseded by 10th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid 2010). May 17-20, 2010, Melbourne, Victoria, Australia.
    Abstract / PDF [832K]

  • CMU-PDL-09-106: No Downtime for Data Conversions: Rethinking Hot Upgrades. Tudor Dumitraş, Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-106. July 2009.
    Abstract / PDF [855K]

  • CMU-PDL-09-105: Not Published

  • CMU-PDL-09-104: Directions for Shingled-Write and Two-Dimensional Magnetic Recording System Architectures: Synergies with Solid-State Disks. Garth Gibson, Milo Polte. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-104. May 2009.
    Abstract / PDF [70K]

  • CMU-PDL-09-103: Mochi: Visual Log-Analysis Based Tools for Debugging Hadoop. Jiaqi Tan, Xinghao Pan, Soila Kavulya, Rajeev Gandhi, Priya Narasimhan. Workshop on Hot Topics in Cloud Computing (HotCloud '09), San Diego, CA, on June 15, 2009. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-103, May 2009.
    Abstract / PDF [373K]

  • CMU-PDL-09-102: File System Virtual Appliances: Portable File System Implementations. Michael Abd-El-Malek, Matthew Wachs, James Cipar, Karan Sanghi, Gregory R. Ganger, Garth A. Gibson, Michael K. Reiter. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-102. May 2009.
    Abstract / PDF [486K]

  • CMU-PDL-09-101: A (In)Cast of Thousands: Scaling Datacenter TCP to Kiloservers and Gigabits. Vijay Vasudevan, Amar Phanishayee, Hiral Shah, Elie Krevat, David G. Andersen, Gregory R. Ganger, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-101, Feb. 2009.
    Abstact / PDF [317K]

  • CMU-CS-09-146: Efficient Byzantine Fault Tolerance for Scalable Storage and Services. James Hendricks. Carnegie Mellon School of Computer Science Ph.D. Dissertation CMU-CS-09-146. July 2009.
    Abstract / PDF [1.1M]

2008

  • CMU-PDL-08-115: A Fault Model for Upgrades in Distributed Systems. Tudor Dumitraş, Soila Kavulya, Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-115, December 2008.
    Abstract / PDF [275K]

  • CMU-PDL-08-114: Data-intensive file systems for Internet services: A rose by any other name ... Wittawat Tantisiriroj, Swapnil Patil, Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-114. October 2008
    Abstract / PDF [350K]

  • CMU-PDL-08-113: Co-scheduling of Disk Head Time in Cluster-based Storage. Matthew Wachs, Gregory R. Ganger. 28th International Symposium On Reliable Distributed Systems September 27-30, 2009. Niagara Falls, New York, U.S.A. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-113. October 2008.
    Abstract / PDF [245K]

  • CMU-PDL-08-112: Ganesha: Black-Box Fault Diagnosis for MapReduce Systems. Xinghao Pan, Jiaqi Tan, Soila Kavulya, Rajeev Gandhi, Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-112. September 2008. Superseded by Workshop on Hot Topics in Measurement and Modeling of Computer Systems (HotMetrics 2009), Seattle, WA (June 2009).
    Abstract / PDF [180K]

  • CMU-PDL-08-111: SALSA: Analyzing Logs as StAte Machines.SALSA: Analyzing Logs as StAte Machines. Jiaqi Tan, Xinghao Pan, Soila Kavulya, Rajeev Gandhi and Priya Narasimhan. USENIX Workshop on Analysis of System Logs (WASL), San Diego, CA (December 2008). Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-111. September 2008.
    Abstract / PDF [630K]

  • CMU-PDL-08-110: GIGA+ : Scalable Directories for Shared File Systems. Swapnil Patil, Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-110. October 2008.
    Abstract / PDF [400K]

  • CMU-PDL-08-109: Characterizing HEC Storage Systems at Rest. Shobhit Dayal. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-109, July 2008.
    Abstract / PDF [603K]

  • CMU-PDL-08-108: FAWN: A Fast Array of Wimpy Nodes. David G. Andersen, Jason Franklin, Amar Phanishayee, Lawrence Tan, Vijay Vasudevan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-108, May 2008.
    Abstract / PDF [875K]

  • CMU-PDL-08-107: User Level Implementation of Scalable Directories (GIGA+). Sanket Hase, Aditya Jayaraman, Vinay K. Perneti, Sundararaman Sridharan, Swapnil V. Patil, Milo Polte, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-107, May 2008.
    Abstract / PDF [1.67M]

  • CMU-PDL-08-106: File System Virtual Appliances: Third-party File System Implementations without the Pain. Michael Abd-El-Malek, Matthew Wachs, James Cipar, Gregory R. Ganger, Garth A. Gibson, Michael K. Reiter. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-106, May 2008.
    Abstract / PDF [508K]

  • CMU-PDL-08-105: Perspective: Semantic Data Management for the Home. Brandon Salmon, Steven W. Schlosser, Lorrie Faith Cranor, Gregory R. Ganger. 7th USENIX Conference on File and Storage Technologies (FAST '09). Feb. 24-27, 2008. San Francisco, CA. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-105, May 2008.
    Abstract / PDF [275KM]

  • CMU-PDL-08-104: ASDF: Automated, Online Fingerpointing for Hadoop. Keith Bare, Michael P. Kasick, Soila Kavulya, Eugene Marinelli, Xinghao Pan, Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-104. May 2008.
    Abstract / PDF [650K]

  • CMU-PDL-08-103: RAMS and BlackSheep: Inferring White-box Application Behavior Using Black-box Techniques. Jiaqi Tan, Priya Narasimhan. School of Computer Science Senior Honors Thesis and Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-103, May, 2008.
    Abstract / PDF [1.7M]

  • CMU-PDL-08-102: Using Utility Functions to Control a Distributed Storage System. John D. Strunk. Carnegie Mellon University, Dept. ECE Ph.D Dissertation CMU-PDL-08-102, May 2008.
    Abstract / PDF [940K]

  • CMU-PDL-08-101: The DiskSim Simulation Environment Version 4.0 Reference Manual.John S. Bucy, Jiri Schindler, Steven W. Schlosser, Gregory R. Ganger, and Contributors. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-101, May 2008.
    Abstract / PDF [704K] / Code Release

2007

  • CMU-PDL-07-108: On Modeling the Relative Fitness of Storage. Michael P. Mesnier. Carnegie Mellon University, Dept. ECE Ph.D Dissertation CMU-PDL-07-108, December 19, 2007.
    Abstract / PDF [1.16M]

  • CMU-PDL-07-107: Learning to Share: A Study of Sharing Among Home Storage Devices. Brandon Salmon, Frank Hady, Jay Melican. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-07-107, October, 2007.
    Abstract / PDF [726K]

  • CMU-PDL-07-106: Using Utility to Provision Storage Systems. John D. Strunk, Eno Thereska, Christos Faloutsos, Gregory R. Ganger. 6th USENIX Conference on File and Storage Technologies (FAST '08). Feb. 26-29, 2008. San Jose, CA. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-07-106, September 2007.
    Abstract / PDF [310K]

  • CMU-PDL-07-105: Measurement and Analysis of TCP Throughput Collapse in Cluster-based Storage Systems. Amar Phanishayee, Elie Krevat, Vijay Vasudevan, David G. Andersen, Gregory R. Ganger, Garth A. Gibson, Srinivasan Seshan. 6th USENIX Conference on File and Storage Technologies (FAST '08). Feb. 26-29, 2008. San Jose, CA. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-07-105, September 2007.
    Abstract / PDF [374K]

  • CMU-PDL-07-104: Not Published

  • CMU-PDL-07-103: Enabling What-if Explorations in Systems. Eno Thereska. Carnegie Mellon University, Dept. ECE Ph.D Dissertation CMU-PDL-07-103, August 2007.
    Abstract / PDF [2.35M]

  • CMU-PDL-07-102: Not Published

  • CMU-PDL-07-101: Observer: Keeping System Models from Becoming Obsolete. Eno Thereska, Dushyanth Narayanan, Anastassia Ailamaki, Gregory R. Ganger. Second Workshop on Hot Topics in Autonomic Computing. June 15, 2007. Jacksonville, FL. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-07-101, January 2007.
    Abstract / PDF [75K]

2006

  • CMU-PDL-06-111: Disk Failures in the Real World: What Does an MTTF of 1,000,000 Hours Mean to You? Bianca Schroeder, Garth A. Gibson. Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST '07), February 13–16, 2007, San Jose, CA. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-06-111, September 2006.
    Abstract / PDF[ 272K]

  • CMU-PDL-06-110: Putting Home Storage Management into Perspective. Brandon Salmon, Steven W. Schlosser, Lily B. Mummert, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-06-110, September 2006.
    Abstract / PDF [382K]

  • CMU-PDL-06-109: Routing, Disjoint Paths, and Classification. Shuheng Zhou. Carnegie Mellon University Parallel Data Lab Ph.D. Dissertation CMU-PDL-06-109, August 2006.
    Abstract / PDF [971K] / Postscript [1.1M]

  • CMU-PDL-06-108: //TRACE: Parallel Trace Replay with Approximate Causal Events. Michael Mesnier, Matthew Wachs, Raja R. Sambasivan, Julio Lopez, James Hendricks, Gregory R. Ganger. Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST '07), February 13–16, 2007, San Jose, CA. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-06-108, September 2006.
    Abstract / PDF[ 187K]

  • CMU-CS-06-147: Using Context to Assist in Personal File Retrieval. Craig A. N. Soules. Carnegie Mellon University School of Computer Science Ph.D. Dissertation CMU-CS-06-147, August 25, 2006.
    Abstract / PDF [ 681K]

  • CMU-PDL-06-107: Group Communication: Helping or Obscuring Failure Diagnosis? Soila Pertet, Rajeev Gandhi and Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-06-107, June, 2006.
    Abstract / PDF [ 591K]

  • CMU-PDL-06-106: Argon: Performance Insulation for Shared Storage Servers. Matthew Wachs, Michael Abd-El-Malek, Eno Thereska, Gregory R. Ganger. Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST '07), February 13–16, 2007, San Jose, CA. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-06-106, May 2006.
    Abstract / PDF [ 167K]

  • CMU-PDL-06-105: Eliminating Cross-server Operations in Scalable File Systems. James Hendricks, Shafeeq Sinnamohideen, Raja R. Sambasivan, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-06-105, May 2006.
    Abstract / PDF [ 254K]

  • CMU-PDL-06-104: Improving Small File Performance in Object-based Storage. James Hendricks, Raja R. Sambasivan, Shafeeq Sinnamohideen, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-06-104, May 2006.
    Abstract / PDF [ 1.45M]

  • CMU-PDL-06-103: Towards Efficient Semantic Object Storage for the Home. Brandon Salmon, Steven W. Schlosser, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-06-103, May 2006.
    Abstract / PDF [ 297K]

  • CMU-CS-06-123: Dimorphic Computing. H. Andres Lagar-Cavilla, Niraj Tolia, Rajesh Balan, Eyal de Lara, M. Satyanarananan, David O'Hallaron. Carnegie Mellon University School of Computer Science Technical Report CMU-CS-06-123, April 2006.
    Abstract / PDF [1.7M]

  • CMU-PDL-06-102: Challenges and Opportunities in Internet Data Mining. David G. Andersen, Nick Feamster. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-06-102, Jan 2006.
    Abstract / PDF [104K]

  • CMU-PDL-06-101: Informed Data Distribution Selection in a Self-predicting Storage System. Eno Thereska, Michael Abd-El-Malek, Jay J. Wylie, Dushyanth Narayanan, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-06-101, January 2006. Superceded in Proceedings of the International Conference on Autonomic Computing (ICAC-06), Dublin, Ireland. June 12th-16th 2006.
    Abstract / PDF [196K]

2005

  • CMU-PDL-05-112: A Large-scale Study of Failures in High-performance-computing Systems. Bianca Schroeder, Garth Gibson.Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-05-112, December, 2005. Superceded by Proceedings of the International Conference on Dependable Systems and Networks (DSN2006), Philadelphia, PA, USA, June 25-28, 2006.
    Abstract / PDF [570K]

  • CMU-PDL-05-111: D-SPTF: Decentralized Request Distribution in Brick-based Storage Systems. Christopher R. Lumb. Carnegie Mellon University Parallel Data Lab Ph.D. Dissertation CMU-PDL-05-111, December, 2005.
    Abstract / PDF [1.2M]

  • CMU-PDL-05-111: Not Published

  • CMU-PDL-05-109: Causes of Failure in Web Applications. Soila Pertet and Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-05-109. December 2005.
    Abstract / PDF [367K]

  • CMU-PDL-05-108: A read/write protocol family for versatile storage infrastructures. Jay J. Wylie. Carnegie Mellon University Ph.D Dissertation. CMU-PDL-05-108, October 2005.
    Abstract / Postscript [1.95M] / PDF [1.5M]

  • CMU-PDL-05-107: Correctness of the Read/Conditional-Write and Query/Update Protocols. Michael Abd-El-Malek, Gregory R. Ganger, Garth R. Goodson, Michael K. Reiter, Jay J. Wylie. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-05-107, September, 2005.
    Abstract / PDF [392K]

  • CMU-PDL-05-106: Modeling the Relative Fitness of Storage Devices. Michael Mesnier, Matthew Wachs, Gregory Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-05-106, August, 2005.
    Abstract / PDF [190K]

  • CMU-PDL-05-105: Connections: Using Context to Enhance File Search. Craig A. N. Soules, Gregory R. Ganger. SOSP'05, October 23–26, 2005, Brighton, United Kingdom. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-05-105, April 2005.
    Abstract / PDF [300K]

  • CMU-PDL-05-104: Ursa Minor: Versatile Cluster-based Storage. Michael Abd-El-Malek, William V. Courtright II, Chuck Cranor, Gregory R. Ganger, James Hendricks, Andrew J. Klosterman, Michael Mesnier, Manish Prasad, Brandon Salmon, Raja R. Sambasivan, Shafeeq Sinnamohideen, John D. Strunk, Eno Thereska, Matthew Wachs, Jay J. Wylie. Proceedings of the 4th USENIX Conference on File and Storage Technology (FAST '05). San Francisco, CA. December 13-16, 2005. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-05-104, April, 2005.
    Abstract / PDF [490K]

  • CMU-CS-05-151: The Seductive Appeal of Thin Clients. Niraj Tolia, David G. Andersen, M. Satyanarayanan. Carnegie Mellon University School of Computer Science Technical Report CMU-CS-05-151. February 2005.
    Abstract / Postscript [250K] / PDF [300K]

  • CMU-CS-05-140: Explicit Compiler-based Memory Management for Out-of-core Applications. Angela Demke Brown. Carnegie Mellon University School of Computer Science Ph.D. Dissertation CMU-CS-05-140, May 2005.
    Abstract / PDF [1.3M]

  • CMU-PDL-05-103:Empirical Analysis of Rate Limiting Mechanisms. Cynthia Wong, Stan Bielski, Ahren Studer, Chenxi Wang. 8th International Symposium on Recent Advances in Intrusion Detection (RAID 2005), September 7-9, 2005, Seattle, Washington. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-05-103, March 2005.
    Abstract / PDF [207K]

  • CMU-PDL-05-102: MultiMap: Preserving disk locality for multidimensional datasets. Minglong Shao, Steven W. Schlosser, Stratos Papadomanolakis, Jiri Schindler, Anastassia Ailamaki, Christos Faloutsos, and Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-05-102. March 2005. Superceded by IEEE 23rd International Conference on Data Engineering (ICDE 2007) Istanbul, Turkey, April 2007.
    Abstract / PDF [318K]

  • CMU-PDL-05-101: Towards self-predicting systems: What if you could ask “what-if”? Eno Thereska, Dushyanth Narayanan, Gregory R. Ganger. 3rd International Workshop on Self-adaptive and Autonomic Computing Systems. Copenhagen, Denmark, August 2005. Supercedes Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-05-10, February 2005.
    Abstract / PDF [110K]

2004

  • CMU-PDL-04-112: Cluster Scheduling for Explicitly-speculative Tasks. David Petrou. Carnegie Mellon University Ph.D Dissertation. CMU-PDL-04-112, December 2004.
    Abstract / PDF [4.2M] / Simulator Code

  • CMU-PDL-04-111: Efficient, scalable consistency for highly fault-tolerant storage. Garth Goodson. Carnegie Mellon University Ph.D Dissertation. CMU-PDL-04-111, August 2004.
    Abstract / PDF [1.5M]

  • CMU-PDL-04-109:Challenges in Building a Two-Tiered Learning Architecture for Disk Layout. Brandon Salmon, Eno Thereska, Craig A.N. Soules, John D. Strunk, Gregory R. Ganger. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-04-109. August, 2004.
    Abstract / Postscript [6.8M] / PDF [150K]

  • CMU-PDL-04-108: Timing-accurate Storage Emulation: Evaluating hypothetical storage components in real computer systems. John Linwood Griffin. Carnegie Mellon University Ph.D Dissertation. CMU-PDL-04-108, September 2004.
    Abstract / PDF [16M]

  • CMU-PDL-04-106: On Hierarchical Routing in Doubling Metrics. Anupam Gupta, Bruce M. Maggs, Shuheng Zhou. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-04-106. December, 2004.
    Abstract / Postscript [914K] / PDF [344K]

  • CMU-PDL-04-105: Toward Automatic Context-based Attribute Assignment for Semantic File Systems. Craig A. N. Soules, Gregory R. Ganger. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-04-105. June 2004.
    Abstract / PDF [290K]

  • CMU-PDL-04-104: Using MEMS-based Storage Devices in Computer Systems. Steven W. Schlosser. Carnegie Mellon University Ph.D Dissertation. CMU-PDL-04-104, May 2004.
    Abstract / PDF [1.1M]

  • CMU-PDL-04-103: Storage Device Performance Prediction with CART Models. Mengzhi Wang, Kinman Au, Anastassia Ailamaki, Anthony Brockwell, Christos Faloutsos, and Gregory R. Ganger. Proc. 12th Annual Meeting of the IEEE/ACM International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS). Volendam, The Netherlands. October 5-7, 2004. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-04-103, March 2004.
    Abstract / Postscript [908K] / PDF [122K]

  • CMU-PDL-04-102: Clotho: Decoupling Page Layout from Storage Organization. Minglong Shao, Jiri Schindler, Steven W. Schlosser, Anastassia Ailamaki, Gregory R. Ganger. Proceedings of the 30th VLDB Conference. Toronto, Canada, 29 August - 3 September 2004. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-04-102, March 2004.
    Abstract / PDF [203K]

  • CMU-CS-04-119: Decentralized Recovery for Survivable Storage Systems. Theodore Ming-Tao Wong. Carnegie Mellon School of Computer Science Ph.D. Dissertation CMU-CS-04-119. May 2004.
    Abstract / Postscript [1.6M] / PDF [715K]

  • CMU-PDL-04-101: File Classification in Self-* Storage Systems. Michael Mesnier, Eno Thereska, Daniel Ellard, Gregory R. Ganger, Margo Seltzer. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-04-101, January 2004. Superceded in Proceedings of the First International Conference on Autonomic Computing (ICAC-04). New York, NY. May 2004.
    Abstract / Postscript [1.6M] / PDF [80K]

2003

  • CMU-PDL-03-109: Matching Application Access Patterns to Storage Device Characteristics. Jiri Schindler. Carnegie Mellon University Ph.D Dissertation. CMU-PDL-03-109, May 2004.
    Abstract / PDF [1.14M]

  • CMU-PDL-03-108: Dynamic Quarantine of Internet Worms. Cynthia Wong, Chenxi Wang, Dawn Song, Stan Bielski, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-03-108, December 2003. Superceded by Proceedings of the International Conference on Dependable Systems and Networks (DSN-2004). Palazzo dei Congressi, Florence, Italy. June 28th - July 1, 2004.
    Abstract / Postscript [1.4M] / PDF [224K]

  • CMU-PDL-03-107: Design and Implementation of a Freeblock Subsystem. Eno Thereska, Jiri Schindler, Christopher R. Lumb, John Bucy, Brandon Salmon, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-03-107, December, 2003.
    Abstract / Postscript [6.5M] / PDF [165K]

  • CMU-PDL-03-106: On the Feasibility of Intrusion Detection Inside Workstation Disks. John Linwood Griffin, Adam Pennington, John S. Bucy, Deepa Choundappan, Nithya Muralidharan, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-03-106. December, 2003.
    Abstract / Postscript [1.12M] / PDF [215K]

  • CMU-PDL-03-105: The Safety and Liveness Properties of a Protocol Family for Versatile Survivable Storage Infrastructures. Garth R. Goodson, Jay J. Wylie, Gregory R. Ganger, Michael K. Reiter. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-03-105. March 2004.
    Abstract / Postscript [922K] / PDF [227K]

  • CMU-PDL-03-104: Efficient Byzantine-tolerant Erasure-coded Storage. Garth R. Goodson, Jay J. Wylie, Gregory R. Ganger, Michael K. Reiter. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-03-104, December 2003. Superceded by Proceedings of the International Conference on Dependable Systems and Networks (DSN-2004).
    Palazzo dei Congressi, Florence, Italy. June 28th - July 1, 2004.
    Abstract / PDF [340K]

  • CMU-PDL-03-103: A Protocol Family for Versatile Survivable Storage Infrastructures. Garth R. Goodson, Jay J. Wylie, Gregory R. Ganger, Michael K. Reiter. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-03-103, December 2003.
    Abstract / Postscript [925K] / PDF [321K]

  • CMU-PDL-03-102: MEMS-based storage devices and standard disk interfaces: A square peg in a round hole? Steven W. Schlosser, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-03-102, December, 2003. Superceded in Proceedings of the 3rd USENIX Conference on File and Storage Technologies (FAST '04). San Francisco, CA. March 31, 2004.
    Abstract / Postscript [2.8M] / PDF [156K]

  • CMU-PDL-03-101: Atropos: A Disk Array Volume Manager for Orchestrated Use of Disks. Jiri Schindler, Steven W. Schlosser, Minglong Shao, Anastassia Ailamaki, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-03-101, December, 2003. Superceded in Proceedings of the 3rd USENIX Conference on File and Storage Technologies (FAST '04). San Francisco, CA. March 31, 2004.
    Abstract / PDF [281K]

  • HCS-TR-14-03: Attribute-Based Prediction of File Properties. Daniel Ellard, Michael Mesnier, Eno Thereska, Gregory R. Ganger, Margo Seltzer. Harvard Computer Science Group Technical Report TR-14-03, December 2003.
    Abstract / Postscript [850K] / PDF [127K]

  • CMU-CS-03-204: Scheduling Explicitly-speculative Tasks. David Petrou, Gregory R. Ganger, Garth A. Gibson. Carnegie Mellon University Technical Report CMU-CS-03-204, November 2003.
    Abstract / Postscript [2.0M] / PDF [400K]

  • CMU-CS-03-203: Balancing Locality and Randomness in DHTs. Shuheng Zhou, Gregory R. Ganger, Peter Steenkiste. Carnegie Mellon University Technical Report CMU-CS-03-203, November 2003.
    Abstract / Postscript [489K] / PDF [511K]

  • CMU-CS-03-202: D-SPTF: Decentralized Request Distribution in Brick-based Storage. Christopher R. Lumb, Gregory R. Ganger, Richard Golding. Carnegie Mellon University School of Computer Science Tecnical Report CMU-CS-03-202, November, 2003.
    Abstract / PDF [475K]

  • CMU-CS-03-200: A Prototype User Interface for Coarse-Grained Desktop Access Control. A. Chris Long, Courtney Moskowitz, and Greg Ganger. Carnegie Mellon University Technical Report CMU-CS-03-200, November 2003.
    Abstract / Postscript [900K] / PDF [535K]

  • CMU-CS-03-192: A Framework for Building Unobtrusive Disk Maintenance Applications. Eno Thereska, Jiri Schindler, John Bucy, Brandon Salmon, Christopher R. Lumb, Gregory R. Ganger. Carnegie Mellon University Technical Report CMU-CS-03-192, October 2003. Superceded by Proceedings of the 3rd USENIX Conference on File and Storage Technologies (FAST '04). San Francisco, CA. March 31, 2004.
    Abstract / Postscript [5.1M] / PDF [148K]

  • CMU-CS-03-187: Byzantine-tolerant Erasure-coded Storage. Garth R. Goodson, Jay J. Wylie, Gregory R. Ganger, Michael K. Reiter. Carnegie Mellon University Technical Report CMU-CS-03-187, September 2003.
    Abstract / Postscript [272K] / PDF [479K]

  • CMU-CS-03-178: Self-* Storage: Brick-based Storage with Automated Administration. Gregory R. Ganger, John D. Strunk, Andrew J. Klosterman. Published as Carnegie Mellon University Technical Report, CMU-CS-03-178, August 2003.
    Abstract / Postscript [2.2M] / PDF [558K]

  • CMU-CS-03-171: Location-based Node IDs: Enabling Explicit Locality in DHTs. Shuheng Zhou, Gregory R. Ganger, Peter Steenkiste. Carnegie Mellon University Technical Report CMU-CS-03-171, September 2003.
    Abstract / Postscript [282K] / PDF [160K]

  • CMU-CS-03-130: A Two-Tiered Software Architecture for Automated Tuning of Disk Layouts. Brandon Salmon, Eno Thereska, Craig A.N. Soules, Gregory R. Ganger. Carnegie Mellon University SCS Technical Report CMU-CS-03-130. Superceded by Proceedings of the First Workshop on Algorithms and Architectures for Self-Managing Systems. In conjunction with Federated Computing Research Conference (FCRC). San Diego, CA. June 11, 2003.
    Abstract / Postscript [718K] / PDF [405K]

  • CMU-CS-03-129: A Human Organization Analogy for Self-* Systems. John D. Strunk, Gregory R. Ganger. Carnegie Mellon University SCS Technical Report CMU-CS-03-129. Superceded by Proceedings of the First Workshop on Algorithms and Architectures for Self-Managing Systems. In conjunction with Federated Computing Research Conference (FCRC). San Diego, CA. June 11, 2003.
    Abstract / Postscript [273K] / PDF [68K]

  • CMU-CS-03-127: Efficient Consistency for Erasure-coded Data via Versioning Servers. Garth R. Goodson, Jay J. Wylie, Gregory R. Ganger, Michael K. Reiter. Carnegie Mellon University Technical Report CMU-CS-03-127, April 2003.
    Abstract / Postscript [290K] / PDF [160K]

  • CMU-CS-03-125: Exposing and Exploiting Internal Parallelism in MEMS-based Storage. Steven W. Schlosser, Jiri Schindler, Anastassia Ailamaki, Gregory R. Ganger. Carnegie Mellon University Technical Report CMU-CS-03-125, March 2003.
    Abstract / Postscript [1.67M] / PDF [136K]

  • CMU-CS-03-124: Lachesis: Robust Database Storage Management Based on Device-specific Performance Characteristics. Jiri Schindler, Anastassia Ailamaki, Gregory R. Ganger. Carnegie Mellon University Technical Report CMU-CS-03-124, April 2003. Superceded by Proceedings, VLDB 03, Berlin, Germany, Sept 9-12, 2003.
    Abstract / Postscript [510K] / PDF [152K]

  • CMU-CS-03-116: Why Can’t I Find My Files? New methods for automating attribute assignment. Craig A.N. Soules, Greg Ganger. Carnegie Mellon University Technical Report CMU-CS-03-116, February 2003. Superceded by Proceedings of the Ninth Workshop on Hot Topics in Operating systems, USENIX Association, May 2003.
    Abstract / PDF [75K]

  • CMU-CS-03-109: Finding and Containing Enemies Within the Walls with Self-securing Network Interfaces. Gregory R. Ganger, Gregg Economou, Stanley M. Bielski. Carnegie Mellon University Technical Report CMU-CS-03-109, January 2003.
    Abstract / Postscript [963K] / PDF [118K]

2002

  • CMU-CS-02-205: Adaptive, Hands-Off Stream Mining. Spiros Papadimitriou, Anthony Brockwell, Christos Faloutsos. Carnegie Mellon University SCS Technical Report CMU-CS-02-205. Superceded by Proceedings VLDB 03, Berlin, Germany, Sept 9-12, 2003.
    Abstract / Postscript [3.6M] / PDF [767K]

  • CMU-CS-02-186: My Cache or Yours? Making Storage More Exclusive. Theodore M. Wong, John Wilkes. CMU SCS Tech. Report CMU-CS-02-186, which supercedes CMU-CS-00-157, originally published in November 2000. Both superceded by Proceedings USENIX Annual Technical Conference (USENIX 2002), pp. 161-175, 10-15 June 2002, Monterey, CA.
    Abstract / Postscript [759K] / PDF [253K]

  • CMU-CS-02-183: Cuckoo: Layered clustering for NFS. Andrew J. Klosterman, Gregory Ganger. Carnegie Mellon University Technical Report CMU-CS-02-183, October 2002.
    Abstract / Postscript [370K] / PDF [86K]

  • CMU-CS-02-180: Decentralized Storage Consistency via Versioning Servers. Garth Goodson, Jay Wylie, Greg Ganger & Mike Reiter. Carnegie Mellon University Technical Report CMU-CS-02-180, September 2002.
    Abstract / PDF [210K]

  • CMU-CS-02-179: Storage-based Intrusion Detection: Watching Storage Activity For Suspicious Behavior. Adam Pennington, John Strunk, John Griffin, Craig Soules, Garth Goodson, Greg Ganger. Carnegie Mellon University Technical Report CMU-CS-02-179, September 2002. Superceded by Proceedings 12th USENIX Security Symposium, Washington, D.C., Aug 4-8, 2003.
    Abstract / Postscript [727K] / PDF [138K]

  • CMU-CS-02-145: Metadata Efficiency in a Comprehensive Versioning File System. Craig A. N. Soules, Garth R. Goodson, John D. Strunk, Gregory R. Ganger. CMU SCS Technical Report CMU-CS-02-145, May 2002. Superceded by Proceedings 2nd USENIX Conference on File and Storage Technologies, San Francisco, CA, Mar 31 - Apr 2, 2003.
    Abstract / Postscript [817K] / PDF [178K]

  • CMU-CS-02-144: Self-Securing Network Interfaces: What, Why and How. Gregory R. Ganger, Gregg Economou, Stanley M. Bielski. CMU SCS Technical Report CMU-CS-02-144, May 2002.
    Abstract / Postscript [952K] / PDF [472K]

  • CMU-CS-02-140: Intrusion Detection, Diagnosis, and Recovery with Self-Securing Storage. John D. Strunk, Garth R. Goodson, Adam G. Pennington, Craig A.N. Soules, Gregory R. Ganger. CMU SCS Technical Report CMU-CS-02-140, May 2002.
    Abstract / Postscript [1.1M] / PDF [119K]

  • CMU-CS-02-129: On Correlated Failures in Survivable Storage Systems. Mehmet Bakkaloglu, Jay J. Wylie, Chenxi Wang, Gregory R. Ganger CMU SCS Technical Report CMU-CS-02-129. May 2002
    Abstract / Postscript [1.8M] / PDF [475K]

  • CMU-CS-02-115: Fractal Prefetching B+trees: Optimizing Both Cache and Disk Performance. Shimin Chen, Phillip B. Gibbons, Todd C. Mowry, and Gary Valentin. CMU SCS Technical Report CMU-CS-02-115. Superceded by Proceedings SIGMOD 2002, June 2002, Madison, Wisc.
    Abstract / Postscript [2.2M] / PDF [335K]

  • CMU-CS-02-114: Verifiable Secret Redistribution for Threshold Sharing Schemes. Theodore M. Wong, Chenxi Wang, Jeannette M. Wing. CMU SCS Technical Report CMU-CS-02-114, February 2002 (revised).
    Abstract / Postscript [156K] / PDF [210K]

  • CMU-CS-02-113: Affinity Scheduling in Staged Server Architectures. Stavros Harizopoulos and Anastassia Ailamaki. Carnegie Mellon University Technical Report CMU-CS-02-113, March 2002.
    Abstract / Postscript [982K] / PDF [186K]

  • CMU-CS-02-103: Examining Semantics In Multi-Protocol Network File Systems. Edward P. A. Hogan, Garth A. Gibson, and Gregory R. Ganger. CMU SCS Technical Report CMU-CS-02-103, January 2002.
    Abstract / Postscript [981K] / PDF [408K]

2001

  • CMU-CS-01-172: Using Speculative Execution to Automatically Hide I/O Latency. Fay W. Chang. Carnegie Mellon Ph.D Dissertation CMU-CS-01-172, December 2001.
    Abstract / Postscript [4.9M] / PDF [945K]

  • CMU-CS-01-166: Blurring the Line Between Oses and Storage Devices. Gregory R. Ganger. CMU SCS Technical Report CMU-CS-01-166, December 2001.
    Abstract / Postscript [2.3M] / PDF [974K]

  • CMU-CS-01-155: Verifiable Secret Redistribution Theodore M. Wong, Jeannette M. Wing. CMU SCS Technical Report CMU-CS-01-155, October 2001.
    Abstract / Postscript [198K] / PDF [168K]

  • CMU-CS-01-149: Freeblock Scheduling Outside of Disk Firmware. Christopher R. Lumb, Jiri Schindler, Gregory R. Ganger. CMU SCS Technical Report CMU-CS-01-149. Superceded by ProceedingsConference on File and Storage Technologies (FAST) January 28-30, 2002. Monterey, CA.
    Abstract / Postscript [643K] / PDF [150K]

  • CMU-CS-01-146: Timing-accurate Storage Emulation. John Linwood Griffin, Jiri Schindler, Steven W. Schlosser, John S. Bucy, Gregory R. Ganger. CMU SCS Technical Report CMU-CS-01-146. Superceded by Proceedings Conference on File and Storage Technologies (FAST) January 28-30, 2002. Monterey, CA.
    Abstract / Postscript [464K] / PDF [102K]

  • CMU-CS-01-123: Authentication Confidences Gregory R. Ganger. CMU SCS Technical Report CMU-CS-01-123, May 2001.
    Abstract / Postscript [335K] / PDF [42K]

  • CMU-CS-01-120: Selecting the Right Data Distribution Scheme for a Survivable Storage System. Jay J. Wylie, Mehmet Bakkaloglu, Vijay Pandurangan, Michael W. Bigrigg, Semih Oguz, Ken Tew, Cory Williams, Gregory R. Ganger, Pradeep K. Khosla. CMU SCS Technical Report CMU-CS-01-120, May 2001.
    Abstract / Postscript [10.9M] / PDF [1.4M]

  • CMU-CS-01-119: Track-aligned Extents: Matching Access Patterns to Disk Drive Characteristics. Jiri Schindler, John Linwood Griffin, Christopher R. Lumb, Gregory R. Ganger. CMU SCS Technical Report CMU-CS-01-119. Superceded by ProceedingsConference on File and Storage Technologies (FAST) January 28-30, 2002. Monterey, CA.
    Abstract / Postscript [682K] / PDF [159K]

  • CMU-CS-01-105: PASTENSE: a Fast Start-up Algorithm for Scalable Video Libraries. Stavros Harizopoulos and Garth A. Gibson. CMU SCS Technical Report CMU-CS-01-105, March 2001.
    Abstract / Postscript [784K] / PDF [184K]

  • CMU-CS-01-101: Data Mining Meets Performance Evaluation: Fast Algorithms for Modeling Bursty Traffic. M. Wang, T. Madhyastha, N.H. Chan, S. Papadimitriou, C. Faloutsos. Carnegie Mellon University SCS technical report CMU-CS-01-101. Superceded by Proceedings18th International Conference on Data Engineering, February 26-March 1, 2002 San Jose, California.
    Abstract / Postscript [2.25M] / PDF [358K]

2000

  • CMU-CS-00-178: Scalable and Manageable Storage Systems. Khalil S. Amiri. Carnegie Mellon Ph.D. Dissertation, CMU-CS-00-178, December, 2000.
    Abstract / Postscript [M] / PDF [1.7M]

  • CMU-CS-00-177: Improving Index Performance through Prefetching. Shimin Chen, Phillip B. Gibbons, and Todd C. Mowry. CMU SCS technical report CMU-CS-00-177. Superceded in proceedings of SIGMOD 2001, Santa Barbara, CA, May 2001.
    Abstract / Postscript [1.8M] / PDF [323K]

  • CMU-CS-00-174: Enabling Dynamic Security Management of via Device-Embedded Security. Gregory R. Ganger and David F. Nagle. CMU SCS Technical Report CMU-CS-00-174, December 2000.
    Abstract / PDF [607K]

  • CMU-CS-00-173: Building Firewalls with Intelligent Network Interface Cards. David Friedman and David Nagle. CMU SCS Technical Report CMU-CS-00-173, May 2001.
    Abstract / Postscript [540K] / PDF [229K]

  • CMU-CS-00-157: My Cache or Yours? Making Storage More Exclusive. T. Wong, G. R. Ganger, J. Wilkes. CMU-CS-00-157, CMU SCS Technical Report, November 2000. Superceded by CMU SCS Tech. Report CMU-CS-02-186, and Proceedings USENIX Annual Technical Conference (USENIX 2002), pp. 161-175, 10-15 June 2002, Monterey, CA.
    Abstract / PDF [163K]

  • CMU-CS-00-145: Active Disk Architecture for Databases. Riedel, E., Faloutsos, C., Nagle, D.F. Carnegie Mellon Technical Report CMU-CS-00-145, May 2000.
    Abstract / Postscript [948K] / PDF [201K]

  • CMU-CS-00-137: Designing Computer Systems with MEMS-based Storage. Schlosser, S.W., Griffin, J.L., Nagle, D.F. and Ganger, G.R. CMU SCS Technical Report CMU-CS-00-137, May 2000. Superceded by Proceedings 9th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS-IX), Cambridge, MA, November 13-15, 2000.
    Abstract / Postscript [2.7M] / PDF [485K]

  • CMU-CS-00-136: Operating System Management of MEMS-based Storage Devices. Griffin, J.L., Schlosser, S.W., Ganger, G.R. and Nagle, D.F. CMU SCS Technical Report CMU-CS-00-136, May 2000. Superceded by Proceedings 4th Symposium on Operating Systems Design & Implementation (OSDI 2000), San Diego, CA, October 23-25, 2000.
    Abstract / Postscript [2.2M] / PDF [336K]

  • CMU-CS-00-134: Secure Continuous Biometric-Enhanced Authentication Andrew J. Klosterman and Gregory R. Ganger. CMU SCS Technical Report CMU-CS-00-134, May 2000.
    Abstract / Postscript [1.1M] / PDF [245K]

  • CMU-CS-00-130: Towards Higher Disk Head Utilization: Extracting "Free" Bandwidth From Busy Disk Drives. Lumb, C., Schindler, J., Ganger, G.R., Nagle, D.F. and Riedel, E. CMU SCS Technical Report CMU-CS-00-130, May 2000. Superceded by Proceedings of the 4th Symposium on Operating Systems Design and Implementation, 2000.
    Abstract / Postscript [2.3M] / PDF [422K]

  • CMU-CS-00-129: Design and Implementation of a Self-Securing Storage Device. John D. Strunk, Garth R. Goodson, Michael L. Scheinholtz, Craig A.N. Soules, Gregory R. Ganger. CMU SCS Technical Report CMU-CS-00-129, May 2000.
    Abstract / Postscript [401K] / PDF [330K]

  • CMU-CS-00-117: Fast and Flexible Application-level Networking on Exokernel Systems. Gregory R. Ganger, Dawson R. Engler, M.Frans Kaashoek, Héctor M. Briceño, Russell Hunt, Thomas Pinckney. CMU SCS Technical Report CMU-CS-00-117, March 2000.
    Abstract / PDF [363K]

  • CMU-CS-00-100: Modeling and Performance of MEMS-Based Storage Devices. Griffin, J.L., Schlosser, S.W., Ganger, G.R. and Nagle, D.F. CMU SCS Technical Report CMU-CS-00-100. Superceded by Proceedings of ACM SIGMETRICS 2000, Santa Clara, California, June 17-21, 2000.
    Abstract / Postscript [3.4M] / PDF [311K]

1999

  • CMU-CS-99-177: Active Disks - Remote Execution for Network-Attached Storage. Erik Riedel. Carnegie Mellon Ph.D. Dissertation CMU-CS-99-177, November 1999.
    Abstract / Postscript.Z [5.7M] / PDF [2.7M]

  • CMU-CS-99-176: Automated Disk Drive Characterization. Schindler, J. and Ganger, G.R. CMU SCS Technical Report CMU-CS-99-176, December 1999.
    Abstract / Postscript [341K] / PDF [282K]

  • CMU-CS-99-174: Filling the Memory Access Gap: A Case for On-Chip Magnetic Storage. Schlosser, S.W., Griffin, J.L., Nagle, D.F. and Ganger, G.R. CMU SCS Technical Report CMU-CS-99-174, December 1999.
    Abstract / Postscript [3.1M] / PDF [398K]

  • CMU-CS-99-160: Security for a High Performance Commodity Storage Subsystem. Howard Gobioff. Carnegie Mellon Ph.D. Dissertation, CMU-CS-99-160, July 1999.
    Abstract / Postscript [8.4M] / PDF [1.8M]

  • CMU-CS-99-154: Embedded Security for Network-Attached Storage. Gobioff, H., Nagle, D.F. and Gibson, G.A. CMU SCS Technical Report CMU-CS-99-154, June 1999.
    Abstract / Postscript [5.1M] / PDF [761K]

  • CMU-CS-99-151: Data Mining on an OLTP System (Nearly) for Free. Riedel, E., Faloutsos, C., Ganger, G.R. and Nagle, D.F. CMU SCS Technical Report CMU-CS-99-151. Superceded by Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, Dallas, Texas, May 14-19, 2000.
    Abstract / Postscript [1.0M] / PDF [171K]

  • CMU-CS-99-140: Dynamic Function Placement in Active Storage Clusters. Amiri, K., Petrou, D., Ganger, G.R. and Gibson, G.A. CMU SCS Technical Report CMU-CS-99-140, June 1999.
    Abstract / Postscript [730K] / PDF [220K]

  • CMU-CS-99-111: Scalable Concurrency Control and Recovery for Shared Storage Arrays. Amiri, K., Gibson, G.A. and Golding, R. CMU SCS Technical Report CMU-CS-99-111, February 1999.
    Abstract / Postscript [480K] / PDF [108K]

1998

  • CMU-CS-98-182: Integrity and Performance in Network Attached Storage. Gobioff, H., Nagle, D.F. and Gibson, G.A. CMU SCS Technical Report CMU-CS-98-182, December 1998.
    Abstract / Postscript [534K] / PDF [142K]

  • CMU-CS-98-103: Selected Reports: Fall 1997 Software Systems Course. CMU SCS Technical Report CMU-CS-98-103, April 4, 1998.
    Abstract / Postscript [5.5M] / PDF [3.3M]

1997

  • CMU-CS-97-204: Informed Prefetching and Caching. Hugo Patterson. Carnegie Mellon Ph.D. Dissertation CMU-CS-97-204: December 1997.
    Abstract / Postscript [4.8M] / PDF [2.3M]

  • CMU-CS-97-198: Active Disks: Remote Execution for Network-Attached Storage. Riedel, E. and Gibson, G.A. CMU SCS Technical Report CMU-CS-97-198, December 1997.
    Abstract / Postscript [2.1M] / PDF [123K]

  • CMU-CS-97-185: Security for Network Attached Storage Devices. Gobioff, H., Gibson, G.A. and Tygar, D. CMU SCS Technical Report CMU-CS-97-185, October, 1997.
    Abstract / Postscript [143K] / PDF [135K]

  • CMU-CS-97-181: Practical and Theoretical Issues in Prefetching and Caching. Andrew Tomkins. Carnegie Mellon Ph.D. Dissertation CMU-CS-97-181: October 1997.
    Abstract / GZ Postscript [827K] / PDF [918K]

  • CMU-CS-97-142: RAIDframe: A Rapid Prototyping Tool for RAID Systems. William V. Courtright II, Garth Gibson, Mark Holland, LeAnn Neal Reilly, Jim Zelenka. Carnegie Mellon University Technical Report CMU-CS-97-142. June 4, 1997.
    Abstract / Postscript [980K] / PDF [620K]

  • CMU-CS-97-141: A Transactional Approach to Redundant Disk Array Implementation. William Courtright, II. Carnegie Mellon Ph.D. Dissertation CMU-CS-97-141, April 1997.
    Abstract / Postscript [1.7M] / PDF [1.1M]

  • CMU-CS-97-118: Filesystems for Network-Attached Secure Disks. Gibson, G., Nagle, D.F., Amiri, K., Chang, F.W., Gobioff, H., Riedel, E., Rochberg, D. and Zelenka, J. CMU SCS Technical Report CMU-CS-97-118, July, 1997.
    Abstract / Postscript [220K] / PDF [94K]

1996

  • CMU-CS-96-174: A Trace-Driven Comparison of Algorithms for Parallel Prefetching and Caching. Kimbrel, T., Tomkins, A., Patterson, R.H., Bershad, B., Cao, P., Felten, E.W., Gibson, G.A., Karlin, A.R. and Li, K. CMU SCS Technical Report CMU-CS-96-174. Superceded by Proceedings of the 2nd USENIX Symposium on Operating Systems Design and Implementation (OSDI), Seattle, WA, October 28-31, 1996, pp. 19-34.
    Abstract / Postscript [670K] / PDF [344K]

  • CMU-CS-96-158: Understanding Customer Dissatisfaction with Underutilized Distributed File Servers. Riedel, E. and Gibson, G.A. CMU SCS Technical Report CMU-CS-96-158. Superceded by Proceedings of the 5th NASA Conference on Mass Storage Systems and Technologies, College Park, MD. September 1996.
    Abstract / Postscript [898K] / PDF [258K]

  • CMU-CS-96-142: A Case for Network-Attached Secure Disks. Gibson, G.A., Nagle, D.F., Amiri, K., Chang, F.W., Feinberg, E.M., Gobioff, H., Lee, C., Ozceri, B., Riedel, E. and Rochberg, D. CMU SCS Technical Report CMU-CS-96-142, September 1996.
    Abstract / Postcript [380K] / PDF [151K]

  • CMU-CS-96-137: A Structured Approach to Redundant Disk Array Implementation. Courtright II, W.V., Gibson, G.A., Holland, M. and Zelenka, J. CMU SCS Technical Report CMU-CS-96-137. Superceded by Proceedings of the International Computer Performance and Dependability Symposium (IPDS), Sept. 4-6, 1996.
    Abstract / Postscript [212K] / PDF [85K]

1995

  • CMU-CS-95-200: RAIDframe: Rapid Prototyping for Disk Arrays. Courtright II, W.V., Gibson, G.A., Holland, M. and Zelenka, J. CMU SCS Technical Report CMU-CS-95-200. Superceded by Proceedings of the 1996 Conference on Measurement and Modeling of Computer Systems (SIGMETRICS), May 1996, Vol. 24 No. 1, pp. 268-269.
    Abstract / Postscript [74K] / PDF [25K]

  • CMU-CS-95-107: The Scotch Parallel Storage Systems. Gibson, G.A., Stodolsky, D., Chang, F.W., Courtright II, W.V., Demetriou, C.G., Ginting, E., Holland, M., Ma, Q., Neal, L., Patterson, R.H., Su, J., Youssef, R. and Zelenka, J. CMU SCS Technical Report CMU-CS-95-107. Superceded by Proceedings of the IEEE CompCon Conference, March 5-8, 1995.
    Abstract / Postscript [112K] / PDF [64K]

1994

  • CMU-CS-94-193: Backward Error Recovery in Redundant Disk Arrays. Courtright II, W.V. and Gibson, G.A. CMU SCS Technical Report CMU-CS-94-193. Superceded by Proceedings of the 1994 Computer Measurement Group (CMG) Conference, Dec. 1994, Vol. 1, pp. 63-74.
    Abstract / HTML / Postscript [134K] / PDF [69K]

  • CMU-CS-94-170: A Redundant Disk Array Architecture for Efficient Small Writes. CMU SCS Technical Report CMU-CS-94-170, 1994. Superceded by Parity-Logging Disk Arrays. Stodolsky, D., Holland, M., Courtright II, W.V. and Gibson, G.A. ACM Transactions on Computer Systems, 12(3):206-235, August 1994.
    Abstract / Postscript [313K] / PDF [164K]

  • CMU-CS-94-164: On-Line Data Reconstruction in Redundant Disk Arrays. Mark Holland. Carnegie Mellon Ph.D. Dissertation CMU-CS-94-164, April 1994, 232 pages.
    Abstract / Table of Contents / Postscript [1.3M] / PDF [769K]

1993

  • CMU-CS-93-200: A Redundant Disk Array Architecture for Efficient Small Writes. Stodolsky, D., Holland, M., Courtright II, W.V. and Gibson, G.A. CMU SCS Technical Report CMU-CS-93-200, October, 1993.
    Abstract / Postscript [356K] / PDF [196K]

  • CMU-CS-93-113: A Status Report on Research in Transparent Informed Prefetching. Patterson, R.H., Gibson, G.A and Satyanarayanan, M. CMU SCS Technical Report CMU-CS-93-113. Superceded in ACM Operating Systems Review, V 27(2), April, 1993, pp.21-34.
    Abstract / Postscript [94K] / PDF [50K]