Recent Publications
-
Characterizing HEC Storage Systems at Rest. Shobhit Dayal. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-109, July 2008.
Abstract / PDF [603K] - IRONModel: Robust Performance Models in the Wild. Eno Thereska, Gregory R. Ganger. SIGMETRICS’08, June 2–6, 2008, Annapolis, Maryland, USA.
Abstract / PDF [813K]
- FAWN: A Fast Array of Wimpy Nodes.
David G. Andersen, Jason Franklin, Amar Phanishayee, Lawrence Tan, Vijay Vasudevan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-108, May 2008.
Abstract / PDF [875K]
- User Level Implementation of Scalable Directories (GIGA+). Sanket Hase, Aditya Jayaraman, Vinay K. Perneti, Sundararaman Sridharan, Swapnil V. Patil, Milo Polte, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-107, May 2008.
Abstract / PDF [1.67M]
- File System Virtual Appliances: Third-party File System Implementations without the Pain. Michael Abd-El-Malek, Matthew Wachs, James Cipar, Gregory R. Ganger, Garth A. Gibson, Michael K. Reiter. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-106, May 2008.
Abstract / PDF [508K]
- Perspective: Semantic Data Management for the Home. Brandon Salmon, Steven W. Schlosser, Lorrie Faith Cranor, Gregory R. Ganger. Carnegie
Mellon University Parallel Data Lab Technical Report CMU-PDL-08-105, May
2008.
Abstract / PDF [1.65M]
- ASDF: Automated, Online Fingerpointing for Hadoop. Keith Bare, Michael P. Kasick, Soila Kavulya, Eugene Marinelli, Xinghao Pan, Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. Carnegie
Mellon University Parallel Data Lab Technical Report
CMU-PDL-08-104.
May 2008.
Abstract / PDF [650K]
- The DiskSim Simulation Environment Version 4.0 Reference Manual.John S. Bucy, Jiri Schindler, Steven W. Schlosser, Gregory R. Ganger, and Contributors. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-101, May 2008.
Abstract / PDF [704K] / Code Release
- Using Utility Functions to Control a Distributed Storage System.
John D. Strunk. Carnegie Mellon University, Dept. ECE Ph.D Dissertation CMU-PDL-08-102,
May 2008.
Abstract / PDF [940K]
- RAMS and BlackSheep: Inferring White-box
Application Behavior Using Black-box Techniques. Jiaqi Tan, Priya Narasimhan. School of Computer Science Senior Honors Thesis and Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-103, May, 2008.
Abstract / PDF [1.7M]
- Using Utility to Provision Storage Systems. John D. Strunk, Eno Thereska, Christos Faloutsos, Gregory R. Ganger. 6th USENIX Conference on File and Storage Technologies (FAST '08). Feb. 26-29, 2008. San Jose, CA. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-07-106, September 2007.
Abstract / PDF [310K]
- Measurement and Analysis of TCP Throughput Collapse in Cluster-based Storage Systems.
Amar Phanishayee, Elie Krevat, Vijay Vasudevan, David G. Andersen, Gregory R. Ganger, Garth A. Gibson, Srinivasan Seshan. 6th USENIX Conference on File and Storage Technologies (FAST '08). Feb. 26-29, 2008. San Jose, CA. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-07-105, September 2007.
Abstract / PDF [374K]
- On Application-level Approaches to Avoiding TCP Throughput Collapse in Cluster-Based Storage Systems. E. Krevat, V. Vasudevan, A. Phanishayee, D. Andersen, G. Ganger, G. Gibson, S. Seshan. Proceedings of the 2nd international Petascale Data Storage Workshop (PDSW '07)
held in conjunction with Supercomputing '07. November 11, 2007, Reno, NV.
Abstract / PDF [124K]
- GIGA+: Scalable Directories for Shared File Systems. Swapnil V. Patil, Garth A. Gibson, Sam Lang, Milo Polte. Proceedings of the 2nd international Petascale Data Storage Workshop (PDSW '07)
held in conjunction with Supercomputing '07. November 11, 2007, Reno, NV.
Abstract / PDF [114K]
- Failure Tolerance in Petascale Computers. Garth Gibson, Bianca Schroeder, Joan Digney. CTWatch Quarterly,
vol. 3 no. 4. Volume on Software Enabling Technologies for Petascale Science. November 2007. www.ctwatch.org
PDF [686K]
- Learning to Share: A Study of Sharing Among Home Storage Devices. Brandon Salmon, Frank Hady, Jay Melican. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-07-107, October, 2007.
Abstract / PDF [726K]
- Low-overhead Byzantine Fault-tolerant Storage. James Hendricks, Gregory R. Ganger, Michael K. Reiter. Proceedings of the Twenty-First ACM Symposium on Operating Systems Principles (SOSP 2007), Stevenson, WA, October 2007.
Abstract / PDF [280K]
- Understanding Failures in Petascale Computers. Bianca Schroeder, Garth A. Gibson. SciDAC 2007. Journal of Physics: Conf. Ser. 78.
Abstract / PDF [712K]
- To Share Or Not To Share? Ryan Johnson, Stavros Harizopoulos, Nikos Hardavellas, Kivanc Sabirli, Ippokratis Pandis, Anastasia Ailamaki, Naju G. Mancheril, Babak Falsafi. Proceedings of the 33rd International Conference on Very Large Data Bases (VLDB’07), Vienna, Austria, September 2007.
Abstract / PDF [366K]
- Efficient Use of the Query Optimizer for Automated Physical Design. Stratos Papadomanolakis, Debabrata Dash, Anastasia Ailamaki. Proceedings of the 33rd International Conference on Very Large Data Bases (VLDB’07), Vienna, Austria, September 2007.
Abstract / PDF [2.4M]
- Enabling What-if Explorations in Systems. Eno Thereska. Carnegie Mellon University, Dept. ECE Ph.D Dissertation CMU-PDL-07-103, August 2007.
Abstract / PDF [2.35M]
- Verifying Distributed Erasure-coded Data. James Hendricks, Gregory R. Ganger, Michael K. Reiter. To appear in Proceedings of the Twenty-Sixth Annual ACM SIGACT-SIGOPS Symposium on Principles of Distributed Computing (PODC 2007), Portland, August 2007.
Abstract / PDF [193K]
- An Analysis of Database System Performance on Chip Multiprocessors. Nikos Hardavellas, Ippokratis Pandis, Ryan Johnson, Naju G. Mancheril, Stavros Harizopoulos, Anastasia Ailamaki and Babak Falsafi. Proceedings of the 6th Hellenic Data Management Symposium (HDMS2007), Athens, Greece, July 2007.
Abstract / PDF [308K]
- Lessons Learned From the Deployment of a Smartphone-Based Access-Control System. Lujo Bauer, Lorrie Faith Cranor, Michael K. Reiter, Kami Vaniea. Symposium On Usable Privacy and Security (SOUPS) 2007, July 18-20, 2007, Pittsburgh, PA, USA.
Abstract / PDF [984K]
- Categorizing and Differencing System Behaviours. Raja R. Sambasivan, Alice X. Zheng, Eno Thereska, Gregory R. Ganger. Second Workshop on Hot Topics in Autonomic Computing. June 15, 2007. Jacksonville, FL.
Abstract / PDF [120K]
- Observer: Keeping System Models from Becoming Obsolete.
Eno Thereska, Dushyanth Narayanan, Anastassia Ailamaki, Gregory R. Ganger. Second Workshop on Hot Topics in Autonomic Computing. June 15, 2007.
Jacksonville, FL. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-07-101, January 2007.
Abstract / PDF[ 75K]
- Using Provenance to Aid in Personal File Search. Sam Shah, Craig A. N. Soules, Gregory R. Ganger, Brian D. Noble. USENIX '07 Annual Technical Conference, Santa Clara, CA, June 17–22, 2007.
Abstract / PDF [225K]
- Improving Mobile Database Access Over Wide-Area Networks Without Degrading Consistency. Niraj Tolia, M. Satyanarayanan, Adam Wolbach. MobiSys’07, June 11–13, 2007, San Juan, Puerto Rico, USA.
Abstract / PDF [1.1M]
- VMM-Independent Graphics Acceleration. H. Andres Lagar-Cavilla, Niraj Tolia, M. Satyanarayanan, Eyal de Lara. VEE’07, June 13–15, 2007, San Diego, California, USA.
Abstract / PDF [1.4M]
- Modeling the Relative Fitness of Storage.
Michael P. Mesnier, Matthew Wachs, Raja R. Sambasivan, Alice X. Zheng, Gregory R. Ganger.
SIGMETRICS’07, June 12–16, 2007, San Diego, California, USA.
Abstract / PDF [235K]
- Scheduling Threads for Constructive Cache Sharing on CMPs. Shimin Chen, Phillip B. Gibbons, Michael Kozuch, Vasileios Liaskovitis, Anastassia Ailamaki, Guy E. Blelloch, Babak Falsafi, Limor Fix, Nikos Hardavellas, Todd C. Mowry, Chris Wilkerson. SPAA'07, June 9-11, 2007, San Diego, California, USA.
Abstract / PDF [293K]
- Consistency-preserving Caching of Dynamic Database Content. Niraj Tolia and M. Satyanarayanan. International World Wide Web Conference (WWW 2007), May 8-12, 2007, Banff, Alberta, Canada.
Abstract / PDF [888K]
- Fingerpointing Correlated Failures in Replicated Systems. Soila Pertet, Rajeev Gandhi and Priya Narasimhan. USENIX Workshop on Tackling Computer Systems Problems with Machine Learning Techniques (SysML), Cambridge, MA (April 2007).
Abstract / PDF [100K]
- Exploiting Similarity for Multi-Source Downloads Using File Handprints. Himabindu Pucha, David G. Andersen, Michael Kaminsky. Proceedings of the 4th Symposium on Networked Systems Design and Implementation (NSDI ’07), Cambridge, Massachusetts, April 2007.
Abstract / PDF [579K]
- MultiMap: Preserving Disk Locality for Multidimensional Datasets. Minglong Shao, Steven W. Schlosser, Stratos Papadomanolakis, Jiri Schindler, Anastassia Ailamaki, Gregory R. Ganger. IEEE 23rd International Conference on Data Engineering (ICDE 2007) Istanbul, Turkey, April 2007. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-05-102. March 2005.
Abstract / PDF [203K]
- The Computer Failure Data Repository. Bianca Schroeder, Garth Gibson. Invited contribution to the Workshop on Reliability Analysis of System Failure Data (RAF'07) MSR Cambridge, UK, March 2007.
Abstract / PDF [42K]
- //TRACE: Parallel Trace Replay with Approximate Causal Events. Michael Mesnier, Matthew Wachs, Raja R. Sambasivan, Julio Lopez, James Hendricks, Gregory R. Ganger. Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST '07),
February 13–16, 2007, San Jose, CA. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-06-108, September 2006.
Abstract / PDF[ 187K]
- Disk Failures in the Real World: What Does an MTTF of 1,000,000 Hours Mean to You? Bianca Schroeder, Garth A. Gibson. Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST '07),
February 13–16, 2007, San Jose, CA. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-06-111, September 2006.
Abstract / PDF[ 272K]
- Argon: Performance Insulation for Shared Storage Servers. Matthew Wachs, Michael Abd-El-Malek, Eno Thereska, Gregory R. Ganger. Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST '07),
February 13–16, 2007, San Jose, CA. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-06-106, May 2006.
Abstract / PDF [ 167K]
- Database Servers on Chip Multiprocessors: Limitations and Opportunities. Nikos Hardavellas, Ippokratis Pandis, Ryan Johnson, Naju G. Mancheril, Anastassia Ailamaki and Babak Falsafi. 3rd Biennial Conference on Innovative Data Systems Research (CIDR), January 7-10, 2007, Asilomar, California, USA.
Abstract / PDF [111K]
- Static Analysis Meets Distributed Fault-Tolerance: Enabling State-Machine Replication with Nondeterminism. Joseph Slember, Priya Narasimhan. Proceedings of the 2nd Workshop on Hot Topics in System Dependability (HotDep '06), Seattle, WA. Nov. 8, 2006.
Abstract / PDF [98K]
- Living with Nondeterminism in Replicated Middleware Applications. Joseph Slember, Priya Narasimhan. Middleware 2006, ACM/IFIP/USENIX, 6th International Middleware Conference, Melbourne, Australia, November 27 - December 1, 2006, Proceedings. Lecture Notes in Computer Science 4290 Springer 2006.
Abstract / PDF [387K]
- Towards Fingerpointing in the Emulab Dynamic Distributed System. Michael P. Kasick, Priya Narasimhan, Kevin Atkinson, Jay Lepreau. Proceedings of the 3rd USENIX Workshop on Real, Large Distributed Systems (WORLDS '06), Seattle, WA. Nov. 5, 2006.
Abstract / PDF [311K]
- Early Experiences on the Journey Towards Self-* Storage. Michael Abd-El-Malek, William V. Courtright II, Chuck Cranor, Gregory R. Ganger, James Hendricks, Andrew J. Klosterman, Michael Mesnier, Manish Prasad,
Brandon Salmon, Raja R. Sambasivan, Shafeeq Sinnamohideen, John D. Strunk, Eno Thereska, Matthew Wachs, Jay J. Wylie. Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, September 2006.
Abstract / PDF [113K] / Postscript [745K]
- Putting Home Storage Management into Perspective. Brandon Salmon, Steven W. Schlosser, Lily B. Mummert, Gregory R. Ganger.
Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-06-110, September 2006.
Abstract / PDF [382K]