Recent Publications
- Solving the Straggler Problem with Bounded Staleness. James Cipar, Qirong Ho, Jin Kyu Kim, Seunghak Lee, Gregory R. Ganger, Garth Gibson, Kimberly Keeton, Eric Xing. 14th USENIX HotOS Workshop, Santa Ana Pueblo, NM, May 13-15, 2013.
Abstract / PDF [174K]
- Theia: Visual Signatures for Problem Diagnosis in Large Hadoop Clusters. Elmer Garduno, Soila P. Kavulya, Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. USENIX ;login, 38(2), April 2013.
Abstract / PDF [961K]
- Asymmetry-aware Execution Placement on Manycore Chips. Alexey Tumanov, Joshua Wise, Onur Mutlu, Gregory R. Ganger. In Proc. of the 3rd Workshop on Systems for Future Multicore Architectures (SFMA'13), EuroSys'13, Apr. 14-17, 2013, Prague, Czech Republic.
Abstract / PDF [703K]
- Visualizing Request-flow Comparison
to Aid Performance Diagnosis in Distributed Systems.
Raja R. Sambasivan, Ilari Shafer, Michelle L. Mazurek, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report
CMU-PDL-13-104 (supersedes CMU-PDL-12-102),
April 2013.
Abstract / PDF [1.93M]
- The Impact of Length and Mathematical Operators on the Usability and
Security of System-assigned One-time PINs.
Patrick Gage Kelley, Saranga Komanduri, Michelle L. Mazurek, Richard Shay,
Tim Vidas, Lujo Bauer, Nicolas Christin, and Lorrie Faith Cranor.
In 2013 Workshop on Usable Security (USEC), April 2013.
Abstract / PDF [802K]
- MemC3: Compact and Concurrent Memcache with Dumber Caching and Smarter Hashing.
Bin Fan, David G. Andersen and Michael Kaminsky.
In Proc. 10th USENIX NSDI, Apr 2013. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-116.
November 2012. Source code: https://github.com/efficient/libcuckoo
Abstract / PDF [280K]
- Application-to-Core Mapping Policies to Reduce Memory System Interference in Multi-Core Systems. Reetuparna Das, Rachata Ausavarungnirun, Onur Mutlu, Akhilesh Kumar, Mani Azimi. Proceedings of the 19th International Symposium on High-Performance Computer Architecture (HPCA 2013), Shenzhen, China, February 2013.
Abstract / PDF [623K]
- MISE: Providing Performance Predictability and Improving Fairness in Shared Main Memory Systems. Lavanya Subramanian, Vivek Seshadri, Yoongu Kim, Ben Jaiyen, Onur Mutlu. Proceedings of the 19th International Symposium on High-Performance Computer Architecture (HPCA 2013), Shenzhen, China, February 2013.
Abstract / PDF [607K]
- Tiered-Latency DRAM: A Low Latency and Low Cost DRAM Architecture.
Donghyuk Lee, Yoongu Kim, Vivek Seshadri, Jamie Liu, Lavanya Subramanian, Onur Mutlu. Proceedings of the 19th International Symposium on High-Performance Computer Architecture (HPCA), Shenzhen China, February 2013.
Abstract / PDF [3.17M]
- Practical Batch-Updatable External Hashing with Sorting.
Hyeontaek Lim and David G. Andersen and Michael Kaminsky.
In Proc. Meeting on Algorithm Engineering and Experiments (ALENEX), Jan 2013.
Abstract / PDF [536K]
- TABLEFS: Enhancing Metadata Efficiency in the Local File System. Kai Ren, Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-13-102, January 2013. Revised version of CMU-PDL-12-110.
Abstract / PDF [798K]
- Giga+TableFS on PanFS: Scaling Metadata Performance on Cluster File Systems. Kartik Kulkarni, Kai Ren, Swapnil Patil, Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-13-101, January 2013.
Abstract / PDF [679K]
- vQuery: A Platform for Connecting Configuration and Performance.
Ilari Shafer, Snorri Gylfason, Gregory R. Ganger.
vwWare Labs Technical Report, Palo Alto, CA. December 2012.
Abstract / PDF [288K]
- Helping Users Create Better Passwords.
Blase Ur, Patrick Gage Kelley, Saranga Komanduri, Joel Lee, Michael Maass,
Michelle L. Mazurek, Timothy Passaro, Richard Shay, Timothy Vidas, Lujo
Bauer, Nicolas Christin, Lorrie Faith Cranor, Serge Egelman, and Julio
López.
USENIX ;login:, 37(6), December 2012.
Abstract / PDF [970K]
- Theia: Visual Signatures for Problem Diagnosis in Large Hadoop Clusters. Elmer Garduno, Soila P. Kavulya, Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. 26th Usenix Large Installation System Administration Conference (LISA'12), Dec. 9-14, San Diego, CA.
Abstract / PDF [913K]
- Failure Diagnosis of Complex Systems. Soila P. Kavulya, Kaustubh Joshi (AT&T), Felicita Di Giandomenico (ISTI-CNR, Pisa, Italy), Priya Narasimhan. Chapter in "Resilience Assessment and Evaluation". Editors. Katinka Wolter, Alberto Avritzer, Marco Vieira, Aad van Moorsel. Springer Verlag, December 2012.
Abstract / PDF [288K]
- Runtime Estimation and Resource Allocation for
Concurrency Testing.
Jiri Simsa, Randy Bryant, Garth Gibson.
Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-113.
December 2012.
Abstract / PDF [490K]
- TABLEFS: Embedding a NoSQL Database inside the Local File System. Ren, Kai, Garth Gibson. 1st Storage System, Hard Disk and Solid State Technologies Summit, IEEE Asia-Pacific Magnetic Recording Conference (APMRC), November 2012, Singapore.
Abstract / PDF [399K]
- MemC3: Compact and Concurrent MemCache with Dumber Caching and
Smarter Hashing.
Bin Fan, David G. Andersen, Michael Kaminsky.
Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-116.
November 2012.
Abstract / PDF [824K]
- HPC Computation on Hadoop Storage with PLFS.
Chuck Cranor, Milo Polte, Garth Gibson.
Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-115.
Nov. 2012.
Abstract / PDF [170K]
- A Case for Scaling HPC Metadata Performance through
De-specialization.
Swapnil Patil,
Kai Ren,
Garth Gibson. 7th Petascale Data Storage Workshop held in conjunction with Supercomputing '12, November 12, 2012. Salt Lake City, UT. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-111, November 2012.
Abstract / PDF [512K]
- JackRabbit: Improved Agility In Elastic Distributed Storage. James Cipar, Lianghong Xu, Elie Krevat, Alexey Tumanov Nitin Gupta, Michael A. Kozuch, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-112, October 2012.
Abstract / PDF [395K]
- HAT: Heterogeneous Adaptive Throttling for On-Chip Networks. Kevin Chang, Rachata Ausavarungnirun, Chris Fallin, Onur Mutlu. SBAC-PAD 2012, New York, NY, October 24-26, 2012.
Abstract / PDF [259K]
- alsched: Algebraic Scheduling of Mixed Workloads in Heterogeneous Clouds. Alexey Tumanov, James Cipar, Michael A. Kozuch, Gregory R. Ganger. 3rd ACM Symposium on Cloud Computing. October 14th-17th, 2012 - San Jose, CA.
Abstract / PDF [379K]
- Heterogeneity and Dynamicity of Clouds at Scale: Google Trace Analysis. Charles Reiss, Alexey Tumanov, Gregory R. Ganger, Randy H. Katz, Michael A. Kozuch. 3rd ACM Symposium on Cloud Computing. October 14th-17th, 2012 - San Jose, CA.
Abstract / PDF [3.1M]
- Using Vector Interfaces to Deliver Millions of IOPS from a Networked Key-value Storage Server. Vijay Vasudevan, Michael Kaminsky, David G. Andersen. SOCC'12, October 14-17, 2012, San Jose, CA USA.
Abstract / PDF [648K]
- Scalable Dynamic Partial Order Reduction. Jiri Simsa, Randy Bryant, Garth Gibson, Jason Hickey. Third Int. Conf. on Runtime Verification (RV2012), 25-28 September 2012, Istanbul, Turkey.
Abstract / PDF [331K]
- The Evicted-Address Filter: A Unified Mechanism to Address Both Cache Pollution and Thrashing. Vivek Seshadri, Onur Mutlu, Michael A Kozuch, Todd C Mowry. PACT'12, September 19–23, 2012, Minneapolis, Minnesota, USA.
Abstract / PDF [2M]
- TABLEFS: Embedding a NoSQL Database Inside the Local File System. Kai Ren, Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-110, September 2012.
Abstract / PDF [1.43M]
- A Proof of Correctness for Egalitarian Paxos. Iulian Moraru, David G. Andersen, Michael Kaminsky. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-109. September 2012.
Abstract / PDF [314K]
- Indexing and Fast Near-Matching of Billions of Astronomical Objects. Bin Fu, Eugene Fink, Garth Gibson and Jaime Carbonell. In Proceedings of the Fourth Workshop on Interfaces and Architecture for Scientific Data Storage, 2012 (IASDS12).
September 24, 2012, Beijing, China.
Abstract / PDF [303K]
- Row Buffer Locality Aware Caching Policies for Hybrid Memories. HanBin Yoon, Justin Meza, Rachata Ausavarungnirun, Rachael A. Harding, Onur Mutlu. Proceedings of the 30th IEEE International Conference on Computer Design (ICCD 2012), Montreal, Quebec, Canada, September 2012.
Best paper award in Computer Systems and Applications track.
Abstract / PDF [577K]
- A Case for Small Row Buffers in Non-Volatile Main Memories. Justin Meza, Jing Li, Onur Mutlu. Proceedings of the 30th IEEE International Conference on Computer Design (ICCD 2012), Poster Session, Montreal, Quebec, Canada, September 2012.
Abstract / PDF [172K]
- RainMon: An Integrated Approach to Mining Bursty Timeseries Monitoring Data. Ilari Shafer, Kai Ren, Vishnu Boddeti, Yashihisa Abe, Greg Ganger, Christos Faloutsos. KDD'12, August 12–16, 2012, Beijing, China.
Abstract / PDF [1.5M]
- How Does Your Password Measure Up? The effect of strength meters on password creation. Blaser Ur, Patrick Gage Kelley, Saranga Komanduri, Joel Lee, Michael Maass, Michelle L. Mazurek, Timothy Passaro, Richard Shay, Timothy Vidas, Lujo Bauer, Nicolas Christin, and Lorrie Faith Cranor. In the 2012 USENIX Security Symposium, August 2012.
Abstract / PDF [1.2M]
- On-Chip Networks from a Networking Perspective: Congestion and Scalability in Many-core Interconnects. George Nychis, Chris Fallin, Thomas Moscibroda, Onur Mutlu, Srinivasan Seshan.
In SIGCOMM 2012, Helsinki, Finland, Aug 2012.
Abstract / PDF [628K]
- Light-weight Black-box Failure Detection for
Distributed Systems.
Jiaqi Tan, Soila Kavulya, Rajeev Gandhi, Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report
CMU-PDL-12-107.
July 2012
Abstract / PDF [300K]
- Egalitarian Paxos. Iulian Moraru, David G. Andersen, Michael Kaminsky. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-108. July 2012.
Abstract / PDF [363K]
- Correct Horse Battery Staple: Exploring the usability of system-assigned passphrases. Richard Shay, Patrick Gage Kelley, Saranga Komanduri, Michelle L. Mazurek, Blase Ur, Tim Vidas, Lujo Bauer, Nicolas Christin, and Lorrie Faith Cranor. In SOUPS 2012: Symposium on Usable Privacy and Security, July 2012.
Abstract / PDF [549K]
- Hadoop's Adolescence: A Comparative Workload Analysis from Three Research Clusters. Kai Ren, YongChul Kwon, Magdalena Balazinska, Bill Howe. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-106. June 2012.
Abstract / PDF [1.76M]
- A Case for Exploiting Subarray-level Parallelism (SALP) in DRAM. Yoongu Kim, Vivek Seshadri, Donghyuk Lee, Jamie Liu, Onur Mutlu. Proceedings of the 39th International Symposium on Computer Architecture, June 2012.
Abstract / PDF [927K]
- RAIDR: Retention-Aware Intelligent DRAM Refresh. Jamie Liu, Ben Jaiyen, Richard Veras, Onur Mutlu. In Proceedings of the 39th International Symposium on Computer Architecture, Portland, Oregon, June 9-13th, 2012.
Abstract / PDF [480K]
- Staged Memory Scheduling: Achieving High Performance and Scalability in Heterogeneous Systems.
Rachata Ausavarungnirun, Kevin Kai-Wei Chang, Lavanya Subramanian, Gabriel H. Loh, Onur Mutlu. The 39th International Symposium on Computer Architecture (ISCA), Portland, Oregon, June 9-13th, 2012.
Abstract / PDF [700K]
- Exact and Approximate Computation of a Histogram of Pairwise Distances between Astronomical Objects. Bin Fu, Eugene Fink, Garth Gibson and Jaime Carbonell. First Workshop on High Performance Computing in Astronomy (AstroHPC 2012), held in conjunction with the 21st International ACM Symposium on High-Performance Parallel and Distributed Computing (HPDC 2012), June 18-19, 2012, Delft, the Netherlands.
Abstract / PDF [309K]
- Automated Diagnosis without Predictability is a Recipe for Failure. Raja R. Sambasivan & Gregory R. Ganger. Proceedings of the 4th USENIX Workshop on Hot Topics in Cloud Computing (HotCloud '12), June 12-13, 2012, Boston, MA. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-101.
Abstract / PDF [368K]
- Draco: Statistical Diagnosis of Chronic Problems in Large Distributed Systems. Soila P. Kavulya, Scott Daniels (AT&T), Kautubh Joshi (AT&T), Matti Hiltunen (AT&T), Rajeev Gandhi, Priya Narasimhan.IEEE/IFIP Conference on Dependable Systems and Networks (DSN), June 2012.
Abstract / PDF [859K]
- File System Virtual Appliances: Portable File System Implementations. Michael Abd-El-Malek , Matthew Wachs, James Cipar, Karan Sanghi, Gregory R. Ganger, Garth A. Gibson, Michael K. Reiter. ACM Transactions on Storage, Vol. 8, No. 3, Article 39, May 2012.
Abstract / PDF [518K]
- MinBD: Minimally-Buffered Deflection Routing for Energy-Efficient Interconnect. Chris Fallin, Greg Nazario, Xiangyao Yu, Kevin Chang, Rachata Ausavarungnirun, Onur Mutlu. In NOCS 2012, Lyngby, Denmark, May 2012.
(One of five papers nominated for the Best Paper Award by the Program
Committee.)
Abstract / PDF [369K]
- Shingled Magnetic Recording for Big Data Applications. Anand Suresh, Garth Gibson, Greg Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-105. May 2012.
Abstract / PDF [561K]
- SkyeFS: Distributed Directories using Giga+ and PVFS. Anthony Chivetta, Swapnil Patil & Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-104, May 2012.
Abstract / PDF [398K]
- Tag, You Can See It! Using tags for access control in photo sharing. Peter F. Klemperer, Yuan Liang, Michelle L. Mazurek, Manya Sleeper, Blase Ur, Lujo Bauer, Lorrie Faith Cranor, Nitin Gupta, and Michael K. Reiter. In CHI 2012: Conference on Human Factors in Computing Systems, May 2012.
Abstract / PDF [560K]
- Guess Again (and Again and Again): Measuring password strength by simulating password-cracking algorithms. Patrick Gage Kelley, Saranga Komanduri, Michelle L. Mazurek, Rich Shay, Tim Vidas, Lujo Bauer, Nicolas Christin, Lorrie Faith Cranor, Julio López. In the 2012 IEEE Symposium on Security and Privacy, May 2012.
Abstract / PDF [2.8M]
- Concurrent Systematic Testing at Scale. Jiri Simsa, Randy Bryant, Garth Gibson, Jason Hickey. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-101. May 2012.
Abstract / PDF [397K]
- Visualizing Request-flow Comparison to Aid Performance Diagnosis in Distributed Systems. Raja R. Sambasivan, Ilari Shafer, Michelle L. Mazurek, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-102. May 2012.
Abstract / PDF [1.13M]
- Landslide: Systematic Dynamic Race Detection in Kernel Space. Ben Blum. Carnegie Mellon University School of Computer Science MS Thesis CMU-CS-12-118. May 2012.
Abstract / PDF [1.7M]
- Towards Understanding Heterogeneous Clouds at Scale: Google Trace Analysis Charles Reiss, Alexey Tumanov, Gregory R. Ganger, Randy H. Katz, Michael A. Kozuch. Intel Science and Technology Center for Cloud Computing Technical Report ISTC-CC-TR-12-101, April 27, 2012.
Abstract / PDF [876K]
- TABLEFS: Embedding a NoSQL Database Inside the Local File System. Kai Ren, Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report. CMU-PDL-12-103 May 2012.
Abstract / PDF [339K]
- A Statistical Study for File System Meta Data On High Performance Computing Sites. Yifan Wang. M.S. Thesis, Information Networking Institute, Carnegie Mellon University. May 2012.
Abstract / PDF [5.3M]
- Enabling Efficient and Scalable Hybrid Memories Using Fine-Granularity DRAM Cache Management. Justin Meza, Jichuan Chang, HanBin Yoon, Onur Mutlu, Parthasarathy Ranganathan. IEEE Computer Architecture Letters (CAL), May 2012.
Abstract / PDF [184K]
- LazyBase: Trading Freshness for Performance in a Scalable Database. James Cipar, Greg Ganger, Kimberly Keeton, Charles B. Morrey III, Craig A. N. Soules, Alistair Veitch. EuroSys 2012 April 10-13, 2012, Bern, Switzerland.
Abstract / PDF [236K]
- Bottleneck Identification and Scheduling in Multithreaded Applications. José A. Joao, M. Aater Suleman, Onur Mutlu, Yale N. Patt. Proceedings of the 17th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), London, UK, March 2012.
Abstract / PDF [828K]
- Near-Real-Time Inference of File-Level Mutations from Virtual Disk Writes. Wolfgang Richter, Mahadev Satyanarayanan, Jan Harkes, Benjamin Gilbert. Carnegie Mellon University School of Computer Science Technical Report CMU-CS-12-103. February 2012.
Abstract / PDF [343K]
- ZZFS: A Hybrid Device and Cloud File System for Spontaneous Users. Michelle L. Mazurek, Eno Thereska, Dinan Gundawardena, Richard Harper, James Scott. FAST 2012: USENIX Conference on File and Storage Technologies, February 2012.
Abstract / PDF [567K]
- Active Disk Meets Flash: A Case for Intelligent SSDs. Sangyeun Cho, Chanik Park , Hyunok Oh, Sungchan Kim, Youngmin Yi and Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-115. Dec. 2011.
Abstract / PDF [989K]
- Persistent, Protected and Cached: Building Blocks for Main Memory Data Stores. Iulian Moraru, David G. Andersen, Michael Kaminsky, Nathan Binkert, Niraj Tolia, Reinhard Munz,Parthasarathy Ranganathan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-114v2, Nov. 2012. Supersedes CMU-PDL-11-114. Dec. 2011.
Abstract / PDF [1.0M]
- Efficient Exploratory Testing of Concurrent Systems. Jiri Simsa, Randy Bryant, Garth Gibson, Jason Hickey (Google). Carnegie Mellon University Parallel Data Laboratory Techical Report CMU-PDL-11-113,
November 2011.
Abstract / PDF [786K]
- A Cyber-Physical-System Approach to Data Center Modeling and Control for Energy Efficiency. Luca Parolini, Bruno Sinopoli, Bruce H. Krogh, Zhikui Wang. Proceedings of the IEEE, Special Issue on Cyber-Physical Systems, December 2011.
Abstract / PDF [1.76M]
- Reducing Memory Interference in Multicore Systems via Application-Aware Memory Channel Partitioning. Sai Prashanth Muralidhara, Lavanya Subramanian, Onur Mutlu, Mahmut Kandemir, Thomas Moscibroda. Proceedings of the 44th International Symposium on Microarchitecture
(MICRO), Porto Alegre, Brazil, December 2011.
Abstract / PDF [232K]
- Understanding and Improving the Diagnostic Workflow of MapReduce Users. Jason D. Campbell (Intel Labs Pittsburgh), Arun B. Ganesan, Ben Gotow, Soila P. Kavulya, James Mulholland, Priya Narasimhan, Sriram Ramasubramanian, Mark Shuster, Jiaqi Tan (DSO National Laboratories, Singapore), ACM Symposium on Computer Human Interaction for Management of Information Technology (CHIMIT), Boston, MA, December 2011.
Abstract / PDF [775K]
- On the Duality of Data-intensive File System Design: Reconciling HDFS and PVFS. Wittawat Tantisiriroj, Swapnil Patil, Garth Gibson, Seung Woo Son, Samuel J. Lang, Robert B. Ross. SC11, November 12-18, 2011, Seattle, Washington USA. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-108. April 2011.
Abstract / PDF [459K]
- Efficient Exploratory Testing of Concurrent Systems. Jiri Simsa, Randy Bryant, Garth Gibson, Jason Hickey (Google). Carnegie Mellon University Parallel Data Laboratory Techical Report CMU-PDL-11-113,
November 2011.
Abstract / PDF [786K]
- The Case for Sleep States in Servers. Anshul Gandhi, Mor Harchol-Balter, Michael A. Kozuch. HotPower'11, October 23, 2011, Cascais, Portugal.
Abstract / PDF [621K]
- Practical Experiences with Chronics Discovery in Large Telecommunications Systems. Soila P. Kavulya, Kaustubh Joshi, Matti Hiltunen, Scott Daniels, Rajeev Gandhi, Priya Narasimhan. SLAML 2011, October 23, 2011, Cascais, Portugal.
Abstract / PDF [500K]
- DiskReduce: Replication as a Prelude to Erasure Coding in Data-Intensive Scalable Computing. Bin Fan, Wittawat Tantisiriroj, Lin Xiao, Garth Gibson. Carnegie Mellon Univsersity Parallel Data Laboratory Technical Report CMU-PDL-11-112, October, 2011.
Abstract / PDF [897K]
- SILT: A Memory-Efficient, High-Performance Key-Value Store. Hyeontaek Lim, Bin Fan, David Andersen and Michael Kaminsky. ACM Symposium on Operating Systems Principles (SOSP'11), Cascais, Portugal, October 2011.
Abstract / PDF [1.15M]
- Small Cache, Big Effect: Provable Load Balancing for Randomly Partitioned Cluster Services. Bin Fan, Hyeontaek Lim, David Andersen and Michael Kaminsky. ACM Symposium on Cloud Computing (SOCC'11), Cascais, Portugal, October, 2011.
Abstract / PDF [336K]
- Switching the Optical Divide: Fundamental Challenges for Hybrid Electrical/Optical Datacenter Networks. Hamid Hajabdolali Bazzaz, Malveeka Tewari, Guohui Wang, George Porter, T. S. Eugene Ng, David G. Andersen, Michael Kaminsky, Michael A. Kozuch, Amin Vahdat. Proc. 2nd ACM Symposium on Cloud Computing (SOCC), Oct 2011.
Abstract / PDF [190K]
- Don't Settle for Eventual: Scalable Causal Consistency for Wide-Area Storage with COPS.
Wyatt Lloyd, Michael J. Freedman, Michael Kaminsky, David G. Andersen. Proc. 23rd ACM Symposium on Operating Systems Principles (SOSP), Oct 2011.
Abstract / PDF [689K]
- Practical Experiences with Chronics Discovery in Large Telecommunications Systems. Soila P. Kavulya (CMU), Kaustubh Joshi, Matti Hiltunen , Scott Daniels (AT&T Labs, Research), Rajeev Gandhi and Priya Narasimhan (CMU). Workshop on System Logs and the Application of Machine Learning Techniques (SLAML), Cascais, Portugal, October 2011.
Abstract / PDF [524K]
- Performance Insulation: More Predictable Shared Storage. Matthew Wachs. Carnegie Mellon University School of Computer Science Ph.D. Dissertation CMU-CS-11-134. September 2011.
Abstract / PDF [2.65M]
- Row Buffer Locality-Aware Data Placement in Hybrid Memories. HanBin Yoon, Justin Meza, Rachata Ausavarungnirun, Rachael Harding, Onur Mutlu. SAFARI Technical Report, TR-SAFARI-2011-005, Carnegie Mellon University, September 2011.
Abstract / PDF [272K]
- Improving Cache Performance Using Victim Tag Stores. Vivek Seshadri, Onur Mutlu, Todd Mowry, Michael A. Kozuch. SAFARI Technical Report, TR-SAFARI-2011-009, Carnegie Mellon University, September 2011.
Abstract / PDF [242K]
- ThermoCast: A Cyber-Physical Forecasting Model for Data Centers. Lei Li, Chieh-Jan Mike Liang, Jie Liu, Suman Nath, Andreas Terzis, Christos Faloutsos. In KDD '11: Proceeding of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, August 21-24, San Diego, CA.
Abstract / PDF [1.32M]
- YCSB++: Benchmarking and Performance Debugging Advanced Features in Scalable Table Stores. Swapnil Patil, Milo Polte, Kai Ren, Wittawat Tantisiriroj, Lin Xiao, Julio Lopez, Garth Gibson, Adam Fuchs, Billie Rinaldi. Proc. of the 2nd ACM Symposium on Cloud Computing (SOCC '11), October 27–28, 2011, Cascais, Portugal. Supersedes Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-11-111, August 2011.
Abstract / PDF [1.2M]
- Minimizing Data Center SLA Violations and Power Consumption via Hybrid Resource Provisioning. Anshul Gandhi, Yuan Chen, Daniel Gmach, Martin Arlitt, Manish Marwah. 2nd IGCC 2011 (IEEE International Green Computing Conference 2011) July 25-28, 2011 Orlando, Florida, USA. -- BEST PAPER AWARD
Abstract / PDF [503K]
- End-to-end Tracing in HDFS. William Wang Carnegie Mellon University School of Computer Science Technical Report (Masters Thesis) CMU-CS-11-120, July 2011.
Abstract / PDF [489K]
- dBug: Systematic Testing of Distributed and Multi-threaded Systems. Jiri Simsa, Randy Bryant, Garth Gibson.18th International Workshop on Model Checking of Software (SPIN'11), Snowbird UT, July 2011.
Abstract / PDF [149K]
- Recipes for Baking Black Forest Databases: Building and Querying Black Hole Merger Trees from Cosmological Simulations. Julio Lopez, Colin Degraf, Tiziana DiMatteo, Bin Fu, Eugene Fink, and Garth Gibson. Proceedings of the Twenty-Third Scientific and Statistical Database Management Conference (SSDBM 2011), 20-22 July 2011.
Abstract / PDF [5.5M]
- Distributed, Robust Auto-Scaling Policies for Power Management in Compute Intensive Server Farms. Anshul Gandhi, Mor Harchol-Balter, Ram Raghunathan, Michael A. Kozuch. 5th International Open Cirrus Summit. June 01 – 03, 2011, Moscow, Russia.
Abstract / PDF [317K]
- Applying Idealized Lower-bound Runtime Models to Understand Inefficiencies in Data-intensive Computing (Extended Abstract). Elie Krevat, Tomer Shiran, Eric Anderson, Joseph Tucek, Jay J. Wylie, Gregory R. Ganger: SIGMETRICS 2011: 125-126, San Jose, CA, June 7-11, 2011.
Abstract / PDF [297K]
- Privacy-Sensitive VM Retrospection. Wolfgang Richter, Glenn Ammons, Jan Harkes, Adam Goode, Nilton Bila, Eyal De Lara, Vas Bala, Mahadev Satyanarayanan. HotCloud 2011 3rd USENIX Workshop on Hot Topics in Cloud Computing. Portland, OR, June 14-17, 2011.
Abstract / PDF [1.97M]
- Six Degrees of Scientific Data: Reading Patterns for Extreme Scale Science IO. Lofstead, Jay, Milo Polte, Garth Gibson, Scott A. Klasky, Karsten Schwan, Ron Oldfield, Matthew Wolf, Qing Liu. 20th ACM Int. Symp. On High-Performance Parallel and Distributed Computing (HPDC'11), June 2011.
Abstract / PDF [595K]
- Memory Power Management via Dynamic Voltage/Frequency Scaling. Howard David, Chris Fallin, Eugene Gorbatov, Ulf R. Hanebutte, Onur Mutlu. Proceedings of the 8th International Conference on Autonomic Computing (ICAC), Karlsruhe, Germany, June 2011.
Abstract / PDF [463K]
- Time Series Clustering: Complex is Simpler! Lei Li, B. Aditya Prakash. In Proceedings of the 28th International Conference on Machine learning, June 28 - July 2, 2011, Bellevue, WA.
Abstract / PDF [631K]
- Diagnosis in Automotive Systems: A Survey. Patrick E. Lanigan, Soila Kavulya, Priya Narasimhan, Thomas E. Fuhrman, Mutasim A. Salman. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-110. June 2011.
Abstract / PDF [369K]
- Principles of Operation for Shingled Disk Devices. Garth Gibson, Greg Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-107. April 2011.
Abstract / PDF [500K]
- Exertion-based Billing for Cloud Storage Access. Matthew Wachs, Lianghong Xu, Arkady Kanevsky, Gregory R. Ganger. Proceedings of the 3rd USENIX Workshop on Hot Topics in Cloud Computing (HotCloud '11). June 14-15, 2011, Portland, OR. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-105. March 2011.
Abstract / PDF [65K]
- Otus: Resource Attribution in Data-Intensive Clusters. Kai Ren, Julio López, Garth Gibson.
MapReduce'11, June 8, 2011, San Jose, California, USA. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-106, April 2011.
Abstract / PDF [2.5M]
- Exploring Reactive Access Control.
Michelle L. Mazurek, Peter F. Klemperer, Richard Shay, Hassan Takabi, Lujo Bauer, Lorrie Faith Cranor. CHI 2011, May 7–12, 2011, Vancouver, BC, Canada.
Abstract / PDF [293k]
- Of Passwords and People: Measuring the Effect of Password-Composition Policies. Saranga Komanduri, Richard Shay, Patrick Gage Kelley, Michelle L. Mazurek, Lujo Bauer, Nicolas Christin, Lorrie Faith Cranor, Serge Egelman. CHI 2011, May 7–12, 2011, Vancouver, BC, Canada.
Abstract / PDF [405K]
- Disks Are Like Snowflakes: No Two Are Alike. Elie Krevat, Joseph Tucek, Gregory R. Ganger. 13th Workshop on Hot Topics in Operating Systems (HotOS 2011), Napa Valley, CA. May 2011. Supersedes Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-11-102, February 2011.
Abstract / PDF [1.8M]
- The Case for VOS: The Vector Operating System. Vijay Vasudevan, David Andersen, Michael Kaminsky.
In 13th Workshop on Hot Topics in Operating Systems (HotOS 2011). May 2011.
Abstract / PDF [430K]
- WindMine: Fast and Effective Mining of Web-click Sequences. Yasushi Sakurai, Lei Li, Yasuko Matsubara, Christos Faloutsos. 2011 SIAM International Conference on Data Mining, April 28-30, 2011, Mesa, AZ.
Abstract / PDF [968K]
- Draco: Top-Down Statistical Diagnosis of Large-scale VoIP Networks. Soila P. Kavulya, Kaustubh Joshi, Matti Hiltunen, Scott Daniels, Rajeev Gandhi, Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-109, April 2011.
Abstract / PDF [787K]
- Recipes for Baking Black Forest Databases: Building and Querying Black Hole Merger Trees from Cosmological Simulations. Julio Lopez, Colin Degraf, Tiziana DiMatteo, Bin Fu, Eugene Fink, Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-104. April 2011.
Abstract / PDF [6.5M]
- The Case for Content Search of VM Clouds.
Mahadev Satyanarayanan, Wolfgang Richter, Glenn Ammons, Jan Harkes, Adam Goode.
34th Annual IEEE Computer Software and Applications Conference Workshops (COMPSACW), July 19-23, 2010, Seoul, Korea.
Abstract / PDF [831K]
- Diagnosing Performance Changes by Comparing Request Flows. Raja R. Sambasivan, Alice X. Zheng, Michael De Rosa, Elie Krevat, Spencer Whitman, Michael Stroucken, William Wang, Lianghong Xu, Gregory R. Ganger. 8th USENIX Symposium on Networked Systems Design and Implementation (NSDI'11). March 30 - April 1, 2011. Boston, MA.
Abstract / PDF [388K]
- Scale and Concurrency of GIGA+: File System Directories with Millions of Files. Swapnil Patil, Garth Gibson. Proceedings of the 9th USENIX Conference on File and Storage Technologies (FAST '11), San Jose CA, February 2011. Supersedes Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-10-110, Sept. 2010.
Abstract / PDF [508K]
- Applying Simple Performance Models to Understand Inefficiencies in Data-Intensive Computing. Elie Krevat, Tomer Shiran, Eric Anderson, Joseph Tucek, Jay J. Wylie, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-103. February 2011.
Abstract / PDF [476K]
- Automation Without Predictability is a Recipe for Failure. Raja R. Sambasivan, Gregory R. Ganger. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-11-101, January 2011.
Abstract / PDF [336K]
- Storage-Based Intrusion Detection. Adam G. Pennington, John Linwood Griffin, John S. Bucy, John D. Strunk, Gregory R. Ganger. ACM Transactions on Information and System Security, Vol. 13, No. 4, Article 30, Pub. date: December 2010.
Abstract / PDF [333K]
- Thread Cluster Memory Scheduling: Exploiting Differences in Memory Access Behavior. Yoongu Kim, Michael Papamichael, Onur Mutlu, Mor Harchol-Balter. Proceedings of the 43rd International Symposium on Microarchitecture (MICRO), Atlanta, GA, December 2010.
Abstract / PDF [478K]
- Improving Storage Bandwidth Guarantees with Performance Insulation. Matthew Wachs, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-113, October 2010.
Abstract / PDF [285K]
- SmartScan: Efficient Metadata Crawl for Storage Management Metadata Querying in Large File Systems. Likun Liu, Lianghong Xu, Yongwei Wu, Guangwen Yang, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-112, Oct. 2010.
Abstract / PDF [366K]
- Speeding Up Finite Element Wave Propagation for Large-Scale Earthquake Simulations. Ricardo Taborda, Julio López, Haydar Karaoglu, John Urbanic, Jacobo Bielak. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-109, October 2010.
Abstract / PDF [4.4M]
- Behavior-Based Problem Localization for Parallel File Systems. Michael P. Kasick, Rajeev Gandhi, Priya Narasimhan. HotDep '10. October 3, 2010, Vancouver, BC, Canada.
Abstract / PDF [149K]
- To Upgrade or Not to Upgrade: Impact of Online Upgrades across Multiple Administrative Domains. T. Dumitras, E. Tilevich, P.Narasimhan. ACM Onward! Conference, Oct. 2010.
Abstract / PDF [425K]
- dBug: Systematic Evaluation of Distributed Systems. Jiri Simsa, Randy Bryant, Garth Gibson. 5th Int. Workshop on Systems Software Verification (SSV’10), co-located with 9th USENIX Symp. on Operating Systems Design and Implementation (OSDI’10), Vancouver BC, October 2010.
Abstract / PDF [168K]
- pWalrus: Towards Better Integration of Parallel File Systems into Cloud Storage. Yoshihisa Abe, Garth Gibson. Workshop on Interfaces and Abstractions for Scientific Data Storage (IASDS10), co-located with IEEE Int. Conference on Cluster Computing 2010 (Cluster10), Heraklion, Greece, September 2010.
Abstract / PDF [321K]
- Token Attempt: The Misrepresentation of Website Privacy Policies through the Misuse of P3P Compact Policy Tokens. Pedro Giovanni Leon, Lorrie Faith Cranor, Aleecia M. McDonald, Robert McGuire. Cylab Technical Report CMU-CyLab-10-014, September 10, 2010.
Abstract / PDF [305K]
- Parsimonious Linear Fingerprinting for Time Series. Lei Li, B. Aditya Prakash, Christos Faloutsos. Proceedings of the VLDB Endowment, Vol. 3, No. 1, September 2010.
Abstract / PDF [684K]
- FAWNSort: Energy-efficient Sorting of 10GB.
Vijay Vasudevan Lawrence Tan, David Andersen, Michael Kaminsky, Michael A. Kozuch, Padmanabhan Pillai,
Winner of 2010 10GB Joulesort, Daytona and Indy categories. http://sortbenchmark.org/. July 2010.
Abstract / PDF [90K]
- Phase Change Memory Architecture and the Quest for Scalability. Benjamin C. Lee, Engin Ipek, Onur Mutlu, Doug Burger. Communications of the ACM (CACM), Research Highlight, Vol. 53, No. 7, pages 99-106, July 2010.
Abstract / PDF [1.34M]
- Diagnosing Performance Changes by Comparing System Behaviours. Raja R. Sambasivan, Alice X. Zheng, Elie Krevat, Spencer Whitman, Michael Stroucken, William Wang, Lianghong Xu, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-107. July 2010. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-103.
Abstract / PDF [503K]
- BEMC: A Searchable, Compressed Representation for Large Seismic Wavefields. Julio López, Leonardo Ramírez-Guzmán, Jacobo Bielak, David O’Hallaron. 22nd Int. Conf on Scientific and Statistical Database Management (SSDBM'10), Heidelberg, Germany, June 30 - July 2, 2010.
Abstract / PDF [311K]
- OddBall: Spotting Anomalies in Weighted Graphs. Leman Akoglu, Mary McGlohon, Christos Faloutsos. PAKDD 2010, Hyderabad, India, 21-24 June 2010. Best Paper Award!
Abstract / PDF [3.0M]
- A Transparently-Scalable Metadata Service for the Ursa Minor Storage System. Shafeeq Sinnamohideen, Raja R. Sambasivan, James Hendricks, Likun Liu, Gregory R. Ganger. Usenix Annual Technical Conference, Boston, MA, June 23-25, 2010. Supercedes Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-10-102. March 2010.
Abstract / PDF [230K]
- Visual, Log-based Causal Tracing for Performance Debugging of MapReduce Systems. Jiaqi Tan*, Soila Kavulya, Rajeev Gandhi and Priya Narasimhan. 30th IEEE International Conference on Distributed Computing Systems (ICDCS) 2010, Genoa, Italy, Jun 2010.
Abstract / PDF [2.1M]
- Zzyzx: Scalable Fault Tolerance Through Byzantine Locking. James Hendricks, Shafeeq Sinnamohideen, Gregory R. Ganger, Michael K. Reiter. Proceedings of the 40th Annual IEEE/IFIP International Conference on Dependable Systems and Networks. Chicago, Illinois, June 2010.
Abstract / PDF [231K]
- DiscFinder: A data-intensive scalable cluster finder for astrophysics. Bin Fu, Kai Ren, Julio López, Eugene Fink, and Garth Gibson.
In Proceedings of the ACM International Symposium on High Performance
Distributed Computing (HPDC), Chicago, IL. June, 2010. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-104..
Abstract / PDF [372K]
- Robust and Flexible Power-proportional Storage. Hrishikesh Amur, James Cipar, Varun Gupta, Gregory R. Ganger, Michael A. Kozuch, Karsten Schwan. ACM Symposium on Cloud Computing (SOCC). June 10-11, 2010, Indianapolis, IN. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-106, February 2010.
Abstract / PDF [944K]
- Reusing Migration to Simply and Efficiently Implement Multi-server Operations in Transparently Scalable Storage Systems. Shafeeq Sinnamohideen. Carnegie Mellon University School of Computer Science Ph.D. Dissertation CMU-CS-10-141. May 2010.
Abstract / PDF [926K]
- Applying Performance Models to Understand Data-intensive Computing Efficiency. Elie Krevat, Tomer Shiran, Eric Anderson†, Joseph Tucek†, Jay J. Wylie†, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-108. May 2010.
Abstract / PDF [304K]
- An Analysis of Traces from a Production MapReduce Cluster. Soila Kavulya, Jiaqi Tan, Rajeev Gandhi and Priya Narasimhan. 10th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid 2010). May 17-20, 2010, Melbourne, Victoria, Australia. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-107, December, 2009.
Abstract / PDF [832K]
- Energy-efficient Cluster Computing with FAWN: Workloads and Implications. Vijay Vasudevan David Andersen, Michael Kaminsky, Lawrence Tan, Jason Franklin, Iulian Moraru .
Proceedings of 1st Int'l Conf. on Energy-Efficient Computing & Networking (e-Energy 2010), Univ. of Passau, Germany. April 13-15, 2010.
Abstract / PDF [645K]
- Open Cirrus: A Global Cloud Computing Testbed. Arutyun I. Avetisyan, Roy Campbell, Indranil Gupta, Michael T. Heath, Steven Y. Ko, Gregory R. Ganger, Michael A. Kozuch, David O’Hallaron, Marcel Kunze, Thomas T. Kwan, Kevin Lai, Martha Lyons, Dejan S. Milojicic, Hing Yan Lee, Ng Kwang Ming, Jing-Yuan Luke, Han Namgong, Yeng Chai Soh. IEEE Computer, April 2010.
Abstract / PDF [1.1M]
- File System Virtual Appliances: Portable File System Implementations. Michael Abd-El-Malek, Matthew Wachs, James Cipar, Karan Sanghi, Gregory R. Ganger, Garth A. Gibson, Michael K. Reiter. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-105, April 2010.
Abstract / PDF [513K]
- Kahuna: Problem Diagnosis for MapReduce-Based Cloud Computing Environments. Jiaqi Tan, Xinghao Pan, Eugene Marinelli, Soila Kavulya, Rajeev Gandhi, Priya Narasimhan. Proceedings of the 12th IEEE/IFIP Network Operations and Management Symposium (NOMS) 2010, Osaka, Japan, Apr 2010.
Abstract / PDF [2.8M]
- Access Control for Home Data Sharing: Attitudes, Needs and Practices. Michelle L. Mazurek, J.P. Arsenault, Joanna Bresee, Nitin Gupta, Iulia Ion, Christina Johns, Daniel Lee, Yuan Liang, Jenny Olsen, Brandon Salmon, Richard Shay, Kami Vaniea, Lujo Bauer, Lorrie Faith Cranor, Gregory R. Ganger, Michael K. Reiter. CHI 2010, April 10 – 15, 2010, Atlanta, Georgia. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-110, October 2009.
Abstract / PDF [250K]