PDL Publications by Project

Big Learning

  • Parity Models: Erasure-Coded Resilience for Prediction Serving Systems. Jack Kosaian, K. V. Rashmi, Shivaram Venkataraman. SOSP ’19, October 27–30, 2019, Huntsville, ON, Canada.
    Abstract / PDF [1M]

  • PipeDream: Generalized Pipeline Parallelism for DNN Training. Deepak Narayanan, Aaron Harlap, Amar Phanishayee, Vivek Seshadri, Nikhil R. Devanur, Gregory R. Ganger, Phillip B. Gibbons, Matei Zaharia. SOSP ’19, October 27–30, 2019, Huntsville, ON, Canada.
    Abstract / PDF [1M]

  • Rateless Codes for Distributed Computations with Sparse Compressed Matrices. Ankur Mallick, Gauri Joshi. IEEE International Symposium on Information Theory (ISIT), July 7-12, 2019, Paris, France.
    Abstract / PDF [672K]

  • Improving ML Applications in Shared Computing Environments. Aaron Harlap. Carnegie Mellon University Electrical and Computer Engineering PhD Dissertation, May 2019.
    Abstract / PDF [1.4M]

  • This is Why ML-driven Cluster Scheduling Remains Widely Impractical. Michael Kuchnik, Jun Woo Park, Chuck Cranor, Elisabeth Moore, Nathan DeBardeleben, George Amvrosiadis. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-19-103, May 2019.
    Abstract / PDF [715K]

  • Fast and Efficient Distributed Matrix-Vector Multiplication Using Rateless Fountain Codes. Ankur Mallick, Malhar Chaudhari, Gauri Joshi. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 12 - 17 May, 2019 · Brighton, UK.
    Abstract / PDF [485K]

  • Towards Lightweight and Robust Machine Learning for CDN Caching. Daniel S. Berger. HotNets-XVII, November 15–16, 2018, Redmond, WA, USA.
    Abstract / PDF [610K]

  • Focus: Querying Large Video Datasets with Low Latency and Low Cost. Kevin Hsieh, Ganesh Ananthanarayanan, Peter Bodik, Shivaram Venkataraman, Paramvir Bahl, Matthai Philipose, Phillip B. Gibbons, Onur Mutlu. 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI), Oct. 8–10, 2018, Carlsbad, CA.
    Abstract / PDF [1.2M]

  • Tributary: Spot-dancing for Elastic Services with Latency SLOs. Aaron Harlap, Andrew Chung, Alexey Tumanov, Gregory R. Ganger, Phillip B. Gibbons. 2018 USENIX Annual Technical Conference. July 11–13, 2018 Boston, MA, USA. Supersedes Carnagie Mellon University Parallel Data Lab Technical Report CMU-PDL-18-102.
    Abstract / PDF [1.25M]

  • Geriatrix: Aging What You See and What You Don’t See -- A File System Aging Approach for Modern Storage Systems. Saurabh Kadekodi, Vaishnavh Nagarajan, Gregory R. Ganger, Garth A. Gibson. 2018 USENIX Annual Technical Conference (USENIX ATC ’18). July 11–13, 2018 • Boston, MA.
    Abstract / PDF [1.44M]

  • Cavs: An Efficient Runtime System for Dynamic Neural Networks. Shizhen Xu, Hao Zhang, Graham Neubig, Wei Dai, Jin Kyu Kim, Zhijie Deng, Qirong Ho, Guangwen Yang, Eric P. Xing. 2018 USENIX Annual Technical Conference (USENIX ATC ’18). July 11–13, 2018 • Boston, MA.
    Abstract / PDF [1.7M]

  • Litz: Elastic Framework for High-Performance Distributed Machine Learning. Aurick Qiao, Abutalib Aghayev, Weiren Yu, Haoyang Chen, Qirong Ho, Garth A. Gibson, Eric P. Xing. 2018 USENIX Annual Technical Conference (USENIX ATC ’18). July 11–13, 2018 • Boston, MA.
    Abstract / PDF [298K]

  • Learning a Code: Machine Learning for Approximate Non-Linear Coded Computation. Jack Kosaian, K.V. Rashmi, Shivaram Venkataraman. arXiv:1806.01259v1 [cs.LG], 4 Jun 2018
    Abstract / PDF [575K]

  • The Locality Descriptor: A Holistic Cross-Layer Abstraction to Express Data Locality in GPUs. Nandita Vijaykumar, Eiman Ebrahimi, Kevin Hsieh, Phillip B. Gibbons, Onur Mutlu. The 45th International Symposium on Computer Architecture - June 2-6, ISCA 2018. Los Angeles, California, USA.
    Abstract / PDF [3.1M]

  • A Case for Richer Cross-layer Abstractions: Bridging the Semantic Gap with Expressive Memory. Nandita Vijaykumar, Abhilasha Jain, Diptesh Majumdar, Kevin Hsieh, Gennady Pekhimenko, Eiman Ebrahimi, Nastaran Hajinazaru, Phillip B. Gibbons, Onur Mutlu. 45th International Symposium on Computer Architecture (ISCA), Los Angeles, CA, USA, June 2018.
    Abstract / PDF [2M]

  • MLtuner: System Support for Automatic Machine Learning Tuning. Henggang Cui, Gregory R. Ganger, Phillip B. Gibbons. arXiv:1803.07445v1 [cs.LG] 20 Mar 2018.
    Abstract / PDF [1M]

  • 3LC: Lightweight and Effective Traffic Compression for Distributed Machine Learning. Hyeontaek Lim, David G. Andersen, Michael Kaminsky. arXiv:1802.07389v1 [cs.LG] 21 Feb 2018.
    Abstract / PDF [586K]

  • PipeDream: Fast and Efficient Pipeline Parallel DNN Training. Aaron Harlap, Deepak Narayanan, Amar Phanishayee, Vivek Seshadri, Nikhil Devanur, Greg Ganger, Phil Gibbons. SysML '18, Feb. 15-16, 2018 , Stanford, CA.
    Abstract / PDF [615K]

  • Intermittent Deep Neural Network Inference. Graham Gobieski, Nathan Beckmann, Brandon Lucia. SysML 2018, February 15-16, 2018, Stanford, CA.
    Abstract / PDF [450K]

  • Tributary: Spot-dancing for elastic services with latency SLOs. Aaron Harlap, Andrew Chung, Alexey Tumanov, Gregory R. Ganger, Phillip B. Gibbons. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-18-102, Jan. 2018.
    Abstract / PDF [990K]

  • Aging Gracefully with Geriatrix: A File System Aging Tool. Saurabh Kadekodi, Vaishnavh Nagarajan, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-17-106, October 2017. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-105. October, 2016.
    Abstract / PDF [560K]

  • Litz: An Elastic Framework for High-Performance Distributed Machine Learning. Aurick Qiao, Abutalib Aghayev, Weiren Yu, Haoyang Chen, Qirong Ho, Garth A. Gibson, Eric P. Xing. Carnegie Mellon Univedrsity Parallel Data Laboratory Technical Report CMU-PDL-17-103. June 2017.
    Abstract / PDF [424K]

  • Proteus: Agile ML Elasticity through Tiered Reliability in Dynamic Resource Markets. Aaron Harlap, Alexey Tumanov, Andrew Chung, Greg Ganger, Phil Gibbons. ACM European Conference on Computer Systems, 2017 (EuroSys'17), 23rd-26th April, 2017, Belgrade, Serbia. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-102. May 2016.
    Abstract / PDF [743K]

  • Gaia: Geo-Distributed Machine Learning Approaching LAN Speeds. Kevin Hsieh, Aaron Harlap, Nandita Vijaykumar, Dimitris Konomis, Gregory R. Ganger, Phillip B. Gibbons, Onur Mutlu. 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI), March 27–29, 2017, Boston, MA.
    Abstract / PDF [1.5M]

  • MLtuner: System Support for Automatic Machine Learning Tuning. Henggang Cui, Gregory R. Ganger, and Phillip B. Gibbons. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-108, October 2016.
    Abstract / PDF [900K]

  • Benchmarking Apache Spark with Machine Learning Applications. Jinliang Wei, Jin Kyu Kim, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-107 October 2016.
    Abstract / PDF [360K]

  • Addressing the Straggler Problem for Iterative Convergent Parallel ML. Aaron Harlap, Henggang Cui, Wei Dai, Jinliang Wei Gregory R. Ganger, Phillip B. Gibbons, Garth A. Gibson, Eric P. Xing. ACM Symposium on Cloud Computing 2016. Oct 5-7, Santa Clara, CA. Supersedes Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-15-102, April 2015.
    Abstract / PDF [519K]

  • TierML: Using Tiers of Reliability for Agile Elasticity in Machine Learning. Aaron Harlap, Gregory R. Ganger, Phillip B. Gibbons. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-102. May 2016.
    Abstract / PDF [590K]

  • GeePS: Scalable Deep Learning on Distributed GPUs with a GPU-Specialized Parameter Server. Henggang Cui, Hao Zhang, Gregory R. Ganger, Phillip B. Gibbons, and Eric P. Xing. ACM European Conference on Computer Systems, 2016 (EuroSys'16), 18th-21st April, 2016, London, UK.
    Abstract / PDF [617K]

  • Scalable Deep Learning on Distributed GPUs with a GPU-specialized Parameter Server. Henggang Cui, Gregory R. Ganger, Phillip B. Gibbons. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-15-107, October 2015.
    Abstract / PDF [537K]

  • SMPFRAME: A Distributed Framework for Scheduled Model Parallel Machine Learning. Jin Kyu Kim, Qirong Hoy, Seunghak Lee Xun Zheng, Wei Dai, Garth Gibson, Eric Xing. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-15-103, May 2015.
    Abstract / PDF [1.57M]

  • Managed Communication and Consistency for Fast Data-Parallel Iterative Analytics. Jinliang Wei, Wei Dai, Aurick Qiao, Qirong Ho*, Henggang Cui, Gregory R. Ganger, Phillip B. Gibbons, Garth A. Gibson, Eric P. Xing. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-15-105. April 2015.
    Abstract / PDF [2.62M]

  • Solving the Straggler Problem for Iterative Convergent Parallel ML. Aaron Harlap, Henggang Cui, Wei Dai, Jinliang Wei Gregory R. Ganger, Phillip B. Gibbons, Garth A. Gibson, Eric P. Xing. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-15-102, April 2015.
    Abstract / PDF [519K]

  • High-Performance Distributed ML at Scale through Parameter Server Consistency Models. Wei Dai, Abhimanu Kumar, Jinliang Wei, Qirong Ho, Garth Gibson, Eric P. Xing. 29th AAAI Conf. on Artificial Intelligence (AAAI-15), Jan 25-29, 2015, Austin, Texas.
    Abstract / PDF [733K]

  • Trading Freshness for Performance in Distributed Systems. James Cipar. Carnegie Mellon University School of Computer Science Ph.D. Dissertation CMU-CS-14-144. December 2014.
    Abstract / PDF [1.82M]

  • On Model Parallelization and Scheduling Strategies for Distributed Machine Learning. S. Lee, J. K. Kim, X. Zheng, Q. Ho, G. A. Gibson, E. P. Xing. Proceedings of 2014 Neural Information Processing Systems (NIPS’14), December 2014.
    Abstract / PDF [336K]

  • Exploiting Bounded Staleness to Speed up Big Data Analytics. Henggang Cui, James Cipar, Qirong Ho, Jin Kyu Kim, Seunghak Lee, Abhimanu Kumar Jinliang Wei, Wei Dai, Gregory R. Ganger, Phillip B. Gibbons, Garth A. Gibson, Eric P. Xing. 2014 USENIX Annual Technical Conference (ATC'14). June 19-20, 2014. Philadelphia, PA. Supersedes CMU-PDL-14-101.
    Abstract / PDF [731K]

  • More Effective Distributed ML via a Stale Synchronous Parallel Parameter Server. Qirong Ho, James Cipar, Henggang Cui, Jin Kyu Kim, Seunghak Lee, Phillip B. Gibbons, Garth A. Gibson, Gregory R. Ganger, Eric P. Xing. Conference on Neural Information Processing Systems (NIPS '13). Dec 5-8, 2013, Lake Tahoe, NV.
    Abstract / PDF [2.64M] / Appendix

  • Solving the Straggler Problem with Bounded Staleness. James Cipar, Qirong Ho, Jin Kyu Kim, Seunghak Lee, Gregory R. Ganger, Garth Gibson, Kimberly Keeton, Eric Xing. 14th USENIX HotOS Workshop, Santa Ana Pueblo, NM, May 13-15, 2013.
    Abstract / PDF [174K]

Cloud Computing

  • Peering through the Dark: An Owl’s View of Inter-job Dependencies and Jobs’ Impact in Shared Clusters. Andrew Chung, Carlo Curino, Subru Krishnan, Konstantinos Karanasos, Panagiotis Garefalakis, Gregory R. Ganger. SIGMOD ’19, June 30–July 5, 2019, Amsterdam, Netherlands.
    Abstract / PDF [1.6M]

  • Distribution-based Cluster Scheduling. Jun Woo Park. Carnegie Mellon University School of Computer Science PhD Dissertation, June 2019.
    Abstract / PDF [1.47M]

  • Improving ML Applications in Shared Computing Environments. Aaron Harlap. Carnegie Mellon University Electrical and Computer Engineering PhD Dissertation, May 2019.
    Abstract / PDF [1.4M]

  • This is Why ML-driven Cluster Scheduling Remains Widely Impractical. Michael Kuchnik, Jun Woo Park, Chuck Cranor, Elisabeth Moore, Nathan DeBardeleben, George Amvrosiadis. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-19-103, May 2019.
    Abstract / PDF [715K]

  • Reconciling LSM-Trees with Modern Hard Drives using BlueFS. Abutalib Aghayev, Sage Weil, Greg Ganger, George Amvrosiadis. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-19-102, April 2019.
    Abstract / PDF [735K]

  • Scaling Video Analytics on Constrained Edge Nodes. Christopher Canel, Thomas Kim, Giulio Zhou, Conglong Li, Hyeontaek Lim, David G. Andersen, Michael Kaminsky, Subramanya R. Dulloor. 2nd SysML Conference (SysML ’19). March 31-April 2, 2019, Palo Alto, CA.
    Abstract / PDF [8.5M]

  • Datacenter RPCs can be General and Fast. Anuj Kalia Michael, Kaminsky, David G. Andersen. 16th USENIX Symposium on Networked Systems Design and Implementation (NSDI), Feb. 26–28, 2019, Boston, MA. Best Paper award!
    Abstract / PDF [555K]

  • Scaling Embedded In-Situ Indexing with DeltaFS. Qing Zheng, Charles D. Cranor, Danhao Guo, Gregory R. Ganger, George Amvrosiadis, Garth A. Gibson, Bradley W. Settlemyer, Gary Grider, Fan Guo. SC18, November 11-16, 2018, Dallas, Texas, USA.
    Abstract / PDF [927K]

  • Stratus: Cost-aware Container Scheduling in the Public Cloud. Andrew Chung, Jun Woo Park, Gregory R. Ganger. ACM Symposium on Cloud Computing, 2018 (SoCC’18), Carlsbad, CA October 11-13, 2018.
    Abstract / PDF [1.5M]

  • RobinHood: Tail Latency Aware Caching—Dynamic Reallocation from Cache-Rich to Cache-Poor. Daniel S. Berger, Benjamin Berg, Timothy Zhu, Siddhartha Sen, Mor Harchol-Balter. 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI ’18). October 8–10, 2018 • Carlsbad, CA, USA.
    Abstract / PDF [2.9M]

  • Putting the “Micro” Back in Microservice. Sol Boucher, Anuj Kalia, David G. Andersen, Michael Kaminsky. 2018 USENIX Annual Technical Conference (USENIX ATC ’18). July 11–13, 2018 • Boston, MA.
    Abstract / PDF [740K]

  • Mainstream: Dynamic Stem-Sharing for Multi-Tenant Video Processing. Angela H. Jiang, Daniel L.K. Wong, Christopher Canel, Lilia Tang, Ishan Misra, Michael Kaminsky*, Michael A. Kozuch*, Padmanabhan Pillai*, David G. Andersen Gregory R. Ganger. 2018 USENIX Annual Technical Conference (USENIX ATC ’18). July 11–13, 2018 • Boston, MA, USA.
    Abstract / PDF [1.5M]

  • On the Diversity of Cluster Workloads and its Impact on Research Results. George Amvrosiadis, Jun Woo Park, Gregory R. Ganger, Garth A. Gibson, Elisabeth Baseman, Nathan DeBardeleben. 2018 USENIX Annual Technical Conference (ATC '18), Boston, MA, July 11-13, 2018.
    Abstract / PDF [285K]

  • A Case for Packing and Indexing in Cloud File Systems. Saurabh Kadekodi, Bin Fan, Adit Madan, Garth A. Gibson, Gregory R. Ganger. 10th USENIX Workshop on Hot Topics in Cloud Computing, July 9, 2018, Boston, MA. Supersedes CMU-PDL-17-105.
    Abstract / PDF [250K]

  • Dynamic Stem-Sharing for Multi-Tenant Video Processing. Angela Jiang, Christopher Canel, Daniel Wong, Michael Kaminsky, Michael A. Kozuch, Padmanabhan Pillai, David G. Andersen, Gregory R. Ganger. SysML 18, February 15–16, 2018. Stanford, CA.
    Abstract / PDF [450K]

  • Picking Interesting Frames in Streaming Video. Christopher Canel, Thomas Kim, Giulio Zhou, Conglong Li, Hyeontaek Lim, David G. Andersen, Michael Kaminsky, Subramanya R. Dulloor. SysML’18, February 15–16, 2018, Stanford, CA.
    Abstract / PDF [913K]

  • 3Sigma: Distribution-based Cluster Scheduling for Runtime Uncertainty. Jun Woo Park, Alexey Tumanov, Angela Jiang, Michael A. Kozuch, Gregory R. Ganger. EuroSys ’18, April 23–26, 2018, Porto, Portugal. Supersedes CMU-PDL-17-107, Nov. 2017.
    Abstract / PDF [1.4M]

  • 3Sigma: Distribution-based cluster scheduling for runtime uncertainty. Jun Woo Park, Alexey Tumanov, Angela Jiang, Michael A. Kozuch, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-17-107, November 2017.
    Abstract / PDF [800K]

  • Software-Defined Storage for Fast Trajectory Queries using a DeltaFS Indexed Massive Directory. Qing Zheng, George Amvrosiadis, Saurabh Kadekodi, Garth Gibson, Chuck Cranor, Brad Settlemyer, Gary Grider, Fan Guo. PDSW-DISCS 2017: 2nd Joint International Workshop on Parallel Data Storage and Data Intensive Scalable Computing Systems held in conjunction with SC17, Denver, CO, November 2017.
    Abstract / PDF [1.25M]

  • A Case for Packing and Indexing in Cloud File Systems. Saurabh Kadekodi, Bin Fan, Adit Madan, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-17-105, October 2017.
    Abstract / PDF [280K]

  • Bigger, Longer, Fewer: What do cluster jobs look like outside Google? George Amvrosiadis, Jun Woo Park, Gregory R. Ganger, Garth A. Gibson, Elisabeth Baseman, Nathan DeBardeleben. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-17-104, October 2017.
    Abstract / PDF [360K]

  • WorkloadCompactor: Reducing datacenter cost while providing tail latency SLO guarantees. Timothy Zhu, Michael A. Kozuch & Mor Harchol-Balter. ACM Symposium on Cloud Computing (SoCC'17) , Santa Clara, Oct 2017.
    Abstact / PDF [3.25M]

  • Principled Workflow-centric Tracing of Distributed Systems. Raja R. Sambasivan, Ilari Shafer, Jonathan Mace, Benjamin H. Sigelman, Rodrigo Fonseca, Gregory R. Ganger. ACM Symposium on Cloud Computing 2016 (SoCC ’16) October 5-7, 2016, Santa Clara, CA, USA.
    Abstract / PDF [590K]

  • SNC-Meister: Admitting More Tenants with Tail Latency SLOs. Timothy Zhu, Daniel S. Berger, Mor Harchol-Balter. SoCC ’16, October 05-07, 2016, Santa Clara, CA, USA.
    Abstract / PDF [500K]

  • Online Deduplication for Distributed Databases. Lianghong Xu. Ph.D. Dissertation, Carnegie Mellon University, Electrical and Computer Engineering, September 2016.
    Abstract / PDF [1.8M]

  • JamaisVu: Robust Scheduling with Auto-Estimated Job Runtimes. Alexey Tumanov, Angela Jiang, Jun Woo Park, Michael A. Kozuch, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-104. September 2016.
    Abstract / PDF [1.6M]

  • Similarity-based Deduplication for Databases. Lianghong Xu, Andrew Pavlo, Sudipta Sengupta, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-101, April 2016.
    Abstract / PDF [1M]

  • GeePS: Scalable Deep Learning on Distributed GPUs with a GPU-Specialized Parameter Server. Henggang Cui, Hao Zhang, Gregory R. Ganger, Phillip B. Gibbons, and Eric P. Xing. ACM European Conference on Computer Systems, 2016 (EuroSys'16), 18th-21st April, 2016, London, UK.
    Abstract / PDF [617K]

  • TetriSched: Global Rescheduling with Adaptive Plan-ahead in Dynamic Heterogeneous Clusters. Alexey Tumanov, Timothy Zhu, Jun Woo Park, Michael A. Kozuch, Mor Harchol-Balter, Gregory R. Ganger. ACM European Conference on Computer Systems, 2016 (EuroSys'16), 18th-21st April, 2016, London, UK.
    Abstract / PDF [8M]

  • DeltaFS: Exascale File Systems Scale Better Without Dedicated Servers. Qing Zheng, Kai Ren, Garth Gibson, Bradley W. Settlemyer, Gary Grider. PDSW2015: 10th Parallel Data Storage Workshop, held in conjunction with SC15, Austin, TX, November 16, 2015.
    Abstract / PDF [930K]

  • ShardFS vs. IndexFS: Replication vs. Caching Strategies for Distributed Metadata Management in Cloud Storage Systems. Lin Xiao, Kai Ren, Qing Zheng, Garth Gibson. ACM Symposium on Cloud Computing 2015. Aug. 27 - 29, 2015, Kohala Coast, HI.
    Abstract / PDF [275K]

  • Using Data Transformations for Low-latency Time Series Analysis. Henggang Cui, Kimberly Keeton, Indrajit Roy, Krishnamurthy Viswanathan, Gregory R. Ganger. ACM Symposium on Cloud Computing 2015. Aug. 27 - 29, 2015, Kohala Coast, HI. See the extended Technical Report for more information.
    Abstract / PDF [1.3M]

  • Managed Communication and Consistency for Fast Data-Parallel Iterative Analytics. Jinliang Wei, Wei Dai, Aurick Qiao, Qirong Ho, Henggang Cui, Gregory R. Ganger, Phillip B. Gibbons, Garth A. Gibson, Eric P. Xing. ACM Symposium on Cloud Computing 2015. Aug. 27 - 29, 2015, Kohala Coast, HI.
    Abstract / PDF [369K]

  • Reducing Replication Bandwidth for Distributed Document Databases. Lianghong Xu, Andrew Pavlo, Sudipta Sengupta, Jin Li, Gregory R. Ganger. ACM Symposium on Cloud Computing 2015. Aug. 27 - 29, 2015, Kohala Coast, HI.
    Abstract / PDF [501K]

  • Using Data Transformations for Low-latency Time Series Analysis. Henggang Cui, Kimberly Keeton, Indrajit Roy Krishnamurthy Viswanathan, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-15-106. April 2015. Extended version of the 2015 SoCC paper.
    Abstract / PDF [925K]

  • A Cloud Computing Course: From Systems To Services. M. Suhail Rehman, Jason Boles, Mohammad Hammoud, Majd F. Sakr. Proceedings of the 46th ACM Special Interest Group on Computer Science Education Conference (SIGCSE 2015), Kansas City, USA, March 2015.
    Abstract / PDF [356K]

  • STOVE: Strict, Observable, Verifiable Data and Execution Models for Untrusted Applications. Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. IEEE 6th International Conference on Cloud Computing Technology and Science (CloudCom), 2014 (Doctoral Symposium), pp.644,649, 15-18 Dec. 2014.
    Abstract / PDF [541K]

  • STOVEPipe: Observable Access Control of User Data for Untrusted Applications on Mobile Devices. Jiaqi Tan, Utsav Drolia, Rolando Martins, Rajeev Gandhi, Priya Narasimhan. Poster at the IEEE 6th International Conference on Cloud Computing Technology and Science (CloudCom), 2014, 15-18 Dec. 2014.
    Abstract / PDF [149K]

  • Exploiting Iterative-ness for Parallel ML Computations. Henggang Cui, Alexey Tumanov, Jinliang Wei, Lianghong Xu, Wei Dai, Jesse Haber-Kucharsky, Qirong Ho, Greg R. Ganger, Phil B. Gibbons, Garth A. Gibson, Eric P. Xing. ACM Symposium on Cloud Computing 2014 (SoCC'14), Seattle, WA, Nov 2014. Supersedes Carnegie Mellon University Parallel Data Technical Report CMU-PDL-14-107.
    Abstract / PDF [609K]

  • PriorityMeister: Tail Latency QoS for Shared Networked Storage. Timothy Zhu, Alexey Tumanov, Michael A. Kozuch, Mor Harchol-Balter, Gregory R. Ganger. ACM Symposium on Cloud Computing 2014 (SoCC'14), Seattle, WA, Nov 2014.
    prioritymeister-SoCC14.pdf
    Abstract / PDF [940K]

  • Cloudlets: at the Leading Edge of Mobile-Cloud Convergence. M. Satyanarayanan, Z. Chen, K. Ha, W. Hu, W. Richter, P. Pillai. Proceedings of MobiCASE 2014: Sixth International Conference on Mobile Computing, Applications and Services, Austin, TX, November 2014.
    Abstract / PDF [859K]

  • A Brief History of Cloud Offload. M. Satyanarayanan. GetMobile, Volume 18, Issue 4, October 2014.
    Abstract / PDF [360K]

  • Agility and Performance in Elastic Distributed Storage. Lianghong Xu, James Cipar, Elie Krevat, Alexey Tumanov, And Nitin Gupta, Michael A. Kozuch, Gregory R. Ganger. ACM Transactions on Storage, Vol. 10, No. 4, Article 16, Publication date: October 2014.
    Abstact / PDF [1.34M]

  • Towards Wearable Cognitive Assistance. Kiryong Ha, Zhuo Chen, Wenlu Hu, Wolfgang Richter, Padmanabhan Pillai, Mahadev Satyanarayanan. Proceedings of the 12th ACM International Conference on Mobile Computing, Systems and Services (MobiSys’14), June 2014.
    Abstract / PDF [1.54M]

  • QuiltView: Glass-Sourced Video for Google Maps Queries. Zhuo Chen, Wenlu Hu, Kiryong Ha, Jan Harkes, Benjamin Gilbert, Jason Hong, Asim Smailagic, Dan Siewiorek, Mahadev Satyanarayanan. The 15th International Workshop on Mobile Computing Systems and Applications (HotMobile'14), February 2014.
    Abstract / PDF [4.51M]

  • Agentless Cloud-wide Streaming of Guest File System Updates. Wolfgang Richter, Canturk Isci, Jan Harkes, Benjamin Gilbert, Vasanth Bala, Mahadev Satyanarayanan. The Second IEEE Conference on Cloud Engineering (IC2E'14), March 2014. The Second IEEE Conference on Cloud Engineering (IC2E'14), March 2014. Best Paper.
    Abstract / PDF [978K]

  • SpringFS: Bridging Agility and Performance in Elastic Distributed Storage. Lianghong Xu, James Cipar, Elie Krevat, Alexey Tumanov, Nitin Gupta, Michael A. Kozuch, Gregory R. Ganger. 12th USENIX Conference on File and Storage Technologies (FAST '14), Santa Clara, CA, February 17–20, 2014.
    Abstract / PDF [319K]

  • Tetrisched: Space-Time Scheduling for Heterogeneous Datacenters. Alexey Tumanov, Timothy Zhu, Michael A. Kozuch†, Mor Harchol-Balter, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-13-112, December, 2013.
    Abstract / PDF [716K]

  • Just-in-Time Provisioning for Cyber Foraging. Kiryong Ha, Padmanabhan Pillai, Wolfgang Richter, Yoshihisa Abe, Mahadev Satyanarayanan The 11th International Conference on Mobile Systems, Applications, and Services (MobiSys'13), June 25–28, 2013, Taipei, Taiwan.
    ha-mobisys-vmsynthesis-2013.pdf cloud
    Abstract / PDF [2.29M]

  • vQuery: A Platform for Connecting Configuration and Performance. Ilari Shafer, Snorri Gylfason, Gregory R. Ganger. vwWare Labs Technical Report, Palo Alto, CA. December 2012.
    Abstract / PDF [288K]

  • alsched: Algebraic Scheduling of Mixed Workloads in Heterogeneous Clouds. Alexey Tumanov, James Cipar, Michael A. Kozuch, Gregory R. Ganger. 3rd ACM Symposium on Cloud Computing. October 14th-17th, 2012 - San Jose, CA.
    Abstract / PDF [379K]

  • Heterogeneity and Dynamicity of Clouds at Scale: Google Trace Analysis. Charles Reiss, Alexey Tumanov, Gregory R. Ganger, Randy H. Katz, Michael A. Kozuch. 3rd ACM Symposium on Cloud Computing. October 14th-17th, 2012 - San Jose, CA.
    Abstract / PDF [3.1M]

  • Saving Cash by Using Less Cache. Timothy Zhu, Anshul Gandhi, Mor Harchol-Balter, Michael A. Kozuch. 4th USENIX Workshop of Hot Topics in Cloud Computing (Hotcloud 2012). June 12-13, 2012, Boston, MA.
    Abstract / PDF [177K]

  • Towards Understanding Heterogeneous Clouds at Scale: Google Trace Analysis Charles Reiss, Alexey Tumanov, Gregory R. Ganger, Randy H. Katz, Michael A. Kozuch. Intel Science and Technology Center for Cloud Computing Technical Report ISTC-CC-TR-12-101, April 27, 2012.
    Abstract / PDF [876K]

  • Near-Real-Time Inference of File-Level Mutations from Virtual Disk Writes. Wolfgang Richter, Mahadev Satyanarayanan, Jan Harkes, Benjamin Gilbert. Carnegie Mellon University School of Computer Science Technical Report CMU-CS-12-103. February 2012.
    Abstract / PDF [343K]

  • ZZFS: A Hybrid Device and Cloud File System for Spontaneous Users. Michelle L. Mazurek, Eno Thereska, Dinan Gundawardena, Richard Harper, James Scott. FAST 2012: USENIX Conference on File and Storage Technologies, February 2012.
    Abstract / PDF [567K]

  • Privacy-Sensitive VM Retrospection. Wolfgang Richter, Glenn Ammons, Jan Harkes, Adam Goode, Nilton Bila, Eyal De Lara, Vas Bala, Mahadev Satyanarayanan. HotCloud 2011 3rd USENIX Workshop on Hot Topics in Cloud Computing. Portland, OR, June 14-17, 2011.
    Abstract / PDF [1.97M]

  • On the Duality of Data-intensive File System Design: Reconciling HDFS and PVFS. Wittawat Tantisiriroj, Swapnil Patil, Garth Gibson, Seung Woo Son, Samuel J. Lang, Robert B. Ross. SC11, November 12-18, 2011, Seattle, Washington USA. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-108. April 2011.
    Abstract / PDF [459K]

  • Exertion-based Billing for Cloud Storage Access. Matthew Wachs, Lianghong Xu, Arkady Kanevsky, Gregory R. Ganger. Proceedings of the 3rd USENIX Workshop on Hot Topics in Cloud Computing (HotCloud '11). June 14-15, 2011, Portland, OR. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-105. March 2011.
    Abstract / PDF [65K]

  • The Case for Content Search of VM Clouds. Mahadev Satyanarayanan, Wolfgang Richter, Glenn Ammons, Jan Harkes, Adam Goode. 34th Annual IEEE Computer Software and Applications Conference Workshops (COMPSACW), July 19-23, 2010, Seoul, Korea.
    Abstract / PDF [831K]

  • Open Cirrus: A Global Cloud Computing Testbed. Arutyun I. Avetisyan, Roy Campbell, Indranil Gupta, Michael T. Heath, Steven Y. Ko, Gregory R. Ganger, Michael A. Kozuch, David O’Hallaron, Marcel Kunze, Thomas T. Kwan, Kevin Lai, Martha Lyons, Dejan S. Milojicic, Hing Yan Lee, Ng Kwang Ming, Jing-Yuan Luke, Han Namgong, Yeng Chai Soh. IEEE Computer, April 2010.
    Abstract / PDF [1.1M]

Database Systems

  • Query-based Workload Forecasting for Self-Driving Database Management Systems. Lin Ma, Dana Van Aken, Ahmed Hefny, Gustavo Mezerhane, Andrew Pavlo, Geoffrey J. Gordon. SIGMOD/PODS '18 International Conference on Management of Data, Houston, TX, USA, June 10 - 15, 2018.
    Abstract / PDF [1.25M]

  • Relaxed Operator Fusion for In-Memory Databases: Making Compilation, Vectorization, and Prefetching Work Together At Last. Prashanth Menon, Todd C. Mowry & Andrew Pavlo. Proceedings of the VLDB Endowment, Vol. 11, No. 1, 2017.
    Abstact / PDF [970K]

  • Automatic Database Management System Tuning Through Large-scale Machine Learning. Dana Van Aken, Andrew Pavlo, Geoffrey J. Gordon, Bohan Zhang. ACM SIGMOD International Conference on Management of Data, May 14-19, 2017. Chicago, IL, USA.
    Abstract / PDF [760K]

  • Online Deduplication for Databases. Lianghong Xu, Andrew Pavlo, Sudipta Sengupta, Gregory R. Ganger. ACM SIGMOD International Conference on Management of Data, May 14-19, 2017.
    Abstract / PDF [890K]

  • An Empirical Evaluation of In-Memory Multi-Version Concurrency Control. Yingjun Wu, Joy Arulraj, Jiexi Lin, Ran Xian, Andrew Pavlo. Proceedings of the VLDB Endowment, vol. 10, iss. 7, pages. 781—792, March 2017.
    Abstract / PDF [660K]

  • An Evaluation of Distributed Concurrency Control. Rachael Harding, Dana Van Aken, Andrew Pavlo, Michael Stonebraker. Proceedings of the VLDB Endowment, vol. 10, iss. 5, pages. 553—564, January 2017.
    Abstract / PDF [421K]

  • Self-Driving Database Management Systems. A. Pavlo, G. Angulo, J. Arulraj, H. Lin, J. Lin, L. Ma, P. Menon, T. Mowry, M. Perron, I. Quah, S. Santurkar, A. Tomasic, S. Toor, D. V. Aken, Z. Wang, Y. Wu, R. Xian, and T. Zhang. In CIDR 2017, Conference on Innovative Data Systems Research. January 8-11, 2017, Chaminade, CA.
    Abstract / PDF [680K]

  • Write-Behind Logging. J. Arulraj, M. Perron, A. Pavlo. Proc. VLDB Endow., vol. 10, pp. 337-348, December, 2016.
    Abstract / PDF [931K]

  • Online Deduplication for Distributed Databases. Lianghong Xu. Ph.D. Dissertation, Carnegie Mellon University, Electrical and Computer Engineering, September 2016.
    Abstract / PDF [1.8M]

  • Larger-than-Memory Data Management on Modern Storage Hardware for In-Memory OLTP Database Systems. Lin Ma, Joy Arulraj, Sam Zhao, Andrew Pavlo, Subramanya R. Dulloor, Michael J. Giardino, Jeff Parkhurst, Jason L. Gardner, Kshitij Dosh*, Col. Stanley Zdonik. DaMoN’16, June 26-July 01 2016, San Francisco, CA, USA.
    Abstract / PDF [1.25M]

  • Bridging the Archipelago between Row-Stores and Column-Stores for Hybrid Workloads. Joy Arulraj, Andrew Pavlo, Prashanth Menon. SIGMOD’16, June 26-July 01, 2016, San Francisco, CA, USA.
    Abstract / PDF [575K]

  • Similarity-based Deduplication for Databases. Lianghong Xu, Andrew Pavlo, Sudipta Sengupta, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-101, April 2016.
    Abstract / PDF [1M]

  • Reducing the Storage Overhead of Main-Memory OLTP Databases with Hybrid Indexes. Huanchen Zhang, Andy Pavlo, David G. Andersen, Michael Kaminsky, Lin Ma, Rui Shen. ACM SIGMOD International Conference on Management of Data 2016 (SIGMOD'16), June 2016.
    Abstract / PDF [715K]

  • BenchPress: Dynamic Workload Control in the OLTP-Bench Testbed. D. Van Aken, D. E. Difallah, A. Pavlo, C. Curino, and P. Cudré-Mauroux. Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, 2015, pp. 1069-1073.
    Abstract / PDF [1.2M]

  • Let’s Talk About Storage & Recovery Methods for Non-Volatile Memory Database Systems. Joy Arulraj, Andrew Pavlo, Subramanya R. Dulloor. Proceedings ACM SIGMOD, Melbourne, Victoria, Australia, May 31-June 4, 2015.
    Abstract / PDF [1M]

  • Reducing Replication Bandwidth for Distributed Document Databases. Lianghong Xu, Andrew Pavlo, Sudipta Sengupta Jin Li, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-14-108. December 2014.
    Abstract / PDF [646K]

Decentralized Caching

  • Cuckoo Linear Algebra. Li Zhou, David G. Andersen, Mu Li, Alexander J. Smola. KDD’15, August 10-13, 2015, Sydney, NSW, Australia.
    Abstract / PDF [611K]

  • Cuckoo Filter: Practically Better Than Bloom. Bin Fan, David G. Andersen, Michael Kaminsky, Michael D. Mitzenmacher. Proceedings of CoNEXT (CoNEXT’14), December 2014.
    Abstract / PDF [343K]


  • D-SPTF: Decentralized Request Distribution in Brick-based Storage Systems. Christopher R. Lumb. Carnegie Mellon University Parallel Data Lab Ph.D. Dissertation CMU-PDL-05-111, December, 2005.
    Abstract / PDF [1.2M]

  • DSPTF: Decentralized Request Distribution in Brickbased Storage Systems. Christopher R. Lumb, Richard Golding, Gregory R. Ganger. Proceedings of ASPLOS’04, October 7–13 ,2004, Boston, Massachusetts, USA.
    Abstract / PDF [281K]

  • Integrating Portable and Distributed Storage. Niraj Tolia, Jan Harkes, Michael Kozuch, and M. Satyanarayanan. Proceedings of the 3rd USENIX Conference on File and Storage Technologies (FAST '04). San Francisco, CA. March 31, 2004.
    Abstract / Postscript [881K] / PDF [211K]

  • D-SPTF: Decentralized Request Distribution in Brick-based Storage. Christopher R. Lumb, Gregory R. Ganger, Richard Golding. Carnegie Mellon University School of Computer Science Tecnical Report CMU-CS-03-202, November, 2003.
    Abstract / PDF [475K]

  • Opportunistic Use of Content Addressable Storage for Distributed File Systems. Niraj Tolia, Michael Kozuch, Mahadev Satyanarayanan, Brad Karp, Thomas Bressoud, and Adrian Perrig. Proceedinge USENIX Annual Technical Conference, General Track 2003: 127-140, June 9-14, San Antonio, TX.
    Abstract / Postscript [1M] / PDF [284K]

  • Data Staging on Untrusted Surrogates. Jason Flinn, Shafeeq Sinnamohideen, Niraj Tolia, M. Satyanarayanan. Proceedings 2nd USENIX Conference on File and Storage Technologies (FAST03), Mar31-Apr2, 2003, San Francisco, CA.
    Abstract / Postscript [1.5M] / PDF [325K]

  • Cuckoo: Layered clustering for NFS. Andrew J. Klosterman, Gregory Ganger. Carnegie Mellon University Technical Report CMU-CS-02-183, October 2002.
    Abstract / Postscript [370K] / PDF [86K]

  • My Cache or Yours? Making Storage More Exclusive. Theodore M. Wong, John Wilkes. USENIX Annual Technical Conference (USENIX 2002), pp. 161-175, 10-15 June 2002, Monterey, CA. Supercedes CMU SCS Tech. Report CMU-CS-02-186, which supercedes CMU-CS-00-157, originally published in November 2000.
    Abstract / Postscript [759K] / PDF [253K]

Energy Efficiency

  • A Scalable Priority-Aware Approach to Managing Data Center Server Power. Yang Li, Charles R. Lefurgy, Karthick Rajamani, Malcolm S. Allen-Ware, Guillermo J. Silva, Daniel D. Heimsoth, Saugata Ghose, Onur Mutlu. HPCA 2019: The 25th International Symposium on High-Performance Computer Architecture, February 16 - 20, 2019, Washington D.C.
    Abstract / PDF [610K]

  • LTRF: Enabling High-Capacity Register Files for GPUs via Hardware/Software Cooperative Register Prefetching. Mohammad Sadrosadati, Amirhossein Mirhosseini, Seyed Borna Ehsani, Hamid Sarbazi-Azad, Mario Drumond, Babak Falsafi, Rachata Ausavarungnirun, Onur Mutlu. ASPLOS2018. The 23rd ACM International Conference on Architectural Support for Programming Languages and Operating Systems, March 24th – March 28th, Williamsburg, VA, USA.
    Abstract / PDF [1.M]

  • Slim NoC: A Low-Diameter On-Chip Network Topology for High Energy Efficiency and Scalability. Maciej Besta, Syed Minhaj Hassan, Sudhakar Yalamanchili, Rachata Ausavarungnirun, Onur Mutlu, Torsten Hoefler. ASPLOS2018. The 23rd ACM International Conference on Architectural Support for Programming Languages and Operating Systems, March 24th – March 28th, Williamsburg, VA, USA.
    Abstract / PDF [1.6M]

  • MASK: Redesigning the GPU Memory Hierarchy to Support Multi-Application Concurrency. Rachata Ausavarungnirun, Vance Miller, Joshua Landgraf, Saugata Ghose, Jayneel Gandhi, Adwait Jog, Christopher J. Rossbach, Onur Mutlu. ASPLOS2018. The 23rd ACM International Conference on Architectural Support for Programming Languages and Operating Systems, March 24th – March 28th, Williamsburg, VA, USA.
    Abstract / PDF [1.1M]

  • μC-States: Fine-grained GPU Datapath Power Management. Onur Kayıran, Adwait Jog, Ashutosh Pattnaik, Rachata Ausavarungnirun, Xulong Tang, Mahmut T. Kandemir, Gabriel H. Loh, Onur Mutlu, Chita R. Das. Proceedings of the The 25th International Conference on Parallel Architectures and Compilation Techniques (PACT 2016), Haifa, Israel, September 2016.
    Abstract / PDF [823K]

  • SizeCap: Efficiently Handling Power Surges in Fuel Cell Powered Data Centers. Yang Li, Di Wang, Saugata Ghose, Jie Liu, Sriram Govindan, Sean James, Eric Peterson, John Siegler, Rachata Ausavarungnirun, Onur Mutlu. 22nd International Symposium on High Performance Computer Architecture (HPCA), March 12-16, Barcelona, Spain, 2016.
    Abstract / PDF [1.32M]

  • A Case for Toggle-Aware Compression for GPU Systems. Gennady Pekhimenko, Evgeny Bolotin, Nandita Vijaykumar, Onur Mutlu, Todd C. Mowry, Stephen W. Keckler. Proceedings of the 22nd International Symposium on High-Performance Computer Architecture (HPCA), Barcelona, Spain, March 2016.
    Abstract / PDF [713K]

  • Low-Cost Inter-Linked Subarrays (LISA): Enabling Fast Inter-Subarray Data Movement in DRAM. Kevin K. Chang, Prashant J. Nair, Donghyuk Lee, Saugata Ghose, Moinuddin K. Qureshi, and Onur Mutlu. Proceedings of the 22nd International Symposium on High-Performance Computer Architecture (HPCA), Barcelona, Spain, March 2016.
    Abstract / PDF [768K]

  • Tiered-Latency DRAM: A Low Latency and Low Cost DRAM Architecture. Donghyuk Lee, Yoongu Kim, Vivek Seshadri, Jamie Liu, Lavanya Subramanian, Onur Mutlu. Proceedings of the 19th International Symposium on High-Performance Computer Architecture (HPCA), Shenzhen China, February 2013.
    Abstract / PDF [3.17M]

  • Runtime Estimation and Resource Allocation for Concurrency Testing. Jiri Simsa, Randy Bryant, Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-113. December 2012.
    Abstract / PDF [490K]

  • Enabling Efficient and Scalable Hybrid Memories Using Fine-Granularity DRAM Cache Management. Justin Meza, Jichuan Chang, HanBin Yoon, Onur Mutlu, Parthasarathy Ranganathan. IEEE Computer Architecture Letters (CAL), May 2012.
    Abstract / PDF [184K]

  • Bottleneck Identification and Scheduling in Multithreaded Applications. José A. Joao, M. Aater Suleman, Onur Mutlu, Yale N. Patt. Proceedings of the 17th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), London, UK, March 2012.
    Abstract / PDF [828K]

  • ZZFS: A Hybrid Device and Cloud File System for Spontaneous Users. Michelle L. Mazurek, Eno Thereska, Dinan Gundawardena, Richard Harper, James Scott. FAST 2012: USENIX Conference on File and Storage Technologies, February 2012.
    Abstract / PDF [567K]

  • Active Disk Meets Flash: A Case for Intelligent SSDs. Sangyeun Cho, Chanik Park , Hyunok Oh, Sungchan Kim, Youngmin Yi and Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-115. Dec. 2011.
    Abstract / PDF [989K]

  • The Case for Sleep States in Servers. Anshul Gandhi, Mor Harchol-Balter, Michael A. Kozuch. HotPower'11, October 23, 2011, Cascais, Portugal.
    Abstract / PDF [621K]

  • Minimizing Data Center SLA Violations and Power Consumption via Hybrid Resource Provisioning. Anshul Gandhi, Yuan Chen, Daniel Gmach, Martin Arlitt, Manish Marwah. 2nd IGCC 2011 (IEEE International Green Computing Conference 2011) July 25-28, 2011 Orlando, Florida, USA. -- BEST PAPER AWARD
    Abstract / PDF [503K]

  • Memory Power Management via Dynamic Voltage/Frequency Scaling. Howard David, Chris Fallin, Eugene Gorbatov, Ulf R. Hanebutte, Onur Mutlu. Proceedings of the 8th International Conference on Autonomic Computing (ICAC), Karlsruhe, Germany, June 2011.
    Abstract / PDF [463K]

  • Distributed, Robust Auto-Scaling Policies for Power Management in Compute Intensive Server Farms. Anshul Gandhi, Mor Harchol-Balter, Ram Raghunathan, Michael A. Kozuch. 5th International Open Cirrus Summit. June 01 – 03, 2011, Moscow, Russia.
    Abstract / PDF [317K]

FAWN

  • SlimDB: A Space-Efficient Key-Value Storage Engine For Semi-Sorted Data. Kai Ren, Qing Zheng, Joy Arulraj, Garth Gibson. Proceedings of the VLDB Endowment, Vol. 10, No. 13, 2017.
    Abstract / PDF [2.15M]

  • FaSST: Fast, Scalable and Simple Distributed Transactions with Two-sided (RDMA) Datagram RPCs. Anuj Kalia, Michael Kaminsky, David G. Andersen.12th USENIX Symposium on Operating Systems Design and Implementation November 2–4, 2016, Savannah, GA, USA.
    Abstract / PDF [608K]

  • Achieving One Billion Key-Value Requests Per Second on a Single Server. Sheng Li, Hyeontaek Lim, Victor Lee, Jung Ho Ahn, Anuj Kalia, Michael Kaminsky, David G. Andersen, Seongil O, Sukhan Lee, Pradeep Dubey. IEEE Micro's Top Picks from the Computer Architecture Conferences 2016, May/June 2016. Top Picks 2016 Award!
    Abstract / PDF [176K]

  • Design Guidelines for High Performance RDMA Systems. Anuj Kalia, Michael Kaminsky, David G. Andersen. 2016 USENIX Annual Technical Conference (USENIX ATC'16), June 2016.
    Abstract / PDF [553K]

  • Cuckoo Linear Algebra. Li Zhou, David G. Andersen, Mu Li, Alexander J. Smola. KDD’15, August 10-13, 2015, Sydney, NSW, Australia.
    Abstract / PDF [611K]

  • Full-Stack Architecting to Achieve a Billion Requests Per Second Throughput on a Single Key-Value Store Server Platform. Sheng Li, Hyeontaek Lim, Victor Lee, Jung Ho Ahn, Anuj Kalia, Michael Kaminsky, David G. Andersen, Seongil O, Sukhan Lee, Pradeep Dubey. ACM Transactions on Computer Systems (TOCS), Vol. 34, No. 2, April 2016.
    Abstract / PDF [1.14M]

  • Be Fast, Cheap and in Control with SwitchKV. Xiaozhou Li, Raghav Sethi, Michael Kaminsky, David G. Andersen, Michael J. Freedman. In 13th USENIX Symposium on Networked Systems Design and Implementation (NSDI'16), Santa Clara, CA, March 2016.
    Abstract / PDF [594K]

  • Towards Accurate and Fast Evaluation of Multi-Stage Log-Structured Designs. Hyeontaek Lim, David G. Andersen, Michael Kaminsky. In 14th USENIX Conference on File and Storage Technologies (FAST'16), Santa Clara, CA, February 2016.
    Abstract / PDF [2M]

  • Resource-Efficient Data-Intensive System Designs for High Performance and Capacity. Hyeontaek Lim. Carnegie Mellon University PhD Dissertation CMU-CS-15-132, September 2015.
    Abstract / PDF [3.1M]

  • Architecting to Achieve a Billion Requests Per Second Throughput on a Single Key-Value Store Server Platform. Sheng Li, Hyeontaek Lim, Victor Lee, Jung Ho Ahn, Anuj Kalia, Michael Kaminsky, David G. Andersen, Seongil O, Sukhan Lee, Pradeep Dubey. In Proceedings of the 42nd International Symposium on Computer Architecture (ISCA 2015), Portland, OR, June 2015. Fast-tracked to Transactions on Computer Systems (TOCS).
    Abstract / PDF [350K]

  • Cuckoo Filter: Practically Better Than Bloom. Bin Fan, David G. Andersen, Michael Kaminsky, Michael D. Mitzenmacher. Proceedings of CoNEXT (CoNEXT’14), December 2014. Abstract / PDF [343K]
    Abstract / PDF [343K]

  • Using RDMA Efficiently for Key-Value Services. Anuj Kalia, Michael Kaminsky, David G. Andersen. ACM SIGCOMM 2014. Chicago, Illinois, August 17-22, 2014. Supersedes CMU-PDL-14-106, June 2014.
    Abstract / PDF [462K]

  • Algorithmic Improvements for Fast Concurrent Cuckoo Hashing. Xiaozhou Li, David G. Andersen, Michael Kaminsky, Michael J. Freedman. Proceedings of the European Conference on Computer Systems (EuroSys '14), April 2014.
    Abstract / PDF [4.3M]

  • MICA: A Holistic Approach to Fast In-Memory Key-Value Storage. Hyeontaek Lim, Dongsu Han, David G. Andersen, Michael Kaminsky. 11th USENIX Symposium on Networked Systems Design and Implementation (NSDI'14), April 2014.
    Abstract / PDF [1.36M]

  • Scalable, High Performance Ethernet Forwarding with CuckooSwitch. Dong Zhou, Bin Fan, Hyeontaek Lim, David G. Andersen, Michael Kaminsky. Proc. 9th International Conference on emerging Networking EXperiments and Technologies (CoNEXT), Dec. 2013.
    Abstract / PDF [479K]

  • Using Vector Interfaces to Deliver Millions of IOPS from a Networked Key-value Storage Server. Vijay Vasudevan, Michael Kaminsky, David G. Andersen. SOCC'12, October 14-17, 2012, San Jose, CA USA.
    Abstract / PDF [648K]

  • FAWNSort: Energy-efficient Sorting of 10GB. Vijay Vasudevan Lawrence Tan, David Andersen, Michael Kaminsky, Michael A. Kozuch, Padmanabhan Pillai, Winner of 2010 10GB Joulesort, Daytona and Indy categories. http://sortbenchmark.org/. July 2010
    Abstract / PDF [90K]

  • Energy-efficient Cluster Computing with FAWN: Workloads and Implications. Vijay Vasudevan David Andersen, Michael Kaminsky, Lawrence Tan, Jason Franklin, Iulian Moraru . Proceedings of 1st Int'l Conf. on Energy-Efficient Computing & Networking (e-Energy 2010), Univ. of Passau, Germany. April 13-15, 2010.
    Abstract / PDF [645K]

  • FAWN: A Fast Array of Wimpy Nodes. David Andersen, Jason Franklin, Michael Kaminsky, Amar Phanishayee, Lawrence Tan, Vijay Vasudevan. Proc. 22nd ACM Symposium on Operating Systems Principles (SOSP 2009), Big Sky, MT. October 2009. BEST PAPER AWARD!
    Abstract / PDF [332K]

  • FAWNdamentally Power-efficient Clusters. Vijay Vasudevan, Jason Franklin, David Andersen, Amar Phanishayee, Lawrence Tan, Michael Kaminsky, Iulian Moraru. 12th Workshop on Hot Topics in Operating Systems (HotOS XII). May 2009.
    Abstract / PDF [236K]

File System Virtual Appliances (FSVA)

  • File System Virtual Appliances: Portable File System Implementations. Michael Abd-El-Malek , Matthew Wachs, James Cipar, Karan Sanghi, Gregory R. Ganger, Garth A. Gibson, Michael K. Reiter. ACM Transactions on Storage, Vol. 8, No. 3, Article 39, May 2012.
    Abstract / PDF [518K]

  • File System Virtual Appliances: Portable File System Implementations. Michael Abd-El-Malek, Matthew Wachs, James Cipar, Karan Sanghi, Gregory R. Ganger, Garth A. Gibson, Michael K. Reiter. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-105, April 2010.
    Abstract / PDF [513K]

  • File System Virtual Appliances. Michael Abd-El-Malek. Ph.D. Dissertation. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-109, August 2009.
    Abstract / PDF [1.15M]

  • File System Virtual Appliances: Portable File System Implementations. Michael Abd-El-Malek, Matthew Wachs, James Cipar, Karan Sanghi, Gregory R. Ganger, Garth A. Gibson, Michael K. Reiter. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-102. May 2009.
    Abstract / PDF [486K]

  • File System Virtual Appliances: Third-party File System Implementations without the Pain. Michael Abd-El-Malek, Matthew Wachs, James Cipar, Gregory R. Ganger, Garth A. Gibson, Michael K. Reiter. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-106, May 2008.
    Abstract / PDF [508K]

  • FAWN: A Fast Array of Wimpy Nodes. David G. Andersen, Jason Franklin, Amar Phanishayee, Lawrence Tan, Vijay Vasudevan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-108, May 2008.
    Abstract / PDF [875K]

New Storage Interfaces

  • File Systems Unfit as Distributed Storage Backends: Lessons from 10 Years of Ceph Evolution. Abutalib Aghayev, Sage Weil, Michael Kuchnik, Mark Nelson, Gregory R. Ganger, George Amvrosiadis. SOSP ’19, October 27–30, 2019, Huntsville, ON, Canada.
    Abstract / PDF [870K]

  • STRADS-AP: Simplifying Distributed Machine Learning Programming without Introducing a New Programming Model. Jin Kyu Kim, Abutalib Aghayev, Garth A. Gibson, Eric P. Xing. Proceedings of the 2019 USENIX Annual Technical Conference, July 10–12, 2019 • Renton, WA.
    Abstract / PDF [490K]

  • SuRF: Practical Range Query Filtering with Fast Succinct Tries. Huanchen Zhang, Hyeontaek Lim, Viktor Leis, David G. Andersen, Michael Kaminsky, Kimberly Keeton, Andrew Pavlo. SIGMOD’18, June 10–15, 2018, Houston, TX, USA.
    Abstract / PDF [1.9M]

  • Building a Bw-Tree Takes More Than Just Buzz Words. Ziqi Wang, Andrew Pavlo, Hyeontaek Lim, Viktor Leis, Huanchen Zhang, Michael Kaminsky, David G. Andersen. SIGMOD’18, June 10–15, 2018, Houston, TX, USA.
    Abstract / PDF [2.2M]

  • Addressing the Long-Lineage Bottleneck in Apache Spark. Haoran Wang, Jinliang Wei, Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-18-101, January 2018.
    Abstract / PDF [250K]

  • Evolving Ext4 for Shingled Disks. Abutalib Aghayev, Theodore Ts’o, Garth Gibson, Peter Desnoyers. 15th USENIX Conference on File and Storage Technologies (FAST '17), Feb 27–Mar 2, 2017. Santa Clara, CA.
    Abstract / PDF [1.4M]

  • Aging Gracefully with Geriatrix: A File System Aging Suite. Saurabh Kadekodi, Vaishnavh Nagarajan, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-105. October, 2016.
    Abstract / PDF [503K]

  • STRADS: A Distributed Framework for Scheduled Model Parallel Machine Learning. Jin Kyu Kim, Qirong Ho, Seunghak Lee, Xun Zheng, Wei Dai, Garth A. Gibson, Eric P. Xing. ACM European Conference on Computer Systems, 2016 (EuroSys'16), 18th-21st April, 2016, London, UK.
    Abstract / PDF [1.6M]

  • Scaling Up Clustered Network Appliances with ScaleBricks. Dong Zhou, Bin Fan, Hyeontaek Lim, David G. Andersen, Michael Kaminsky, Michael Mitzenmacher, Ren Wang, Ajaypal Singh. Proc. ACM SIGCOMM 2015, August 17-21, 2015, London, United Kingdom.
    Abstract / PDF [626K]

  • Exploiting Compressed Block Size as an Indicator of Future Reuse. Gennady Pekhimenko, Tyler Huberty, Rui Cai, Onur Mutlu, Phillip P. Gibbons, Michael A. Kozuch, and Todd C. Mowry. Proceedings of the 21st International Symposium on High-Performance Computer Architecture (HPCA), Bay Area, CA, February 2015.
    Abstract / PDF [2.4M]

  • Cuckoo Filter: Practically Better Than Bloom. Bin Fan, David G. Andersen, Michael Kaminsky, Michael D. Mitzenmacher. Proceedings of CoNEXT (CoNEXT’14), December 2014.
    Abstract / PDF [343K]

  • Comparing Performance of Different Cleaning Algorithms for SMR Disks. Mukul Kumar Singh. M.S. Thesis: Master of Science in Information Networking, April 2014.
    Abstract / PDF [623K]

  • More Effective Distributed ML via a Stale Synchronous Parallel Parameter Server. Qirong Ho, James Cipar, Henggang Cui, Jin Kyu Kim, Seunghak Lee, Phillip B. Gibbons, Garth A. Gibson, Gregory R. Ganger, Eric P. Xing. Conference on Neural Information Processing Systems (NIPS '13). Dec 5-8, 2013, Lake Tahoe, NV.
    Abstract / PDF [2.64M] / Appendix

  • Memory-Efficient GroupBy-Aggregate using Compressed Buffer Trees. Hrishikesh Amur, Wolfgang Richter, David G. Andersen, Michael Kaminsky, Karsten Schwan, Athula Balachandran, Erik Zawadzki. 2013 ACM Symposium on Cloud Computing (SoCC'13), Oct. 01-03 2013, Santa Clara, CA, USA.
    Abstract / PDF [944K]

  • Active Disk Meets Flash: A Case for Intelligent SSDs. Sangyeun Cho, Chanik Park, Hyunok Oh, Sungchan Kim, Youngmin, Gregory R. Ganger. Proceedings of the ACM Int'l Conference on Supercomputing (ICS), Eugene, OR, June 2013.
    Abstract / PDF [677K]

  • Building a High-Performance Metadata Service by Reusing Scalable I/O Bandwidth. Kai Ren, Swapnil Patil, Kartik Kulkarni, Adit Madan, Garth Gibson. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-13-107, May 2013.
    Abstract / PDF [690K]

  • Specialized Storage for Big Numeric Time Series. Ilari Shafer, Raja R. Sambasivan, Anthony Rowe, Gregory R. Ganger. Proceedings of the 5th Workshop on Hot Topics in Storage and File Systems, June 2013.
    Abstract / PDF [161K]

  • MemC3: Compact and Concurrent Memcache with Dumber Caching and Smarter Hashing. Bin Fan, David G. Andersen and Michael Kaminsky. In Proc. 10th USENIX NSDI, Apr 2013. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-116. November 2012. Source code: https://github.com/efficient/libcuckoo
    Abstract / PDF [280K]

  • Practical Batch-Updatable External Hashing with Sorting. Hyeontaek Lim and David G. Andersen and Michael Kaminsky. In Proc. Meeting on Algorithm Engineering and Experiments (ALENEX), Jan 2013.
    Abstract / PDF [536K]

  • Memory-Efficient Group-By-Aggregate using Compressed Buffer Trees. Hrishikesh Amur, Wolfgang Richter, David G. Andersen, Michael Kaminsky, Karsten Schwan, Athula Balachandran, Erik Zawadzki. Georgia Tech Center for Experimental Research in Computer Systems Technical Report GIT-CERCS-12-08.
    Abstract / PDF [450K]

  • MemC3: Compact and Concurrent MemCache with Dumber Caching and Smarter Hashing. Bin Fan, David G. Andersen, Michael Kaminsky. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-116. November 2012.
    Abstract / PDF [824K]

  • JackRabbit: Improved Agility In Elastic Distributed Storage. James Cipar, Lianghong Xu, Elie Krevat, Alexey Tumanov Nitin Gupta, Michael A. Kozuch, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-112, October 2012.
    Abstract / PDF [395K]

  • SILT: A Memory-Efficient, High-Performance Key-Value Store. Hyeontaek Lim, Bin Fan, David Andersen and Michael Kaminsky. ACM Symposium on Operating Systems Principles (SOSP'11), Cascais, Portugal, October 2011.
    Abstract / PDF [1.15M]

  • Switching the Optical Divide: Fundamental Challenges for Hybrid Electrical/Optical Datacenter Networks. Hamid Hajabdolali Bazzaz, Malveeka Tewari, Guohui Wang, George Porter, T. S. Eugene Ng, David G. Andersen, Michael Kaminsky, Michael A. Kozuch, Amin Vahdat. Proc. 2nd ACM Symposium on Cloud Computing (SOCC), Oct 2011.
    Abstract / PDF [190K]

  • Don't Settle for Eventual: Scalable Causal Consistency for Wide-Area Storage with COPS.
    Wyatt Lloyd, Michael J. Freedman, Michael Kaminsky, David G. Andersen. Proc. 23rd ACM Symposium on Operating Systems Principles (SOSP), Oct 2011.
    Abstract / PDF [689K]

  • RainMon: An Integrated Approach to Mining Bursty Timeseries Monitoring Data. Ilari Shafer, Kai Ren, Vishnu Boddeti, Yashihisa Abe, Greg Ganger, Christos Faloutsos. KDD'12, August 12–16, 2012, Beijing, China.
    Abstract / PDF [1.5M]

  • The Case for VOS: The Vector Operating System. Vijay Vasudevan, David Andersen, Michael Kaminsky. In 13th Workshop on Hot Topics in Operating Systems (HotOS 2011). May 2011.
    Abstract / PDF [430K]

  • Principles of Operation for Shingled Disk Devices. Garth Gibson, Greg Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-107. April 2011.
    Abstract / PDF [500K]

  • Otus: Resource Attribution in Data-Intensive Clusters. Kai Ren, Julio López, Garth Gibson. MapReduce'11, June 8, 2011, San Jose, California, USA. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-106, April 2011.
    Abstract / PDF [2.5M]

  • pWalrus: Towards Better Integration of Parallel File Systems into Cloud Storage. Yoshihisa Abe, Garth Gibson. Workshop on Interfaces and Abstractions for Scientific Data Storage (IASDS10), co-located with IEEE Int. Conference on Cluster Computing 2010 (Cluster10), Heraklion, Greece, September 2010.
    Abstract / PDF [321K]

  • FAWNSort: Energy-efficient Sorting of 10GB. Vijay Vasudevan Lawrence Tan, David Andersen, Michael Kaminsky, Michael A. Kozuch, Padmanabhan Pillai, Winner of 2010 10GB Joulesort, Daytona and Indy categories. http://sortbenchmark.org/. July 2010
    Abstract / PDF [90K]

  • Energy-efficient Cluster Computing with FAWN: Workloads and Implications. Vijay Vasudevan David Andersen, Michael Kaminsky, Lawrence Tan, Jason Franklin, Iulian Moraru . Proceedings of 1st Int'l Conf. on Energy-Efficient Computing & Networking (e-Energy 2010), Univ. of Passau, Germany. April 13-15, 2010.
    Abstract / PDF [645K]

  • Open Cirrus: A Global Cloud Computing Testbed. Arutyun I. Avetisyan, Roy Campbell, Indranil Gupta, Michael T. Heath, Steven Y. Ko, Gregory R. Ganger, Michael A. Kozuch, David O’Hallaron, Marcel Kunze, Thomas T. Kwan, Kevin Lai, Martha Lyons, Dejan S. Milojicic, Hing Yan Lee, Ng Kwang Ming, Jing-Yuan Luke, Han Namgong, Yeng Chai Soh. IEEE Computer, April 2010.
    Abstract / PDF [1.1M]

  • BEMC: A Searchable, Compressed Representation for Large Seismic Wavefields. Julio López, Leonardo Ramírez-Guzmán, Jacobo Bielak, David O’Hallaron. 22nd Int. Conf on Scientific and Statistical Database Management (SSDBM'10), Heidelberg, Germany, June 30 - July 2, 2010.
    Abstract / PDF [311K]

  • Robust and Flexible Power-proportional Storage. Hrishikesh Amur, James Cipar, Varun Gupta, Gregory R. Ganger, Michael A. Kozuch, Karsten Schwan. ACM Symposium on Cloud Computing (SOCC). June 10-11, 2010, Indianapolis, IN. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-106, February 2010.
    Abstract / PDF [944K]

  • FAWN: A Fast Array of Wimpy Nodes. David Andersen, Jason Franklin, Michael Kaminsky, Amar Phanishayee, Lawrence Tan, Vijay Vasudevan. Proc. 22nd ACM Symposium on Operating Systems Principles (SOSP 2009), Big Sky, MT. October 2009. BEST PAPER AWARD!
    Abstract / PDF [332K]

  • Safe and Effective Fine-grained TCP Retransmissions for Datacenter Communication. Vijay Vasudevan, Amar Phanishayee, Hiral Shah, Elie Krevat, David G. Andersen, Gregory R. Ganger, Garth A. Gibson, Brian Mueller. SIGCOMM’09, August 17–21, 2009, Barcelona, Spain.
    Abstract / PDF [755K]

  • Tashi: Location-aware Cluster Management. Michael A. Kozuch, Michael P. Ryan, Richard Gass, Steven W. Schlosser, David O’Hallaron, James Cipar, Elie Krevat, Julio López, Michael Stroucken, Gregory R. Ganger. First Workshop on Automated Control for Datacenters and Clouds (ACDC'09), Barcelona, Spain, June 2009.
    Abstract / PDF [160K]

  • FAWNdamentally Power-efficient Clusters. Vijay Vasudevan, Jason Franklin, David Andersen, Amar Phanishayee, Lawrence Tan, Michael Kaminsky, Iulian Moraru. 12th Workshop on Hot Topics in Operating Systems (HotOS XII). May 2009.
    Abstract / PDF [236K]

  • Enabling Enterprise Solid State Disks Performance. Milo Polte, Jiri Simsa, Garth Gibson. 1st Workshop on Integrating Solid-state Memory into the Storage Hierarchy, March 7, 2009, Washington DC.
    Abstract / PDF [302K]

  • Solving TCP Incast in Cluster Storage Systems. Vijay Vasudevan, Hiral Shah, Amar Phanishayee, Elie Krevat, David Andersen, Greg Ganger, Garth Gibson. FAST 2009 Work in Progress Report. 7th USENIX Conference on File and Storage Technologies. Feb 24-27, 2009, San Francisco, CA.
    PDF [70K]

  • A (In)Cast of Thousands: Scaling Datacenter TCP to Kiloservers and Gigabits. Vijay Vasudevan, Amar Phanishayee, Hiral Shah, Elie Krevat, David G. Andersen, Gregory R. Ganger, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-101, Feb. 2009.
    Abstact / PDF [317K]

  • FAWN: A Fast Array of Wimpy Nodes. David G. Andersen, Jason Franklin, Amar Phanishayee, Lawrence Tan, Vijay Vasudevan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-108, May 2008.
    Abstract / PDF [875K]

  • Measurement and Analysis of TCP Throughput Collapse in Cluster-based Storage Systems. Amar Phanishayee, Elie Krevat, Vijay Vasudevan, David G. Andersen, Gregory R. Ganger, Garth A. Gibson, Srinivasan Seshan. 6th USENIX Conference on File and Storage Technologies (FAST '08). Feb. 26-29, 2008. San Jose, CA. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-07-105, September 2007.
    Abstract / PDF [374K]

  • D-SPTF: Decentralized Request Distribution in Brick-based Storage Systems. Christopher R. Lumb. Carnegie Mellon University Parallel Data Lab Ph.D. Dissertation CMU-PDL-05-111, December, 2005.
    Abstract / PDF [1.2M]

  • On Multidimensional Data and Modern Disks. Steven W. Schlosser, Jiri Schindler, Stratos Papadomanolakis , Minglong Shao Anastassia Ailamaki, Christos Faloutsos, Gregory R. Ganger. Proceedings of the 4th USENIX Conference on File and Storage Technology (FAST '05). San Francisco, CA. December 13-16, 2005.
    Abstract / PDF [220K]

  • Replication Policies for Layered Clustering of NFS Servers. Raja R. Sambasivan, Andrew J. Klosterman, Gregory R. Ganger. 13th Annual Meeting of the IEEE International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS). September 26 - 29, 2005, Atlanta, GA.
    Abstract / PDF [199K]

  • DSPTF: Decentralized Request Distribution in Brickbased Storage Systems. Christopher R. Lumb, Richard Golding, Gregory R. Ganger. Proceedings of ASPLOS’04, October 7–13 ,2004, Boston, Massachusetts, USA.
    Abstract / PDF [281K]

  • Clotho: Decoupling Page Layout from Storage Organization. Minglong Shao, Jiri Schindler, Steven W. Schlosser, Anastassia Ailamaki, Gregory R. Ganger. Proceedings of the 30th VLDB Conference. Toronto, Canada, 29 August - 3 September 2004. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-04-102, March 2004.
    Abstract / PDF [203K]

  • Matching Application Access Patterns to Storage Device Characteristics. Jiri Schindler. Carnegie Mellon University Ph.D Dissertation. CMU-PDL-03-109, May 2004.
    Abstract / PDF [1.14M]

  • Atropos: A Disk Array Volume Manager for Orchestrated Use of Disks. Jiri Schindler, Steven W. Schlosser, Minglong Shao, Anastassia Ailamaki, Gregory R. Ganger. Proceedings of the 3rd USENIX Conference on File and Storage Technologies (FAST '04). San Francisco, CA. March 31, 2004. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-03-101, December, 2003.
    Abstract / PDF [281K]

  • A Framework for Building Unobtrusive Disk Maintenance Applications. Eno Thereska, Jiri Schindler, John Bucy, Brandon Salmon, Christopher R. Lumb, Gregory R. Ganger. Proceedings of the 3rd USENIX Conference on File and Storage Technologies (FAST '04). San Francisco, CA. March 31, 2004. Supercedes Carnegie Mellon University Technical Report CMU-CS-03-192, October 2003.
    Abstract / Postscript [5.1M] / PDF [148K]

  • Design and Implementation of a Freeblock Subsystem. Eno Thereska, Jiri Schindler, Christopher R. Lumb, John Bucy, Brandon Salmon, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-03-107, December, 2003.
    Abstract / Postscript [6.5M] / PDF [165K]

  • D-SPTF: Decentralized Request Distribution in Brick-based Storage. Christopher R. Lumb, Gregory R. Ganger, Richard Golding. Carnegie Mellon University School of Computer Science Tecnical Report CMU-CS-03-202, November, 2003.
    Abstract / PDF [475K]

  • Object-Based Storage. Mike Mesnier, Gregory R. Ganger, Erik Riedel. IEEE Communications Magazine, v.41 n.8 pp 84-90, August 2003.
    Abstract / PDF [85K]

  • Lachesis: Robust Database Storage Management Based on Device-specific Performance Characteristics. Jiri Schindler, Anastassia Ailamaki, Gregory R. Ganger. Carnegie Mellon University Technical Report CMU-CS-03-124, April 2003. To appear in VLDB 03, Berlin, Sept 9-12, 2003.
    Abstract / PDF [152K]

  • Exposing and Exploiting Internal Parallelism in MEMS-based Storage. Steven W. Schlosser, Jiri Schindler, Anastassia Ailamaki, Gregory R. Ganger. Carnegie Mellon University Technical Report CMU-CS-03-125, March 2003.
    Abstract / Postscript [1.67M] / PDF [136K]

  • Cuckoo: Layered clustering for NFS. Andrew J. Klosterman, Gregory Ganger. Carnegie Mellon University Technical Report CMU-CS-02-183, October 2002.
    Abstract / Postscript [370K] / PDF [86K]

  • Examining Semantics In Multi-Protocol Network File Systems. Edward P. A. Hogan, Garth A. Gibson, Gregory R. Ganger. CMU SCS Technical Report CMU-CS-02-103, January 2002.
    Abstract / Postscript [981K] / PDF [408K]

  • Blurring the Line Between Oses and Storage Devices. Gregory R. Ganger. CMU SCS Technical Report CMU-CS-01-166, December 2001.
    Abstract / Postscript [2.3M] / PDF [974K]

  • Freeblock Scheduling Outside of Disk Firmware. Christopher R. Lumb, Jiri Schindler, Gregory R. Ganger. Conference on File and Storage Technologies (FAST), January 28-30, 2002. Monterey, CA. Supercedes CMU SCS Technical Report CMU-CS-01-149.
    Abstract / Postscript [643K] / PDF [150K]

  • Towards Higher Disk Head Utilization: Extracting "Free" Bandwidth From Busy Disk Drives. Lumb, C., Schindler, J., Ganger, G.R., Nagle, D.F. and Riedel, E. Appears in Proc. of the 4th Symposium on Operating Systems Design and Implementation, 2000. Supercedes CMU SCS Technical Report CMU-CS-00-130, May 2000.
    Abstract / Postscript [2.3M] / PDF [422K]

Non-volatile Memory

  • TVARAK: Software-Managed Hardware Offload for DAX NVM Storage Redundancy. Rajat Kateja, Nathan Beckmann, Greg Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-19-105, Aug 2019.
    Abstract / PDF [975K]

  • Lazy Redundancy for NVM Storage: Handing the Performance-Reliability Tradeoff to Applications. Rajat Kateja, Andy Pavlo, Greg Ganger Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-19-101, April 2019.
    Abstract / PDF [800K]

  • Improving 3D NAND Flash Memory Lifetime by Tolerating Early Retention Loss and Process Variation. Y. Luo, S. Ghose, Y. Cai, E. F. Haratsch, O. Mutlu. Proc. of the ACM SIGMETRICS Conference, Irvine, CA, June 2018; Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS), Vol. 2, No. 3, December 2018.
    Abstract / PDF [3.2M]

  • The Parallel Persistent Memory Model. Guy E. Blelloch, Phillip B. Gibbons, Yan Gu, Charles McGuffey, Julian Shun. SPAA ’18, July 16–18, 2018, Vienna, Austria.
    Abstract / PDF [760K]

  • FLIN: Enabling Fairness and Enhancing Performance in Modern NVMe Solid State Drives. A. Tavakkol, M. Sadrosadati, S. Ghose, J. Kim, Y. Luo, Y. Wang, N. M. Ghiasi, L. Orosa, J. Gómez-Luna, O. Mutlu. Proc. of the International Symposium on Computer Architecture (ISCA), Los Angeles, CA, June 2018.
    Abstract / PDF [888K]

  • A Case for Richer Cross-layer Abstractions: Bridging the Semantic Gap with Expressive Memory. Nandita Vijaykumar, Abhilasha Jain, Diptesh Majumdar, Kevin Hsieh, Gennady Pekhimenko, Eiman Ebrahimi, Nastaran Hajinazaru, Phillip B. Gibbons, Onur Mutlu. 45th International Symposium on Computer Architecture (ISCA), Los Angeles, CA, USA, June 2018.
    Abstract / PDF [2M]

  • Implicit Decomposition for Write-Efficient Connectivity Algorithms. Naama Ben-David, Guy E. Blelloch, Jeremy T. Fineman, Phillip B. Gibbons, Yan Gu, Charles McGuffey, and Julian Shun. 2018 International Parallel and Distributed Processing Symposium (IPDPS '18). May 21-25, 2018, Vancouver, BC, Canada.
    Abstract / PDF [716K]

  • MQSim: A Framework for Enabling Realistic Studies of Modern Multi-Queue SSD Devices. A. Tavakkol, J. Gómez-Luna, M. Sadrosadati, S. Ghose, and O. Mutlu. USENIX Conference on File and Storage Technologies (FAST), Oakland, CA, February 2018.
    Abstract / PDF [2.25M]

  • Mosaic: A GPU Memory Manager with Application-Transparent Support for Multiple Page Sizes Rachata Ausavarungnirun, Joshua Landgraf, Vance Miller, Saugata Ghose, Jayneel Gandhi, Christopher J. Rossbach & Onur Mutlu. Proc. of the International Symposium on Microarchitecture (MICRO), Cambridge, MA, October 2017.
    Abstact / PDF [1.32M]

  • Ambit: In-Memory Accelerator for Bulk Bitwise Operations Using Commodity DRAM Technology. Vivek Seshadri, Donghyuk Lee, Thomas Mullins, Hasan Hassan, Amirali Boroumand, Jeremie Kim, Michael A. Kozuch, Onur Mutlu, Phillip B. Gibbons & Todd C. Mowry. Proceedings of the 50th International Symposium on Microarchitecture (MICRO), Boston, MA, USA, October 2017.
    Abstact / PDF [2.5M]

  • Detecting and Mitigating Data-Dependent DRAM Failures by Exploiting Current Memory Content. Samira Khan, Chris Wilkerson, Zhe Wang, Alaa R. Alameldeen, Donghyuk Lee & Onur Mutlu. Proceedings of the 50th International Symposium on Microarchitecture (MICRO), Boston, MA, USA, October 2017.
    Abstact / PDF [1.5M]

  • Utility-Based Hybrid Memory Management. Yang Li, Saugata Ghose, Jongmoo Choi, Jin Sun, Hui Wang & Onur Mutlu. In Proc. of the IEEE Cluster Conference (CLUSTER), Honolulu, HI, September 2017.
    Abstact / PDF [588K]

  • Error Characterization, Mitigation, and Recovery in Flash-Memory-Based Solid-State Drives. Yu Cai, Saugata Ghose, Erich F. Haratsch, Yixin Luo & Onur Mutlu. Proceedings of the IEEE Volume: 105, Issue: 9, Sept. 2017.
    Abstact / PDF [5.3M]

  • Viyojit: Decoupling Battery and DRAM Capacities for Battery-Backed DRAM. Rajat Kateja, Anirudh Badam, Sriram Govindan, Bikash Sharma, Greg Ganger. ISCA ’17, June 24-28, 2017, Toronto, ON, Canada.
    Abstract / PDF [1M]

  • Understanding Reduced-Voltage Operation in Modern DRAM Devices: Experimental Characterization, Analysis, and Mechanisms. Kevin K. Chang, A. Giray Yaglikçi, Saugata Ghose, Aditya Agrawal, Niladrish Chatterjee, Abhijith Kashyap, Donghyuk Lee, Mike O’Connor, Hasan Hassan & Onur Mutlu. Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS), Vol. 1, No. 1, June 2017.
    Abstact / PDF [4M]

  • Design-Induced Latency Variation in Modern DRAM Chips: Characterization, Analysis, and Latency Reduction Mechanisms. Donghyuk Lee, Samira Khan, Lavanya Subramanian, Saugata Ghose, Rachata Ausavarungnirun, Gennady Pekhimenko, Vivek Seshadri & Onur Mutlu. Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS), Vol. 1, No. 1, June 2017.
    Abstact / PDF [2.5M]

  • Improving the Reliability of Chip-off Forensic Analysis of NAND Flash Memory Devices. Aya Fukami, Saugata Ghose, Yixin Luo, Yu CaI, Onur Mutlu. DFRWS Digital Forensics Research Conference Europe (DFRWS EU), March 21 - 23, 2017 Lake Constance, Germany.
    Abstract / PDF [1.5M]

  • Vulnerabilities in MLC NAND Flash Memory Programming: Experimental Analysis, Exploits, and Mitigation Techniques. Yu Cai, Saugata Ghose, Yixin Luo, Ken Mai, Onur Mutlu, Erich F. Haratsch. 23rd IEEE Symposium on High Performance Computer Architecture, Industrial session, February 2017.
    Abstract / PDF [8.4M]

  • SoftMC: A Flexible and Practical Open-Source Infrastructure for Enabling Experimental DRAM Studies. Hasan Hassan,Nandita Vijaykumar, Samira Khan, Saugata Ghose, Kevin Chang, Gennady Pekhimenko, Donghyuk Lee, Oguz Ergin, Onur Mutlu. International Symposium on High-Performance Computer Architecture (HPCA), February 2017.
    Abstract / PDF [1.6M]

  • Efficient Algorithms with Asymmetric Read and Write Costs. Guy E Blelloch, Jeremy T Fineman, Phillip B Gibbons, Yan Gu, Julian Shun. 24th European Symposium on Algorithms (ESA’16). August, 2016.
    Abstract / PDF [623K]

  • PARBOR: An Efficient System-Level Technique to Detect Data-Dependent Failures in DRAM. Samira Khan, Donghyuk Lee, Onur Mutlu. Proceedings of the 45th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), Toulouse, France, June 28 - July 1 2016.
    Abstract / PDF [630K]

  • Transparent Offloading and Mapping (TOM): Enabling Programmer-Transparent Near-Data Processing in GPU Systems. Kevin Hsieh, Eiman Ebrahimi, Gwangsun Kim, Niladrish Chatterjee, Mike O'Connor, Nandita Vijaykumar, Onur Mutlu§, Stephen W. Keckler. Proceedings of the 43rd International Symposium on Computer Architecture (ISCA), Seoul, South Korea, June 18 - 22, 2016.
    Abstract / PDF [1M]

  • Understanding Latency Variation in Modern DRAM Chips: Experimental Characterization, Analysis, and Optimization. Kevin K. Chang, Abhijith Kashyap, Hasan Hassan, Saugata Ghose, Kevin Hsieh, Donghyuk Lee, Tianshi Li, Gennady Pekhimenko, Samira Khan, Onur Mutlu. Proceedings of the ACM International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS), Antibes Juan-Les-Pins, France, June 14 - 18, 2016.
    Abstract / PDF [3M]

  • A Case for Toggle-Aware Compression for GPU Systems. Gennady Pekhimenko, Evgeny Bolotin, Nandita Vijaykumar, Onur Mutlu, Todd C. Mowry, Stephen W. Keckler. Proceedings of the 22nd International Symposium on High-Performance Computer Architecture (HPCA), Barcelona, Spain, March 2016.
    Abstract / PDF [713K]

  • ChargeCache: Reducing DRAM Latency by Exploiting Row Access Locality. Hasan Hassan, Gennady Pekhimenko, Nandita Vijaykumar Vivek Seshadri, Donghyuk Lee, Oguz Ergin, Onur Mutlu. Proceedings of the 22nd International Symposium on High-Performance Computer Architecture (HPCA), Barcelona, Spain, March 2016.
    Abstract / PDF [2M]

  • Low-Cost Inter-Linked Subarrays (LISA): Enabling Fast Inter-Subarray Data Movement in DRAM. Kevin K. Chang, Prashant J. Nair, Donghyuk Lee, Saugata Ghose, Moinuddin K. Qureshi, and Onur Mutlu. Proceedings of the 22nd International Symposium on High-Performance Computer Architecture (HPCA), Barcelona, Spain, March 2016.
    Abstract / PDF [768K]

  • A Framework for Accelerating Bottlenecks in GPU Execution with Assist Warps. Nandita Vijaykumar, Gennady Pekhimenko, Adwait Jog, Saugata Ghose, Abhishek Bhowmick, Rachata Ausavarungnirun, Chita R. Das, Mahmut T. Kandemir, Todd C. Mowry, Onur Mutlu. arXiv:1602.01348v1 [cs.AR]. 3 Feb 2016.
    Abstract / PDF [1.87M]

  • Simultaneous Multi-Layer Access: Improving 3D-Stacked Memory Bandwidth at Low Cost. Donghyuk Lee, Saugata Ghose, Gennady Pekhimenko, Samira Khan, Onur Mutlu. ACM Transactions on Architecture and Code Optimization (TACO), Vol. 12, January 2016. Presented at the 11th HiPEAC Conference, Prague, Czech Republic, January 2016.
    Abstract / PDF [2M]

  • Enabling Accurate and Practical Online Flash Channel Modeling for Modern MLC NAND Flash Memory. Yixin Luo, Saugata Ghose, Yu Cai, Erich F. Haratsch, Onur Mutlu JSAC Special Issue, 2016.
    Abstract / PDF [4.2M]

  • ThyNVM: Enabling Software-Transparent Crash Consistency in Persistent Memory Systems. Jinglei Ren, Jishen Zhao, Samira Khan, Jongmoo Choi, Yongwei Wu, Onur Mutlu. Proceedings of the 48th International Symposium on Microarchitecture (MICRO), Waikiki, Hawaii, USA, December 2015.
    Abstract / PDF [460K]

  • The Application Slowdown Model: Quantifying and Controlling the Impact of Inter-Application Interference at Shared Caches and Main Memory. Lavanya Subramanian, Vivek Seshadri, Arnab Ghosh, Samira Khan, Onur Mutlu. Proceedings of the 48th International Symposium on Microarchitecture (MICRO), Waikiki, Hawaii, USA, December 2015.
    Abstract / PDF [604K]

  • Gather-Scatter DRAM: In-DRAM Address Translation to Improve the Spatial Locality of Non-unit Strided Accesses. Vivek Seshadri, Thomas Mullins, Amirali Boroumand, Onur Mutlu, Phillip B. Gibbons, Michael A. Kozuch, Todd C. Mowry. Proceedings of the 48th International Symposium on Microarchitecture (MICRO), Waikiki, Hawaii, USA, December 2015.
    Abstract / PDF [874K]

  • High-Performance and Lightweight Transaction Support in Flash-Based SSDs. Youyou Lu, Jiwu Shu, Jia Guo, Shuai Li, Onur Mutlu. IEEE Transactions on Computers (TC), October 2015.
    Abstract / PDF [1.4M]

  • WARM: Improving NAND Flash Memory Lifetime with Write-hotness Aware Retention Management. Yixin Luo, Yu Cai, Saugata Ghose, Jongmoo Choi, Onur Mutlu.MSST 2015: 31st International Conference on Massive Storage Systems and Technologies, Jun 1, 2015 - Jun 5, 2015, Santa Clara, CA.
    Abstract / PDF [1.5M]

  • Page Overlays: An Enhanced Virtual Memory Framework to Enable Fine-grained Memory Management. Vivek Seshadri, Gennady Pekhimenko, Olatunji Ruwase, Onur Mutlu, Phillip B. Gibbons, Michael A. Kozuch, Todd C. Mowry, Trishul Chilimbi. Proceedings of the 42nd International Symposium on Computer Architecture (ISCA), Portland, OR, June 2015.
    Abstract / PDF [2.1M]

  • Let’s Talk About Storage & Recovery Methods for Non-Volatile Memory Database Systems. Joy Arulraj, Andrew Pavlo, Subramanya R. Dulloor. Proceedings ACM SIGMOD, Melbourne, Victoria, Australia, May 31-June 4, 2015.
    Abstract / PDF [1M]

  • Data Retention in MLC NAND Flash Memory: Characterization, Optimization and Recovery. Yu Cai, Yixin Luo, Erich F. Haratsch, Ken Mai, Onur Mutlu. HPCA-21, February 7-11, 2015 — Best Paper Runner Up.
    Abstract / PDF [1.6M]

  • Adaptive-Latency DRAM: Optimizing DRAM Timing for the Common-Case. Donghyuk Lee, Yoongu Kim, Gennady Pekhimenko, Samira Khan, Vivek Seshadri, Kevin Chang, Onur Mutlu. Proceedings of the 21st International Symposium on High-Performance Computer Architecture (HPCA), Bay Area, CA, February 2015.
    Abstract / PDF [1.67M]

  • Research Problems and Opportunities in Memory Systems. Onur Mutlu, Lavanya Subramanian. Invited Article in Supercomputing Frontiers and Innovations (SUPERFRI), 2015.
    Abstract / PDF [1.72M]

  • The Main Memory System: Challenges and Opportunities. Onur Mutlu, Justin Meza, Lavanya Subramanian. Invited Article in Communications of the Korean Institute of Information Scientists and Engineers (KIISE), 2015.
    Abstract / PDF [813K]

  • Main Memory Scaling: Challenges and Solution Directions. Onur Mutlu. Invited Book Chapter in More than Moore Technologies for Next Generation Computer Design, pp. 127-153, Springer, 2015.
    Abstract / PDF [1.02M]

  • Efficient Data Mapping and Buffering Techniques for Multilevel Cell Phase-Change Memories. Hanbin Yoon, Justin Meza, Naveen Mural Imanohar, Norman P. Jouppi, Onur Mutlu. ACM Transactions on Architecture and Code Optimization, Vol. 11, No. 4, Article 40, December 2014.
    Abstract / PDF [1.06M]

  • FIRM: Fair and High-Performance Memory Control for Persistent Memory Systems. Jishen Zhao, Onur Mutlu, Yuan Xie. Proceedings of the 47th International Symposium on Microarchitecture (MICRO), Cambridge, UK, December 2014.
    Abstract / PDF [626K]

  • Loose-Ordering Consistency for Persistent Memory. Youyou Lu, Jiwu Shu, Long Sun, Onur Mutlu. Proceedings of the 32nd IEEE International Conference on Computer Design (ICCD), Seoul, South Korea, October 2014.
    Abstract / PDF [389K]

  • The Blacklisting Memory Scheduler: Achieving High Performance and Fairness at Low Cost. Lavanya Subramanian, Donghyuk Lee, Vivek Seshadri, Harsha Rastogi, Onur Mutlu. Proceedings of the 32nd IEEE International Conference on Computer Design (ICCD), Seoul, South Korea, October 2014.
    Abstract / PDF [240K]

  • Characterizing Application Memory Error Vulnerability to Optimize Datacenter Cost via Heterogeneous- Reliability Memory. Yixin Luo, Sriram Govindan, Bikash Sharma, Mark Santaniello, Justin Meza, Aman Kansal, Jie Liu, Badriddine Khessib, Kushagra Vaid, Onur Mutlu Proceedings of the 44th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), Atlanta, GA, June 2014.
    Abstract / PDF [1.58]

  • The Efficacy of Error Mitigation Techniques for DRAM Retention Failures: A Comparative Experimental Study. Samira Khan, Donghyuk Lee, Yoongu Kim, Alaa Alameldeen, Chris Wilkerson, Onur Mutlu. Proceedings of the ACM International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS’14), June 2014.
    Abstract / PDF [8M]

  • Bounding Memory Interference Delay in COTS-based Multi-Core Systems. Hyoseung Kim, Dionisio de Niz, Björn Andersson, Mark Klein, Onur Mutlu, Ragunathan (Raj) Rajkumar. Proceedings of the 20th IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS), Berlin, Germany, April 2014.
    Abstract / PDF [2.5M]

  • Memory Systems. Yoongu Kim, Onur Mutlu. Invited Book Chapter in Computing Handbook, Third Edition: Computer Science and Software Engineering, CRC Press, April 2014.
    Abstract / PDF [453K]

  • Improving DRAM Performance by Parallelizing Refreshes with Accesses. Kevin Chang, Donghyuk Lee, Zeshan Chishti, Chris Wilkerson, Alaa Alameldeen, Yoongu Kim, Onur Mutlu. Proceedings of the 20th International Symposium on High-Performance Computer Architecture (HPCA'14), February 2014.
    Abstract / PDF [2.86M]

  • Consistent, Durable, and Safe Memory Management for Byte-addressable Non Volatile Main Memory. Iulian Moraru, David G. Andersen, Michael Kaminsky, Niraj Tolia, Nathan Binkert, Parthasarathy Ranganathan. TRIOS: Conference on Timely Results in Operating Systems. Held in conjunction with SOSP '13. Farmington, PA, November 3, 2013.
    Abstract / PDF [967K]

  • RowClone: Fast and Energy-Efficient In-DRAM Bulk Data Copy and Initialization. Vivek Seshadri, Yoongu Kim, Chris Fallin, Donghyuk Lee, Rachata Ausavarungnirun, Gennady Pekhimenko, Yixin Luo, Onur Mutlu, Phillip B. Gibbons, Michael A. Kozuch, and Todd C. Mowry, 46th IEEE/ACM International Symposium on Microarchitecture (MICRO-46), December 2013.
    Abstract / PDF [2.42M]

  • LightTx: A Lightweight Transactional Design in Flash-based SSDs to Support Flexible Transactions. Youyou Lu, Jiwu Shuy, Jia Guo, Shuai Li, Onur Mutlu. The 32nd IEEE International Conference on Computer Design (ICCD13). October 6-9, 2013, Ashville, NC, USA.
    Abstract / PDF [262K]

  • Program Interference in MLC NAND Flash Memory: Characterization, Modeling, and Mitigation. Yu Cai, Onur Mutlu, Erich F. Haratsch, Ken Mai. The 32nd IEEE International Conference on Computer Design (ICCD13). October 6-9, 2013, Ashville, NC, USA.
    Abstract / PDF [1.18M]

  • Threshold Voltage Distribution in MLC NAND Flash Memory: Characterization, Analysis, and Modeling. Yu Cai, Erich F. Haratsch, Onur Mutlu and Ken Mai. Design Automation and Test in Europe (DATE 2013), Mar 19-22, 2013, Grenoble, France.
    Abstract / PDF [1.44M]

  • Memory Scaling: A Systems Architecture Perspective. Onur Mutlu. MemCon 2013 (MEMCON), Santa Clara, CA, August 2013.
    Abstract / PDF [114K]

  • A Case for Efficient Hardware/Software Cooperative Management of Storage and Memory. Justin Meza, Yixin Luo, Samira Khan, Jishen Zhao, Yuan Xie, Onur Mutlu. Fifth Workshop on Energy-Efficient Design (WEED 2013). Held in conjunction with the 2013 International Symposium on Computer Architecture (ISCA-40). June 24, 2013, Tel-Aviv, Israel.
    Abstract / PDF [667K]

  • Evaluating STT-RAM as an Energy-Efficient Main Memory Alternative. Emre Kultursay, Mahmut Kandemir, Anand Sivasubramaniam, and Onur Mutlu. 2013 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS 2013), April 21-23, 2013, Austin, TX.
    Abstract / PDF [1.83M]

  • Error Analysis and Retention-Aware Error Management for NAND Flash Memory. Yu Cai, Gulay Yalcin, Onur Mutlu, Erich F. Haratsch, Adrian Cristal, Osman Unsal, Ken Mai. Intel Technology Journal (ITJ) Special. Issue on Memory Resiliency, 2013.
    Abstract / PDF [270K]

  • Asymmetry-aware Execution Placement on Manycore Chips. Alexey Tumanov, Joshua Wise, Onur Mutlu, Gregory R. Ganger. In Proc. of the 3rd Workshop on Systems for Future Multicore Architectures (SFMA'13), EuroSys'13, Apr. 14-17, 2013, Prague, Czech Republic.
    Abstract / PDF [703K]

  • Application-to-Core Mapping Policies to Reduce Memory System Interference in Multi-Core Systems. Reetuparna Das, Rachata Ausavarungnirun, Onur Mutlu, Akhilesh Kumar, Mani Azimi. Proceedings of the 19th International Symposium on High-Performance Computer Architecture (HPCA 2013), Shenzhen, China, February 2013.
    Abstract / PDF [623K]

  • MISE: Providing Performance Predictability and Improving Fairness in Shared Main Memory Systems. Lavanya Subramanian, Vivek Seshadri, Yoongu Kim, Ben Jaiyen, Onur Mutlu. Proceedings of the 19th International Symposium on High-Performance Computer Architecture (HPCA 2013), Shenzhen, China, February 2013.
    Abstract / PDF [607K]

  • Tiered-Latency DRAM: A Low Latency and Low Cost DRAM Architecture. Donghyuk Lee, Yoongu Kim, Vivek Seshadri, Jamie Liu, Lavanya Subramanian, Onur Mutlu. Proceedings of the 19th International Symposium on High-Performance Computer Architecture (HPCA), Shenzhen China, February 2013.
    Abstract / PDF [3.17M]

  • Using Vector Interfaces to Deliver Millions of IOPS from a Networked Key-value Storage Server. Vijay Vasudevan, Michael Kaminsky, David G. Andersen. SOCC'12, October 14-17, 2012, San Jose, CA USA.
    Abstract / PDF [648K]

  • Row Buffer Locality Aware Caching Policies for Hybrid Memories. HanBin Yoon, Justin Meza, Rachata Ausavarungnirun, Rachael A. Harding, Onur Mutlu. Proceedings of the 30th IEEE International Conference on Computer Design (ICCD 2012), Montreal, Quebec, Canada, September 2012. Best paper award in Computer Systems and Applications track.
    Abstract / PDF [577K]

  • A Case for Small Row Buffers in Non-Volatile Main Memories. Justin Meza, Jing Li, Onur Mutlu. Proceedings of the 30th IEEE International Conference on Computer Design (ICCD 2012), Poster Session, Montreal, Quebec, Canada, September 2012.
    Abstract / PDF [172K]

  • Enabling Efficient and Scalable Hybrid Memories Using Fine-Granularity DRAM Cache Management. Justin Meza, Jichuan Chang, HanBin Yoon, Onur Mutlu, Parthasarathy Ranganathan. IEEE Computer Architecture Letters (CAL), May 2012.
    Abstract / PDF [184K]

  • Persistent, Protected and Cached: Building Blocks for Main Memory Data Stores. Iulian Moraru, David G. Andersen, Michael Kaminsky, Nathan Binkert, Niraj Tolia, Reinhard Munz,Parthasarathy Ranganathan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-114v2, Nov. 2012. Supersedes CMU-PDL-11-114. Dec. 2011.
    Abstract / PDF [1.0M]

  • Row Buffer Locality-Aware Data Placement in Hybrid Memories. HanBin Yoon, Justin Meza, Rachata Ausavarungnirun, Rachael Harding, Onur Mutlu. SAFARI Technical Report, TR-SAFARI-2011-005, Carnegie Mellon University, September 2011.
    Abstract / PDF [272K]

  • A Case for Exploiting Subarray-level Parallelism (SALP) in DRAM. Yoongu Kim, Vivek Seshadri, Donghyuk Lee, Jamie Liu, Onur Mutlu. Proceedings of the 39th International Symposium on Computer Architecture, June 2012.
    Abstract / PDF [927K]

  • RAIDR: Retention-Aware Intelligent DRAM Refresh. Jamie Liu, Ben Jaiyen, Richard Veras, Onur Mutlu. In Proceedings of the 39th International Symposium on Computer Architecture, Portland, Oregon, June 9-13th, 2012.
    Abstract / PDF [480K]

  • Staged Memory Scheduling: Achieving High Performance and Scalability in Heterogeneous Systems. Rachata Ausavarungnirun, Kevin Kai-Wei Chang, Lavanya Subramanian, Gabriel H. Loh, Onur Mutlu. The 39th International Symposium on Computer Architecture (ISCA), Portland, Oregon, June 9-13th, 2012.
    Abstract / PDF [700K]

  • Memory Power Management via Dynamic Voltage/Frequency Scaling. Howard David, Chris Fallin, Eugene Gorbatov, Ulf R. Hanebutte, Onur Mutlu. Proceedings of the 8th International Conference on Autonomic Computing (ICAC), Karlsruhe, Germany, June 2011.
    Abstract / PDF [463K]

  • Thread Cluster Memory Scheduling: Exploiting Differences in Memory Access Behavior. Yoongu Kim, Michael Papamichael, Onur Mutlu, Mor Harchol-Balter. Proceedings of the 43rd International Symposium on Microarchitecture (MICRO), Atlanta, GA, December 2010.
    Abstract / PDF [478K]

  • Phase Change Memory Architecture and the Quest for Scalability. Benjamin C. Lee, Engin Ipek, Onur Mutlu, Doug Burger. Communications of the ACM (CACM), Research Highlight, Vol. 53, No. 7, pages 99-106, July 2010.
    Abstract / PDF [1.34M]

  • Phase Change Technology and the Future of Main Memory. Benjamin C. Lee, Ping Zhou, Jun Yang, Youtao Zhang, Bo Zhao, Engin Ipek, Onur Mutlu, Doug Burger. IEEE Micro, Special Issue: Micro's Top Picks from 2009 Computer Architecture Conferences (MICRO TOP PICKS), Vol. 30, No. 1, pages 60-70, January/February 2010.
    Abstract / PDF [600K]

  • ATLAS: A Scalable and High-Performance Scheduling Algorithm for Multiple Memory Controllers. Yoongu Kim, Dongsu Han, Onur Mutlu, Mor Harchol-Balter. Proceedings of the 16th International Symposium on High-Performance Computer Architecture (HPCA), Bangalore, India, January 2010.
    Abstract / PDF [333K]

  • Architecting Phase Change Memory as a Scalable DRAM Alternative. Benjamin C. Lee, Engin Ipek, Onur Mutlu, Doug Burger. Proceedings of the 36th International Symposium on Computer Architecture (ISCA), pages 2-13, Austin, TX, June 2009.
    Abstract / PDF [2.6M]

Paxos

  • Egalitarian Distributed Consensus. Iulian Moraru. Carnegie Mellon University Ph.D. Dissertation CMU-CS-14-133. August 2014.
    Abstract / PDF [1.95M]

  • Paxos Quorum Leases: Fast Reads Without Sacrificing Writes. Iulian Moraru, David G. Andersen, Michael Kaminsky. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-14-105. May 2014.
    Abstract / PDF [444K]

  • There Is More Consensus in Egalitarian Parliaments. Iulian Moraru, David G. Andersen, Michael Kaminsky. Proceedings of the 24th ACM Symposium on Operating Systems Principles (SOSP'13), November 3-6, 2013, Nemacolin Woodlands Resort, Farmington, PA.
    Abstract / PDF [713K]

  • A Proof of Correctness for Egalitarian Paxos. Iulian Moraru, David G. Andersen, Michael Kaminsky. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-13-111. August 2013.
    Abstract / PDF [2.3M]

  • A Proof of Correctness for Egalitarian Paxos. Iulian Moraru, David G. Andersen, Michael Kaminsky. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-109. September 2012. Superseded by CMU-PDL-13-111, August 2013.
    Abstract / PDF [2.3M]

  • Egalitarian Paxos. Iulian Moraru, David G. Andersen, Michael Kaminsky. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-108. July 2012.
    Abstract / PDF [363K]

Problem Analysis

  • So, You Want To Trace Your Distributed System? Key Design Insights from Years of Practical Experience. Raja R. Sambasivan, Rodrigo Fonseca, Ilari Shafer, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-14-102, April 2014.
    Abstract / PDF [870K]

  • Visualizing Request-flow Comparison to Aid Performance Diagnosis in Distributed Systems. Raja R. Sambasivan, Ilari Shafer, Michelle L. Mazurek, Gregory R. Ganger. IEEE Transactions on Visualization and Computer Graphics (Proceedings Information Visualization 2013), vol. 19, no. 12, Dec. 2013.
    Abstract / PDF [1.9M] / TRAILER VIDEO [5.6M] / VIDEO [17.9M]

  • Making Problem Diagnosis Work for Large-Scale, Production Storage Systems. Michael P. Kasick, Priya Narasimhan, Kevin Harms. Proceedings of the 27th Large Installation System Administration Conference (LISA '13), Washington, DC, November 2013.
    Abstract / PDF [2.23M]

  • Automated Diagnosis of Chronic Performance Problems in Production Systems. Soila P. Kavulya. Carnegie Mellon University Parallel Data Lab Ph.D. Dissertation. CMU-PDL-13-109, May 2013.
    Abstract / PDF [12.6M]

  • Diagnosing Performance Changes in Distributed Systems by Comparing Request Flows. Raja R. Sambasivan. Carnegie Mellon University Parallel Data Lab Ph.D. Dissertation. CMU-PDL-13-105, May 2013.
    Abstract / PDF [3.9M]

  • Theia: Visual Signatures for Problem Diagnosis in Large Hadoop Clusters. Elmer Garduno, Soila P. Kavulya, Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. USENIX ;login, 38(2), April 2013.
    Abstract / PDF [961K]

  • Visualizing Request-flow Comparison to Aid Performance Diagnosis in Distributed Systems. Raja R. Sambasivan, Ilari Shafer, Michelle L. Mazurek, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-13-104 (supersedes CMU-PDL-12-102), April 2013.
    Abstract / PDF [1.93M]

  • Theia: Visual Signatures for Problem Diagnosis in Large Hadoop Clusters. Elmer Garduno, Soila P. Kavulya, Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. 26th Usenix Large Installation System Administration Conference (LISA'12), Dec. 9-14, San Diego, CA. Best Student Paper.
    Abstract / PDF [913K]

  • Failure Diagnosis of Complex Systems. Soila P. Kavulya, Kaustubh Joshi (AT&T), Felicita Di Giandomenico (ISTI-CNR, Pisa, Italy), Priya Narasimhan. Chapter in "Resilience Assessment and Evaluation". Editors. Katinka Wolter, Alberto Avritzer, Marco Vieira, Aad van Moorsel. Springer Verlag, December 2012.
    Abstract / PDF [288K]

  • Light-weight Black-box Failure Detection for Distributed Systems. Jiaqi Tan, Soila Kavulya, Rajeev Gandhi, Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-107. July 2012
    Abstract / PDF [300K]

  • Automated Diagnosis without Predictability is a Recipe for Failure. Raja R. Sambasivan & Gregory R. Ganger. Proceedings of the 4th USENIX Workshop on Hot Topics in Cloud Computing (HotCloud '12), June 12-13, 2012, Boston, MA. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-101.
    Abstract / PDF [368K]

  • Draco: Statistical Diagnosis of Chronic Problems in Large Distributed Systems. Soila P. Kavulya, Scott Daniels (AT&T), Kautubh Joshi (AT&T), Matti Hiltunen (AT&T), Rajeev Gandhi, Priya Narasimhan.IEEE/IFIP Conference on Dependable Systems and Networks (DSN), June 2012.
    Abstract / PDF [859K]

  • End-to-end Tracing in HDFS. William Wang Carnegie Mellon University School of Computer Science Technical Report (Masters Thesis) CMU-CS-11-120, July 2011.
    Abstract / PDF [489K]

  • Diagnosis in Automotive Systems: A Survey. Patrick E. Lanigan, Soila Kavulya, Priya Narasimhan, Thomas E. Fuhrman, Mutasim A. Salman. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-110. June 2011.
    Abstract / PDF [369K]

  • Automation Without Predictability is a Recipe for Failure. Raja R. Sambasivan, Gregory R. Ganger. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-11-101, January 2011.
    Abstract / PDF [336K]

  • Draco: Top-Down Statistical Diagnosis of Large-scale VoIP Networks. Soila P. Kavulya, Kaustubh Joshi, Matti Hiltunen, Scott Daniels, Rajeev Gandhi, Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-109, April 2011.
    Abstract / PDF [787K]

  • Diagnosing Performance Changes by Comparing Request Flows. Raja R. Sambasivan, Alice X. Zheng, Michael De Rosa, Elie Krevat, Spencer Whitman, Michael Stroucken, William Wang, Lianghong Xu, Gregory R. Ganger. 8th USENIX Symposium on Networked Systems Design and Implementation (NSDI'11). March 30 - April 1, 2011. Boston, MA.
    Abstract / PDF [388K]

  • Behavior-Based Problem Localization for Parallel File Systems. Michael P. Kasick, Rajeev Gandhi, Priya Narasimhan. HotDep '10. October 3, 2010, Vancouver, BC, Canada.
    Abstract / PDF [149K]

  • To Upgrade or Not to Upgrade: Impact of Online Upgrades across Multiple Administrative Domains. T. Dumitras, E. Tilevich, P.Narasimhan. ACM Onward! Conference, Oct. 2010.
    Abstract / PDF [425K]

  • Why Do Upgrades Fail And What Can We Do About It? Toward Dependable, Online Upgrades in Enterprise Systems. T. Dumitras, P. Narasimhan. ACM/IFIP/USENIX Middleware Conference, Nov-Dec. 2009.
    Abstract / PDF [835K]

  • Toward Upgrades-as-a-Service in Distributed Systems. T. Dumitras, P. Narasimhan. Poster Session at Middleware 2009. 10th International Middleware Conference Urbana Champaign, Illinois, USA.
    Abstract / PDF [602K]

Storage for High-End Computing

  • Mochi: Composing Data Services for High-Performance Computing Environments. Robert B. Ross, George Amvrosiadis, Philip Carns, Charles D. Cranor, Matthieu Dorier, Kevin Harms, Greg Ganger, Garth Gibson, Samuel K. Gutierrez, Robert Latham, Bob Robey, Dana Robinson, Bradley Settlemyer, Galen Shipman, Shane Snyder, Jerome Soumagne, Qing Zheng. Journal of Computer Science and Technology 35(1): 121–144 Jan. 2020.
    Abstract / PDF [1.3M]

  • Multiversioned Page Overlays: Enabling Faster Serializable Hardware Transactional Memory. Ziqi Wang, Michael A. Kozuch, Todd C. Mowry, Vivek Seshadri. 28th Parallel Architecture and Compiler Technologies 2019 (PACT'19), Sept 21-25, 2019, Seattle, WA.
    Abstract / PDF [475K]

  • Compact Filters for Fast Online Data Partitioning. Qing Zheng, Charles D. Cranor, Ankush Jain, Gregory R. Ganger, Garth A. Gibson, George Amvrosiadis, Bradley W. Settlemyer, Gary Grider. IEEE CLUSTER 2019. September 23 - 26, 2019, Albuquerque, New Mexico, USA.
    Abstract / PDF [1M]

  • Compact Filter Structures for Fast Data Partitioning. Qing Zheng, Charles D. Cranor, Ankush Jain, Gregory R. Ganger, Garth A. Gibson, George Amvrosiadis, Bradley W. Settlemyer, Gary A. Grider. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-19-104, June 2019.
    Abstract / PDF[574K]

  • Cluster Storage Systems Gotta Have HeART: Improving Storage Efficiency by Exploiting Disk-reliability Heterogeneity. Saurabh Kadekodi, K. V. Rashmi, Gregory R. Ganger. 17th USENIX Conference on File and Storage Technologies (FAST '19) Feb. 25–28, 2019 Boston, MA.
    Abstract / PDF [1.1M]

  • STRADS: A Distributed Framework for Scheduled Model Parallel Machine Learning. Jin Kyu Kim, Qirong Ho, Seunghak Lee, Xun Zheng, Wei Dai, Garth A. Gibson, Eric P. Xing. ACM European Conference on Computer Systems, 2016 (EuroSys'16), 18th-21st April, 2016, London, UK.
    Abstract / PDF [1.6M]

  • DeltaFS: Exascale File Systems Scale Better Without Dedicated Servers. Qing Zheng, Kai Ren, Garth Gibson, Bradley W. Settlemyer, Gary Grider. PDSW2015: 10th Parallel Data Storage Workshop, held in conjunction with SC15, Austin, TX, November 16, 2015.
    Abstract / PDF [930K]

  • High-Performance and Lightweight Transaction Support in Flash-Based SSDs. Youyou Lu, Jiwu Shu, Jia Guo, Shuai Li, Onur Mutlu. IEEE Transactions on Computers (TC), October 2015.
    Abstract / PDF [1.4M]

  • Caveat-Scriptor: Write Anywhere Shingled Disks. Saurabh Kadekodi, Swapnil Pimpale, Garth Gibson. Proc. Of the Seventh USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage’15), Santa Clara, CA, July 2015. Expanded paper available: Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-15-101.
    Abstract / PDF [3.4M]

  • ShardFS vs. IndexFS: Replication vs. Caching Strategies for Distributed Metadata Management in Cloud Storage Systems. Lin Xiao, Kai Ren, Qing Zheng, Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-15-104, April 2015.
    Abstract / PDF [696K]

  • Trading Freshness for Performance in Distributed Systems. James Cipar. Carnegie Mellon University School of Computer Science Ph.D. Dissertation CMU-CS-14-144. December 2014.
    Abstract / PDF [1.82M]

  • IndexFS: Scaling File System Metadata Performance with Stateless Caching and Bulk Insertion. Kai Ren, Qing Zheng, Swapnil Patil, Garth Gibson. ACM/IEEE Int'l Conf. for High Performance Computing, Networking, Storage and Analysis (SC'14), November 16-21, 2014, New Orleans, LA. BEST PAPER AWARD!
    Abstract / PDF [939K] / Slides [1M]

  • BatchFS: Scaling the File System Control Plane with Client-Funded Metadata Servers. Qing Zheng, Kai Ren, Garth Gibson. Proceedings of the 9th international Petascale Data Storage Workshop (PDSW '14) held in conjunction with Supercomputing '14. November 16, 2014, New Orleans, LA.
    Abstract / PDF [651K]

  • Will They Blend?: Exploring Big Data Computation atop Traditional HPC NAS Storage. Ellis H. Wilson III, Mahmut T. Kandemir, Garth Gibson. The 34th International Conference on Distributed Computing Systems, ICDCS 2014, June 30 - July 3, 2014, Madrid, Spain.
    Abstract / PDF [332K]

  • Scaling File System Metadata Performance with Stateless Caching and Bulk Insertion. Kai Ren, Qing Zheng, Swapnil Patil, Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-14-103. May 2014.
    Abstract / PDF [763K]

  • SpringFS: Bridging Agility and Performance in Elastic Distributed Storage. Lianghong Xu, James Cipar, Elie Krevat, Alexey Tumanov, Nitin Gupta, Michael A. Kozuch, Gregory R. Ganger. 12th USENIX Conference on File and Storage Technologies (FAST '14), Santa Clara, CA, February 17–20, 2014.
    Abstract / PDF [319K]

  • More Effective Distributed ML via a Stale Synchronous Parallel Parameter Server. Qirong Ho, James Cipar, Henggang Cui, Jin Kyu Kim, Seunghak Lee, Phillip B. Gibbons, Garth A. Gibson, Gregory R. Ganger, Eric P. Xing. Conference on Neural Information Processing Systems (NIPS '13). Dec 5-8, 2013, Lake Tahoe, NV.
    Abstract / PDF [2.64M] / Appendix

  • Memory-Efficient GroupBy-Aggregate using Compressed Buffer Trees. Hrishikesh Amur, Wolfgang Richter, David G. Andersen, Michael Kaminsky, Karsten Schwan, Athula Balachandran, Erik Zawadzki. 2013 ACM Symposium on Cloud Computing (SoCC'13), Oct. 01-03 2013, Santa Clara, CA, USA.
    Abstract / PDF [944K]

  • TABLEFS: Enhancing Metadata Efficiency in the Local File System. Kai Ren, Garth Gibson. 2013 USENIX Annual Technical Conference, June 26-28, 2013, San Jose, CA.
    Abstract / PDF [867K]

  • PRObE: A Thousand-Node Experimental Cluster for Computer Systems Research. Garth Gibson, Gary Grider, Andree Jacobson, Wyatt Lloyd. USENIX ;login:, v 38, n 3, June 2013.
    Abstract / PDF [1.5M]

  • Shingled Magnetic Recording: Areal Density Increase Requires New Data Management. Tim Feldman, Garth Gibson. USENIX ;login:, v 38, n 3, June 2013.
    Abstract / PDF [1.17M]

  • I/O Acceleration with Pattern Detection. Jun He, John Bent, Aaron Torres, Gary Grider, Garth Gibson, Carlos Maltzahn, Xian-He Sun. The 22nd Int. ACM Symposium on High Performance Parallel and Distributed Computing (HPDC'13), New York City, June 17-21, 2013.
    Abstract / PDF [458K]

  • Building a High-Performance Metadata Service by Reusing Scalable I/O Bandwidth. Kai Ren, Swapnil Patil, Kartik Kulkarni, Adit Madan, Garth Gibson. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-13-107, May 2013.
    Abstract / PDF [690K]

  • TABLEFS: Enhancing Metadata Efficiency in the Local File System. Kai Ren, Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-13-102, January 2013. Revised version of CMU-PDL-12-110.
    Abstract / PDF [798K

  • Giga+TableFS on PanFS: Scaling Metadata Performance on Cluster File Systems. Kartik Kulkarni, Kai Ren, Swapnil Patil, Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-13-101, January 2013.
    Abstract / PDF [679K]

  • Memory-Efficient Group-By-Aggregate using Compressed Buffer Trees. Hrishikesh Amur, Wolfgang Richter, David G. Andersen, Michael Kaminsky, Karsten Schwan, Athula Balachandran, Erik Zawadzki. Georgia Tech Center for Experimental Research in Computer Systems Technical Report GIT-CERCS-12-08.
    Abstract / PDF [450K]

  • Runtime Estimation and Resource Allocation for Concurrency Testing. Jiri Simsa, Randy Bryant, Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-113. December 2012.
    Abstract / PDF [490K]

  • HPC Computation on Hadoop Storage with PLFS. Chuck Cranor, Milo Polte, Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-115. Nov. 2012.
    Abstract / PDF [170K]

  • A Case for Scaling HPC Metadata Performance through De-specialization. Swapnil Patil, Kai Ren, Garth Gibson. 7th Petascale Data Storage Workshop held in conjunction with Supercomputing '12, November 12, 2012. Salt Lake City, UT. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-111, November 2012.
    Abstract / PDF [512K]

  • RainMon: An Integrated Approach to Mining Bursty Timeseries Monitoring Data. Ilari Shafer, Kai Ren, Vishnu Boddeti, Yashihisa Abe, Greg Ganger, Christos Faloutsos. KDD'12, August 12–16, 2012, Beijing, China.
    Abstract / PDF [1.5M]

  • Shingled Magnetic Recording for Big Data Applications. Anand Suresh, Garth Gibson, Greg Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-105. May 2012.
    Abstract / PDF [561K]

  • SkyeFS: Distributed Directories using Giga+ and PVFS. Anthony Chivetta, Swapnil Patil & Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-104, May 2012.
    Abstract / PDF [398K]

  • A Statistical Study for File System Meta Data On High Performance Computing Sites. Yifan Wang. M.S. Thesis, Information Networking Institute, Carnegie Mellon University. May 2012.
    Abstract / PDF [5.3M]

  • Active Disk Meets Flash: A Case for Intelligent SSDs. Sangyeun Cho, Chanik Park , Hyunok Oh, Sungchan Kim, Youngmin Yi and Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-115. Dec. 2011.
    Abstract / PDF [989K]

  • LazyBase: Trading Freshness for Performance in a Scalable Database. James Cipar, Greg Ganger, Kimberly Keeton, Charles B. Morrey III, Craig A. N. Soules, Alistair Veitch. EuroSys 2012 April 10-13, 2012, Bern, Switzerland.
    Abstract / PDF [236K]

  • DiskReduce: Replication as a Prelude to Erasure Coding in Data-Intensive Scalable Computing. Bin Fan, Wittawat Tantisiriroj, Lin Xiao, Garth Gibson. Carnegie Mellon Univsersity Parallel Data Laboratory Technical Report CMU-PDL-11-112, October, 2011.
    Abstract / PDF [897K]

  • Small Cache, Big Effect: Provable Load Balancing for Randomly Partitioned Cluster Services. Bin Fan, Hyeontaek Lim, David Andersen and Michael Kaminsky. ACM Symposium on Cloud Computing (SOCC'11), Cascais, Portugal, October, 2011.
    Abstract / PDF [336K]

  • Applying Idealized Lower-bound Runtime Models to Understand Inefficiencies in Data-intensive Computing (Extended Abstract). Elie Krevat, Tomer Shiran, Eric Anderson, Joseph Tucek, Jay J. Wylie, Gregory R. Ganger: SIGMETRICS 2011: 125-126, San Jose, CA, June 7-11, 2011.
    Abstract / PDF [297K]

  • Six Degrees of Scientific Data: Reading Patterns for Extreme Scale Science IO. Lofstead, Jay, Milo Polte, Garth Gibson, Scott A. Klasky, Karsten Schwan, Ron Oldfield, Matthew Wolf, Qing Liu. 20th ACM Int. Symp. On High-Performance Parallel and Distributed Computing (HPDC'11), June 2011.
    Abstract / PDF [595K]

  • On the Duality of Data-intensive File System Design: Reconciling HDFS and PVFS. Wittawat Tantisiriroj, Swapnil Patil, Garth Gibson, Seung Woo Son, Samuel J. Lang, Robert B. Ross. SC11, November 12-18, 2011, Seattle, Washington USA. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-108. April 2011.
    Abstract / PDF [459K]

  • YCSB++: Benchmarking and Performance Debugging Advanced Features in Scalable Table Stores. Swapnil Patil, Milo Polte, Kai Ren, Wittawat Tantisiriroj, Lin Xiao, Julio Lopez, Garth Gibson, Adam Fuchs, Billie Rinaldi. Proc. of the 2nd ACM Symposium on Cloud Computing (SOCC '11), October 27–28, 2011, Cascais, Portugal. Supersedes Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-11-111, August 2011.
    Abstract / PDF [1.2M]

  • Principles of Operation for Shingled Disk Devices. Garth Gibson, Greg Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-107. April 2011.
    Abstract / PDF [500K]

  • Otus: Resource Attribution in Data-Intensive Clusters. Kai Ren, Julio López, Garth Gibson. MapReduce'11, June 8, 2011, San Jose, California, USA. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-106, April 2011.
    Abstract / PDF [2.5M]

  • Disks Are Like Snowflakes: No Two Are Alike. Elie Krevat, Joseph Tucek, Gregory R. Ganger. 13th Workshop on Hot Topics in Operating Systems (HotOS 2011), Napa Valley, CA. May 2011. Supersedes Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-11-102, February 2011.
    Abstract / PDF [1.8M]

  • Applying Simple Performance Models to Understand Inefficiencies in Data-Intensive Computing. Elie Krevat, Tomer Shiran, Eric Anderson, Joseph Tucek, Jay J. Wylie, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-103. February 2011.
    Abstract / PDF [476K]

  • Scale and Concurrency of GIGA+: File System Directories with Millions of Files. Swapnil Patil, Garth Gibson. Proceedings of the 9th USENIX Conference on File and Storage Technologies (FAST '11), San Jose CA, February 2011. Supersedes Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-10-110, Sept. 2010.
    Abstract / PDF [508K]

  • pWalrus: Towards Better Integration of Parallel File Systems into Cloud Storage. Yoshihisa Abe, Garth Gibson. Workshop on Interfaces and Abstractions for Scientific Data Storage (IASDS10), co-located with IEEE Int. Conference on Cluster Computing 2010 (Cluster10), Heraklion, Greece, September 2010.
    Abstract / PDF [321K]

  • BEMC: A Searchable, Compressed Representation for Large Seismic Wavefields. Julio López, Leonardo Ramírez-Guzmán, Jacobo Bielak, David O’Hallaron. 22nd Int. Conf on Scientific and Statistical Database Management (SSDBM'10), Heidelberg, Germany, June 30 - July 2, 2010.
    Abstract / PDF [311K]

  • Robust and Flexible Power-proportional Storage. Hrishikesh Amur, James Cipar, Varun Gupta, Gregory R. Ganger, Michael A. Kozuch, Karsten Schwan. ACM Symposium on Cloud Computing (SOCC). June 10-11, 2010, Indianapolis, IN. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-106, February 2010.
    Abstract / PDF [944K]

  • Applying Performance Models to Understand Data-intensive Computing Efficiency. Elie Krevat, Tomer Shiran, Eric Anderson†, Joseph Tucek†, Jay J. Wylie†, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-108. May 2010.
    Abstract / PDF [304K]

  • ...And eat it too: High read performance in write-optimized HPC I/O middleware file formats. Milo Polte, Jay Lofstead, John Bent, Garth Gibson, Scott A. Klasky, Qing Liu, Manish Parashar, Norbert Podhorszki, Karsten Schwan, Meghan Wingate, Matthew Wolf. 4th Petascale Data Storage Workshop held in conjunction with Supercomputing '09, November 15, 2009. Portland, Oregon. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-111, November 2009.
    Abstract / PDF [388K]

  • PLFS: A Checkpoint Filesystem for Parallel Applications. John Bent, Garth Gibson, Gary Grider, Ben McClelland, Paul Nowoczynski, James Nunez, Milo Polte, Meghan Wingate. Supercomputing '09, November 15, 2009. Portland, Oregon.
    Abstract / PDF [388K]

  • DiskReduce: RAID for Data-Intensive Scalable Computing. Bin Fan, Wittawat Tantisiriroj, Lin Xiao, Garth Gibson. 4th Petascale Data Storage Workshop held in conjunction with Supercomputing '09, November 15, 2009. Portland, Oregon. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-112, November 2009.
    Abstract / PDF [304K]

  • Understanding and Maturing the Data-Intensive Scalable Computing Storage Substrate. Garth Gibson, Bin Fan, Swapnil Patil, Milo Polte, Wittawat Tantisiriroj, Lin Xiao. Microsoft Research eScience Workshop 2009, Pittsburgh, PA, October 16-17, 2009.
    Abstract / PDF [520K]

  • In Search of an API for Scalable File Systems: Under the table or above it? Swapnil Patil, Garth A. Gibson, Gregory R. Ganger, Julio Lopez, Milo Polte, Wittawat Tantisiroj, and Lin Xiao. USENIX HotCloud Workshop 2009. June 2009, San Diego CA.
    Abstract / PDF [260K]

  • System-Call Based Problem Diagnosis for PVFS. Michael P. Kasick, Keith A. Bare, Eugene E. Marinelli III, Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. Proceedings of the 5th Workshop on Hot Topics in System Dependability (HotDep '09). Lisbon, Portugal. June 2009.
    Abstract / PDF [117K]

  • Directions for Shingled-Write and Two-Dimensional Magnetic Recording System Architectures: Synergies with Solid-State Disks. Garth Gibson, Milo Polte. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-104. May 2009.
    Abstract / PDF [70K]

  • Enabling Enterprise Solid State Disks Performance. Milo Polte, Jiri Simsa, Garth Gibson. 1st Workshop on Integrating Solid-state Memory into the Storage Hierarchy, March 7, 2009, Washington DC.
    Abstract / PDF [302K]

  • Fast Log-based Concurrent Writing of Checkpoints. Milo Polte, Jiri Simsa, Wittawat Tantisiriroj, Garth Gibson, Shobhit Dayal, Mikhail Chainani, Dilip Kumar Uppugandla. Proceedings of the 3rd Petascale Data Storage Workshop held in conjunction with Supercomputing '08, November 17, 2008, Austin, TX.
    Abstract / PDF [262K]

  • Comparing Performance of Solid State Devices and Mechanical Disks. Milo Polte, Jiri Simsa, Garth Gibson. Proceedings of the 3rd Petascale Data Storage Workshop held in conjunction with Supercomputing '08, November 17, 2008, Austin, TX.
    Abstract / PDF [99K]

  • Data-intensive file systems for Internet services: A rose by any other name ... Wittawat Tantisiriroj, Swapnil Patil, Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-114. October 2008
    Abstract / PDF [350K]

  • GIGA+ : Scalable Directories for Shared File Systems. Swapnil Patil, Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-110. October 2008.
    Abstract / PDF [400K]

  • Characterizing HEC Storage Systems at Rest. Shobhit Dayal. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-109, July 2008.
    Abstract / PDF [603K]
  • User Level Implementation of Scalable Directories (GIGA+). Sanket Hase, Aditya Jayaraman, Vinay K. Perneti, Sundararaman Sridharan, Swapnil V. Patil, Milo Polte, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-107, May 2008.
    Abstract / PDF [1.67M]

  • Measurement and Analysis of TCP Throughput Collapse in Cluster-based Storage Systems. Amar Phanishayee, Elie Krevat, Vijay Vasudevan, David G. Andersen, Gregory R. Ganger, Garth A. Gibson, Srinivasan Seshan. 6th USENIX Conference on File and Storage Technologies (FAST '08). Feb. 26-29, 2008. San Jose, CA. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-07-105, September 2007.
    Abstract / PDF [374K]

  • On Application-level Approaches to Avoiding TCP Throughput Collapse in Cluster-Based Storage Systems. E. Krevat, V. Vasudevan, A. Phanishayee, D. Andersen, G. Ganger, G. Gibson, S. Seshan. Proceedings of the 2nd international Petascale Data Storage Workshop (PDSW '07) held in conjunction with Supercomputing '07. November 11, 2007, Reno, NV.
    Abstract / PDF [124K]

  • GIGA+: Scalable Directories for Shared File Systems. Swapnil V. Patil, Garth A. Gibson, Sam Lang, Milo Polte. Proceedings of the 2nd international Petascale Data Storage Workshop (PDSW '07) held in conjunction with Supercomputing '07. November 11, 2007, Reno, NV.
    Abstract / PDF [114K]

Associated Publications

  • Processing-in-Memory: A Workload-Driven Perspective. S. Ghose, A. Boroumand, J. S. Kim, J. Gómez-Luna, O. Mutlu. To appear in IBM Journal of Research and Development (JRD), November 2019.
    Abstract / PDF [2.1M]

  • Enabling Practical Processing in and Near Memory for Data-Intensive Computing. O. Mutlu, S. Ghose, J. Gómez-Luna, R. Ausavarungnirun. Proc. of the Design Automation Conference (DAC), Las Vegas, NV, June 2019.
    Abstract / PDF [477K]

  • CROW: A Low-Cost Substrate for Improving DRAM Performance, Energy Efficiency, and Reliability. H. Hassan, M. Patel, J. S. Kim, A. G. Yaglikçi, N. Vijaykumar, N. Mansouri Ghiasi, S. Ghose, O. Mutlu. Proc. of the International Symposium on Computer Architecture (ISCA), Phoenix, AZ, June 2019.
    Abstract / PDF [1.45M]

  • CoNDA: Efficient Cache Coherence Support for Near-Data Accelerators. A. Boroumand, S. Ghose, M. Patel, H. Hassan, B. Lucia, R. Ausavarungnirun, K. Hsieh, N. Hajinazar, K. T. Malladi, H. Zheng, O. Mutlu. Proc. of the International Symposium on Computer Architecture (ISCA), Phoenix, AZ, June 2019.
    Abstract / PDF [1.1M]

  • Understanding the Interactions ofWorkloads and DRAM Types: A Comprehensive Experimental Study. S. Ghose, T. Li, N. Hajinazar, D. Senol Cali, O. Mutlu. Proc. of the Joint ACM SIGMETRICS/IFIP Performance Conference, Phoenix, AZ, June 2019; To appear in Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS), 2019.
    Abstract / PDF [2M]

  • Intelligence Beyond the Edge: Inference on Intermittent Embedded Systems. Graham Gobieski, Brandon Lucia, Nathan Beckmann Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS’19), April 13th – April 17th, Providence, RI.
    Abstract / PDF [3.35M]

  • What Your DRAM Power Models Are Not Telling You: Lessons from a Detailed Experimental Study. S. Ghose, A. G. Yaglikçi, R. Gupta, D. Lee, K. Kudrolli, W. X. Liu, H. Hassan, K. K. Chang, N. Chatterjee, A. Agrawal, M. O'Connor, O. Mutlu. Proc. of the ACM SIGMETRICS Conference, Irvine, CA, June 2018; Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS), Vol. 2, No. 3, December 2018.
    Abstract / PDF [2.6M]

  • SRPT for Multiserver Systems. Isaac Grosof, Ziv Scully, Mor Harchol-Balter. Performance Evaluation , vol. 127-128, Nov. 2018, pp. 154-175. Also in Proc. 36th International Symposium on Computer Performance, Modeling, Measurements, and Evaluation (Performance 2018) , Toulouse, France, December 2018. Best Student Paper Award.
    Abstract / PDF [780K]

  • SOAP Bubbles: Robust Scheduling Under Adversarial Noise. Ziv Scully, Mor Harchol-Balter. 56th Annual Allerton Conference on Communication, Control, and Computing, 2-5 Oct. 2018. Monticello, IL.
    Abstract / PDF [245K]

  • Exploiting Locality in Graph Analytics through Hardware-Accelerated Traversal Scheduling. Anurag Mukkara, Nathan Beckmann, Maleen Abeydeera, Xiaosong Ma, Daniel Sanchez. 51st Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), 20-24 Oct. 2018, Fukuoka, Japan.
    Abstract / PDF [660K]

  • Practical Bounds on Optimal Caching with Variable Object Sizes. Daniel S. Berger, Nathan Beckmann, Mor Harchol-Balter. Proceedings of the ACM on Measurement and Analysis of Computing Systems. Vol. 2, No. 2, Article 32, June 2018.
    Abstract / PDF [1.2M]

  • Practical Bounds on Offline Caching with Variable Object Sizes. Daniel Berger, Nathan Beckmann, Mor Harchol-Balter. Proc. ACM Meas. Anal. Comput. Syst., Vol. 2, No. 2, Article 32. June 2018. POMACS 2018.
    Abstract / PDF [1.2M]

  • LHD: Improving Cache Hit Rate by Maximizing Hit Density. Nathan Beckmann, Haoxian Chen, Asaf Cidon. 15th USENIX Symposium on Networked Systems Design and Implementation ({NSDI} 18), April 9-11, 2018, Renton, WA..
    Abstract / PDF [1.1M]

  • GoogleWorkloads for Consumer Devices: Mitigating Data Movement Bottlenecks. Amirali Boroumand, Saugata Ghose, Youngsok Kim, Rachata Ausavarungnirun, Eric Shiu, Rahul Thakur, Daehyun Kim, Aki Kuusela, Allan Knies, Parthasarathy Ranganathan, Onur Mutlu. ASPLOS’18, March 24–28, 2018, Williamsburg, VA, USA.
    Abstract / PDF [885K]

  • SOAP: One Clean Analysis of All Age-Based Scheduling Policies. Ziv Scully, Mor Harchol-Balter, Alan Scheller-Wolf. Proc. ACM Meas. Anal. Comput. Syst., Vol. 2, No. 1, Article 16, March 2018.
    Abstract / PDF [885K]

  • Efficient Multi-Tenant Inference on Video using Microclassifiers. Giulio Zhou, Thomas Kim, Christopher Canel, Conglong Li, Hyeontaek Lim, David G. Andersen, Michael Kaminsky, Subramanya R. Dulloor. SysML’18, February 15–16, 2018, Stanford, CA.
    Abstract / PDF [1.5M]

  • Towards Optimality in Parallel Job Scheduling. Benjamin Berg, Jan-Pieter Dorsman, Mor Harchol-Balter. Proc. ACM Meas. Anal. Comput. Syst., Vol. 1, No. 2, Article 40. Publication date: December 2017.
    Abstract / PDF [4.3M]

  • A Better Model for Job Redundancy: Decoupling Server Slowdown and Job Size. Kristen Gardner, Mor Harchol-Balter, Alan Scheller-Wolf & Benny Van Houdt. Transactions on Networking, September 2017.
    Abstact / PDF [544K]

  • Workload Analysis and Caching Strategies for Search Advertising Systems. Conglong Li, David G. Andersen, Qiang Fu, Sameh Elnikety, Yuxiong He. SoCC ’17, September 24–27, 2017, Santa Clara, CA, USA.
    Abstract / PDF [650K]

  • Scheduling for Efficiency and Fairness in Systems with Redundancy. Kristen Gardner, Mor Harchol-Balter, Esa Hyyti & Rhonda Righter. Performance Evaluation, July 2017.
    Abstact / PDF [784K]

  • Carpool: A Bufferless On-Chip Network Supporting Adaptive Multicast and Hotspot Alleviation. Xiyue Xiang, Wentao Shi, Saugata Ghose, Lu Peng, Onur Mutlu & Nian-Feng Tzeng. In Proc. of the International Conference on Supercomputing (ICS), Chicago, IL, June 2017.
    Abstact / PDF [6.7M]

  • Cachier: Edge-caching for Recognition Applications. Utsav Drolia, Katherine Guo (Bell Labs), Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. The 37th IEEE International Conference on Distributed Computing Systems (ICDCS 2017), June 5 – 8, 2017, Atlanta, GA, USA.
    Abstract / PDF [5.4M]

  • Efficient Redundancy Techniques for Latency Reduction in Cloud Systems. Gauri Joshi, Emina Soljanin & Gregory Wornell. ACM Transactions on Modeling and Performance Evaluation of Computing Systems (TOMPECS) Volume 2 Issue 2, May 2017.
    Abstact / PDF [1.38M]

  • AdaptSize: Orchestrating the Hot Object Memory Cache in a Content Delivery Network. Daniel S. Berger, Ramesh K. Sitaraman, Mor Harchol-Balter. 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI '17). March 27–29, 2017, Boston, MA.
    Abstract / PDF [560K]

  • Towards Edge-caching for Image Recognition. Utsav Drolia, Katherine Guo, Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. First Workshop on Smart Edge Computing and Networking (SmartEdge) '17, held in conjunction with PerCom 2017, March 13 - 17, 2017, Hawaii, USA.
    Abstract / PDF [5.1M]

  • Prescriptive Safety-Checks through Automated Proofs for Control-Flow Integrity. Jiaqi Tan. Carnegie Mellon University Electrical and Computer Engineering PhD Dissertation, November 2016.
    Abstract / PDF [5.75M]

  • A Survey of Security Vulnerabilities in Bluetooth Low Energy Beacons. Hui Jun Tay, Jiaqi Tan, Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-109. November 2016.
    Abstract / PDF [110K]

  • AUSPICE-R: Automatic Safety-Property Proofs for Realistic Features in Machine Code. Jiaqi Tan, Hui Jun Tay, Rajeev Gandhi, Priya Narasimhan.14th Asian Symposium on Programming Languages and Systems (APLAS), November 2016.
    Abstract / PDF [325K]

  • EC-Cache: Load-Balanced, Low-Latency Cluster Caching with Online Erasure Coding. K. V. Rashmi, Mosharaf Chowdhury, Jack Kosaian, Ion Stoica & Kannan Ramchandran. 12th USENIX Symposium on Operating Systems Design and Implementation, Nov. 2–4, 2016, Savannah, GA.
    Abstract / PDF [830K]

  • Stateless Model Checking with Data-Race Preemption Points. Ben Blum, Garth Gibson. SPLASH 2016 OOPSLA, Oct 30 - Nov 4, 2016, Amsterdam, Netherlands.
    Abstract / PDF [704K]

  • Zorua: A Holistic Approach to Resource Virtualization in GPUs. Nandita Vijaykumar, Kevin Hsieh, Gennady Pekhimenko, Samira Khan, Ashish Shrestha,Saugata Ghose, Adwait Jogu, Phillip B. Gibbons, Onur Mutlu. 49th IEEE/ACM International Symposium on Microarchitecture (MICRO’16), October 15-19, 2016, Taipei, Taiwan.
    Abstract / PDF [1.5M]

  • A Model for Application Slowdown Estimation in On-Chip Networks and Its Use for Improving System Fairness and Performance. Xiyue Xiang, Saugata Ghose, Onur Mutlu, Nian-Feng Tzeng. International Conference on Computer Design (ICCD), October 3-5, 2016, Phoenix, USA.
    Abstract / PDF [399K]

  • Accelerating Pointer Chasing in 3D-Stacked Memory: Challenges, Mechanisms, Evaluation.Kevin Hsieh, Samira Khan, Nandita Vijaykumar, Kevin K. Chang, Amirali Boroumand, Saugata Ghose, Onur Mutlu. International Conference on Computer Design (ICCD), October 3-5, 2016, Phoenix, USA.
    Abstract / PDF [1.67M]

  • PCFIRE: Towards Provable Preventative Control-Flow Integrity Enforcement for Realistic Embedded Software. Jiaqi Tan, Hui Jun Tay, Utsav Drolia, Rajeev Gandhi, Priya Narasimhan. EMSOFT’16, October 01-07, 2016, Pittsburgh, PA, USA.
    Abstract / PDF [722K]

  • Poster Abstract: BUFS: Towards Bottom-Up Foundational Security for Software in the Internet-of-Things. Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. 1st IEEE/ACM Symposium on Edge Computing (SEC 2016), October 2016.
    Abstract / PDF [682K]

  • A Better Model for Job Redundancy: Decoupling Server Slowdown and Job Size Kristen Gardner, Mor Harchol-Balter, Alan Scheller-Wolf. IEEE Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS 2016), London, UK, September 2016.
    Abstract / PDF [244K]

  • Soundness Proofs for Iterative Deepening. Ben Blum. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-103, September 6, 2016.
    Abstract / PDF [356K]

  • Parallel Algorithms for Asymmetric Read-Write Costs. Naama Ben-David, Guy E. Blelloch, Jeremy T. Fineman, Phillip B. Gibbons, Yan Gu, Charles McGuffey, Julian Shun. 28th ACM Symposium on Parallelism in Algorithms and Architectures Jul 11, 2016 - Jul 13, 2016. Asilomar State Beach, California, USA.
    Abstract / PDF [386K]

  • A Case for Hierarchical Rings with Deflection Routing: An energy-efficient on-chip communication substrate. Rachata Ausavarungnirun, Chris Fallin, Xiangyao Yu, Kevin Kai-Wei Chang, Greg Nazario, Reetuparna Das, Gabriel H. Loh, Onur Mutlu, Parallel Computing, Volume 54, May 2016, Pages 29-45, ISSN 0167-8191.
    Abstract / PDF [2M]

  • Achieving both High Energy Efficiency and High Performance in On-Chip Communication using Hierarchical Rings with Deflection Routing. Rachata Ausavarungnirun, Chris Fallin, Xiangyao Yu, Kevin Kai-Wei Chang, Greg Nazario, Reetuparna Das, Gabriel H. Loh, Onur Mutlu. arXiv:1602.06005v1 [cs.DC], 18 Feb 2016.
    Abstract / PDF [576K]

  • Scheduling Techniques for Hybrid Circuit/Packet Networks. He Liu, Matthew K. Mukerjee, Conglong Li, Nicolas Feltman, George Papen, Stefan Savage, Srinivasan Seshan, Geoffrey M. Voelker, David G. Andersen, Michael Kaminsky, George Porter, Alex C. Snoeren. In 11th International Conference on emerging Networking EXperiments and Technologies (CoNEXT 2015), Heidelberg, Germany, December 2015. Nominated for Best Paper.
    Abstract / PDF [510K]

  • Decoupled Direct Memory Access: Isolating CPU and IO Traffic by Leveraging a Dual-Data-Port DRAM. Donghyuk Lee, Lavanya Subramanian, Rachata Ausavarungnirun, Jongmoo Choi, Onur Mutlu. Proceedings of the 24th International Conference on Parallel Architectures and Compilation Techniques (PACT), San Francisco, CA, USA, October 2015.
    Abstract / PDF [1.8M]

  • Tracking and Reducing Uncertainty in Dataflow Analysis-Based Dynamic Parallel Monitoring. Michelle Goodstein, Phillip Gibbons, Michael Kozuch, Todd Mowry. International Conference on Parallel Architectures and Compilation Techniques (PACT 2015), Oct 18, 2015 - Oct 21, 2015, San Francisco, CA.
    Abstract / PDF [341K]

  • Exploiting Inter-Warp Heterogeneity to Improve GPGPU Performance. Rachata Ausavarungnirun, Saugata Ghose, Onur Kayiran, Gabriel H. Loh, Chita R. Das, Mahmut T. Kandemir, Onur Mutlu. Proceedings of the The 24th International Conference on Parallel Architectures and Compilation Techniques (PACT 2015), San Francisco, October 2015.
    Abstract / PDF [556K]

  • Krowd: A Key-Value Store for Crowded Venues. Utsav Drolia, Nathan Mickulicz, Rajeev Gandhi, Priya Narasimhan.10th ACM Workshop on Mobility in the Evolving Internet Architecture (MobiArch), held in Paris, France in September 2015. Best Paper.
    Abstract / PDF [696K]

  • A Low-Overhead, Fully-Distributed, Guaranteed-Delivery Routing Algorithm for Faulty Network-on-Chips. Mohammad Fattah, Antti Airola, Rachata Ausavarungnirun, Nima Mirzaei, Pasi Liljeberg, Juha Plosila, Siamak Mohammadi, Tapio Pahikkala, Onur Mutlu, Hannu Tenhunen. Proceedings of the 9th ACM/IEEE International Symposium on Networks on Chip (NOCS), Vancouver, BC, Canada, September 2015.
    Abstract / PDF [1M]

  • AUSPICE: Automated Safety Property Verification for Unmodified Executables. Jiaqi Tan, Hui Jun Tay, Rajeev Gandhi, and Priya Narasimhan. In 7th Working Conference on Verified Software: Theories, Tools, and Experiments (VSTTE), July 2015.
    Abstract / PDF [390K]

  • Reducing Latency via Redundant Requests: Exact Analysis. Kristen Gardner, Sam Zbarsky, Sherwin Doroudi, Mor Harchol-Balter, Esa Hyytia, Alan Scheller-Wolf. Proceedings of ACM Sigmetrics/Performance 2015 Conference on Measurement and Modeling of Computer Systems (SIGMETRICS 15), Portland, OR. June 2015.
    Abstract / PDF [725K]

  • A Case for Core-Assisted Bottleneck Acceleration in GPUs: Enabling Efficient Data Compression. Nandita Vijaykumar, Gennady Pekhimenko, Adwait Jog, Abhishek Bhowmick, Rachata Ausavarungnirun, Chita Das, Mahmut Kandemir, Todd C. Mowry, Onur Mutlu. Proceedings of the 42nd International Symposium on Computer Architecture (ISCA), Portland, OR, June 2015.
    Abstract / PDF [1M]

  • PocketTrend: Timely Identification and Delivery of Trending Search Content to Mobile Users. Gennady Pekhimenko, Dimitrios Lymberopoulos, Oriana Riva, Karin Strauss, Doug Burger. Proceedings of the 24th International World Wide Web Conference (WWW), Florence, Italy, May 2015.
    Abstract / PDF [504K]

  • Raising the Bar for Using GPUs in Software Packet Processing. Anuj Kalia, Dong Zhou, Michael Kaminsky, David G. Andersen. 12th Usenix Symposium on Networked Systems Design (NSDI'15). May 4-6, 2015, Oakland, CA.
    Abstract / PDF [386K]

  • Efficient Hypervisor Based Malware Detection. Peter Friedrich Klemperer. Ph.D. Dissertation, Carnegie Mellon University, Electrical and Computer Engineering, May 2015.
    Abstract / PDF [1.3M]

  • Optimal Scheduling for Jobs with Progressive Deadlines. Kristen Gardner, Sem Borst, Mor Harchol-Balter. IEEE INFOCOM 15, Hong Kong, April, 2015.
    Abstract / PDF [558K]

  • Mitigating Prefetcher-Caused Pollution Using Informed Caching Policies for Prefetched Blocks Vivek Seshadri, Samihan Yedkar, Hongyi Xin, Onur Mutlu, Phillip B. Gibbons, Michael A. Kozuch, Todd C. Mowry. ACM Transactions on Architecture and Code Optimization (TACO), Volume 11 Issue 4, January 2015, Article No. 51.
    Abstract / PDF [1.1M]

  • Having Your Cake and Eating It Too: Jointly Optimal Erasure Codes for I/O, Storage, and Network-bandwidth. KV Rashmi, Preetum Nakkiran, Jingyan Wang, Nihar B. Shah & Kannan Ramchandran. USENIX FAST, Feb 2015, Santa Clara, CA. Best paper.
    Abstract / PDF [560K]

  • Toggle-Aware Compression for GPUs. Gennady Pekhimenko, Evgeny Bolotin, Mike O'Connor, Onur Mutlu, Todd C. Mowry, Stephen W. Keckler. IEEE Computer Architecture Letters (CAL).
    Abstract / PDF [346K]

  • A Comparative Study of Baremetal Provisioning Frameworks. Ashok Chandrasekar, Garth Gibson. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-14-109, December 2014.
    Abstract / PDF [447K]

  • Managing GPU Concurrency in Heterogeneous Architectures. Onur Kayiran, Nachiappan Chidambaram Nachiappan, Adwait Jog, Rachata Ausavarungnirun, Mahmut T. Kandemir, Gabriel H. Loh, Onur Mutlu, and Chita R. Das. Proceedings of 47th International Symposium on Microarchitecture (MICRO’14), December 2014.
    Abstract / PDF [2.38M]

  • Paxos Quorum Leases: Fast Reads Without Sacrificing Writes. Iulian Moraru, David G. Andersen, Michael Kaminsky. ACM Symposium on Cloud Computing 2014 (SoCC'14), Seattle, WA, Nov 2014. BEST PAPER AWARD!
    Abstract / PDF [287K]

  • The Heterogeneous Block Architecture. Chris Fallin, Chris Wilkerson, Onur Mutlu. Proceedings of 32nd IEEE International Conference on Computer Design (ICCD’14), October 2014.
    Abstract / PDF [308K]

  • Design and Evaluation of Hierarchical Rings with Deflection Routing. Rachata Ausavarungnirun, Chris Fallin, Xiangyao Yu, Kevin Chang, Greg Nazario, Reetuparna Das, Gabriel Loh, Onur Mutlu. Proceedings of the 26th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD’14), October 2014.
    Abstract / PDF [325K]

  • Fast and Accurate Mapping of Complete Genomics Reads. Donghyuk Lee, Farhad Hormozdiari, Hongyi Xin, Faraz Hach, Onur Mutlu, Can Alkan. Methods, Elsevier, October 2014.
    Abstract / PDF [1.25M]

  • Value Driven Load Balancing. Sherwin Doroudi, Esa Hyytia, Mor Harchol-Balter. Performance Evaluation, vol. 79, September 2014.
    Abstract / PDF [258K]

  • Egalitarian Distributed Consensus. Iulian Moraru. Carnegie Mellon University Ph.D. Dissertation CMU-CS-14-133. August 2014.
    Abstract / PDF [1.95M]

  • Towards Secure Execution of Untrusted Code for Mobile Edge-Clouds. Jiaqi Tan, Utsav Drolia, Rajeev Gandhi, Priya Narasimhan. Poster at 7th ACM Conference on Security and Privacy in Wireless and Mobile Networks (WiSec), July 2014.
    Abstract / PDF [116K]

  • CHIPS: Content-based Heuristics for Improving Photo Privacy for Smartphones. Jiaqi Tan, Utsav Drolia, Rolando Martins, Rajeev Gandhi, Priya Narasimhan. 7th ACM Conference on Security and Privacy in Wireless and Mobile Networks (WiSec), July 2014.
    Abstract / PDF [1.4M]

  • Exact Analysis of the M/M/k/setup Class of Markov Chains via Recursive Renewal Reward. Anshul Gandhi, Sherwin Doroudi, Mor Harchol-Balter, Alan Scheller-Wolf. Queueing Systems: Theory and Applications vol. 77, no. 2, 2014, pp. 177-209. June 2014.
    Abstract / PDF [4K]

  • The Dirty-Block Index. Vivek Seshadri, Abhishek Bhowmick, Onur Mutlu, Phillip B. Gibbons, Michael A. Kozuch, Todd C. Mowry. 41st International Symposium on Computer Architecture, June, 2014.
    Abstract / PDF [2.32M]

  • Improving Cache Performance by Exploiting Read-Write Disparity. Samira Khan, Alaa Alameldeen, Chris Wilkerson, Onur Mutlu, Daniel Jimenez. Proceedings of the 20th International Symposium on High-Performance Computer Architecture (HPCA), Orlando, FL, February 2014. Best paper session.
    Abstract / PDF [355K]

  • Linearly Compressed Pages: A Low-Complexity, Low-Latency Main Memory Compression Framework. Gennady Pekhimenko, Vivek Seshadri, Yoongu Kim, Hongyi Xin, Onur Mutlu, Philip B. Gibbons, Michael A. Kozuch, Todd C. Mowry. Proceedings of the 46th International Symposium on Microarchitecture (MICRO), Davis, CA, December 2013.
    Abstract / PDF [525K]

  • Measuring Password Guessability for an Entire University. Michelle L. Mazurek, Saranga Komanduri, Timothy Vidas, Lujo Bauer, Nicolas Christin, Lorrie Faith Cranor, Patrick Gage Kelley, Richard Shay, Blase Ur. In CCS 2013: ACM Conference on Computer and Communications Security, November 2013.
    Abstract / PDF [2.19M]

  • Challenges in Security and Privacy for Mobile Edge-Clouds. Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-13-113. October, 2013.
    Abstract / PDF [212K]

  • Hadoop's Adolescence: An Analysis of Hadoop Usage in Scientific Workloads. Kai Ren, YongChul Kwon, Magdalena Balazinska, Bill Howe. Very Large Data Bases (VLDB), August, 2013.
    Abstract / PDF [986K]

  • Solving the Straggler Problem with Bounded Staleness. James Cipar, Qirong Ho, Jin Kyu Kim, Seunghak Lee, Gregory R. Ganger, Garth Gibson, Kimberly Keeton, Eric Xing. 14th USENIX HotOS Workshop, Santa Ana Pueblo, NM, May 13-15, 2013.
    Abstract / PDF [174K]

  • The Impact of Length and Mathematical Operators on the Usability and Security of System-assigned One-time PINs. Patrick Gage Kelley, Saranga Komanduri, Michelle L. Mazurek, Richard Shay, Tim Vidas, Lujo Bauer, Nicolas Christin, and Lorrie Faith Cranor. In 2013 Workshop on Usable Security (USEC), April 2013.
    Abstract / PDF [802K]

  • PETAL: Preset Encoding Table Information Leakage. Jiaqi Tan, Jayvardhan Nahata. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-13-106, April 2013.
    Abstract / PDF [291K]

  • Helping Users Create Better Passwords. Blase Ur, Patrick Gage Kelley, Saranga Komanduri, Joel Lee, Michael Maass, Michelle L. Mazurek, Timothy Passaro, Richard Shay, Timothy Vidas, Lujo Bauer, Nicolas Christin, Lorrie Faith Cranor, Serge Egelman, and Julio López. USENIX ;login:, 37(6), December 2012.
    Abstract / PDF [970K]

  • AutoScale: Dynamic, Robust Capacity Management for Multi-Tier Data Centers. Anshul Gandhi, Mor Harchol-Balter, Ram Raghunathan, Michael Kozuch. Transactions on Computer Systems, Volume 30, Issue 4, Article 14. November 2012.
    Abstract / PDF [1.77M]

  • TABLEFS: Embedding a NoSQL Database inside the Local File System. Ren, Kai, Garth Gibson. 1st Storage System, Hard Disk and Solid State Technologies Summit, IEEE Asia-Pacific Magnetic Recording Conference (APMRC), November 2012, Singapore.
    Abstract / PDF [399K]

  • HAT: Heterogeneous Adaptive Throttling for On-Chip Networks. Kevin Chang, Rachata Ausavarungnirun, Chris Fallin, Onur Mutlu. SBAC-PAD 2012, New York, NY, October 24-26, 2012.
    Abstract / PDF [259K]

  • Scalable Dynamic Partial Order Reduction. Jiri Simsa, Randy Bryant, Garth Gibson, Jason Hickey. Third Int. Conf. on Runtime Verification (RV2012), 25-28 September 2012, Istanbul, Turkey.
    Abstract / PDF [331K]

  • The Evicted-Address Filter: A Unified Mechanism to Address Both Cache Pollution and Thrashing. Vivek Seshadri, Onur Mutlu, Michael A Kozuch, Todd C Mowry. PACT'12, September 19–23, 2012, Minneapolis, Minnesota, USA.
    Abstract / PDF [2M]

  • TABLEFS: Embedding a NoSQL Database Inside the Local File System. Kai Ren, Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-110, September 2012.
    Abstract / PDF [1.43M]

  • SOFTScale: Stealing Opportunistically For Transient Scaling. Anshul Gandhi, Timothy Zhu, Mor Harchol-Balter, Michael Kozuchy. Carnegie Mellon University School of Computer Science Technical Report CMU-CS-12-111R, August 2012.
    Abstract / PDF [477K]

  • Hadoop's Adolescence: A Comparative Workload Analysis from Three Research Clusters. Kai Ren, YongChul Kwon, Magdalena Balazinska, Bill Howe. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-106. June 2012.
    Abstract / PDF [1.76M]

  • MinBD: Minimally-Buffered Deflection Routing for Energy-Efficient Interconnect. Chris Fallin, Greg Nazario, Xiangyao Yu, Kevin Chang, Rachata Ausavarungnirun, Onur Mutlu. In NOCS 2012, Lyngby, Denmark, May 2012. (One of five papers nominated for the Best Paper Award by the Program Committee.)
    Abstract / PDF [369K]

  • Guess Again (and Again and Again): Measuring password strength by simulating password-cracking algorithms. Patrick Gage Kelley, Saranga Komanduri, Michelle L. Mazurek, Rich Shay, Tim Vidas, Lujo Bauer, Nicolas Christin, Lorrie Faith Cranor, Julio López. In the 2012 IEEE Symposium on Security and Privacy, May 2012.
    Abstract / PDF [2.8M]

  • Concurrent Systematic Testing at Scale. Jiri Simsa, Randy Bryant, Garth Gibson, Jason Hickey. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-101. May 2012.
    Abstract / PDF [397K]

  • Landslide: Systematic Dynamic Race Detection in Kernel Space. Ben Blum. Carnegie Mellon University School of Computer Science MS Thesis CMU-CS-12-118. May 2012.
    Abstract / PDF [1.7M]

  • TABLEFS: Embedding a NoSQL Database Inside the Local File System. Kai Ren, Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report. CMU-PDL-12-103 May 2012.
    Abstract / PDF [339K]

  • Efficient Exploratory Testing of Concurrent Systems. Jiri Simsa, Randy Bryant, Garth Gibson, Jason Hickey (Google). Carnegie Mellon University Parallel Data Laboratory Techical Report CMU-PDL-11-113, November 2011.
    Abstract / PDF [786K]

  • A Cyber-Physical-System Approach to Data Center Modeling and Control for Energy Efficiency. Luca Parolini, Bruno Sinopoli, Bruce H. Krogh, Zhikui Wang. Proceedings of the IEEE, Special Issue on Cyber-Physical Systems, December 2011.
    Abstract / PDF [1.76M]

  • Reducing Memory Interference in Multicore Systems via Application-Aware Memory Channel Partitioning. Sai Prashanth Muralidhara, Lavanya Subramanian, Onur Mutlu, Mahmut Kandemir, Thomas Moscibroda. Proceedings of the 44th International Symposium on Microarchitecture
    (MICRO), Porto Alegre, Brazil, December 2011.
    Abstract / PDF [232K]

  • Understanding and Improving the Diagnostic Workflow of MapReduce Users. Jason D. Campbell (Intel Labs Pittsburgh), Arun B. Ganesan, Ben Gotow, Soila P. Kavulya, James Mulholland, Priya Narasimhan, Sriram Ramasubramanian, Mark Shuster, Jiaqi Tan (DSO National Laboratories, Singapore), ACM Symposium on Computer Human Interaction for Management of Information Technology (CHIMIT), Boston, MA, December 2011.
    Abstract / PDF [775K]

  • Practical Experiences with Chronics Discovery in Large Telecommunications Systems. Soila P. Kavulya, Kaustubh Joshi, Matti Hiltunen, Scott Daniels, Rajeev Gandhi, Priya Narasimhan. SLAML 2011, October 23, 2011, Cascais, Portugal.
    Abstract / PDF [500K]

  • Practical Experiences with Chronics Discovery in Large Telecommunications Systems. Soila P. Kavulya (CMU), Kaustubh Joshi, Matti Hiltunen , Scott Daniels (AT&T Labs, Research), Rajeev Gandhi and Priya Narasimhan (CMU). Workshop on System Logs and the Application of Machine Learning Techniques (SLAML), Cascais, Portugal, October 2011.
    Abstract / PDF [524K]

  • Improving Cache Performance Using Victim Tag Stores. Vivek Seshadri, Onur Mutlu, Todd Mowry, Michael A. Kozuch. SAFARI Technical Report, TR-SAFARI-2011-009, Carnegie Mellon University, September 2011.
    Abstract / PDF [242K]

  • How Does Your Password Measure Up? The effect of strength meters on password creation. Blaser Ur, Patrick Gage Kelley, Saranga Komanduri, Joel Lee, Michael Maass, Michelle L. Mazurek, Timothy Passaro, Richard Shay, Timothy Vidas, Lujo Bauer, Nicolas Christin, and Lorrie Faith Cranor. In the 2012 USENIX Security Symposium, August 2012.
    Abstract / PDF [1.2M]

  • On-Chip Networks from a Networking Perspective: Congestion and Scalability in Many-core Interconnects. George Nychis, Chris Fallin, Thomas Moscibroda, Onur Mutlu, Srinivasan Seshan.
    In SIGCOMM 2012, Helsinki, Finland, Aug 2012.
    Abstract / PDF [628K]

  • ThermoCast: A Cyber-Physical Forecasting Model for Data Centers. Lei Li, Chieh-Jan Mike Liang, Jie Liu, Suman Nath, Andreas Terzis, Christos Faloutsos. In KDD '11: Proceeding of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, August 21-24, San Diego, CA.
    Abstract / PDF [1.32M]

  • dBug: Systematic Testing of Distributed and Multi-threaded Systems. Jiri Simsa, Randy Bryant, Garth Gibson.18th International Workshop on Model Checking of Software (SPIN'11), Snowbird UT, July 2011.
    Abstract / PDF [149K]

  • Time Series Clustering: Complex is Simpler! Lei Li, B. Aditya Prakash. In Proceedings of the 28th International Conference on Machine learning, June 28 - July 2, 2011, Bellevue, WA.
    Abstract / PDF [631K]

  • WindMine: Fast and Effective Mining of Web-click Sequences. Yasushi Sakurai, Lei Li, Yasuko Matsubara, Christos Faloutsos. 2011 SIAM International Conference on Data Mining, April 28-30, 2011, Mesa, AZ.
    Abstract / PDF [968K]

  • SmartScan: Efficient Metadata Crawl for Storage Management Metadata Querying in Large File Systems. Likun Liu, Lianghong Xu, Yongwei Wu, Guangwen Yang, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-112, Oct. 2010.
    Abstract / PDF [366K]

  • dBug: Systematic Evaluation of Distributed Systems. Jiri Simsa, Randy Bryant, Garth Gibson. 5th Int. Workshop on Systems Software Verification (SSV’10), co-located with 9th USENIX Symp. On Operating Systems Design and Implementation (OSDI’10), Vancouver BC, October 2010.
    Abstract / PDF [168K]

  • Token Attempt: The Misrepresentation of Website Privacy Policies through the Misuse of P3P Compact Policy Tokens. Pedro Giovanni Leon, Lorrie Faith Cranor, Aleecia M. McDonald, Robert McGuire. Cylab Technical Report CMU-CyLab-10-014, September 10, 2010.
    Abstract / PDF [305K]

  • Parsimonious Linear Fingerprinting for Time Series. Lei Li, B. Aditya Prakash, Christos Faloutsos. Proceedings of the VLDB Endowment, Vol. 3, No. 1, September 2010.
    Abstract / PDF [684K]

  • Correct Horse Battery Staple: Exploring the usability of system-assigned passphrases. Richard Shay, Patrick Gage Kelley, Saranga Komanduri, Michelle L. Mazurek, Blase Ur, Tim Vidas, Lujo Bauer, Nicolas Christin, and Lorrie Faith Cranor. In SOUPS 2012: Symposium on Usable Privacy and Security, July 2012.
    Abstract / PDF [549K]

  • OddBall: Spotting Anomalies in Weighted Graphs. Leman Akoglu, Mary McGlohon, Christos Faloutsos. PAKDD 2010, Hyderabad, India, 21-24 June 2010. Best Paper Award!
    Abstract / PDF [3.0M]

  • Visual, Log-based Causal Tracing for Performance Debugging of MapReduce Systems. Jiaqi Tan*, Soila Kavulya, Rajeev Gandhi and Priya Narasimhan. 30th IEEE International Conference on Distributed Computing Systems (ICDCS) 2010, Genoa, Italy, Jun 2010.
    Abstract / PDF [2.1M]

  • An Analysis of Traces from a Production MapReduce Cluster. Soila Kavulya, Jiaqi Tan, Rajeev Gandhi and Priya Narasimhan. 10th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid 2010). May 17-20, 2010, Melbourne, Victoria, Australia. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-107, December, 2009.
    Abstract / PDF [832K]

  • Kahuna: Problem Diagnosis for MapReduce-Based Cloud Computing Environments. Jiaqi Tan, Xinghao Pan, Eugene Marinelli, Soila Kavulya, Rajeev Gandhi, Priya Narasimhan. Proceedings of the 12th IEEE/IFIP Network Operations and Management Symposium (NOMS) 2010, Osaka, Japan, Apr 2010.
    Abstract / PDF [2.8M]

  • Black-Box Problem Diagnosis in Parallel File Systems. Michael P. Kasick, Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. Proceedings of the 8th USENIX Conference on File and Storage Technologies (FAST '10), San Jose, CA, February 2010.
    Abstract / PDF [533K]

  • Journaling versus Soft Updates: Asynchronous Meta-data Protection in File Systems. Margo I. Seltzer, Gregory R. Ganger, M. Kirk McKusick, Keith A. Smith, Craig A. N. Soules, Christopher A. Stein. Proceedings of the USENIX Technical Conference, June, 2000.
    Abstract / PDF [120K]