Recent Publications

  • Unearthing Inter-job Dependencies for Better Cluster Scheduling. Andrew Chung, Subru Krishnan, Konstantinos Karanasos, Carlo Curino, Gregory R. Ganger. 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI'20), Virtual Event, Nov. 4–6, 2020.
    Abstract / PDF [1.0M] / Slides / Talk Video

  • PACEMAKER: Avoiding HeART Attacks in Storage Clusters with Disk-adaptive Redundancy. Saurabh Kadekodi, Francisco Maturana, Suhas Jayaram Subramanya, Juncheng Yang, K. V. Rashmi, Gregory R. Ganger. 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI'20), Virtual Event, Nov. 4–6, 2020.
    Abstract / PDF [2.1M] / Slides / Talk Video

  • The CacheLib Caching Engine: Design and Experiences at Scale. Benjamin Berg, Daniel S. Berger, Sara McAllister, Isaac Grosof, Sathya Gunasekar, Jimmy Lu, Michael Uhlar, Jim Carrig, Nathan Beckmann, Mor Harchol-Balter, Gregory R. Ganger. 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI'20), Virtual Event, Nov. 4–6, 2020.
    Abstract / PDF [606K] / Slides / Talk Video

  • A Large Scale Analysis of Hundreds of In-memory Cache Clusters at Twitter. Juncheng Yang, Yao Yue, K. V. Rashmi. 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI'20), Virtual Event, Nov. 4–6, 2020.
    Abstract / PDF [1.6M] / Slides / Talk Video

  • Jumanji: The Case for Dynamic NUCA in the Datacenter. Brian Schwedock, Nathan Beckmann. MICRO '53: Proceedings of the 53nd Annual IEEE/ACM International Symposium on Microarchitecture, Virtual Athens, Greece, October 17-21, 2020.
    Abstract / PDF [2.3M]

  • Challenges and Solutions for Fast Remote Persistent Memory Access. Anuj Kalia, David Andersen, Michael Kaminsky. SoCC ’20, October 19–21, 2020, Virtual Event, USA. BEST PAPER AWARD!
    Abstract / PDF [710K] / Talk Video

  • High Availability in Cheap Distributed Key Value Storage. Thomas Kim, Daniel Lin-Kit Wong, Gregory R. Ganger, Michael Kaminsky, David G. Andersen. SoCC ’20, October 19–21, 2020, Virtual Event, USA.
    Abstract / PDF [2.6M] / Talk Video

  • Caching with Delayed Hits. Nirav Atre, Justine Sherry, Weina Wang, Daniel S. Berger. SIGCOMM ’20, August 10–14, 2020, Virtual Event, NY, USA.
    Abstract / PDF [2.7M] / Talk Video

  • Lightweight Preemptible Functions. Sol Boucher, Anuj Kalia, David G. Andersen, Michael Kaminsky. 2020 USENIX Annual Technical Conference (USENIX ATC '20). Virtual Boston, MA, July 15–17, 2020.
    Abstract / PDF [1M] / Talk Video / Slides

  • Fast Software Cache Design for Network Appliances. Dong Zhou, Huacheng Yu, Michael Kaminsky, David Andersen. 2020 USENIX Annual Technical Conference (USENIX ATC '20). Virtual Boston, MA, July 15–17, 2020.
    Abstract / PDF [11M] / Talk Video / Slides

  • More IOPS for Less: Exploiting Burstable Storage in Public Clouds. Hojin Park, Gregory R. Ganger, George Amvrosiadis. 12th USENIX Workshop on Hot Topics in Cloud Computing (HotCloud ’20). Virtual Boston, MA, July 13-14, 2020.
    Abstract / PDF [600K] / Talk Video / Slides

  • DriftSurf: A Risk-competitive Learning Algorithm under Concept Drift. Ashraf Tahmasbi, Ellango Jothimurugesan, Srikanta Tirthapura, Phillip B. Gibbons. arXiv:2003.06508 [cs.LG], August, 2020.
    Abstract / PDF [1.2M]

  • Machine Learning on Volatile Instances. Xiaoxi Zhang, Jianyu Wang, Gauri Joshi, Carlee Joe-Wong. IEEE Intl. Conf. on Computer Communications (INFOCOM). Virtual Toronto, Canada, July 6-9, 2020.
    Abstract / PDF [516K]

  • Simple Near-Optimal Scheduling for the M/G/1. Ziv Scully, Mor Harchol-Balter, Alan Scheller-Wolf. Proceedings of the ACM Measurement and Analysis of Computer Systems - SIGMETRICS, June 2020, Boston, MA.
    Abstract / PDF [885K] / Talk Video

  • Order-Preserving Key Compression for In-Memory Search Trees. Huanchen Zhang, Xiaoxuan Liu, David G. Andersen, Michael Kaminsky, Kimberly Keeton, Andrew Pavlo
    SIGMOD’20, June 14–19, 2020. Virtual Portland, OR.
    Abstract / PDF [2.15M]

  • Improving Approximate Nearest Neighbor Search through Learned Adaptive Early Termination. Conglong Li, Minjia Zhang, David G. Andersen, Yuxiong He. SIGMOD ’20, June 14–19, 2020, Virtual Portland, OR, USA.
    Abstract / PDF [800K]

  • Active Learning for ML Enhanced Database Systems. Lin Ma, Bailu Ding, Sudipto Das, Adith Swaminathan. SIGMOD’20, June 14–19, 2020. Virtual Portland, OR.
    Abstract / PDF [2.4M]

  • TVARAK: Software-Managed Hardware Offload for Redundancy in Direct-Access NVM Storage. Rajat Kateja, Nathan Beckmann, Greg Ganger. 47th International Symposium on Computer Architecture, May 30 – June 3, 2020, Virtual Valencia, Spain.
    Abstract / PDF [1.6M]

  • Overlap Local-SGD: An Algorithmic Approach to Hide Communication Delays in Distributed SGD. Jianyu Wang, Hao Liang, Gauri Joshi. International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2020. Virtual Barcelona, Spain, May 4-8, 2020.
    Abstract / PDF [442K]

  • Correlated Multi-armed Bandits with a Latent Random Source. Samarth Gupta, Gauri Joshi, Osman Yagan. International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2020. Virtual Barcelona, Spain, May 4-8, 2020.
    Abstract / PDF [1.1M]

  • Lookahead Converges to Stationary Points of Smooth Non-Convex Functions. Jianyu Wang, Vinayak Tantia, Nicolas Ballas, Michael Rabbat. ICASSP 2020: 45th International Conference on Acoustics, Speech, and Signal Processing. Virtual Barcelona, Spain, May 4-8, 2020.
    Abstract / PDF [242K]

  • SlowMo: Improving Communication-Efficient Distributed SGD with Slow Momentum. Jianyu Wang, Vinayak Tantia, Nicolas Ballas, Michael Rabbat. ICLR 2020: International Conference on Learning Representations, Apr 26-May 1, 2020, Virtual Addis Ababa, Ethiopia.
    Abstract / PDF [640K] / Talk Video & Slides

  • Vilamb: Low Overhead Asynchronous Redundancy for Direct Access NVM. Rajat Kateja, Andy Pavlo, Greg Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-20-101, April 2020.
    Abstract / PDF [665K]

  • Learning-Based Coded Computation. Jack Kosaian, K.V. Rashmi, Shivaram Venkataraman. IEEE Journal on Selected Areas in Information Theory, March 2020.
    Abstract / PDF [654K]

  • The Case for Custom Storage Backends in Distributed Storage Systems. A. Aghayev, S. Weil, M. Kuchnik, M. Nelson, G. Ganger, G. Amvrosiadis. To appear in ACM Transactions on Storage, Volume 16, Issue 1, March 2020.
    Abstract / PDF [2.6M]

  • Livia: Data-Centric Computing Throughout the Memory Hierarchy. Elliot Lockerman, Axel Feldmann, Mohammad Bakhshalipour, Alexandru Stanescu, Shashwat Gupta, Daniel Sanchez, Nathan Beckmann. ASPLOS '20: Proceedings of the 25th International Conference on Architectural Support for Programming Languages and Operating Systems, Virtual Lausanne, Switzerland, March 16-20, March 2020.
    Abstract / PDF [1.6M] / Talk Video

  • Learning Relaxed Belady for Content Distribution Network Caching. Zhenyu Song, Daniel S. Berger, Kai Li, Wyatt Lloyd. 17th USENIX Symposium on Networked Systems Design and Implementation (NSDI ’20). February 25–27, 2020. Santa Clara, CA.
    Abstract / PDF [2.25M]

  • Scalable Pointer Analysis of Data Structures using Semantic Models. Pratik Fegade, Christian Wimmer. 29th Conference on Compiler Construction (CC ’20), February 22–23, 2020, San Diego, CA, USA.
    Abstract / PDF [700K]

  • Convertible Codes: New Class of Codes for Efficient Conversion of Coded Data in Distributed Storage. Francisco Maturana, K. V. Rashmi. 11th Innovations in Theoretical Computer Science Conference (ITCS 2020). Seattle, WA, January 12-14, 2020.
    Abstract / PDF [687K]

  • Mochi: Composing Data Services for High-Performance Computing Environments. Robert B. Ross, George Amvrosiadis, Philip Carns, Charles D. Cranor, Matthieu Dorier, Kevin Harms, Greg Ganger, Garth Gibson, Samuel K. Gutierrez, Robert Latham, Bob Robey, Dana Robinson, Bradley Settlemyer, Galen Shipman, Shane Snyder, Jerome Soumagne, Qing Zheng. Journal of Computer Science and Technology 35(1): 121–144 Jan. 2020.
    Abstract / PDF [1.3M]

  • Writeback-Aware Caching. Nathan Beckmann, Phillip B. Gibbons, Bernhard Haeupler, Charles McGuffey. Society for Industrial and Applied Mathematics. 2020.
    Abstract / PDF [847K]

  • MATCHA: Speeding Up Decentralized SGD via Matching Decomposition Sampling. Jianyu Wang, Anit Sahu, Gauri Joshi, Soummya Kar. NeurIPS workshop of Federated Learning for Data Privacy and Confidentiality, Dec 13, 2019. Vancouver, BC, Canada. Distinguished Student Paper Award.
    Abstract / PDF [1.1M]

  • Rateless Codes for Near-Perfect Load Balancing in Distributed Matrix-vector Multiplication. Ankur Mallick, Malhar Chaudhari, Ganesh Palanikumar, Utsav Sheth, Gauri Joshi. Proc. ACM Meas. Anal. Comput. Syst., Vol. 3, No. 3, Article 58. December 2019. BEST PAPER SIGMETRICS'20.
    Abstract / PDF [1.9M]

  • Demystifying Complex Workload–DRAM Interactions: An Experimental Study. S. Ghose, T. Li, N. Hajinazar, D. Senol Cali, O. Mutlu. Proc. of the Joint ACM SIGMETRICS/IFIP Performance Conference, Phoenix, AZ, June 2019. To appear in Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS), Vol. 3, No. 3, December 2019.
    Abstract / PDF [4M]

  • Processing-in-Memory: A Workload-Driven Perspective. S. Ghose, A. Boroumand, J. S. Kim, J. Gómez-Luna, O. Mutlu. IBM Journal of Research and Development (JRD), Vol. 63, No. 6, November/December 2019.
    Abstract / PDF [2.1M]

  • MANIC: A Vector-Dataflow Architecture for Ultra-Low-Power Embedded Systems. G Gobieski, A Nagi, N Serafin, MM Isgenc, N Beckmann, B Lucia. MICRO '52: Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture, Columbus, OH, October 2019.
    Abstract / PDF [1.2M]

  • Processing-in-Memory: A Workload-Driven Perspective. S. Ghose, A. Boroumand, J. S. Kim, J. Gómez-Luna, O. Mutlu. To appear in IBM Journal of Research and Development (JRD), November 2019.
    Abstract / PDF [2.1M]

  • File Systems Unfit as Distributed Storage Backends: Lessons from 10 Years of Ceph Evolution. Abutalib Aghayev, Sage Weil, Michael Kuchnik, Mark Nelson, Gregory R. Ganger, George Amvrosiadis. SOSP ’19, October 27–30, 2019, Huntsville, ON, Canada.
    Abstract / PDF [870K]

  • Parity Models: Erasure-Coded Resilience for Prediction Serving Systems. Jack Kosaian, K. V. Rashmi, Shivaram Venkataraman. SOSP ’19, October 27–30, 2019, Huntsville, ON, Canada.
    Abstract / PDF [1M]

  • PipeDream: Generalized Pipeline Parallelism for DNN Training. Deepak Narayanan, Aaron Harlap, Amar Phanishayee, Vivek Seshadri, Nikhil R. Devanur, Gregory R. Ganger, Phillip B. Gibbons, Matei Zaharia. SOSP ’19, October 27–30, 2019, Huntsville, ON, Canada.
    Abstract / PDF [1M]

  • Multiversioned Page Overlays: Enabling Faster Serializable Hardware Transactional Memory. Ziqi Wang, Michael A. Kozuch, Todd C. Mowry, Vivek Seshadri. 28th Parallel Architecture and Compiler Technologies 2019 (PACT'19), Sept 21-25, 2019, Seattle, WA.
    Abstract / PDF [475K]

  • Compact Filters for Fast Online Data Partitioning. Qing Zheng, Charles D. Cranor, Ankush Jain, Gregory R. Ganger, Garth A. Gibson, George Amvrosiadis, Bradley W. Settlemyer, Gary Grider. IEEE CLUSTER 2019. September 23 - 26, 2019, Albuquerque, New Mexico, USA.
    Abstract / PDF [1M]

  • Vantage: Optimizing Video Upload for Time-shifted Viewing of Social Livestreams. Devdeep Ray, Jack Kosaian, K. V. Rashmi, Srini Seshan. ACM SIGCOMM, August 19-24, 2019, Beijing, China.
    Abstract / PDF [6.75M]

  • TVARAK: Software-Managed Hardware Offload for DAX NVM Storage Redundancy. Rajat Kateja, Nathan Beckmann, Greg Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-19-105, Aug 2019.
    Abstract / PDF [975K]

  • STRADS-AP: Simplifying Distributed Machine Learning Programming without Introducing a New Programming Model. Jin Kyu Kim, Abutalib Aghayev, Garth A. Gibson, Eric P. Xing. Proceedings of the 2019 USENIX Annual Technical Conference, July 10–12, 2019 • Renton, WA.
    Abstract / PDF [490K]

  • Rateless Codes for Distributed Computations with Sparse Compressed Matrices. Ankur Mallick, Gauri Joshi. IEEE International Symposium on Information Theory (ISIT), July 7-12, 2019, Paris, France.
    Abstract / PDF [672K]

  • Peering through the Dark: An Owl’s View of Inter-job Dependencies and Jobs’ Impact in Shared Clusters. Andrew Chung, Carlo Curino, Subru Krishnan, Konstantinos Karanasos, Panagiotis Garefalakis, Gregory R. Ganger. SIGMOD ’19, June 30–July 5, 2019, Amsterdam, Netherlands.
    Abstract / PDF [1.6M]

  • Distribution-based Cluster Scheduling. Jun Woo Park. Carnegie Mellon University School of Computer Science PhD Dissertation, June 2019.
    Abstract / PDF [1.47M]

  • Enabling Practical Processing in and Near Memory for Data-Intensive Computing. O. Mutlu, S. Ghose, J. Gómez-Luna, R. Ausavarungnirun. Proc. of the Design Automation Conference (DAC), Las Vegas, NV, June 2019.
    Abstract / PDF [477K]

  • CROW: A Low-Cost Substrate for Improving DRAM Performance, Energy Efficiency, and Reliability. H. Hassan, M. Patel, J. S. Kim, A. G. Yaglikçi, N. Vijaykumar, N. Mansouri Ghiasi, S. Ghose, O. Mutlu. Proc. of the International Symposium on Computer Architecture (ISCA), Phoenix, AZ, June 2019.
    Abstract / PDF [1.45M]

  • CoNDA: Efficient Cache Coherence Support for Near-Data Accelerators. A. Boroumand, S. Ghose, M. Patel, H. Hassan, B. Lucia, R. Ausavarungnirun, K. Hsieh, N. Hajinazar, K. T. Malladi, H. Zheng, O. Mutlu. Proc. of the International Symposium on Computer Architecture (ISCA), Phoenix, AZ, June 2019.
    Abstract / PDF [1.1M]

  • Understanding the Interactions ofWorkloads and DRAM Types: A Comprehensive Experimental Study. S. Ghose, T. Li, N. Hajinazar, D. Senol Cali, O. Mutlu. Proc. of the Joint ACM SIGMETRICS/IFIP Performance Conference, Phoenix, AZ, June 2019; To appear in Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS), 2019.
    Abstract / PDF [2M]

  • Compact Filter Structures for Fast Data Partitioning. Qing Zheng, Charles D. Cranor, Ankush Jain, Gregory R. Ganger, Garth A. Gibson, George Amvrosiadis, Bradley W. Settlemyer, Gary A. Grider. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-19-104, June 2019.
    Abstract / PDF[574K]

  • Improving ML Applications in Shared Computing Environments. Aaron Harlap. Carnegie Mellon University Electrical and Computer Engineering PhD Dissertation, May 2019.
    Abstract / PDF [1.4M]

  • This is Why ML-driven Cluster Scheduling Remains Widely Impractical. Michael Kuchnik, Jun Woo Park, Chuck Cranor, Elisabeth Moore, Nathan DeBardeleben, George Amvrosiadis. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-19-103, May 2019.
    Abstract / PDF [715K]

  • Fast and Efficient Distributed Matrix-Vector Multiplication Using Rateless Fountain Codes. Ankur Mallick, Malhar Chaudhari, Gauri Joshi. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 12 - 17 May, 2019 · Brighton, UK.
    Abstract / PDF [485K]

  • Reconciling LSM-Trees with Modern Hard Drives using BlueFS. Abutalib Aghayev, Sage Weil, Greg Ganger, George Amvrosiadis. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-19-102, April 2019.
    Abstract / PDF [735K]

  • Intelligence Beyond the Edge: Inference on Intermittent Embedded Systems. Graham Gobieski, Brandon Lucia, Nathan Beckmann Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS’19), April 13th – April 17th, Providence, RI.
    Abstract / PDF [3.35M]
  • Lazy Redundancy for NVM Storage: Handing the Performance-Reliability Tradeoff to Applications. Rajat Kateja, Andy Pavlo, Greg Ganger Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-19-101, April 2019.
    Abstract / PDF [800K]

  • Scaling Video Analytics on Constrained Edge Nodes. Christopher Canel, Thomas Kim, Giulio Zhou, Conglong Li, Hyeontaek Lim, David G. Andersen, Michael Kaminsky, Subramanya R. Dulloor. 2nd SysML Conference (SysML ’19). March 31-April 2, 2019, Palo Alto, CA.
    Abstract / PDF [8.5M]

  • Automating Dependence-Aware Parallelization of Machine Learning Training on Distributed Shared Memory. Jinliang Wei, Garth A. Gibson, Phillip B. Gibbons, Eric P. Xing. EuroSys '19: Proceedings of the Fourteenth EuroSys Conference, March 2019, Dresden, Germany.
    Abstract / PDF [1.1M]

  • Non-Volatile Memory Database Management Systems. Joy Arulraj, Andrew Pavlo. Synthesis Lectures on Data Management, Morgan & Claypool Publishers, February 2019.
    Abstract / PDF currently unavailable

  • Datacenter RPCs can be General and Fast. Anuj Kalia Michael, Kaminsky, David G. Andersen. 16th USENIX Symposium on Networked Systems Design and Implementation (NSDI), Feb. 26–28, 2019, Boston, MA. Best Paper award!
    Abstract / PDF [555K]

  • Cluster Storage Systems Gotta Have HeART: Improving Storage Efficiency by Exploiting Disk-reliability Heterogeneity. Saurabh Kadekodi, K. V. Rashmi, Gregory R. Ganger. 17th USENIX Conference on File and Storage Technologies (FAST '19) Feb. 25–28, 2019 Boston, MA.
    Abstract / PDF [1.1M]

  • A Scalable Priority-Aware Approach to Managing Data Center Server Power. Yang Li, Charles R. Lefurgy, Karthick Rajamani, Malcolm S. Allen-Ware, Guillermo J. Silva, Daniel D. Heimsoth, Saugata Ghose, Onur Mutlu. HPCA 2019: The 25th International Symposium on High-Performance Computer Architecture, February 16 - 20, 2019, Washington D.C.
    Abstract / PDF [610K]

  • External vs. Internal: An Essay on Machine Learning Agents for Autonomous Database Management Systems. Andrew Pavlo, Matthew Butrovich, Ananya Joshi, Lin Ma, Prashanth Menon, Dana Van Aken, Lisa Lee, Ruslan Salakhutdinov. Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, 42(2): 32-46 (2019).
    Abstract / PDF [555K]

  • What Your DRAM Power Models Are Not Telling You: Lessons from a Detailed Experimental Study. S. Ghose, A. G. Yaglikçi, R. Gupta, D. Lee, K. Kudrolli, W. X. Liu, H. Hassan, K. K. Chang, N. Chatterjee, A. Agrawal, M. O'Connor, O. Mutlu. Proc. of the ACM SIGMETRICS Conference, Irvine, CA, June 2018; Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS), Vol. 2, No. 3, December 2018.
    Abstract / PDF [2.6M]

  • Improving 3D NAND Flash Memory Lifetime by Tolerating Early Retention Loss and Process Variation. Y. Luo, S. Ghose, Y. Cai, E. F. Haratsch, O. Mutlu. Proc. of the ACM SIGMETRICS Conference, Irvine, CA, June 2018; Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS), Vol. 2, No. 3, December 2018.
    Abstract / PDF [3.2M]

  • SRPT for Multiserver Systems. Isaac Grosof, Ziv Scully, Mor Harchol-Balter. Performance Evaluation , vol. 127-128, Nov. 2018, pp. 154-175. Also in Proc. 36th International Symposium on Computer Performance, Modeling, Measurements, and Evaluation (Performance 2018) , Toulouse, France, December 2018. Best Student Paper Award.
    Abstract / PDF [780K]

  • Towards Lightweight and Robust Machine Learning for CDN Caching. Daniel S. Berger. HotNets-XVII, November 15–16, 2018, Redmond, WA, USA.
    Abstract / PDF [610K]

  • Scaling Embedded In-Situ Indexing with DeltaFS. Qing Zheng, Charles D. Cranor, Danhao Guo, Gregory R. Ganger, George Amvrosiadis, Garth A. Gibson, Bradley W. Settlemyer, Gary Grider, Fan Guo. SC18, November 11-16, 2018, Dallas, Texas, USA.
    Abstract / PDF [927K]

  • Stratus: Cost-aware Container Scheduling in the Public Cloud. Andrew Chung, Jun Woo Park, Gregory R. Ganger. ACM Symposium on Cloud Computing, 2018 (SoCC’18), Carlsbad, CA October 11-13, 2018.
    Abstract / PDF [1.5M]

  • Focus: Querying Large Video Datasets with Low Latency and Low Cost. Kevin Hsieh, Ganesh Ananthanarayanan, Peter Bodik, Shivaram Venkataraman, Paramvir Bahl, Matthai Philipose, Phillip B. Gibbons, Onur Mutlu. 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI), Oct. 8–10, 2018, Carlsbad, CA.
    Abstract / PDF [1.2M]

  • RobinHood: Tail Latency Aware Caching—Dynamic Reallocation from Cache-Rich to Cache-Poor. Daniel S. Berger, Benjamin Berg, Timothy Zhu, Siddhartha Sen, Mor Harchol-Balter. 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI ’18). October 8–10, 2018 • Carlsbad, CA, USA.
    Abstract / PDF [2.9M]

  • SOAP Bubbles: Robust Scheduling Under Adversarial Noise. Ziv Scully, Mor Harchol-Balter. 56th Annual Allerton Conference on Communication, Control, and Computing, 2-5 Oct. 2018. Monticello, IL.
    Abstract / PDF [245K]

  • Exploiting Locality in Graph Analytics through Hardware-Accelerated Traversal Scheduling. Anurag Mukkara, Nathan Beckmann, Maleen Abeydeera, Xiaosong Ma, Daniel Sanchez. 51st Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), 20-24 Oct. 2018, Fukuoka, Japan.
    Abstract / PDF [660K]

  • The Parallel Persistent Memory Model. Guy E. Blelloch, Phillip B. Gibbons, Yan Gu, Charles McGuffey, Julian Shun. SPAA ’18, July 16–18, 2018, Vienna, Austria.
    Abstract / PDF [760K]

  • Putting the “Micro” Back in Microservice. Sol Boucher, Anuj Kalia, David G. Andersen, Michael Kaminsky. 2018 USENIX Annual Technical Conference (USENIX ATC ’18). July 11–13, 2018 • Boston, MA.
    Abstract / PDF [740K]

  • Geriatrix: Aging What You See and What You Don’t See -- A File System Aging Approach for Modern Storage Systems. Saurabh Kadekodi, Vaishnavh Nagarajan, Gregory R. Ganger, Garth A. Gibson. 2018 USENIX Annual Technical Conference (USENIX ATC ’18). July 11–13, 2018 • Boston, MA.
    Abstract / PDF [1.44M]

  • Cavs: An Efficient Runtime System for Dynamic Neural Networks. Shizhen Xu, Hao Zhang, Graham Neubig, Wei Dai, Jin Kyu Kim, Zhijie Deng, Qirong Ho, Guangwen Yang, Eric P. Xing. 2018 USENIX Annual Technical Conference (USENIX ATC ’18). July 11–13, 2018 • Boston, MA.
    Abstract / PDF [1.7M]

  • Litz: Elastic Framework for High-Performance Distributed Machine Learning. Aurick Qiao, Abutalib Aghayev, Weiren Yu, Haoyang Chen, Qirong Ho, Garth A. Gibson, Eric P. Xing. 2018 USENIX Annual Technical Conference (USENIX ATC ’18). July 11–13, 2018 • Boston, MA.
    Abstract / PDF [298K]

  • Mainstream: Dynamic Stem-Sharing for Multi-Tenant Video Processing. Angela H. Jiang, Daniel L.K. Wong, Christopher Canel, Lilia Tang, Ishan Misra, Michael Kaminsky*, Michael A. Kozuch*, Padmanabhan Pillai*, David G. Andersen Gregory R. Ganger. 2018 USENIX Annual Technical Conference (USENIX ATC ’18). July 11–13, 2018 • Boston, MA, USA.
    Abstract / PDF [1.5M]

  • Tributary: Spot-dancing for Elastic Services with Latency SLOs. Aaron Harlap, Andrew Chung, Alexey Tumanov, Gregory R. Ganger, Phillip B. Gibbons. 2018 USENIX Annual Technical Conference. July 11–13, 2018 Boston, MA, USA. Supersedes Carnagie Mellon University Parallel Data Lab Technical Report CMU-PDL-18-102.
    Abstract / PDF [1.25M]

  • On the Diversity of Cluster Workloads and its Impact on Research Results. George Amvrosiadis, Jun Woo Park, Gregory R. Ganger, Garth A. Gibson, Elisabeth Baseman, Nathan DeBardeleben. 2018 USENIX Annual Technical Conference (ATC '18), Boston, MA, July 11-13, 2018.
    Abstract / PDF [285K]

  • A Case for Packing and Indexing in Cloud File Systems. Saurabh Kadekodi, Bin Fan, Adit Madan, Garth A. Gibson, Gregory R. Ganger. 10th USENIX Workshop on Hot Topics in Cloud Computing, July 9, 2018, Boston, MA. Supersedes CMU-PDL-17-105.
    Abstract / PDF [250K]

  • FLIN: Enabling Fairness and Enhancing Performance in Modern NVMe Solid State Drives. A. Tavakkol, M. Sadrosadati, S. Ghose, J. Kim, Y. Luo, Y. Wang, N. M. Ghiasi, L. Orosa, J. Gómez-Luna, O. Mutlu. Proc. of the International Symposium on Computer Architecture (ISCA), Los Angeles, CA, June 2018.
    Abstract / PDF [888K]

  • Learning a Code: Machine Learning for Approximate Non-Linear Coded Computation. Jack Kosaian, K.V. Rashmi, Shivaram Venkataraman. arXiv:1806.01259v1 [cs.LG], 4 Jun 2018
    Abstract / PDF [575K]

  • Practical Bounds on Offline Caching with Variable Object Sizes. Daniel Berger, Nathan Beckmann, Mor Harchol-Balter. Proc. ACM Meas. Anal. Comput. Syst., Vol. 2, No. 2, Article 32. June 2018. POMACS 2018.
    Abstract / PDF [1.2M]

  • Query-based Workload Forecasting for Self-Driving Database Management Systems. Lin Ma, Dana Van Aken, Ahmed Hefny, Gustavo Mezerhane, Andrew Pavlo, Geoffrey J. Gordon. SIGMOD/PODS '18 International Conference on Management of Data, Houston, TX, USA, June 10 - 15, 2018.
    Abstract / PDF [1.25M]

  • Building a Bw-Tree Takes More Than Just Buzz Words. Ziqi Wang, Andrew Pavlo, Hyeontaek Lim, Viktor Leis, Huanchen Zhang, Michael Kaminsky, David G. Andersen. SIGMOD’18, June 10–15, 2018, Houston, TX, USA.
    Abstract / PDF [2.2M]

  • SuRF: Practical Range Query Filtering with Fast Succinct Tries. Huanchen Zhang, Hyeontaek Lim, Viktor Leis, David G. Andersen, Michael Kaminsky, Kimberly Keeton, Andrew Pavlo. SIGMOD’18, June 10–15, 2018, Houston, TX, USA.
    Abstract / PDF [1.9M]

  • The Locality Descriptor: A Holistic Cross-Layer Abstraction to Express Data Locality in GPUs. Nandita Vijaykumar, Eiman Ebrahimi, Kevin Hsieh, Phillip B. Gibbons, Onur Mutlu. The 45th International Symposium on Computer Architecture - June 2-6, ISCA 2018. Los Angeles, California, USA.
    Abstract / PDF [3.1M]

  • A Case for Richer Cross-layer Abstractions: Bridging the Semantic Gap with Expressive Memory. Nandita Vijaykumar, Abhilasha Jain, Diptesh Majumdar, Kevin Hsieh, Gennady Pekhimenko, Eiman Ebrahimi, Nastaran Hajinazaru, Phillip B. Gibbons, Onur Mutlu. 45th International Symposium on Computer Architecture (ISCA), Los Angeles, CA, USA, June 2018.
    Abstract / PDF [2M]

  • Practical Bounds on Optimal Caching with Variable Object Sizes. Daniel S. Berger, Nathan Beckmann, Mor Harchol-Balter. Proceedings of the ACM on Measurement and Analysis of Computing Systems. Vol. 2, No. 2, Article 32, June 2018.
    Abstract / PDF [1.2M]

  • Implicit Decomposition for Write-Efficient Connectivity Algorithms. Naama Ben-David, Guy E. Blelloch, Jeremy T. Fineman, Phillip B. Gibbons, Yan Gu, Charles McGuffey, and Julian Shun. 2018 International Parallel and Distributed Processing Symposium (IPDPS '18). May 21-25, 2018, Vancouver, BC, Canada.
    Abstract / PDF [716K]

  • 3Sigma: Distribution-based Cluster Scheduling for Runtime Uncertainty. Jun Woo Park, Alexey Tumanov, Angela Jiang, Michael A. Kozuch, Gregory R. Ganger. EuroSys ’18, April 23–26, 2018, Porto, Portugal. Supersedes CMU-PDL-17-107, Nov. 2017.
    Abstract / PDF [1.4M]

  • LHD: Improving Cache Hit Rate by Maximizing Hit Density. Nathan Beckmann, Haoxian Chen, Asaf Cidon. 15th USENIX Symposium on Networked Systems Design and Implementation ({NSDI} 18), April 9-11, 2018, Renton, WA..
    Abstract / PDF [1.1M]

  • Better Caching in Search Advertising Systems with Rapid Refresh Predictions. Conglong Li, David G. Andersen, Qiang Fu, Sameh Elnikety, Yuxiong He. Proceedings of the 2018 World Wide Web Conference, Lyon, France, April 23 - 27, 2018.
    Abstract / PDF [1.1M]

  • Rateless Codes for Near-Perfect Load Balancing in Distributed Matrix-Vector Multiplication. Ankur Mallick, Malhar Chaudhari, Gauri Joshi. arXiv:1804.10331v2 [cs.DC] 30 Apr 2018.
    Abstract / PDF [1.1M]

  • GoogleWorkloads for Consumer Devices: Mitigating Data Movement Bottlenecks. Amirali Boroumand, Saugata Ghose, Youngsok Kim, Rachata Ausavarungnirun, Eric Shiu, Rahul Thakur, Daehyun Kim, Aki Kuusela, Allan Knies, Parthasarathy Ranganathan, Onur Mutlu. ASPLOS’18, March 24–28, 2018, Williamsburg, VA, USA.
    Abstract / PDF [885K]

  • LTRF: Enabling High-Capacity Register Files for GPUs via Hardware/Software Cooperative Register Prefetching. Mohammad Sadrosadati, Amirhossein Mirhosseini, Seyed Borna Ehsani, Hamid Sarbazi-Azad, Mario Drumond, Babak Falsafi, Rachata Ausavarungnirun, Onur Mutlu. ASPLOS2018. The 23rd ACM International Conference on Architectural Support for Programming Languages and Operating Systems, March 24th – March 28th, Williamsburg, VA, USA.
    Abstract / PDF [1.M]

  • MASK: Redesigning the GPU Memory Hierarchy to Support Multi-Application Concurrency. Rachata Ausavarungnirun, Vance Miller, Joshua Landgraf, Saugata Ghose, Jayneel Gandhi, Adwait Jog, Christopher J. Rossbach, Onur Mutlu. ASPLOS2018. The 23rd ACM International Conference on Architectural Support for Programming Languages and Operating Systems, March 24th – March 28th, Williamsburg, VA, USA.
    Abstract / PDF [1.1M]

  • Slim NoC: A Low-Diameter On-Chip Network Topology for High Energy Efficiency and Scalability. Maciej Besta, Syed Minhaj Hassan, Sudhakar Yalamanchili, Rachata Ausavarungnirun, Onur Mutlu, Torsten Hoefler. ASPLOS2018. The 23rd ACM International Conference on Architectural Support for Programming Languages and Operating Systems, March 24th – March 28th, Williamsburg, VA, USA.
    Abstract / PDF [1.6M]

  • SOAP: One Clean Analysis of All Age-Based Scheduling Policies. Ziv Scully, Mor Harchol-Balter, Alan Scheller-Wolf. Proc. ACM Meas. Anal. Comput. Syst., Vol. 2, No. 1, Article 16, March 2018.
    Abstract / PDF [885K]

  • MLtuner: System Support for Automatic Machine Learning Tuning. Henggang Cui, Gregory R. Ganger, Phillip B. Gibbons. arXiv:1803.07445v1 [cs.LG] 20 Mar 2018.
    Abstract / PDF [1M]

  • Dynamic Stem-Sharing for Multi-Tenant Video Processing. Angela Jiang, Christopher Canel, Daniel Wong, Michael Kaminsky, Michael A. Kozuch, Padmanabhan Pillai, David G. Andersen, Gregory R. Ganger. SysML 18, February 15–16, 2018. Stanford, CA.
    Abstract / PDF [450K]

  • MQSim: A Framework for Enabling Realistic Studies of Modern Multi-Queue SSD Devices. A. Tavakkol, J. Gómez-Luna, M. Sadrosadati, S. Ghose, and O. Mutlu. USENIX Conference on File and Storage Technologies (FAST), Oakland, CA, February 2018.
    Abstract / PDF [2.25M]

  • 3LC: Lightweight and Effective Traffic Compression for Distributed Machine Learning. Hyeontaek Lim, David G. Andersen, Michael Kaminsky. arXiv:1802.07389v1 [cs.LG] 21 Feb 2018.
    Abstract / PDF [586K]

  • Efficient Multi-Tenant Inference on Video using Microclassifiers. Giulio Zhou, Thomas Kim, Christopher Canel, Conglong Li, Hyeontaek Lim, David G. Andersen, Michael Kaminsky, Subramanya R. Dulloor. SysML’18, February 15–16, 2018, Stanford, CA.
    Abstract / PDF [1.5M]

  • PipeDream: Fast and Efficient Pipeline Parallel DNN Training. Aaron Harlap, Deepak Narayanan, Amar Phanishayee, Vivek Seshadri, Nikhil Devanur, Greg Ganger, Phil Gibbons. SysML '18, Feb. 15-16, 2018 , Stanford, CA.
    Abstract / PDF [615K]

  • Intermittent Deep Neural Network Inference. Graham Gobieski, Nathan Beckmann, Brandon Lucia. SysML 2018, February 15-16, 2018, Stanford, CA.
    Abstract / PDF [450K]

  • Picking Interesting Frames in Streaming Video. Christopher Canel, Thomas Kim, Giulio Zhou, Conglong Li, Hyeontaek Lim, David G. Andersen, Michael Kaminsky, Subramanya R. Dulloor. SysML’18, February 15–16, 2018, Stanford, CA.
    Abstract / PDF [913K]

  • Tributary: Spot-dancing for elastic services with latency SLOs. Aaron Harlap, Andrew Chung, Alexey Tumanov, Gregory R. Ganger, Phillip B. Gibbons. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-18-102, Jan. 2018.
    Abstract / PDF [990K]

  • Addressing the Long-Lineage Bottleneck in Apache Spark. Haoran Wang, Jinliang Wei, Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-18-101, January 2018.
    Abstract / PDF [250K]