Conferences

Search results

    TitleConferenceSpeaker(s)
    Sabre: Hardware-Accelerated Snapshot Compression for Serverless MicroVMsOSDI '24Nikita Lazarev, Varun Gohil, James Tsai, Andy Anderson, Bhushan Chitlur, Zhiru Zhang, Christina Delimitrou
    Nomad: Non-Exclusive Memory Tiering via Transactional Page MigrationOSDI '24Lingfeng Xiang, Zhen Lin, Weishu Deng, Hui Lu, Jia Rao, Yifan Yuan, Ren Wang
    Managing Memory Tiers with CXL in Virtualized EnvironmentsOSDI '24Yuhong Zhong, Daniel S. Berger, Carl Waldspurger, Ryan Wee, Ishwar Agarwal, Rajat Agarwal, Frank Hady, Karthik Kumar, Mark D. Hill, Mosharaf Chowdhury, Asaf Cidon
    Harvesting Memory-bound CPU Stall Cycles in Software with MSHOSDI '24Zhihong Luo, Sam Son, Sylvia Ratnasamy, Scott Shenker
    DRust: Language-Guided Distributed Shared Memory with Fine Granularity, Full Transparency, and Ultra EfficiencyOSDI '24Haoran Ma, Yifan Qiao, Shi Liu, Shan Yu, Yuanjiang Ni, Qingda Lu, Jiesheng Wu, Yiying Zhang, Miryung Kim, Harry Xu
    Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-ServeOSDI '24Amey Agrawal, Nitin Kedia, Ashish Panwar, Jayashree Mohan, Nipun Kwatra, Bhargav Gulavani, Alexey Tumanov, Ramachandran Ramjee
    ServerlessLLM: Low-Latency Serverless Inference for Large Language ModelsOSDI '24Yao Fu, Leyang Xue, Yeqi Huang, Andrei-Octavian Brabete, Dmitrii Ustiugov, Yuvraj Patel, Luo Mai
    InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache ManagementOSDI '24Wonbeom Lee, Jungi Lee, Junghwan Seo, Jaewoong Sim
    Llumnix: Dynamic Scheduling for Large Language Model ServingOSDI '24Biao Sun, Ziming Huang, Hanyu Zhao, Wencong Xiao, Xinyi Zhang, Yong Li, Wei Lin
    DistServe: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model ServingOSDI '24Yinmin Zhong, Shengyu Liu, Junda Chen, Jianbo Hu, Yibo Zhu, Xuanzhe Liu, Xin Jin, Hao Zhang
    ACCL+: an FPGA-Based Collective Engine for Distributed ApplicationsOSDI '24Zhenhao He, Dario Korolija, Yu Zhu, Benjamin Ramhorst, Tristan Laan, Lucian Petrica, Michaela Blott, Gustavo Alonso
    Beaver: Practical Partial Snapshots for Distributed Cloud ServicesOSDI '24Liangcheng Yu, Xiao Zhang, Haoran Zhang, John Sonchack, Dan Ports, Vincent Liu
    Fast and Scalable In-network Lock Management Using Lock FissionOSDI '24Hanze Zhang, Ke Cheng, Rong Chen, Haibo Chen
    Chop Chop: Byzantine Atomic Broadcast to the Network LimitOSDI '24Martina Camaioni, Rachid Guerraoui, Matteo Monti, Pierre-Louis Roman, Manuel Vidigueira, Gauthier Voron
    Enabling Tensor Language Model to Assist in Generating High-Performance Tensor Programs for Deep LearningOSDI '24Yi Zhai, Sijia Yang, Keyu Pan, Renwei Zhang, Shuo Liu, Chao Liu, Zichun Ye, Jianmin Ji, Jie Zhao, Yu Zhang, Yanyong Zhang
    Ladder: Enabling Efficient Low-Precision Deep Learning Computing through Hardware-aware Tensor TransformationOSDI '24Lei Wang, Lingxiao Ma, Shijie Cao, Quanlu Zhang, Jilong Xue, Yining Shi, Ningxin Zheng, Ziming Miao, Fan Yang, Ting Cao, Yuqing Yang, Mao Yang
    Caravan: Practical Online Learning of In-Network ML Models with Labeling AgentsOSDI '24Qizheng Zhang, Ali Imran, Enkeleda Bardhi, Tushar Swamy, Nathan Zhang, Muhammad Shahbaz, Kunle Olukotun
    nnScaler: Constraint-Guided Parallelization Plan Generation for Deep Learning TrainingOSDI '24Zhiqi Lin, Youshan Miao, Quanlu Zhang, Fan Yang, Yi Zhu, Cheng Li, Saeed Maleki, Xu Cao, Ning Shang, Yilei Yang, Weijiang Xu, Mao Yang, Lintao Zhang, Lidong Zhou
    ChameleonAPI: Automatic and Efficient Customization of Neural Networks for ML ApplicationsOSDI '24Yuhan Liu, Chengcheng Wan, Kuntai Du, Henry Hoffmann, Junchen Jiang, Shan Lu, Michael Maire
    SquirrelFS: using the Rust compiler to check file-system crash consistencyOSDI '24Hayley LeBlanc, Nathan Taylor, James Bornholt, Vijay Chidambaram
    High-throughput and Flexible Host Networking for Accelerated ComputingOSDI '24Athinagoras Skiadopoulos, Zhiqiang Xie, Mark Zhao, Qizhe Cai, Saksham Agarwal, Jacob Adelmann, David Ahern, Carlo Contavalli, Michael Goldflam, Vitaly Mayatskikh, Raghu Raja, Daniel Walton, Rachit Agarwal, Shrijeet Mukherjee, Christos Kozyrakis
    IntOS: Persistent Embedded Operating System and Language Support for Multi-threaded Intermittent ComputingOSDI '24Yilun Wu, Byounguk Min, Mohannad Ismail, Wenjie Xiong, Changhee Jung, Dongyoon Lee
    Data-flow Availability: Achieving Timing Assurance in Autonomous SystemsOSDI '24Ao Li, Ning Zhang
    Microkernel Goes General: Performance and Compatibility in the HongMeng Production MicrokernelOSDI '24Haibo Chen, Xie Miao, Ning Jia, Nan Wang, Yu Li, Nian Liu, Yutao Liu, Fei Wang, Qiang Huang, Kun Li, Hongyang Yang, Hui Wang, Jie Yin, Yu Peng, Fengwei Xu
    Optimizing Resource Allocation in Hyperscale Datacenters: Scalability, Usability, and ExperiencesOSDI '24Neeraj Kumar, Pol Mauri Ruiz, Vijay Menon, Igor Kabiljo, Mayank Pundir, Andrew Newell, Daniel Lee, Liyuan Wang, Chunqiang Tang

Pages