OSDI '24 Poster Session

The following posters will be presented at the OSDI '24 Poster Session and Reception on Wednesday, July 10, from 6:00 pm–7:30 pm in the Santa Clara Ballroom.

The OSDI '24 Poster Session and Reception is sponsored by Amazon.

Unpublished Work

Posters of unpublished research.


A Heterogeneous Accelerator System for Retrieval-Augmented Generation
Wenqi Jiang and Gustavo Alonso, ETH Zurich

An Isolation Metric
Anjali and Michael Swift, University of Wisconsin-Madison

Auto-Whittaker: Automatic Rewrites for Scaling Distributed Protocols
David C.Y. Chu, Natacha Crooks, and Joseph M. Hellerstein, UC Berkeley; Heidi Howard, Azure Research, Microsoft

Autobahn: Seamless high speed BFT
Neil Giridharan, University of California, Berkeley; Florian Suri-Payer, Cornell University; Itai Abraham, Intel; Lorenzo Alvisi, Cornell University; Natacha Crooks, University of California, Berkeley, and Azure Systems Research

Benchmarking Video Processing Systems in the Cloud
Zhiqi Li, Carleton University; Ruiqi Yu, Beijing University of Posts and Telecommunications; Jianshu Liu, Boise State University

Demographic Bias in Web Scheduling Systems
Sara Mahdizadeh Shahri and Akshitha Sriraman, Carnegie Mellon University

Design of Trust Chained IoT Key Management using OP-TEE
Atsuko Takefusa and Yutaka Ishikawa, National Institute of Informatics; Yasushi Ono, Institute of Information Security

Enzian fast RPC: merging OS and NIC on coherent interconnects
Pengcheng Xu, ETH Zurich

Eureka! We Can Let Your System Decide Its Consensus Needs
Reginald Frank and Soujanya Ponnapalli, University of California, Berkeley; Natacha Crooks, University of California, Berkeley, and Azure Systems Research

Graceful Termination under ECC-uncorrectable Memory Errors using Crane
Daichi Hatayama and Hiroshi Yamada, TUAT

IT-OAB: Intelligent Tiering based on One-off Access Block for Tiered Deduplication Storage System
Zilu Yao, National University of Defense Technology; Yinjin Fu, Sun Yat-sen University; Nong Xiao, National University of Defense Technology and Sun Yat-sen University

Kirsch: a foundational OS for heterogenous SoCs
Ben Fiedler and Zikai Liu, ETH Zurich

Kratos: In Search of a Distributed Control Plane for LEO Satellite Networks
Vaibhav Bhosale, Ketan Bhardwaj, and Ada Gavrilovska, Georgia Institute of Technology

LightSched: Scheduling Deep Learning Tasks on Photonic Computing Hardware
Zhizhen Zhong, Massachusetts Institute of Technology

Memory Usage Interfaces for Serverless Functions
Yonghao Zou, EPFL; David Hua, University of Waterloo; George Candea, EPFL

Mochi: Consensus-Aware Auto Scaling for State Machine Replication
Harald Ng, Ermias Habtegabr, and Paris Carbone, KTH Royal Institute of Technology

PipeDecode: Efficient LLM Inference using Pipelines within Decoding
Yunkai Liang, Sun Yat-sen University; Bin Gao, National University of Singapore; Pengfei Zuo, Huawei Cloud; Zhi Zhou and Xu Chen, Sun Yat-sen University

SAILOR: fast, cost-effective ML training in the cloud
Foteini Strati and Ixeia Sánchez Périz, ETH Zurich; Qinghao Hu, Nanyang Technological University; Ana Klimovic, ETH Zurich

Stateless SFI: Faster and Safer Machine Code Validation
Zachary Yedidia, Matthew Sotoudeh, and David Mazieres, Stanford University

Towards performance interfaces for SMT solvers
Can Cebeci, George Candea, and Clément Pit-Claudel, EPFL

Published Work

Posters of papers published at OSDI '24.


ACCL+: an FPGA-Based Collective Engine for Distributed Applications
Zhenhao He, Dario Korolija, Yu Zhu, and Benjamin Ramhorst, Systems Group, ETH Zurich; Tristan Laan, University of Amsterdam; Lucian Petrica and Michaela Blott, AMD Research; Gustavo Alonso, Systems Group, ETH Zurich

Anvil: Verifying Liveness of Cluster Management Controllers
Xudong Sun, Wenjie Ma, Jiawei Tyler Gu, and Zicheng Ma, University of Illinois Urbana-Champaign; Tej Chajed, University of Wisconsin-Madison; Jon Howell, Andrea Lattuada, and Oded Padon, VMware Research; Lalith Suresh, Feldera; Adriana Szekeres, VMware Research; Tianyin Xu, University of Illinois Urbana-Champaign

Automatically Reasoning About How Systems Code Uses the CPU Cache
Rishabh Iyer, Katerina Argyraki, and George Candea, EPFL

Beaver: Practical Partial Snapshots for Distributed Cloud Services
Liangcheng Yu, University of Pennsylvania; Xiao Zhang, Shanghai Jiao Tong University; Haoran Zhang, University of Pennsylvania; John Sonchack, Princeton University; Dan Ports, Microsoft / University of Washington; Vincent Liu, University of Pennsylvania

Caravan: Practical Online Learning of In-Network ML Models with Labeling Agents
Qizheng Zhang, Stanford University; Ali Imran, Purdue University; Enkeleda Bardhi, Sapienza University of Rome; Tushar Swamy and Nathan Zhang, Stanford University; Muhammad Shahbaz, Purdue University and University of Michigan; Kunle Olukotun, Stanford University

ChameleonAPI: Automatic and Efficient Customization of Neural Networks for ML Applications
Yuhan Liu, University of Chicago; Chengcheng Wan, East China Normal University; Kuntai Du, Henry Hoffmann, and Junchen Jiang, University of Chicago; Shan Lu, University of Chicago and Microsoft Research; Michael Maire, University of Chicago

Data-flow Availability: Achieving Timing Assurance in Autonomous Systems
Ao Li and Ning Zhang, Washington University in St. Louis

DistServe: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model Serving
Yinmin Zhong and Shengyu Liu, Peking University; Junda Chen, UC San Diego; Jianbo Hu, Peking University; Yibo Zhu, StepFun; Xuanzhe Liu and Xin Jin, Peking University; Hao Zhang, UC San Diego

DSig: Breaking the Barrier of Signatures in Data Centers
Marcos K. Aguilera, VMware Research Group; Clément Burgelin, Rachid Guerraoui, and Antoine Murat, École Polytechnique Fédérale de Lausanne (EPFL); Athanasios Xygkis, Oracle Labs; Igor Zablotchi, Mysten Labs

Enabling Tensor Language Model to Assist in Generating High-Performance Tensor Programs for Deep Learning
Yi Zhai, University of Science and Technology of China; Sijia Yang, Huawei Technologies Co., Ltd.; Keyu Pan, ByteDance Ltd.; Renwei Zhang, Huawei Technologies Co., Ltd.; Shuo Liu, University of Science and Technology of China; Chao Liu and Zichun Ye, Huawei Technologies Co., Ltd.; Jianmin Ji, University of Science and Technology of China; Jie Zhao, Hunan University; Yu Zhang and Yanyong Zhang, University of Science and Technology of China

Fairness in Serving Large Language Models
Ying Sheng, UC Berkeley and Stanford University; Shiyi Cao, Dacheng Li, Banghua Zhu, and Zhuohan Li, UC Berkeley; Danyang Zhuo, Duke University; Joseph E. Gonzalez and Ion Stoica, UC Berkeley

FairyWREN: A Sustainable Cache for Emerging Write-Read-Erase Flash Interfaces
Sara McAllister and Yucong "Sherry" Wang, Carnegie Mellon University; Benjamin Berg, UNC Chapel Hill; Daniel S. Berger, Microsoft Azure and University of Washington; George Amvrosiadis, Nathan Beckmann, and Gregory R. Ganger, Carnegie Mellon University

Flock: A Framework for Deploying On-Demand Distributed Trust
Darya Kaviani and Sijun Tan, UC Berkeley; Pravein Govindan Kannan, IBM Research; Raluca Ada Popa, UC Berkeley

High-throughput and Flexible Host Networking for Accelerated Computing
Athinagoras Skiadopoulos, Zhiqiang Xie, and Mark Zhao, Stanford University; Qizhe Cai and Saksham Agarwal, Cornell University; Jacob Adelmann, David Ahern, Carlo Contavalli, Michael Goldflam, Vitaly Mayatskikh, Raghu Raja, and Daniel Walton, Enfabrica; Rachit Agarwal, Cornell University; Shrijeet Mukherjee, Enfabrica; Christos Kozyrakis, Stanford University

Identifying On-/Off-CPU Bottlenecks Together with Blocked Samples
Minwoo Ahn and Jeongmin Han, Sungkyunkwan University; Youngjin Kwon, Korea Advanced Institute of Science and Technology (KAIST); Jinkyu Jeong, Yonsei University

Inductive Invariants That Spark Joy: Using Invariant Taxonomies to Streamline Distributed Protocol Proofs
Tony Nuda Zhang, University of Michigan; Travis Hance, Carnegie Mellon University; Manos Kapritsos, University of Michigan; Tej Chajed, University of Wisconsin–Madison; Bryan Parno, Carnegie Mellon University

InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management
Wonbeom Lee, Jungi Lee, Junghwan Seo, and Jaewoong Sim, Seoul National University

IronSpec: Increasing the Reliability of Formal Specifications
Eli Goldweber, Weixin Yu, Seyed Armin Vakil Ghahani, and Manos Kapritsos, University of Michigan

Llumnix: Dynamic Scheduling for Large Language Model Serving
Biao Sun, Ziming Huang, Hanyu Zhao, Wencong Xiao, Xinyi Zhang, Yong Li, and Wei Lin, Alibaba Group

Managing Memory Tiers with CXL in Virtualized Environments
Yuhong Zhong, Columbia University, Microsoft Azure; Daniel S. Berger, Microsoft Azure, University of Washington; Carl Waldspurger, Carl Waldspurger Consulting; Ryan Wee, Columbia University; Ishwar Agarwal, Rajat Agarwal, Frank Hady, and Karthik Kumar, Intel; Mark D. Hill, University of Wisconsin–Madison; Mosharaf Chowdhury, University of Michigan; Asaf Cidon, Columbia University

Microkernel Goes General: Performance and Compatibility in the HongMeng Production Microkernel
Haibo Chen, Huawei Central Software Institute and Shanghai Jiao Tong University; Xie Miao, Ning Jia, Nan Wang, Yu Li, Nian Liu, Yutao Liu, Fei Wang, Qiang Huang, Kun Li, Hongyang Yang, Hui Wang, Jie Yin, Yu Peng, and Fengwei Xu, Huawei Central Software Institute

Nomad: Non-Exclusive Memory Tiering via Transactional Page Migration
Lingfeng Xiang, Zhen Lin, Weishu Deng, Hui Lu, and Jia Rao, The University of Texas at Arlington; Yifan Yuan and Ren Wang, Intel Labs

Optimizing Resource Allocation in Hyperscale Datacenters: Scalability, Usability, and Experiences
Neeraj Kumar, Pol Mauri Ruiz, Vijay Menon, Igor Kabiljo, Mayank Pundir, Andrew Newell, Daniel Lee, Liyuan Wang, and Chunqiang Tang, Meta Platforms

Parrot: Efficient Serving of LLM-based Applications with Semantic Variable
Chaofan Lin, Shanghai Jiao Tong University; Zhenhua Han, Chengruidong Zhang, Yuqing Yang, and Fan Yang, Microsoft Research; Chen Chen, Shanghai Jiao Tong University; Lili Qiu, Microsoft Research

Performance Interfaces for Hardware Accelerators
Jiacheng Ma, Rishabh Iyer, Sahand Kashani, Mahyar Emami, Thomas Bourgeat, and George Candea, EPFL

Secret Key Recovery in a Global-Scale End-to-End Encryption System
Graeme Connell, Signal Messenger; Vivian Fang, UC Berkeley; Rolfe Schmidt, Signal Messenger; Emma Dauterman and Raluca Ada Popa, UC Berkeley

ServerlessLLM: Low-Latency Serverless Inference for Large Language Models
Yao Fu, Leyang Xue, Yeqi Huang, and Andrei-Octavian Brabete, University of Edinburgh; Dmitrii Ustiugov, NTU Singapore; Yuvraj Patel and Luo Mai, University of Edinburgh

ServiceLab: Preventing Tiny Performance Regressions at Hyperscale through Pre-Production Testing
Mike Chow, Meta Platforms; Yang Wang, Meta Platforms and The Ohio State University; William Wang, Ayichew Hailu, Rohan Bopardikar, Bin Zhang, Jialiang Qu, David Meisner, Santosh Sonawane, Yunqi Zhang, Rodrigo Paim, Mack Ward, Ivor Huang, Matt McNally, Daniel Hodges, Zoltan Farkas, Caner Gocmen, Elvis Huang, and Chunqiang Tang, Meta Platforms

Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-Serve
Amey Agrawal, Georgia Institute of Technology; Nitin Kedia, Ashish Panwar, Jayashree Mohan, Nipun Kwatra, and Bhargav Gulavani, Microsoft Research India; Alexey Tumanov, Georgia Institute of Technology; Ramachandran Ramjee, Microsoft Research India

When will my ML Job finish? Toward providing Completion Time Estimates through Predictability-Centric Scheduling
Abdullah Bin Faisal, Noah Martin, Hafiz Mohsin Bashir, Swaminathan Lamelas, and Fahad R. Dogar, Tufts University

μSlope: High Compression and Fast Search on Semi-Structured Logs
Rui Wang, YScope; Devin Gibson, YScope and University of Toronto; Kirk Rodrigues, YScope; Yu Luo, YScope, Uber, and University of Toronto; Yun Zhang, Kaibo Wang, Yupeng Fu, and Ting Chen, Uber; Ding Yuan, YScope and University of Toronto