dLoRA: Dynamically Orchestrating Requests and Adapters for LoRA LLM Serving

TitledLoRA: Dynamically Orchestrating Requests and Adapters for LoRA LLM Serving
Publication TypeConference Paper
Year of Publication2024
AuthorsWu B, Zhu R, Zhang Z, Sun P, Liu X, Jin X
Conference Name18th USENIX Symposium on Operating Systems Design and Implementation (OSDI 24)
Date Published07/2024
PublisherUSENIX Association
Conference LocationSanta Clara, CA
ISBN Number978-1-939133-40-3
URLhttps://www.usenix.org/conference/osdi24/presentation/wu-bingyang