Linked Presentation: Llumnix: Dynamic Scheduling for Large Language Model ServingMonoNN: Enabling a New Monolithic Optimization Space for Neural Network Inference Tasks on Modern GPU-Centric Architectures