PPoPP 2026
Sat 31 January - Wed 4 February 2026 Sydney, Australia
co-located with HPCA/CGO/PPoPP/CC 2026
Tue 3 Feb 2026 16:50 - 17:10 at Balmoral - ML Inference Chair(s): Hailong Yang

Diffusion models have become the dominant approach for generative tasks in images, videos, and other domains. However, diverse data properties in generation requests, which are critical for efficient serving, remain underexploited. To address this issue, we propose a diffusion model serving system ChituDiffusion. ChituDiffusion leverages the locality of data properties to recompose a diffusion pipeline into dGraphs with shared optimization opportunities, enabling thorough compile-time and runtime co-optimizations. During compilation, ChituDiffusion compiles each dGraph into multiple execution engines optimized for specific data properties. At runtime, heterogeneous requests are elaborately reorganized into fine-grained batching tasks with similar properties and then efficiently executed by matched engines. Evaluation on five diffusion applications shows that ChituDiffusion improves the throughput by up to 2.13$\times$ (1.58$\times$ on average) on A100 and 2.19$\times$ (1.51$\times$ on average) on H100 compared with existing frameworks. The code for ChituDiffusion and the production traces have been made open-source at https://github.com/thu-pacman/chitu/tree/Diffusion.

Tue 3 Feb

Displayed time zone: Hobart change

15:50 - 17:10
ML InferenceMain Conference at Balmoral
Chair(s): Hailong Yang Beihang University
15:50
20m
Talk
BEEMS: Boosting Machine Vision Efficiency via Computation Graph-Based Memory Smoothing
Main Conference
Hanjing Shen Shanghai Jiao Tong University, Fangxin Liu Shanghai Jiao Tong University, Jian Liu Beijing University of Aeronautics and Astronautics, Li Jiang Shanghai Jiaotong University, Haibing Guan Shanghai Jiao Tong University
DOI
16:10
20m
Talk
Laser: Unlocking Layer-Level Scheduling for Efficient Multi-SLO LLM Serving
Main Conference
Jianxiong Liao Sun Yat-sen University, ​​Quanxing​ Dong​ Sun Yat-sen University​, Yunkai Liang Sun Yat-sen University, Zhi Zhou Sun Yat-sen University, Xu Chen Sun Yat-sen University
DOI
16:30
20m
Talk
MixFusion: A Patch-Level Parallel Serving System for Mixed-Resolution Diffusion Models
Main Conference
Desen Sun University of Waterloo, Zepeng Zhao Carnegie Mellon University, Yuke Wang Rice University
DOI
16:50
20m
Talk
ChituDiffusion: A Data-Characteristic-Aware Serving System for Diffusion Models
Main Conference
Chengzhang Wu Tsinghua University, Liyan Zheng Tsinghua University, Haojie Wang Tsinghua University, Kezhao Huang Tsinghua University, Zixuan Ma Tsinghua University, Dong Dong , Jidong Zhai Tsinghua University
DOI