PPoPP 2026
Sat 31 January - Wed 4 February 2026 Sydney, Australia
co-located with HPCA/CGO/PPoPP/CC 2026

This program is tentative and subject to change.

You're viewing the program in a time zone which is different from your device's time zone change time zone

Sat 31 Jan

Displayed time zone: Hobart change

08:45 - 10:30
11:00 - 12:45

Sun 1 Feb

Displayed time zone: Hobart change

08:45 - 10:30
11:00 - 12:45
13:45 - 15:30
13:45 - 15:30
16:00 - 17:45
16:00 - 17:45

Mon 2 Feb

Displayed time zone: Hobart change

09:50 - 11:10
Concurrency ControlMain Conference at Pyrmont
09:50
20m
Talk
Binary Compatible Critical Section Delegation
Main Conference
Junyao Zhang , Zhuo Wang Alibaba Group, Zhe Zhou Fudan University
10:10
20m
Talk
Hapax Locks : Scalable Value-Based Mutual Exclusion
Main Conference
Dave Dice Oracle Labs, Alex Kogan Oracle Labs
10:30
20m
Talk
Fixing non-blocking data structures for better compatibility with memory reclamation schemes
Main Conference
Md Amit Hasan Arovi Pennsylvania State University, Ruslan Nikolaev Pennsylvania State University
10:50
20m
Talk
Multiverse: Transactional Memory with Dynamic Multiversioning
Main Conference
Gaetano Coccimiglio University of Waterloo, Trevor Brown University of Waterloo, Srivatsan Ravi University of Southern California
11:30 - 12:50
Scheduling and Load BalancingMain Conference at Pyrmont
11:30
20m
Talk
Rethinking Thread Scheduling Under Oversubscription: A User-Space Framework for Coordinating Multi-Runtime and Multi-Process Workloads
Main Conference
Aleix Roca Barcelona Supercomputing Center, Vicenç Beltran Barcelona Supercomputing Center
11:50
20m
Talk
Waste-Efficient Work Stealing
Main Conference
Kyle Singer Massachusetts Institute of Technology, Kunal Agrawal Washington University in St. Louis, TB Schardl Massachusetts Institute of Technology
12:10
20m
Talk
DiggerBees: Depth First Search Leveraging Hierarchical Block-Level Stealing on GPUs
Main Conference
Yuyao Niu Barcelona Supercomputing Center, Yuechen Lu China University of Petroleum-Beijing, Weifeng Liu China University of Petroleum-Beijing, Marc Casas Barcelona Supercomputing Center
12:30
20m
Talk
PANA: A Fine-Grained Runtime-Adaptive Load Balancing for Parallel SpMV on multicore CPUs
Main Conference
Haodong Bian Tsinghua University, Youhui Zhang Tsinghua University, Xiang Fei Tsinghua University, Jianqiang Huang Qinghai University, Xiaoying Wang Qinghai University
14:10 - 15:30
Concurrent Data StructuresMain Conference at Pyrmont
14:10
20m
Talk
UFO Trees: Practical and Provably-Efficient Parallel Batch-Dynamic Trees
Main Conference
Quinten De Man University of Maryland, Atharva Sharma University of Maryland, Kishen N Gowda University of Maryland, Laxman Dhulipala University of Maryland, College Park
14:30
20m
Talk
Sharded Elimination and Combining for Highly-Efficient Concurrent Stacks
Main Conference
Ajay Singh FORTH ICS and University of Waterloo, Nikos Metaxakis , Panagiota Fatourou FORTH ICS and University of Crete, Greece
14:50
20m
Talk
Concurrent Balanced Augmented Trees
Main Conference
Panagiota Fatourou University of Crete & FORTH, Siddhartha Jayanti Google Research, Younghun Roh Massachusetts Institute of Technology, Eric Ruppert York University, Ajay Singh FORTH ICS and University of Waterloo, Yuanhao Wei University of British Columbia, Evan Wrench University of British Columbia
15:10
20m
Talk
Parallel Dynamic Spatial Indexes
Main Conference
Ziyang Men University of California, Riverside, Bo Huang University of California, Riverside, Yan Gu University of California, Riverside, Yihan Sun University of California, Riverside
15:50 - 17:10
GPU and Heterogeneous ComputingMain Conference at Pyrmont
15:50
20m
Talk
PRISM: An Efficient GPU-Based Lossy Compression Framework for Progressive Data Retrieval with Multi-Level Interpolation
Main Conference
bing lu , Zedong Liu University of Chinese Academy of Sciences, Hairui Zhao Jilin University, Dejun Luo University of Chinese Academy of Sciences, Wenjing Huang University of Chinese Academy of Sciences, Yida Gu University of Chinese Academy of Sciences, Jinyang Liu University of Houston, Guangming Tan University of Chinese Academy of Sciences, Dingwen Tao Institute of Computing Technology, Chinese Academy of Sciences
16:10
20m
Talk
Dynamic Detection of Inefficient Data Mapping Patterns in Heterogeneous OpenMP Applications
Main Conference
Luke Marzen Iowa State University, Junhyung Shim Iowa State University, Ali Jannesari Iowa State University
16:30
20m
Talk
Root-Down Exposure for Maximal Clique Enumeration on GPUs
Main Conference
Zhe Pan Tsinghua University, Peng Qu Tsinghua University, Youhui Zhang Tsinghua University
16:50
20m
Talk
ROME: Maximizing GPU Efficiency for All-Pairs Shortest Path via Taming Fine-Grained Irregularities
Main Conference
Weile Luo The Hong Kong University of Science and Technology, Guangzhou, Yuhan Chen The Hong Kong University of Science and Technology, Guangzhou, Xiangrui Yu The Hong Kong University of Science and Technology, Guangzhou, Qiang Wang Harbin Institute of Technology, Shenzhen, Ruibo Fan The Hong Kong University of Science and Technology, Guangzhou, Hongyuan Liu Stevens Institute of Technology, Xiaowen Chu The Hong Kong University of Science and Technology, Guangzhou
17:30 - 19:00
Business MeetingMain Conference at Cronulla
17:30
90m
Meeting
Business Meeting
Main Conference

Tue 3 Feb

Displayed time zone: Hobart change

09:50 - 11:10
Stencil and Sparse Matrix ComputationMain Conference at Pyrmont
09:50
20m
Talk
SPIDER: Unleashing Sparse Tensor Cores for Stencil Computation via Strided Swapping
Main Conference
Qiqi Gu Shanghai Jiao Tong University, Chenpeng Wu Shanghai Jiao Tong University, Heng Shi , Jianguo Yao Shanghai Jiao Tong University; Shanghai Enflame Technology
10:10
20m
Talk
ASM-SpMM: Unleashing the Potential of Arm SME for Sparse Matrix Multiplication Acceleration
Main Conference
Jiazhi Jiang Sun Yat-sen University, Xijia Yao Sun Yat-sen University, Jiayu Chen Sun Yat-sen University, jinhui wei Sun Yat-sen University, Dan Huang , Yutong Lu Sun Yat-sen University
10:30
20m
Talk
Exploiting Efficient Mapping and Pipelined Execution for Accelerating SpMV on Tensor Cores
Main Conference
Kaige Zhang Beihang University, Hailong Yang Beihang University, Xin You Beihang University, Tianyu Feng Beihang University, Yufan Xu Independent Researcher, Zhongzhi Luan Beihang University, Yi Liu Beihang University, Depei Qian Beihang University
10:50
20m
Talk
VDHA: Vector-Driven Hash Aggregation for Sparse Matrix–Sparse Vector Multiplication on GPUs
Main Conference
Yuchen Li Tsinghua University, Zhe Pan Tsinghua University, Peng Qu Tsinghua University, Youhui Zhang Tsinghua University
11:30 - 12:50
Mixed Precision and QuantizationMain Conference at Balmoral
11:30
20m
Talk
RoMeo: Mitigating Dual-dimensional Outliers with Rotated Mixed Precision Quantization
Main Conference
Qihao Zhang Tsinghua University, MingLiang Tang Tsinghua University, Mingshu Zhai Tsinghua University, Kinman Lei Tsinghua University, Jidong Zhai Tsinghua University
11:50
20m
Talk
High-Throughput Non-Uniformly Quantized 3-bit LLM Inference
Main Conference
YuAng Chen Chinese University of Hong Kong, Wenqi Zeng Hong Kong University of Science and Technology, Jeffrey Xu Yu Chinese University of Hong Kong
12:10
20m
Talk
JanusQuant: Accurate and Efficient 2-bit KV Cache Quantization for Long-context Inference
Main Conference
Chengyu Sun Wuhan University, Yaqi Xia Wuhan University, Hulin Wang , Donglin Yang Nvidia Corporation, Xiaobo Zhou University of Macau, Dazhao Cheng WuHan University
12:30
20m
Talk
HierCut: Enabling 16-bit Format Mixed Precision for Molecular Dynamics through Hierarchical Cutoff
Main Conference
zeyu song Tsinghua University, Lin Gan Tsinghua University, Xiaohui Duan Shandong University, Jiayu Fu Tsinghua University, Zhengrui Li Tsinghua University, Yinuo Wang Tsinghua University, Guangzhao Li Chinese Academy of Sciences, Guangwen Yang Tsinghua University
11:30 - 12:50
Cluster and Cloud ComputingMain Conference at Pyrmont
11:30
20m
Talk
Cacheman: A Comprehensive Last-Level Cache Management System for Multi-tenant Clouds
Main Conference
Xiaokang Hu Alibaba Cloud Computing, Yuchao Cao Alibaba Cloud Computing, Naixuan Guan Alibaba Cloud Computing, Yifan Wu Alibaba Cloud Computing, Xishi Qiu Alibaba Cloud Computing, Shengdong Dai Alibaba Cloud Computing, Ben Luo Alibaba Cloud Computing, Sanchuan Cheng Alibaba Cloud Computing, Fudong Qiu Alibaba Cloud Computing, Yibin Shen Alibaba Cloud, Jiesheng Wu Alibaba Cloud Computing
11:50
20m
Talk
zBuffer: Zero-Copy and Metadata-Free Serialization for Fast RPC with Scatter-Gather Reflection
Main Conference
Xiangyu Liu Xiamen University, Huiba Li Alibaba, Shun Gai Alibaba, Youmin Chen Shanghai Jiao Tong University, Yiming Zhang Xiamen University
12:10
20m
Talk
Scaling GPU-to-CPU Migration for Efficient Distributed Execution on CPU Clusters
Main Conference
Ruobing Han Georgia Institute of Technology, Hyesoon Kim Georgia Institute of Technology
12:30
20m
Talk
Trojan Horse: Aggregate-and-Batch for Scaling Up Sparse Direct Solvers on GPU Clusters
Main Conference
Yida Li China University of Petroleum-Beijing, Siwei Zhang China University of Petroleum, Yiduo Niu , Yang Du China University of Petroleum, Qingxiao Sun China University of Petroleum-Beijing, Zhou Jin China University of Petroleum-Beijing, Weifeng Liu China University of Petroleum-Beijing
14:10 - 15:30
Distributed TrainingMain Conference at Balmoral
14:10
20m
Talk
COCCL: A Collective Communication Library Supporting Easy Integration and Configuration of Customized Compression for Scalable LLM Training
Main Conference
Xingchen Liu University of Chinese Academy of Sciences, Haoran Kong Chinese University of Hong Kong, Shenzhen, Hairui Zhao Jilin University, Shengkai Lyu University of Chinese Academy of Sciences, Zheng Wei University of Chinese Academy of Sciences, Man Liu University of Chinese Academy of Sciences, Xingjian Tian University of Chinese Academy of Sciences, Liyang Zhao University of Chinese Academy of Sciences, Zhuohan Chen University of Chinese Academy of Sciences, Fakang Wang Ant Group, Zizhong Chen Chinese University of Hong Kong, Shenzhen, Zhan Wang University of Chinese Academy of Sciences, Guangming Tan University of Chinese Academy of Sciences, Dingwen Tao Institute of Computing Technology, Chinese Academy of Sciences
14:30
20m
Talk
Elastor: Elastic and Efficient Model Partitioning and Checkpointing for Fault-tolerant Distributed Training
Main Conference
Xuanyu Wang Peking University, Fangcheng FU Shanghai Jiao Tong University, Haoyang Li Peking University, Hao Ge Peking University, Sheng Lin Peking University, Jiawen Niu Peking University, Bin Cui Peking University
14:50
20m
Talk
HelixPipe: Efficient Distributed Training of Long Sequence Transformers with Attention Parallel Pipeline Parallelism
Main Conference
Geng Zhang National University of Singapore, Shenggan Cheng National University of Singapore, Xuanlei Zhao National University of Singapore, Ziming Liu , Yang You National University of Singapore
15:10
20m
Talk
CCL-D: A High-Precision Diagnostic System for Slow and Hang Anomalies in Large-Scale Model Training
Main Conference
Yida Gu University of Chinese Academy of Sciences, Fakang Wang AntGroup, Jianhao Fu AntGroup, Zhenhang Sun Ant Group, Qianyu Zhang Ant Group, Hairui Zhao Jilin University, Xingchen Liu University of Chinese Academy of Sciences, Yang Tian Ant Group, Wenjing Huang University of Chinese Academy of Sciences, Zedong Liu University of Chinese Academy of Sciences, Yifan Chen Ant Group, Jinwu Yang University of Chinese Academy of Sciences, Yueyuan Zhou University of Chinese Academy of Sciences, Qian Zhao Ant Group, Haoxu Li University of Chinese Academy of Sciences, Tao Wang Ant Group, Feng Yu Ant Group, Zhan Wang University of Chinese Academy of Sciences, Guangming Tan University of Chinese Academy of Sciences, Dingwen Tao Institute of Computing Technology, Chinese Academy of Sciences
14:10 - 15:30
Parallel AlgorithmsMain Conference at Pyrmont
14:10
20m
Talk
Pipelonk: Accelerating End-to-End Zero-Knowledge Proof Generation on GPUs for PLONK-based Protocols
Main Conference
Zhiyuan Zhang Shandong University, Yanxin Cai Shandong University, Wenhao Yin Shandong University, Xueyu Wu The University of Hong Kong, Yi Wang Shenzhen University, Lei Ju Shandong University, Zhuoran Ji Shandong University
14:30
20m
Talk
ParDiff: Efficiently Parallelizing Reverse-Mode Automatic Differentiation with Direct Indexing
Main Conference
Shuhong Huang Tsinghua University, Shizhi Tang Qingcheng.AI, Yuan Wen University of Aberdeen, Huanqi Cao Tsinghua University, Ruibai Tang Tsinghua University, yidong chen , Jiping Yu Tsinghua University, Yang Li Lenovo Research, Chao Jiang Lenovo Research, Limin Xiao Lenovo Research, Jidong Zhai Tsinghua University
14:50
20m
Talk
Faster and Cheaper: Pushing the Sequence Alignment Throughput with Commercial CPUs
Main Conference
Zhonghai Zhang Institute of Computing Technology, Chinese Academy of Sciences / University of Chinese Academy of Sciences, Yewen Li The Hong Kong University of Science and Technology, Ke Meng Chinese Academy of Sciences, Chunming Zhang Institute of Computing Technology, Chinese Academy of Sciences, Guangming Tan University of Chinese Academy of Sciences
15:10
20m
Talk
PIM-zd-tree: A Fast Space-Partitioning Index Leveraging Processing-in-Memory
Main Conference
Yiwei Zhao Carnegie Mellon University, Hongbo Kang Tsinghua University, Ziyang Men University of California, Riverside, Yan Gu University of California, Riverside, Guy E. Blelloch Carnegie Mellon University, Laxman Dhulipala University of Maryland, College Park, Charles McGuffey Reed College, Phil Gibbons Carnegie Mellon University
15:50 - 17:10
ML InferenceMain Conference at Balmoral
15:50
20m
Talk
BEEMS: Boosting Machine Vision Efficiency via Computation Graph-Based Memory Smoothing
Main Conference
Hanjing Shen Shanghai Jiao Tong University, Fangxin Liu Shanghai Jiao Tong University, Jian Liu Beijing University of Aeronautics and Astronautics, Li Jiang Shanghai Jiaotong University, Haibing Guan Shanghai Jiao Tong University
16:10
20m
Talk
Laser: Unlocking Layer-Level Scheduling for Efficient Multi-SLO LLM Serving
Main Conference
Jianxiong Liao Sun Yat-sen University, ​​Quanxing​ Dong​ Sun Yat-sen University​, Yunkai Liang Sun Yat-sen University, Zhi Zhou Sun Yat-sen University, Xu Chen Sun Yat-sen University
16:30
20m
Talk
MixFusion: A Patch-Level Parallel Serving System for Mixed-Resolution Diffusion Models
Main Conference
Desen Sun University of Waterloo, Zepeng Zhao Carnegie Mellon University, Yuke Wang Rice University
16:50
20m
Talk
Difflow: A Data-Characteristic-Aware Serving System for Diffusion Models
Main Conference
Chengzhang Wu Tsinghua University, Liyan Zheng Tsinghua University, Haojie Wang Tsinghua University, Kezhao Huang Tsinghua University, Zixuan Ma Tsinghua University, Dong Dong , Jidong Zhai Tsinghua University
15:50 - 17:10
Graphs and Graph Neural NetworksMain Conference at Pyrmont
15:50
20m
Talk
ElasGNN: An Elastic Training Framework for Distributed GNN Training
Main Conference
Siqi Wang Beihang University, Hailong Yang Beihang University, Pengbo Wang Beihang University, Hongliang Cao Beihang University, Yufan Xu Independent Researcher, Xuezhu Wang Beihang University, Zhongzhi Luan Beihang University, Yi Liu Beihang University, Depei Qian Beihang University
16:10
20m
Talk
APERTURE: Algorithm-System Co-Optimization for Temporal Graph Network Inference
Main Conference
Yiqing Wang Beihang University, Hailong Yang Beihang University, Enze Yu Beihang University, Qingxiao Sun Beihang University, Kejie Ma Beihang University, Kaige Zhang Beihang University, chenhao xie Beihang University, Depei Qian Beihang University
16:30
20m
Talk
TAC: Cache-based System for Accelerating Billion-Scale GNN Training on Multi-GPU Platform
Main Conference
Zhiqiang Liang , Hongyu Gao​​ , Fang Liu Computer Network Information Center, Chinese Academy of Sciences,University of Chinese Academy of Sciences, Jue Wang Computer Network Information Center, Chinese Academy of Sciences;University of Chinese Academy of Sciences, Xingguo Shi University of Chinese Academy of Sciences, Juyu Gu University of Chinese Academy of Sciences, Peng Di Ant Group & UNSW, San Li University of Chinese Academy of Sciences, Lei Tang University of Chinese Academy of Sciences, Chunbao Zhou University of Chinese Academy of Sciences, Lian Zhao University of Chinese Academy of Sciences, yangang wang University of Chinese Academy of Sciences, Xuebin Chi University of Chinese Academy of Sciences
16:50
20m
Talk
DTMiner: A Data-centric System for Efficient Temporal Motif Mining
Main Conference
hou yinbo Huazhong University of Science and Technology, Hao Qi Huazhong University of Science and Technology, Ligang He University of Warwick, Jin Zhao Huazhong University of Science and Technology, Yu Zhang School of Computer Science and Technology, Huazhong University of Science and Technology, Hui Yu Hong Kong University of Science and Technology, Longlong Lin Southwest University, Lin Gu Huazhong University of Science and Technology, Wenbin Jiang Huazhong University of Science and Technology, XIAOFEI LIAO Huazhong University of Science and Technology, Hai Jin Huazhong University of Science and Technology
17:15 - 18:15
Optimizing TransformersMain Conference at Pyrmont
17:15
20m
Talk
FlashAttention-T: Towards Fully Tensorized Attention by Exploiting Tensor-Vector Parallelism
Main Conference
Jianxing Xu University of Science and Technology of China, Yuanbo Wen , Jun Bi Chinese Academy of Sciences, Ruibai Xu University of Science and Technology of China, Guanglin Xu Chinese Academy of Sciences, Rui Zhang Chinese Academy of Sciences, Wei Li Chinese Academy of Sciences, Ling Li Institute of Software, Chinese Academy of Sciences, Tianshi Chen Cambricon Technologies, Qi Guo Chinese Academy of Sciences, Yunji Chen Chinese Academy of Sciences
17:35
20m
Talk
Accelerating Sparse Transformer Inference on GPU
Main Conference
Wenhao Dai China University of Petroleum-Beijing, Haodong Deng China University of Petroleum, Mengfei Rong China University of Petroleum, Xinyu Yang Beihang University, Hongyu Liu Baidu Inc., Fangxin Liu Shanghai Jiao Tong University, Hailong Yang Beihang University, Qianwen Cao China University of Petroleum, Qingxiao Sun Beihang University
17:55
20m
Talk
MetaAttention: A Unified and Performant Attention Framework Across Hardware Backends
Main Conference
Feiyang Chen Shanghai Jiao Tong University, Yu Cheng Peking University, Lei Wang Peking University, Yuqing Xia Microsoft Research, Ziming Miao Microsoft Research, Lingxiao Ma Microsoft Research, Fan Yang Microsoft Research Asia, Jilong Xue Microsoft Research, Zhi Yang Peking University, Mao Yang Microsoft Research, Xingda Wei Shanghai Jiao Tong University, Haibo Chen Shanghai Jiao Tong University

Wed 4 Feb

Displayed time zone: Hobart change

09:50 - 11:10
Matrix and Linear Algebra AlgorithmsMain Conference at Pyrmont
09:50
20m
Talk
Towards Singular Value Decomposition for Rank-Deficient Matrices: An Efficient and Accurate Algorithm on GPU Architectures
Main Conference
Lu Shi University of Electronic Science and Technology of China, WeiWei Xu Nanjing University of Information Science and Technology, Shaoshuai Zhang University of Electronic Science and Technology of China
10:10
20m
Talk
A Diagonal Block Memory-Aware Polynomial Preconditioner for Linear and Eigenvalue Solvers
Main Conference
Xiaojian Yang National University of Defense Technology, Yuhui Ni National University of Defense Technology, Fan Yuan Xiangtan University, Shengguo Li National University of Defense Technology, Dezun Dong NUDT, xuchuanfu National University of Defense Technology, Haipeng Jia Jia, Jie Liu National University of Defense Technology
10:30
20m
Talk
A Distributed Matrix-Block-Vector Multiplication in Presence of System Performance Variability
Main Conference
Yuchen Ma College of William & Mary, Bin Ren College of William & Mary, Andreas Stathopoulos College of William & Mary
10:50
20m
Talk
Characterizing Matrix Multiplication Units Across General Parallel Patterns in Scientific Computing
Main Conference
Yuechen Lu China University of Petroleum-Beijing, Hongwei Zeng , Marc Casas Barcelona Supercomputing Center, Weifeng Liu China University of Petroleum-Beijing