PPoPP 2026 Program
This program is tentative and subject to change.
Sat 31 JanDisplayed time zone: Hobart change
07:45 - 16:00 | |||
08:45 - 10:30 | CACHPWorkshops and Tutorials at Bondi Chair(s): Jose Nelson Amaral University of Alberta, Bruce Hoppe Massachusetts Institute of Technology, Yihan Sun University of California, Riverside Website with schedule: https://fastcode.org/events/coevolution-workshop/ | ||
08:45 - 10:30 | MLIRWorkshops and Tutorials at Curl Curl Chair(s): Kunwar Grover AMD, Mahesh Ravishankar AMD, Saday Sadayappan University of Utah, USA | ||
10:30 - 11:00 | |||
10:30 30mCoffee break | Break HPCA/CGO/PPoPP/CC Catering | ||
11:00 - 12:45 | CACHPWorkshops and Tutorials at Bondi Chair(s): Jose Nelson Amaral University of Alberta, Bruce Hoppe Massachusetts Institute of Technology, Yihan Sun University of California, Riverside Website with schedule: https://fastcode.org/events/coevolution-workshop/ | ||
11:00 - 12:45 | MLIRWorkshops and Tutorials at Curl Curl Chair(s): Kunwar Grover AMD, Mahesh Ravishankar AMD, Saday Sadayappan University of Utah, USA | ||
12:45 - 13:45 | |||
12:45 60mLunch | Lunch HPCA/CGO/PPoPP/CC Catering | ||
13:45 - 15:30 | MLIRWorkshops and Tutorials at Curl Curl Chair(s): Kunwar Grover AMD, Mahesh Ravishankar AMD, Saday Sadayappan University of Utah, USA | ||
15:30 - 16:00 | |||
15:30 30mCoffee break | Break HPCA/CGO/PPoPP/CC Catering | ||
16:00 - 17:45 | MLIRWorkshops and Tutorials at Curl Curl Chair(s): Kunwar Grover AMD, Mahesh Ravishankar AMD, Saday Sadayappan University of Utah, USA | ||
Sun 1 FebDisplayed time zone: Hobart change
07:45 - 19:00 | |||
08:45 - 10:30 | ScaleDNNWorkshops and Tutorials at Curl Curl Chair(s): Dhabaleswar K. Panda Ohio State University, Nawras Alnaasan Ohio State University Website with schedule: https://nowlab.cse.ohio-state.edu/tutorials/hidl_PPoPP26/ | ||
10:30 - 11:00 | |||
10:30 30mCoffee break | Break HPCA/CGO/PPoPP/CC Catering | ||
11:00 - 12:45 | ScaleDNNWorkshops and Tutorials at Curl Curl Chair(s): Dhabaleswar K. Panda Ohio State University, Nawras Alnaasan Ohio State University Website with schedule: https://nowlab.cse.ohio-state.edu/tutorials/hidl_PPoPP26/ | ||
12:45 - 13:45 | |||
12:45 60mLunch | Lunch HPCA/CGO/PPoPP/CC Catering | ||
13:45 - 15:30 | DiffPPWorkshops and Tutorials at Bondi Chair(s): Paul Hovland Argonne National Laboratory, Jan Hueckelheim Argonne National Laboratory Website with schedule: https://diffprog-ppopp.github.io/ | ||
13:45 - 15:30 | DDRPWorkshops and Tutorials at Bungan Chair(s): Umang Mathur National University of Singapore, Andreas Pavlogiannis Aarhus University More information at https://sites.google.com/view/race-prediction-tutorial. | ||
15:30 - 16:00 | |||
15:30 30mCoffee break | Break HPCA/CGO/PPoPP/CC Catering | ||
16:00 - 17:45 | DiffPPWorkshops and Tutorials at Bondi Chair(s): Paul Hovland Argonne National Laboratory, Jan Hueckelheim Argonne National Laboratory Website with schedule: https://diffprog-ppopp.github.io/ | ||
16:00 - 17:45 | DDRPWorkshops and Tutorials at Bungan Chair(s): Umang Mathur National University of Singapore, Andreas Pavlogiannis Aarhus University More information at https://sites.google.com/view/race-prediction-tutorial. | ||
18:00 - 20:00 | Welcome ReceptionHPCA/CGO/PPoPP/CC Catering at Parkside Ballroom All attendees registered for the main conference are invited to attend the welcome reception from 18:00 on Sunday evening, where there will be great food and drink and an opportunity to engage with the vibrant HPCA/CGO/PPoPP/CC community. | ||
18:00 2hSocial Event | Welcome Reception HPCA/CGO/PPoPP/CC Catering | ||
Mon 2 FebDisplayed time zone: Hobart change
07:45 - 16:00 | |||
08:30 - 08:45 | WelcomeHPCA/CGO/PPoPP/CC Plenary Keynotes at Pyrmont Chair(s): Steve Blackburn Google and Australian National University, Tony Hosking Australian National University, Shuaiwen Leon Song Together AI and University of Sydney The conference will formally open with a Welcome to Country from a Traditional Owner of the Eora Nation where the ICC is located. Following that, the General Chairs will welcome you. | ||
08:30 15mDay opening | Welcome HPCA/CGO/PPoPP/CC Plenary Keynotes Steve Blackburn Google and Australian National University, Tony Hosking Australian National University, Shuaiwen Leon Song Together AI and University of Sydney | ||
08:45 - 09:45 | 2025 ACM/IEEE-CS Ken Kennedy AwardHPCA/CGO/PPoPP/CC Plenary Keynotes at Pyrmont Chair(s): Steve Blackburn Google and Australian National University | ||
08:45 60mKeynote | Compiler 2.0: Building the Next Generation Compilers with Machine Learning HPCA/CGO/PPoPP/CC Plenary Keynotes Saman Amarasinghe Massachusetts Institute of Technology | ||
09:50 - 11:10 | |||
09:50 20mTalk | Binary Compatible Critical Section DelegationBest Paper Award Main Conference DOI | ||
10:10 20mTalk | Hapax Locks: Scalable Value-Based Mutual Exclusion Main Conference DOI | ||
10:30 20mTalk | Fixing Non-blocking Data Structures for Better Compatibility with Memory Reclamation Schemes Main Conference DOI | ||
10:50 20mTalk | Multiverse: Transactional Memory with Dynamic Multiversioning Main Conference Gaetano Coccimiglio University of Waterloo, Trevor Brown University of Waterloo, Srivatsan Ravi University of Southern California DOI | ||
11:10 - 11:30 | |||
11:10 20mCoffee break | Break HPCA/CGO/PPoPP/CC Catering | ||
11:30 - 12:50 | |||
11:30 20mTalk | Rethinking Thread Scheduling under Oversubscription: A User-Space Framework for Coordinating Multi-runtime and Multi-process WorkloadsBest Paper Nominee Main Conference DOI | ||
11:50 20mTalk | Waste-Efficient Work Stealing Main Conference Kyle Singer Massachusetts Institute of Technology, Kunal Agrawal Washington University in St. Louis, TB Schardl Massachusetts Institute of Technology DOI | ||
12:10 20mTalk | DiggerBees: Depth First Search Leveraging Hierarchical Block-Level Stealing on GPUs Main Conference Yuyao Niu Barcelona Supercomputing Center, Yuechen Lu China University of Petroleum-Beijing, Weifeng Liu China University of Petroleum-Beijing, Marc Casas Barcelona Supercomputing Center DOI | ||
12:30 20mTalk | PANA: A Fine-Grained Runtime-Adaptive Load Balancing for Parallel SpMV on Multicore CPUs Main Conference Haodong Bian Tsinghua University, Youhui Zhang Tsinghua University, Xiang Fei Tsinghua University, Jianqiang Huang Qinghai University, Xiaoying Wang Qinghai University DOI | ||
12:50 - 14:10 | |||
12:50 80mLunch | Lunch HPCA/CGO/PPoPP/CC Catering | ||
14:10 - 15:30 | |||
14:10 20mTalk | UFO Trees: Practical and Provably-Efficient Parallel Batch-Dynamic TreesBest Paper Nominee Main Conference Quinten De Man University of Maryland, Atharva Sharma University of Maryland, Kishen N Gowda University of Maryland, Laxman Dhulipala University of Maryland, College Park DOI | ||
14:30 20mTalk | Sharded Elimination and Combining for Highly-Efficient Concurrent Stacks Main Conference Ajay Singh FORTH ICS, Nikos Metaxakis , Panagiota Fatourou FORTH ICS and University of Crete, Greece DOI | ||
14:50 20mTalk | Concurrent Balanced Augmented Trees Main Conference Evan Wrench University of British Columbia, Ajay Singh FORTH ICS, Younghun Roh Massachusetts Institute of Technology, Panagiota Fatourou University of Crete & FORTH, Siddhartha Jayanti Google Research, Eric Ruppert York University, Yuanhao Wei University of British Columbia DOI | ||
15:10 20mTalk | Parallel Dynamic Spatial Indexes Main Conference Ziyang Men University of California, Riverside, Bo Huang University of California, Riverside, Yan Gu University of California, Riverside, Yihan Sun University of California, Riverside DOI | ||
15:30 - 15:50 | |||
15:30 20mCoffee break | Break HPCA/CGO/PPoPP/CC Catering | ||
15:50 - 17:10 | GPU and Heterogeneous ComputingMain Conference at Pyrmont Chair(s): Frank Mueller North Carolina State University, USA | ||
15:50 20mTalk | PRISM: An Efficient GPU-Based Lossy Compression Framework for Progressive Data Retrieval with Multi-Level InterpolationBest Paper Nominee Main Conference Bing Lu Institute of Computing Technology of Chinese Academy of Sciences, Zedong Liu University of Chinese Academy of Sciences, Hairui Zhao Jilin University, Dejun Luo University of Chinese Academy of Sciences, Wenjing Huang University of Chinese Academy of Sciences, Yida Gu University of Chinese Academy of Sciences, Jinyang Liu University of Houston, Guangming Tan University of Chinese Academy of Sciences, Dingwen Tao Institute of Computing Technology, Chinese Academy of Sciences DOI | ||
16:10 20mTalk | Dynamic Detection of Inefficient Data Mapping Patterns in Heterogeneous OpenMP Applications Main Conference Luke Marzen Iowa State University, Junhyung Shim Iowa State University, Ali Jannesari Iowa State University DOI | ||
16:30 20mTalk | Root-Down Exposure for Maximal Clique Enumeration on GPUs Main Conference DOI | ||
16:50 20mTalk | ROME: Maximizing GPU Efficiency for All-Pairs Shortest Path via Taming Fine-Grained Irregularities Main Conference Weile Luo The Hong Kong University of Science and Technology, Guangzhou, Yuhan Chen The Hong Kong University of Science and Technology, Guangzhou, Xiangrui Yu The Hong Kong University of Science and Technology, Guangzhou, Qiang Wang Harbin Institute of Technology, Shenzhen, Ruibo Fan The Hong Kong University of Science and Technology, Guangzhou, Hongyuan Liu Stevens Institute of Technology, Xiaowen Chu The Hong Kong University of Science and Technology, Guangzhou DOI | ||
17:30 - 19:00 | Business MeetingMain Conference at Cronulla Chair(s): Tony Hosking Australian National University, Madan Musuvathi Microsoft Research, Kenjiro Taura The University of Tokyo | ||
17:30 90mMeeting | PPoPP Business Meeting Main Conference | ||
Tue 3 FebDisplayed time zone: Hobart change
08:15 - 16:00 | |||
08:45 - 09:45 | Plenary KeynoteHPCA/CGO/PPoPP/CC Plenary Keynotes at Pyrmont Chair(s): Tony Hosking Australian National University | ||
08:45 60mKeynote | Oracle Parfait – Scaling Vulnerability Detection from Enterprise Systems to Cloud-Scale Systems and Beyond HPCA/CGO/PPoPP/CC Plenary Keynotes Cristina Cifuentes Oracle Software Assurance | ||
09:50 - 11:10 | Stencil and Sparse Matrix ComputationMain Conference at Pyrmont Chair(s): Shoaib Kamil Adobe Research | ||
09:50 20mTalk | SPIDER: Unleashing Sparse Tensor Cores for Stencil Computation via Strided Swapping Main Conference Qiqi Gu Shanghai Jiao Tong University, Chenpeng Wu Shanghai Jiao Tong University, Heng Shi , Jianguo Yao Shanghai Jiao Tong University; Shanghai Enflame Technology DOI | ||
10:10 20mTalk | ASM-SpMM: Unleashing the Potential of Arm SME for Sparse Matrix Multiplication Acceleration Main Conference Jiazhi Jiang Sun Yat-sen University, Xijia Yao Sun Yat-sen University, Jiayu Chen Sun Yat-sen University, jinhui wei Sun Yat-sen University, Dan Huang , Yutong Lu Sun Yat-sen University DOI | ||
10:30 20mTalk | Exploiting Efficient Mapping and Pipelined Execution for Accelerating SpMV on Tensor Cores Main Conference Kaige Zhang Beihang University, Hailong Yang Beihang University, Xin You Beihang University, Tianyu Feng Beihang University, Yufan Xu Independent Researcher, Zhongzhi Luan Beihang University, Yi Liu Beihang University, Depei Qian Beihang University DOI | ||
10:50 20mTalk | VDHA: Vector-Driven Hash Aggregation for Sparse Matrix-Sparse Vector Multiplication on GPUs Main Conference Yuchen Li Tsinghua University, Zhe Pan Tsinghua University, Peng Qu Tsinghua University, Youhui Zhang Tsinghua University DOI | ||
11:10 - 11:30 | |||
11:10 20mCoffee break | Break HPCA/CGO/PPoPP/CC Catering | ||
11:30 - 12:50 | Mixed Precision and QuantizationMain Conference at Balmoral Chair(s): Dingwen Tao Institute of Computing Technology, Chinese Academy of Sciences | ||
11:30 20mTalk | RoMeo: Mitigating Dual-dimensional Outliers with Rotated Mixed Precision Quantization Main Conference Qihao Zhang Tsinghua University, MingLiang Tang Tsinghua University, Mingshu Zhai Tsinghua University, Kinman Lei Tsinghua University, Jidong Zhai Tsinghua University DOI | ||
11:50 20mTalk | High-Throughput Non-Uniformly Quantized 3-bit LLM Inference Main Conference YuAng Chen Chinese University of Hong Kong, Wenqi Zeng Hong Kong University of Science and Technology, Jeffrey Xu Yu Chinese University of Hong Kong DOI | ||
12:10 20mTalk | JanusQuant: Accurate and Efficient 2-bit KV Cache Quantization for Long-Context Inference Main Conference Chengyu Sun Wuhan University, Yaqi Xia Wuhan University, Hulin Wang , Donglin Yang Nvidia Corporation, Xiaobo Zhou University of Macau, Dazhao Cheng WuHan University DOI | ||
12:30 20mTalk | HierCut: Enabling 16-bit Format Mixed Precision for Molecular Dynamics through Hierarchical Cutoff Main Conference zeyu song Tsinghua University, Lin Gan Tsinghua University, Xiaohui Duan Shandong University, Jiayu Fu Tsinghua University, Zhengrui Li Tsinghua University, Yinuo Wang Tsinghua University, Guangzhao Li Chinese Academy of Sciences, Guangwen Yang Tsinghua University DOI | ||
11:30 - 12:50 | Cluster and Cloud ComputingMain Conference at Pyrmont Chair(s): Ruslan Nikolaev Pennsylvania State University | ||
11:30 20mTalk | Cacheman: A Comprehensive Last-Level Cache Management System for Multi-tenant Clouds Main Conference Xiaokang Hu Alibaba Cloud Computing, Yuchao Cao Alibaba Cloud Computing, Naixuan Guan Alibaba Cloud Computing, Yifan Wu Alibaba Cloud Computing, Xishi Qiu Alibaba Cloud Computing, Shengdong Dai Alibaba Cloud Computing, Ben Luo Alibaba Cloud Computing, Sanchuan Cheng Alibaba Cloud Computing, Fudong Qiu Alibaba Cloud Computing, Yibin Shen Alibaba Cloud, Jiesheng Wu Alibaba Cloud Computing DOI | ||
11:50 20mTalk | zBuffer: Zero-Copy and Metadata-Free Serialization for Fast RPC with Scatter-Gather Reflection Main Conference Xiangyu Liu Xiamen University, Huiba Li Alibaba, Shun Gai Alibaba, Youmin Chen Shanghai Jiao Tong University, Yiming Zhang Xiamen University DOI | ||
12:10 20mTalk | Scaling GPU-to-CPU Migration for Efficient Distributed Execution on CPU Clusters Main Conference DOI | ||
12:30 20mTalk | Trojan Horse: Aggregate-and-Batch for Scaling Up Sparse Direct Solvers on GPU ClustersBest Paper Nominee Main Conference Yida Li China University of Petroleum-Beijing, Siwei Zhang China University of Petroleum-Beijing, Yiduo Niu China University of Petroleum-Beijing, Yang Du China University of Petroleum-Beijing, Qingxiao Sun China University of Petroleum-Beijing, Zhou Jin China University of Petroleum-Beijing, Weifeng Liu China University of Petroleum-Beijing DOI | ||
12:50 - 14:10 | |||
12:50 80mAwards | HPCA Awards Lunch HPCA/CGO/PPoPP/CC Catering | ||
12:50 - 14:10 | |||
12:50 80mLunch | Lunch HPCA/CGO/PPoPP/CC Catering | ||
14:10 - 15:30 | |||
14:10 20mTalk | COCCL: A Collective Communication Library Supporting Easy Integration and Configuration of Customized Compression for Scalable LLM Training Main Conference Xingchen Liu University of Chinese Academy of Sciences, Haoran Kong Chinese University of Hong Kong, Shenzhen, Hairui Zhao Jilin University, Shengkai Lyu University of Chinese Academy of Sciences, Zheng Wei University of Chinese Academy of Sciences, Man Liu University of Chinese Academy of Sciences, Xingjian Tian University of Chinese Academy of Sciences, Liyang Zhao University of Chinese Academy of Sciences, Zhuohan Chen University of Chinese Academy of Sciences, Fakang Wang Ant Group, Zizhong Chen Chinese University of Hong Kong, Shenzhen, Zhan Wang University of Chinese Academy of Sciences, Guangming Tan University of Chinese Academy of Sciences, Dingwen Tao Institute of Computing Technology, Chinese Academy of Sciences DOI | ||
14:30 20mTalk | Elastor: Elastic and Efficient Model Partitioning and Checkpointing for Fault-Tolerant Distributed Training Main Conference Xuanyu Wang Peking University, Fangcheng FU Shanghai Jiao Tong University, Haoyang Li Peking University, Hao Ge Peking University, Sheng Lin Peking University, Jiawen Niu Peking University, Bin Cui Peking University DOI | ||
14:50 20mTalk | HelixPipe: Efficient Distributed Training of Long Sequence Transformers with Attention Parallel Pipeline Parallelism Main Conference Geng Zhang National University of Singapore, Shenggan Cheng National University of Singapore, Xuanlei Zhao National University of Singapore, Ziming Liu , Yang You National University of Singapore DOI | ||
15:10 20mTalk | CCL-D: A High-Precision Diagnostic System for Slow and Hang Anomalies in Large-Scale Model TrainingBest Paper Nominee Main Conference Yida Gu University of Chinese Academy of Sciences, Fakang Wang AntGroup, Jianhao Fu AntGroup, Zhenhang Sun Ant Group, Qianyu Zhang Ant Group, Hairui Zhao Jilin University, Xingchen Liu University of Chinese Academy of Sciences, Yang Tian Ant Group, Wenjing Huang University of Chinese Academy of Sciences, Zedong Liu University of Chinese Academy of Sciences, Yifan Chen Ant Group, Jinwu Yang University of Chinese Academy of Sciences, Yueyuan Zhou University of Chinese Academy of Sciences, Qian Zhao Ant Group, Haoxu Li University of Chinese Academy of Sciences, Tao Wang Ant Group, Feng Yu Ant Group, Zhan Wang University of Chinese Academy of Sciences, Guangming Tan University of Chinese Academy of Sciences, Dingwen Tao Institute of Computing Technology, Chinese Academy of Sciences DOI | ||
14:10 - 15:30 | |||
14:10 20mTalk | Pipelonk: Accelerating End-to-End Zero-Knowledge Proof Generation on GPUs for PLONK-Based Protocols Main Conference Zhiyuan Zhang Shandong University, Yanxin Cai Shandong University, Wenhao Yin Shandong University, Xueyu Wu The University of Hong Kong, Yi Wang Shenzhen University, Lei Ju Shandong University, Zhuoran Ji Shandong University DOI | ||
14:30 20mTalk | ParDiff: Efficiently Parallelizing Reverse-Mode Automatic Differentiation with Direct Indexing Main Conference Shuhong Huang Tsinghua University, Shizhi Tang Qingcheng.AI, Yuan Wen University of Aberdeen, Huanqi Cao Tsinghua University, Ruibai Tang Tsinghua University, yidong chen , Jiping Yu Tsinghua University, Yang Li Lenovo Research, Chao Jiang Lenovo Research, Limin Xiao Lenovo Research, Jidong Zhai Tsinghua University DOI | ||
14:50 20mTalk | Faster and Cheaper: Pushing the Sequence Alignment Throughput with Commercial CPUs Main Conference Zhonghai Zhang Institute of Computing Technology, Chinese Academy of Sciences / University of Chinese Academy of Sciences, Yewen Li The Hong Kong University of Science and Technology, Ke Meng Chinese Academy of Sciences, Chunming Zhang Institute of Computing Technology, Chinese Academy of Sciences, Guangming Tan University of Chinese Academy of Sciences DOI | ||
15:10 20mTalk | PIM-zd-tree: A Fast Space-Partitioning Index Leveraging Processing-in-Memory Main Conference Yiwei Zhao Carnegie Mellon University, Hongbo Kang Tsinghua University, Ziyang Men University of California, Riverside, Yan Gu University of California, Riverside, Guy E. Blelloch Carnegie Mellon University, Laxman Dhulipala University of Maryland, College Park, Charles McGuffey Reed College, Phil Gibbons Carnegie Mellon University DOI | ||
15:30 - 15:50 | |||
15:30 20mCoffee break | Break HPCA/CGO/PPoPP/CC Catering | ||
15:50 - 17:10 | |||
15:50 20mTalk | BEEMS: Boosting Machine Vision Efficiency via Computation Graph-Based Memory Smoothing Main Conference Hanjing Shen Shanghai Jiao Tong University, Fangxin Liu Shanghai Jiao Tong University, Jian Liu Beijing University of Aeronautics and Astronautics, Li Jiang Shanghai Jiaotong University, Haibing Guan Shanghai Jiao Tong University DOI | ||
16:10 20mTalk | Laser: Unlocking Layer-Level Scheduling for Efficient Multi-SLO LLM Serving Main Conference Jianxiong Liao Sun Yat-sen University, Quanxing Dong Sun Yat-sen University, Yunkai Liang Sun Yat-sen University, Zhi Zhou Sun Yat-sen University, Xu Chen Sun Yat-sen University DOI | ||
16:30 20mTalk | MixFusion: A Patch-Level Parallel Serving System for Mixed-Resolution Diffusion Models Main Conference DOI | ||
16:50 20mTalk | ChituDiffusion: A Data-Characteristic-Aware Serving System for Diffusion Models Main Conference Chengzhang Wu Tsinghua University, Liyan Zheng Tsinghua University, Haojie Wang Tsinghua University, Kezhao Huang Tsinghua University, Zixuan Ma Tsinghua University, Dong Dong , Jidong Zhai Tsinghua University DOI | ||
15:50 - 17:10 | Graphs and Graph Neural NetworksMain Conference at Pyrmont Chair(s): Ali Jannesari Iowa State University | ||
15:50 20mTalk | ElasGNN: An Elastic Training Framework for Distributed GNN Training Main Conference Siqi Wang Beihang University, Hailong Yang Beihang University, Pengbo Wang Beihang University, Hongliang Cao Beihang University, Yufan Xu Independent Researcher, Xuezhu Wang Beihang University, Zhongzhi Luan Beihang University, Yi Liu Beihang University, Depei Qian Beihang University DOI | ||
16:10 20mTalk | APERTURE: Algorithm-System Co-optimization for Temporal Graph Network Inference Main Conference Yiqing Wang Beihang University, Hailong Yang Beihang University, Enze Yu Beihang University, Qingxiao Sun Beihang University, Kejie Ma Beihang University, Kaige Zhang Beihang University, chenhao xie Beihang University, Depei Qian Beihang University DOI | ||
16:30 20mTalk | TAC: Cache-Based System for Accelerating Billion-Scale GNN Training on Multi-GPU Platform Main Conference Zhiqiang Liang , Hongyu Gao , Fang Liu Computer Network Information Center, Chinese Academy of Sciences,University of Chinese Academy of Sciences, Jue Wang Computer Network Information Center, Chinese Academy of Sciences;University of Chinese Academy of Sciences, Xingguo Shi University of Chinese Academy of Sciences, Juyu Gu University of Chinese Academy of Sciences, Peng Di Ant Group & UNSW, San Li University of Chinese Academy of Sciences, Lei Tang University of Chinese Academy of Sciences, Chunbao Zhou University of Chinese Academy of Sciences, Lian Zhao University of Chinese Academy of Sciences, yangang wang University of Chinese Academy of Sciences, Xuebin Chi University of Chinese Academy of Sciences DOI | ||
16:50 20mTalk | DTMiner: A Data-Centric System for Efficient Temporal Motif Mining Main Conference hou yinbo Huazhong University of Science and Technology, Hao Qi Huazhong University of Science and Technology, Ligang He University of Warwick, Jin Zhao Huazhong University of Science and Technology, Yu Zhang School of Computer Science and Technology, Huazhong University of Science and Technology, Hui Yu Hong Kong University of Science and Technology, Longlong Lin Southwest University, Lin Gu Huazhong University of Science and Technology, Wenbin Jiang Huazhong University of Science and Technology, XIAOFEI LIAO Huazhong University of Science and Technology, Hai Jin Huazhong University of Science and Technology DOI | ||
17:15 - 18:15 | Optimizing TransformersMain Conference at Pyrmont Chair(s): Shaoshuai Zhang University of Electronic Science and Technology of China | ||
17:15 20mTalk | FlashAttention-T: Towards Fully Tensorized Attention by Exploiting Tensor-Vector Parallelism Main Conference Jianxing Xu University of Science and Technology of China, Yuanbo Wen , Jun Bi Chinese Academy of Sciences, Ruibai Xu University of Science and Technology of China, Guanglin Xu Chinese Academy of Sciences, Rui Zhang Chinese Academy of Sciences, Wei Li Chinese Academy of Sciences, Ling Li Institute of Software, Chinese Academy of Sciences, Tianshi Chen Cambricon Technologies, Qi Guo Chinese Academy of Sciences, Yunji Chen Chinese Academy of Sciences DOI | ||
17:35 20mTalk | Accelerating Sparse Transformer Inference on GPU Main Conference Wenhao Dai China University of Petroleum-Beijing, Haodong Deng China University of Petroleum, Mengfei Rong China University of Petroleum, Xinyu Yang Beihang University, Hongyu Liu Baidu Inc., Fangxin Liu Shanghai Jiao Tong University, Hailong Yang Beihang University, Qianwen Cao China University of Petroleum, Qingxiao Sun Beihang University DOI | ||
17:55 20mTalk | MetaAttention: A Unified and Performant Attention Framework Across Hardware Backends Main Conference Feiyang Chen Shanghai Jiao Tong University, Yu Cheng Peking University, Lei Wang Peking University, Yuqing Xia Microsoft Research, Ziming Miao Microsoft Research, Lingxiao Ma Microsoft Research, Fan Yang Microsoft Research Asia, Jilong Xue Microsoft Research, Zhi Yang Peking University, Mao Yang Microsoft Research, Xingda Wei Shanghai Jiao Tong University, Haibo Chen Shanghai Jiao Tong University DOI | ||
18:30 - 21:30 | |||
18:30 3hSocial Event | Excursion HPCA/CGO/PPoPP/CC Catering | ||
Wed 4 FebDisplayed time zone: Hobart change
08:15 - 10:00 | |||
08:30 - 08:45 | |||
08:30 15mDay opening | Didgeridoo Performance HPCA/CGO/PPoPP/CC Plenary Keynotes | ||
08:45 - 09:45 | |||
08:45 60mKeynote | Architecting Resilience at Scale: From Research to Practice HPCA/CGO/PPoPP/CC Plenary Keynotes | ||
09:50 - 11:10 | Matrix and Linear Algebra AlgorithmsMain Conference at Pyrmont Chair(s): Tony Hosking Australian National University | ||
09:50 20mTalk | Towards Singular Value Decomposition for Rank-Deficient Matrices: An Efficient and Accurate Algorithm on GPU Architectures Main Conference Lu Shi University of Electronic Science and Technology of China, WeiWei Xu Nanjing University of Information Science and Technology, Shaoshuai Zhang University of Electronic Science and Technology of China DOI | ||
10:10 20mTalk | A Diagonal Block Memory-Aware Polynomial Preconditioner for Linear and Eigenvalue Solvers Main Conference Xiaojian Yang National University of Defense Technology, Yuhui Ni National University of Defense Technology, Fan Yuan Xiangtan University, Shengguo Li National University of Defense Technology, Dezun Dong NUDT, xuchuanfu National University of Defense Technology, Haipeng Jia Jia, Jie Liu National University of Defense Technology DOI | ||
10:30 20mTalk | A Distributed Matrix-Block-Vector Multiplication in Presence of System Performance Variability Main Conference Yuchen Ma College of William & Mary, Bin Ren College of William & Mary, Andreas Stathopoulos College of William & Mary DOI | ||
10:50 20mTalk | Characterizing Matrix Multiplication Units across General Parallel Patterns in Scientific Computing Main Conference Yuechen Lu China University of Petroleum-Beijing, Hongwei Zeng , Marc Casas Barcelona Supercomputing Center, Weifeng Liu China University of Petroleum-Beijing DOI | ||
11:10 - 11:30 | |||
11:10 20mCoffee break | Break HPCA/CGO/PPoPP/CC Catering | ||