ICDE 2026: Detailed Program

Program last updated on March 23, 2026.

Research Papers

[R-1] Indexing Structures and Physical Database Design
Time: Tuesday, May 5, 10:00 - 12:00
Location: Av. Duluth
Track: Query Processing, Indexing, and Optimization
Session Chair: [To Be Announced]
  • DIndex: An Efficient On-Disk Learned Index for Memory-Constrained Environments
    Jiahuan Shen (Shanghai Jiao Tong University), Chuzhe Tang (Shanghai Jiao Tong University), Haoning Lan (Shanghai Jiao Tong University), Ren Ren (Huawei Technologies Co., Ltd.), Zhaoguo Wang (Shanghai Jiao Tong University)
  • PRO-HNSW: Proactive Repair and Optimization for High-Performance Dynamic HNSW Indexes
    Huijun Jin (Yonsei University), Jieun Lee (Yonsei University), Shengmin Piao (Yonsei University), Sangmin Seo (Yonsei University), Sanghyun Park (Yonsei University)
  • BS-tree: A gapped data-parallel B-tree
    Dimitrios Tsitsigkos (Athena RC), Achilleas Michalopoulos (University of Ioannina), Nikos Mamoulis (University of Ioannina), Manolis Terrovitis (Athena RC)
  • Updatable Balanced Index for Fast On-device Search with Auto-selection Model
    Yushuai Ji (Wuhan University), Sheng Wang (Wuhan University), Zhiyu Chen (Amazon), Yuan Sun (La Trobe University), Zhiyong Peng (Wuhan University)
  • Mitigating Dual Load Imbalance via Dynamic Cooperative Scheduling in Distributed Key-Value Stores
    Jiakun Zhang (University of Science and Technology of China), Patrick P. C. Lee (The Chinese University of Hong Kong), Wenzhe Zhu (University of Science and Technology of China), Yongkun Li (University of Science and Technology of China), Shuyi Zhang (University of Science and Technology of China), Yinlong Xu (University of Science and Technology of China)
  • Fast Content-Aware Influence Maximization Query Answering by labeling Index
    Xingliang Lv (Zhejiang University), Qihao Shi* (Zhejiang University), Can Wang (Zhejiang University), Mingli Song (Zhejiang University), Wenliang Du (Zhejiang University), Wujian Yang (Hangzhou City University), Guanlin Chen (Hangzhou City University)
  • One Size Does NOT Fit All: On the Importance of Physical Representations for Datalog Evaluation [Experiment, Analysis, and Benchmark]
    Nick Rassau (Johannes Gutenberg University Mainz), Felix Schuhknecht* (Johannes Gutenberg University Mainz)
[R-2] Spatiotemporal Prediction and Urban Computing
Time: Tuesday, May 5, 10:00 - 12:00
Location: Rue McGill
Track: Spatial Databases and Temporal Databases
Session Chair: [To Be Announced]
  • Energy-Efficient Autonomous Driving with Adaptive Perception and Robust Decision
    Yuyang Xia (University of Electronic Science and Technology of China), Zibo Liang (University of Electronic Science and Technology of China), Liwei Deng (University of Electronic Science and Technology of China), Yan Zhao (University of Electronic Science and Technology of China), Han Su (University of Electronic Science and Technology of China), Kai Zheng (University of Electronic Science and Technology of China)
  • SaSPartitioner: A Self-adaptive Streaming Partitioner using Deep Reinforcement Learning
    Shenghao Gong (Zhejiang University), Liu Liu (Zhejiang University), Ziquan Fang (Zhejiang University), Yunjun Gao (Zhejiang University), Yaofeng Tu (ZTE Corporation)
  • Damba-ST: Domain-Adaptive Mamba for Efficient Urban Spatio-Temporal Prediction
    Rui AN (The Hong Kong Polytechnic University), Yifeng Zhang (The Hong Kong Polytechnic University), Ziran Liang (The Hong Kong Polytechnic University), Wenqi Fan (The Hong Kong Polytechnic University), Yuxuan Liang (The Hong Kong University of Science and Technology (Guangzhou)), Xuequn Shang (Northwestern Polytechnical University), Qing Li (The Hong Kong Polytechnic University)
  • Online Multi-Modal Spatio-Temporal Prediction: A Reinforcement Learning and Dynamic Contrastive Framework
    Ziquan Fang* (Zhejiang University), Tinghui Luo (Zhejiang University), Xiaole Pan (Zhejiang University), Lu Chen (Zhejiang University), Surun Ji (iQIYI Inc), Mingfan Lu (iQIYI Inc)
  • City-wide Origin-destination Matrix Generation via Cascaded Graph Denoising Diffusion
    Can Rong* (Singapore-MIT Alliance for Research and Technology (SMART)), Jingtao Ding (Tsinghua University), Zhicheng Liu (Alibaba Group), Peng Lu (PKU-Wuhan Institute for Artificial Intelligence), Yong Li (Tsinghua University)
  • VisiFold: Long-Term Traffic Forecasting via Temporal Folding Graph and Node Visibility
    Zhiwei Zhang* (Beijing Jiaotong University), Xinyi Du (Beijing Normal University), Weihao Wang (Beijing Jiaotong University), Xuanchi Guo (Beijing Jiaotong University), Wenjuan Han (Beijing Jiaotong University)
  • Data-Segmentation Prompt based Continual Learning Framework for Online Spatio-Temporal Prediction
    Banglie Yang (Sichuan University), Liwei Deng* (Aalborg University), Cheng Dai (Sichuan University), Kai Zheng (University of Electronic Science and Technology of China)
[R-3] Benchmarking, Testing and Evaluation of DB Systems
Time: Tuesday, May 5, 10:00 - 12:00
Location: Rue Sherbrooke
Track: AI-based DB Tuning, Benchmarks and Performances
Session Chair: [To Be Announced]
  • Benchmarking RL-Enhanced Spatial Indices Against Traditional, Advanced, and Learned Counterparts [Experiment, Analysis, and Benchmark]
    Guanli Liu (The University of Melbourne), Renata Borovica-Gajic (The University of Melbourne), Hai Lan (RMIT University), Zhifeng Bao (RMIT University)
  • GPU-Accelerated OLTP: An In-Depth Analysis of Concurrency Control Schemes [Experiment, Analysis, and Benchmark]
    Zihan Sun (Tsinghua University), Yuyu Luo (HKUST(GZ)), Yong Zhang (Tsinghua University), Chao Li (Tsinghua University), Chunxiao Xing (Tsinghua University)
  • Tetris: Lightweight Hyperparameter Auto-Tuning for Mitigating Performance Spikes in LSM-KVS
    YINA LV* (Xiamen University), Wenhao Zhu (Xiamen University), Qiao Li (Mohamed bin Zayed University of Artificial Intelligence), Quanqing Xu (OceanBase, Ant Group), Congming Gao (Xiamen University), Chuanhui Yang (OceanBase, Ant Group), Xiaoli Wang (Xiamen University), Chun Jason Xue (Mohamed bin Zayed University of Artificial Intelligence)
  • Distance Comparison Operations Are Not Silver Bullets in Vector Similarity Search: A Benchmark Study on Their Merits and Limits [Experiment, Analysis, and Benchmark]
    Zhuanglin Zheng (Beihang University), Yuxiang Zeng (Beihang University), Chenchen Liu (Beihang University), Yunzhen Chi (Beihang University), Binhan Yang (Beihang University), Yongxin Tong* (Beihang University)
  • WikiDBGraph: A Data Management Benchmark Suite for Collaborative Learning over Database Silos [Experiment, Analysis, and Benchmark]
    Zhaomin Wu* (National University of Singapore), Ziyang Wang (National University of Singapore), Bingsheng He (National University of Singapore)
  • Vireo: Human-in-the-Loop DBMS Fuzzing with Visualization and LLM Support
    Jie Liang* (Beihang University), Zhiyong Wu (Tsinghua University), Jingzhou Fu (Tsinghua University), Chi Zhang (Tsinghua University), Runpei Miao (Beihang University), Zhuo Su (Beihang University), Yu Jiang (Tsinghua University), Shuai Ma (Beihang University)
  • HCT-QA: A Benchmark for Question Answering on Human-Centric Tables [Experiment, Analysis, and Benchmark]
    Mohammad Shahmeer Ahmad* (QCRI, HBKU), Zan Naeem (QCRI, HBKU), Michael Aupetit (QCRI, HBKU), Ahmed Elmagarmid (QCRI, HBKU), Mohamed Eltabakh (QCRI, HBKU), Xiaosong Ma (MBZUAI), Mourad Ouzzani (QCRI, HBKU), Chaoyi Ruan (NUS), Hani Al-Sayeh (QCRI, HBKU)
[R-4] Data Quality, Repair and Outlier Detection
Time: Tuesday, May 5, 10:00 - 12:00
Location: Rue Mansfield
Track: Information Integration and Data Quality
Session Chair: [To Be Announced]
  • PROCore: Robust Core-set Selection via Pareto Multi-dimensional Optimization from Noisy Data
    Xiaoou Ding (Harbin Institute of Technology), Hongbin Hu (Harbin Institute of Technology), Songnan Jiang (Harbin Institute of Technology), Muyun Zhou (Harbin Institute of Technology), Chen Wang (Tsinghua university), Jingru Yang (National Key Laboratory of Data Space Technology and System), Hongzhi Wang* (Harbin Institute of Technology)
  • Truth ≠ Frequency: Leveraging Dependencies for Subset Repair
    Haoda Li (Nankai University), Jiahui Chen (Nankai University), Yu Sun* (Nankai University), Shaoxu Song (Tsinghua University), Haiwei Zhang (Nankai University), Xiaojie Yuan (Nankai University)
  • RFOD: Random Forest-based Outlier Detection for Mixed-Type Tabular Data
    Yihao Ang (National University of Singapore), Peicheng Yao (National University of Singapore), Yifan Bao (National University of Singapore), Yushuo Feng (Huazhong University of Science and Technology), Qiang Huang* (Harbin Institute of Technology (Shenzhen)), Anthony K. H. Tung (National University of Singapore), Zhiyong Huang (National University of Singapore)
  • TORepair: Diffusion-based Task-Oriented Error Repair via Differentiable Bi-Level Optimization
    Wei Ni (Zhejiang University; City University of Hong Kong), Xiaoye Miao* (Zhejiang University), Xiangyu Zhao (City University of Hong Kong), Yangyang Wu (Zhejiang University), Jianwei Yin
  • EDDI: Explainable Data Drift Monitoring using Influence
    Nikolaos Myrtakis* (University of Crete), Andrea Castellani (Honda Research Institute Europe GmbH), Ioannis Tsamardinos (University of Crete), Vassilis Christophides (ENSEA, CY Cergy Paris University, CNRS)
  • Analysis of Candidate Keys in Relational Databases
    Zihui Yang (University of Auckland), Yuqian Ma (University of Auckland), Sebastian Link* (University of Auckland)
  • Representative Functional Dependencies
    Qiongqiong Lin (Zhejiang University), Jingyan Sai (Alibaba Group), Jiazheng Song (Zhejiang University), Jinfei Liu* (Zhejiang University), Kui Ren (Zhejiang University), Tianzhen Wang (Alibaba Group), Yanbei Pang (Alibaba Group), Feifei Li (Alibaba Group)
[R-5] Blockchain Protocols, Storage and Smart Contracts
Time: Tuesday, May 5, 10:00 - 12:00
Location: Rue Crescent
Track: Distributed Ledgers and Blockchains
Session Chair: [To Be Announced]
  • RoarChain: A Robust Sharding Blockchain System for Enterprise Consortium
    Yuan Sui (Northeastern University), Xiaochun Yang (Northeastern University), Bin Wang (Northeastern University), Yujie Zhang (Northeastern University), Lina Wang (Wuhan University)
  • Banknote-Chain: Achieving User-Incentivized Parallelism in Blockchain via a Banknote-Inspired Transaction Model
    Zhiyu Ma (Hefei Institutes of Physical Science, Chinese Academy of Sciences; University of Science and Technology of China), Xiaofeng Li (Hefei Institutes of Physical Science, Chinese Academy of Sciences; University of Science and Technology of China), He Zhao (Hefei Institutes of Physical Science, Chinese Academy of Sciences; University of Science and Technology of China), Tong Zhou (Hefei Institutes of Physical Science, Chinese Academy of Sciences; Anhui ZhongKeJingGe Technology Co., Ltd), Nianzu Sheng (Hefei Institutes of Physical Science, Chinese Academy of Sciences; University of Science and Technology of China), Haotian Cheng (University of Science and Technology of China; Hefei Institutes of Physical Science, Chinese Academy of Sciences)
  • Chubby: Robust Smart Contract Execution Against Dependency Over-declaration
    Junyu Wei* (East China Normal University), Xiaodong Qi (Nanyang Technological University), Qifeng Que (East China Normal University), Zhao Zhang (East China Normal University), Yanqin Yang (East China Normal University), Cheqing Jin (East China Normal University)
  • COLE+: Towards Practical Column-based Learned Storage for Blockchain Systems
    Ce Zhang (Hong Kong Baptist University), Cheng Xu (Hong Kong Baptist University), Haibo Hu (Hong Kong Polytechnic University), Jianliang Xu* (Hong Kong Baptist University)
  • SpendableStore: A UTXO-based Decentralized Data Store
    YINAN ZHOU* (UNIVERSITY OF CALIFORNIA), Faisal Nawab (UNIVERSITY OF CALIFORNIA, Irvine)
  • Geco: A Confidentiality-Preserving and High-Performance Permissioned Blockchain Framework for General Smart Contracts
    Songxiao Guo (The University of Hong Kong), Rongxin Guan (The University of Hong Kong), Ji Qi* (Institute of Software Chinese Academy of Sciences), Zongyuan Zhang (The University of Hong Kong), Tianyang Duan (The University of Hong Kong), Sen Wang (Huawei Technologies), Yanjun Wu (Institute of Software Chinese Academy of Sciences), Heming Cui (The University of Hong Kong)
  • HYDRA: Breaking the Global Ordering Barrier in Multi-BFT Consensus
    Hanzheng Lyu* (University of British Columbia), Shaokang Xie (University of California, Davis), Jianyu Niu (City University of Hong Kong), Mohammad Sadoghi (University of California, Davis), Yinqian Zhang (Southern University of Science and Technology), Cong Wang (City University of Hong Kong), Ivan Beschastnikh (University of British Columbia), Chen Feng (University of British Columbia)
[R-6] LLMs for Database Optimization and Administration
Time: Tuesday, May 5, 13:30 - 15:00
Location: Av. Duluth
Track: AI for Data Management
Session Chair: [To Be Announced]
  • MVGPT: Generative Materialized View Forecasting
    Yue Han (Tsinghua University), Guoliang Li* (Tsinghua University), Wenchun Xu (Alibaba Group), Xianglei Ran (Alibaba Group), ZeYa Gong (Alibaba Group), Wei Guo (Alibaba), Guang Qiu (Alibaba Group), Bo Zheng (Alibaba Group)
  • MINT: Multi-Vector Search Index Tuning
    Jiongli Zhu, Yue Wang*, Bailu Ding, Philip Bernstein, Vivek Narasayya, Surajit Chaudhuri
  • LLM4Hint: Leveraging Large Language Models for Hint Recommendation in Offline Query Optimization
    Suchen Liu (Peking University), Yang Lin (ZTE Corporation), Yinjun Han (ZTE Corporation), Jun Gao* (Peking University)
  • LLMSQLMUTATOR: LLM-Powered Test Case Generation for Database Using Bug Reports
    Chenglin Tian* (Beijing University of Posts and Telecommunications), Chaofan Li (Beijing University of Posts and Telecommunications), Yawen Li (Beijing University of Posts and Telecommunications), Yingxia Shao (Beijing University of Posts and Telecommunications)
  • LLMIA: An Out-of-the-Box Index Advisor via In-Context Learning with LLMs
    Xinxin Zhao* (Renmin University of China), Xinmei Huang (Renmin University of China), Haoyang Li (Renmin University of China), Jing Zhang (Renmin University of China), Shuai Wang (ByteDance), Tieying Zhang (ByteDance), Jianjun Chen (ByteDance), Rui Shi (ByteDance), Cuiping Li (Renmin University of China), Hong Chen (Renmin University of China)
[R-7] Approximate, Vector and Pub-Sub Query Processing
Time: Tuesday, May 5, 13:30 - 15:00
Location: Rue McGill
Track: Query Processing, Indexing, and Optimizatio
Session Chair: [To Be Announced]
  • Semantic Publish/Subscribe over Evolving Topics
    Yiming Yao* (University of Electronic Science and Technology of China), Lisi Chen (University of Electronic Science and Technology of China), Shuo Shang (University of Electronic Science and Technology of China)
  • UTune: Towards Uncertainty-Aware Online Index Tuning
    Chenning Wu (Fudan University), Sifan Chen (Fudan University), Wentao Wu (Microsoft Research), Yinan Jing* (Fudan University), Zhengying He (Fudan University), Kai Zhang (Fudan University), X.Sean Wang (Fudan University)
  • A-Scan: Efficient Scale-up Analytics via Throughput-Guided Data Movement
    Hamish Nicholson* (EPFL), Aunn Raza (Oracle), Viktor Sanca (Oracle), Anastasia Ailamaki (EPFL)
  • SINDI: An Efficient Index for Sparse Vector Approximate Maximum Inner Product Search
    Ruoxuan Li (ECNU), Xiaoyao Zhong (Ant Group), Jiabao Jin (Ant Group), Peng Cheng* (Tongji University), Wangze Ni (Zhejiang University), Zhitao Shen (Ant Group), Wei Jia (Ant Group), Xiangyu Wang (Ant Group), Heng Tao Shen (Tongji University), Jingkuan Song (Tongji University)
  • Systematic Evaluation of Plan-based Adaptive Query Processing [Experiment, Analysis, and Benchmark]
    Pei Mu* (University of Edinburgh), Anderson Chaves Carniel (Huawei Technologies Research & Development (UK) Limited), Antonio Barbalace (University of Edinburgh), Amir Shaikhha (University of Edinburgh)
[R-8] Table Integration, Schema Matching and Data Markets
Time: Tuesday, May 5, 13:30 - 15:00
Location: Rue Sherbrooke
Track: Information Integration and Data Quality
Session Chair: [To Be Announced]
  • A Unified Framework for Compressed and Encrypted Text Direct Processing
    Yani Liu (Renmin University of China), Feng Zhang* (Renmin University of China), Yu Zhang (Renmin University of China), Siqi Ma (University of New South Wales), Elisa Bertino (Purdue University), Xiaoyong Du (Renmin University of China)
  • Revisiting Single-Table Retrieval: An Open Problem Under 360° Stress Tests [Experiment, Analysis, and Benchmark]
    Chenyu Yang (HKUST(GZ)), Junhao Li (HKUST(GZ)), Ziyu Jiang (HKUST(GZ)), Yuyu Luo (HKUST(GZ)), Ju Fan (Renmin University of China), Nan Tang* (HKUST(GZ))
  • Novel Table Search
    Besat Kassaie* (University of Waterloo), Renee J. Miller (University of Waterloo)
  • Label-Constrained Column Annotation with Language Models and Graph Neural Networks
    Duo Yang (KU Leuven), Ioannis Dasoulas (KU Leuven), Anastasia Dimou* (KU Leuven)
  • Information Leakage from Prices in Query-based Data Markets
    Teng Tu (Zhejiang University), Huanhuan Peng (Zhejiang University), Xiaoye Miao* (Zhejiang University), Guanjie Cheng (Zhejiang University), Shuiguang Deng (Zhejiang University), JIanwei Yin (Zhejiang University)
[R-9] Edge Computing, IoT and Streaming Applications
Time: Tuesday, May 5, 13:30 - 15:00
Location: Rue Mansfield
Track: Data Stream Systems and Edge Computing
Session Chair: [To Be Announced]
  • FLASH Viterbi: Fast and Adaptive Viterbi Decoding for Modern Data Systems
    Ziheng Deng (Northeastern University), Xue Liu (Northeastern University), Jiantong Jiang (The University of Western Australia), Yankai Li (Northeastern University), Qingxu Deng* (Northeastern University), Xiaochun Yang (Northeastern University)
  • Deferred Flushing for Out-of-Order Arrivals in Apache IoTDB
    Xiaojian Zhang (Tsinghua University), Zhiheng Liu (Tsinghua University), Shaoxu Song* (Tsinghua University), Xiangdong Huang (Tsinghua University), Chen Wang (Tsinghua University), Jianmin Wang (Tsinghua University)
  • EC-RAG: Towards Efficient Edge-Cloud Retrieval-Augmented Generation Systems
    Liang Wang* (Huazhong University of Science and Technology), Kai Wang (Huazhong University of Science and Technology), Ranjun Jia (Huazhong University of Science and Technology), Kai Lu (Huazhong University of Science and Technology), Jiguang Wan (Huazhong University of Science and Technology), Hao Huo (PingCAP), Yulong Zhai (PingCAP), Zhiyuan Liang (PingCAP), Di Wang (PingCAP)
  • ShareFlow: An Efficient Framework for Multi-Query Continuous Subgraph Matching
    Peiqi Yuan (Southern University of Science and Technology), Zhaohang Feng (Southern University of Science and Technology), Ruiqi Xu (Beijing Institute of Technology, Zhuhai), Keming Li (University of California, Irvine), Rui Mao (Shenzhen University), Bo Tang* (Southern University of Science and Technology)
  • Fast and Accurate Element-Level Streaming CP Decomposition for Higher-Order Tensors
    Jeongyoung Lee* (Seoul National University), SeungJoo Lee (Seoul National University), U Kang (Seoul National University)
[R-10] Memory-Efficient Storage and In-Memory Data Systems
Time: Tuesday, May 5, 13:30 - 15:00
Location: Rue Crescent
Track: Modern Hardware and In-Memory Database Systems
Session Chair: [To Be Announced]
  • Reconfiguring Scalable Hashing with Persistent CPU Caches
    Zhenyu Yu (Huazhong University of Science and Technology), Bolong Zheng* (Huazhong University of Science and Technology), Ling Xu (Shuyi Technology), Qianlu Wu (Huazhong University of Science and Technology), Qiang Chen (Huazhong University of Science and Technology), Ziyang Yue (Huazhong University of Science and Technology)
  • SHMemora: Protective Key-Value Store on Distributed Shared Memory
    Jiajun Luo* (Tsinghua University), Siyu Lin (Tsinghua University), Yunpeng Xu (Tsinghua University), Shengwei Liu (Cornell University), Jin Xia (Shenzhen Longsys Electronics Co., Ltd.), Dong Liu (Shenzhen Longsys Electronics Co., Ltd.), Zheng Liu (Alibaba Group), Huanchen Zhang (Tsinghua University), Teng Ma (Alibaba Group), Shuwen Deng (Tsinghua University)
  • Enabling Homomorphic Analytical Operations on Compressed Scientific Data with Multi-stage Decompression
    Xuan Wu (Oregon State University), Sheng Di (Argonne National Laboratory), Tripti Agarwal (University of Utah), Kai Zhao (Florida State University), Xin Liang* (Oregon State University), Franck Cappello (Argonne National Laboratory)
  • Mirror Asymmetry Perfect Hashing: A Memory-Efficient and Load-Intensive-Optimized Hashing Index on Hybrid DRAM-PMem Architecture
    Jingcheng Ju* (Institute of Information Engineering, Chinese Academy of Sciences, School of Cyber Security, University of Chinese Academy of Sciences), Zirui Liu (Peking University), Kaicheng Yang (Peking University), Tong Yang (Peking University), Yikai Zhao (Peking University), Feng Liu (Institute of Information Engineering, Chinese Academy of Sciences, School of Cyber Security, University of Chinese Academy of Sciences), Guodong Yang (Huawei), Xingchun Wang (Huawei), Duohe Ma (Institute of Information Engineering, Chinese Academy of Sciences, School of Cyber Security, University of Chinese Academy of Sciences)
  • CSD-CoKV: Host-CSD Collaborative Offloading for High-Performance LSM-tree based KV Stores
    Zhining Cao* (Shandong University), Kai Zhang (Inspur (Jinan) Data Technology Co., Ltd), Jinrun Yang (Shandong University), Hui Li (Inspur (Jinan) Data Technology Co., Ltd), Nan Su (Inspur (Jinan) Data Technology Co., Ltd), Qian Wei (Shandong University), Shikun Ma (Shandong University), Zehao Chen (Shandong University), Junbo Yin (Inspur (Jinan) Data Technology Co., Ltd), Haijun Zhang (Inspur (Jinan) Data Technology Co., Ltd), Zhaoyan Shen (Shandong University)
[R-11] Graph Indexing and Optimization
Time: Tuesday, May 5, 15:30 - 17:00
Location: Rue McGill
Track: Graph Structure Analytics
Session Chair: [To Be Announced]
  • Efficient Meta-path Constrained Reachability Query on Heterogeneous Information Networks
    Chao Ni (NUAA), Zi Chen* (Wuhan University of Technology), Long Yuan (Wuhan University of Technology), Bolong Zheng (Huazhong University of Science and Technology), Lu Qin (University of Technology Sydney)
  • Lightweight 2-Hop Labels for Reachability Queries on Large-Scale Graphs
    Yishu Wang* (Northeastern University), Jinlong Chu (Northeastern University), Ye Yuan (Beijing Institute of Technology), Yu Gu (Northeastern University), Lianpeng Qiao (Beijing Institute of Technology)
  • HistCore: Efficient k-Core Decomposition on GPUs with Locality-Aware Computation
    Chen Zhao (Wuhan University), Guojia Wan* (Wuhan University), Ting Yu (Zhejiang Lab), Jiawei Jiang (Wuhan University), Bo Du (Wuhan University)
  • GoCache: Accelerating Out-of-Core Graph Queries with Pattern-Driven Caching
    Zheng Yang* (University of Science and Technology of China), Yicheng Zhang (University of Science and Technology of China), Lixiao Cui (Nankai University), Luofan Chen (University of Science and Technology of China), Chongzhuo Yang (University of Science and Technology of China), Xiaojian Luo (Alibaba Group), Sijie Shen (Alibaba Group), Wenyuan Yu (Alibaba Group), Jingren Zhou (Alibaba Group), Cheng Li (University of Science and Technology of China)
  • C2graph: A Compression-Collaboration Algorithm for CPU-GPU Hybrid Weighted Graph Traversals
    Ning Wang (Guangzhou University), Huaibei Li (Ocean University of China), Shen Su (Guangzhou University), Yu Gu (Northeastern University), Ge Yu (Northeastern University), Zhigang Wang* (Guangzhou University), Dawei Zhao (Qilu University of Technology), Hui Lu (Guangzhou University), Zhihong Tian (Guangzhou University)
[R-12] Query Optimization and Rewriting
Time: Tuesday, May 5, 15:30 - 17:00
Location: Rue Sherbrooke
Track: Query Processing, Indexing, and Optimization
Session Chair: [To Be Announced]
  • MICRO: A Lightweight Middleware for Optimizing Cross-store Cross-model Graph-Relation Joins
    Xiuwen Zheng (University of California, San Diego)*, Arun Kumar (University of California, San Diego), Amarnath Gupta (University of California, San Diego)
  • SSC-Join: an Efficient Syntactic-Semantic Collaboration based Set Semantic Similarity Join Algorithm
    Lianyin Jia (Faculty of Information Engineering and Automation, Kunming University of Science & Technology), Chengchen Zeng (Kunming University of Science and Technology), Mengjuan Li (Yunnan Normal University), Suprio Ray (University of New Brunswick), Yinong Chen (Arizona State University), Jiaman Ding (Kunming University of Science and Technology)*, Xiuxing Li (Beijing Institute of Technology)
  • From Single to Multiple Attributes: Experimental Insights on Sampling-Based Distinct Combination Estimation in GROUP-BY Queries [Experiment, Analysis, and Benchmark]
    Yujie Zhang* (Northeastern University), Xiaochun Yang (Northeastern University), Bin Wang (Northeastern University), Yuan Sui (Northeastern University)
  • A Set-Theoretic Approach to Detecting Logic Bugs in DBMS Inner Join Optimizations
    Ce Lyu (East China Normal University), Changzheng Wei (Ant Group), Yanhao Wang (East China Normal University), Jie Liang (Beihang University), Li Lin (Ant Group), Hanghang Wu (Ant Group), Minghao Zhao* (East China Normal University), Ying Yan (Ant Group), Aoying Zhou (East China Normal University)
  • Efficient Query Rewrite Rule Discovery via Standardized Enumeration and Learning-to-Rank
    Yuan Zhang (Shenzhen Institute of Computing Science, Shenzhen University), Yuxing Chen* (Tencent Inc.), Yuekun Yu (Shenzhen Institute of Computing Science, Shenzhen University), Jinbin Huang (Shenzhen Institute of Computing Science, Shenzhen University), Rui Mao (Shenzhen Institute of Computing Science, Shenzhen University), Anqun Pan (Tencent Inc.), Lixiong Zheng (Tencent Inc.), Jianbin Qin (Shenzhen Institute of Computing Science, Shenzhen University)
[R-13] Federated and Distributed Learning Systems
Time: Tuesday, May 5, 15:30 - 17:00
Location: Rue Mansfield
Track: Distributed, Parallel and P2P Data Management
Session Chair: [To Be Announced]
  • Towards the Distributed Large-scale k-NN Graph Construction by Graph Merge
    Cheng Zhang (Xiamen University), Wan-Lei Zhao (Xiamen University)*, Shihai Xiao (Huawei Technologies Ltd.), Jiajie Yao (Huawei Technologies Ltd.), Xuecang Zhang (Huawei Technologies Ltd.)
  • TopFGL: A Topology-Aware and Distribution-Agnostic Federated Learning Framework Tackling Topological Heterogeneity on Graph Data
    Junyang Wang (University of Science and Technology of China)*, Lan Zhang (University of Science and Technology of China), Yihang Cheng (University of Science and Technology of China), Mu Yuan (The Chinese University of Hong Kong), Tianfu Wang (University of Science and Technology of China), Zhihui Fu (Shanghai Jiao Tong University), Jun Wang (University of Luxembourg)
  • AdaFedRec: Adaptive Heterogeneous Federated Recommender Systems across Multi-Device Users
    Zhenkai Li (East China Normal University)*, Ming Hu (Singapore Management University), Chentao Jia (East China Normal University), Yining Sun (East China Normal University), Zhufeng Lu (East China Normal University), Mingyang Yu (East China Normal University), Yanxing Yang (East China Normal University), Xiaofei Xie (Singapore Management University), Mingsong Chen (East China Normal University)
  • SSFusion: Tensor Fusion with Selective Sparsification for Efficient Distributed DNN Training
    Zhangqiang Ming (Huazhong University of Science and Technology)*, Rui Wang (Huazhong University of Science and Technology), Yuchong Hu (Huazhong University of Science and Technology), Yuanhao Shu (Innovation Research Institute of Cethik Group Co.Ltd), Wenxiang Zhou (Huazhong University of Science and Technology), Xinjue Zheng (Huazhong University of Science and Technology), Dan Feng (Huazhong University of Science and Technology)
  • TS3D: A Temporal Multimodal Dataset for Distributed Database System Analysis [Experiment, Analysis, and Benchmark]
    Yuanyuan Yao (Zhejiang University), Yuhan Shi (Zhejiang Universit), Yian Wei (Zhejiang University), Lu Chen* (Zhejiang University), Mourad Khayati (University of Fribourg), Cheng Long (Nanyang Technological University), Tianyi Li (Aalborg University)
[R-14] Stream Processing Engines and Architectures
Time: Tuesday, May 5, 15:30 - 17:00
Location: Rue Crescent
Track: Data Stream Systems and Edge Computing
Session Chair: [To Be Announced]
  • Astraea: Efficient Pipelined Micro-batch Stream Processing with Non-hash Differentiated Partitioning
    Sijie Wu (Huazhong University of Science and Technology), Hanhua Chen (Huazhong University of Science and Technology)*, Hai Jin (Huazhong University of Science and Technology), Haoran Cai (Huawei Technologies Co., Ltd)
  • NebulaStream: An Adaptive and Efficient Multi-query Stream Processing Engine
    Nils Schubert* (Technische Universität Berlin), Lukas Schwerdtfeger (Technische Universität Berlin), Sara Schnaterbeck (Technische Universität Berlin), Philipp Grulich (Observe Inc.), Bonaventura Del Monte (Observe Inc.), Steffen Zeuch (Technische Universität Berlin), Volker Markl (Technische Universität Berlin)
  • When Complex Event Recognition Meets Cloud-Native Architectures
    Shizhe Liu* (Nanjing University), Haipeng Dai (Nanjing University), Meng Li (Nanjing University), Yuemeng Zhang (Nanjing University), Shaoxu Song (Tsinghua University), Zhifeng Bao (The University of Queensland), Hancheng Wang (Nanjing University), Xiaofeng Gao (Shanghai Jiao Tong University), Guihai Chen (Nanjing University)
  • Process Faster, Pay Less: Functional Isolation for Stream Processing
    Eleni Zapridou* (EPFL), Michael Koepf (TU Wien), Panagiotis Sioulas (Oracle), Ioannis Mytilinis (Oracle), Anastasia Ailamaki (EPFL)
  • Low-Latency Stateful Stream Processing through Timely and Accurate Prefetching
    Eleni Zapridou* (EPFL), Anastasia Ailamaki (EPFL)
[R-15] NL2SQL: Methods and Architectures
Time: Wednesday, May 6, 10:00 - 12:00
Location: Av. Duluth
Track: AI for Data Management
Session Chair: [To Be Announced]
  • OsmT: Bridging OpenStreetMap Queries and Natural Language with Open-source Tag-aware Language Models
    Zhuoyue WAN (The Hong Kong Polytechnic University), Wentao Hu (The Hong Kong Polytechnic University), Chen Jason Zhang (The Hong Kong Polytechnic University), Yuanfeng Song (ByteDance)*, Shuaimin Li (Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences), Ruiqiang Xiao (The Hong Kong University of Science and Technology), Xiao-Yong Wei (The Hong Kong Polytechnic University), Raymond Chi-Wing Wong (The Hong Kong University of Science and Technology)
  • Knapsack Optimization-based Schema Linking for LLM-based Text-to-SQL Generation
    Zheng Yuan (The Hong Kong Polytechnic University (PolyU))*, Hao Chen (City University of Macau), Zijin Hong (The Hong Kong Polytechnic University), Qinggang Zhang (The Hong Kong Polytechnic University), Feiran Huang (Jinan University), Qing Li (The Hong Kong Polytechnic University), Xiao Huang (The Hong Kong Polytechnic University)
  • CYANSQL: Unlock the Power of NL2SQL via Clustering-based Test-Time Scaling
    Haoyu Qin (Fudan University), Tonghui Ren (Tencent Cloud), Zhenying He (Fudan University)*, X.Sean Wang (Fudan University), Jiashu Xing (Tencent Cloud), Yanghuan Ye (Tencent Cloud), Shifei Huang (Tencent Cloud), Jinbao Li (Qilu University of Technology)
  • Text2VectorSQL: Towards a Unified Interface for Vector Search and SQL Queries
    Zhengren Wang (Peking University), Dongwen Yao (Shanghai Jiao Tong University), Bozhou Li (Peking University), Dongsheng Ma (Peking University), Bo Li (Peking University), Zhiyu Li (Institute for Advanced Algorithms Research, Shanghai), Feiyu Xiong (Institute for Advanced Algorithms Research, Shanghai), Bin Cui (Peking University), Linpeng Tang (OriginHub Technology), Wentao Zhang* (Peking University)
  • Boosting Small Language Models for Text-to-SQL with Fine-Grained Execution Feedback and Cost-Efficient Rewards
    Thanh Dat Hoang (Griffith University), Thanh Trung Huynh (VinUniversity), Matthias Weidlich (Humboldt University of Berlin), Thanh Tam Nguyen (Griffith University), Tong Chen (The University of Queensland), Hongzhi Yin (The University of Queensland), Quoc Viet Hung Nguyen* (Griffith University)
  • LEAF-SQL: Level-wise Exploration with Adaptive Fine-graining for Text-to-SQL Skeleton Prediction
    Zhao Tan* (Jiangxi University of Finance and Economics), Xiping Liu (Jiangxi University of Finance and Economics), Qing Shu (Jiangxi University of Finance and Economics), Qizhi Wan (Jiangxi University of Finance and Economics), Dexi Liu (Jiangxi University of Finance and Economics), Changxuan Wan (Jiangxi University of Finance and Economics)
  • Text2SQL-Flow: A Robust SQL-Aware Data Augmentation Framework for Text-to-SQL
    Qifeng Cai* (Peking University), Hao Liang (Peking University), Chang Xu (Peking University), Tao Xie (Peking University), Wentao Zhang (Peking University), Bin Cui (Peking University)
[R-16] Spatial Queries, Road Networks and Indexing
Time: Wednesday, May 6, 10:00 - 12:00
Location: Rue Crescent
Track: Spatial Databases and Temporal Databases
Session Chair: [To Be Announced]
  • SOLAR: Scalable Distributed Spatial Joins through Learning-based Optimization
    Yongyi Liu* (University of California, Riverside), Ahmed Abdelmaguid (University of California, Riverside), Ahmed Mohamood (Google LLC), Amr Magdy (University of California, Riverside), Minyao Zhu (Google LLC)
  • PC-PS: A Multi-Dimensional Point-Cloud Data Publish/Subscribe System
    Yuanchi Fan* (UESTC), Lisi Chen (UESTC), Shuo Shang (UESTC), Christian Jensen (Aalborg University)
  • PLAN: Fast and Approximate Gaussian Kernel Density Visualization in Road Networks
    Tsz Nam Chan* (Shenzhen University), Hongwei Ye (Shenzhen University), Bojian Zhu (Hong Kong Baptist University), Leong Hou U (University of Macau), Dingming Wu (Shenzhen University), Ruisheng Wang (Shenzhen University), Joshua Zhexue Huang (Shenzhen University)
  • A Robust and Globally Accurate Hierarchical Hub Labeling Index for SP-Distance Queries in Dynamic Road Networks
    Wei Liu (Yantai University), Ziqiang Yu* (Yantai University), Xiaohui Yu (York University), Yang Liu (Wilfrid Laurier University), Simu Liu (Yantai University)
  • iKSP: A Path Enumeration Index in Road Networks
    Zihan Luo* (The Hong Kong University of Science and Technology), Lei Li (The Hong Kong University of Science and Technology), Mengxuan Zhang (The Australian National University), Xinjie Zhou (The Hong Kong University of Science and Technology), Zizhuo Xu (The Hong Kong University of Science and Technology), Xiaofang Zhou (The Hong Kong University of Science and Technology)
  • Robust Spatial-Temporal Similar Trajectory Search via Structure-Enhanced Domain-Invariant Learning
    Xiaolin Han* (Northwestern Polytechnical University), Yonghao Zhou (Northwestern Polytechnical University), Chenhao Ma (Chinese University of Hong Kong, Shenzhen), Lingyun Song (Northwestern Polytechnical University), Xinbiao Gan (National University of Defense Technology), Xuequn Shang (Northwestern Polytechnical University)
  • SOLAR: Efficient Spatial Queries on Real-time LSM-based Storage
    Jingyi Yang* (Nanyang Technological University), Jiachen Shi (Nanyang Technological University), Jian Chen (Nanyang Technological University), Gao Cong (Nanyang Technological University)
[R-17] Learned Models for Query Optimization and Cost Estimation
Time: Wednesday, May 6, 10:00 - 12:00
Location: Rue McGill
Track: AI-based DB Tuning, Benchmarks and Performances
Session Chair: [To Be Announced]
  • TemplateQO: Template-aware and Scalable Query Optimization with Data-efficient Learning
    Pengfei Zheng (Huazhong University of Science and Technology), Guoneng Li (Huazhong University of Science and Technology), Ling Xu (Huazhong University of Science and Technology), Rong Zhu (Alibaba), Yan Li (Wuhan University of Technology), Bolong Zheng* (Huazhong University of Science and Technology)
  • CoLSE: A Lightweight and Robust Hybrid Learned Model for Single-Table Cardinality Estimation using Joint CDF
    Lankadinee Rathuwadu* (University of Melbourne), Christopher Leckie (University of Melbourne), Guanli Liu (University of Melbourne), Renata Borovica-Gajic (University of Melbourne)
  • LAMP: A Dual-Mode Framework for Database Workload Memory Prediction
    Guoze Xue (Zhejiang University), Lu Chen* (Zhejiang University), Ziquan Fang (Zhejiang University), Tianyi Li (Aalborg University), Yushuai Li (Aalborg University), Torben Bach Pedersen (Aalborg University)
  • Robust Index Benefit Estimation via Hierarchical and Two-dimensional Feature Representation
    Tao Li* (State Cloud, China Telecom), Feng Liang (Shenzhen MSU-BIT University), Jinqi Quan (State Cloud, China Telecom), Zihang Yang (State Cloud, China Telecom), Teng Wang (State Cloud, China Telecom), Runhuai Huang (State Cloud, China Telecom), Xiping Hu (Shenzhen MSU-BIT University), Meng Li (Nanjing University), Haipeng Dai (Nanjing University)
  • Telescope: A Learned What-If Call for Column Store Selection in HTAP Databases
    Yidong Zhang (Renmin University of China), Chao Zhang* (Renmin University of China), Zhengkun Wu (Renmin University of China), Ju Fan (Renmin University Of China), Xinyi Zhang (Renmin University Of China), Hong Chen (Renmin University of China), Yuxing Chen (Tencent Inc.), Anqun Pan (Tencent Inc.)
  • SkyNet: Solving Skyline Queries with Neural Networks
    Jinfei Liu* (Zhejiang University), Jiayao Zhang (Zhejiang University), Pengyun Zhu (Tianjing University), Li Xiong (Emory University), Jian Pei (Duke University)
  • Lequa: A Learning-Based Query-Aware Framework for Selective Query Optimization
    Guoneng Li (Huazhong University of Science and Technology), Pengfei Zheng (Huazhong University of Science and Technology), Ling Xu (Shuyi Tech.), Yan Li (Wuhan University of Technology), Bolong Zheng* (Huazhong University of Science and Technology)
[R-18] Log Analytics, Anomaly Detection and Tensor Methods
Time: Wednesday, May 6, 10:00 - 12:00
Location: Rue Sherbrooke
Track: Data Mining and Knowledge Discovery
Session Chair: [To Be Announced]
  • AutoHFormer: Efficient Hierarchical Autoregressive Transformer for Time Series Prediction
    Qianru Zhang* (The University of Hong Kong), HongGang Wen (The University of Hong Kong), Ming Li (Zhejiang Normal University), Dong Huang (National University of Singapore), Siu-Ming Yiu (The University of Hong Kong), Christian S Jensen (Aalborg University), Pietro Liò (Cambridge University)
  • Efficient Zero-shot and Label-free Log Anomaly Detection for Resource-constrained Systems
    Zuohan Wu* (the Hong Kong Unversity of Science and Technology (Guangzhou)), Jiachuan Wang (the Hong Kong Unversity of Science and Technology), Libin Zheng (Sun Yat-sen University), Yongqi Zhang (the Hong Kong Unversity of Science and Technology (Guangzhou)), Shuangyin Li (South China Normal University), Lei Chen (the Hong Kong Unversity of Science and Technology and the Hong Kong Unversity of Science and Technology (Guangzhou))
  • An Encode-then-Decompose Approach to Unsupervised Time Series Anomaly Detection on Contaminated Training Data
    Buang Zhang* (ECNU), Tung Kieu (Aalborg University), Xiangfei Qiu (East China Normal University), Chenjuan Guo (East China Normal University), Jilin Hu (East China Normal University), Aoying Zhou (East China Normal University), Christian S. Jensen (Aalborg University), Bin Yang (East China Normal University)
  • Krone: Hierarchical and Modular Log Anomaly Detection
    Lei Ma* (WPI), Jinyang Liu (Bytedance), Tieying Zhang (Bytedance), Peter VanNostrand (WPI), Dennis Hofmann (WPI), Lei Cao (Arizona University), Elke Rundensteiner (WPI), Jianjun Chen (Bytedance)
  • SLGParser: Practical and Efficient Label-Free Log Parsing Using Large Language Models
    Yibing Hu* (Institute of Information Engineering, CAS), Cong Wang (Institute of Information Engineering, CAS), Lixin Zhao (Institute of Information Engineering, CAS), Aimin Yu (Institute of Information Engineering, CAS)
  • Toward scalable Tucker decomposition: skew-aware multi-level partitioning with GPU–storage co-processing
    Seung Hyeon Song (KOREATECH), Jihye Lee (ETRI), Chanki Kim (Jeonbuk National University), Kang-Wook Chon* (KOREATECH)
  • QPAD: Quantile-Preserving Approximate Dimension Reduction for Nearest Neighbors Preservation in High-Dimensional Vector Search
    Jiuzhou Fu* (University of Washington), Dongfang Zhao (University of Washington)
[R-19] Graph Neural Networks, Knowledge Graph Learning and Reasoning
Time: Wednesday, May 6, 10:00 - 12:00
Location: Rue Mansfield
Track: Graph Learning and Mining
Session Chair: [To Be Announced]
  • On Graph Rewiring with Motifs: a Find-and-Replace Approach
    Qihao Wang* (University of Illinois at Urbana Champaign), Hongtai Cao (University of Illinois at Urbana Champaign), Xiaodong Li (The University of Hong Kong), Matin Najafi (The University of Hong Kong), Kevin Chen-Chuan Chang (University of Illinois at Urbana Champaign), Reynold Cheng (The University of Hong Kong)
  • SNI-GNN: SmartNIC-Assisted Full-Graph GNN Training with In-Network Embedding Prediction
    Guofan Yu* (Hong Kong Baptist University), Sitian Chen (Hong Kong Baptist University), Zhenheng Tang (Hong Kong University of Science and Technology (Guangzhou)), Xiaowen Chu (Hong Kong University of Science and Technology (Guangzhou)), Amelie Chi Zhou (Hong Kong Baptist University)
  • DIFFCOM: Conditional Discrete Diffusion Model for Community Search
    Ling Li* (Shanxi University), Liang Bai (Shanxi University), Siqiang Luo (Nanyang Technological University), Yejiang Wang (Northeastern University), Yuhai Zhao (Northeastern University)
  • Incremental GNN Embedding Computation on Streaming Graphs
    Qiange Wang* (National University of Singapore), Haoran Lv (Northeastern University), Yanfeng Zhang (Northeastern University), Weng-Fai Wong (National University of Singapore), Bingsheng He (National University of Singapore)
  • Unveiling Semantically Cohesive Structures: Maximal Meta-Path Clique Enumeration in Heterogeneous Graphs
    Tao Yu* (Fudan University), Wen Deng (Fudan University), Weiguo Zheng (Fudan University), Jeffrey Xu Yu (The Hong Kong University of Science and Technology (Guangzhou))
  • FlashEKGR: Fast Embedding-Based Knowledge Graph Reasoning Models Training
    Wentai Zhang (Beijing University of Post and Telecommunication), Teng Xu (Beijing University of Posts and Telecommunications), Weiguang Wang (Beijing University of Posts and Telecommunications), Junxing Li (Beijing University of Posts and Telecommunications), Jun Zhang (Beijing University of Posts and Telecommunications), Yifan Zhu (Beijing University of Posts and Telecommunications), Haihong E* (Beijing University of Posts and Telecommunications)
  • OMNIA: Closing the Loop by Leveraging LLMs for Knowledge Graphs Completion
    Frédéric IENG (Université Paris Cité), Massinissa Hammaz (Université Paris Cité), Soror Sahri (Université Paris Cité), Mourad OUZZANI* (Qatar Computing Research Institute, HBKU), Salima Benbernou (Université Paris Cité), Hanieh Khorashadizadeh (Universität zu Lübeck), Sven Groppe (Universität zu Lübeck), Farah Benamara (IRIT)
[R-20] Dense Subgraph and Core Decomposition
Time: Wednesday, May 6, 15:30 - 17:00
Location: Rue McGill
Track: Graph Structure Analytics
Session Chair: [To Be Announced]
  • Querying Historical k-Dense Subgraphs On Temporal graphs
    Qi Zhang (University of Science and Technology Beijing), Yalong Zhang (Beijing Institute of Technology), Rong-Hua Li* (Beijing Institute of Technology), Xu-Cheng Yin (University of Science and Technology Beijing), Guoren Wang (Beijing Institute of Technology)
  • Density Decomposition of Multilayer Graphs
    Jiaqi Jiang (Beijing Institute of Technology), Rong-Hua Li* (Beijing Institute of Technology), Yalong Zhang (Beijing Institute of Technology)
  • SQAC: Scalable Querying of Attribute-Constrained (α, β)-Cores over Large Bipartite Graphs
    Xin Deng (Hunan University), Peng Peng (Hunan University), Baoqing Sun (Hunan University), Shuo Dai (Hunan University), Zheng Qin* (Hunan University), Lijun Chang (The University of Sydney)
  • Listing Minimal Cores in Large Real-World Graphs
    Yukai Sun (Harbin Institute of Technology, Shenzhen), Kaiqiang Yu (Nanjing University), Shengxin Liu* (Harbin Institute of Technology, Shenzhen), Cheng Long (Nanyang Technological University), Raymond Chi-Wing Wong (The Hong Kong University of Science and Technology), Xun Zhou (Harbin Institute of Technology, Shenzhen), Min Zhang (Harbin Institute of Technology, Shenzhen)
  • BEACON: A Benchmark for Efficient and Accurate Counting of Subgraphs [Experiment, Analysis, and Benchmark]
    Xiangju Zhu* (The University of Hong Kong), Mohammad Matin Najafi (Huawei Hong Kong Research Center), Chrysanthi Kosyfaki (The Hong Kong University of Science and Technology), Xiaodong Li (Xiamen University), Reynold Cheng (The University of Hong Kong), Laks Lakshmanan (University of British Columbia)
[R-21] LLM-Assisted and AI-Augmented Query Processing
Time: Wednesday, May 6, 15:30 - 17:00
Location: Rue Sherbrooke
Track: Query Processing, Indexing, and Optimization
Session Chair: [To Be Announced]
  • Query-Driven Data Exploration with Heterogeneous Treatment Effects
    Antonis Mandamadiotis* (Athena Research Center), Sihem Amer-Yahia (CNRS, Univ. Grenoble Alpes), Georgia Koutrika (Athena Research Center)
  • BOND: A Co-Designed Framework for LLM-Powered Analytics Over Relational Data
    Lixiang Chen* (East China Normal University), Qin Zheng, (East China Normal University), Zhicheng Pan (East China Normal University), Chengcheng Yang (East China Normal University), Rong Zhang (East China Normal University), Xuan Zhou (East China Normal University)
  • Batcher: Learning to Construct Cost-Efficient Batches of Small Queries in Big Data Processing Platforms
    Yeonsu Park (Kangwon National University), Taesung Lee (POSTECH), Byungchul Tak (Kyungpook National University), Wook-Shin Han* (POSTECH)
  • CactusDB: Unlock Co-Optimization Opportunities for SQL Queries and AI/ML Model Inferences
    Lixi Zhou (Arizona State University), Kanchan Chowdhury (Arizona State University), Lulu Xie (Arizona State University), Jaykumar Tandel (Arizona State University), Hong Guan (Arizona State University), Zhiwei Fan (Meta), Xinwei Fu (Amazon), Jia Zou* (Arizona State University)
  • APEX: Adaptive Variable-wise Parallel Execution for Worst-Case Optimal Joins on Graph Queries
    Yipeng Liu (Guangxi Key Laboratory of Trusted Software, Guilin University of Electronic Technology), Yuming Lin* (Guangxi Key Laboratory of Trusted Software, Guilin University of Electronic Technology), Zhicheng Pan (East China Normal University), Chengcheng Yang (East China Normal University), You Li (Guangxi Key Laboratory of Trusted Software, Guilin University of Electronic Technology), Aoying Zhou (East China Normal University)
[R-22] Time Series Cleaning and Imputation
Time: Wednesday, May 6, 15:30 - 17:00
Location: Rue Mansfield
Track: Information Integration and Data Quality
Session Chair: [To Be Announced]
  • MINOR: Multivariate Time Series Iterative Cleaning Algorithm
    Aoqian Zhang* (Beijing Institute of Technology), Yinru Sun (Beijing Institute of Technology), Pengxiang Hao (Beijing Institute of Technology), Yifeng Gong (Beijing Institute of Technology), Boyang Li (Beijing Institute of Technology), Jing Geng (Beijing Institute of Technology, Zheng Wang (Shanghai Jiao Tong University), Lianpeng Qiao (Beijing Institute of Technology)
  • RAMSeS: Robust and Adaptive Model Selection for Time-Series Anomaly Detection Algorithms
    Mohamed Ahmed Abdelmaksoud Mohamed* (TU Berlin & BIFOLD), Sheng Ding (University of Stuttgart), Andrey Morozov (University of Stuttgart), Ziawasch Abedjan (TU Berlin & BIFOLD)
  • Time-Frequency Conditioned Diffusion for Multivariate Time Series Imputation
    Yumeng Liu (Shenzhen Technology University), Zheng Wang (Shenzhen Technology University), Jikui Liu (Shenzhen Polytechnic Uiversity), Kaisa Zhang (Beijing University of Poss and Telecommunications), Weidong Gao (Beijing University of Poss and Telecommunications), Xiaomao Fan* (Shenzhen Technology University)
  • EDITOR: Multi-Resolution Cleaning of Multivariate Time Series via Detect-Localize-Repair
    Chenyang Li* (Renmin University of China), Chaohong Ma (Hebei Normal University), Xiaohui Yu (York University), Cailong Li (Northeastern University), Xiaofeng Meng (Renmin University of China)
  • Improving Data Imputation through a Tuned Strategy for Dependency Discovery
    Bernardo Breve (University of Naples Federico II), Loredana Caruccio (University of Salerno), Tullio Pizzuti* (University of Salerno), Giuseppe Polese (University of Salerno)
[R-23] Differential Privacy and Local Privacy Mechanisms
Time: Wednesday, May 6, 15:30 - 17:00
Location: Rue Crescent
Track: Database Security and Privacy
Session Chair: [To Be Announced]
  • Fine-grained Manipulation Attacks to Local Differential Privacy Protocols for Range Query
    Xinyu Li (Xi'an Jiaotong University), Wenda Chen (Xi'an Jiaotong University), Xuebin Ren* (Xi'an Jiaotong University)
  • ABC: Numerical Data Collection under Local Differential Privacy without Prior Knowledge
    Incheol Baek* (Korea University), Hyungbin Kim (Korea University), Yon Dohn Chung (Korea University)
  • Answering Federated Range Queries with Local Differential Privacy
    Yuemin Zhang (The Hong Kong Polytechnic University), Qingqing Ye* (The Hong Kong Polytechnic University), Junxu Liu (The Hong Kong Polytechnic University), Wei Dong (Nanyang Technological University)
  • Robust Single-message Shuffle Differential Privacy Protocol for Accurate Distribution Estimation
    Xiaoguang Li* (Xidian University), Hanyi Wang (China Mobile (Suzhou) Software Technology Co., Ltd), Yaowei Huang (Guangzhou University), Jungang Yang (Shanghai University), Qingqing Ye (The Hong Kong Polytechnic University), Haonan Yan (Xidian University), Ke Pan (Xidian University), Zhe Sun (Guangzhou University), Hui Li (Xidian University)
  • Revisiting Locally Differentially Private Protocols: Towards Better Trade-offs in Privacy, Utility, and Attack Resistance [Experiment, Analysis, and Benchmark]
    ZHéber H. Arcolezi* (Inria), Sébastien Gambs (UQAM)
[R-24] Vector Databases, Embeddings and ML Data Infrastructure
Time: Thursday, May 7, 10:00 - 12:00
Location: Av. Duluth
Track: Data Management for AI
Session Chair: [To Be Announced]
  • Exqutor: Extended Query Optimizer for Vector-augmented Analytical Queries
    Hyunjoon Kim (Yonsei University), Chaerim Lim (Yonsei University), Hyeonjun An (Yonsei University), Rathijit Sen (Microsoft), Kwanghyun Park* (Yonsei University)
  • Federated Retrieval over Embedding-Heterogeneous Vector Databases
    Yuxiang Wang (Beihang University), Yongxin Tong* (Beihang University), Zimu Zhou (City University of Hong Kong), Ziyuan He (Beihang University), Ruixi Hu (Beihang University), Ke Xu (Beihang University)
  • Trading Vector Data in Vector Databases
    Jin Cheng* (The Chinese University of Hong Kong, Shenzhen), Xiangxiang Dai (The Chinese University of Hong Kong), Ningning Ding (Hong Kong University of Science and Technology (Guangzhou)), John C.S. Lui (The Chinese University of Hong Kong), Jianwei Huang (The Chinese University of Hong Kong, Shenzhen)
  • MojoFrame: Dataframe Library in Mojo Language
    Shengya Huang* (University of Illinois at Urbana-Champaign), Zhaoheng Li* (University of Illinois at Urbana-Champaign), Derek Werner (University of Illinois at Urbana-Champaign), Yongjoo Park (University of Illinois at Urbana-Champaign)
  • Approximate Diverse k-nearest Neighbor Search in Vector Database
    Jiachen ZHAO* (Chinese University of Hong Kong), Xiao Yan (Wuhan University), Eric Lo (Chinese University of Hong Kong)
  • SQLVec: SQL-Based Vector Similarity Search
    Zequn Zhang* (School of Cyber Science and Engineering, Wuhan University), Yuanyuan Zhu (School of Computer Science, Wuhan University), Hao Zhang (The Chinese University of Hong kong), Jeffrey Xu Yu (Hong Kong University of Sciences and Technology (Guangzhou))
  • MISFEAT: Feature Selection for Subgroups with Mutual Information Estimation
    Bar Genossar* (Technion -- Israel Institute of Technology), Thinh On (New Jersey Institute of Technology), Md Mouinul Islam (PayPal), Ben Eliav (Technion -- Israel Institute of Technology), Senjuti Basu Roy (New Jersey Institute of Technology), Avigdor Gal (Technion -- Israel Institute of Technology)
[R-25] Community Search on Diverse Graph Types
Time: Thursday, May 7, 10:00 - 12:00
Location: Rue McGill
Track: Graph Structure Analytics
Session Chair: [To Be Announced]
  • MOCHI: Motif-based Community Search over Large Heterogeneous Information Networks
    Yuhan Zhou (Zhejiang University), Qing Liu (Zhejiang University), Xin Huang (Hong Kong Baptist University), Jianliang Xu (Hong Kong Baptist University), Yunjun Gao* (Zhejiang University)
  • More Than Pivot for Maximal Clique Enumeration
    Zhaoyi Zhong (Swinburne University of Technology), Rui Zhou* (Swinburne University of Technology), Lu Chen (Swinburne University of Technology), Xiaofan Li (Nanyang Technological University), Chengfei Liu (Swinburne University of Technology)
  • Beyond Homophily: Community Search on Heterophilic Graphs
    Qing Sima (University of New South Wales), Xiaoyang Wang* (University of New South Wales), Wenjie Zhang (University of New South Wales)
  • Efficient Community Search on Attributed Public-Private Graphs
    Yuqi Chen* (Hong Kong Baptist University), Weihan Zhang, (Sun Yat-sen University), Xin Huang (Hong Kong Baptist University)
  • Maximum Balanced Clique Search on Large Directed Graphs
    Jianhua Wang* (Inner Mongolia University), Jianye Yang (Guangzhou University), Zhaoquan Gu (Harbin Institute of Technology (Shenzhen)), Dian Ouyang (Guangzhou University), Ziyi Ma (Hebei University of Technology), Ying Zhang (Zhejiang Gongshang University)
  • Prompt-Guided Community Search under Extreme Few-Shot Supervision
    Wenxin Yang (Beijing Institute of Technology), Kaiyu Feng* (Beijing Institute of Technology), Lanting Fang (Beijing Institute of Technology), Kangfei Zhao (Beijing Institute of Technology), Xia Wu (Beijing Institute of Technology)
  • Efficient Size Constraint Community Search over Heterogeneous Information Networks
    Xinjian Zhang (Swinburne University of Technology), Chengfei Liu* (Swinburne University of Technology), Lu Chen (Swinburne University of Technology), Rui Zhou (Swinburne University of Technology), Bo Ning (Dalian Maritime University)
[R-26] Tabular Data, Community Search and Knowledge Discovery
Time: Thursday, May 7, 10:00 - 12:00
Location: Rue Sherbrooke
Track: Data Mining and Knowledge Discovery
Session Chair: [To Be Announced]
  • Fast Discovery of Functional Dependencies via Bayesian Network Learning
    Siyi Yang (National University of Defense Technology), Shenglin Chen (National University of Defense Technology), Xi Wang (National University of Defense Technology), Yuhua Tang (National University of Defense Technology), Ruochun Jin* (National University of Defense Technology)
  • VisPoison: An Effective Backdoor Attack Framework for Tabular Data Visualization Models
    Shuaimin Li (The Hong Kong Polytechnic University), Chen Jason Zhang (The Hong Kong Polytechnic University), Xuanang Chen (Institute of Software, Chinese Academy of Sciences), Anni Peng (PetroChina Digital Intelligence Research Institute Co., Ltd.), Zhuoyue Wan (The Hong Kong Polytechnic University), Yuanfeng Song* (ByteDance), Shiwen Ni (Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences), Min Yang (Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences), Fei Hao (The Hong Kong Polytechnic University), Raymond Chi-Wing Wong (The Hong Kong University of Science and Technology)
  • Keyword-Aware Skyline Community Search on Semantics and Structure
    Chuanhou Sun* (Northeastern University), Yuhai Zhao (Northeastern University), Ling Li (Shanxi University), Yuan Li (North China University of Technology)
  • TabLoft: Tabular Data Generation Based on LLM with Ordered Features
    Luyu Chen* (Fudan University), Changhao Wu (Fudan University), Jingyi Li (Fudan University), Sen Liu (Fudan University), Guangnan Ye (Fudan University), Hongfeng Chai (Fudan University)
  • Beyond Traditional Diagnostics: Transforming Patient-Side Information into Predictive Insights with Knowledge Graphs and Prototypes
    Yibowen Zhao* (Shandong University), Yinan Zhang (Nanyang Technological University), Zhixiang Su (Nanyang Technological University), Lizhen Cui (Shandong University), Chunyan Miao (Nanyang Technological University)
  • C²TC: A Training-Free Framework for Efficient Tabular Data Condensation
    Sijia Xu (University of New South Wales), Fan Li (University of New South Wales), Xiaoyang Wang* (University of New South Wales), Zhengyi Yang (University of New South Wales), Xuemin Lin (Shanghai Jiao Tong University)
  • Beyond Imputation: A Semantic Unification Framework for Data and Its Missingness in Multimodal Healthcare Analytics
    Chaohe Zhang* (Peking University), Liantao Ma (Peking University), Shiwei Lyu (Ant Group), Xin Gao (Peking University), Junfeng Zhao (Peking University), Yasha Wang (Peking University), Xu Chu (Peking University)
[R-27] Sketches, Approximate Queries and Data Streams
Time: Thursday, May 7, 10:00 - 12:00
Location: Rue Mansfield
Track: Uncertain Databases, Graphs and Streaming
Session Chair: [To Be Announced]
  • GeminiSketch: An Accurate and Efficient Sketch for Summarizing Temporal Graph Streams with Rolling-out Elimination
    Xuyang Jing (Xidian University), Chenhao Zhang (Xidian University), Zheng Yan* (Xidian University), Qingze Jiang (Xidian University), Witold Pedrycz (University of Alberta), Mingjun Wang (Xidian University), Cong Wang (Northwestern Polytechnical University)
  • Approximate Butterfly Counting in Sublinear Time
    Chi Luo (Shanghai Jiao Tong University), Jiaxin Song (University of Illinois Urbana-Champaign), Yuhao Zhang (Shanghai Jiao Tong University), Kai Wang* (Shanghai Jiao Tong University), Zhixing He (Shanghai Jiao Tong University), Kuan Yang (Shanghai Jiao Tong University)
  • Evolving Sketch: Time-Decaying Frequency Estimation for Evolving Streams
    Ge Gao* (Soochow University), Yang Du (Soochow University), He Huang (Soochow University), Yu-E Sun (Soochow University), Jianzhi Tang (Soochow University)
  • Spatiotemporal Sketch Disaggregation: Streaming Analytics with Heterogeneous Resources
    Jonatan Langlet* (KTH Royal Institute of Technology), Peiqing Chen (University of Maryland, College Park), Michael Mitzenmacher (Harvard University), Zaoxing Liu (University of Maryland, College Park), Ran Ben Basat (University College London), Gianni Antichi (Politecnico di Milano)
  • ProvSQL: A General System for Keeping Track of the Provenance and Probability of Data
    Aryak Sen (Univ. Grenoble Alpes), Silviu Maniu (Univ. Grenoble Alpes), Pierre Senellart* (ENS, PSL University)
  • AlignSketch: A Framework for Aligning Theoretical and Practical Estimation Errors
    Ce Zheng* (School of Cyber Science and Technology, Beihang University), Hanyue Zheng (School of Computer Science, Peking University), Jingwei Shi (School of Information Management & Engineering, Shanghai University of Finance and Economics), Xinye Xu (School of Computer Science, Peking University), Wei Zhou (Viterbi School of Engineering, University of Southern California), Tong Yang (School of Computer Science, Peking University), Zhenyu Guan (School of Cyber Science and Technology, Beihang University), Yong Cui (Department of Computer Science and Technology, Tsinghua University)
  • Query-Guided Analysis and Mitigation of Data Verification Errors
    Ran Schreiber* (Bar-Ilan University), Yael Amsterdamer (Bar-Ilan University)
[R-28] Transaction Management, Distributed Storage and Serverless Systems
Time: Thursday, May 7, 10:00 - 12:00
Location: Rue Crescent
Track: Cloud Data Management
Session Chair: [To Be Announced]
  • Contemp: Instance Caching Based on Container Temperature in Serverless Environment
    Pengwei Wang* (Donghua University), Nuo Chen (Donghua University), Haoquan Qi (Donghua University), Yichen Zhong (Donghua University), Shun Song (Ant Group)
  • MTC: Scalable Transaction Commit for Multi-Master Cloud Databases
    Kecheng Luo* (East China Normal University), Xiaoxian Wei (East China Normal University), Wenxin Liu (East China Normal University), Peng Cai (East China Normal University), Aoying Zhou (East China Normal University), Hui Li (Guizhou University), Le Cai (ByteDance Inc)
  • Efficient Cloud-edge Collaborative Approaches to SPARQL Queries over Large RDF graphs
    Shidan Ma* (Hunan University), Peng Peng (Hunan University), Xu Zhou (Hunan University), M. Tamer Özsu (University of Waterloo), Lei Zou (Peking University), Guo Chen (Hunan University)
  • ImmortalChopper: Real-Time and Resilient Distributed Transactions in the Edge-Cloud
    Juncheng Fang* (University of California, Irvine), Farzad Habibi (University of California, Irvine), Binbin Gu (University of California, Irvine), Faisal Nawab (University of California, Irvine)
  • REMON: Remote External Memory Over the Network
    Shiquan Zhang* (University of Toronto), Michail Bachras (University of Toronto), Yuqiu Zhang (University of Toronto), Yunhao Mao (University of Toronto), Hans-Arno Jacobsen (University of Toronto)
  • PAT: Towards Transaction Routing with Page Affinity in Shared-Cache Databases
    Zhongqin Tan* (Northeastern University), Haoyuan Zhang (Northeastern University), Yanfeng Zhang (Northeastern University), Zeshun Peng (Northeastern University), Weixing Zhou (Northeastern University), Jinyu Zhang (Huawei Company), Yang Ren (Huawei Company), Guoliang Li (Tsinghua University), Ge Yu (Northeastern University)
  • Accelerating Metadata Management of DFS via Speculative Permission Checking
    Yiduo Wang* (China Telecom Cloud Computing Research Institute), Linghang Meng (China Telecom Cloud Computing Research Institute), Liang Li (China Telecom Cloud Computing Research Institute), Jie Wu (China Telecom Cloud Computing Research Institute)
[R-29] GPU-Accelerated and Hardware-Optimized Query Processing
Time: Thursday, May 7, 13:30 - 15:00
Location: Av. Duluth
Track: Modern Hardware and In-Memory Database Systems
Session Chair: [To Be Announced]
  • FaScalSQL: A Fast and Scalable GPU-Accelerated SQL Query Engine for Out-of-Memory Tables
    Chaemin Lim (Yonsei University), Suhyun Lee (Yonsei University), Jinwoo Choi (Yonsei University), Kwanghyun Park (Yonsei University), Jinho Lee (Seoul National University), Joonsung Kim (Sungkyunkwan University), Youngsok Kim* (Yonsei University)
  • cuFHEDB: GPU-Accelerated Fully Homomorphic Encryption Database
    Shijie Gao* (Renmin University of China), Feng Zhang (Renmin University of China), Qian Xu (Renmin University of China), Yang Li (Lenovo research), XueFeng Liu (Lenovo research), Chao Jiang (Lenovo research), Limin Xiao (Lenovo research), Siqi Ma (University of New South Wales), Elisa Bertino (Purdue University), Xiaoyong Du (Renmin University of China)
  • Improving GPU Tensor Query Processing for Resource-Constrained Environments
    Qian Xu* (Renmin University of China), Feng Zhang (Renmin University of China), Shijie Gao (Renmin University of China), Kun Chen (Individual Researcher), Jianhua Wang (China Electronics Technology Kingbase (Beijing) Technologies Inc), Zheng Chen (Tsinghua University), Xiaoyong Du (Renmin University of China)
  • F5: A Robust SIMD-Accelerated MSD Radix Sort
    Arif Arman (Texas A&M University), Dmitri Loguinov* (Texas A&M University)
  • GLIDE: GPU-Accelerated ANN Graph Index Construction via Data Locality
    Fuhao Ruan (Huazhong University of Science and Technology), Ziyang Yue (Huazhong University of Science and Technology), Ling Xu (Shuyi Tech.), Dawei Liu (Huazhong University of Science and Technology), Bolong Zheng* (Huazhong University of Science and Technology)
[R-30] NL2SQL: Systems, Evaluation and Benchmarking
Time: Thursday, May 7, 13:30 - 15:00
Location: Rue McGill
Track: AI for Data Management
Session Chair: [To Be Announced]
  • Hexgen-Flow: Optimizing LLM Inference Request Scheduling for Agentic Text-to-SQL
    You Peng (Hong Kong University of Science and Technology (HKUST)), Youhe Jiang (Hong Kong University of Science and Technology (HKUST)), Wenqi Jiang (ETH Zurich), Chen Wang (Tsinghua University), Binhang Yuan* (Hong Kong University of Science and Technology (HKUST))
  • SQLMorph: Query Mutation and Fine-Grained Metrics for Text-to-SQL Evaluation [Experiment, Analysis, and Benchmark]
    Mohammadhossein Malekpour (Polytechnique Montréal), Mohamed Riahi (Polytechnique Montréal), Maxime Lamothe (Polytechnique Montréal), Amine Mhedhbi* (Polytechnique Montréal)
  • An Efficient and Effective Evaluator for Text2SQL Models on Unseen and Unlabeled Data
    Khanh Trinh Pham (Griffith University), Thanh Tam Nguyen (Griffith University), Viet Huynh (Edith Cowan University), Hongzhi Yin (The University of Queensland), Quoc Viet Hung Nguyen* (Griffith University)
  • Elena: An Explainability-aided Online Query Optimization Framework
    Yuan Dong (Zhejiang University), Yuanyuan Yao (Zhejiang University), Yangyang Wu* (Zhejiang University), Lu Chen (Zhejiang University), Rong Zhu (Alibaba Group)
  • MM2SQL: A Benchmark and Method for Visually-Grounded SQL Generation
    Shengze Shi* (Institute of Software Chinese Academy of Sciences, University of Chinese Academy of Sciences), Tao Ren (Institute of Software Chinese Academy of Sciences, University of Chinese Academy of Sciences), Li Qi (Institute of Software Chinese Academy of Sciences, University of Chinese Academy of Sciences), Tingrui Yang (Institute of Software Chinese Academy of Sciences, University of Chinese Academy of Sciences), Wei Xiong (Institute of Software Chinese Academy of Sciences, University of Chinese Academy of Sciences), Jun Hu (Institute of Software Chinese Academy of Sciences, University of Chinese Academy of Sciences)
[R-31] Graph Pattern Matching, Hypergraphs and Subgraph Queries
Time: Thursday, May 7, 13:30 - 15:00
Location: Rue Sherbrooke
Track: Graph Structure Analytics
Session Chair: [To Be Announced]
  • L4G: Two-hop Label Management for Group Steiner Tree Search on Graphs
    Xiaoyao Feng (Renmin University of China), Yahui Sun* (Renmin University of China), Zhuoran Wang (Renmin University of China), Junlin Li (Renmin University of China), Sijia Luo (Renmin University of China), Rong-Hua Li (Beijing Institute of Technology)
  • Efficient Hypergraph Pattern Matching via Match-and-Filter and Intersection Constraint
    Siwoo Song (Seoul National University), Wonseok Shin (Standigm Inc), Kunsoo Park* (Seoul National University), Giuseppe Italiano (LUISS University), Zhengyi Yang (University of New South Wales), Wenjie Zhang (University of New South Wales)
  • HL-index: Fast Reachability Query in Hypergraphs
    Peiting Xie* (The University of New South Wales), Xiangjun Zai (University of New South Wales), Yanping Wu (University of Technology Sydney), Xiaoyang Wang (The University of New South Wales), Wenjie Zhang (The University of New South Wales), Lu Qin (University of Technology Sydney)
  • Subtree Mode and Applications
    Jialong Zhou (King's College London), Ben Bals (CWI), Matei Tinca (Vrije Universiteit), Ai Guan (King's College London), Panagiots Charalampopoulos (King's College London), Grigorios Loukides* (King's College London), Solon Pissis (CWI)
  • Efficient Graph Matching with Pattern Reduction
    Pingpeng Yuan (Huazhong University of Science & Technology), Yujiang Wang (Huazhong University of Science & Technology), Jiangji Peng (Huazhong University of Science & Technology), Tianyu Ma (Huazhong University of Science & Technology), Siyuan He (Huazhong University of Science & Technology), Ling Liu* (Georgia Institute of Technology)
[R-32] Graph-Based RAG, Queries and Entity Tasks
Time: Thursday, May 7, 13:30 - 15:00
Location: Rue Mansfield
Track: Graph Queries, Entity Alignment and Learning
Session Chair: [To Be Announced]
  • PROGQL: A Provenance Graph Query System for Cyber Attack Investigation
    Fei Shao* (Case Western Reserve University), Jia Zou (Arizona State University), Zhichao Cao (Arizona State University), Xusheng Xiao (Arizona State University)
  • Effective Fairest Community Search over Heterogeneous Information Networks
    Taige Zhao (Deakin University), Jianxin Li* (Edith Cowan University), Man Li (Victoria University), Wei Luo (Deakin University), Jingxian Cheng (Chang'an University), Yuan Miao (Victoria University), Hua Wang (Victoria University)
  • AGRAG: Advanced Graph-based Retrieval-Augmented Generation for LLMs
    Yubo Wang* (HKUST), Haoyang Li (The Hong Kong Polytechnic University), Fei Teng (HKUST), Lei Chen (HKUST & HKUST(GZ))
  • Clue-RAG: Towards Accurate and Cost-Efficient Graph-based RAG via Multi-Partite Graph-based Index
    Yaodong Su (CUHKSZ), Yixiang Fang* (CUHKSZ), Yingli Zhou (CUHKSZ), Chuanhui Yang (OceanBase, Ant Group)
  • ZTab: Domain-based Zero-shot Annotation for Table Columns
    Ehsan Hoseinzade* (Simon Fraser University), Ke Wang (Simon Fraser university)
[R-33] Explainability, Fairness and Trust in Data Systems
Time: Thursday, May 7, 13:30 - 15:00
Location: Rue Crescent
Track: Explainability, Fairness, and Trust in Data Systems and Analysis
Session Chair: [To Be Announced]
  • CausalPre: Scalable and Effective Data Pre-processing for Causal Fairness
    Ying Zheng* (National University of Singapore), Yangfan Jiang (National University of Singapore), Kian-Lee Tan (National University of Singapore)
  • Promoting Fairness in Information Access within Social Networks
    Changan Liu* (Fudan University), Xiaotian Zhou (Fudan University), Ahad N. Zehmakan (Australian National University), Zhongzhi Zhang (Fudan University)
  • Interpreting Graph Inference with Skyline Explanations
    Dazhuo Qiu* (Aalborg University), Haolai Che (Case Western Reserve University), Arijit Khan (Aalborg University), Yinghui Wu (Case Western Reserve University)
  • Explaining GNN Negatives Globally and Locally
    Kehan Pang (Beihang University), Wenfei Fan (University of Edinburgh), Min Xie* (Shenzhen Institute of Computing Sciences), Dandan Lin (Shenzhen Institute of Computing Sciences)
  • Mitigating GenAI-powered Evidence Pollution for Out-of-Context Misinformation Detection
    Zehong Yan* (National University of Singapore), Peng Qi (National University of Singapore), Wynne Hsu (National University of Singapore), Mong Li Lee (National University of Singapore)
[R-34] Dynamic Graphs and Temporal Graph Processing
Time: Thursday, May 7, 15:30 - 17:00
Location: Rue McGill
Track: Graph Structure Analytics
Session Chair: [To Be Announced]
  • GRACE: Alleviating Reconstruction Cost in Dynamic Graph Processing Systems
    Hongru Gao (Huazhong University of Science and Technology), Shuhao Zhang* (Huazhong University of Science and Technology), Xiaofei Liao (Huazhong University of Science and Technology), Hai Jin (Huazhong University of Science and Technology)
  • IIT-Tree: An efficient index to support interval-based query on large temporal graphs
    Faming Li* (Northeastern University), Shengli Qiu (Northeastern University), Xiaochun Yang (Northeastern University), Bin Wang (Northeastern University), Hengzhao Ma (Northeastern University)
  • TRADER: Real-Time Arbitrage Detection via Negative Cycles on Dynamic Graphs
    Bingqiao Luo* (National University of Singapore), Yuhang Chen (National University of Singapore), Jiaxin Jiang (National University of Singapore), Yuheng Cong (Shanghai Jiao Tong University), Ziyu He (Shanghai Jiao Tong University), Shixuan Sun (Shanghai Jiao Tong University), Bingsheng He (National University of Singapore), Wee Howe Ang (Tokka Labs)
  • Unifying Graph Traversals and Time Series Joins in Hybrid Graphs
    Gianluca Rossi* (Lyon 1 University), Angela Bonifati (Lyon 1 University), Riccardo Tommasini (INSA Lyon)
  • PRIME: Efficient Algorithm for Token Graph Routing Problem
    Haotian Xu (Hong Kong University of Science and Technology (Guangzhou)), Yuqing Zhu (Nanyang Technological University), Yuming Huang (National University of Singapore), Jing Tang* (The Hong Kong University of Science and Technology (Guangzhou))
[R-35] Learned Data Systems and AI-Driven Analysis
Time: Thursday, May 7, 15:30 - 17:00
Location: Rue Sherbrooke
Track: AI for Data Management
Session Chair: [To Be Announced]
  • Conflict Resolution for Improving ML Accuracy
    Wenfei Fan (Shenzhen Institute of Computing Sciences), Xiaoyu Han (Fudan University), Hufsa Khan (Shenzhen Institute of Computing Sciences), Weilong Ren* (Shenzhen Institute of Computing Sciences), Yaoshu Wang (Shenzhen Institute of Computing Sciences), Min Xie (Shenzhen Institute of Computing Sciences), Zihuan Xu (Shenzhen Institute of Computing Sciences)
  • LUCID: an Updatable and Concurrent Learned Index for Larger-than-Memory Data Management
    Chaohong Ma* (Hebei Normal University), Xiaohui Yu (York University), Yifan Li (York University), Aishan Maoliniyazi (Renmin University of China), Xiaofeng Meng (Renmin University of China)
  • Rethinking Flexible Graph Similarity Computation: One-step Alignment with Global Guidance
    Zhouyang Liu* (National University of Defense Technology), Ning Liu (Information Support Force Engineering University), Yixin Chen (National University of Defense Technology), Jiezhong He (National University of Defense Technology), Shuai Ma (Beihang University), Dongsheng Li (National University of Defense Technology)
  • Generalizable Address-aware Semantic Prefetching for Scalable Transactional and Analytical Workloads
    Farzaneh Zirak* (The University of Melbourne), Farhana Choudhury (The University of Melbourne), Renata Borovica-Gajic (The University of Melbourne)
  • An LLM-Guided Query-Aware Inference System for GNN Models on Large Knowledge Graphs
    Waleed Afandi* (Concordia University), Hussein Abdallah (Concordia University), Ashraf Aboulnaga (The University of Texas at Arlington), Essam Mansour (Concordia University)
[R-36] Time Series and Temporal Data Analysis
Time: Thursday, May 7, 15:30 - 17:00
Location: Rue Mansfield
Track: Spatial Databases and Temporal Databases
Session Chair: [To Be Announced]
  • Compressing High-Frequency Time Series Through Multiple Models and Stealing from Residuals
    Abduvoris Abduvakhobov (Aalborg University), Soeren Kejser Jensen* (Aalborg University), Christian Thomsen (Aalborg University), Torben Bach Pedersen (Aalborg University)
  • Sorting Compressed Time Series
    Zhiheng Liu (Tsinghua University), Xingyu Liu (Tsinghua University), Shaoxu Song* (Tsinghua University), Jianmin Wang (Tsinghua University)
  • FusAD: Time-Frequency Fusion with Adaptive Denoising for General Time Series Analysis
    Da Zhang (Northwestern Polytechnical University), Bingyu Li (University of Science and Technology of China), Zhiyuan Zhao (Institute of Artificial Intelligence (TeleAI), China Telecom), Feiping Nie (Northwestern Polytechnical University), Junyu Gao* (Northwestern Polytechnical University, Center for OPTical IMagery Analysis and Learning), Xuelong Li (Institute of Artificial Intelligence (TeleAI), China Telecom)
  • Scaling Subsequence Similarity Join Based on Dynamic Time Warping
    Zemin Chao (Harbin institute of technology), Qiaoyi Zheng (Harbin institute of technology), Xingxing Xiao (Harbin institute of technology), Boyu Xiao (Harbin institute of technology), Zhixin Qi (Harbin institute of technology), Hongzhi Wang* (Harbin institute of technology)
  • Time-varying Vector Field Compression with Preserved Critical Point Trajectories
    Mingze Xia (Oregon State University), Yuxiao Li (The Ohio State University), Pu Jiao (University of Kentucky), Bei Wang (University of Utah), Xin Liang* (Oregon State University), Hanqi Guo (The Ohio State University)
[R-37] Secure Query Processing and Access Control
Time: Thursday, May 7, 15:30 - 17:00
Location: Rue Crescent
Track: Database Security & Privacy
Session Chair: [To Be Announced]
  • Secure Query Processing with Linear Online Cost
    Qiyao Luo* (OceanBase, Ant Group), Yilei Wang (Alibaba Cloud), Wei Dong (Nanyang Technological University), Ke Yi (Hong Kong Univ. of Science and Technology)
  • Zero-Knowledge Verifiable Graph Query Evaluation via Expansion-Centric Operator Decomposition
    Hao Wu (East China Normal University), Changzheng Wei (Ant Group), Yanhao Wang (East China Normal University), Li Lin (Ant Group), Yilong Leng (East China Normal University), Shiyu He (East China Normal University), Minghao Zhao* (East China Normal University), Hanghang Wu (Ant Group), Ying Yan (Ant Group), Aoying Zhou (East China Normal University)
  • Data Guard: A Fine-grained Purpose-based Access Control System for Large Data Warehouses
    Khai Tran* (LinkedIn), Sudarshan Vasudevan (LinkedIn), Pratham Desai (LinkedIn), Alex Gorelik (LinkedIn), Mayank Mahuja (LinkedIn), Athrey Venkateshababu (LinkedIn), Mohit Verma (LinkedIn), Dichao Hu (LinkedIn), Walaa Moustafa (LinkedIn), Vasanth Rajamani (OpenAI), Ankit Gupta (Anthropic), Issac Buenrostro (LinkedIn), Kalinda Raina (LinkedIn), Yanwen Lin (LinkedIn)
  • CFDGraph: Privacy-Preserving Graph Processing for Large-Scale Collaborative Fraud Detection
    Qiulin Wu (Shenzhen University, Hong Kong Baptist University), Amelie Chi Zhou* (Hong Kong Baptist University), Tristan Allard (Univ. Rennes, CNRS, IRISA), Shadi Ibrahim (Inria), Yuhong Feng (Shenzhen University), Lichun Li (Ant Group), Amr Abbadi (UC Santa Barbara)
  • RISK: Efficiently processing rich spatial-keyword queries on encrypted geo-textual data
    Zhen Lv (Xidian University), Cong Cao (Xidian University), Hongwei Huo (Xidian University), Jiangtao Cui (Xidian University), Yanguo Peng* (Xidian University), Hui Li (Xidian University), Yingfan Liu (Xidian University)
[R-38] Storage Management and LSM-tree Systems
Time: Friday, May 8, 10:00 - 12:00
Location: Av. Duluth
Track: Query Processing, Indexing, and Optimization
Session Chair: [To Be Announced]
  • Contextual Pattern Mining and Counting
    Ling Li (King's College London), Daniel Gibney (University of Texas at Dallas), Sharma Thankachan (North Carolina State University), Solon Pissis (CWI), Grigorios Loukides* (King's College London)
  • LogDelta: Differential Encoding for Log Data
    Songze Li (Tsinghua University), Shaoxu Song* (Tsinghua University), Zhitao Shen (Ant Group)
  • MatKV: Trading Compute for Flash Storage in LLM Inference
    Kun-Woo Shin (Seoul National University), Jay H. Park (Samsung Electronics), Moonwook Oh (Samsung Electronics), Yohan Jo (Seoul National University), Jaeyoung Do (Seoul National University), Sang-Won Lee* (Seoul National University)
  • AOEH: An Efficient Extendable Hashing to Reduce Read/Write Amplification for Persistent Memory
    SHIHAO ZHANG* (Shanghai Jiao Tong University), CHI ZHANG (Shanghai Jiao Tong University), YUNFEI GU (Shanghai Jiao Tong University), CHENTAO WU (Shanghai Jiao Tong University), JIE LI (Shanghai Jiao Tong University), JUNZHE LV (Shanxi Taihang Laboratory Co., Ltd)
  • Query-Driven LSM Compactions
    Shubham Kaushik* (Brandeis University), Manos Athanassoulis (Boston University), Subhadeep Sarkar (Brandeis University)
  • Resystance: Unleashing Hidden Performance of Compaction in LSM-trees via eBPF
    Hongsu Byun (Sogang University), Seungjae Lee (Sogang University), Honghyoen Yoo (Sogang Univerisy), MyoungJoon Kim (Sogang University), Sungyong Park* (Sogang Universiry)
  • Doux: Decoupling Values from Keys for Real-Time Analytics
    Shiming Yang* (Renmin University of China), Yu Luo (Renmin University of China), Shuang Liu (Renmin University of China), Wei Lu (Renmin University of China), Kuien Liu (Institute of Software Chinese Academy of Sciences), Yuxing Chen (Tencent Inc.), Anqun Pan (Tencent Inc.), Lixiong Zheng (Tencent Inc.), Xiaoyong Du (Renmin University of China)
[R-39] Trajectory, POI Recommendation and Spatial Crowdsourcing
Time: Friday, May 8, 10:00 - 12:00
Location: Rue McGill
Track: Spatial Databases and Temporal Databases
Session Chair: [To Be Announced]
  • High-Fidelity Task Assignment in Spatial Crowdsourcing via Implicit Human Feedback
    Qingshun Wu (Zhengzhou University), Yafei Li* (Zhengzhou University), Lei Gao (Zhengzhou University), Guanglei Zhu (Zhengzhou University), Lei Chen (Beijing Institute of Technology), Mingliang Xu (Zhengzhou University)
  • Efficient Model-Agnostic Continual Learning for Next POI Recommendation
    Chenhao Wang* (UESTC), Shanshan Feng (Wuhan University), Lisi Chen (UESTC), Fan Li (The Hong Kong Polytechnic University), Shuo Shang (UESTC)
  • PORCA: Root Cause Analysis with Partially Observed Data
    Chang Gong (Institute of Computing Technology, Chinese Academy of Sciences), Di Yao* (Institute of Computing Technology, Chinese Academy of Sciences), Jin Wang (Megagon Labs), Wenbin Li (Institute of Computing Technology, Chinese Academy of Sciences), Lanting Fang (Beijing Institute of Technology), Yongtao Xie (Southeast University), Kaiyu Feng (Beijing Institute of Technology), Peng Han (University of Electronic Science andTechnology of China), Jingping Bi (Institute of Computing Technology, Chinese Academy of Sciences)
  • Trajectory–User Linking via Heterogeneous Preference Graph and Dual-Encoder Mutual Distillation
    ZEMING TIAN (Sun Yat-sen University), Zixin Qin (Sun Yat-sen University), Huaijie Zhu* (Sun Yat-sen University), Ningning Cui (Chang'an University), Wei Liu (Sun Yat-sen University), Jianxing Yu (Sun Yat-sen University), Jian Yin (Sun Yat-sen University)
  • FedCurrMM: A Federated Map Matching Framework with Curriculum-aware Client Selection
    Minxiao Chen* (Beijing University of Posts and Telecommunications), Haitao Yuan (Nanyang Technological University), Haoning Wang (National University of Singapore), Nan Jiang (Nanyang Technological University), Zhihan Zheng (Beijing University of Posts and Telecommunications), Ao Zhou (Beijing University of Posts and Telecommunications), Shangguang Wang (State Key Laboratory of Networking and Switching Technology)
  • Geography-Aware Large Language Model for Next POI Recommendation
    Wei Liu* (Sun Yat-Sen University), Zhao Liu (Sun Yat-sen University), Muzu Xie (Sun Yat-sen University), Huaijie Zhu (Sun Yat-sen University), Jianxing Yu (Sun Yat-sen University), Jian Yin (Sun Yat-sen University), Wang-Chien Lee (The Pennsylvania State University)
  • Balancing Competition for Fairness-aware Task Assignment in Spatial Crowdsourcing
    Jinwen Chen* (University of Electronic Science and Technology of China), Hao Miao (The Hong Kong Polytechnic University), Lei Jia (University of Electronic Science and Technology of China), Guangqiang Yin (University of Electronic Science and Technology of China), Yan Zhao (Shenzhen Institute for Advanced Study, University of Electronic Science and Technology of China), Kai Zheng (University of Electronic Science and Technology of China)
[R-40] Spatiotemporal Forecasting, Urban Analytics and Recommendations
Time: Friday, May 8, 10:00 - 12:00
Location: Rue Sherbrooke
Track: Data Mining and Knowledge Discovery
Session Chair: [To Be Announced]
  • Efficient Traffic Forecasting on Large-Scale Road Network by Regularized Adaptive Graph Convolution
    Kaiqi Wu (Sun Yat-Sen University), Weiyang Kong (Sun Yat-Sen University), Sen Zhang (Sun Yat-Sen University), Zitong Chen (Sun Yat-Sen University), Yubao Liu* (Sun Yat-Sen University)
  • TransLGX: A Self-contained Model to Predict the Entire Lifecycle and complete state of Logistics Package Trajectories
    Yichen Song (Zhejiang Universitiy), Jianfeng Zhou (Bytedance), Jian-Ya Ding* (Bytedance), Renhao Cao (Bytedance)
  • Community-level Personalized Recommendation by Exploiting Evolving User-Item Micro-clusters
    Xinyu Liu* (University of Electronic Science and Technology of China), Jinxia Guo (University of Electronic Science and Technology of China), Qirui Hao (University of Electronic Science and Technology of China), Zhongjing Yu (Peking University), Qinli Yang (University of Electronic Science and Technology of China), Junming Shao (University of Electronic Science and Technology of China)
  • DNA: A Distribution-and-Aggregation Solution for Spatiotemporal K-function-based Analysis
    Tsz Nam Chan* (Shenzhen University), Bojian Zhu (Hong Kong Baptist University), Dingming Wu (Shenzhen University), Renchi Yang (Hong Kong Baptist University), Ruisheng Wang (Shenzhen University)
  • Effective Dataset Distillation for Spatio-Temporal Forecasting with Bi-dimensional Compression
    Taehyung Kwon (KAIST), Yeonje Choi (KAIST), Yeongho Kim (KAIST), Kijung Shin* (KAIST)
  • CAMMSR: Category-Guided Attentive Mixture of Experts for Multimodal Sequential Recommendation
    Jinfeng Xu* (The University of Hong Kong), Zheyu Chen (Beijing Institute of Technology), Shuo Yang (The University of Hong Kong), Jinze Li (The University of Hong Kong), Hewei Wang (Carnegie Mellon University), Yijie Li (Carnegie Mellon University), Jianheng Tang (Peking University), Yunhuai Liu (Peking University), Edith Ngai (The University of Hong Kong)
  • Fast k-means via Data-Aware Grouping and Gap-Optimized Lower Bound
    Xiaogang Huang* (Fujian Normal University), Dan Zhuang (Fujian Normal University), Jianbao Chen (Fujian Normal University), Tiefeng Ma (Southwestern University of Finance and Economics), Shuangzhe Liu (University of Canberra)
[R-41] Table Question Answering and Discovery
Time: Friday, May 8, 10:00 - 12:00
Location: Rue Mansfield
Track: AI for Data Management
Session Chair: [To Be Announced]
  • Accurate Table Question Answering with Accessible LLMs
    Yangfan Jiang* (National University of Singapore), Fei Wei (Alibaba Group), Ergute Bao (Mohamed bin Zayed University of Artificial Intelligence), Yaliang Li (Alibaba Group), Bolin Ding (Alibaba Group), Yin Yang (Hamad Bin Khalifa University), Xiaokui Xiao (National University of Singapore)
  • Efficient and Scalable Search for Statistics
    Antoine Gauquier* (DI ENS, ENS, CNRS, PSL University & Inria), Simon Ebel (Inria), Helena Galhardas (INESC-ID & IST, Universidade Lisboa), Théo Galizzi (Inria), Ioana Manolescu (Inria), Aurélien Peden (Inria), Pierre Senellart (DI ENS, ENS, CNRS, PSL University & Inria)
  • Decomposition-Driven Multi-Table Retrieval and Reasoning for Numerical Question Answering
    Feng Luo (RMIT University), Hai Lan (The University of Queensland), Hui Luo (University of Wollongong), Zhifeng Bao* (The University of Queensland), Xiaoli Wang (Xiamen University), J.Shane Culpepper (The University of Queensland), Shazia Sadiq (The University of Queensland)
  • SPARQ: A Cost-Efficient Framework for Offline Table Question Answering via Adaptive Routing
    Yang Liu* (Beihang University), Mengyi Yan (Shandong University), Jiao Xue (Inspur Cloud Information Technology Co., Ltd.), Weilong Ren (Shenzhen Institute of Computing Sciences), Yutong Ye (Beihang University), Haoyi Zhou (Beihang University), Zhumin Chen (Shandong University), Jianxin Li (Beihang University)
  • L³C: Leaf-Centric Continuous Codes for Natural Language-Driven Table Discovery
    Qiyuan Zhang* (National University of Defense Technology), Ruochun Jin (National University of Defense Technology), Jixin Zhang (National University of Defense Technology), Yuhua Tang (National University of Defense Technology), Xiang Zhao (National University of Defense Technology), Shixuan Liu (National University of Defense Technology)
[R-42] AI-Powered Querying and RAG Systems
Time: Friday, May 8, 13:30 - 15:00
Location: Av. Duluth
Track: AI for Data Management
Session Chair: [To Be Announced]
  • HaS: Accelerating RAG through Homology-Aware Speculative Retrieval
    Peng Peng (South China University of Technology), Weiwei Lin (South China University of Technology), Wentai Wu* (Jinan University), Xinyang Wang (Beijing Forestry University), Yongheng Liu (Pengcheng Laboratory)
  • SaCal: An Efficient Saliency-Guided Causal Framework for Interpretable Healthcare Analytics
    Feixuan Lin* (Beijing Institute of Technology), Chenyu You (Beijing Institute of Technology), Zhongle Xie (Zhejiang University), Zhaojing Luo (Beijing Institute of Technology), Meihui Zhang (Beijing Institute of Technology)
  • XRAG: eXamining the Core - Benchmarking Foundational Components in Advanced Retrieval-Augmented Generation [Experiment, Analysis, and Benchmark]
    Qili Zhang (Beihang University), Qianren Mao* (Zhongguancun Laboratory), Yangyifei Luo (Beihang University), Yashuo Luo (Beihang University), Hanwen Hao (Beihang University), Zhilong Cao (Beihang University), Weifeng Jiang (Nanyang Technological University), Zhijun Chen (Beihang University), Junnan Liu (Beihang University), Feng Yan (Beihang University), Xiaolong Wang (Beihang University), Jinlong Zhang (Beihang University), Zhenting Huang (Beihang University), Zhixing Tan (Zhongguancun Laboratory), Jie Sun (Zhongguancun Laboratory), Bo Li (Beihang University), Jianxin Li (Beihang University), Philip Yu (University of Illinois Chicago)
  • CARROT: A Learned Cost-Constrained Retrieval Optimization System for RAG
    Ziting Wang (Nanyang Technological University), Haitao Yuan* (Nanyang Technological University), Wei Dong (Nanyang Technological University), Gao Cong (Nanyang Technological University), Feifei Li (Alibaba Group)
  • ARCADE: A Real-Time Data System for Hybrid and Continuous Query Processing across Diverse Data Modalities
    Jingyi Yang* (Nanyang Technological University), Songsong Mo (Nanyang Technological University), Jiachen Shi (Nanyang Technological University), Zihao Yu (Nanyang Technological University), Kunhao Shi (Nanyang Technological University), Xuchen Ding (Nanyang Technological University), Gao Cong (Nanyang Technological University)
[R-43] Spatial Graphs and Road Network Analytics
Time: Friday, May 8, 13:30 - 15:00
Location: Rue McGill
Track: Graph Structure Analytics
Session Chair: [To Be Announced]
  • Reverse k Nearest Neighbor Query in Large Road Networks: A Tree Decomposition based Approach
    Dian Ouyang (Guangzhou University), Boyu Zhang (Guangzhou University), Jianye Yang* (Guangzhou University), Shiyu Yang (Guangzhou University), Chonghua Wang (China Industrial Control Systems Cyber Emergency Response Team), Xuemin Lin (Shanghai Jiao Tong University)
  • Overcoming the Sync-Compute Dilemma in Parallel Graph-Based Vector Retrieval
    Qiji Mo* (Nankai University), Zhiyuan Hua (Nankai University), Zebin Yao (Nankai University), Lixiao Cui (Nankai University), Gang Wang (Nankai University), Xiaoguang Liu (Nankai University), Zijing Wei (Alibaba Group Holding Limited), Xinyu Liu (Alibaba Group Holding Limited), Tianxiao Tang (Alibaba Group Holding Limited), Shaozhi Liu (Alibaba Group Holding Limited), Lin Qu (Alibaba Group Holding Limited)
  • Efficient Top-k Nearest Neighbors Search in Dynamic Road Networks
    Junhua Zhang (University of New South Wales), Yamei Song (University of New South Wales), Wentao Li* (University of Leicester), Lu Qin (University of Technology Sydney)
  • An Efficient and Scalable Approach for Path Queries on Public Transportation Networks
    Junhua Zhang* (Northeastern University), Wentao Li (University of Leicester), Wenjie Zhang (University of New South Wales), Lu Qin (University of Technology Sydney), Xiaochun Yang (Northeastern University)
  • BAMG: A Block-Aware Monotonic Graph Index for Disk-Based Approximate Nearest Neighbor Search
    Huiling Li* (Hong Kong Baptist University), Xin Huang (Hong Kong Baptist University), Byron Choi (Hong Kong Baptist University), Jianliang Xu (Hong Kong Baptist University)
[R-44] Distributed Storage, Consensus and Infrastructure
Time: Friday, May 8, 13:30 - 15:00
Location: Rue Sherbrooke
Track: Distributed, Parallel and P2P Data Management
Session Chair: [To Be Announced]
  • SwitchDelta: Asynchronous Metadata Updating for Distributed Storage with In-Network Data Visibility
    Junru Li* (Tsinghua), Qing Wang (Tsinghua), Zhe Yang (Tsinghua), Shuo Liu (Huawei Technologies Co., Ltd.), Jiwu Shu (Tsinghua), Youyou Lu (Tsinghua)
  • GeoLayer: Towards Low-Latency and Cost-Efficient Geo-Distributed Graph Stores with Layered Graph
    Feng Yao* (Northeastern University), Xiaokang Yang (Northeastern University), Shufeng Gong (Northeastern University), Song Yu (Northeastern University), Yanfeng Zhang (Northeastern University), Ge Yu (Northeastern University)
  • RL-Paxos: Relieving the Leader's Burden with Efficient Task Offloading in Distributed Consensus
    Chenhao Zhang* (Beihang university), Jinquan Wang (Beihang university), Meng Han (Tsinghua university), Bing Wei (Hainan university), Xiaojian Liao (Beihang university), Limin Xiao (Beihang university), Shanchen Pang (China University of Petroleum (East China))
  • DistVec: Efficient Distributed Machine Learning in Parallel Database Systems
    Xinyi Zhang* (Renmin University of China), Liangzu Liu (Peking University), Xupeng Miao (Purdue University), Yinjun Wu (Peking University), Zhen Chen (Tsinghua University), Wei Lu (Renmin University of China), Xiaoyong Du (Renmin University of China), Bin Cui (Peking University)
  • Nezha: A Key-Value Separated Distributed Store with Optimized Raft Integration
    Yangyang Wang* (Nanchang University), Yucong Dong (Nanchang University), Ziqian Cheng (Nanchang University), Zichen Xu (Nanchang University)
[R-45] Knowledge Graph Completion and Reasoning
Time: Friday, May 8, 13:30 - 15:00
Location: Rue Mansfield
Track: Graph Queries, Entity Alignment and Learning
Session Chair: [To Be Announced]
  • Chase Anonymisation: Privacy-Preserving Knowledge Graphs with Logical Reasoning
    Luigi Bellomarini (Bank of Italy), Costanza Catalano* (Bank of Italy), Andrea Coletta (Bank of Italy), Michela Iezzi (Bank of Italy), Pierangela Samarati (Università degli Studi di Milano)
  • Reconstructing TensorLog for Scalable End-to-end Rule Learning
    Kunxun Qi* (The Hong Kong University of Science and Technology (Guangzhou)), Jianfeng Du (Guangdong University of Foreign Studies), Hai Wan (Sun Yat-sen University), Wei Wang (The Hong Kong University of Science and Technology (Guangzhou))
  • An End-to-End Re-Evaluation of Table Entity-Linking Systems
    Martin Christensen* (Aalborg University), Matteo Lissandrini (University of Verona), Katja Hose (Technische Universität Wien)
  • RaSE-KGC: A Relation-Aware Segment Encoding Approach for Knowledge Graph Completion
    Chenxiao Lin (Xiamen University), Ye Luo* (Xiamen University), Kunhong Liu (Xiamen University), Qingqiang Wu (Xiamen University)
  • Semantic Compression for Sound and Complete Query Answering over Knowledge Graphs
    Junhua Ma* (Sun Yat-sen University), Jianfeng Du (Guangdong University of Foreign Studies), Hai Wan (Sun Yat-sen University), Yue Yu (Peng Cheng Laboratory), Qunxun Qi (Sun Yat-sen University), Weilin Luo (Sun Yat-sen University), Yanan Liu (Sun Yat-sen University)

Industry & Applications (I&A) Papers

[I&A-1] LLMs and AI for Database Operations
Time: Tuesday, May 5, 10:00 - 12:00
Location: Av. Van-Horne
Track: Industry & Application
Session Chair: [To Be Announced]
  • Graph Query Generation with Constraint-guided Large Language Agents
    Mengying Wang* (Case Western Reserve University), Nicolaas Jedema (Amazon), Rahul Pandey (Amazon), RaviKiran Krishnan (Meta), Jens Lehmann (Amazon), Yinghui Wu (Case Western Reserve University)
  • D2SQA: An Edge–Cloud Collaborative Slow Query Analysis Framework Deployed at DBAPPSecurity
    Ziquan Fang* (Zhejiang university), Xiangheng Wang (Zhejiang university), Zijun Jia (Beihang University), Bo Liu (DBAPPSecurity Co., Ltd.), Yuan Fan (DBAPPSecurity Co., Ltd.), Haichuan Zhang (DBAPPSecurity Co., Ltd.)
  • RedParrot: Accelerating NL-to-DSL for Business Analytics via Query Semantic Caching
    Tong Wang* (Zhejiang University), Yongqin Xu (Zhejiang University), Jianfeng Zhang (Zhejiang University), Lingxi Cui (Zhejiang University), Wenqing Wei (Xiaohongshu), Suzhou Chen (Xiaohongshu), Huan Li (Zhejiang University), Ke Chen (Zhejiang University), Lidan Shou (Zhejiang University)
  • DM-RAG: Enhancing User Support in Dameng Databases with Retrieval-Augmented Generation
    Qiang Huang* (Wuhan University), Ke Liu (Wuhan Dameng Database Co., Ltd), Liang Deng (Wuhan Dameng Database Co., Ltd), Sijing Zhang (Wuhan Dameng Database Co., Ltd), Chuang Hu (Wuhan University), Tieyun Qian (Wuhan University), Xiao Yan (Wuhan University), Jiawei Jiang (Wuhan University)
  • GalaxyRAG: Graph Retrieval-Augmented Generation for Enterprise Knowledge Systems
    Bing Tong* (The Hong Kong University of Science and Technology (Guangzhou)), Yan Zhou (Zhejiang Chuanglin Technology Co., Ltd.), Chen Zhang (Zhejiang Chuanglin Technology Co., Ltd.), Zhaojie Yin (Zhejiang Chuanglin Technology Co., Ltd.), Jia Li (The Hong Kong University of Science and Technology (Guangzhou))
  • Democratizing Tabular Data Access with an Open-Source Synthetic-Data SDK
    Ivona Krchova (MostlyAI), Mariana vargas vieyra* (MostlyAI), Mario Scriminaci (MostlyAI), Andrey Sidorenko (MostlyAI)
  • High-Fidelity and Complex Test Data Generation for Google SQL Code Generation Services
    Yeounoh Chung* (Google), Shivasankari Kannan (Google), Amita Gondi (Google), Tristan Swadell (Google), Fatma Ozcan (Google)
[I&A-2] Recommendation Systems and CTR Prediction
Time: Tuesday, May 5, 13:30 - 15:00
Location: Av. Van-Horne
Track: Industry & Application
Session Chair: [To Be Announced]
  • REG4Rec: Reasoning-Enhanced Generative Model for Large-Scale Recommendation Systems
    Haibo Xing* (Alibaba), Hao Deng (Alibaba), Yucheng Mao (Alibaba), Lingyu Mu (Alibaba), Jinxin Hu (Alibaba), Yi Xu (Alibaba), Hao Zhang (Alibaba), Jiahao Wang (Alibaba), Shizhun Wang (Alibaba), Yu Zhang (Alibaba), Xiaoyi Zeng (Alibaba), Jing Zhang (Wuhan University)
  • GALA: Generative Aligned Learning for Adaptive Multimodal Representation in the Eleme Recommender System
    JiPing Liu* (Alibaba Group), Zhongmin Zhang (Alibaba Group), Zisen Sang (Alibaba Group), Zhijia Fang (Alibaba Group), Tao Ouyang (Central South University), Ma Jiang (Alibaba Group), shaopeng liang (Alibaba Group), Zeyang Hou (Alibaba Group), Guodong Cao (Alibaba Group), Jia Jia (Alibaba Group)
  • Cascading Relevance-driven Recommendation Network for CTR Prediction in Trigger-Introduced Recommendation
    Kaixuan Chen* (Taobao & Tmall Group of Alibaba), Wenwen Wang (Taobao & Tmall Group of Alibaba), Xing Fang (Taobao & Tmall Group of Alibaba), Yang Huang (Taobao & Tmall Group of Alibaba), Jing Wang (Taobao & Tmall Group of Alibaba)
  • Decoupled Multimodal Fusion for User Interest Modeling in Click-Through Rate Prediction
    Alin Fan* (Alibaba International Digital Commerce Group), Hanqing Li (Alibaba International Digital Commerce Group), Sihan Lu (Renmin University of China), Jingsong Yuan (Alibaba International Digital Commerce Group), Jiandong Zhang (Alibaba International Digital Commerce Group)
  • OEPO: Online Experience-based Preference Optimization for CTR Prediction
    Zhichao Liao* (University of Electronic Science and Technology of China), Ziheng Ni (JD.com), Congcong Liu (JD.com), Zhiwei Fang (JD.com), Changping Peng (JD.com)
[I&A-3] E-commerce, Search and Feature Engineering
Time: Tuesday, May 5, 15:30 - 17:00
Location: Av. Van-Horne
Track: Industry & Application
Session Chair: [To Be Announced]
  • REVISION: Reflective Intent Mining and Online Reasoning Auxiliary for E-commerce Visual Search System Optimization
    Yiwen Tang* (Shanghai AI Laboratory, Alibaba), Qiuyu Zhao (Alibaba), Zenghui Sun (Alibaba), Jinsong Lan (Alibaba), Xiaoyong Zhu (Alibaba), Bo Zheng (Alibaba)
  • Relevance Matters: A Multi-Task and Multi-Stage Large Language Model Approach for E-commerce Query Rewriting
    Aijun Dai* (Jingdong), Jixiang Zhang (Tsinghua University)
  • FELA: A Multi-Agent Evolutionary System for Feature Engineering of Industrial Event Log Data
    Kun Ouyang* (LIGHTSPEED STUDIOS, Tencent), Haoyu Wang (Tsinghua University), Dong Fang (LIGHTSPEED STUDIOS)
  • JITPrune: An Efficient Online Feature Pruning Framework for Embedding-based DLRM Training
    Hongzheng Li* (Beijing University of Posts and Telecommunications), Yucheng Wu (Peking University), Junjie Zhai (Tencent), Anan Liu (Tencent), Yuekui Yang (Tencent), Yingxia Shao (Beijing University of Posts and Telecommunications)
  • CoLIBRi: Supporting quotation through multi-modal retrieval and conversational search on manufacturing drawings
    Jacob Pollack* (Leipzig University), Lucas Peter (Leipzig University), Matthias Täschner (Leipzig University), Carmen Ahnert (CPT Präzisionstechnik GmbH)
[I&A-4] Scalable Data Systems and Infrastructure
Time: Wednesday, May 6, 10:00 - 12:00
Location: Av. Van-Horne
Track: Industry & Application
Session Chair: [To Be Announced]
  • OceanBase Mercury: Building a Distributed Real-time Analytical Processing Database System
    Quanqing Xu* (OceanBase, Ant Group), Chuanhui Yang (OceanBase, Ant Group), Ruijie Li (OceanBase, Ant Group), Dongdong Xie (OceanBase, Ant Group), Hui Cao (OceanBase, Ant Group), Yi Xiao (OceanBase, Ant Group), Junquan Chen (OceanBase, Ant Group), Yanzuo Wang (OceanBase, Ant Group), Saitong Zhao (OceanBase, Ant Group), Fusheng Han (OceanBase, Ant Group)
  • OceanBase CDC: A Log-Based Distributed CDC System for High Availability and Scalability
    Quanqing Xu* (OceanBase, Ant Group), Chuanhui Yang (OceanBase, Ant Group), Sen Wang (OceanBase, Ant Group), Fusheng Han (OceanBase, Ant Group)
  • Bala-Join: An Adaptive Hash Join for Balancing Communication and Computation in Geo-Distributed SQL Databases
    Wenlong Song (Xidian University), Hui Li* (Xidian University), Bingying Zhai (Xidian University), Jinxing Yang (Xidian University), Pinghui Wang (Xi’an Jiaotong University), Jiangtao Cui (Xidian University), Luming Sun (Yunxi Technology Company Ltd.), Ming Li (Shandong Inspur Database Technology Company Ltd.)
  • Automatic Parameter Tuning for Compaction in LSM-Tree based Databases
    Pinshan Cao (East China Normal University), Peng Cai* (East China Normal University), Xuan Zhou (East China Normal University), Jun-Peng Zhu (East China Normal University), Kecheng Luo (East China Normal University), Sijia Li (East China Normal University), Quanqing Xu (OceanBase, Ant Group), Chuanhui Yang (OceanBase, Ant Group)
  • DBdoctor: A Fine-grained and Non-intrusive Performance Diagnosis Platform for Databases
    Xinyue Shi* (Renmin University of China), Quanqi Xin (Juhaokan Technology, Hisense), Zhengjin Wang (Renmin University of China), Xinyi Zhang (Renmin University of China), Haoqiong Bian (Renmin University of China), Wei Lu (Renmin University of China), Qiyu Zhuang (Renmin University of China), Shuang Liu (Renmin University of China), Jikuan Zhang (Juhaokan Technology, Hisense), Xiang Zheng (Juhaokan Technology, Hisense), Yunpeng Chai (Renmin University of China), Xiaoyong Du (Renmin University of China)
  • On Efficient Materialization in Data Lakes
    Andrew Harn (Google Inc), Herald Kllapi* (Google Inc), Zhepeng Yan (Google Inc)
  • StreamShield: A Production-Proven Resiliency Solution for Apache Flink at ByteDance
    Yong Fang (ByteDance), Yuxing Han* (ByteDance), Meng Wang (ByteDance), Yifan Zhang (ByteDance), Yue Ma (ByteDance), Chi Zhang (ByteDance)
[I&A-5] Time Series Analysis and Forecasting
Time: Wednesday, May 6, 15:30 - 17:00
Location: Av. Van-Horne
Track: Industry & Application
Session Chair: [To Be Announced]
  • TAT: Temporal-Aligned Transformer for Multi-Horizon Peak Demand Forecasting
    Zhiyuan Zhao* (Georgia Institute of Technology), Sitan Yang (Keystone AI), Stan Vitebsky (Amazon), B. Aditya Prakash (Georgia Institute of Technology), Dmitry Efimov (Amazon)
  • Hierarchical Industrial Demand Forecasting with Temporal and Uncertainty Explanations
    Harshavardhan Kamarthi (Georgia Institute of Technology), Shangqing Xu* (Georgia Institute of Technology), Xinjie Tong (Aspen Technology), Xingyu Zhou (The Dow Chemical Company), James Peters (The Dow Chemical Company), Joseph Czyzyk (The Dow Chemical Company), B. Aditya Prakash (Georgia Institute of Technology)
  • Accurate and Efficient Multi-channel Time Series Forecasting via Sparse Attention Mechanism
    Hengda Bao (SF Express), Jingfei Fang (SF Express), guangzheng wu* (Zhejiang University of Technology), Weihua Zhou (Zhejiang University)
  • From Benchmarks to Production: Transferring Time Series Anomaly Detection Methods for Electricity Production Monitoring
    Nicolas Vautier* (EDF Lab Paris Saclay), Paul Caron (EDF DOAAT), Nardi Xhepi (EDF DOAAT), Félicie Bizeul (EDF Lab Paris Saclay), Manel Boumghar (EDF Lab Paris Saclay), Christophe Degouy (EDF DOAAT), Paul Boniol (INRIA)
  • User-Adaptive Meta-Learning for Cold-Start Medication Recommendation with Uncertainty Filtering
    Arya Hadizadeh Moghaddam (University of Kansas), Mohsen Nayebi Kerdabadi (University of Kansas), Dongjie Wang (University of Kansas), Mei Liu (UF Health), Zijun Yao* (University of Kansas)
[I&A-6] Large-Scale ML Systems and Data Science Applications
Time: Thursday, May 7, 13:30 - 15:00
Location: Av. Van-Horne
Track: Industry & Application
Session Chair: [To Be Announced]
  • DLRover-LM: LLM Pre-Training Framework with Thousands of Accelerators in AntGroup
    Ziling Huang* (Sichuan University), Zhengmao Ye (Sichuan University), Qingsong Cai (Sichuan University), Zelong Huang (Sichuan University), Bo Sang (Ant Group), Haitao Zhang (Ant Group), Jian Sha (Ant Group), Tingfeng Lan (University of Virginia), Hui Lu (The University of Texas at Arlington), Yuanchun Zhou (Chinese Academy of Science), Mingjie Tang (Sichuan University)
  • Tackling Workload Forecasting Challenges with an Offline-Online Dynamic Framework
    Jian Jiang* (Ant Group), Yu Liu (Ant Group), Jia Li (Ant Group), Lu Han (Nanjing University), Wei Lu (Ant Group), Qiwen Deng (Ant Group), Zhibo Zhu (Ant Group), Xingyu Lu (Ant Group), Lintao Ma (Ant Group)
  • Taxon: Hierarchical Tax Code Prediction with Semantically Aligned LLM Expert Guidance
    Jihang Li* (Hong Kong University of Science and Technology (Guangzhou)), Qing Liu (Alibaba Group), Zulong Chen (Alibaba Group), Jing Wang (Alibaba Group), Wei Wang (Alibaba Group), Chuanfei Xu (Guangdong Laboratory of Artificial Intelligence and Digital Economy (Shenzhen)), Zeyi Wen (Hong Kong University of Science and Technology (Guangzhou))
  • Building and Benchmarking Large Language Models for Machine Translation in Social Network Services
    Hongcheng Guo (Fudan University), Fei Zhao (Xiaohongshu Inc.), Shaosheng Cao* (Xiaohongshu Inc.), Xinze Lyu (Xiaohongshu Inc.), Zijie Meng (Zhejiang University), Yue Wang (Nanjing University), Yao Hu (Xiaohongshu Inc.), Zhoujun Li (Xiaohongshu Inc.), Zuozhu Liu (Zhejiang University)
  • Billion-scale Fintech Analytics: Scalable Data Management and Anomaly Detection at NPCI
    Bharadwaj Dasari (National Payments Corporation of India), Turaga Sai Dhiraj (National Payments Corporation of India), Ganesh Jambhrunkar (National Payments Corporation of India), Thirumalai Kailasam (National Payments Corporation of India), Charu Vikram (National Payments Corporation of India), Saurav Singla (National Payments Corporation of India), Pranjal Naman (Indian Institute of Science (IISc), Bangalore), Yogesh Simmhan* (Indian Institute of Science (IISc), Bangalore)
[I&A-7] Hardware-Accelerated Search, Compression and Data Integration
Time: Thursday, May 7, 15:30 - 17:00
Location: Av. Van-Horne
Track: Industry & Application
Session Chair: [To Be Announced]
  • CCD–Level and Load-Aware Thread Orchestration for In-Memory Vector ANNS on Multi-Core CPUs
    Yuchen Huang (East China Normal University), Baiteng Ma (East China Normal University), Yiping Sun (Xiaohongshu Inc), Yang Shi (Xiaohongshu Inc), Xiao Chen (Xiaohongshu Inc), Xiaocheng Zhong (Xiaohongshu Inc), Zhiyong Wang (Xiaohongshu Inc), Yao Hu (Xiaohongshu Inc), Chuliang Weng* (East China Normal University)
  • KScaNN: Scalable Approximate Nearest Neighbor Search on Kunpeng
    Oleg Senkevich (Huawei Technologies Ltd), Siyang Xu (Huawei Technologies Ltd.), Tianyi Jiang (Huawei Technologies Ltd.), Alexander Radionov (Huawei Technologies Ltd.), Jan Tabaszewski (Huawei Technologies Ltd.), Dmitriy Malyshev (Higher School of Economics), Zijian Li* (Huawei Technologies Ltd.), Daihao Xue (Huawei Technologies Ltd.), Licheng Yu (Huawei Technologies Ltd.), Weidi Zeng (Huawei Technologies Ltd.), Meiling Wang (Huawei Technologies Ltd.), Xin Yao (Huawei Technologies Ltd.), Siyu Huang (Huawei Technologies Ltd.), Gleb Neshchetkin (Huawei Technologies Ltd.), Qiuling Pan (Huawei Technologies Ltd.), Yaoyao Fu (Huawei Technologies Ltd.)
  • Efficient Data Processing using On-the-Fly Host-PIM Interactions in a Commodity PIM System
    Hyojune Kim (Hanyang University), Jeonghyeon Joo (Hanyang University), TaeHyeong Park (Yonsei University), Yongjun Park (Yonsei University), Hyuck Han (FuriosaAI), Sooyong Kang* (Hanyang University)
  • OpenZL: Using Graphs to Compress Smaller and Faster
    Yann Collet (Meta), Nick Terrell (Meta), Winston Felix Handte (Meta Platforms), Danielle Rozenblit (Meta), Victor Zhang* (Meta), Kevin Zhang (Meta), Yaelle Goldschlag (Meta), Jennifer Lee (Meta), Elliot Gorokhovsky (Meta), Yonatan Komornik (Meta), Daniel Riegel (Meta), Stan Angelov (Meta), Nadav Rotem (Meta)
  • GaV: Guess and Verification of Column Semantics
    Davide Di Stefano (TU Wien & Unlimidata Ltd), Jinsong Guo* (Unlimidata Ltd), Yang Hu (University of Leicester), Matteo Capalbo (University of Calabria), Davide Mario Longo (University of Calabria), Georg Gottlob (University of Calabria)

Lightning Talks

[Lightning Talks] Lightning Talks Session
Time: Wednesday, May 6, 10:00 - 12:00
Location: Rue Notre-Dame
Track: Lightning Talks
Session Chair: [To Be Announced]
  • MDSD: Multi-turn Diverse Synthetic Dialog Generation for Domain Specific Incomplete Requests Understanding
    Xi Li* (Apple), Xiaoxu Wu (Apple), Lijuan Xiao (Apple), Tao Liu (Apple), Ping Huang (Apple), Jiulong Shan (Apple)
  • Model Slicing: a Data Engineering Perspective
    Parke Godfrey (York University), Lukasz Golab* (University of Waterloo), Divesh Srivastava (AT&T Chief Data Office), Jarek Szlichta (York University)
  • Responsible Entity Resolution over Streaming Data
    Kostas Stefanidis* (Tampere University), Vasilis Efthymiou (Harokopio University of Athens), Tiago Brasileiro Araújo (Tampere University)
  • Tuning IBM Db2 with Explainable AI
    Andrew Chai* (York University and IBM CAS), Alexander Bianchi (IBM Canada Ltd.), Vincent Corvinelli (IBM Canada Ltd.), Parke Godfrey (York University and IBM CAS), Lukasz Golab (University of Waterloo), Jarek Szlichta (York University and IBM CAS), Calisto Zuzarte (IBM Canada Ltd.)
  • From Polystores to PolyDBMS: The Polypheny Experience
    Marco Vogt* (University of Basel), David Lengweiler (University of Basel), Martin Vahlensieck (University of Basel), Yiming Wu (University of Basel), Heiko Schuldt (University of Basel)
  • Recovering Structure in Unstructured LLM Outputs
    Joel Rorseth* (University of Waterloo), Parke Godfrey (York University), Lukasz Gola (University of Waterloo), Divesh Srivastava (AT&T Chief Data Office), Szlichta Szlichta (York University)
  • Rethinking BFT Consensus via Transaction-Level Protocol Construction for Optimal Performance
    BINGXU ZHU* (Concordia University), Gengrui Zhang (Concordia University)
  • SENTINEL: Evaluating Pipeline Robustness to Distributional Shifts
    Jahid Hasan* (Purdue University), Jingya Wang (Purdue University), Romila Pradhan (Purdue University)
  • SPLICE: An Efficient Framework for Selecting Sources for Machine Learning Tasks
    Ambarish Singh* (Purdue University), Romila Pradhan (Purdue University)
  • When Text-to-SQL Evaluation Misleads: Rethinking Benchmarking Practices
    Oktie Hassanzadeh* (IBM Research), Nhan Pham (IBM Research), Timothy Dinger (IBM Research), Tanvi Kaple (IBM Research), Long Vu (IBM Research), Michael Glass (IBM Research), Shankar Subramanian (IBM Research)
  • Are Dedicated Vector Databases Necessary? Benchmarking Vector Search in Relational and Analytical Systems for Enterprise Workloads
    Archan Dutta* (Aisera), Thais Poi (San Joaquin Delta College)
  • Is Quantum Computing Ready for Real-Time Database Optimization?
    Hanwen Liu* (University of Southern California), Ibrahim Sabek (University of Southern California)
  • On Breaking the Scalability Barrier in Data Cleaning
    El Kindi Rezig* (University of Utah)
  • GNN Explainers 2.0: A Paradigm for User-Oriented, Data-Guided Explanations
    Arijit Khan* (Bowling Green State University)
  • SalesforceDB: Built on one LSM to rule them all !
    Vaibhav Arora* (Salesforce), Peter Desnoyers (Salesforce)

Demos

[Demo A] Accepted Demo Papers - Group A
Time: Tuesday, May 5, 10:00 - 12:00 & 15:30 - 17:00
Location: Av. Laurier
Track: Demonstrations
Session Chair: [To Be Announced]
  • BClean+: A Bayesian Data Cleaning System with Automated Prior Generation
    Ziyan Han* (Shenzhen University), Jing Zhu (Shenzhen University), Jinbin Huang (Shenzhen University), Rui Mao (Shenzhen University; Shenzhen Institute of Computing Sciences), Jianbin Qin (Shenzhen University; Shenzhen Institute of Computing Sciences)
  • LazyVLM: Neuro-Symbolic Approach to Video Analytics
    Xiangru Jian* (University of Waterloo), Wei Pang (University of Waterloo), Zhengyuan Dong (University of Waterloo), Chao Zhang (University of Waterloo), Tamer Özsu (University of Waterloo)
  • Jazero: A Semantic Table Search System
    Martin Christensen* (Aalborg University), Matteo Lissandrini (University of Verona), Katja Hose (Technische Universität Wien)
  • GeX: Guiding tuning of Db2 with eXplainable AI
    Andrew Chai* (York University and IBM CAS), Alexander Bianchi (IBM Canada Ltd.), Vincent Corvinelli (IBM Canada Ltd.), Parke Godfrey (York University and IBM CAS), Lukasz Golab (University of Waterloo), Jarek Szlichta (York University and IBM CAS), Calisto Zuzarte (IBM Canada Ltd.)
  • A Fast, Versatile, and User-friendly Plugin for Kernel Density Analysis
    Tsz Nam Chan* (Shenzhen University), Bojian Zhu (Hong Kong Baptist University), Leong Hou U (University of Macau), Dingming Wu (Shenzhen University), Wei Tu (Shenzhen University), Jianliang Xu (Hong Kong Baptist University)
  • Schema-GraphRAG: Bridging Hybrid Search and Graph Traversal for Complex Retrieval Tasks
    Bastian Lipka* (IBM Research), Venkata Vamsikrishna Meduri (IBM Research), Berthold Reinwald (IBM Research), Nasrullah Sheikh (IBM Research)
  • TAPE: A Temporal Graph-based Memory System for Personal LLM Agents
    Chengyang Luo (Zhejiang University), Qing Liu (Zhejiang University), Wenjie Zhang (The University of New South Wales), Yunjun Gao* (Zhejiang University)
  • Scoper: Streamline Linkable Schemas for Matching
    Leonard Traeger* (University of Maryland, Baltimore County), Andreas Behrend (Technical University Cologne), George Karabatis (University of Maryland, Baltimore County)
  • CORAL: COncept-based Explanations for RAG LLMs
    Katherine Ling* (York University), Joel Rorseth (University of Waterloo), Parke Godfrey (York University), Lukasz Golab (University of Waterloo), Jarek Szlichta (York University)
  • DeepSketch 2.0: Discovering Temporal Relationships in Large Time Series Datasets
    Zheng Zhang (Northwestern University), Runzhe Jiang (Northwestern University), Andrew Crotty* (Northwestern University)
[Demo B] Accepted Demo Papers - Group B
Time: Tuesday, May 5, 13:30 - 15:00 & Wednesday, May 6, 10:00 - 12:00
Location: Av. Laurier
Track: Demonstrations
Session Chair: [To Be Announced]
  • DJGen: Data & Conjunctive Join Plan Generator
    Vasilis Sarris* (University of Pittsburgh), Brian Nixon (University of Pittsburgh), Panos Chrysanthis (University of Pittsburgh)
  • CASE: A Comprehensive and Interactive Influence Analysis System for Social Networks
    Xueqin Chang (Zhejiang University), Chuanyu Liu (Zhejiang University), Qing Liu (Zhejiang University), Baihua Zheng (Singapore Management University), Yunjun Gao* (Zhejiang University)
  • HEX: OLAP-Enabled Hierarchical Explanations
    Kathryn Carbone* (University of Waterloo), Parke Godfrey (York University), Lukasz Golab (University of Waterloo), Jarek Szlichta (York University), Robin Cohen (University of Waterloo)
  • RUBEN: Rule-Based Explanations for Retrieval-Augmented LLM Systems
    Joel Rorseth* (University of Waterloo), Parke Godfrey (York University), Lukasz Golab (University of Waterloo), Divesh Srivastava (AT&T Chief Data Office), Jarek Szlichta (York University)
  • SENTINEL: Evaluating Pipeline Robustness to Distributional Shifts
    Jahid Hasan* (Purdue University), Jingya Wang (Purdue University), Romila Pradhan (Purdue University)
  • SqlRewriter: Harnessing Community Knowledge to Rewrite SQL Queries
    Qiushi Bai* (UC Irvine), Yihong Yu (UC Irvine), Colin Harrison (UC Irvine), Jiatong Liu (UC Irvine), James Liu (UC Irvine), Jessie He (UC Irvine), Hartley Tran (UC Irvine), Jun Xia (UC Irvine), Chen Li (UC Irvine)
  • LOADS: Adaptive Cloud-Edge-Device Database Management System Optimizer
    Chunyu Zhao (Harbin Institute of Technology), Yihan Zhang (Harbin Institute of Technology), Shuangshuang Cui (Harbin Institute of Technology), Hongzhi Wang* (Harbin Institute of Technology)
  • Multi-Model Geospatial Data Management and Exploration
    David Lengweiler* (University of Basel), Marco Vogt (Polypheny GmbH), Heiko Schuldt (University of Basel)
  • PiPer - Leveraging Pipeline Perspectives for Effective Data Pipeline Exploration
    Melanie Herschel* (Nanyang Technological University), Ridhwan Hakim Bin Kusni (NTU)
  • Pathfinder: Context Engineering and Knowledge Management for Domain-Specific Horizontal Reasoning
    Joohyun Lee* (Seoul National University), Ghita Benboubker (Seoul National University), JungKwan Han (Seoul National University), Jisoo Jang (Seoul National University), Wen-Syan Li (Seoul National University)

TKDE Posters

[TKDE Poster A] Accepted TKDE Posters - Group A
Time: Tuesday, May 5, 10:00 - 12:00 & 15:30 - 17:00
Location: Av. Viger
Track: TKDE Posters
Session Chair: [To Be Announced]
  • Minimum k-Vertex Connected Graph Search
    Yang Liu (Harbin Institute of Technology, Shenzhen), Hejiao Huang (Harbin Institute of Technology, Shenzhen), Kaiqiang Yu (Nanjing University), Shengxin Liu* (Harbin Institute of Technology, Shenzhen), Cheng Long (Nanyang Technological University)
  • PiTruss Community Search for Multilayer Graphs
    Run-An Wang (Harbin Institute of Technology), Zhaonian Zou* (Harbin Institute of Technology), Dandan Liu (Harbin Institute of Technology), Xudong Liu (Harbin Institute of Technology)
  • Graph2Region: Efficient Graph Similarity Learning with Structure and Scale Restoration
    Zhouyang Liu* (National University of Defense Technology), Yixin Chen (National University of Defense Technology), Ning Liu (Information Support Force Engineering University), Jiezhong He (National University of Defense Technology), Dongsheng Li (National University of Defense Technology)
  • Structural Clustering of Multi-layer Graphs
    Xudong Liu (Harbin Institute of Technology), Zhaonian Zou* (Harbin Institute of Technology), Run-An Wang (Harbin Institute of Technology), Dandan Liu (Harbin Institute of Technology)
  • Jet-BGC: Joint Latent Embedding and Structural Fusion Bipartite Graph Clustering
    Liang Li* (National University of Defense Technology), Yuangang Pan (Agency for Science, Technology and Research (A∗STAR)), Junpu Zhang (National University of Defense Technology), Pei Zhang (National University of Defense Technology), Jie Liu (National University of Defense Technology), Xinwang Liu (National University of Defense Technology), Kenli Li (Hunan University), Ivor W. Tsang (Agency for Science, Technology and Research (A∗STAR)), Keqin Li (State University of New York)
  • ConvD: Attention Enhanced Dynamic Convolutional Embeddings for Knowledge Graph Completion
    Wenbin Guo (Tianjin University), zhao li (Tianjin University), Xin Wang* (Tianjin University), Zirui Chen (Tianjin University), Jun Zhao (Ningxia University), Jianxin Li (Edith Cowan University), Yuan Ye (Beijing Institute of Technology)
  • HyCubE: Efficient Knowledge Hypergraph 3D Circular Convolutional Embedding
    Zhao Li* (Tianjin University), Xin Wang (Tianjin University), Jun Zhao (Ningxia University), Wenbin Guo (Tianjin University), Jianxin Li (Edith Cowan University)
  • An Amortized O(1) Lower Bound for Dynamic Time Warping in Motif Discovery
    Zemin Chao (Harbin institute of technology), Hong Gao (Zhejiang Normal University), Dongjing Miao (Harbin institute of technology), Jianzhong Li (Harbin institute of technology), Hongzhi Wang* (Harbin institute of technology)
  • Discovery of Temporal Network Motifs
    Hanqing Chen (Beihang University), Shuai Ma* (Beihang University), Junfeng Liu (Beihang University), Lizhen Cui (Shandong University)
[TKDE Poster B] Accepted TKDE Posters - Group B
Time: Tuesday, May 5, 13:30 - 15:00 & Wednesday, May 6, 10:00 - 12:00
Location: Av. Viger
Track: TKDE Posters
Session Chair: [To Be Announced]
  • Generalized Local Prominence for Source Detection in Real-World Rumor Networks
    Syed Shafat Ali* (University of Kashmir), Ajay Rastogi (Amity University), Tarique Anwar (RMIT University), Syed Afzal Murtaza Rizvi (Jamia Millia Islamia), Jian Yang (Macquarie University), Jia Wu (Macquarie University), Quan Z. Sheng (Macquarie University)
  • Orthogonal Keys: High Precision and Recall for Mining Meaningful Database Keys from Inconsistent and Incomplete Relations
    Henning Koehler (Massey University), Sebastian Link* (University of Auckland)
  • Hierarchy-Aware Neural Subgraph Matching with Enhanced Similarity Measure
    Zhouyang Liu* (National University of Defense Technology), Ning Liu (Information Support Force Engineering University), Yixin Chen (National University of Defense Technology), Jiezhong He (National University of Defense Technology), Menghan Jia (National University of Defense Technology), Dongsheng Li (National University of Defense Technology)
  • Can Uncertainty Quantification Improve Learned Index Benefit Estimation?
    Tao Yu (Harbin Institute of Technology), Zhaonian Zou* (Harbin Institute of Technology), Hao Xiong (Harbin Institute of Technology)
  • Intent Propagation Contrastive Collaborative Filtering
    Haojie Li* (Qingdao University of Science and Technology), Junwei Du (Qingdao University of Science and Technology), Guanfeng Liu (Macquarie University), Feng Jiang (Qingdao University of Science and Technology), Yan Wang (Macquarie University), Xiaofang Zhou (Hong Kong University of Science and Technology)
  • Maximizing Influence Query over Indoor Trajectories
    Jian Chen* (Harbin Institute of Technology), Hong Gao (Zhejiang Normal University), Yuhong Shi (Harbin Institute of Technology), Junle Chen (Harbin Institute of Technology), Donghua Yang (Harbin Institute of Technology), Jianzhong Li (Chinese Academy of Sciences)
  • LIOF: Make the Learned Index Learn Faster With Higher Accuracy
    Tao Ji* (School of Information, Renmin University of China), Kai Zhong (School of Information, Renmin University of China), Luming Sun (Yunxi Technology Company Ltd.), Yiyan Li (School of Information, Renmin University of China), Cuiping Li (School of Information, Renmin University of China), Hong Chen (School of Information, Renmin University of China)
  • FedDict: Towards Practical Federated Dictionary-Based Time Series Classification
    Zhiyu Liang (Harbin Institute of Technology), Zheng Liang (Harbin Institute of Technology), Hongzhi Wang* (Harbin Institute of Technology), Bo Zheng (CnosDB Inc.)
  • Snoopy: Effective and Efficient Semantic Join Discovery via Proxy Columns
    Yuxiang Guo (Zhejiang University), Yuren Mao (Zhejiang University), Zhonghao Hu (Zhejiang University), Lu Chen (Zhejiang University), Yunjun Gao* (Zhejiang University)

DEFT

[DEFT] Data Engineering Future Technologies
Time: Thursday, May 7, 10:00 - 12:00
Location: Av. Van-Horne
Track: Data Engineering Future Technologies (DEFT)
Session Chair: [To Be Announced]
  • Efficient Neural-Symbolic Data System via Multi-Agent Collaboration
    Ye Yuan (Beijing Institute of Technology), Bo Tang (Southern University of Science and Technology), Zhaojing Luo (Beijing Institute of Technology), Boyang Li (Beijing Institute of Technology), Renjie Liu (Southern University of Science and Technology), Zhilang Wei (Beijing Institute of Technology)
  • VPS: Rethinking OLTP Database Performance Evaluation through Transactional Value
    Jianbin Qin (Shenzhen University), Yibin Lin (Shenzhen University), Wendi Hua (Shenzhen University), Rui Mao (Shenzhen University), Chuan Xiao (Osaka University)
  • Living Databases: A Unified Model for Continuous Schema Evolution, Versioning, and Transformations
    Amol Deshpande (University of Maryland at College Park)
  • Green or Greedy? An Ecological Analysis of Datacenter GPU Replacements
    Marc Baeuerle (Hasso Plattner Institute, University of Potsdam), Ole Becker (Hasso Plattner Institute, University of Potsdam), Nikolas Hoellerl (Hasso Plattner Institute, University of Potsdam), Ricardo Salazar Díaz (Hasso Plattner Institute, University of Potsdam), Ilin Tolovski (Hasso Plattner Institute, University of Potsdam), Tilmann Rabl (Hasso Plattner Institute, University of Potsdam)
  • GenIE: Simulator-Driven Iterative Data Exploration for Scientific Discovery
    Ashwin Colaco (University of California, Irvine), Martin Boissier (Hasso Plattner Institute), Sriram Rao (University of California, Irvine), Shubharoop Ghosh (ImageCat), Sharad Mehrotra (University of California, Irvine), Tilmann Rabl (Hasso Plattner Institute)
  • Towards a Hybrid Quantum-Classical Computing Framework for Database Optimization Problems in Real Time Setup
    Hanwen Liu (University of Southern California), Ibrahim Sabek (University of Southern California)

Tutorials

[Tutorial-1] Evolution of LSM-Tree Key-Value Stores: A Tutorial on State-of-the-Art and Future Directions
Time: Tuesday, May 5, 10:00 - 12:00
Location: Rue Saint-Denis
Track: Tutorials
Length: 1.5 Hours
  • Presenters:
    Yina Lv (Xiamen University, China)
    Qiao Li (MBZUAI, UAE)
    Quanqing Xu (OceanBase, Ant Group, China)
    Chun Jason Xue (MBZUAI, UAE)
[Tutorial-2] Large Language Models for Spatial Analysis Queries
Time: Wednesday, May 6, 10:00 - 12:00
Location: Rue Saint-Denis
Track: Tutorials
Length: 1.5 Hours
  • Presenters:
    Mohamed Hemdan (University of Minnesota, Minnesota, USA)
    Youssef Hussein (University of Minnesota, Minnesota, USA)
    Mohamed F. Mokbel (University of Minnesota, Minnesota, USA)
[Tutorial-3] Data Discovery in Data Lakes: Operations, Indexes, Systems
Time: Tuesday, May 5, 13:30 - 15:00 (Part I) & 15:30 - 17:00 (Part II)
Location: Rue Saint-Denis
Track: Tutorials
Length: 3 Hours
  • Presenters:
    Ziawasch Abedjan (TU Berlin & BIFOLD Berlin, Germany)
    Mahdi Esmailoghli (Humboldt-Universitat zu Berlin, Berlin, Germany)
    Sainyam Galhotra (Cornell University, Ithaca, NY, USA)
[Tutorial-4] The Virtuous Cycle: AI-Powered Vector Search and Vector Search-Augmented AI
Time: Wednesday, May 6, 15:30 - 17:00
Location: Rue Saint-Denise
Track: Tutorials
Length: 1.5 Hours
  • Presenters:
    Jiuqi Wei (Oceanbase, Ant Group, Beijing, China)
    Quanqing Xu (Oceanbase, Ant Group, Hangzhou, China)
    Chuanhui Yang (Oceanbase, Ant Group, Beijing, China)
[Tutorial-5] Query Rewrite in the Learning Age: From Rules to ML-Based and LLM-Driven Techniques
Time: Thursday, May 7, 10:00 - 12:00
Location: Rue Saint-Denis
Track: Tutorials
Length: 1.5 Hours
  • Presenters:
    Shengchen Liu (University of Ottawa, Ottawa, ON, Canada)
    Verena Kantere (University of Ottawa, Ottawa, ON, Canada)
    Nicholas Ostan (IBM Canada Ltd., Toronto, ON, Canada)
    Farhana Haider (IBM Canada Ltd., Toronto, ON, Canada)
    Calisto Zuzarte (IBM Canada Ltd., Toronto, ON, Canada)
[Tutorial-6] Data-Centric Foundations of Agentic AI
Time: Thursday, May 7, 13:30 - 15:00
Location: Rue Saint-Denis
Track: Tutorials
Length: 1.5 Hours
  • Presenters:
    Yuxin Jin (University of Technology Sydney, Sydney, Australia)
    Ying Zhang (Zhejiang Gongshang University, Hangzhou, China)
    Hanchen Wang (University of Technology Sydney, Sydney, Australia)
    Wenjie Zhang (University of New South Wales, Sydney, Australia)
[Tutorial-7] Model Slicing: A Data Engineering Perspective
Time: Thursday, May 7, 15:30 - 17:00
Location: Rue Saint-Denis
Track: Tutorials
Length: 1.5 Hours
  • Presenters:
    Parke Godfrey (York University, Toronto, ON, Canada)
    Lukasz Golab (University of Waterloo, Waterloo, ON, Canada)
    Divesh Srivastava (AT&T Chief Data Office, Bedminster, NJ, USA)
    Jarek Szlichta (York University, Toronto, ON, Canada)