PRO-HNSW: Proactive Repair and Optimization for High-Performance Dynamic HNSW Indexes
Huijun Jin (Yonsei University), Jieun Lee (Yonsei University), Shengmin Piao (Yonsei University), Sangmin Seo (Yonsei University), Sanghyun Park (Yonsei University)
BS-tree: A gapped data-parallel B-tree
Dimitrios Tsitsigkos (Athena RC), Achilleas Michalopoulos (University of Ioannina), Nikos Mamoulis (University of Ioannina), Manolis Terrovitis (Athena RC)
Updatable Balanced Index for Fast On-device Search with Auto-selection Model
Yushuai Ji (Wuhan University), Sheng Wang (Wuhan University), Zhiyu Chen (Amazon), Yuan Sun (La Trobe University), Zhiyong Peng (Wuhan University)
Mitigating Dual Load Imbalance via Dynamic Cooperative Scheduling in Distributed Key-Value Stores
Jiakun Zhang (University of Science and Technology of China), Patrick P. C. Lee (The Chinese University of Hong Kong), Wenzhe Zhu (University of Science and Technology of China), Yongkun Li (University of Science and Technology of China), Shuyi Zhang (University of Science and Technology of China), Yinlong Xu (University of Science and Technology of China)
Fast Content-Aware Influence Maximization Query Answering by labeling Index
Xingliang Lv (Zhejiang University), Qihao Shi* (Zhejiang University), Can Wang (Zhejiang University), Mingli Song (Zhejiang University), Wenliang Du (Zhejiang University), Wujian Yang (Hangzhou City University), Guanlin Chen (Hangzhou City University)
One Size Does NOT Fit All: On the Importance of Physical Representations for Datalog Evaluation [Experiment, Analysis, and Benchmark]
Nick Rassau (Johannes Gutenberg University Mainz), Felix Schuhknecht* (Johannes Gutenberg University Mainz)
[R-2] Spatiotemporal Prediction and Urban Computing
Time: Tuesday, May 5, 10:00 - 12:00 Location: Rue McGill Track: Spatial Databases and Temporal Databases Session Chair: [To Be Announced]
Energy-Efficient Autonomous Driving with Adaptive Perception and Robust Decision
Yuyang Xia (University of Electronic Science and Technology of China), Zibo Liang (University of Electronic Science and Technology of China), Liwei Deng (University of Electronic Science and Technology of China), Yan Zhao (University of Electronic Science and Technology of China), Han Su (University of Electronic Science and Technology of China), Kai Zheng (University of Electronic Science and Technology of China)
SaSPartitioner: A Self-adaptive Streaming Partitioner using Deep Reinforcement Learning
Shenghao Gong (Zhejiang University), Liu Liu (Zhejiang University), Ziquan Fang (Zhejiang University), Yunjun Gao (Zhejiang University), Yaofeng Tu (ZTE Corporation)
Damba-ST: Domain-Adaptive Mamba for Efficient Urban Spatio-Temporal Prediction
Rui AN (The Hong Kong Polytechnic University), Yifeng Zhang (The Hong Kong Polytechnic University), Ziran Liang (The Hong Kong Polytechnic University), Wenqi Fan (The Hong Kong Polytechnic University), Yuxuan Liang (The Hong Kong University of Science and Technology (Guangzhou)), Xuequn Shang (Northwestern Polytechnical University), Qing Li (The Hong Kong Polytechnic University)
Online Multi-Modal Spatio-Temporal Prediction: A Reinforcement Learning and Dynamic Contrastive Framework
Ziquan Fang* (Zhejiang University), Tinghui Luo (Zhejiang University), Xiaole Pan (Zhejiang University), Lu Chen (Zhejiang University), Surun Ji (iQIYI Inc), Mingfan Lu (iQIYI Inc)
City-wide Origin-destination Matrix Generation via Cascaded Graph Denoising Diffusion
Can Rong* (Singapore-MIT Alliance for Research and Technology (SMART)), Jingtao Ding (Tsinghua University), Zhicheng Liu (Alibaba Group), Peng Lu (PKU-Wuhan Institute for Artificial Intelligence), Yong Li (Tsinghua University)
VisiFold: Long-Term Traffic Forecasting via Temporal Folding Graph and Node Visibility
Zhiwei Zhang* (Beijing Jiaotong University), Xinyi Du (Beijing Normal University), Weihao Wang (Beijing Jiaotong University), Xuanchi Guo (Beijing Jiaotong University), Wenjuan Han (Beijing Jiaotong University)
Data-Segmentation Prompt based Continual Learning Framework for Online Spatio-Temporal Prediction
Banglie Yang (Sichuan University), Liwei Deng* (Aalborg University), Cheng Dai (Sichuan University), Kai Zheng (University of Electronic Science and Technology of China)
[R-3] Benchmarking, Testing and Evaluation of DB Systems
Time: Tuesday, May 5, 10:00 - 12:00 Location: Rue Sherbrooke Track: AI-based DB Tuning, Benchmarks and Performances Session Chair: [To Be Announced]
Benchmarking RL-Enhanced Spatial Indices Against Traditional, Advanced, and Learned Counterparts [Experiment, Analysis, and Benchmark]
Guanli Liu (The University of Melbourne), Renata Borovica-Gajic (The University of Melbourne), Hai Lan (RMIT University), Zhifeng Bao (RMIT University)
GPU-Accelerated OLTP: An In-Depth Analysis of Concurrency Control Schemes [Experiment, Analysis, and Benchmark]
Zihan Sun (Tsinghua University), Yuyu Luo (HKUST(GZ)), Yong Zhang (Tsinghua University), Chao Li (Tsinghua University), Chunxiao Xing (Tsinghua University)
Tetris: Lightweight Hyperparameter Auto-Tuning for Mitigating Performance Spikes in LSM-KVS
YINA LV* (Xiamen University), Wenhao Zhu (Xiamen University), Qiao Li (Mohamed bin Zayed University of Artificial Intelligence), Quanqing Xu (OceanBase, Ant Group), Congming Gao (Xiamen University), Chuanhui Yang (OceanBase, Ant Group), Xiaoli Wang (Xiamen University), Chun Jason Xue (Mohamed bin Zayed University of Artificial Intelligence)
Distance Comparison Operations Are Not Silver Bullets in Vector Similarity Search: A Benchmark Study on Their Merits and Limits [Experiment, Analysis, and Benchmark]
Zhuanglin Zheng (Beihang University), Yuxiang Zeng (Beihang University), Chenchen Liu (Beihang University), Yunzhen Chi (Beihang University), Binhan Yang (Beihang University), Yongxin Tong* (Beihang University)
WikiDBGraph: A Data Management Benchmark Suite for Collaborative Learning over Database Silos [Experiment, Analysis, and Benchmark]
Zhaomin Wu* (National University of Singapore), Ziyang Wang (National University of Singapore), Bingsheng He (National University of Singapore)
Vireo: Human-in-the-Loop DBMS Fuzzing with Visualization and LLM Support
Jie Liang* (Beihang University), Zhiyong Wu (Tsinghua University), Jingzhou Fu (Tsinghua University), Chi Zhang (Tsinghua University), Runpei Miao (Beihang University), Zhuo Su (Beihang University), Yu Jiang (Tsinghua University), Shuai Ma (Beihang University)
HCT-QA: A Benchmark for Question Answering on Human-Centric Tables [Experiment, Analysis, and Benchmark]
Mohammad Shahmeer Ahmad* (QCRI, HBKU), Zan Naeem (QCRI, HBKU), Michael Aupetit (QCRI, HBKU), Ahmed Elmagarmid (QCRI, HBKU), Mohamed Eltabakh (QCRI, HBKU), Xiaosong Ma (MBZUAI), Mourad Ouzzani (QCRI, HBKU), Chaoyi Ruan (NUS), Hani Al-Sayeh (QCRI, HBKU)
[R-4] Data Quality, Repair and Outlier Detection
Time: Tuesday, May 5, 10:00 - 12:00 Location: Rue Mansfield Track: Information Integration and Data Quality Session Chair: [To Be Announced]
PROCore: Robust Core-set Selection via Pareto Multi-dimensional Optimization from Noisy Data
Xiaoou Ding (Harbin Institute of Technology), Hongbin Hu (Harbin Institute of Technology), Songnan Jiang (Harbin Institute of Technology), Muyun Zhou (Harbin Institute of Technology), Chen Wang (Tsinghua university), Jingru Yang (National Key Laboratory of Data Space Technology and System), Hongzhi Wang* (Harbin Institute of Technology)
Truth ≠ Frequency: Leveraging Dependencies for Subset Repair
RFOD: Random Forest-based Outlier Detection for Mixed-Type Tabular Data
Yihao Ang (National University of Singapore), Peicheng Yao (National University of Singapore), Yifan Bao (National University of Singapore), Yushuo Feng (Huazhong University of Science and Technology), Qiang Huang* (Harbin Institute of Technology (Shenzhen)), Anthony K. H. Tung (National University of Singapore), Zhiyong Huang (National University of Singapore)
TORepair: Diffusion-based Task-Oriented Error Repair via Differentiable Bi-Level Optimization
Wei Ni (Zhejiang University; City University of Hong Kong), Xiaoye Miao* (Zhejiang University), Xiangyu Zhao (City University of Hong Kong), Yangyang Wu (Zhejiang University), Jianwei Yin
EDDI: Explainable Data Drift Monitoring using Influence
Nikolaos Myrtakis* (University of Crete), Andrea Castellani (Honda Research Institute Europe GmbH), Ioannis Tsamardinos (University of Crete), Vassilis Christophides (ENSEA, CY Cergy Paris University, CNRS)
Analysis of Candidate Keys in Relational Databases
Zihui Yang (University of Auckland), Yuqian Ma (University of Auckland), Sebastian Link* (University of Auckland)
Representative Functional Dependencies
Qiongqiong Lin (Zhejiang University), Jingyan Sai (Alibaba Group), Jiazheng Song (Zhejiang University), Jinfei Liu* (Zhejiang University), Kui Ren (Zhejiang University), Tianzhen Wang (Alibaba Group), Yanbei Pang (Alibaba Group), Feifei Li (Alibaba Group)
[R-5] Blockchain Protocols, Storage and Smart Contracts
Time: Tuesday, May 5, 10:00 - 12:00 Location: Rue Crescent Track: Distributed Ledgers and Blockchains Session Chair: [To Be Announced]
RoarChain: A Robust Sharding Blockchain System for Enterprise Consortium
Yuan Sui (Northeastern University), Xiaochun Yang (Northeastern University), Bin Wang (Northeastern University), Yujie Zhang (Northeastern University), Lina Wang (Wuhan University)
Banknote-Chain: Achieving User-Incentivized Parallelism in Blockchain via a Banknote-Inspired Transaction Model
Zhiyu Ma (Hefei Institutes of Physical Science, Chinese Academy of Sciences; University of Science and Technology of China), Xiaofeng Li (Hefei Institutes of Physical Science, Chinese Academy of Sciences; University of Science and Technology of China), He Zhao (Hefei Institutes of Physical Science, Chinese Academy of Sciences; University of Science and Technology of China), Tong Zhou (Hefei Institutes of Physical Science, Chinese Academy of Sciences; Anhui ZhongKeJingGe Technology Co., Ltd), Nianzu Sheng (Hefei Institutes of Physical Science, Chinese Academy of Sciences; University of Science and Technology of China), Haotian Cheng (University of Science and Technology of China; Hefei Institutes of Physical Science, Chinese Academy of Sciences)
Chubby: Robust Smart Contract Execution Against Dependency Over-declaration
Junyu Wei* (East China Normal University), Xiaodong Qi (Nanyang Technological University), Qifeng Que (East China Normal University), Zhao Zhang (East China Normal University), Yanqin Yang (East China Normal University), Cheqing Jin (East China Normal University)
COLE+: Towards Practical Column-based Learned Storage for Blockchain Systems
Ce Zhang (Hong Kong Baptist University), Cheng Xu (Hong Kong Baptist University), Haibo Hu (Hong Kong Polytechnic University), Jianliang Xu* (Hong Kong Baptist University)
SpendableStore: A UTXO-based Decentralized Data Store
YINAN ZHOU* (UNIVERSITY OF CALIFORNIA), Faisal Nawab (UNIVERSITY OF CALIFORNIA, Irvine)
Geco: A Confidentiality-Preserving and High-Performance Permissioned Blockchain Framework for General Smart Contracts
Songxiao Guo (The University of Hong Kong), Rongxin Guan (The University of Hong Kong), Ji Qi* (Institute of Software Chinese Academy of Sciences), Zongyuan Zhang (The University of Hong Kong), Tianyang Duan (The University of Hong Kong), Sen Wang (Huawei Technologies), Yanjun Wu (Institute of Software Chinese Academy of Sciences), Heming Cui (The University of Hong Kong)
HYDRA: Breaking the Global Ordering Barrier in Multi-BFT Consensus
Hanzheng Lyu* (University of British Columbia), Shaokang Xie (University of California, Davis), Jianyu Niu (City University of Hong Kong), Mohammad Sadoghi (University of California, Davis), Yinqian Zhang (Southern University of Science and Technology), Cong Wang (City University of Hong Kong), Ivan Beschastnikh (University of British Columbia), Chen Feng (University of British Columbia)
[R-6] LLMs for Database Optimization and Administration
Time: Tuesday, May 5, 13:30 - 15:00 Location: Av. Duluth Track: AI for Data Management Session Chair: [To Be Announced]
MVGPT: Generative Materialized View Forecasting
Yue Han (Tsinghua University), Guoliang Li* (Tsinghua University), Wenchun Xu (Alibaba Group), Xianglei Ran (Alibaba Group), ZeYa Gong (Alibaba Group), Wei Guo (Alibaba), Guang Qiu (Alibaba Group), Bo Zheng (Alibaba Group)
LLM4Hint: Leveraging Large Language Models for Hint Recommendation in Offline Query Optimization
Suchen Liu (Peking University), Yang Lin (ZTE Corporation), Yinjun Han (ZTE Corporation), Jun Gao* (Peking University)
LLMSQLMUTATOR: LLM-Powered Test Case Generation for Database Using Bug Reports
Chenglin Tian* (Beijing University of Posts and Telecommunications), Chaofan Li (Beijing University of Posts and Telecommunications), Yawen Li (Beijing University of Posts and Telecommunications), Yingxia Shao (Beijing University of Posts and Telecommunications)
LLMIA: An Out-of-the-Box Index Advisor via In-Context Learning with LLMs
Xinxin Zhao* (Renmin University of China), Xinmei Huang (Renmin University of China), Haoyang Li (Renmin University of China), Jing Zhang (Renmin University of China), Shuai Wang (ByteDance), Tieying Zhang (ByteDance), Jianjun Chen (ByteDance), Rui Shi (ByteDance), Cuiping Li (Renmin University of China), Hong Chen (Renmin University of China)
[R-7] Approximate, Vector and Pub-Sub Query Processing
Time: Tuesday, May 5, 13:30 - 15:00 Location: Rue McGill Track: Query Processing, Indexing, and Optimizatio Session Chair: [To Be Announced]
Semantic Publish/Subscribe over Evolving Topics
Yiming Yao* (University of Electronic Science and Technology of China), Lisi Chen (University of Electronic Science and Technology of China), Shuo Shang (University of Electronic Science and Technology of China)
UTune: Towards Uncertainty-Aware Online Index Tuning
Chenning Wu (Fudan University), Sifan Chen (Fudan University), Wentao Wu (Microsoft Research), Yinan Jing* (Fudan University), Zhengying He (Fudan University), Kai Zhang (Fudan University), X.Sean Wang (Fudan University)
A-Scan: Efficient Scale-up Analytics via Throughput-Guided Data Movement
SINDI: An Efficient Index for Sparse Vector Approximate Maximum Inner Product Search
Ruoxuan Li (ECNU), Xiaoyao Zhong (Ant Group), Jiabao Jin (Ant Group), Peng Cheng* (Tongji University), Wangze Ni (Zhejiang University), Zhitao Shen (Ant Group), Wei Jia (Ant Group), Xiangyu Wang (Ant Group), Heng Tao Shen (Tongji University), Jingkuan Song (Tongji University)
Systematic Evaluation of Plan-based Adaptive Query Processing [Experiment, Analysis, and Benchmark]
Pei Mu* (University of Edinburgh), Anderson Chaves Carniel (Huawei Technologies Research & Development (UK) Limited), Antonio Barbalace (University of Edinburgh), Amir Shaikhha (University of Edinburgh)
[R-8] Table Integration, Schema Matching and Data Markets
Time: Tuesday, May 5, 13:30 - 15:00 Location: Rue Sherbrooke Track: Information Integration and Data Quality Session Chair: [To Be Announced]
A Unified Framework for Compressed and Encrypted Text Direct Processing
Yani Liu (Renmin University of China), Feng Zhang* (Renmin University of China), Yu Zhang (Renmin University of China), Siqi Ma (University of New South Wales), Elisa Bertino (Purdue University), Xiaoyong Du (Renmin University of China)
Revisiting Single-Table Retrieval: An Open Problem Under 360° Stress Tests [Experiment, Analysis, and Benchmark]
Chenyu Yang (HKUST(GZ)), Junhao Li (HKUST(GZ)), Ziyu Jiang (HKUST(GZ)), Yuyu Luo (HKUST(GZ)), Ju Fan (Renmin University of China), Nan Tang* (HKUST(GZ))
Novel Table Search
Besat Kassaie* (University of Waterloo), Renee J. Miller (University of Waterloo)
Label-Constrained Column Annotation with Language Models and Graph Neural Networks
Duo Yang (KU Leuven), Ioannis Dasoulas (KU Leuven), Anastasia Dimou* (KU Leuven)
Information Leakage from Prices in Query-based Data Markets
Teng Tu (Zhejiang University), Huanhuan Peng (Zhejiang University), Xiaoye Miao* (Zhejiang University), Guanjie Cheng (Zhejiang University), Shuiguang Deng (Zhejiang University), JIanwei Yin (Zhejiang University)
[R-9] Edge Computing, IoT and Streaming Applications
Time: Tuesday, May 5, 13:30 - 15:00 Location: Rue Mansfield Track: Data Stream Systems and Edge Computing Session Chair: [To Be Announced]
FLASH Viterbi: Fast and Adaptive Viterbi Decoding for Modern Data Systems
Ziheng Deng (Northeastern University), Xue Liu (Northeastern University), Jiantong Jiang (The University of Western Australia), Yankai Li (Northeastern University), Qingxu Deng* (Northeastern University), Xiaochun Yang (Northeastern University)
Deferred Flushing for Out-of-Order Arrivals in Apache IoTDB
Xiaojian Zhang (Tsinghua University), Zhiheng Liu (Tsinghua University), Shaoxu Song* (Tsinghua University), Xiangdong Huang (Tsinghua University), Chen Wang (Tsinghua University), Jianmin Wang (Tsinghua University)
EC-RAG: Towards Efficient Edge-Cloud Retrieval-Augmented Generation Systems
Liang Wang* (Huazhong University of Science and Technology), Kai Wang (Huazhong University of Science and Technology), Ranjun Jia (Huazhong University of Science and Technology), Kai Lu (Huazhong University of Science and Technology), Jiguang Wan (Huazhong University of Science and Technology), Hao Huo (PingCAP), Yulong Zhai (PingCAP), Zhiyuan Liang (PingCAP), Di Wang (PingCAP)
ShareFlow: An Efficient Framework for Multi-Query Continuous Subgraph Matching
Peiqi Yuan (Southern University of Science and Technology), Zhaohang Feng (Southern University of Science and Technology), Ruiqi Xu (Beijing Institute of Technology, Zhuhai), Keming Li (University of California, Irvine), Rui Mao (Shenzhen University), Bo Tang* (Southern University of Science and Technology)
Fast and Accurate Element-Level Streaming CP Decomposition for Higher-Order Tensors
Jeongyoung Lee* (Seoul National University), SeungJoo Lee (Seoul National University), U Kang (Seoul National University)
[R-10] Memory-Efficient Storage and In-Memory Data Systems
Time: Tuesday, May 5, 13:30 - 15:00 Location: Rue Crescent Track: Modern Hardware and In-Memory Database Systems Session Chair: [To Be Announced]
Reconfiguring Scalable Hashing with Persistent CPU Caches
Zhenyu Yu (Huazhong University of Science and Technology), Bolong Zheng* (Huazhong University of Science and Technology), Ling Xu (Shuyi Technology), Qianlu Wu (Huazhong University of Science and Technology), Qiang Chen (Huazhong University of Science and Technology), Ziyang Yue (Huazhong University of Science and Technology)
SHMemora: Protective Key-Value Store on Distributed Shared Memory
Jiajun Luo* (Tsinghua University), Siyu Lin (Tsinghua University), Yunpeng Xu (Tsinghua University), Shengwei Liu (Cornell University), Jin Xia (Shenzhen Longsys Electronics Co., Ltd.), Dong Liu (Shenzhen Longsys Electronics Co., Ltd.), Zheng Liu (Alibaba Group), Huanchen Zhang (Tsinghua University), Teng Ma (Alibaba Group), Shuwen Deng (Tsinghua University)
Enabling Homomorphic Analytical Operations on Compressed Scientific Data with Multi-stage Decompression
Xuan Wu (Oregon State University), Sheng Di (Argonne National Laboratory), Tripti Agarwal (University of Utah), Kai Zhao (Florida State University), Xin Liang* (Oregon State University), Franck Cappello (Argonne National Laboratory)
Mirror Asymmetry Perfect Hashing: A Memory-Efficient and Load-Intensive-Optimized Hashing Index on Hybrid DRAM-PMem Architecture
Jingcheng Ju* (Institute of Information Engineering, Chinese Academy of Sciences, School of Cyber Security, University of Chinese Academy of Sciences), Zirui Liu (Peking University), Kaicheng Yang (Peking University), Tong Yang (Peking University), Yikai Zhao (Peking University), Feng Liu (Institute of Information Engineering, Chinese Academy of Sciences, School of Cyber Security, University of Chinese Academy of Sciences), Guodong Yang (Huawei), Xingchun Wang (Huawei), Duohe Ma (Institute of Information Engineering, Chinese Academy of Sciences, School of Cyber Security, University of Chinese Academy of Sciences)
CSD-CoKV: Host-CSD Collaborative Offloading for High-Performance LSM-tree based KV Stores
Zhining Cao* (Shandong University), Kai Zhang (Inspur (Jinan) Data Technology Co., Ltd), Jinrun Yang (Shandong University), Hui Li (Inspur (Jinan) Data Technology Co., Ltd), Nan Su (Inspur (Jinan) Data Technology Co., Ltd), Qian Wei (Shandong University), Shikun Ma (Shandong University), Zehao Chen (Shandong University), Junbo Yin (Inspur (Jinan) Data Technology Co., Ltd), Haijun Zhang (Inspur (Jinan) Data Technology Co., Ltd), Zhaoyan Shen (Shandong University)
[R-11] Graph Indexing and Optimization
Time: Tuesday, May 5, 15:30 - 17:00 Location: Rue McGill Track: Graph Structure Analytics Session Chair: [To Be Announced]
Efficient Meta-path Constrained Reachability Query on Heterogeneous Information Networks
Chao Ni (NUAA), Zi Chen* (Wuhan University of Technology), Long Yuan (Wuhan University of Technology), Bolong Zheng (Huazhong University of Science and Technology), Lu Qin (University of Technology Sydney)
Lightweight 2-Hop Labels for Reachability Queries on Large-Scale Graphs
Yishu Wang* (Northeastern University), Jinlong Chu (Northeastern University), Ye Yuan (Beijing Institute of Technology), Yu Gu (Northeastern University), Lianpeng Qiao (Beijing Institute of Technology)
HistCore: Efficient k-Core Decomposition on GPUs with Locality-Aware Computation
Chen Zhao (Wuhan University), Guojia Wan* (Wuhan University), Ting Yu (Zhejiang Lab), Jiawei Jiang (Wuhan University), Bo Du (Wuhan University)
GoCache: Accelerating Out-of-Core Graph Queries with Pattern-Driven Caching
Zheng Yang* (University of Science and Technology of China), Yicheng Zhang (University of Science and Technology of China), Lixiao Cui (Nankai University), Luofan Chen (University of Science and Technology of China), Chongzhuo Yang (University of Science and Technology of China), Xiaojian Luo (Alibaba Group), Sijie Shen (Alibaba Group), Wenyuan Yu (Alibaba Group), Jingren Zhou (Alibaba Group), Cheng Li (University of Science and Technology of China)
C2graph: A Compression-Collaboration Algorithm for CPU-GPU Hybrid Weighted Graph Traversals
Ning Wang (Guangzhou University), Huaibei Li (Ocean University of China), Shen Su (Guangzhou University), Yu Gu (Northeastern University), Ge Yu (Northeastern University), Zhigang Wang* (Guangzhou University), Dawei Zhao (Qilu University of Technology), Hui Lu (Guangzhou University), Zhihong Tian (Guangzhou University)
[R-12] Query Optimization and Rewriting
Time: Tuesday, May 5, 15:30 - 17:00 Location: Rue Sherbrooke Track: Query Processing, Indexing, and Optimization Session Chair: [To Be Announced]
MICRO: A Lightweight Middleware for Optimizing Cross-store Cross-model Graph-Relation Joins
Xiuwen Zheng (University of California, San Diego)*, Arun Kumar (University of California, San Diego), Amarnath Gupta (University of California, San Diego)
SSC-Join: an Efficient Syntactic-Semantic Collaboration based Set Semantic Similarity Join Algorithm
Lianyin Jia (Faculty of Information Engineering and Automation, Kunming University of Science & Technology), Chengchen Zeng (Kunming University of Science and Technology), Mengjuan Li (Yunnan Normal University), Suprio Ray (University of New Brunswick), Yinong Chen (Arizona State University), Jiaman Ding (Kunming University of Science and Technology)*, Xiuxing Li (Beijing Institute of Technology)
From Single to Multiple Attributes: Experimental Insights on Sampling-Based Distinct Combination Estimation in GROUP-BY Queries [Experiment, Analysis, and Benchmark]
Yujie Zhang* (Northeastern University), Xiaochun Yang (Northeastern University), Bin Wang (Northeastern University), Yuan Sui (Northeastern University)
A Set-Theoretic Approach to Detecting Logic Bugs in DBMS Inner Join Optimizations
Ce Lyu (East China Normal University), Changzheng Wei (Ant Group), Yanhao Wang (East China Normal University), Jie Liang (Beihang University), Li Lin (Ant Group), Hanghang Wu (Ant Group), Minghao Zhao* (East China Normal University), Ying Yan (Ant Group), Aoying Zhou (East China Normal University)
Efficient Query Rewrite Rule Discovery via Standardized Enumeration and Learning-to-Rank
Yuan Zhang (Shenzhen Institute of Computing Science, Shenzhen University), Yuxing Chen* (Tencent Inc.), Yuekun Yu (Shenzhen Institute of Computing Science, Shenzhen University), Jinbin Huang (Shenzhen Institute of Computing Science, Shenzhen University), Rui Mao (Shenzhen Institute of Computing Science, Shenzhen University), Anqun Pan (Tencent Inc.), Lixiong Zheng (Tencent Inc.), Jianbin Qin (Shenzhen Institute of Computing Science, Shenzhen University)
[R-13] Federated and Distributed Learning Systems
Time: Tuesday, May 5, 15:30 - 17:00 Location: Rue Mansfield Track: Distributed, Parallel and P2P Data Management Session Chair: [To Be Announced]
Towards the Distributed Large-scale k-NN Graph Construction by Graph Merge
TopFGL: A Topology-Aware and Distribution-Agnostic Federated Learning Framework Tackling Topological Heterogeneity on Graph Data
Junyang Wang (University of Science and Technology of China)*, Lan Zhang (University of Science and Technology of China), Yihang Cheng (University of Science and Technology of China), Mu Yuan (The Chinese University of Hong Kong), Tianfu Wang (University of Science and Technology of China), Zhihui Fu (Shanghai Jiao Tong University), Jun Wang (University of Luxembourg)
AdaFedRec: Adaptive Heterogeneous Federated Recommender Systems across Multi-Device Users
Zhenkai Li (East China Normal University)*, Ming Hu (Singapore Management University), Chentao Jia (East China Normal University), Yining Sun (East China Normal University), Zhufeng Lu (East China Normal University), Mingyang Yu (East China Normal University), Yanxing Yang (East China Normal University), Xiaofei Xie (Singapore Management University), Mingsong Chen (East China Normal University)
SSFusion: Tensor Fusion with Selective Sparsification for Efficient Distributed DNN Training
Zhangqiang Ming (Huazhong University of Science and Technology)*, Rui Wang (Huazhong University of Science and Technology), Yuchong Hu (Huazhong University of Science and Technology), Yuanhao Shu (Innovation Research Institute of Cethik Group Co.Ltd), Wenxiang Zhou (Huazhong University of Science and Technology), Xinjue Zheng (Huazhong University of Science and Technology), Dan Feng (Huazhong University of Science and Technology)
TS3D: A Temporal Multimodal Dataset for Distributed Database System Analysis [Experiment, Analysis, and Benchmark]
Yuanyuan Yao (Zhejiang University), Yuhan Shi (Zhejiang Universit), Yian Wei (Zhejiang University), Lu Chen* (Zhejiang University), Mourad Khayati (University of Fribourg), Cheng Long (Nanyang Technological University), Tianyi Li (Aalborg University)
[R-14] Stream Processing Engines and Architectures
Time: Tuesday, May 5, 15:30 - 17:00 Location: Rue Crescent Track: Data Stream Systems and Edge Computing Session Chair: [To Be Announced]
Astraea: Efficient Pipelined Micro-batch Stream Processing with Non-hash Differentiated Partitioning
Sijie Wu (Huazhong University of Science and Technology), Hanhua Chen (Huazhong University of Science and Technology)*, Hai Jin (Huazhong University of Science and Technology), Haoran Cai (Huawei Technologies Co., Ltd)
NebulaStream: An Adaptive and Efficient Multi-query Stream Processing Engine
Nils Schubert* (Technische Universität Berlin), Lukas Schwerdtfeger (Technische Universität Berlin), Sara Schnaterbeck (Technische Universität Berlin), Philipp Grulich (Observe Inc.), Bonaventura Del Monte (Observe Inc.), Steffen Zeuch (Technische Universität Berlin), Volker Markl (Technische Universität Berlin)
When Complex Event Recognition Meets Cloud-Native Architectures
Shizhe Liu* (Nanjing University), Haipeng Dai (Nanjing University), Meng Li (Nanjing University), Yuemeng Zhang (Nanjing University), Shaoxu Song (Tsinghua University), Zhifeng Bao (The University of Queensland), Hancheng Wang (Nanjing University), Xiaofeng Gao (Shanghai Jiao Tong University), Guihai Chen (Nanjing University)
Process Faster, Pay Less: Functional Isolation for Stream Processing
Eleni Zapridou* (EPFL), Michael Koepf (TU Wien), Panagiotis Sioulas (Oracle), Ioannis Mytilinis (Oracle), Anastasia Ailamaki (EPFL)
Low-Latency Stateful Stream Processing through Timely and Accurate Prefetching
Eleni Zapridou* (EPFL), Anastasia Ailamaki (EPFL)
[R-15] NL2SQL: Methods and Architectures
Time: Wednesday, May 6, 10:00 - 12:00 Location: Av. Duluth Track: AI for Data Management Session Chair: [To Be Announced]
OsmT: Bridging OpenStreetMap Queries and Natural Language with Open-source Tag-aware Language Models
Zhuoyue WAN (The Hong Kong Polytechnic University), Wentao Hu (The Hong Kong Polytechnic University), Chen Jason Zhang (The Hong Kong Polytechnic University), Yuanfeng Song (ByteDance)*, Shuaimin Li (Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences), Ruiqiang Xiao (The Hong Kong University of Science and Technology), Xiao-Yong Wei (The Hong Kong Polytechnic University), Raymond Chi-Wing Wong (The Hong Kong University of Science and Technology)
Knapsack Optimization-based Schema Linking for LLM-based Text-to-SQL Generation
Zheng Yuan (The Hong Kong Polytechnic University (PolyU))*, Hao Chen (City University of Macau), Zijin Hong (The Hong Kong Polytechnic University), Qinggang Zhang (The Hong Kong Polytechnic University), Feiran Huang (Jinan University), Qing Li (The Hong Kong Polytechnic University), Xiao Huang (The Hong Kong Polytechnic University)
CYANSQL: Unlock the Power of NL2SQL via Clustering-based Test-Time Scaling
Haoyu Qin (Fudan University), Tonghui Ren (Tencent Cloud), Zhenying He (Fudan University)*, X.Sean Wang (Fudan University), Jiashu Xing (Tencent Cloud), Yanghuan Ye (Tencent Cloud), Shifei Huang (Tencent Cloud), Jinbao Li (Qilu University of Technology)
Text2VectorSQL: Towards a Unified Interface for Vector Search and SQL Queries
Zhengren Wang (Peking University), Dongwen Yao (Shanghai Jiao Tong University), Bozhou Li (Peking University), Dongsheng Ma (Peking University), Bo Li (Peking University), Zhiyu Li (Institute for Advanced Algorithms Research, Shanghai), Feiyu Xiong (Institute for Advanced Algorithms Research, Shanghai), Bin Cui (Peking University), Linpeng Tang (OriginHub Technology), Wentao Zhang* (Peking University)
Boosting Small Language Models for Text-to-SQL with Fine-Grained Execution Feedback and Cost-Efficient Rewards
Thanh Dat Hoang (Griffith University), Thanh Trung Huynh (VinUniversity), Matthias Weidlich (Humboldt University of Berlin), Thanh Tam Nguyen (Griffith University), Tong Chen (The University of Queensland), Hongzhi Yin (The University of Queensland), Quoc Viet Hung Nguyen* (Griffith University)
LEAF-SQL: Level-wise Exploration with Adaptive Fine-graining for Text-to-SQL Skeleton Prediction
Zhao Tan* (Jiangxi University of Finance and Economics), Xiping Liu (Jiangxi University of Finance and Economics), Qing Shu (Jiangxi University of Finance and Economics), Qizhi Wan (Jiangxi University of Finance and Economics), Dexi Liu (Jiangxi University of Finance and Economics), Changxuan Wan (Jiangxi University of Finance and Economics)
Text2SQL-Flow: A Robust SQL-Aware Data Augmentation Framework for Text-to-SQL
Qifeng Cai* (Peking University), Hao Liang (Peking University), Chang Xu (Peking University), Tao Xie (Peking University), Wentao Zhang (Peking University), Bin Cui (Peking University)
[R-16] Spatial Queries, Road Networks and Indexing
Time: Wednesday, May 6, 10:00 - 12:00 Location: Rue Crescent Track: Spatial Databases and Temporal Databases Session Chair: [To Be Announced]
SOLAR: Scalable Distributed Spatial Joins through Learning-based Optimization
Yongyi Liu* (University of California, Riverside), Ahmed Abdelmaguid (University of California, Riverside), Ahmed Mohamood (Google LLC), Amr Magdy (University of California, Riverside), Minyao Zhu (Google LLC)
PC-PS: A Multi-Dimensional Point-Cloud Data Publish/Subscribe System
PLAN: Fast and Approximate Gaussian Kernel Density Visualization in Road Networks
Tsz Nam Chan* (Shenzhen University), Hongwei Ye (Shenzhen University), Bojian Zhu (Hong Kong Baptist University), Leong Hou U (University of Macau), Dingming Wu (Shenzhen University), Ruisheng Wang (Shenzhen University), Joshua Zhexue Huang (Shenzhen University)
A Robust and Globally Accurate Hierarchical Hub Labeling Index for SP-Distance Queries in Dynamic Road Networks
Wei Liu (Yantai University), Ziqiang Yu* (Yantai University), Xiaohui Yu (York University), Yang Liu (Wilfrid Laurier University), Simu Liu (Yantai University)
iKSP: A Path Enumeration Index in Road Networks
Zihan Luo* (The Hong Kong University of Science and Technology), Lei Li (The Hong Kong University of Science and Technology), Mengxuan Zhang (The Australian National University), Xinjie Zhou (The Hong Kong University of Science and Technology), Zizhuo Xu (The Hong Kong University of Science and Technology), Xiaofang Zhou (The Hong Kong University of Science and Technology)
Robust Spatial-Temporal Similar Trajectory Search via Structure-Enhanced Domain-Invariant Learning
Xiaolin Han* (Northwestern Polytechnical University), Yonghao Zhou (Northwestern Polytechnical University), Chenhao Ma (Chinese University of Hong Kong, Shenzhen), Lingyun Song (Northwestern Polytechnical University), Xinbiao Gan (National University of Defense Technology), Xuequn Shang (Northwestern Polytechnical University)
SOLAR: Efficient Spatial Queries on Real-time LSM-based Storage
[R-17] Learned Models for Query Optimization and Cost Estimation
Time: Wednesday, May 6, 10:00 - 12:00 Location: Rue McGill Track: AI-based DB Tuning, Benchmarks and Performances Session Chair: [To Be Announced]
TemplateQO: Template-aware and Scalable Query Optimization with Data-efficient Learning
Pengfei Zheng (Huazhong University of Science and Technology), Guoneng Li (Huazhong University of Science and Technology), Ling Xu (Huazhong University of Science and Technology), Rong Zhu (Alibaba), Yan Li (Wuhan University of Technology), Bolong Zheng* (Huazhong University of Science and Technology)
CoLSE: A Lightweight and Robust Hybrid Learned Model for Single-Table Cardinality Estimation using Joint CDF
Lankadinee Rathuwadu* (University of Melbourne), Christopher Leckie (University of Melbourne), Guanli Liu (University of Melbourne), Renata Borovica-Gajic (University of Melbourne)
LAMP: A Dual-Mode Framework for Database Workload Memory Prediction
Guoze Xue (Zhejiang University), Lu Chen* (Zhejiang University), Ziquan Fang (Zhejiang University), Tianyi Li (Aalborg University), Yushuai Li (Aalborg University), Torben Bach Pedersen (Aalborg University)
Robust Index Benefit Estimation via Hierarchical and Two-dimensional Feature Representation
Tao Li* (State Cloud, China Telecom), Feng Liang (Shenzhen MSU-BIT University), Jinqi Quan (State Cloud, China Telecom), Zihang Yang (State Cloud, China Telecom), Teng Wang (State Cloud, China Telecom), Runhuai Huang (State Cloud, China Telecom), Xiping Hu (Shenzhen MSU-BIT University), Meng Li (Nanjing University), Haipeng Dai (Nanjing University)
Telescope: A Learned What-If Call for Column Store Selection in HTAP Databases
Yidong Zhang (Renmin University of China), Chao Zhang* (Renmin University of China), Zhengkun Wu (Renmin University of China), Ju Fan (Renmin University Of China), Xinyi Zhang (Renmin University Of China), Hong Chen (Renmin University of China), Yuxing Chen (Tencent Inc.), Anqun Pan (Tencent Inc.)
SkyNet: Solving Skyline Queries with Neural Networks
Lequa: A Learning-Based Query-Aware Framework for Selective Query Optimization
Guoneng Li (Huazhong University of Science and Technology), Pengfei Zheng (Huazhong University of Science and Technology), Ling Xu (Shuyi Tech.), Yan Li (Wuhan University of Technology), Bolong Zheng* (Huazhong University of Science and Technology)
[R-18] Log Analytics, Anomaly Detection and Tensor Methods
Time: Wednesday, May 6, 10:00 - 12:00 Location: Rue Sherbrooke Track: Data Mining and Knowledge Discovery Session Chair: [To Be Announced]
AutoHFormer: Efficient Hierarchical Autoregressive Transformer for Time Series Prediction
Qianru Zhang* (The University of Hong Kong), HongGang Wen (The University of Hong Kong), Ming Li (Zhejiang Normal University), Dong Huang (National University of Singapore), Siu-Ming Yiu (The University of Hong Kong), Christian S Jensen (Aalborg University), Pietro Liò (Cambridge University)
Efficient Zero-shot and Label-free Log Anomaly Detection for Resource-constrained Systems
Zuohan Wu* (the Hong Kong Unversity of Science and Technology (Guangzhou)), Jiachuan Wang (the Hong Kong Unversity of Science and Technology), Libin Zheng (Sun Yat-sen University), Yongqi Zhang (the Hong Kong Unversity of Science and Technology (Guangzhou)), Shuangyin Li (South China Normal University), Lei Chen (the Hong Kong Unversity of Science and Technology and the Hong Kong Unversity of Science and Technology (Guangzhou))
An Encode-then-Decompose Approach to Unsupervised Time Series Anomaly Detection on Contaminated Training Data
Buang Zhang* (ECNU), Tung Kieu (Aalborg University), Xiangfei Qiu (East China Normal University), Chenjuan Guo (East China Normal University), Jilin Hu (East China Normal University), Aoying Zhou (East China Normal University), Christian S. Jensen (Aalborg University), Bin Yang (East China Normal University)
Krone: Hierarchical and Modular Log Anomaly Detection
Lei Ma* (WPI), Jinyang Liu (Bytedance), Tieying Zhang (Bytedance), Peter VanNostrand (WPI), Dennis Hofmann (WPI), Lei Cao (Arizona University), Elke Rundensteiner (WPI), Jianjun Chen (Bytedance)
SLGParser: Practical and Efficient Label-Free Log Parsing Using Large Language Models
Yibing Hu* (Institute of Information Engineering, CAS), Cong Wang (Institute of Information Engineering, CAS), Lixin Zhao (Institute of Information Engineering, CAS), Aimin Yu (Institute of Information Engineering, CAS)
Toward scalable Tucker decomposition: skew-aware multi-level partitioning with GPU–storage co-processing
Seung Hyeon Song (KOREATECH), Jihye Lee (ETRI), Chanki Kim (Jeonbuk National University), Kang-Wook Chon* (KOREATECH)
QPAD: Quantile-Preserving Approximate Dimension Reduction for Nearest Neighbors Preservation in High-Dimensional Vector Search
Jiuzhou Fu* (University of Washington), Dongfang Zhao (University of Washington)
[R-19] Graph Neural Networks, Knowledge Graph Learning and Reasoning
Time: Wednesday, May 6, 10:00 - 12:00 Location: Rue Mansfield Track: Graph Learning and Mining Session Chair: [To Be Announced]
On Graph Rewiring with Motifs: a Find-and-Replace Approach
Qihao Wang* (University of Illinois at Urbana Champaign), Hongtai Cao (University of Illinois at Urbana Champaign), Xiaodong Li (The University of Hong Kong), Matin Najafi (The University of Hong Kong), Kevin Chen-Chuan Chang (University of Illinois at Urbana Champaign), Reynold Cheng (The University of Hong Kong)
SNI-GNN: SmartNIC-Assisted Full-Graph GNN Training with In-Network Embedding Prediction
Guofan Yu* (Hong Kong Baptist University), Sitian Chen (Hong Kong Baptist University), Zhenheng Tang (Hong Kong University of Science and Technology (Guangzhou)), Xiaowen Chu (Hong Kong University of Science and Technology (Guangzhou)), Amelie Chi Zhou (Hong Kong Baptist University)
DIFFCOM: Conditional Discrete Diffusion Model for Community Search
Ling Li* (Shanxi University), Liang Bai (Shanxi University), Siqiang Luo (Nanyang Technological University), Yejiang Wang (Northeastern University), Yuhai Zhao (Northeastern University)
Incremental GNN Embedding Computation on Streaming Graphs
Qiange Wang* (National University of Singapore), Haoran Lv (Northeastern University), Yanfeng Zhang (Northeastern University), Weng-Fai Wong (National University of Singapore), Bingsheng He (National University of Singapore)
Tao Yu* (Fudan University), Wen Deng (Fudan University), Weiguo Zheng (Fudan University), Jeffrey Xu Yu (The Hong Kong University of Science and Technology (Guangzhou))
FlashEKGR: Fast Embedding-Based Knowledge Graph Reasoning Models Training
Wentai Zhang (Beijing University of Post and Telecommunication), Teng Xu (Beijing University of Posts and Telecommunications), Weiguang Wang (Beijing University of Posts and Telecommunications), Junxing Li (Beijing University of Posts and Telecommunications), Jun Zhang (Beijing University of Posts and Telecommunications), Yifan Zhu (Beijing University of Posts and Telecommunications), Haihong E* (Beijing University of Posts and Telecommunications)
OMNIA: Closing the Loop by Leveraging LLMs for Knowledge Graphs Completion
Frédéric IENG (Université Paris Cité), Massinissa Hammaz (Université Paris Cité), Soror Sahri (Université Paris Cité), Mourad OUZZANI* (Qatar Computing Research Institute, HBKU), Salima Benbernou (Université Paris Cité), Hanieh Khorashadizadeh (Universität zu Lübeck), Sven Groppe (Universität zu Lübeck), Farah Benamara (IRIT)
[R-20] Dense Subgraph and Core Decomposition
Time: Wednesday, May 6, 15:30 - 17:00 Location: Rue McGill Track: Graph Structure Analytics Session Chair: [To Be Announced]
Querying Historical k-Dense Subgraphs On Temporal graphs
Qi Zhang (University of Science and Technology Beijing), Yalong Zhang (Beijing Institute of Technology), Rong-Hua Li* (Beijing Institute of Technology), Xu-Cheng Yin (University of Science and Technology Beijing), Guoren Wang (Beijing Institute of Technology)
Density Decomposition of Multilayer Graphs
Jiaqi Jiang (Beijing Institute of Technology), Rong-Hua Li* (Beijing Institute of Technology), Yalong Zhang (Beijing Institute of Technology)
SQAC: Scalable Querying of Attribute-Constrained (α, β)-Cores over Large Bipartite Graphs
Xin Deng (Hunan University), Peng Peng (Hunan University), Baoqing Sun (Hunan University), Shuo Dai (Hunan University), Zheng Qin* (Hunan University), Lijun Chang (The University of Sydney)
Listing Minimal Cores in Large Real-World Graphs
Yukai Sun (Harbin Institute of Technology, Shenzhen), Kaiqiang Yu (Nanjing University), Shengxin Liu* (Harbin Institute of Technology, Shenzhen), Cheng Long (Nanyang Technological University), Raymond Chi-Wing Wong (The Hong Kong University of Science and Technology), Xun Zhou (Harbin Institute of Technology, Shenzhen), Min Zhang (Harbin Institute of Technology, Shenzhen)
BEACON: A Benchmark for Efficient and Accurate Counting of Subgraphs [Experiment, Analysis, and Benchmark]
Xiangju Zhu* (The University of Hong Kong), Mohammad Matin Najafi (Huawei Hong Kong Research Center), Chrysanthi Kosyfaki (The Hong Kong University of Science and Technology), Xiaodong Li (Xiamen University), Reynold Cheng (The University of Hong Kong), Laks Lakshmanan (University of British Columbia)
[R-21] LLM-Assisted and AI-Augmented Query Processing
Time: Wednesday, May 6, 15:30 - 17:00 Location: Rue Sherbrooke Track: Query Processing, Indexing, and Optimization Session Chair: [To Be Announced]
Query-Driven Data Exploration with Heterogeneous Treatment Effects
Antonis Mandamadiotis* (Athena Research Center), Sihem Amer-Yahia (CNRS, Univ. Grenoble Alpes), Georgia Koutrika (Athena Research Center)
BOND: A Co-Designed Framework for LLM-Powered Analytics Over Relational Data
Lixiang Chen* (East China Normal University), Qin Zheng, (East China Normal University), Zhicheng Pan (East China Normal University), Chengcheng Yang (East China Normal University), Rong Zhang (East China Normal University), Xuan Zhou (East China Normal University)
Batcher: Learning to Construct Cost-Efficient Batches of Small Queries in Big Data Processing Platforms
Yeonsu Park (Kangwon National University), Taesung Lee (POSTECH), Byungchul Tak (Kyungpook National University), Wook-Shin Han* (POSTECH)
CactusDB: Unlock Co-Optimization Opportunities for SQL Queries and AI/ML Model Inferences
Lixi Zhou (Arizona State University), Kanchan Chowdhury (Arizona State University), Lulu Xie (Arizona State University), Jaykumar Tandel (Arizona State University), Hong Guan (Arizona State University), Zhiwei Fan (Meta), Xinwei Fu (Amazon), Jia Zou* (Arizona State University)
APEX: Adaptive Variable-wise Parallel Execution for Worst-Case Optimal Joins on Graph Queries
Yipeng Liu (Guangxi Key Laboratory of Trusted Software, Guilin University of Electronic Technology), Yuming Lin* (Guangxi Key Laboratory of Trusted Software, Guilin University of Electronic Technology), Zhicheng Pan (East China Normal University), Chengcheng Yang (East China Normal University), You Li (Guangxi Key Laboratory of Trusted Software, Guilin University of Electronic Technology), Aoying Zhou (East China Normal University)
[R-22] Time Series Cleaning and Imputation
Time: Wednesday, May 6, 15:30 - 17:00 Location: Rue Mansfield Track: Information Integration and Data Quality Session Chair: [To Be Announced]
MINOR: Multivariate Time Series Iterative Cleaning Algorithm
Aoqian Zhang* (Beijing Institute of Technology), Yinru Sun (Beijing Institute of Technology), Pengxiang Hao (Beijing Institute of Technology), Yifeng Gong (Beijing Institute of Technology), Boyang Li (Beijing Institute of Technology), Jing Geng (Beijing Institute of Technology, Zheng Wang (Shanghai Jiao Tong University), Lianpeng Qiao (Beijing Institute of Technology)
RAMSeS: Robust and Adaptive Model Selection for Time-Series Anomaly Detection Algorithms
Mohamed Ahmed Abdelmaksoud Mohamed* (TU Berlin & BIFOLD), Sheng Ding (University of Stuttgart), Andrey Morozov (University of Stuttgart), Ziawasch Abedjan (TU Berlin & BIFOLD)
Time-Frequency Conditioned Diffusion for Multivariate Time Series Imputation
Yumeng Liu (Shenzhen Technology University), Zheng Wang (Shenzhen Technology University), Jikui Liu (Shenzhen Polytechnic Uiversity), Kaisa Zhang (Beijing University of Poss and Telecommunications), Weidong Gao (Beijing University of Poss and Telecommunications), Xiaomao Fan* (Shenzhen Technology University)
EDITOR: Multi-Resolution Cleaning of Multivariate Time Series via Detect-Localize-Repair
Chenyang Li* (Renmin University of China), Chaohong Ma (Hebei Normal University), Xiaohui Yu (York University), Cailong Li (Northeastern University), Xiaofeng Meng (Renmin University of China)
Improving Data Imputation through a Tuned Strategy for Dependency Discovery
Bernardo Breve (University of Naples Federico II), Loredana Caruccio (University of Salerno), Tullio Pizzuti* (University of Salerno), Giuseppe Polese (University of Salerno)
[R-23] Differential Privacy and Local Privacy Mechanisms
Time: Wednesday, May 6, 15:30 - 17:00 Location: Rue Crescent Track: Database Security and Privacy Session Chair: [To Be Announced]
Fine-grained Manipulation Attacks to Local Differential Privacy Protocols for Range Query
ABC: Numerical Data Collection under Local Differential Privacy without Prior Knowledge
Incheol Baek* (Korea University), Hyungbin Kim (Korea University), Yon Dohn Chung (Korea University)
Answering Federated Range Queries with Local Differential Privacy
Yuemin Zhang (The Hong Kong Polytechnic University), Qingqing Ye* (The Hong Kong Polytechnic University), Junxu Liu (The Hong Kong Polytechnic University), Wei Dong (Nanyang Technological University)
Robust Single-message Shuffle Differential Privacy Protocol for Accurate Distribution Estimation
Xiaoguang Li* (Xidian University), Hanyi Wang (China Mobile (Suzhou) Software Technology Co., Ltd), Yaowei Huang (Guangzhou University), Jungang Yang (Shanghai University), Qingqing Ye (The Hong Kong Polytechnic University), Haonan Yan (Xidian University), Ke Pan (Xidian University), Zhe Sun (Guangzhou University), Hui Li (Xidian University)
Revisiting Locally Differentially Private Protocols: Towards Better Trade-offs in Privacy, Utility, and Attack Resistance [Experiment, Analysis, and Benchmark]
ZHéber H. Arcolezi* (Inria), Sébastien Gambs (UQAM)
[R-24] Vector Databases, Embeddings and ML Data Infrastructure
Time: Thursday, May 7, 10:00 - 12:00 Location: Av. Duluth Track: Data Management for AI Session Chair: [To Be Announced]
Exqutor: Extended Query Optimizer for Vector-augmented Analytical Queries
Hyunjoon Kim (Yonsei University), Chaerim Lim (Yonsei University), Hyeonjun An (Yonsei University), Rathijit Sen (Microsoft), Kwanghyun Park* (Yonsei University)
Federated Retrieval over Embedding-Heterogeneous Vector Databases
Yuxiang Wang (Beihang University), Yongxin Tong* (Beihang University), Zimu Zhou (City University of Hong Kong), Ziyuan He (Beihang University), Ruixi Hu (Beihang University), Ke Xu (Beihang University)
Trading Vector Data in Vector Databases
Jin Cheng* (The Chinese University of Hong Kong, Shenzhen), Xiangxiang Dai (The Chinese University of Hong Kong), Ningning Ding (Hong Kong University of Science and Technology (Guangzhou)), John C.S. Lui (The Chinese University of Hong Kong), Jianwei Huang (The Chinese University of Hong Kong, Shenzhen)
MojoFrame: Dataframe Library in Mojo Language
Shengya Huang* (University of Illinois at Urbana-Champaign), Zhaoheng Li* (University of Illinois at Urbana-Champaign), Derek Werner (University of Illinois at Urbana-Champaign), Yongjoo Park (University of Illinois at Urbana-Champaign)
Approximate Diverse k-nearest Neighbor Search in Vector Database
Jiachen ZHAO* (Chinese University of Hong Kong), Xiao Yan (Wuhan University), Eric Lo (Chinese University of Hong Kong)
SQLVec: SQL-Based Vector Similarity Search
Zequn Zhang* (School of Cyber Science and Engineering, Wuhan University), Yuanyuan Zhu (School of Computer Science, Wuhan University), Hao Zhang (The Chinese University of Hong kong), Jeffrey Xu Yu (Hong Kong University of Sciences and Technology (Guangzhou))
MISFEAT: Feature Selection for Subgroups with Mutual Information Estimation
Bar Genossar* (Technion -- Israel Institute of Technology), Thinh On (New Jersey Institute of Technology), Md Mouinul Islam (PayPal), Ben Eliav (Technion -- Israel Institute of Technology), Senjuti Basu Roy (New Jersey Institute of Technology), Avigdor Gal (Technion -- Israel Institute of Technology)
[R-25] Community Search on Diverse Graph Types
Time: Thursday, May 7, 10:00 - 12:00 Location: Rue McGill Track: Graph Structure Analytics Session Chair: [To Be Announced]
MOCHI: Motif-based Community Search over Large Heterogeneous Information Networks
Yuhan Zhou (Zhejiang University), Qing Liu (Zhejiang University), Xin Huang (Hong Kong Baptist University), Jianliang Xu (Hong Kong Baptist University), Yunjun Gao* (Zhejiang University)
More Than Pivot for Maximal Clique Enumeration
Zhaoyi Zhong (Swinburne University of Technology), Rui Zhou* (Swinburne University of Technology), Lu Chen (Swinburne University of Technology), Xiaofan Li (Nanyang Technological University), Chengfei Liu (Swinburne University of Technology)
Beyond Homophily: Community Search on Heterophilic Graphs
Qing Sima (University of New South Wales), Xiaoyang Wang* (University of New South Wales), Wenjie Zhang (University of New South Wales)
Efficient Community Search on Attributed Public-Private Graphs
Yuqi Chen* (Hong Kong Baptist University), Weihan Zhang, (Sun Yat-sen University), Xin Huang (Hong Kong Baptist University)
Maximum Balanced Clique Search on Large Directed Graphs
Jianhua Wang* (Inner Mongolia University), Jianye Yang (Guangzhou University), Zhaoquan Gu (Harbin Institute of Technology (Shenzhen)), Dian Ouyang (Guangzhou University), Ziyi Ma (Hebei University of Technology), Ying Zhang (Zhejiang Gongshang University)
Prompt-Guided Community Search under Extreme Few-Shot Supervision
Wenxin Yang (Beijing Institute of Technology), Kaiyu Feng* (Beijing Institute of Technology), Lanting Fang (Beijing Institute of Technology), Kangfei Zhao (Beijing Institute of Technology), Xia Wu (Beijing Institute of Technology)
Efficient Size Constraint Community Search over Heterogeneous Information Networks
Xinjian Zhang (Swinburne University of Technology), Chengfei Liu* (Swinburne University of Technology), Lu Chen (Swinburne University of Technology), Rui Zhou (Swinburne University of Technology), Bo Ning (Dalian Maritime University)
[R-26] Tabular Data, Community Search and Knowledge Discovery
Time: Thursday, May 7, 10:00 - 12:00 Location: Rue Sherbrooke Track: Data Mining and Knowledge Discovery Session Chair: [To Be Announced]
Fast Discovery of Functional Dependencies via Bayesian Network Learning
Siyi Yang (National University of Defense Technology), Shenglin Chen (National University of Defense Technology), Xi Wang (National University of Defense Technology), Yuhua Tang (National University of Defense Technology), Ruochun Jin* (National University of Defense Technology)
VisPoison: An Effective Backdoor Attack Framework for Tabular Data Visualization Models
Shuaimin Li (The Hong Kong Polytechnic University), Chen Jason Zhang (The Hong Kong Polytechnic University), Xuanang Chen (Institute of Software, Chinese Academy of Sciences), Anni Peng (PetroChina Digital Intelligence Research Institute Co., Ltd.), Zhuoyue Wan (The Hong Kong Polytechnic University), Yuanfeng Song* (ByteDance), Shiwen Ni (Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences), Min Yang (Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences), Fei Hao (The Hong Kong Polytechnic University), Raymond Chi-Wing Wong (The Hong Kong University of Science and Technology)
Keyword-Aware Skyline Community Search on Semantics and Structure
Chuanhou Sun* (Northeastern University), Yuhai Zhao (Northeastern University), Ling Li (Shanxi University), Yuan Li (North China University of Technology)
TabLoft: Tabular Data Generation Based on LLM with Ordered Features
Luyu Chen* (Fudan University), Changhao Wu (Fudan University), Jingyi Li (Fudan University), Sen Liu (Fudan University), Guangnan Ye (Fudan University), Hongfeng Chai (Fudan University)
Beyond Traditional Diagnostics: Transforming Patient-Side Information into Predictive Insights with Knowledge Graphs and Prototypes
C²TC: A Training-Free Framework for Efficient Tabular Data Condensation
Sijia Xu (University of New South Wales), Fan Li (University of New South Wales), Xiaoyang Wang* (University of New South Wales), Zhengyi Yang (University of New South Wales), Xuemin Lin (Shanghai Jiao Tong University)
Beyond Imputation: A Semantic Unification Framework for Data and Its Missingness in Multimodal Healthcare Analytics
[R-27] Sketches, Approximate Queries and Data Streams
Time: Thursday, May 7, 10:00 - 12:00 Location: Rue Mansfield Track: Uncertain Databases, Graphs and Streaming Session Chair: [To Be Announced]
GeminiSketch: An Accurate and Efficient Sketch for Summarizing Temporal Graph Streams with Rolling-out Elimination
Xuyang Jing (Xidian University), Chenhao Zhang (Xidian University), Zheng Yan* (Xidian University), Qingze Jiang (Xidian University), Witold Pedrycz (University of Alberta), Mingjun Wang (Xidian University), Cong Wang (Northwestern Polytechnical University)
Approximate Butterfly Counting in Sublinear Time
Chi Luo (Shanghai Jiao Tong University), Jiaxin Song (University of Illinois Urbana-Champaign), Yuhao Zhang (Shanghai Jiao Tong University), Kai Wang* (Shanghai Jiao Tong University), Zhixing He (Shanghai Jiao Tong University), Kuan Yang (Shanghai Jiao Tong University)
Evolving Sketch: Time-Decaying Frequency Estimation for Evolving Streams
Ge Gao* (Soochow University), Yang Du (Soochow University), He Huang (Soochow University), Yu-E Sun (Soochow University), Jianzhi Tang (Soochow University)
Spatiotemporal Sketch Disaggregation: Streaming Analytics with Heterogeneous Resources
Jonatan Langlet* (KTH Royal Institute of Technology), Peiqing Chen (University of Maryland, College Park), Michael Mitzenmacher (Harvard University), Zaoxing Liu (University of Maryland, College Park), Ran Ben Basat (University College London), Gianni Antichi (Politecnico di Milano)
ProvSQL: A General System for Keeping Track of the Provenance and Probability of Data
Aryak Sen (Univ. Grenoble Alpes), Silviu Maniu (Univ. Grenoble Alpes), Pierre Senellart* (ENS, PSL University)
AlignSketch: A Framework for Aligning Theoretical and Practical Estimation Errors
Ce Zheng* (School of Cyber Science and Technology, Beihang University), Hanyue Zheng (School of Computer Science, Peking University), Jingwei Shi (School of Information Management & Engineering, Shanghai University of Finance and Economics), Xinye Xu (School of Computer Science, Peking University), Wei Zhou (Viterbi School of Engineering, University of Southern California), Tong Yang (School of Computer Science, Peking University), Zhenyu Guan (School of Cyber Science and Technology, Beihang University), Yong Cui (Department of Computer Science and Technology, Tsinghua University)
Query-Guided Analysis and Mitigation of Data Verification Errors
Ran Schreiber* (Bar-Ilan University), Yael Amsterdamer (Bar-Ilan University)
[R-28] Transaction Management, Distributed Storage and Serverless Systems
Time: Thursday, May 7, 10:00 - 12:00 Location: Rue Crescent Track: Cloud Data Management Session Chair: [To Be Announced]
Contemp: Instance Caching Based on Container Temperature in Serverless Environment
Pengwei Wang* (Donghua University), Nuo Chen (Donghua University), Haoquan Qi (Donghua University), Yichen Zhong (Donghua University), Shun Song (Ant Group)
MTC: Scalable Transaction Commit for Multi-Master Cloud Databases
Kecheng Luo* (East China Normal University), Xiaoxian Wei (East China Normal University), Wenxin Liu (East China Normal University), Peng Cai (East China Normal University), Aoying Zhou (East China Normal University), Hui Li (Guizhou University), Le Cai (ByteDance Inc)
Efficient Cloud-edge Collaborative Approaches to SPARQL Queries over Large RDF graphs
Shidan Ma* (Hunan University), Peng Peng (Hunan University), Xu Zhou (Hunan University), M. Tamer Özsu (University of Waterloo), Lei Zou (Peking University), Guo Chen (Hunan University)
ImmortalChopper: Real-Time and Resilient Distributed Transactions in the Edge-Cloud
Juncheng Fang* (University of California, Irvine), Farzad Habibi (University of California, Irvine), Binbin Gu (University of California, Irvine), Faisal Nawab (University of California, Irvine)
REMON: Remote External Memory Over the Network
Shiquan Zhang* (University of Toronto), Michail Bachras (University of Toronto), Yuqiu Zhang (University of Toronto), Yunhao Mao (University of Toronto), Hans-Arno Jacobsen (University of Toronto)
PAT: Towards Transaction Routing with Page Affinity in Shared-Cache Databases
Shijie Gao* (Renmin University of China), Feng Zhang (Renmin University of China), Qian Xu (Renmin University of China), Yang Li (Lenovo research), XueFeng Liu (Lenovo research), Chao Jiang (Lenovo research), Limin Xiao (Lenovo research), Siqi Ma (University of New South Wales), Elisa Bertino (Purdue University), Xiaoyong Du (Renmin University of China)
Improving GPU Tensor Query Processing for Resource-Constrained Environments
Qian Xu* (Renmin University of China), Feng Zhang (Renmin University of China), Shijie Gao (Renmin University of China), Kun Chen (Individual Researcher), Jianhua Wang (China Electronics Technology Kingbase (Beijing) Technologies Inc), Zheng Chen (Tsinghua University), Xiaoyong Du (Renmin University of China)
F5: A Robust SIMD-Accelerated MSD Radix Sort
Arif Arman (Texas A&M University), Dmitri Loguinov* (Texas A&M University)
GLIDE: GPU-Accelerated ANN Graph Index Construction via Data Locality
Fuhao Ruan (Huazhong University of Science and Technology), Ziyang Yue (Huazhong University of Science and Technology), Ling Xu (Shuyi Tech.), Dawei Liu (Huazhong University of Science and Technology), Bolong Zheng* (Huazhong University of Science and Technology)
[R-30] NL2SQL: Systems, Evaluation and Benchmarking
Time: Thursday, May 7, 13:30 - 15:00 Location: Rue McGill Track: AI for Data Management Session Chair: [To Be Announced]
Hexgen-Flow: Optimizing LLM Inference Request Scheduling for Agentic Text-to-SQL
You Peng (Hong Kong University of Science and Technology (HKUST)), Youhe Jiang (Hong Kong University of Science and Technology (HKUST)), Wenqi Jiang (ETH Zurich), Chen Wang (Tsinghua University), Binhang Yuan* (Hong Kong University of Science and Technology (HKUST))
SQLMorph: Query Mutation and Fine-Grained Metrics for Text-to-SQL Evaluation [Experiment, Analysis, and Benchmark]
An Efficient and Effective Evaluator for Text2SQL Models on Unseen and Unlabeled Data
Khanh Trinh Pham (Griffith University), Thanh Tam Nguyen (Griffith University), Viet Huynh (Edith Cowan University), Hongzhi Yin (The University of Queensland), Quoc Viet Hung Nguyen* (Griffith University)
Elena: An Explainability-aided Online Query Optimization Framework
Yuan Dong (Zhejiang University), Yuanyuan Yao (Zhejiang University), Yangyang Wu* (Zhejiang University), Lu Chen (Zhejiang University), Rong Zhu (Alibaba Group)
MM2SQL: A Benchmark and Method for Visually-Grounded SQL Generation
Shengze Shi* (Institute of Software Chinese Academy of Sciences, University of Chinese Academy of Sciences), Tao Ren (Institute of Software Chinese Academy of Sciences, University of Chinese Academy of Sciences), Li Qi (Institute of Software Chinese Academy of Sciences, University of Chinese Academy of Sciences), Tingrui Yang (Institute of Software Chinese Academy of Sciences, University of Chinese Academy of Sciences), Wei Xiong (Institute of Software Chinese Academy of Sciences, University of Chinese Academy of Sciences), Jun Hu (Institute of Software Chinese Academy of Sciences, University of Chinese Academy of Sciences)
[R-31] Graph Pattern Matching, Hypergraphs and Subgraph Queries
Time: Thursday, May 7, 13:30 - 15:00 Location: Rue Sherbrooke Track: Graph Structure Analytics Session Chair: [To Be Announced]
L4G: Two-hop Label Management for Group Steiner Tree Search on Graphs
Xiaoyao Feng (Renmin University of China), Yahui Sun* (Renmin University of China), Zhuoran Wang (Renmin University of China), Junlin Li (Renmin University of China), Sijia Luo (Renmin University of China), Rong-Hua Li (Beijing Institute of Technology)
Efficient Hypergraph Pattern Matching via Match-and-Filter and Intersection Constraint
Siwoo Song (Seoul National University), Wonseok Shin (Standigm Inc), Kunsoo Park* (Seoul National University), Giuseppe Italiano (LUISS University), Zhengyi Yang (University of New South Wales), Wenjie Zhang (University of New South Wales)
HL-index: Fast Reachability Query in Hypergraphs
Peiting Xie* (The University of New South Wales), Xiangjun Zai (University of New South Wales), Yanping Wu (University of Technology Sydney), Xiaoyang Wang (The University of New South Wales), Wenjie Zhang (The University of New South Wales), Lu Qin (University of Technology Sydney)
Subtree Mode and Applications
Jialong Zhou (King's College London), Ben Bals (CWI), Matei Tinca (Vrije Universiteit), Ai Guan (King's College London), Panagiots Charalampopoulos (King's College London), Grigorios Loukides* (King's College London), Solon Pissis (CWI)
Efficient Graph Matching with Pattern Reduction
Pingpeng Yuan (Huazhong University of Science & Technology), Yujiang Wang (Huazhong University of Science & Technology), Jiangji Peng (Huazhong University of Science & Technology), Tianyu Ma (Huazhong University of Science & Technology), Siyuan He (Huazhong University of Science & Technology), Ling Liu* (Georgia Institute of Technology)
[R-32] Graph-Based RAG, Queries and Entity Tasks
Time: Thursday, May 7, 13:30 - 15:00 Location: Rue Mansfield Track: Graph Queries, Entity Alignment and Learning Session Chair: [To Be Announced]
PROGQL: A Provenance Graph Query System for Cyber Attack Investigation
Fei Shao* (Case Western Reserve University), Jia Zou (Arizona State University), Zhichao Cao (Arizona State University), Xusheng Xiao (Arizona State University)
Effective Fairest Community Search over Heterogeneous Information Networks
Taige Zhao (Deakin University), Jianxin Li* (Edith Cowan University), Man Li (Victoria University), Wei Luo (Deakin University), Jingxian Cheng (Chang'an University), Yuan Miao (Victoria University), Hua Wang (Victoria University)
AGRAG: Advanced Graph-based Retrieval-Augmented Generation for LLMs
Yubo Wang* (HKUST), Haoyang Li (The Hong Kong Polytechnic University), Fei Teng (HKUST), Lei Chen (HKUST & HKUST(GZ))
Clue-RAG: Towards Accurate and Cost-Efficient Graph-based RAG via Multi-Partite Graph-based Index
Yaodong Su (CUHKSZ), Yixiang Fang* (CUHKSZ), Yingli Zhou (CUHKSZ), Chuanhui Yang (OceanBase, Ant Group)
ZTab: Domain-based Zero-shot Annotation for Table Columns
Ehsan Hoseinzade* (Simon Fraser University), Ke Wang (Simon Fraser university)
[R-33] Explainability, Fairness and Trust in Data Systems
Time: Thursday, May 7, 13:30 - 15:00 Location: Rue Crescent Track: Explainability, Fairness, and Trust in Data Systems and Analysis Session Chair: [To Be Announced]
CausalPre: Scalable and Effective Data Pre-processing for Causal Fairness
Ying Zheng* (National University of Singapore), Yangfan Jiang (National University of Singapore), Kian-Lee Tan (National University of Singapore)
Promoting Fairness in Information Access within Social Networks
Changan Liu* (Fudan University), Xiaotian Zhou (Fudan University), Ahad N. Zehmakan (Australian National University), Zhongzhi Zhang (Fudan University)
Interpreting Graph Inference with Skyline Explanations
Dazhuo Qiu* (Aalborg University), Haolai Che (Case Western Reserve University), Arijit Khan (Aalborg University), Yinghui Wu (Case Western Reserve University)
Explaining GNN Negatives Globally and Locally
Kehan Pang (Beihang University), Wenfei Fan (University of Edinburgh), Min Xie* (Shenzhen Institute of Computing Sciences), Dandan Lin (Shenzhen Institute of Computing Sciences)
Mitigating GenAI-powered Evidence Pollution for Out-of-Context Misinformation Detection
Zehong Yan* (National University of Singapore), Peng Qi (National University of Singapore), Wynne Hsu (National University of Singapore), Mong Li Lee (National University of Singapore)
[R-34] Dynamic Graphs and Temporal Graph Processing
Time: Thursday, May 7, 15:30 - 17:00 Location: Rue McGill Track: Graph Structure Analytics Session Chair: [To Be Announced]
GRACE: Alleviating Reconstruction Cost in Dynamic Graph Processing Systems
Hongru Gao (Huazhong University of Science and Technology), Shuhao Zhang* (Huazhong University of Science and Technology), Xiaofei Liao (Huazhong University of Science and Technology), Hai Jin (Huazhong University of Science and Technology)
IIT-Tree: An efficient index to support interval-based query on large temporal graphs
Faming Li* (Northeastern University), Shengli Qiu (Northeastern University), Xiaochun Yang (Northeastern University), Bin Wang (Northeastern University), Hengzhao Ma (Northeastern University)
TRADER: Real-Time Arbitrage Detection via Negative Cycles on Dynamic Graphs
Bingqiao Luo* (National University of Singapore), Yuhang Chen (National University of Singapore), Jiaxin Jiang (National University of Singapore), Yuheng Cong (Shanghai Jiao Tong University), Ziyu He (Shanghai Jiao Tong University), Shixuan Sun (Shanghai Jiao Tong University), Bingsheng He (National University of Singapore), Wee Howe Ang (Tokka Labs)
Unifying Graph Traversals and Time Series Joins in Hybrid Graphs
PRIME: Efficient Algorithm for Token Graph Routing Problem
Haotian Xu (Hong Kong University of Science and Technology (Guangzhou)), Yuqing Zhu (Nanyang Technological University), Yuming Huang (National University of Singapore), Jing Tang* (The Hong Kong University of Science and Technology (Guangzhou))
[R-35] Learned Data Systems and AI-Driven Analysis
Time: Thursday, May 7, 15:30 - 17:00 Location: Rue Sherbrooke Track: AI for Data Management Session Chair: [To Be Announced]
Conflict Resolution for Improving ML Accuracy
Wenfei Fan (Shenzhen Institute of Computing Sciences), Xiaoyu Han (Fudan University), Hufsa Khan (Shenzhen Institute of Computing Sciences), Weilong Ren* (Shenzhen Institute of Computing Sciences), Yaoshu Wang (Shenzhen Institute of Computing Sciences), Min Xie (Shenzhen Institute of Computing Sciences), Zihuan Xu (Shenzhen Institute of Computing Sciences)
LUCID: an Updatable and Concurrent Learned Index for Larger-than-Memory Data Management
Chaohong Ma* (Hebei Normal University), Xiaohui Yu (York University), Yifan Li (York University), Aishan Maoliniyazi (Renmin University of China), Xiaofeng Meng (Renmin University of China)
Rethinking Flexible Graph Similarity Computation: One-step Alignment with Global Guidance
Zhouyang Liu* (National University of Defense Technology), Ning Liu (Information Support Force Engineering University), Yixin Chen (National University of Defense Technology), Jiezhong He (National University of Defense Technology), Shuai Ma (Beihang University), Dongsheng Li (National University of Defense Technology)
Generalizable Address-aware Semantic Prefetching for Scalable Transactional and Analytical Workloads
Farzaneh Zirak* (The University of Melbourne), Farhana Choudhury (The University of Melbourne), Renata Borovica-Gajic (The University of Melbourne)
An LLM-Guided Query-Aware Inference System for GNN Models on Large Knowledge Graphs
Waleed Afandi* (Concordia University), Hussein Abdallah (Concordia University), Ashraf Aboulnaga (The University of Texas at Arlington), Essam Mansour (Concordia University)
[R-36] Time Series and Temporal Data Analysis
Time: Thursday, May 7, 15:30 - 17:00 Location: Rue Mansfield Track: Spatial Databases and Temporal Databases Session Chair: [To Be Announced]
Compressing High-Frequency Time Series Through Multiple Models and Stealing from Residuals
Zhiheng Liu (Tsinghua University), Xingyu Liu (Tsinghua University), Shaoxu Song* (Tsinghua University), Jianmin Wang (Tsinghua University)
FusAD: Time-Frequency Fusion with Adaptive Denoising for General Time Series Analysis
Da Zhang (Northwestern Polytechnical University), Bingyu Li (University of Science and Technology of China), Zhiyuan Zhao (Institute of Artificial Intelligence (TeleAI), China Telecom), Feiping Nie (Northwestern Polytechnical University), Junyu Gao* (Northwestern Polytechnical University, Center for OPTical IMagery Analysis and Learning), Xuelong Li (Institute of Artificial Intelligence (TeleAI), China Telecom)
Scaling Subsequence Similarity Join Based on Dynamic Time Warping
Zemin Chao (Harbin institute of technology), Qiaoyi Zheng (Harbin institute of technology), Xingxing Xiao (Harbin institute of technology), Boyu Xiao (Harbin institute of technology), Zhixin Qi (Harbin institute of technology), Hongzhi Wang* (Harbin institute of technology)
Time-varying Vector Field Compression with Preserved Critical Point Trajectories
Mingze Xia (Oregon State University), Yuxiao Li (The Ohio State University), Pu Jiao (University of Kentucky), Bei Wang (University of Utah), Xin Liang* (Oregon State University), Hanqi Guo (The Ohio State University)
[R-37] Secure Query Processing and Access Control
Time: Thursday, May 7, 15:30 - 17:00 Location: Rue Crescent Track: Database Security & Privacy Session Chair: [To Be Announced]
Secure Query Processing with Linear Online Cost
Qiyao Luo* (OceanBase, Ant Group), Yilei Wang (Alibaba Cloud), Wei Dong (Nanyang Technological University), Ke Yi (Hong Kong Univ. of Science and Technology)
Zero-Knowledge Verifiable Graph Query Evaluation via Expansion-Centric Operator Decomposition
Hao Wu (East China Normal University), Changzheng Wei (Ant Group), Yanhao Wang (East China Normal University), Li Lin (Ant Group), Yilong Leng (East China Normal University), Shiyu He (East China Normal University), Minghao Zhao* (East China Normal University), Hanghang Wu (Ant Group), Ying Yan (Ant Group), Aoying Zhou (East China Normal University)
Data Guard: A Fine-grained Purpose-based Access Control System for Large Data Warehouses
CFDGraph: Privacy-Preserving Graph Processing for Large-Scale Collaborative Fraud Detection
Qiulin Wu (Shenzhen University, Hong Kong Baptist University), Amelie Chi Zhou* (Hong Kong Baptist University), Tristan Allard (Univ. Rennes, CNRS, IRISA), Shadi Ibrahim (Inria), Yuhong Feng (Shenzhen University), Lichun Li (Ant Group), Amr Abbadi (UC Santa Barbara)
RISK: Efficiently processing rich spatial-keyword queries on encrypted geo-textual data
Zhen Lv (Xidian University), Cong Cao (Xidian University), Hongwei Huo (Xidian University), Jiangtao Cui (Xidian University), Yanguo Peng* (Xidian University), Hui Li (Xidian University), Yingfan Liu (Xidian University)
[R-38] Storage Management and LSM-tree Systems
Time: Friday, May 8, 10:00 - 12:00 Location: Av. Duluth Track: Query Processing, Indexing, and Optimization Session Chair: [To Be Announced]
Contextual Pattern Mining and Counting
Ling Li (King's College London), Daniel Gibney (University of Texas at Dallas), Sharma Thankachan (North Carolina State University), Solon Pissis (CWI), Grigorios Loukides* (King's College London)
MatKV: Trading Compute for Flash Storage in LLM Inference
Kun-Woo Shin (Seoul National University), Jay H. Park (Samsung Electronics), Moonwook Oh (Samsung Electronics), Yohan Jo (Seoul National University), Jaeyoung Do (Seoul National University), Sang-Won Lee* (Seoul National University)
AOEH: An Efficient Extendable Hashing to Reduce Read/Write Amplification for Persistent Memory
Resystance: Unleashing Hidden Performance of Compaction in LSM-trees via eBPF
Hongsu Byun (Sogang University), Seungjae Lee (Sogang University), Honghyoen Yoo (Sogang Univerisy), MyoungJoon Kim (Sogang University), Sungyong Park* (Sogang Universiry)
Doux: Decoupling Values from Keys for Real-Time Analytics
Shiming Yang* (Renmin University of China), Yu Luo (Renmin University of China), Shuang Liu (Renmin University of China), Wei Lu (Renmin University of China), Kuien Liu (Institute of Software Chinese Academy of Sciences), Yuxing Chen (Tencent Inc.), Anqun Pan (Tencent Inc.), Lixiong Zheng (Tencent Inc.), Xiaoyong Du (Renmin University of China)
[R-39] Trajectory, POI Recommendation and Spatial Crowdsourcing
Time: Friday, May 8, 10:00 - 12:00 Location: Rue McGill Track: Spatial Databases and Temporal Databases Session Chair: [To Be Announced]
High-Fidelity Task Assignment in Spatial Crowdsourcing via Implicit Human Feedback
Qingshun Wu (Zhengzhou University), Yafei Li* (Zhengzhou University), Lei Gao (Zhengzhou University), Guanglei Zhu (Zhengzhou University), Lei Chen (Beijing Institute of Technology), Mingliang Xu (Zhengzhou University)
Efficient Model-Agnostic Continual Learning for Next POI Recommendation
Chenhao Wang* (UESTC), Shanshan Feng (Wuhan University), Lisi Chen (UESTC), Fan Li (The Hong Kong Polytechnic University), Shuo Shang (UESTC)
PORCA: Root Cause Analysis with Partially Observed Data
Chang Gong (Institute of Computing Technology, Chinese Academy of Sciences), Di Yao* (Institute of Computing Technology, Chinese Academy of Sciences), Jin Wang (Megagon Labs), Wenbin Li (Institute of Computing Technology, Chinese Academy of Sciences), Lanting Fang (Beijing Institute of Technology), Yongtao Xie (Southeast University), Kaiyu Feng (Beijing Institute of Technology), Peng Han (University of Electronic Science andTechnology of China), Jingping Bi (Institute of Computing Technology, Chinese Academy of Sciences)
Trajectory–User Linking via Heterogeneous Preference Graph and Dual-Encoder Mutual Distillation
FedCurrMM: A Federated Map Matching Framework with Curriculum-aware Client Selection
Minxiao Chen* (Beijing University of Posts and Telecommunications), Haitao Yuan (Nanyang Technological University), Haoning Wang (National University of Singapore), Nan Jiang (Nanyang Technological University), Zhihan Zheng (Beijing University of Posts and Telecommunications), Ao Zhou (Beijing University of Posts and Telecommunications), Shangguang Wang (State Key Laboratory of Networking and Switching Technology)
Geography-Aware Large Language Model for Next POI Recommendation
Wei Liu* (Sun Yat-Sen University), Zhao Liu (Sun Yat-sen University), Muzu Xie (Sun Yat-sen University), Huaijie Zhu (Sun Yat-sen University), Jianxing Yu (Sun Yat-sen University), Jian Yin (Sun Yat-sen University), Wang-Chien Lee (The Pennsylvania State University)
Balancing Competition for Fairness-aware Task Assignment in Spatial Crowdsourcing
Jinwen Chen* (University of Electronic Science and Technology of China), Hao Miao (The Hong Kong Polytechnic University), Lei Jia (University of Electronic Science and Technology of China), Guangqiang Yin (University of Electronic Science and Technology of China), Yan Zhao (Shenzhen Institute for Advanced Study, University of Electronic Science and Technology of China), Kai Zheng (University of Electronic Science and Technology of China)
[R-40] Spatiotemporal Forecasting, Urban Analytics and Recommendations
Time: Friday, May 8, 10:00 - 12:00 Location: Rue Sherbrooke Track: Data Mining and Knowledge Discovery Session Chair: [To Be Announced]
Efficient Traffic Forecasting on Large-Scale Road Network by Regularized Adaptive Graph Convolution
TransLGX: A Self-contained Model to Predict the Entire Lifecycle and complete state of Logistics Package Trajectories
Yichen Song (Zhejiang Universitiy), Jianfeng Zhou (Bytedance), Jian-Ya Ding* (Bytedance), Renhao Cao (Bytedance)
Community-level Personalized Recommendation by Exploiting Evolving User-Item Micro-clusters
Xinyu Liu* (University of Electronic Science and Technology of China), Jinxia Guo (University of Electronic Science and Technology of China), Qirui Hao (University of Electronic Science and Technology of China), Zhongjing Yu (Peking University), Qinli Yang (University of Electronic Science and Technology of China), Junming Shao (University of Electronic Science and Technology of China)
DNA: A Distribution-and-Aggregation Solution for Spatiotemporal K-function-based Analysis
Tsz Nam Chan* (Shenzhen University), Bojian Zhu (Hong Kong Baptist University), Dingming Wu (Shenzhen University), Renchi Yang (Hong Kong Baptist University), Ruisheng Wang (Shenzhen University)
Effective Dataset Distillation for Spatio-Temporal Forecasting with Bi-dimensional Compression
CAMMSR: Category-Guided Attentive Mixture of Experts for Multimodal Sequential Recommendation
Jinfeng Xu* (The University of Hong Kong), Zheyu Chen (Beijing Institute of Technology), Shuo Yang (The University of Hong Kong), Jinze Li (The University of Hong Kong), Hewei Wang (Carnegie Mellon University), Yijie Li (Carnegie Mellon University), Jianheng Tang (Peking University), Yunhuai Liu (Peking University), Edith Ngai (The University of Hong Kong)
Fast k-means via Data-Aware Grouping and Gap-Optimized Lower Bound
Xiaogang Huang* (Fujian Normal University), Dan Zhuang (Fujian Normal University), Jianbao Chen (Fujian Normal University), Tiefeng Ma (Southwestern University of Finance and Economics), Shuangzhe Liu (University of Canberra)
[R-41] Table Question Answering and Discovery
Time: Friday, May 8, 10:00 - 12:00 Location: Rue Mansfield Track: AI for Data Management Session Chair: [To Be Announced]
Accurate Table Question Answering with Accessible LLMs
Yangfan Jiang* (National University of Singapore), Fei Wei (Alibaba Group), Ergute Bao (Mohamed bin Zayed University of Artificial Intelligence), Yaliang Li (Alibaba Group), Bolin Ding (Alibaba Group), Yin Yang (Hamad Bin Khalifa University), Xiaokui Xiao (National University of Singapore)
Efficient and Scalable Search for Statistics
Antoine Gauquier* (DI ENS, ENS, CNRS, PSL University & Inria), Simon Ebel (Inria), Helena Galhardas (INESC-ID & IST, Universidade Lisboa), Théo Galizzi (Inria), Ioana Manolescu (Inria), Aurélien Peden (Inria), Pierre Senellart (DI ENS, ENS, CNRS, PSL University & Inria)
Decomposition-Driven Multi-Table Retrieval and Reasoning for Numerical Question Answering
Feng Luo (RMIT University), Hai Lan (The University of Queensland), Hui Luo (University of Wollongong), Zhifeng Bao* (The University of Queensland), Xiaoli Wang (Xiamen University), J.Shane Culpepper (The University of Queensland), Shazia Sadiq (The University of Queensland)
SPARQ: A Cost-Efficient Framework for Offline Table Question Answering via Adaptive Routing
Yang Liu* (Beihang University), Mengyi Yan (Shandong University), Jiao Xue (Inspur Cloud Information Technology Co., Ltd.), Weilong Ren (Shenzhen Institute of Computing Sciences), Yutong Ye (Beihang University), Haoyi Zhou (Beihang University), Zhumin Chen (Shandong University), Jianxin Li (Beihang University)
L³C: Leaf-Centric Continuous Codes for Natural Language-Driven Table Discovery
Qiyuan Zhang* (National University of Defense Technology), Ruochun Jin (National University of Defense Technology), Jixin Zhang (National University of Defense Technology), Yuhua Tang (National University of Defense Technology), Xiang Zhao (National University of Defense Technology), Shixuan Liu (National University of Defense Technology)
[R-42] AI-Powered Querying and RAG Systems
Time: Friday, May 8, 13:30 - 15:00 Location: Av. Duluth Track: AI for Data Management Session Chair: [To Be Announced]
HaS: Accelerating RAG through Homology-Aware Speculative Retrieval
Peng Peng (South China University of Technology), Weiwei Lin (South China University of Technology), Wentai Wu* (Jinan University), Xinyang Wang (Beijing Forestry University), Yongheng Liu (Pengcheng Laboratory)
SaCal: An Efficient Saliency-Guided Causal Framework for Interpretable Healthcare Analytics
Feixuan Lin* (Beijing Institute of Technology), Chenyu You (Beijing Institute of Technology), Zhongle Xie (Zhejiang University), Zhaojing Luo (Beijing Institute of Technology), Meihui Zhang (Beijing Institute of Technology)
XRAG: eXamining the Core - Benchmarking Foundational Components in Advanced Retrieval-Augmented Generation [Experiment, Analysis, and Benchmark]
Qili Zhang (Beihang University), Qianren Mao* (Zhongguancun Laboratory), Yangyifei Luo (Beihang University), Yashuo Luo (Beihang University), Hanwen Hao (Beihang University), Zhilong Cao (Beihang University), Weifeng Jiang (Nanyang Technological University), Zhijun Chen (Beihang University), Junnan Liu (Beihang University), Feng Yan (Beihang University), Xiaolong Wang (Beihang University), Jinlong Zhang (Beihang University), Zhenting Huang (Beihang University), Zhixing Tan (Zhongguancun Laboratory), Jie Sun (Zhongguancun Laboratory), Bo Li (Beihang University), Jianxin Li (Beihang University), Philip Yu (University of Illinois Chicago)
CARROT: A Learned Cost-Constrained Retrieval Optimization System for RAG
Time: Friday, May 8, 13:30 - 15:00 Location: Rue McGill Track: Graph Structure Analytics Session Chair: [To Be Announced]
Reverse k Nearest Neighbor Query in Large Road Networks: A Tree Decomposition based Approach
Dian Ouyang (Guangzhou University), Boyu Zhang (Guangzhou University), Jianye Yang* (Guangzhou University), Shiyu Yang (Guangzhou University), Chonghua Wang (China Industrial Control Systems Cyber Emergency Response Team), Xuemin Lin (Shanghai Jiao Tong University)
Overcoming the Sync-Compute Dilemma in Parallel Graph-Based Vector Retrieval
Qiji Mo* (Nankai University), Zhiyuan Hua (Nankai University), Zebin Yao (Nankai University), Lixiao Cui (Nankai University), Gang Wang (Nankai University), Xiaoguang Liu (Nankai University), Zijing Wei (Alibaba Group Holding Limited), Xinyu Liu (Alibaba Group Holding Limited), Tianxiao Tang (Alibaba Group Holding Limited), Shaozhi Liu (Alibaba Group Holding Limited), Lin Qu (Alibaba Group Holding Limited)
Efficient Top-k Nearest Neighbors Search in Dynamic Road Networks
Junhua Zhang (University of New South Wales), Yamei Song (University of New South Wales), Wentao Li* (University of Leicester), Lu Qin (University of Technology Sydney)
An Efficient and Scalable Approach for Path Queries on Public Transportation Networks
Junhua Zhang* (Northeastern University), Wentao Li (University of Leicester), Wenjie Zhang (University of New South Wales), Lu Qin (University of Technology Sydney), Xiaochun Yang (Northeastern University)
BAMG: A Block-Aware Monotonic Graph Index for Disk-Based Approximate Nearest Neighbor Search
Huiling Li* (Hong Kong Baptist University), Xin Huang (Hong Kong Baptist University), Byron Choi (Hong Kong Baptist University), Jianliang Xu (Hong Kong Baptist University)
[R-44] Distributed Storage, Consensus and Infrastructure
Time: Friday, May 8, 13:30 - 15:00 Location: Rue Sherbrooke Track: Distributed, Parallel and P2P Data Management Session Chair: [To Be Announced]
SwitchDelta: Asynchronous Metadata Updating for Distributed Storage with In-Network Data Visibility
Junru Li* (Tsinghua), Qing Wang (Tsinghua), Zhe Yang (Tsinghua), Shuo Liu (Huawei Technologies Co., Ltd.), Jiwu Shu (Tsinghua), Youyou Lu (Tsinghua)
GeoLayer: Towards Low-Latency and Cost-Efficient Geo-Distributed Graph Stores with Layered Graph
Feng Yao* (Northeastern University), Xiaokang Yang (Northeastern University), Shufeng Gong (Northeastern University), Song Yu (Northeastern University), Yanfeng Zhang (Northeastern University), Ge Yu (Northeastern University)
RL-Paxos: Relieving the Leader's Burden with Efficient Task Offloading in Distributed Consensus
Chenhao Zhang* (Beihang university), Jinquan Wang (Beihang university), Meng Han (Tsinghua university), Bing Wei (Hainan university), Xiaojian Liao (Beihang university), Limin Xiao (Beihang university), Shanchen Pang (China University of Petroleum (East China))
DistVec: Efficient Distributed Machine Learning in Parallel Database Systems
Xinyi Zhang* (Renmin University of China), Liangzu Liu (Peking University), Xupeng Miao (Purdue University), Yinjun Wu (Peking University), Zhen Chen (Tsinghua University), Wei Lu (Renmin University of China), Xiaoyong Du (Renmin University of China), Bin Cui (Peking University)
Nezha: A Key-Value Separated Distributed Store with Optimized Raft Integration
Time: Friday, May 8, 13:30 - 15:00 Location: Rue Mansfield Track: Graph Queries, Entity Alignment and Learning Session Chair: [To Be Announced]
Chase Anonymisation: Privacy-Preserving Knowledge Graphs with Logical Reasoning
Luigi Bellomarini (Bank of Italy), Costanza Catalano* (Bank of Italy), Andrea Coletta (Bank of Italy), Michela Iezzi (Bank of Italy), Pierangela Samarati (Università degli Studi di Milano)
Reconstructing TensorLog for Scalable End-to-end Rule Learning
Kunxun Qi* (The Hong Kong University of Science and Technology (Guangzhou)), Jianfeng Du (Guangdong University of Foreign Studies), Hai Wan (Sun Yat-sen University), Wei Wang (The Hong Kong University of Science and Technology (Guangzhou))
An End-to-End Re-Evaluation of Table Entity-Linking Systems
Martin Christensen* (Aalborg University), Matteo Lissandrini (University of Verona), Katja Hose (Technische Universität Wien)
RaSE-KGC: A Relation-Aware Segment Encoding Approach for Knowledge Graph Completion
Chenxiao Lin (Xiamen University), Ye Luo* (Xiamen University), Kunhong Liu (Xiamen University), Qingqiang Wu (Xiamen University)
Semantic Compression for Sound and Complete Query Answering over Knowledge Graphs
Junhua Ma* (Sun Yat-sen University), Jianfeng Du (Guangdong University of Foreign Studies), Hai Wan (Sun Yat-sen University), Yue Yu (Peng Cheng Laboratory), Qunxun Qi (Sun Yat-sen University), Weilin Luo (Sun Yat-sen University), Yanan Liu (Sun Yat-sen University)
Industry & Applications (I&A) Papers
[I&A-1] LLMs and AI for Database Operations
Time: Tuesday, May 5, 10:00 - 12:00 Location: Av. Van-Horne Track: Industry & Application Session Chair: [To Be Announced]
Graph Query Generation with Constraint-guided Large Language Agents
DM-RAG: Enhancing User Support in Dameng Databases with Retrieval-Augmented Generation
Qiang Huang* (Wuhan University), Ke Liu (Wuhan Dameng Database Co., Ltd), Liang Deng (Wuhan Dameng Database Co., Ltd), Sijing Zhang (Wuhan Dameng Database Co., Ltd), Chuang Hu (Wuhan University), Tieyun Qian (Wuhan University), Xiao Yan (Wuhan University), Jiawei Jiang (Wuhan University)
GalaxyRAG: Graph Retrieval-Augmented Generation for Enterprise Knowledge Systems
Bing Tong* (The Hong Kong University of Science and Technology (Guangzhou)), Yan Zhou (Zhejiang Chuanglin Technology Co., Ltd.), Chen Zhang (Zhejiang Chuanglin Technology Co., Ltd.), Zhaojie Yin (Zhejiang Chuanglin Technology Co., Ltd.), Jia Li (The Hong Kong University of Science and Technology (Guangzhou))
Democratizing Tabular Data Access with an Open-Source Synthetic-Data SDK
Time: Tuesday, May 5, 13:30 - 15:00 Location: Av. Van-Horne Track: Industry & Application Session Chair: [To Be Announced]
REG4Rec: Reasoning-Enhanced Generative Model for Large-Scale Recommendation Systems
Haibo Xing* (Alibaba), Hao Deng (Alibaba), Yucheng Mao (Alibaba), Lingyu Mu (Alibaba), Jinxin Hu (Alibaba), Yi Xu (Alibaba), Hao Zhang (Alibaba), Jiahao Wang (Alibaba), Shizhun Wang (Alibaba), Yu Zhang (Alibaba), Xiaoyi Zeng (Alibaba), Jing Zhang (Wuhan University)
GALA: Generative Aligned Learning for Adaptive Multimodal Representation in the Eleme Recommender System
JiPing Liu* (Alibaba Group), Zhongmin Zhang (Alibaba Group), Zisen Sang (Alibaba Group), Zhijia Fang (Alibaba Group), Tao Ouyang (Central South University), Ma Jiang (Alibaba Group), shaopeng liang (Alibaba Group), Zeyang Hou (Alibaba Group), Guodong Cao (Alibaba Group), Jia Jia (Alibaba Group)
Cascading Relevance-driven Recommendation Network for CTR Prediction in Trigger-Introduced Recommendation
Kaixuan Chen* (Taobao & Tmall Group of Alibaba), Wenwen Wang (Taobao & Tmall Group of Alibaba), Xing Fang (Taobao & Tmall Group of Alibaba), Yang Huang (Taobao & Tmall Group of Alibaba), Jing Wang (Taobao & Tmall Group of Alibaba)
Decoupled Multimodal Fusion for User Interest Modeling in Click-Through Rate Prediction
Alin Fan* (Alibaba International Digital Commerce Group), Hanqing Li (Alibaba International Digital Commerce Group), Sihan Lu (Renmin University of China), Jingsong Yuan (Alibaba International Digital Commerce Group), Jiandong Zhang (Alibaba International Digital Commerce Group)
OEPO: Online Experience-based Preference Optimization for CTR Prediction
Zhichao Liao* (University of Electronic Science and Technology of China), Ziheng Ni (JD.com), Congcong Liu (JD.com), Zhiwei Fang (JD.com), Changping Peng (JD.com)
[I&A-3] E-commerce, Search and Feature Engineering
Time: Tuesday, May 5, 15:30 - 17:00 Location: Av. Van-Horne Track: Industry & Application Session Chair: [To Be Announced]
REVISION: Reflective Intent Mining and Online Reasoning Auxiliary for E-commerce Visual Search System Optimization
Yiwen Tang* (Shanghai AI Laboratory, Alibaba), Qiuyu Zhao (Alibaba), Zenghui Sun (Alibaba), Jinsong Lan (Alibaba), Xiaoyong Zhu (Alibaba), Bo Zheng (Alibaba)
Relevance Matters: A Multi-Task and Multi-Stage Large Language Model Approach for E-commerce Query Rewriting
FELA: A Multi-Agent Evolutionary System for Feature Engineering of Industrial Event Log Data
Kun Ouyang* (LIGHTSPEED STUDIOS, Tencent), Haoyu Wang (Tsinghua University), Dong Fang (LIGHTSPEED STUDIOS)
JITPrune: An Efficient Online Feature Pruning Framework for Embedding-based DLRM Training
Hongzheng Li* (Beijing University of Posts and Telecommunications), Yucheng Wu (Peking University), Junjie Zhai (Tencent), Anan Liu (Tencent), Yuekui Yang (Tencent), Yingxia Shao (Beijing University of Posts and Telecommunications)
CoLIBRi: Supporting quotation through multi-modal retrieval and conversational search on manufacturing drawings
Jacob Pollack* (Leipzig University), Lucas Peter (Leipzig University), Matthias Täschner (Leipzig University), Carmen Ahnert (CPT Präzisionstechnik GmbH)
[I&A-4] Scalable Data Systems and Infrastructure
Time: Wednesday, May 6, 10:00 - 12:00 Location: Av. Van-Horne Track: Industry & Application Session Chair: [To Be Announced]
OceanBase Mercury: Building a Distributed Real-time Analytical Processing Database System
Quanqing Xu* (OceanBase, Ant Group), Chuanhui Yang (OceanBase, Ant Group), Ruijie Li (OceanBase, Ant Group), Dongdong Xie (OceanBase, Ant Group), Hui Cao (OceanBase, Ant Group), Yi Xiao (OceanBase, Ant Group), Junquan Chen (OceanBase, Ant Group), Yanzuo Wang (OceanBase, Ant Group), Saitong Zhao (OceanBase, Ant Group), Fusheng Han (OceanBase, Ant Group)
OceanBase CDC: A Log-Based Distributed CDC System for High Availability and Scalability
Quanqing Xu* (OceanBase, Ant Group), Chuanhui Yang (OceanBase, Ant Group), Sen Wang (OceanBase, Ant Group), Fusheng Han (OceanBase, Ant Group)
Bala-Join: An Adaptive Hash Join for Balancing Communication and Computation in Geo-Distributed SQL Databases
Wenlong Song (Xidian University), Hui Li* (Xidian University), Bingying Zhai (Xidian University), Jinxing Yang (Xidian University), Pinghui Wang (Xi’an Jiaotong University), Jiangtao Cui (Xidian University), Luming Sun (Yunxi Technology Company Ltd.), Ming Li (Shandong Inspur Database Technology Company Ltd.)
Automatic Parameter Tuning for Compaction in LSM-Tree based Databases
Pinshan Cao (East China Normal University), Peng Cai* (East China Normal University), Xuan Zhou (East China Normal University), Jun-Peng Zhu (East China Normal University), Kecheng Luo (East China Normal University), Sijia Li (East China Normal University), Quanqing Xu (OceanBase, Ant Group), Chuanhui Yang (OceanBase, Ant Group)
DBdoctor: A Fine-grained and Non-intrusive Performance Diagnosis Platform for Databases
Xinyue Shi* (Renmin University of China), Quanqi Xin (Juhaokan Technology, Hisense), Zhengjin Wang (Renmin University of China), Xinyi Zhang (Renmin University of China), Haoqiong Bian (Renmin University of China), Wei Lu (Renmin University of China), Qiyu Zhuang (Renmin University of China), Shuang Liu (Renmin University of China), Jikuan Zhang (Juhaokan Technology, Hisense), Xiang Zheng (Juhaokan Technology, Hisense), Yunpeng Chai (Renmin University of China), Xiaoyong Du (Renmin University of China)
On Efficient Materialization in Data Lakes
Andrew Harn (Google Inc), Herald Kllapi* (Google Inc), Zhepeng Yan (Google Inc)
StreamShield: A Production-Proven Resiliency Solution for Apache Flink at ByteDance
Yong Fang (ByteDance), Yuxing Han* (ByteDance), Meng Wang (ByteDance), Yifan Zhang (ByteDance), Yue Ma (ByteDance), Chi Zhang (ByteDance)
[I&A-5] Time Series Analysis and Forecasting
Time: Wednesday, May 6, 15:30 - 17:00 Location: Av. Van-Horne Track: Industry & Application Session Chair: [To Be Announced]
TAT: Temporal-Aligned Transformer for Multi-Horizon Peak Demand Forecasting
Zhiyuan Zhao* (Georgia Institute of Technology), Sitan Yang (Keystone AI), Stan Vitebsky (Amazon), B. Aditya Prakash (Georgia Institute of Technology), Dmitry Efimov (Amazon)
Hierarchical Industrial Demand Forecasting with Temporal and Uncertainty Explanations
Harshavardhan Kamarthi (Georgia Institute of Technology), Shangqing Xu* (Georgia Institute of Technology), Xinjie Tong (Aspen Technology), Xingyu Zhou (The Dow Chemical Company), James Peters (The Dow Chemical Company), Joseph Czyzyk (The Dow Chemical Company), B. Aditya Prakash (Georgia Institute of Technology)
Accurate and Efficient Multi-channel Time Series Forecasting via Sparse Attention Mechanism
Hengda Bao (SF Express), Jingfei Fang (SF Express), guangzheng wu* (Zhejiang University of Technology), Weihua Zhou (Zhejiang University)
From Benchmarks to Production: Transferring Time Series Anomaly Detection Methods for Electricity Production Monitoring
Nicolas Vautier* (EDF Lab Paris Saclay), Paul Caron (EDF DOAAT), Nardi Xhepi (EDF DOAAT), Félicie Bizeul (EDF Lab Paris Saclay), Manel Boumghar (EDF Lab Paris Saclay), Christophe Degouy (EDF DOAAT), Paul Boniol (INRIA)
User-Adaptive Meta-Learning for Cold-Start Medication Recommendation with Uncertainty Filtering
Arya Hadizadeh Moghaddam (University of Kansas), Mohsen Nayebi Kerdabadi (University of Kansas), Dongjie Wang (University of Kansas), Mei Liu (UF Health), Zijun Yao* (University of Kansas)
[I&A-6] Large-Scale ML Systems and Data Science Applications
Time: Thursday, May 7, 13:30 - 15:00 Location: Av. Van-Horne Track: Industry & Application Session Chair: [To Be Announced]
DLRover-LM: LLM Pre-Training Framework with Thousands of Accelerators in AntGroup
Ziling Huang* (Sichuan University), Zhengmao Ye (Sichuan University), Qingsong Cai (Sichuan University), Zelong Huang (Sichuan University), Bo Sang (Ant Group), Haitao Zhang (Ant Group), Jian Sha (Ant Group), Tingfeng Lan (University of Virginia), Hui Lu (The University of Texas at Arlington), Yuanchun Zhou (Chinese Academy of Science), Mingjie Tang (Sichuan University)
Tackling Workload Forecasting Challenges with an Offline-Online Dynamic Framework
Jian Jiang* (Ant Group), Yu Liu (Ant Group), Jia Li (Ant Group), Lu Han (Nanjing University), Wei Lu (Ant Group), Qiwen Deng (Ant Group), Zhibo Zhu (Ant Group), Xingyu Lu (Ant Group), Lintao Ma (Ant Group)
Jihang Li* (Hong Kong University of Science and Technology (Guangzhou)), Qing Liu (Alibaba Group), Zulong Chen (Alibaba Group), Jing Wang (Alibaba Group), Wei Wang (Alibaba Group), Chuanfei Xu (Guangdong Laboratory of Artificial Intelligence and Digital Economy (Shenzhen)), Zeyi Wen (Hong Kong University of Science and Technology (Guangzhou))
Building and Benchmarking Large Language Models for Machine Translation in Social Network Services
Hongcheng Guo (Fudan University), Fei Zhao (Xiaohongshu Inc.), Shaosheng Cao* (Xiaohongshu Inc.), Xinze Lyu (Xiaohongshu Inc.), Zijie Meng (Zhejiang University), Yue Wang (Nanjing University), Yao Hu (Xiaohongshu Inc.), Zhoujun Li (Xiaohongshu Inc.), Zuozhu Liu (Zhejiang University)
Billion-scale Fintech Analytics: Scalable Data Management and Anomaly Detection at NPCI
Bharadwaj Dasari (National Payments Corporation of India), Turaga Sai Dhiraj (National Payments Corporation of India), Ganesh Jambhrunkar (National Payments Corporation of India), Thirumalai Kailasam (National Payments Corporation of India), Charu Vikram (National Payments Corporation of India), Saurav Singla (National Payments Corporation of India), Pranjal Naman (Indian Institute of Science (IISc), Bangalore), Yogesh Simmhan* (Indian Institute of Science (IISc), Bangalore)
[I&A-7] Hardware-Accelerated Search, Compression and Data Integration
Time: Thursday, May 7, 15:30 - 17:00 Location: Av. Van-Horne Track: Industry & Application Session Chair: [To Be Announced]
CCD–Level and Load-Aware Thread Orchestration for In-Memory Vector ANNS on Multi-Core CPUs
Yuchen Huang (East China Normal University), Baiteng Ma (East China Normal University), Yiping Sun (Xiaohongshu Inc), Yang Shi (Xiaohongshu Inc), Xiao Chen (Xiaohongshu Inc), Xiaocheng Zhong (Xiaohongshu Inc), Zhiyong Wang (Xiaohongshu Inc), Yao Hu (Xiaohongshu Inc), Chuliang Weng* (East China Normal University)
KScaNN: Scalable Approximate Nearest Neighbor Search on Kunpeng
Oleg Senkevich (Huawei Technologies Ltd), Siyang Xu (Huawei Technologies Ltd.), Tianyi Jiang (Huawei Technologies Ltd.), Alexander Radionov (Huawei Technologies Ltd.), Jan Tabaszewski (Huawei Technologies Ltd.), Dmitriy Malyshev (Higher School of Economics), Zijian Li* (Huawei Technologies Ltd.), Daihao Xue (Huawei Technologies Ltd.), Licheng Yu (Huawei Technologies Ltd.), Weidi Zeng (Huawei Technologies Ltd.), Meiling Wang (Huawei Technologies Ltd.), Xin Yao (Huawei Technologies Ltd.), Siyu Huang (Huawei Technologies Ltd.), Gleb Neshchetkin (Huawei Technologies Ltd.), Qiuling Pan (Huawei Technologies Ltd.), Yaoyao Fu (Huawei Technologies Ltd.)
Efficient Data Processing using On-the-Fly Host-PIM Interactions in a Commodity PIM System
Hyojune Kim (Hanyang University), Jeonghyeon Joo (Hanyang University), TaeHyeong Park (Yonsei University), Yongjun Park (Yonsei University), Hyuck Han (FuriosaAI), Sooyong Kang* (Hanyang University)
OpenZL: Using Graphs to Compress Smaller and Faster
Yann Collet (Meta), Nick Terrell (Meta), Winston Felix Handte (Meta Platforms), Danielle Rozenblit (Meta), Victor Zhang* (Meta), Kevin Zhang (Meta), Yaelle Goldschlag (Meta), Jennifer Lee (Meta), Elliot Gorokhovsky (Meta), Yonatan Komornik (Meta), Daniel Riegel (Meta), Stan Angelov (Meta), Nadav Rotem (Meta)
GaV: Guess and Verification of Column Semantics
Davide Di Stefano (TU Wien & Unlimidata Ltd), Jinsong Guo* (Unlimidata Ltd), Yang Hu (University of Leicester), Matteo Capalbo (University of Calabria), Davide Mario Longo (University of Calabria), Georg Gottlob (University of Calabria)
Lightning Talks
[Lightning Talks] Lightning Talks Session
Time: Wednesday, May 6, 10:00 - 12:00 Location: Rue Notre-Dame Track: Lightning Talks Session Chair: [To Be Announced]
MDSD: Multi-turn Diverse Synthetic Dialog Generation for Domain Specific Incomplete Requests Understanding
Xi Li* (Apple), Xiaoxu Wu (Apple), Lijuan Xiao (Apple), Tao Liu (Apple), Ping Huang (Apple), Jiulong Shan (Apple)
Model Slicing: a Data Engineering Perspective
Parke Godfrey (York University), Lukasz Golab* (University of Waterloo), Divesh Srivastava (AT&T Chief Data Office), Jarek Szlichta (York University)
Responsible Entity Resolution over Streaming Data
Kostas Stefanidis* (Tampere University), Vasilis Efthymiou (Harokopio University of Athens), Tiago Brasileiro Araújo (Tampere University)
Tuning IBM Db2 with Explainable AI
Andrew Chai* (York University and IBM CAS), Alexander Bianchi (IBM Canada Ltd.), Vincent Corvinelli (IBM Canada Ltd.), Parke Godfrey (York University and IBM CAS), Lukasz Golab (University of Waterloo), Jarek Szlichta (York University and IBM CAS), Calisto Zuzarte (IBM Canada Ltd.)
From Polystores to PolyDBMS: The Polypheny Experience
Marco Vogt* (University of Basel), David Lengweiler (University of Basel), Martin Vahlensieck (University of Basel), Yiming Wu (University of Basel), Heiko Schuldt (University of Basel)
Recovering Structure in Unstructured LLM Outputs
Joel Rorseth* (University of Waterloo), Parke Godfrey (York University), Lukasz Gola (University of Waterloo), Divesh Srivastava (AT&T Chief Data Office), Szlichta Szlichta (York University)
Rethinking BFT Consensus via Transaction-Level Protocol Construction for Optimal Performance
When Text-to-SQL Evaluation Misleads: Rethinking Benchmarking Practices
Oktie Hassanzadeh* (IBM Research), Nhan Pham (IBM Research), Timothy Dinger (IBM Research), Tanvi Kaple (IBM Research), Long Vu (IBM Research), Michael Glass (IBM Research), Shankar Subramanian (IBM Research)
Are Dedicated Vector Databases Necessary? Benchmarking Vector Search in Relational and Analytical Systems for Enterprise Workloads
Archan Dutta* (Aisera), Thais Poi (San Joaquin Delta College)
Is Quantum Computing Ready for Real-Time Database Optimization?
Hanwen Liu* (University of Southern California), Ibrahim Sabek (University of Southern California)
On Breaking the Scalability Barrier in Data Cleaning
El Kindi Rezig* (University of Utah)
GNN Explainers 2.0: A Paradigm for User-Oriented, Data-Guided Explanations
Arijit Khan* (Bowling Green State University)
SalesforceDB: Built on one LSM to rule them all !
Vaibhav Arora* (Salesforce), Peter Desnoyers (Salesforce)
Demos
[Demo A] Accepted Demo Papers - Group A
Time: Tuesday, May 5, 10:00 - 12:00 & 15:30 - 17:00 Location: Av. Laurier Track: Demonstrations Session Chair: [To Be Announced]
BClean+: A Bayesian Data Cleaning System with Automated Prior Generation
Ziyan Han* (Shenzhen University), Jing Zhu (Shenzhen University), Jinbin Huang (Shenzhen University), Rui Mao (Shenzhen University; Shenzhen Institute of Computing Sciences), Jianbin Qin (Shenzhen University; Shenzhen Institute of Computing Sciences)
LazyVLM: Neuro-Symbolic Approach to Video Analytics
Xiangru Jian* (University of Waterloo), Wei Pang (University of Waterloo), Zhengyuan Dong (University of Waterloo), Chao Zhang (University of Waterloo), Tamer Özsu (University of Waterloo)
Jazero: A Semantic Table Search System
Martin Christensen* (Aalborg University), Matteo Lissandrini (University of Verona), Katja Hose (Technische Universität Wien)
GeX: Guiding tuning of Db2 with eXplainable AI
Andrew Chai* (York University and IBM CAS), Alexander Bianchi (IBM Canada Ltd.), Vincent Corvinelli (IBM Canada Ltd.), Parke Godfrey (York University and IBM CAS), Lukasz Golab (University of Waterloo), Jarek Szlichta (York University and IBM CAS), Calisto Zuzarte (IBM Canada Ltd.)
A Fast, Versatile, and User-friendly Plugin for Kernel Density Analysis
Tsz Nam Chan* (Shenzhen University), Bojian Zhu (Hong Kong Baptist University), Leong Hou U (University of Macau), Dingming Wu (Shenzhen University), Wei Tu (Shenzhen University), Jianliang Xu (Hong Kong Baptist University)
Schema-GraphRAG: Bridging Hybrid Search and Graph Traversal for Complex Retrieval Tasks
TAPE: A Temporal Graph-based Memory System for Personal LLM Agents
Chengyang Luo (Zhejiang University), Qing Liu (Zhejiang University), Wenjie Zhang (The University of New South Wales), Yunjun Gao* (Zhejiang University)
Scoper: Streamline Linkable Schemas for Matching
Leonard Traeger* (University of Maryland, Baltimore County), Andreas Behrend (Technical University Cologne), George Karabatis (University of Maryland, Baltimore County)
CORAL: COncept-based Explanations for RAG LLMs
Katherine Ling* (York University), Joel Rorseth (University of Waterloo), Parke Godfrey (York University), Lukasz Golab (University of Waterloo), Jarek Szlichta (York University)
DeepSketch 2.0: Discovering Temporal Relationships in Large Time Series Datasets
Kathryn Carbone* (University of Waterloo), Parke Godfrey (York University), Lukasz Golab (University of Waterloo), Jarek Szlichta (York University), Robin Cohen (University of Waterloo)
RUBEN: Rule-Based Explanations for Retrieval-Augmented LLM Systems
Joel Rorseth* (University of Waterloo), Parke Godfrey (York University), Lukasz Golab (University of Waterloo), Divesh Srivastava (AT&T Chief Data Office), Jarek Szlichta (York University)
SENTINEL: Evaluating Pipeline Robustness to Distributional Shifts
SqlRewriter: Harnessing Community Knowledge to Rewrite SQL Queries
Qiushi Bai* (UC Irvine), Yihong Yu (UC Irvine), Colin Harrison (UC Irvine), Jiatong Liu (UC Irvine), James Liu (UC Irvine), Jessie He (UC Irvine), Hartley Tran (UC Irvine), Jun Xia (UC Irvine), Chen Li (UC Irvine)
LOADS: Adaptive Cloud-Edge-Device Database Management System Optimizer
Chunyu Zhao (Harbin Institute of Technology), Yihan Zhang (Harbin Institute of Technology), Shuangshuang Cui (Harbin Institute of Technology), Hongzhi Wang* (Harbin Institute of Technology)
Multi-Model Geospatial Data Management and Exploration
David Lengweiler* (University of Basel), Marco Vogt (Polypheny GmbH), Heiko Schuldt (University of Basel)
PiPer - Leveraging Pipeline Perspectives for Effective Data Pipeline Exploration
Melanie Herschel* (Nanyang Technological University), Ridhwan Hakim Bin Kusni (NTU)
Pathfinder: Context Engineering and Knowledge Management for Domain-Specific Horizontal Reasoning
Joohyun Lee* (Seoul National University), Ghita Benboubker (Seoul National University), JungKwan Han (Seoul National University), Jisoo Jang (Seoul National University), Wen-Syan Li (Seoul National University)
TKDE Posters
[TKDE Poster A] Accepted TKDE Posters - Group A
Time: Tuesday, May 5, 10:00 - 12:00 & 15:30 - 17:00 Location: Av. Viger Track: TKDE Posters Session Chair: [To Be Announced]
Minimum k-Vertex Connected Graph Search
Yang Liu (Harbin Institute of Technology, Shenzhen), Hejiao Huang (Harbin Institute of Technology, Shenzhen), Kaiqiang Yu (Nanjing University), Shengxin Liu* (Harbin Institute of Technology, Shenzhen), Cheng Long (Nanyang Technological University)
PiTruss Community Search for Multilayer Graphs
Run-An Wang (Harbin Institute of Technology), Zhaonian Zou* (Harbin Institute of Technology), Dandan Liu (Harbin Institute of Technology), Xudong Liu (Harbin Institute of Technology)
Graph2Region: Efficient Graph Similarity Learning with Structure and Scale Restoration
Zhouyang Liu* (National University of Defense Technology), Yixin Chen (National University of Defense Technology), Ning Liu (Information Support Force Engineering University), Jiezhong He (National University of Defense Technology), Dongsheng Li (National University of Defense Technology)
Structural Clustering of Multi-layer Graphs
Xudong Liu (Harbin Institute of Technology), Zhaonian Zou* (Harbin Institute of Technology), Run-An Wang (Harbin Institute of Technology), Dandan Liu (Harbin Institute of Technology)
Jet-BGC: Joint Latent Embedding and Structural Fusion Bipartite Graph Clustering
Liang Li* (National University of Defense Technology), Yuangang Pan (Agency for Science, Technology and Research (A∗STAR)), Junpu Zhang (National University of Defense Technology), Pei Zhang (National University of Defense Technology), Jie Liu (National University of Defense Technology), Xinwang Liu (National University of Defense Technology), Kenli Li (Hunan University), Ivor W. Tsang (Agency for Science, Technology and Research (A∗STAR)), Keqin Li (State University of New York)
ConvD: Attention Enhanced Dynamic Convolutional Embeddings for Knowledge Graph Completion
Wenbin Guo (Tianjin University), zhao li (Tianjin University), Xin Wang* (Tianjin University), Zirui Chen (Tianjin University), Jun Zhao (Ningxia University), Jianxin Li (Edith Cowan University), Yuan Ye (Beijing Institute of Technology)
HyCubE: Efficient Knowledge Hypergraph 3D Circular Convolutional Embedding
Zhao Li* (Tianjin University), Xin Wang (Tianjin University), Jun Zhao (Ningxia University), Wenbin Guo (Tianjin University), Jianxin Li (Edith Cowan University)
An Amortized O(1) Lower Bound for Dynamic Time Warping in Motif Discovery
Zemin Chao (Harbin institute of technology), Hong Gao (Zhejiang Normal University), Dongjing Miao (Harbin institute of technology), Jianzhong Li (Harbin institute of technology), Hongzhi Wang* (Harbin institute of technology)
Discovery of Temporal Network Motifs
Hanqing Chen (Beihang University), Shuai Ma* (Beihang University), Junfeng Liu (Beihang University), Lizhen Cui (Shandong University)
[TKDE Poster B] Accepted TKDE Posters - Group B
Time: Tuesday, May 5, 13:30 - 15:00 & Wednesday, May 6, 10:00 - 12:00 Location: Av. Viger Track: TKDE Posters Session Chair: [To Be Announced]
Generalized Local Prominence for Source Detection in Real-World Rumor Networks
Syed Shafat Ali* (University of Kashmir), Ajay Rastogi (Amity University), Tarique Anwar (RMIT University), Syed Afzal Murtaza Rizvi (Jamia Millia Islamia), Jian Yang (Macquarie University), Jia Wu (Macquarie University), Quan Z. Sheng (Macquarie University)
Orthogonal Keys: High Precision and Recall for Mining Meaningful Database Keys from Inconsistent and Incomplete Relations
Henning Koehler (Massey University), Sebastian Link* (University of Auckland)
Hierarchy-Aware Neural Subgraph Matching with Enhanced Similarity Measure
Zhouyang Liu* (National University of Defense Technology), Ning Liu (Information Support Force Engineering University), Yixin Chen (National University of Defense Technology), Jiezhong He (National University of Defense Technology), Menghan Jia (National University of Defense Technology), Dongsheng Li (National University of Defense Technology)
Can Uncertainty Quantification Improve Learned Index Benefit Estimation?
Tao Yu (Harbin Institute of Technology), Zhaonian Zou* (Harbin Institute of Technology), Hao Xiong (Harbin Institute of Technology)
Haojie Li* (Qingdao University of Science and Technology), Junwei Du (Qingdao University of Science and Technology), Guanfeng Liu (Macquarie University), Feng Jiang (Qingdao University of Science and Technology), Yan Wang (Macquarie University), Xiaofang Zhou (Hong Kong University of Science and Technology)
Maximizing Influence Query over Indoor Trajectories
Jian Chen* (Harbin Institute of Technology), Hong Gao (Zhejiang Normal University), Yuhong Shi (Harbin Institute of Technology), Junle Chen (Harbin Institute of Technology), Donghua Yang (Harbin Institute of Technology), Jianzhong Li (Chinese Academy of Sciences)
LIOF: Make the Learned Index Learn Faster With Higher Accuracy
Tao Ji* (School of Information, Renmin University of China), Kai Zhong (School of Information, Renmin University of China), Luming Sun (Yunxi Technology Company Ltd.), Yiyan Li (School of Information, Renmin University of China), Cuiping Li (School of Information, Renmin University of China), Hong Chen (School of Information, Renmin University of China)
FedDict: Towards Practical Federated Dictionary-Based Time Series Classification
Zhiyu Liang (Harbin Institute of Technology), Zheng Liang (Harbin Institute of Technology), Hongzhi Wang* (Harbin Institute of Technology), Bo Zheng (CnosDB Inc.)
Snoopy: Effective and Efficient Semantic Join Discovery via Proxy Columns
Yuxiang Guo (Zhejiang University), Yuren Mao (Zhejiang University), Zhonghao Hu (Zhejiang University), Lu Chen (Zhejiang University), Yunjun Gao* (Zhejiang University)
DEFT
[DEFT] Data Engineering Future Technologies
Time: Thursday, May 7, 10:00 - 12:00 Location: Av. Van-Horne Track: Data Engineering Future Technologies (DEFT) Session Chair: [To Be Announced]
Efficient Neural-Symbolic Data System via Multi-Agent Collaboration
Ye Yuan (Beijing Institute of Technology), Bo Tang (Southern University of Science and Technology), Zhaojing Luo (Beijing Institute of Technology), Boyang Li (Beijing Institute of Technology), Renjie Liu (Southern University of Science and Technology), Zhilang Wei (Beijing Institute of Technology)
VPS: Rethinking OLTP Database Performance Evaluation through Transactional Value
Jianbin Qin (Shenzhen University), Yibin Lin (Shenzhen University), Wendi Hua (Shenzhen University), Rui Mao (Shenzhen University), Chuan Xiao (Osaka University)
Living Databases: A Unified Model for Continuous Schema Evolution, Versioning, and Transformations
Amol Deshpande (University of Maryland at College Park)
Green or Greedy? An Ecological Analysis of Datacenter GPU Replacements
Marc Baeuerle (Hasso Plattner Institute, University of Potsdam), Ole Becker (Hasso Plattner Institute, University of Potsdam), Nikolas Hoellerl (Hasso Plattner Institute, University of Potsdam), Ricardo Salazar Díaz (Hasso Plattner Institute, University of Potsdam), Ilin Tolovski (Hasso Plattner Institute, University of Potsdam), Tilmann Rabl (Hasso Plattner Institute, University of Potsdam)
GenIE: Simulator-Driven Iterative Data Exploration for Scientific Discovery
Ashwin Colaco (University of California, Irvine), Martin Boissier (Hasso Plattner Institute), Sriram Rao (University of California, Irvine), Shubharoop Ghosh (ImageCat), Sharad Mehrotra (University of California, Irvine), Tilmann Rabl (Hasso Plattner Institute)
Towards a Hybrid Quantum-Classical Computing Framework for Database Optimization Problems in Real Time Setup
Hanwen Liu (University of Southern California), Ibrahim Sabek (University of Southern California)
Tutorials
[Tutorial-1] Evolution of LSM-Tree Key-Value Stores: A Tutorial on State-of-the-Art and Future Directions
Time: Tuesday, May 5, 10:00 - 12:00 Location: Rue Saint-Denis Track: Tutorials Length: 1.5 Hours
Presenters:
Yina Lv (Xiamen University, China)
Qiao Li (MBZUAI, UAE)
Quanqing Xu (OceanBase, Ant Group, China)
Chun Jason Xue (MBZUAI, UAE)
[Tutorial-2] Large Language Models for Spatial Analysis Queries
Time: Wednesday, May 6, 10:00 - 12:00 Location: Rue Saint-Denis Track: Tutorials Length: 1.5 Hours
Presenters:
Mohamed Hemdan (University of Minnesota, Minnesota, USA)
Youssef Hussein (University of Minnesota, Minnesota, USA)
Mohamed F. Mokbel (University of Minnesota, Minnesota, USA)
[Tutorial-3] Data Discovery in Data Lakes: Operations, Indexes, Systems
Presenters:
Ziawasch Abedjan (TU Berlin & BIFOLD Berlin, Germany)
Mahdi Esmailoghli (Humboldt-Universitat zu Berlin, Berlin, Germany)
Sainyam Galhotra (Cornell University, Ithaca, NY, USA)
[Tutorial-4] The Virtuous Cycle: AI-Powered Vector Search and Vector Search-Augmented AI
Time: Wednesday, May 6, 15:30 - 17:00 Location: Rue Saint-Denise Track: Tutorials Length: 1.5 Hours
Presenters:
Jiuqi Wei (Oceanbase, Ant Group, Beijing, China)
Quanqing Xu (Oceanbase, Ant Group, Hangzhou, China)
Chuanhui Yang (Oceanbase, Ant Group, Beijing, China)
[Tutorial-5] Query Rewrite in the Learning Age: From Rules to ML-Based and LLM-Driven Techniques
Time: Thursday, May 7, 10:00 - 12:00 Location: Rue Saint-Denis Track: Tutorials Length: 1.5 Hours
Presenters:
Shengchen Liu (University of Ottawa, Ottawa, ON, Canada)
Verena Kantere (University of Ottawa, Ottawa, ON, Canada)
Nicholas Ostan (IBM Canada Ltd., Toronto, ON, Canada)
Farhana Haider (IBM Canada Ltd., Toronto, ON, Canada)
Calisto Zuzarte (IBM Canada Ltd., Toronto, ON, Canada)
[Tutorial-6] Data-Centric Foundations of Agentic AI
Time: Thursday, May 7, 13:30 - 15:00 Location: Rue Saint-Denis Track: Tutorials Length: 1.5 Hours
Presenters:
Yuxin Jin (University of Technology Sydney, Sydney, Australia)
Ying Zhang (Zhejiang Gongshang University, Hangzhou, China)
Hanchen Wang (University of Technology Sydney, Sydney, Australia)
Wenjie Zhang (University of New South Wales, Sydney, Australia)
[Tutorial-7] Model Slicing: A Data Engineering Perspective
Time: Thursday, May 7, 15:30 - 17:00 Location: Rue Saint-Denis Track: Tutorials Length: 1.5 Hours
Presenters:
Parke Godfrey (York University, Toronto, ON, Canada)
Lukasz Golab (University of Waterloo, Waterloo, ON, Canada)
Divesh Srivastava (AT&T Chief Data Office, Bedminster, NJ, USA)
Jarek Szlichta (York University, Toronto, ON, Canada)