January 19-22, 2025 Amsterdam, The Netherlands
Palimpzest: Optimizing AI-Powered Analytics with Declarative Query Processing
Chunwei Liu (MIT), Matthew D Russo (MIT), Michael Cafarella (MIT CSAIL), Lei Cao (University of Arizona), Peter Baile Chen (Massachusetts Institute of Technology), Zui Chen (Tsinghua University), Michael Franklin (University of Chicago), Tim Kraska (MIT), Samuel Madden (MIT), Rana Shahout (Harvard), Gerardo Vitagliano (MIT CSAIL)
Adaptive data transformations for QaaS
Dimitrios Koutsoukos (ETHZ), Renato Marroquín (Oracle Labs), Ingo Müller (Google), Ana Klimovic (ETH Zurich)
OSDB: Exposing the Operating System’s Inner Database
Robert Soulé (Yale University), George V Neville-Neil (Yale University), Stelios Kasouridis (Yale University), Alex Yuan (Yale University), Avi Silberschatz (Yale University), Peter Alvaro (UC Santa Cruz)
Text2SQL is Not Enough: Unifying AI and Databases with TAG
Asim Biswal (University of California, Berkeley), Siddharth Jha (UC Berkeley), Carlos Guestrin (Stanford University), Matei Zaharia (Berkeley and Databricks), Joseph E Gonzalez (UC Berkeley), Amog Kamsetty (UC Berkeley), Shu Liu (University of California Berkeley), Liana Patel (Stanford University)
Bullion: A Column Store for Machine Learning
Gang Liao (University of Maryland), Ye Liu (Bytedance Inc.), Jianjun Chen (Bytedance), Daniel Abadi (University of Maryland, College Park)
Runtime-Extensible Parsers
Hannes Mühleisen (DuckDB Labs), Mark Raasveldt (DuckDB Labs)
Efficient Approximate Query Processing with Block Sampling
Yuxuan Zhu (University of Illinois Urbana-Champaign), Daniel Kang (UIUC)
The Five-Minute Rule for the Cloud: Caching in Analytics Systems
Kira Duwe (EPFL), Angelos-Christos Anadiotis (Oracle Zürich), Andrew Lamb (InfluxData), Lucas Lersch (Amazon), Boaz Leskes (MotherDuck), Daniel Ritter (SAP), Pinar Tozun (IT University of Copenhagen)*
Towards Foundation Database Models
Johannes Wehrstein (TU Darmstadt, Carsten Binnig (TU Darmstadt), Fatma Ozcan (Google), Shobha Vasudevan (Google), Yu Gan (Google), Yawen Wang (Google)
Towards Query Optimizer as a Service (QOaaS) in a Unified LakeHouse Platform: Can One QO Rule Them All?
Yuanyuan Tian (Microsoft Gray Systems Lab, Rana Alotaibi (Saudi Authority for Data and AI (SDAIA) - NCAI), Jesús Camacho-Rodríguez (Microsoft), Carlo Curino (Microsoft), Cesar A Galindo-Legaria (Microsoft), Ashit Gosalia (Microsoft), Brian Kroth (Microsoft), Sergiy Matusevych (Microsoft Gray Systems Lab), Nicolas Bruno (Microsoft), Ashvin Agrawal (Microsoft - GSL), Stefan Grafberger (University of Amsterdam), Beysim Sezgin (Microsoft), Milan Potocnik (Microsoft), Mahesh Kumar Behera (Microsoft), Milind Joshi (Microsoft), Xiaoyu Li (Microsoft)
Frequency-Store: Scaling Image AI by A Column-Store for Images
Utku Sirin (Harvard University, Victoria Kauffman (Harvard University), Aadit Saluja (DASLab), Florian Klein (Technical University of Munich), Jeremy E Hsu (Harvard University), Stratos Idreos (Harvard)
Pasha: An Efficient and Scalable Database Architecture on a CXL Pod
Yibo Huang (The University of Texas at Austin), Newton Ni (University of Texas at Austin), Vijay Chidambaram (UT Austin), Dixin Tang (The University of Texas, Austin), Emmett Witchel (The University of Texas at Austin)
Hybrid Querying Over Relational Databases and Large Language
Fuheng Zhao (UCSB), Divy Agrawal (University of California, Santa Barbara), Amr El Abbadi (UC Santa Barbara)
Towards Functional Decomposition of Storage Formats
Martin Prammer (University of Wisconsin – Madison, Xinyu Zeng (Tsinghua University), Ruijun Meng (Tsinghua University), Wes McKinney (Posit PBC), Huanchen Zhang (Tsinghua University), Andrew Pavlo (Carnegie Mellon University), Jignesh Patel (Carnegie Mellon University)
Trampoline-Style Queries for SQL
Louisa Lambrecht (University of Tübingen), Torsten Grust (Universität Tübingen), Altan Birler (TUM), Thomas Neumann (TU Munich)
Flow with FlorDB: Incremental Context Maintenance for the Machine Learning Lifecycle
Rolando Garcia (UC Berkeley, Pragya Kallanagoudar (UC Berkeley), Chithra Anand (UC Berkeley), Sarah Chasins (UC Berkeley), Joseph M Hellerstein (UC Berkeley), Aditya Parameswaran (University of California, Berkeley)
OLTP Through the Looking Glass 16 Years Later: Communication is the New Bottleneck
Xinjing Zhou (Massachusetts Institute of Technology), Viktor Leis (Technische Universität München), Xiangyao Yu (University of Wisconsin-Madison), Michael Stonebraker (MIT)
The Design of an LLM-powered Unstructured Analytics System
Eric Anderson (Aryn Inc.), Jonathan Fritz (Aryn), Austin Lee (Aryn Inc.), Bohou Li (Aryn Inc.), Mark Lindblad (Aryn Inc.), Henry Lindeman (Aryn Inc.), Alex Meyer (Aryn Inc.), Parth Parmar (Aryn Inc.), Tanvi Ranade (Aryn Inc.), Mehul Shah (Aryn Inc.), Benjamin Sowell (Aryn Inc), Dan Tecuci (Aryn Inc.), Vinayak Thapliyal (Aryn Inc.), Matt Welsh (Aryn Inc.)
DPDPU: Data Processing with DPUs
Jason Hu (University of Toronto), Philip A Bernstein (Microsoft Research), Jialin Li (NUS), Qizhen Zhang (University of Toronto)
Cephalopod – Virtual Data Model Composition through Partial Query Translation
David Loughlin (Imperial College), Holger Pirk (Imperial College)
NeurDB: On the Design and Implementation of an AI-powered Autonomous Database
Zhanhao Zhao (National University of Singapore), Shaofeng Cai (National University of Singapore), Haotian Gao (National University of Singapore), Hexiang Pan (National University of Singapore), Siqi Xiang (National University of Singapore), Naili Xing (National University of Singapore), Gang Chen (Zhejiang University), Beng Chin Ooi (NUS), Yanyan Shen (Shanghai Jiao Tong University), Yuncheng Wu (Renmin University of China), Meihui Zhang (Beijing Institute of Technology)
Generic Version Control: Configurable Versioning for Application-Specific Requirements
Gunce S Yilmaz (Saarland University), Jens Dittrich (Saarland University, Saarland Informatics Campus)
AOP: Automated and Interactive LLM Pipeline Orchestration for Answering Complex Queries
Jiayi Wang (Tsinghua University), Guoliang Li (Tsinghua University)
Resource-Adaptive Query Execution with Paged Memory Management
Riki Otaki (University of Chicago, Jun Hyuk Chang (University of Chicago), Charles Benello (University of Chicago), Aaron J Elmore (University of Chicago), Goetz Graefe (Google)
A Case for Ecological Efficiency in Database Server Lifecycles
Thomas Bodner (Hasso Plattner Institute, University of Potsdam), Martin Boissier (Hasso Plattner Institute), Tilmann Rabl (HPI, University of Potsdam), Ricardo Salazar-Díaz (Hasso Plattner Institute, University of Potsdam), Florian Schmeller (Hasso Plattner Institute), Nils Straßenburg (Hasso Plattner Institut), Ilin Tolovski (Hasso Plattner Institute), Marcel Weisgut (HPI, University of Potsdam), wang yue (Hasso Plattner Institute, University of Potsdam)
Rethinking MIMD-SIMD Interplay for Analytical Query Processing in In-Memory Database Engines
Lennart Schmidt (Technische Universität Dresden), Johannes Pietrzyk (TU Dresden), Juliana Hildebrandt (TU Dresden), Alexander Krause (TU Dresden), Dirk Habich (TU Dresden), Wolfgang Lehner (TU Dresden)
Mind the Data Gap: Bridging Large Language Models (LLMs) to Enterprise Data Integration
Moe Kayali (University of Washington), Fabian Wenz (TUM), Nesime Tatbul (Intel Labs and MIT), Cagatay Demiralp (MIT CSAIL)
Transactional Cloud Applications Go with the (Data)Flow
Kyriakos Psarakis (TU Delft), George Christodoulou (TU Delft), Marios Fragkoulis (TU Delft), Asterios Katsifodimos (TU Delft)
Adaptive Factorization Using Linear-Chained Hash Tables
Paul Gross (Centrum Wiskunde & Informatica), Daniel ten Wolde (Centrum Wiskunde & Informatica), Peter Boncz (Centrum Wiskunde & Informatica)
Linear Elastic Caching via Ski Rental
Ravi Kumar (Google), Todd Lipcon (Google), Manish Purohit (Google), Tamas Sarlos (Google Research)
VectraFlow: Integrating Vectors into Stream Processing
Duo Lu (Brown University), Siming Feng (Brown University), Jonathan D Zhou (Brown University), Franco Solleza (Brown University), Malte Schwarzkopf (Brown University), Ugur Cetintemel (Brown University)
Beyond Relations: A Case for Elevating to the Entity-Relationship Abstraction
Amol Deshpande (University of Maryland at College Park)
Databases in the Era of Memory-Centric Computing
Yannis Chronis (Google), Anastasia Ailamaki (EPFL), Lawrence Benson (TUM), Helena Caminal (Google), Jana Giceva (TU Munich), Dave Patterson (Google), Eric Sedlar (Oracle), Lisa Wu Wills (Duke University)
GenEdit: Compounding Operators and Continuous Improvement to Tackle Text-to-SQL in the Enterprise
Karime Maamari (Distyl AI), Connor Landy (Distyl AI), Amine Mhedhbi (Polytechnique Montréal)