Amsterdam

Menu:

Mentorship Sessions

CIDR has been organizing mentorship sessions in the recent years. Attendees who are interested in being a mentor will be paired with a mentee (or a few mentees depending on the demand) for a ~30min session. The mentors decide when this session takes place depending on their availability. Interest in being a mentor or mentee is polled during the registration process.

Sunday

3:00pm

Hotel check-in and conference registration

6:00pm

Dinner

6:00 – 7:30

7:30pm

Social Event Reception hour

Monday

8:45am

Opening and Welcome

Carlo Curino (Microsoft), Gustavo Alonso (ETH Zürich), Sam Madden (MIT)

9:00am

10:00am

Break

10:30am

Break

Session 1: ARE QUERIES STILL A THING?

Chair: TBA
10:30 – 10:50 Trampoline-Style Queries for SQL Louisa Lambrecht (University of Tübingen), Torsten Grust (Universität Tübingen), Altan Birler (TUM), Thomas Neumann (TU Munich)
10:50 – 11:10 Resource-Adaptive Query Execution with Paged Memory Management Riki Otaki (University of Chicago, Jun Hyuk Chang (University of Chicago), Charles Benello (University of Chicago), Aaron J Elmore (University of Chicago), Goetz Graefe (Google)
11:10 – 11:30 Efficient Approximate Query Processing with Block Sampling Yuxuan Zhu (University of Illinois Urbana-Champaign), Daniel Kang (UIUC)
11:30 – 11:50 The Five-Minute Rule for the Cloud: Caching in Analytics Systems Kira Duwe (EPFL), Angelos-Christos Anadiotis (Oracle Zürich), Andrew Lamb (InfluxData), Lucas Lersch (Amazon), Boaz Leskes (MotherDuck), Daniel Ritter (SAP), Pinar Tozun (IT University of Copenhagen)
11:50 – 12:10 Towards Query Optimizer as a Service (QOaaS) in a Unified LakeHouse Platform: Can One QO Rule Them All? Yuanyuan Tian (Microsoft Gray Systems Lab, Rana Alotaibi (Saudi Authority for Data and AI (SDAIA) - NCAI), Jesús Camacho-Rodríguez (Microsoft), Carlo Curino (Microsoft), Cesar A Galindo-Legaria (Microsoft), Ashit Gosalia (Microsoft), Brian Kroth (Microsoft), Sergiy Matusevych (Microsoft Gray Systems Lab), Nicolas Bruno (Microsoft), Ashvin Agrawal (Microsoft - GSL), Stefan Grafberger (University of Amsterdam), Beysim Sezgin (Microsoft), Milan Potocnik (Microsoft), Mahesh Kumar Behera (Microsoft), Milind Joshi (Microsoft), Xiaoyu Li (Microsoft)

12:10pm

Spare time

12:30pm

Lunch

12:30 – 1:30

1:30pm

Session 2: HARDWARE MATTERS

Chair: Pinar Tözun
1:30 – 1:50 Databases in the Era of Memory-Centric Computing Yannis Chronis (Google), Anastasia Ailamaki (EPFL), Lawrence Benson (TUM), Helena Caminal (Google), Jana Giceva (TU Munich), Dave Patterson (Google), Eric Sedlar (Oracle), Lisa Wu Wills (Duke University)
1:50 – 2:10 Rethinking MIMD-SIMD Interplay for Analytical Query Processing in In- Memory Database Engines Lennart Schmidt (Technische Universität Dresden), Johannes Pietrzyk (TU Dresden), Juliana Hildebrandt (TU Dresden), Alexander Krause (TU Dresden), Dirk Habich (TU Dresden), Wolfgang Lehner (TU Dresden)
2:10 – 2:30 Pasha: An Efficient and Scalable Database Architecture on a CXL Pod Yibo Huang (The University of Texas at Austin), Newton Ni (University of Texas at Austin), Vijay Chidambaram (UT Austin), Dixin Tang (The University of Texas, Austin), Emmett Witchel (The University of Texas at Austin)
2:30 – 2:50 DPDPU: Data Processing with DPUs Jason Hu (University of Toronto), Philip A Bernstein (Microsoft Research), Jialin Li (NUS), Qizhen Zhang (University of Toronto)

2:50pm

Test of Time Award

Chair: TBA
Awards Award Talks

3:30pm

Break

4:15pm

Session 3: LLMs ARE THE NEW NO-SQL

Chair: Fatma Özcan (Google) 4:15 – 4:35 Hybrid Querying Over Relational Databases and Large Language Fuheng Zhao (UCSB), Divy Agrawal (University of California, Santa Barbara), Amr El Abbadi (UC Santa Barbara)
4:35 – 4:55 Text2SQL is Not Enough: Unifying AI and Databases with TAG Asim Biswal (University of California, Berkeley), Siddharth Jha (UC Berkeley), Carlos Guestrin (Stanford University), Matei Zaharia (Berkeley and Databricks), Joseph E Gonzalez (UC Berkeley), Amog Kamsetty (UC Berkeley), Shu Liu (University of California Berkeley), Liana Patel (Stanford University)
4:55 – 5:15 Palimpzest: Optimizing AI-Powered Analytics with Declarative Query Processing Chunwei Liu (MIT), Matthew D Russo (MIT), Michael Cafarella (MIT CSAIL), Lei Cao (University of Arizona), Peter Baile Chen (Massachusetts Institute of Technology), Zui Chen (Tsinghua University), Michael Franklin (University of Chicago), Tim Kraska (MIT), Samuel Madden (MIT), Rana Shahout (Harvard), Gerardo Vitagliano (MIT CSAIL)
5:15 – 5:35 The Design of an LLM-powered Unstructured Analytics System Eric Anderson (Aryn Inc.), Jonathan Fritz (Aryn), Austin Lee (Aryn Inc.), Bohou Li (Aryn Inc.), Mark Lindblad (Aryn Inc.), Henry Lindeman (Aryn Inc.), Alex Meyer (Aryn Inc.), Parth Parmar (Aryn Inc.), Tanvi Ranade (Aryn Inc.), Mehul Shah (Aryn Inc.), Benjamin Sowell (Aryn Inc), Dan Tecuci (Aryn Inc.), Vinayak Thapliyal (Aryn Inc.), Matt Welsh (Aryn Inc.)
5:35 – 5:50 Sponsor Talk 1: SAP

5:50pm

Spare time

6:00pm

Dinner

6:00 – 7:30

7:30pm

Gong Show and Database Quiz

Chair: Andy Pavlo 7:30 – 9:00

9:00pm

Social hour




Tuesday


09:00am

Session 4: RETHINKING DATABASE ARCHITECTURE

Chair: Nesime Tatbul (Intel) 09:00 – 09:20 A Case for Ecological Efficiency in Database Server Lifecycles Thomas Bodner (Hasso Plattner Institute, University of Potsdam), Martin Boissier (Hasso Plattner Institute), Tilmann Rabl (HPI, University of Potsdam), Ricardo Salazar-Díaz (Hasso Plattner Institute, University of Potsdam), Florian Schmeller (Hasso Plattner Institute), Nils Straßenburg (Hasso Plattner Institut), Ilin Tolovski (Hasso Plattner Institute), Marcel Weisgut (HPI, University of Potsdam), Wang Yue (Hasso Plattner Institute, University of Potsdam)
09:20 – 09:40 Beyond Relations: A Case for Elevating to the Entity-Relationship Abstraction Amol Deshpande (University of Maryland at College Park)
09:40 – 10:00 Cephalopod – Virtual Data Model Composition through Partial Query Translation David Loughlin (Imperial College), Holger Pirk (Imperial College)
10:00 – 10:20 OLTP Through the Looking Glass 16 Years Later: Communication is the New Bottleneck Xinjing Zhou (Massachusetts Institute of Technology), Viktor Leis (Technische Universität München), Xiangyao Yu (University of Wisconsin-Madison), Michael Stonebraker (MIT)

10:20pm

Break

10:50am

Session 5: BETTER DATA PROCESSING

Chair: Jana Gičeva (TU Munich) 10:50 – 11:10 Runtime-Extensible Parsers Hannes Mühleisen (DuckDB Labs), Mark Raasveldt (DuckDB Labs)
11:10 – 11:30 Towards Functional Decomposition of Storage Formats Martin Prammer (University of Wisconsin – Madison), Xinyu Zeng (Tsinghua University), Ruijun Meng (Tsinghua University), Wes McKinney (Posit PBC), Huanchen Zhang (Tsinghua University), Andrew Pavlo (Carnegie Mellon University), Jignesh Patel (Carnegie Mellon University)
11:30 – 11:50 Adaptive data transformations for QaaS Dimitrios Koutsoukos (ETHZ), Renato Marroquín (Oracle Labs), Ingo Müller (Google), Ana Klimovic (ETH Zurich)
11:50 – 12:10 Adaptive Factorization Using Linear-Chained Hash Tables Paul Gross (Centrum Wiskunde & Informatica), Daniel ten Wolde (Centrum Wiskunde & Informatica), Peter Boncz (Centrum Wiskunde & Informatica)

12:10pm

Spare time

12:30pm

Lunch

12:30 – 1:30

1:30pm

Spare time

2:30pm

Session 6: CLOUD AND MORE

Chair: Ana Klimovic (ETH) 2:30 – 2:50 Linear Elastic Caching via Ski Rental Ravi Kumar (Google), Todd Lipcon (Google), Manish Purohit (Google), Tamas Sarlos (Google Research)
2:50 – 3:10 VectraFlow: Integrating Vectors into Stream Processing Duo Lu (Brown University), Siming Feng (Brown University), Jonathan D Zhou (Brown University), Franco Solleza (Brown University), Malte Schwarzkopf (Brown University), Ugur Cetintemel (Brown University)
3:10 – 3:30 Generic Version Control: Configurable Versioning for Application- Specific Requirements Gunce S Yilmaz (Saarland University), Jens Dittrich (Saarland University, Saarland Informatics Campus)
3:30 – 3:50 Transactional Cloud Applications Go with the (Data)Flow Kyriakos Psarakis (TU Delft), George Christodoulou (TU Delft), Marios Fragkoulis (TU Delft), Asterios Katsifodimos (TU Delft)

3:50pm

Break

4:15pm

Session 7: DATABASES AND ML

Chair: Madelon Hulsebos (CWI) 4:15 – 4:35 Bullion: A Column Store for Machine Learning Gang Liao (University of Maryland), Ye Liu (Bytedance Inc.), Jianjun Chen (Bytedance), Daniel Abadi (University of Maryland, College Park)
4:35 – 4:55 Frequency-Store: Scaling Image AI by A Column-Store for Images Utku Sirin (Harvard University, Victoria Kauffman (Harvard University), Aadit Saluja (DASLab), Florian Klein (Technical University of Munich), Jeremy E Hsu (Harvard University), Stratos Idreos (Harvard)
4:55 – 5:15 GenEdit: Compounding Operators and Continuous Improvement to Tackle Text-to-SQL in the Enterprise Karime Maamari (Distyl AI), Connor Landy (Distyl AI), Amine Mhedhbi (Polytechnique Montréal)
5:15 – 5:35 NeurDB: On the Design and Implementation of an AI-powered Autonomous Database Zhanhao Zhao (National University of Singapore), Shaofeng Cai (National University of Singapore), Haotian Gao (National University of Singapore), Hexiang Pan (National University of Singapore), Siqi Xiang (National University of Singapore), Naili Xing (National University of Singapore), Gang Chen (Zhejiang University), Beng Chin Ooi (NUS), Yanyan Shen (Shanghai Jiao Tong University), Yuncheng Wu (Renmin University of China), Meihui Zhang (Beijing Institute of Technology)
5:35 – 5:50 Sponsor Talk 2: Where is the Lakehouse heading? Mate Zaharia, Databricks

5:50pm

Spare time

6:00pm

Dinner

6:00 – 7:30

7:30pm

Invited Talk

Open Science: A New Paradigm for the Research Lifecycle and the Role of Computing

Yannis Ioannidis
ACM President, University of Athens, Athena Research Center
Session Chair: TBA

Industry Talks

Chair: Carlo Curino
8:00 – 8:15 Sponsor Talk 3: The Fine Art of Work Skipping Ismael Oukid, Snowflake
8:15 – 8:30 Sponsor Talk 4: AWS
8:30 – 8:45 Sponsor Talk 5: Database Engineering at Salesforce Caetano Sauer, Salesforce
8:45 – 9:00 Sponsor Talk 6: Title TBA

9:00pm

Social hour




Wednesday


9:00am

Keynote 2

Title TBA

9:00 – 10:00 Christos Kozyrakis, Stanford Session Chair: Sam Madden

10:00am

Break

10:30am

Session 8: BEYOND DATABASES

Session Chair: TBA 10:30 – 10:50 OSDB: Exposing the Operating System’s Inner Database Robert Soulé (Yale University), George V Neville-Neil (Yale University), Stelios Kasouridis (Yale University), Alex Yuan (Yale University), Avi Silberschatz (Yale University), Peter Alvaro (UC Santa Cruz)
10:50 – 11:10 Towards Foundation Database Models Johannes Wehrstein (TU Darmstadt), Carsten Binnig (TU Darmstadt), Fatma Ozcan (Google), Shobha Vasudevan (Google), Yu Gan (Google), Yawen Wang (Google)
11:10 – 11:30 AOP: Automated and Interactive LLM Pipeline Orchestration for Answering Complex Queries Jiayi Wang (Tsinghua University), Guoliang Li (Tsinghua University)
11:30 – 11:50 Flow with FlorDB: Incremental Context Maintenance for the Machine Learning Lifecycle Rolando Garcia (UC Berkeley, Pragya Kallanagoudar (UC Berkeley), Chithra Anand (UC Berkeley), Sarah Chasins (UC Berkeley), Joseph M Hellerstein (UC Berkeley), Aditya Parameswaran (University of California, Berkeley)
11:50 – 12:10 Mind the Data Gap: Bridging Large Language Models (LLMs) to Enterprise Data Integration Moe Kayali (University of Washington), Fabian Wenz (TUM), Nesime Tatbul (Intel Labs and MIT), Cagatay Demiralp (MIT CSAIL)

12:10pm

Lunch / End Of CIDR

12:10 – 1:30