Last update:
Mon Nov 10 17:02:05 MST 2025
Mark D. Hill Is transactional memory an oxymoron? . . 1--1
Justin Zobel Databases and the silification of health 2--2
Stephen Blott and
Roger Weber What's wrong with high-dimensional
similarity search? . . . . . . . . . . . 3--3
Nicolas Bruno and
Surajit Chaudhuri Constrained physical design tuning . . . 4--15
Anastasios Kementsietsidis and
Frank Neven and
Dieter Van de Craen and
Stijn Vansummeren Scalable multi-query optimization for
exploratory queries over federated
scientific databases . . . . . . . . . . 16--27
David J. DeWitt and
Erik Paulson and
Eric Robinson and
Jeffrey Naughton and
Joshua Royalty and
Srinath Shankar and
Andrew Krioukov Clustera: an integrated computation and
data management system . . . . . . . . . 28--41
Alvin Cheung and
Samuel Madden Performance profiling with EndoScope, an
acquisitional software monitoring
framework . . . . . . . . . . . . . . . 42--53
Ziv Bar-Yossef and
Maxim Gurevich Mining search engine query logs via
suggestion sampling . . . . . . . . . . 54--65
Mert Akdere and
U\ugur Çetintemel and
Nesime Tatbul Plan-based complex event detection
across distributed sources . . . . . . . 66--77
Alexander Lachmann and
Mirek Riedewald Finding relevant patterns in bursty
sequences . . . . . . . . . . . . . . . 78--89
Hao Cheng and
Kien A. Hua and
Khanh Vu Constrained locally weighted clustering 90--101
Michael Hay and
Gerome Miklau and
David Jensen and
Don Towsley and
Philipp Weis Resisting structural re-identification
in anonymized social networks . . . . . 102--114
Manolis Terrovitis and
Nikos Mamoulis and
Panos Kalnis Privacy-preserving anonymization of
set-valued data . . . . . . . . . . . . 115--125
HweeHwa Pang and
Kyriakos Mouratidis Authenticating the query results of text
search engines . . . . . . . . . . . . . 126--137
Ashish Kundu and
Elisa Bertino Structural signatures for tree data
structures . . . . . . . . . . . . . . . 138--150
Haggai Roitman and
David Carmel and
Elad Yom-Tov Maintaining dynamic channel profiles on
the Web . . . . . . . . . . . . . . . . 151--162
Fan Yang and
Nitin Gupta and
Chavdar Botev and
Elizabeth F. Churchill and
George Levchenko and
Jayavel Shanmugasundaram WYSIWYG development of data driven Web
applications . . . . . . . . . . . . . . 163--175
Eda Baykan and
Monika Henzinger and
Ingmar Weber Web page language identification based
on URLs . . . . . . . . . . . . . . . . 176--187
Wook-Shin Han and
Wooseong Kwak and
Jinsoo Lee and
Guy M. Lohman and
Volker Markl Parallelizing query optimization . . . . 188--200
Marios Hadjieleftheriou and
Xiaohui Yu and
Nick Koudas and
Divesh Srivastava Hashed samples: selectivity estimators
for set similarity selection queries . . 201--212
Edith Cohen and
Haim Kaplan Tighter estimation using bottom $k$
sketches . . . . . . . . . . . . . . . . 213--229
Bogdan Alexe and
Wang-Chiew Tan and
Yannis Velegrakis STBenchmark: towards a benchmark for
mapping systems . . . . . . . . . . . . 230--244
Yannis Katsis and
Alin Deutsch and
Yannis Papakonstantinou Interactive source registration in
community-oriented information
integration . . . . . . . . . . . . . . 245--259
Mauricio A. Hernández and
Paolo Papotti and
Wang-Chiew Tan Data exchange with data-metadata
translations . . . . . . . . . . . . . . 260--273
Jin Li and
Kristin Tufte and
Vladislav Shkapenyuk and
Vassilis Papadimos and
Theodore Johnson and
David Maier Out-of-order processing: a new
architecture for high-performance stream
systems . . . . . . . . . . . . . . . . 274--288
Wook-Shin Han and
Haifeng Jiang and
Howard Ho and
Quanzhong Li StreamTX: extracting tuples from
streaming XML data . . . . . . . . . . . 289--300
Cheqing Jin and
Ke Yi and
Lei Chen and
Jeffrey Xu Yu and
Xuemin Lin Sliding-window top-$k$ queries on
uncertain streams . . . . . . . . . . . 301--312
Christoph Koch and
Dan Olteanu Conditioning probabilistic databases . . 313--325
George Beskales and
Mohamed A. Soliman and
Ihab F. Ilyas Efficient search for the top-$k$
probable nearest neighbors in uncertain
databases . . . . . . . . . . . . . . . 326--339
Daisy Zhe Wang and
Eirinaios Michelakis and
Minos Garofalakis and
Joseph M. Hellerstein BayesStore: managing large, uncertain
data repositories with probabilistic
graphical models . . . . . . . . . . . . 340--351
Daniel Deutch and
Tova Milo Type inference and type checking for
queries on execution traces . . . . . . 352--363
Haichuan Shang and
Ying Zhang and
Xuemin Lin and
Jeffrey Xu Yu Taming verification hardness: an
efficient algorithm for testing subgraph
isomorphism . . . . . . . . . . . . . . 364--375
Lukasz Golab and
Howard Karloff and
Flip Korn and
Divesh Srivastava and
Bei Yu On generating near-optimal tableaux for
conditional functional dependencies . . 376--390
Wenfei Fan and
Shuai Ma and
Yanli Hu and
Jie Liu and
Yinghui Wu Propagating functional dependencies with
conditions . . . . . . . . . . . . . . . 391--407
Ioannis Antonellis and
Hector Garcia Molina and
Chi Chao Chang Simrank++: query rewriting through link
analysis of the click graph . . . . . . 408--421
Dmitry Lizorkin and
Pavel Velikhov and
Maxim Grinev and
Denis Turdakov Accuracy estimate and optimization
techniques for SimRank computation . . . 422--433
Badrish Chandramouli and
Jun Yang End-to-end support for joins in
large-scale publish/subscribe systems 434--450
Ashwin Machanavajjhala and
Erik Vee and
Minos Garofalakis and
Jayavel Shanmugasundaram Scalable ranked publish/subscribe . . . 451--462
Jens Teubner and
Torsten Grust and
Sebastian Maneth and
Sherif Sakr Dependable cardinality forecasts for
XQuery . . . . . . . . . . . . . . . . . 463--477
Hongzhi Wang and
Jianzhong Li and
Jizhou Luo and
Hong Gao Hash-base subgraph query processing
method for graph-structured XML
documents . . . . . . . . . . . . . . . 478--489
Sara Cohen Generating XML structure using examples
and constraints . . . . . . . . . . . . 490--501
Allison L. Holloway and
David J. DeWitt Read-optimized databases, in depth . . . 502--513
Ioannis Koltsidas and
Stratis D. Viglas Flashing up the storage layer . . . . . 514--525
Russell Sears and
Mark Callaghan and
Eric Brewer Rose: compressed, log-structured
replication . . . . . . . . . . . . . . 526--537
Michael J. Cafarella and
Alon Halevy and
Daisy Zhe Wang and
Eugene Wu and
Yang Zhang WebTables: exploring the power of tables
on the Web . . . . . . . . . . . . . . . 538--549
Charles Garrod and
Amit Manjhi and
Anastasia Ailamaki and
Bruce Maggs and
Todd Mowry and
Christopher Olston and
Anthony Tomasic Scalable query result caching for Web
applications . . . . . . . . . . . . . . 550--561
Daniele Braga and
Stefano Ceri and
Florian Daniel and
Davide Martinenghi Optimization of multi-domain queries on
the Web . . . . . . . . . . . . . . . . 562--573
YongChul Kwon and
Magdalena Balazinska and
Albert Greenberg Fault-tolerant stream processing using a
distributed, replicated file system . . 574--585
Mi-Yen Yeh and
Kun-Lung Wu and
Philip S. Yu and
Ming-Syan Chen LeeWave: level-wise distribution of
wavelet coefficients for processing $k$
NN queries over distributed streams . . 586--597
Marcos K. Aguilera and
Wojciech Golab and
Mehul A. Shah A practical scalable distributed B-tree 598--609
Lin Qiao and
Vijayshankar Raman and
Frederick Reiss and
Peter J. Haas and
Guy M. Lohman Main-memory scan sharing for multi-core
CPUs . . . . . . . . . . . . . . . . . . 610--621
Ryan Johnson and
Vijayshankar Raman and
Richard Sidle and
Garret Swart Row-wise parallel predicate evaluation 622--634
Gokul Soundararajan and
Jin Chen and
Mohamed A. Sharaf and
Cristiana Amza Dynamic partitioning of the cache
hierarchy in shared data centers . . . . 635--646
Thomas Neumann and
Gerhard Weikum RDF-3X: a RISC-style engine for RDF . . 647--659
Alkis Simitsis and
Akanksha Baid and
Yannis Sismanis and
Berthold Reinwald Multidimensional content eXploration . . 660--671
Marcus Fontoura and
Vanja Josifovski and
Ravi Kumar and
Christopher Olston and
Andrew Tomkins and
Sergei Vassilvitskii Relaxation in text search using
taxonomies . . . . . . . . . . . . . . . 672--683
Hoa Nguyen and
Thanh Nguyen and
Juliana Freire Learning to extract form labels . . . . 684--694
Magesh Jayapandian and
H. V. Jagadish Automated creation of a forms-based
database query interface . . . . . . . . 695--709
Sihem Amer Yahia and
Michael Benedikt and
Laks V. S. Lakshmanan and
Julia Stoyanovich Efficient network aware search in
collaborative tagging sites . . . . . . 710--721
Reynold Cheng and
Jinchuan Chen and
Xike Xie Cleaning uncertain data with quality
guarantees . . . . . . . . . . . . . . . 722--735
Jiansheng Huang and
Ting Chen and
AnHai Doan and
Jeffrey F. Naughton On the provenance of non-answers to
queries over extracted data . . . . . . 736--747
Shenghuo Zhu and
Tao Li and
Zhiyuan Chen and
Dingding Wang and
Yihong Gong Dynamic active probing of helpdesk
databases . . . . . . . . . . . . . . . 748--760
Carlo A. Curino and
Hyun J. Moon and
Carlo Zaniolo Graceful database schema evolution: the
PRISM workbench . . . . . . . . . . . . 761--772
Xiaoyong Chai and
Mayssam Sayyadian and
AnHai Doan and
Arnon Rosenthal and
Len Seligman Analyzing and revising data integration
schemas to improve their matchability 773--784
Partha Pratim Talukdar and
Marie Jacob and
Muhammad Salman Mehmood and
Koby Crammer and
Zachary G. Ives and
Fernando Pereira and
Sudipto Guha Learning to create data-integrating
queries . . . . . . . . . . . . . . . . 785--796
Christopher Ré and
Dan Suciu Approximate lineage for probabilistic
databases . . . . . . . . . . . . . . . 797--808
Prithviraj Sen and
Amol Deshpande and
Lise Getoor Exploiting shared correlations in
probabilistic databases . . . . . . . . 809--820
Vibhor Rastogi and
Dan Suciu and
Evan Welbourne Access control over uncertain data . . . 821--832
Graham Cormode and
Divesh Srivastava and
Ting Yu and
Qing Zhang Anonymizing bipartite graph data using
safe groupings . . . . . . . . . . . . . 833--844
Yingyi Bu and
Ada Wai Chee Fu and
Raymond Chi Wing Wong and
Lei Chen and
Jiuyong Li Privacy preserving serial data
publishing by role composition . . . . . 845--856
Xiaokui Xiao and
Yufei Tao Output perturbation with query
relaxation . . . . . . . . . . . . . . . 857--869
David Lomet and
Mingsheng Hong and
Rimma Nehme and
Rui Zhang Transaction time indexing with version
compression . . . . . . . . . . . . . . 870--881
Hyun J. Moon and
Carlo A. Curino and
Alin Deutsch and
Chien-Yi Hou and
Carlo Zaniolo Managing and querying transaction-time
databases under schema evolution . . . . 882--895
Reza Sherkat and
Davood Rafiei On efficiently searching trajectories
and archival data for historical
similarities . . . . . . . . . . . . . . 896--908
Ken Q. Pu and
Xiaohui Yu Keyword query cleaning . . . . . . . . . 909--920
Ziyang Liu and
Yi Cher Reasoning and identifying relevant
matches for XML keyword search . . . . . 921--932
Chuan Xiao and
Wei Wang and
Xuemin Lin Ed-Join: an efficient algorithm for
similarity joins with edit distance
constraints . . . . . . . . . . . . . . 933--944
Sanjay Agrawal and
Kaushik Chakrabarti and
Surajit Chaudhuri and
Venkatesh Ganti Scalable ad-hoc entity extraction from
text collections . . . . . . . . . . . . 945--957
Parag Agrawal and
Daniel Kifer and
Christopher Olston Scheduling shared scans of large data
files . . . . . . . . . . . . . . . . . 958--969
Suman Nath and
Phillip B. Gibbons Online maintenance of very large random
samples on flash storage . . . . . . . . 970--983
Tingjian Ge and
Stan Zdonik A skip-list approach for efficiently
processing forecasting queries . . . . . 984--995
Thomas Phan and
Wen-Syan Li A request-routing framework for
SOA-based enterprise computing . . . . . 996--1007
Cathrin Weiss and
Panagiotis Karras and
Abraham Bernstein Hexastore: sextuple indexing for
Semantic Web data management . . . . . . 1008--1019
Cyrus Shahabi and
Lu-An Tang and
Songhua Xing Indexing land surface for efficient kNN
query . . . . . . . . . . . . . . . . . 1020--1031
Raymond Chi-Wing Wong and
Ada Wai-Chee Fu and
Jian Pei and
Yip Sing Ho and
Tai Wong and
Yubao Liu Efficient skyline querying with variable
user preferences on nominal attributes 1032--1043
Lin Guo and
Sihem Amer Yahia and
Raghu Ramakrishnan and
Jayavel Shanmugasundaram and
Utkarsh Srivastava and
Erik Vee Efficient top-$k$ processing over
query-dependent functions . . . . . . . 1044--1055
Wei Wu and
Fei Yang and
Chee-Yong Chan and
Kian-Lee Tan FINCH: evaluating reverse
$k$-Nearest-Neighbor queries on location
data . . . . . . . . . . . . . . . . . . 1056--1067
Hoyoung Jeung and
Man Lung Yiu and
Xiaofang Zhou and
Christian S. Jensen and
Heng Tao Shen Discovery of convoys in trajectory
databases . . . . . . . . . . . . . . . 1068--1080
Jae-Gil Lee and
Jiawei Han and
Xiaolei Li and
Hector Gonzalez TraClass: trajectory classification
using hierarchical region-based and
trajectory-based clustering . . . . . . 1081--1094
Sarana Nutanong and
Rui Zhang and
Egemen Tanin and
Lars Kulik The V*-Diagram: a query-dependent
approach to moving KNN queries . . . . . 1095--1106
Ravindra Guravannavar and
S. Sudarshan Rewriting procedures for batched
bindings . . . . . . . . . . . . . . . . 1107--1123
Harish D. and
Pooja N. Darera and
Jayant R. Haritsa Identifying robust plans through plan
diagram reduction . . . . . . . . . . . 1124--1140
Surajit Chaudhuri and
Vivek Narasayya and
Ravi Ramamurthy A pay-as-you-go framework for query
execution feedback . . . . . . . . . . . 1141--1152
Tyson Condie and
David Chu and
Joseph M. Hellerstein and
Petros Maniatis Evita raced: metacompilation for
declarative networks . . . . . . . . . . 1153--1165
Fei Chiang and
Renée J. Miller Discovering data quality rules . . . . . 1166--1177
Xiang Zhang and
Feng Pan and
Wei Wang and
Andrew Nobel Mining non-redundant high order
correlations in binary data . . . . . . 1178--1188
Bhavana Bharat Dalvi and
Meghana Kshirsagar and
S. Sudarshan Keyword search on external memory data
graphs . . . . . . . . . . . . . . . . . 1189--1204
Ioannis Koltsidas and
Heiko Müller and
Stratis D. Viglas Sorting hierarchical data in external
memory for archiving . . . . . . . . . . 1205--1216
Ahmed Metwally and
Fatih Emekçi and
Divyakant Agrawal and
Amr El Abbadi SLEUTH: Single-pubLisher attack
dEtection Using correlaTion Hunting . . 1217--1228
Meikel Poess and
Raghunath Othayoth Nambiar Energy cost, the key challenge of
today's data centers: a power
consumption analysis of TPC-C results 1229--1240
Jayant Madhavan and
David Ko and
Lucja Kot and
Vignesh Ganapathy and
Alex Rasmussen and
Alon Halevy Google's Deep Web crawl . . . . . . . . 1241--1252
Melanie Weis and
Felix Naumann and
Ulrich Jehle and
Jens Lufter and
Holger Schuster Industry-scale duplicate detection . . . 1253--1264
Ronnie Chaiken and
Bob Jenkins and
Per-Åke Larson and
Bill Ramsey and
Darren Shakib and
Simon Weaver and
Jingren Zhou SCOPE: easy and efficient parallel
processing of massive data sets . . . . 1265--1276
Brian F. Cooper and
Raghu Ramakrishnan and
Utkarsh Srivastava and
Adam Silberstein and
Philip Bohannon and
Hans-Arno Jacobsen and
Nick Puz and
Daniel Weaver and
Ramana Yerneni PNUTS: Yahoo!'s hosted data serving
platform . . . . . . . . . . . . . . . . 1277--1288
Srini Acharya and
Peter Carlin and
Cesar Galindo-Legaria and
Krzysztof Kozielczyk and
Pawel Terlecki and
Peter Zabback Relational support for flexible schema
scenarios . . . . . . . . . . . . . . . 1289--1300
Niloy Mukherjee and
Bharath Aleti and
Amit Ganesh and
Krishna Kunchithapadam and
Scott Lynn and
Sujatha Muthulingam and
Kam Shergill and
Shaoyu Wang and
Wei Zhang Oracle SecureFiles System . . . . . . . 1301--1312
Jatin Chhugani and
Anthony D. Nguyen and
Victor W. Lee and
William Macy and
Mostafa Hagog and
Yen-Kuang Chen and
Akram Baransi and
Sanjeev Kumar and
Pradeep Dubey Efficient implementation of sorting on
multi-core SIMD CPU architecture . . . . 1313--1324
Atreyee Dey and
Sourjya Bhaumik and
Harish D. and
Jayant R. Haritsa Efficiently approximating query
optimizer plan diagrams . . . . . . . . 1325--1336
Dominik \'Sl\kezak and
Jakub Wróblewski and
Victoria Eastwood and
Piotr Synak Brighthouse: an analytic data warehouse
for ad-hoc queries . . . . . . . . . . . 1337--1345
Mohamed Ziauddin and
Dinesh Das and
Hong Su and
Yali Zhu and
Khaled Yagoub Optimizer plan change management:
improved stability and performance in
Oracle 11g . . . . . . . . . . . . . . . 1346--1355
Zhen Hua Liu and
Sivasankaran Chandrasekar and
Thomas Baby and
Hui J. Chang Towards a physical XML independent
XQuery/SQL/XML engine . . . . . . . . . 1356--1367
Allison W. Lee and
Mohamed Zait Closing the query processing loop in
Oracle 11g . . . . . . . . . . . . . . . 1368--1378
Namit Jain and
Shailendra Mishra and
Anand Srinivasan and
Johannes Gehrke and
Jennifer Widom and
Hari Balakrishnan and
U\ugur Çetintemel and
Mitch Cherniack and
Richard Tibbetts and
Stan Zdonik Towards a streaming SQL standard . . . . 1379--1390
Yu Huang and
Ziyang Liu and
Yi Chen eXtract: a snippet generation system for
XML search . . . . . . . . . . . . . . . 1392--1395
James F. Terwilliger and
Sergey Melnik and
Philip A. Bernstein Language-integrated querying of XML data
in SQL server . . . . . . . . . . . . . 1396--1399
Christian Mathis and
Andreas M. Weiner and
Theo Härder and
Caesar Ralf Franz Hoppen XTCcmp: XQuery compilation on XTC . . . 1400--1403
Yuanyuan Tian and
Jignesh M. Patel and
Viji Nair and
Sebastian Martini and
Matthias Kretzler Periscope/GQ: a graph querying toolkit 1404--1407
Andrey Balmin and
Latha Colby and
Emiran Curtmola and
Quanzhong Li and
Fatma Özcan and
Sharath Srinivas and
Zografoula Vagena SEDA: a system for search, exploration,
discovery, and analysis of XML Data . . 1408--1411
Hamid Motahari and
Boualem Benatallah and
Regis Saint-Paul and
Fabio Casati and
Periklis Andritsos Process spaceship: discovering and
exploring process views from event logs
in data spaces . . . . . . . . . . . . . 1412--1415
Mihai Lupu and
Y. C. Tay P 3 N: profiling the potential of a
peer-based data management system . . . 1416--1419
Mounir Tlili and
W. Kokou Dedzoe and
Esther Pacitti and
Patrick Valduriez and
Reza Akbarinia and
Pascal Molli and
Gérôme Canals and
Stéphane Lauri\`ere P2P logging and timestamping for
reconciliation . . . . . . . . . . . . . 1420--1423
Toan Luu and
Gleb Skobeltsyn and
Fabius Klemm and
Maroje Puh and
Ivana Podnar Zarko and
Martin Rajman and
Karl Aberer AlvisP2P: scalable peer-to-peer text
retrieval in a structured P2P network 1424--1427
S. Abiteboul and
T. Allard and
P. Chatalic and
G. Gardarin and
A. Ghitescu and
F. Goasdoué and
I. Manolescu and
B. Nguyen and
M. Ouazara and
A. Somani and
N. Travers and
G. Vasile and
S. Zoupanos WebContent: efficient P2P Warehousing of
Web data . . . . . . . . . . . . . . . . 1428--1431
Pawel Jurczyk and
Li Xiong DObjects: enabling distributed data
services for metacomputing platforms . . 1432--1435
Qihong Shao and
Yi Chen and
Shu Tao and
Xifeng Yan and
Nikos Anerousis EasyTicket: a ticket routing
recommendation engine for enterprise
problem resolution . . . . . . . . . . . 1436--1439
Cristian Duda and
Gianni Frey and
Donald Kossmann and
Chong Zhou AJAXSearch: crawling, indexing and
searching Web 2.0 applications . . . . . 1440--1443
Kun Liu and
Evimaria Terzi and
Tyrone Grandison ManyAspects: a system for highlighting
diverse concepts in documents . . . . . 1444--1447
Emiran Curtmola and
Alin Deutsch and
Dionysios Logothetis and
K. K. Ramakrishnan and
Divesh Srivastava and
Kenneth Yocum XTreeNet: democratic community search 1448--1451
Guoliang Li and
Jianhua Feng and
Jianyong Wang and
Lizhu Zhou An effective and versatile keyword
search engine on heterogeneous data
sources . . . . . . . . . . . . . . . . 1452--1455
Akanksha Baid and
Andrey Balmin and
Heasoo Hwang and
Erik Nijkamp and
Jun Rao and
Berthold Reinwald and
Alkis Simitsis and
Yannis Sismanis and
Frank van Ham DBPubs: multidimensional exploration of
database publications . . . . . . . . . 1456--1459
Wenfei Fan and
Floris Geerts and
Xibei Jia Semandaq: a data quality system based on
conditional functional dependencies . . 1460--1463
Yannis Katsis and
Alin Deutsch and
Yannis Papakonstantinou and
Keliang Zhao RIDE: a tool for interactive source
registration in community-oriented
information integration . . . . . . . . 1464--1467
Bogdan Alexe and
Wang-Chiew Tan and
Yannis Velegrakis Comparing and evaluating mapping systems
with STBenchmark . . . . . . . . . . . . 1468--1471
Dionysios Logothetis and
Kenneth Yocum Ad-hoc data processing in the cloud . . 1472--1475
Felix Weigel and
Biswanath Panda and
Mirek Riedewald and
Johannes Gehrke and
Manuel Calimlim Large-scale collaborative analysis and
extraction of Web data . . . . . . . . . 1476--1479
Tom Crecelius and
Mouna Kacimi and
Sebastian Michel and
Thomas Neumann and
Josiane Xavier Parreira and
Ralf Schenkel and
Gerhard Weikum Making SENSE: socially enhanced search
and exploration . . . . . . . . . . . . 1480--1483
Wentian Lu and
Gerome Miklau AuditGuard: a system for database
auditing under retention restrictions 1484--1487
Ling Hu and
Kenneth A. Ross and
Yuan-Chi Chang and
Christian A. Lang and
Donghui Zhang QueryScope: visualizing queries for
repeatable database tuning . . . . . . . 1488--1491
Katja Hose and
Daniel Klan and
Matthias Marx and
Kai-Uwe Sattler When is it time to rethink the aggregate
configuration of your OLAP server? . . . 1492--1495
Robert Kallman and
Hideaki Kimura and
Jonathan Natkins and
Andrew Pavlo and
Alexander Rasin and
Stanley Zdonik and
Evan P. C. Jones and
Samuel Madden and
Michael Stonebraker and
Yang Zhang and
John Hugg and
Daniel J. Abadi H-store: a high-performance, distributed
main memory transaction processing
system . . . . . . . . . . . . . . . . . 1496--1499
Eric Perlman and
Randal Burns and
Michael Kazhdan Organizing and indexing non-convex
regions . . . . . . . . . . . . . . . . 1500--1503
Eric Paquet and
Herna L. Viktor Capri/MR: exploring protein databases
from a structural and physicochemical
point of view . . . . . . . . . . . . . 1504--1507
Fan Guo and
Lei Li and
Christos Faloutsos and
Eric P. Xing C-DEM: a multi-modal query system for
Drosophila Embryo databases . . . . . . 1508--1511
Tova Milo and
Daniel Deutch Querying and monitoring distributed
business processes . . . . . . . . . . . 1512--1515
Michael Franklin and
Alon Halevy and
David Maier A first tutorial on dataspaces . . . . . 1516--1517
Enrico Franconi Ontologies and databases: myths and
challenges . . . . . . . . . . . . . . . 1518--1519
Magdalena Balazinska and
Christopher Ré and
Dan Suciu Systems aspects of probabilistic data
management . . . . . . . . . . . . . . . 1520--1521
Wenfei Fan and
Floris Geerts and
Xibei Jia A revival of integrity constraints for
data cleaning . . . . . . . . . . . . . 1522--1523
Mirella M. Moro and
Zografoula Vagena and
Vassilis J. Tsotras XML Structural Summaries . . . . . . . . 1524--1525
Mohamed A. Sharaf and
Alexandros Labrinidis and
Panos K. Chrysanthis Scheduling continuous queries in data
stream management systems . . . . . . . 1526--1527
Hans-Peter Kriegel and
Peer Kröger and
Arthur Zimek Detecting clusters in moderate-to-high
dimensional data: subspace clustering,
pattern-based clustering, and
correlation clustering . . . . . . . . . 1528--1529
Graham Cormode and
Marios Hadjieleftheriou Finding frequent items in data streams 1530--1541
Hui Ding and
Goce Trajcevski and
Peter Scheuermann and
Xiaoyue Wang and
Eamonn Keogh Querying and mining of time series data:
experimental comparison of
representations and distance measures 1542--1552
Lefteris Sidirourgos and
Romulo Goncalves and
Martin Kersten and
Niels Nes and
Stefan Manegold Column-store support for RDF data
management: not all swans are white . . 1553--1563
Virginie Sans and
Dominique Laurent Prefix based numbering schemes for XML:
techniques, applications and
performances . . . . . . . . . . . . . . 1564--1573
Su Chen and
Christian S. Jensen and
Dan Lin A benchmark for evaluating moving object
indexes . . . . . . . . . . . . . . . . 1574--1585
Jens Dittrich and
Lukas Blunschi and
Marcos Antonio Vaz Salles Dwarfs in the rearview mirror: how big
are they really? . . . . . . . . . . . . 1586--1597
Jie Shao and
Heng Tao Shen and
Xiaofang Zhou Challenges and techniques for effective
and efficient similarity search in large
video databases . . . . . . . . . . . . 1598--1603
Frank Hopfgartner Studying interaction methodologies in
video retrieval . . . . . . . . . . . . 1604--1608
David Lo and
Siau-Cheng Khoo Mining patterns and rules for software
specification discovery . . . . . . . . 1609--1616
Laurynas Biveinis and
Simonas Saltenis Towards efficient main-memory use for
optimum tree index update . . . . . . . 1617--1622
Alexander Holupirek and
Marc H. Scholl Implementing filesystems by tree-aware
DBMSs . . . . . . . . . . . . . . . . . 1623--1630
Artin Avanes and
Johann-Christoph Freytag Adaptive workflow scheduling under
resource allocation constraints and
network dynamics . . . . . . . . . . . . 1631--1637
Sergej Zerr and
Wolfgang Nejdl Privacy preserving document indexing
infrastructure for a distributed
environment . . . . . . . . . . . . . . 1638--1643
Jiajia Miao GS-TMS: a global stream-based threat
monitor system . . . . . . . . . . . . . 1644--1651
Evgeny Kharlamov and
Werner Nutt Incompleteness in information
integration . . . . . . . . . . . . . . 1652--1658
Daniel Deutch and
Tova Milo Querying Web-based applications under
models of uncertainty . . . . . . . . . 1659--1665
Panu Silvasti and
Seppo Sippu and
Eljas Soisalon-Soininen XML-document-filtering automaton . . . . 1666--1671
Tobias Scholl and
Alfons Kemper Community-driven data grids . . . . . . 1672--1677
Wolfgang Gatterbauer and
Magdalena Balazinska and
Nodira Khoussainova and
Dan Suciu Believe it or not: adding belief
annotations to databases . . . . . . . . 1--12
Zhenjie Zhang and
Beng Chin Ooi and
Srinivasan Parthasarathy and
Anthony K. H. Tung Similarity search on Bregman divergence:
towards non-metric indexing . . . . . . 13--24
Zhiping Zeng and
Anthony K. H. Tung and
Jianyong Wang and
Jianhua Feng and
Lizhu Zhou Comparing stars: on approximating graph
edit distance . . . . . . . . . . . . . 25--36
Steven Euijong Whang and
Hector Garcia-Molina and
Chad Brower and
Jayavel Shanmugasundaram and
Sergei Vassilvitskii and
Erik Vee and
Ramana Yerneni Indexing Boolean expressions . . . . . . 37--48
Yongluan Zhou and
Ali Salehi and
Karl Aberer Scalable delivery of stream query result 49--60
Michael Benedikt and
James Cheney Schema-based independence analysis for
XML updates . . . . . . . . . . . . . . 61--72
Rimma V. Nehme and
Elke A. Rundensteiner and
Elisa Bertino Tagging stream data for rich real-time
services . . . . . . . . . . . . . . . . 73--84
Atish Das Sarma and
Ashwin Lall and
Danupon Nanongkai and
Jun Xu Randomized multi-pass streaming skyline
algorithms . . . . . . . . . . . . . . . 85--96
Galen Reeves and
Jie Liu and
Suman Nath and
Feng Zhao Managing massive time series streams
with multi-scale compressed trickles . . 97--108
Tianyi Wu and
Dong Xin and
Qiaozhu Mei and
Jiawei Han Promotion analysis in multi-dimensional
space . . . . . . . . . . . . . . . . . 109--120
Nikos Sarkas and
Nilesh Bansal and
Gautam Das and
Nick Koudas Measure-driven keyword-query expansion 121--132
Bin Liu and
H. V. Jagadish Using trees to depict a forest . . . . . 133--144
Hazem Elmeleegy and
Ahmed K. Elmagarmid and
Emmanuel Cecchet and
Walid G. Aref and
Willy Zwaenepoel Online piece-wise linear approximation
of numerical streams with precision
guarantees . . . . . . . . . . . . . . . 145--156
Mirco Stern and
Erik Buchmann and
Klemens Böhm A wavelet transform for efficient
consolidation of sensor relations with
quality guarantees . . . . . . . . . . . 157--168
Liu Yu and
Jianzhong Li and
Hong Gao and
Xiaolin Fang Enabling $ \epsilon $-approximate
querying in sensor networks . . . . . . 169--180
Arnab Nandi and
Philip A. Bernstein HAMSTER: using search clicklogs for
schema and taxonomy matching . . . . . . 181--192
Lucja Kot and
Christoph Koch Cooperative update exchange in the
Youtopia system . . . . . . . . . . . . 193--204
Panagiotis Papapetrou and
Vassilis Athitsos and
George Kollios and
Dimitrios Gunopulos Reference-based alignment in large
sequence databases . . . . . . . . . . . 205--216
Sudipto Das and
Shyam Antony and
Divyakant Agrawal and
Amr El Abbadi Thread cooperation in multicore
architectures for frequency counting
over multiple data streams . . . . . . . 217--228
Rene Mueller and
Jens Teubner and
Gustavo Alonso Streams on wires: a query compiler for
FPGAs . . . . . . . . . . . . . . . . . 229--240
Badrish Chandramouli and
Jonathan Goldstein and
David Maier On-the-fly progress detection in
iterative stream queries . . . . . . . . 241--252
Tim Kraska and
Martin Hentschel and
Gustavo Alonso and
Donald Kossmann Consistency rationing in the cloud: pay
only when it matters . . . . . . . . . . 253--264
David Lomet and
Mohamed F. Mokbel Locking key ranges with unbundled
transaction services . . . . . . . . . . 265--276
George Candea and
Neoklis Polyzotis and
Radek Vingralek A scalable, predictable join operator
for highly concurrent data warehouses 277--288
Rahul Gupta and
Sunita Sarawagi Answering table augmentation queries
from unstructured lists on the Web . . . 289--300
Bogdan Cautis and
Alin Deutsch and
Nicola Onose and
Vasilis Vassalos Efficient rewriting of XPath queries
using Query Set Specifications . . . . . 301--312
Ziyang Liu and
Peng Sun and
Yi Chen Structured search result differentiation 313--324
Eduard C. Dragut and
Thomas Kabisch and
Clement Yu and
Ulf Leser A hierarchical approach to model Web
query interfaces for Web source
integration . . . . . . . . . . . . . . 325--336
Gao Cong and
Christian S. Jensen and
Dingming Wu Efficient retrieval of the top-$k$ most
relevant spatial Web objects . . . . . . 337--348
Eduard Dragut and
Fang Fang and
Prasad Sistla and
Clement Yu and
Weiyi Meng Stop word and related problems in Web
interface integration . . . . . . . . . 349--360
Devesh Agrawal and
Deepak Ganesan and
Ramesh Sitaraman and
Yanlei Diao and
Shashi Singh Lazy-Adaptive Tree: an optimized index
structure for flash devices . . . . . . 361--372
Rubao Lee and
Xiaoning Ding and
Feng Chen and
Qingda Lu and
Xiaodong Zhang MCC-DB: minimizing cache conflicts in
multi-core processors for databases . . 373--384
Thomas Willhalm and
Nicolae Popovici and
Yazan Boshmaf and
Hasso Plattner and
Alexander Zeier and
Jan Schaffner SIMD-scan: ultra fast in-memory table
scan using on-chip vector processing
units . . . . . . . . . . . . . . . . . 385--394
Surajit Chaudhuri and
Venkatesh Ganti and
Dong Xin Mining document collections to
facilitate accurate approximate entity
matching . . . . . . . . . . . . . . . . 395--406
Wenfei Fan and
Xibei Jia and
Jianzhong Li and
Shuai Ma Reasoning about record matching rules 407--418
Alin Dobra and
Chris Jermaine and
Florin Rusu and
Fei Xu Turbo-charging estimate convergence in
DBO . . . . . . . . . . . . . . . . . . 419--430
Edith Cohen and
Nick Duffield and
Haim Kaplan and
Carsten Lund and
Mikkel Thorup Composable, scalable, and accurate
weight summarization of unaggregated
data sets . . . . . . . . . . . . . . . 431--442
Sai Wu and
Shouxu Jiang and
Beng Chin Ooi and
Kian-Lee Tan Distributed online aggregations . . . . 443--454
Georgia Koloniari and
Evaggelia Pitoura A recall-based cluster formation game in
peer-to-peer systems . . . . . . . . . . 455--466
Alan Fekete and
Shirley N. Goldrei and
Jorge Pérez Asenjo Quantifying isolation anomalies . . . . 467--478
Ryan Johnson and
Ippokratis Pandis and
Anastasia Ailamaki Improving OLTP scalability using
speculative lock inheritance . . . . . . 479--489
Russell Sears and
Eric Brewer Segment-based recovery: write-ahead
logging revisited . . . . . . . . . . . 490--501
Jian Li and
Barna Saha and
Amol Deshpande A unified approach to ranking in
probabilistic databases . . . . . . . . 502--513
Arvind Arasu and
Surajit Chaudhuri and
Raghav Kaushik Learning string transformations from
examples . . . . . . . . . . . . . . . . 514--525
Graham Cormode and
Antonios Deligiannakis and
Minos Garofalakis and
Andrew McGregor Probabilistic histograms for
probabilistic data . . . . . . . . . . . 526--537
Ohad Greenshpan and
Tova Milo and
Neoklis Polyzotis Autocompletion for mashups . . . . . . . 538--549
Xin Luna Dong and
Laure Berti-Equille and
Divesh Srivastava Integrating conflicting data: the role
of source dependence . . . . . . . . . . 550--561
Xin Luna Dong and
Laure Berti-Equille and
Divesh Srivastava Truth discovery and copying detection in
a dynamic world . . . . . . . . . . . . 562--573
Lukasz Golab and
Howard Karloff and
Flip Korn and
Avishek Saha and
Divesh Srivastava Sequential dependencies . . . . . . . . 574--585
Dimitar Denev and
Arturas Mazeika and
Marc Spaniol and
Gerhard Weikum SHARC: framework for quality-conscious
Web archiving . . . . . . . . . . . . . 586--597
George Beskales and
Mohamed A. Soliman and
Ihab F. Ilyas and
Shai Ben-David Modeling and querying possible repairs
in duplicate detection . . . . . . . . . 598--609
Denis Mindolin and
Jan Chomicki Discovering relative importance of
skyline attributes . . . . . . . . . . . 610--621
Min-Soo Kim and
Jiawei Han A particle-and-density based
evolutionary clustering method for
dynamic networks . . . . . . . . . . . . 622--633
Xiaoyan Yang and
Cecilia M. Procopiuc and
Divesh Srivastava Summarizing relational databases . . . . 634--645
Edith Cohen and
Haim Kaplan and
Subhabrata Sen Coordinated weighted sampling for
estimating aggregates over multiple
weight assignments . . . . . . . . . . . 646--657
Hongrae Lee and
Raymond T. Ng and
Kyuseok Shim Power-law based estimation of set
similarity join size . . . . . . . . . . 658--669
Panagiotis Karras Optimality and scalability in lattice
histogram construction . . . . . . . . . 670--681
Ymir Vigfusson and
Adam Silberstein and
Brian F. Cooper and
Rodrigo Fonseca Adaptively parallelizing distributed
range queries . . . . . . . . . . . . . 682--693
Shirish Tatikonda and
Srinivasan Parthasarathy Mining tree-structured data on multicore
systems . . . . . . . . . . . . . . . . 694--705
P. Unterbrunner and
G. Giannikis and
G. Alonso and
D. Fauser and
D. Kossmann Predictable performance for
unpredictable workloads . . . . . . . . 706--717
Yang Zhou and
Hong Cheng and
Jeffrey Xu Yu Graph clustering based on
structural/attribute similarities . . . 718--729
Mohammad Al Hasan and
Mohammed J. Zaki Output space sampling for graph patterns 730--741
Chen Chen and
Cindy X. Lin and
Matt Fredrikson and
Mihai Christodorescu and
Xifeng Yan and
Jiawei Han Mining graph patterns efficiently via
randomized summaries . . . . . . . . . . 742--753
Sihem Amer-Yahia and
Senjuti Basu Roy and
Ashish Chawlat and
Gautam Das and
Cong Yu Group recommendation: semantics and
efficiency . . . . . . . . . . . . . . . 754--765
Smriti Bhagat and
Graham Cormode and
Balachander Krishnamurthy and
Divesh Srivastava Class-based graph anonymization for
social network data . . . . . . . . . . 766--777
Nikos Sarkas and
Gautam Das and
Nick Koudas Improved search for socially annotated
data . . . . . . . . . . . . . . . . . . 778--789
Ashwin Machanavajjhala and
Johannes Gehrke and
Michaela Götz Data publishing against realistic
adversaries . . . . . . . . . . . . . . 790--801
HweeHwa Pang and
Jilian Zhang and
Kyriakos Mouratidis Scalable verification for outsourced
dynamic databases . . . . . . . . . . . 802--813
Xiaokui Xiao and
Yufei Tao and
Minghua Chen Optimal random perturbation at multiple
privacy levels . . . . . . . . . . . . . 814--825
Ira Assent and
Marc Wichterich and
Ralph Krieger and
Hardy Kremer and
Thomas Seidl Anticipatory DTW for efficient
similarity search in time series
databases . . . . . . . . . . . . . . . 826--837
Dimitris Tsirogiannis and
Sudipto Guha and
Nick Koudas Improving the performance of list
intersection . . . . . . . . . . . . . . 838--849
Raghav Kaushik and
Dan Suciu Consistent histograms in the presence of
distinct value counts . . . . . . . . . 850--861
Charu Aggarwal and
Yan Xie and
Philip S. Yu GConnect: a connectivity index for
massive disk-resident graphs . . . . . . 862--873
Di Yang and
Elke A. Rundensteiner and
Matthew O. Ward A shared execution strategy for multiple
pattern mining requests over streaming
data . . . . . . . . . . . . . . . . . . 874--885
Lei Zou and
Lei Chen and
M. Tamer Özsu Distance-join: pattern match query in a
large graph database . . . . . . . . . . 886--897
Qian Wan and
Raymond Chi-Wing Wong and
Ihab F. Ilyas and
M. Tamer Özsu and
Yu Peng Creating competitive products . . . . . 898--909
Rene Mueller and
Jens Teubner and
Gustavo Alonso Data processing on FPGAs . . . . . . . . 910--921
Azza Abouzeid and
Kamil Bajda-Pawlikowski and
Daniel Abadi and
Avi Silberschatz and
Alexander Rasin HadoopDB: an architectural hybrid of
MapReduce and DBMS technologies for
analytical workloads . . . . . . . . . . 922--933
Yeye He and
Jeffrey F. Naughton Anonymization of set-valued data via
top-down, local generalization . . . . . 934--945
Lei Zou and
Lei Chen and
M. Tamer Özsu $k$-automorphism: a general framework
for privacy preserving network
publication . . . . . . . . . . . . . . 946--957
Nick Koudas and
Divesh Srivastava and
Ting Yu and
Qing Zhang Distribution based microdata
anonymization . . . . . . . . . . . . . 958--969
Michael Meier and
Michael Schmidt and
Georg Lausen On chase termination beyond
stratification . . . . . . . . . . . . . 970--981
Guido Moerkotte and
Thomas Neumann and
Gabriele Steidl Preventing bad plans by bounding the
impact of cardinality estimation errors 982--993
Surajit Chaudhuri and
Vivek Narasayya and
Ravi Ramamurthy Exact cardinality query optimization for
optimizer testing . . . . . . . . . . . 994--1005
Balder ten Cate and
Laura Chiticariu and
Phokion Kolaitis and
Wang-Chiew Tan Laconic schema mappings: computing the
core with SQL queries . . . . . . . . . 1006--1017
Marcelo Arenas and
Jorge Pérez and
Juan Reutter and
Cristian Riveros Inverting schema mappings: bridging the
gap between theory and practice . . . . 1018--1029
James F. Terwilliger and
Philip A. Bernstein and
Sergey Melnik Full-fidelity flexible object-oriented
XML access . . . . . . . . . . . . . . . 1030--1041
Ting Wang and
Ling Liu Privacy-aware mobile services over road
networks . . . . . . . . . . . . . . . . 1042--1053
Leong Hou U. and
Nikos Mamoulis and
Kyriakos Mouratidis A fair assignment algorithm for multiple
preference queries . . . . . . . . . . . 1054--1065
Takeshi Mishima and
Hiroshi Nakamura Pangea: an eager database replication
middleware guaranteeing snapshot
isolation without modification of
database servers . . . . . . . . . . . . 1066--1077
Hazem Elmeleegy and
Jayant Madhavan and
Alon Halevy Harvesting relational tables from lists
on the web . . . . . . . . . . . . . . . 1078--1089
Michael J. Cafarella and
Alon Halevy and
Nodira Khoussainova Data integration for the relational web 1090--1101
Georg Gottlob and
Reinhard Pichler and
Vadim Savenkov Normalization and optimization of schema
mappings . . . . . . . . . . . . . . . . 1102--1113
Songhua Xing and
Cyrus Shahabi and
Bei Pan Continuous monitoring of nearest
neighbors on land surface . . . . . . . 1114--1125
Raymond Chi-Wing Wong and
M. Tamer Özsu and
Philip S. Yu and
Ada Wai-Chee Fu and
Lian Liu Efficient method for maximizing
bichromatic reverse nearest neighbor . . 1126--1137
Muhammad Aamir Cheema and
Xuemin Lin and
Ying Zhang and
Wei Wang and
Wenjie Zhang Lazy updates: an efficient technique to
continuously monitoring reverse kNN . . 1138--1149
Ling Chen and
Sourav S. Bhowmick and
Wolfgang Nejdl NEAR-Miner: mining evolution
associations of Web site directories for
efficient maintenance of Web archives 1150--1161
W. K. Wong and
David W. Cheung and
Edward Hung and
Ben Kao and
Nikos Mamoulis An audit environment for outsourcing of
frequent itemset mining . . . . . . . . 1162--1173
Barzan Mozafari and
Carlo Zaniolo Publishing naive Bayesian classifiers:
privacy without accuracy loss . . . . . 1174--1185
Kostas Tzoumas and
Man Lung Yiu and
Christian S. Jensen Workload-aware indexing of continuously
moving objects . . . . . . . . . . . . . 1186--1197
Meihui Zhang and
Su Chen and
Christian S. Jensen and
Beng Chin Ooi and
Zhenjie Zhang Effectively indexing uncertain moving
objects for predictive queries . . . . . 1198--1209
Jagan Sankaranarayanan and
Hanan Samet and
Houman Alborzi Path oracles for spatial networks . . . 1210--1221
Hideaki Kimura and
George Huo and
Alexander Rasin and
Samuel Madden and
Stanley B. Zdonik Correlation maps: a compressed access
method for exploiting soft functional
dependencies . . . . . . . . . . . . . . 1222--1233
Karl Schnaitter and
Neoklis Polyzotis and
Lise Getoor Index interactions in physical design
tuning: modeling, analysis, and
applications . . . . . . . . . . . . . . 1234--1245
Songyun Duan and
Vamsidhar Thummala and
Shivnath Babu Tuning database configuration parameters
with iTuned . . . . . . . . . . . . . . 1246--1257
Marcos Vaz Salles and
Tuan Cao and
Benjamin Sowell and
Alan Demers and
Johannes Gehrke and
Christoph Koch and
Walker White An evaluation of checkpoint recovery for
massively multiplayer online games . . . 1258--1269
Emmanuel Müller and
Stephan Günnemann and
Ira Assent and
Thomas Seidl Evaluating clustering in subspace
projections of high dimensional data . . 1270--1281
Oktie Hassanzadeh and
Fei Chiang and
Hyun Chul Lee and
Renée J. Miller Framework for evaluating clustering
algorithms in duplicate detection . . . 1282--1293
Hongfei Guo and
Dan Jones and
Jennifer Beckmann and
Praveen Seshadri Declarative management in Microsoft SQL
server . . . . . . . . . . . . . . . . . 1294--1305
Amr El-Helw and
Ihab F. Ilyas and
Calisto Zuzarte StatAdvisor: recommending statistical
views . . . . . . . . . . . . . . . . . 1306--1317
Mustafa Canim and
George A. Mihaila and
Bishwaranjan Bhattacharjee and
Kenneth A. Ross and
Christian A. Lang An object placement advisor for DB2
using solid state storage . . . . . . . 1318--1329
Manish Bhide and
Manoj K. Agarwal and
Amir Bar-Or and
Sriram Padmanabhan and
Srinivas K. Mittapalli and
Girish Venkatachaliah XPEDIA: XML processing for data
integration . . . . . . . . . . . . . . 1330--1341
Roger Bamford and
Vinayak Borkar and
Matthias Brantner and
Peter M. Fischer and
Daniela Florescu and
David Graf and
Donald Kossmann and
Tim Kraska and
Dan Muresan and
Sorin Nasoi and
Markos Zacharioudakis XQuery reloaded . . . . . . . . . . . . 1342--1353
Ning Zhang and
Nipun Agarwal and
Sivasankaran Chandrasekar and
Sam Idicula and
Vijay Medi and
Sabina Petride and
Balasubramanyam Sthanikam Binary XML storage and query processing
in Oracle 11g . . . . . . . . . . . . . 1354--1365
Srikanth Bellamkonda and
Rafi Ahmed and
Andrew Witkowski and
Angela Amor and
Mohamed Zait and
Chun-Chieh Lin Enhanced subquery optimizations in
Oracle . . . . . . . . . . . . . . . . . 1366--1377
Changkyu Kim and
Tim Kaldewey and
Victor W. Lee and
Eric Sedlar and
Anthony D. Nguyen and
Nadathur Satish and
Jatin Chhugani and
Andrea Di Blas and
Pradeep Dubey Sort vs. Hash revisited: fast join
implementation on modern multi-core CPUs 1378--1389
Yu Xu and
Pekka Kostamaa Efficient outer join data skew handling
in parallel DBMS . . . . . . . . . . . . 1390--1396
Eric Friedman and
Peter Pawlowski and
John Cieslewicz SQL/MapReduce: a practical approach to
self-describing, polymorphic, and
parallelizable user-defined functions 1402--1413
Alan F. Gates and
Olga Natkovich and
Shubham Chopra and
Pradeep Kamath and
Shravan M. Narayanamurthy and
Christopher Olston and
Benjamin Reed and
Santhosh Srinivasan and
Utkarsh Srivastava Building a high-level dataflow system on
top of Map-Reduce: the Pig experience 1414--1425
Biswanath Panda and
Joshua S. Herbach and
Sugato Basu and
Roberto J. Bayardo PLANET: massively parallel learning of
tree ensembles with MapReduce . . . . . 1426--1437
Thomas Legler and
Wolfgang Lehner and
Jan Schaffner and
Jens Krüger Robust and distributed top-n
frequent-pattern mining with SAP BW
accelerator . . . . . . . . . . . . . . 1438--1449
Nicolas Dieu and
Adrian Dragusanu and
Françoise Fabret and
François Llirbat and
Eric Simon 1,000 tables under the form . . . . . . 1450--1461
Bishwaranjan Bhattacharjee and
Lipyeow Lim and
Timothy Malkemus and
George Mihaila and
Kenneth Ross and
Sherman Lau and
Cathy McArthur and
Zoltan Toth and
Reza Sherkat Efficient index compression in DB2 LUW 1462--1473
Zoé Lacroix and
Christophe Legendre and
Spyro Mousses Storing scientific workflows in a
database . . . . . . . . . . . . . . . . 1474--1480
Jeffrey Cohen and
Brian Dolan and
Mark Dunlap and
Joseph M. Hellerstein and
Caleb Welton MAD skills: new analysis practices for
big data . . . . . . . . . . . . . . . . 1481--1492
Michael Ley DBLP: some lessons learned . . . . . . . 1493--1500
Niloy Mukherjee and
Amit Ganesh and
Vinayagam Djegaradjane and
Sujatha Muthulingam and
Wei Zhang and
Krishna Kunchithapadam and
Scott Lynn and
Bharath Aleti and
Kam Shergill and
Shaoyu Wang Oracle SecureFiles: prepared for the
digital deluge . . . . . . . . . . . . . 1501--1511
Robert Baumgartner and
Georg Gottlob and
Marcus Herzog Scalable Web data extraction for online
market intelligence . . . . . . . . . . 1512--1523
Anand Rajaraman Kosmix: high-performance topic
exploration using the deep Web . . . . . 1524--1529
Rimma V. Nehme and
Karen E. Works and
Elke A. Rundensteiner and
Elisa Bertino Query mesh: multi-route query processing
technology . . . . . . . . . . . . . . . 1530--1533
P. Cudre-Mauroux and
H. Kimura and
K.-T. Lim and
J. Rogers and
R. Simakov and
E. Soroush and
P. Velikhov and
D. L. Wang and
M. Balazinska and
J. Becla and
D. DeWitt and
B. Heath and
D. Maier and
S. Madden and
J. Patel and
M. Stonebraker and
S. Zdonik A demonstration of SciDB: a
science-oriented DBMS . . . . . . . . . 1534--1537
Kuien Liu and
Ke Deng and
Zhiming Ding and
Mingshu Li and
Xiaofang Zhou MOIR/MT: monitoring large-scale road
network traffic in real-time . . . . . . 1538--1541
Romain Colle and
Leonidas Galanis and
Supiti Buranawatanachoke and
Stratos Papadomanolakis and
Yujun Wang Oracle Database Replay . . . . . . . . . 1542--1545
Nedyalko Borisov and
Shivnath Babu and
Sandeep Uttamchandani and
Ramani Routray and
Aameek Singh DIADS: a problem diagnosis tool for
databases and storage area networks . . 1546--1549
Melanie Herschel and
Mauricio A. Hernández and
Wang-Chiew Tan Artemis: a system for analyzing missing
answers . . . . . . . . . . . . . . . . 1550--1553
Eugene Wu and
Philippe Cudre-Mauroux and
Samuel Madden Demonstration of the TrajStore system 1554--1557
M. H. Ali and
C. Gerea and
B. S. Raman and
B. Sezgin and
T. Tarnavski and
T. Verona and
P. Wang and
P. Zabback and
A. Ananthanarayan and
A. Kirilov and
M. Lu and
A. Raizman and
R. Krishnan and
R. Schindlauer and
T. Grabs and
S. Bjeletich and
B. Chandramouli and
J. Goldstein and
S. Bhat and
Ying Li and
V. Di Nicola and
X. Wang and
David Maier and
S. Grell and
O. Nano and
I. Santos Microsoft CEP Server and online
behavioral targeting . . . . . . . . . . 1558--1561
Stefan Krompass and
Harumi Kuno and
Janet L. Wiener and
Kevin Wilkinson and
Umeshwar Dayal and
Alfons Kemper A testbed for managing dynamic mixed
workloads . . . . . . . . . . . . . . . 1562--1565
Yanif Ahmad and
Christoph Koch DBToaster: a SQL compiler for
high-performance delta processing in
main-memory databases . . . . . . . . . 1566--1569
Nicoleta Preda and
Fabian M. Suchanek and
Gjergji Kasneci and
Thomas Neumann and
Maya Ramanath and
Gerhard Weikum ANGIE: active knowledge for interactive
exploration . . . . . . . . . . . . . . 1570--1573
Hanna Köpcke and
Andreas Thor and
Erhard Rahm Comparative evaluation of entity
resolution approaches with FEVER . . . . 1574--1577
Falk Brauer and
Wojciech Barczynski and
Gregor Hackenbroich and
Marcus Schramm and
Adrian Mocan and
Felix Förster RankIE: document retrieval on ranked
entity graphs . . . . . . . . . . . . . 1578--1581
Giansalvatore Mecca and
Paolo Papotti and
Salvatore Raunich and
Marcello Buoncristiano Concise and expressive mappings with
+Spicy . . . . . . . . . . . . . . . . . 1582--1585
Isabel F. Cruz and
Flavio Palandri Antonelli and
Cosmin Stroe AgreementMaker: efficient matching for
large real-world schemas and ontologies 1586--1589
Oktie Hassanzadeh and
Reynold Xin and
Renée J. Miller and
Anastasios Kementsietsidis and
Lipyeow Lim and
Min Wang Linkage Query Writer . . . . . . . . . . 1590--1593
Xiaoyuan Wang and
Xingzhi Sun and
Feng Cao and
Li Ma and
Nick Kanellos and
Kang Zhang and
Yue Pan and
Yong Yu SMDM: enhancing enterprise-wide master
data management using semantic Web
technologies . . . . . . . . . . . . . . 1594--1597
Michael N. Gubanov and
Lucian Popa and
Howard Ho and
Hamid Pirahesh and
Jeng-Yih Chang and
Shr-Chang Chen IBM UFO repository: object-oriented data
integration . . . . . . . . . . . . . . 1598--1601
Huajun Chen and
Bin Lu and
Yuan Ni and
Guotong Xie and
Chunying Zhou and
Jinhua Mi and
Zhaohui Wu Mashup by surfing a Web of data APIs . . 1602--1605
Reinhard Pichler and
Vadim Savenkov DEMo: data exchange modeling tool . . . 1606--1609
Julie Letchner and
Christopher Ré and
Magdalena Balazinska and
Matthai Philipose Lahar demonstration: warehousing
Markovian streams . . . . . . . . . . . 1610--1613
Peng Sun and
Ziyang Liu and
Sivaramakrishnan Natarajan and
Susan B. Davidson and
Yi Chen WOLVES: achieving correct provenance
analysis by detecting and resolving
unsound workflow views . . . . . . . . . 1614--1617
Chenyun Dai and
Gabriel Ghinita and
Elisa Bertino and
Ji-Won Byun and
Ninghui Li TIAMAT: a tool for interactive analysis
of microdata anonymization techniques 1618--1621
Yintao Yu and
Cindy X. Lin and
Yizhou Sun and
Chen Chen and
Jiawei Han and
Binbin Liao and
Tianyi Wu and
ChengXiang Zhai and
Duo Zhang and
Bo Zhao iNextCube: information network-enhanced
text cube . . . . . . . . . . . . . . . 1622--1625
Ashish Thusoo and
Joydeep Sen Sarma and
Namit Jain and
Zheng Shao and
Prasad Chakka and
Suresh Anthony and
Hao Liu and
Pete Wyckoff and
Raghotham Murthy Hive: a warehousing solution over a
map-reduce framework . . . . . . . . . . 1626--1629
Arjun Satish and
Ramesh Jain and
Amarnath Gupta Tolkien: an event based storytelling
system . . . . . . . . . . . . . . . . . 1630--1633
Emre Sarigöl and
Oriana Riva and
Patrick Stuedi and
Gustavo Alonso Enabling social networking in ad hoc
networks of mobile phones . . . . . . . 1634--1637
Zhuowei Bao and
Sarah Cohen-Boulakia and
Susan B. Davidson and
Pierrick Girard PDiffView: viewing the difference in
provenance of workflow results . . . . . 1638--1641
Daniel Deutch and
Tova Milo and
Tom Yam Goal-oriented Web-site navigation for
on-line shoppers . . . . . . . . . . . . 1642--1645
Fernando Pereira and
Anand Rajaraman and
Sunita Sarawagi and
William Tunstall-Pedoe and
Gerhard Weikum and
Alon Halevy Answering Web questions using structured
data: dream or reality? . . . . . . . . 1646--1646
Philip A. Bernstein and
Daniel J. Abadi and
Michael J. Cafarella and
Joseph M. Hellerstein and
Donald Kossmann and
Samuel Madden How best to build Web-scale data
managers? . . . . . . . . . . . . . . . 1647--1647
Stefan Manegold and
Martin L. Kersten and
Peter Boncz Database architecture evolution: mammals
flourished long before dinosaurs became
extinct . . . . . . . . . . . . . . . . 1648--1653
Xin Luna Dong and
Felix Naumann Data fusion: resolving data conflicts
for integration . . . . . . . . . . . . 1654--1655
Jeffrey Heer and
Joseph M. Hellerstein Data visualization and social data
analysis . . . . . . . . . . . . . . . . 1656--1657
Surajit Chaudhuri and
Gautam Das Keyword querying and ranking in
databases . . . . . . . . . . . . . . . 1658--1659
Marios Hadjieleftheriou and
Chen Li Efficient approximate search on string
collections . . . . . . . . . . . . . . 1660--1661
Divesh Srivastava and
Suresh Venkatasubramanian Information theory for data management 1662--1663
Daniel J. Abadi and
Peter A. Boncz and
Stavros Harizopoulos Column-oriented database systems . . . . 1664--1665
Divesh Srivastava and
Lukasz Golab and
Rick Greer and
Theodore Johnson and
Joseph Seidel and
Vladislav Shkapenyuk and
Oliver Spatscheck and
Jennifer Yates Enabling real time data analysis . . . . 1--2
Paul Matsudaira High-end biological imaging generates
very large $3$D+ and dynamic datasets 3--3
Junghoo Cho and
Hector Garcia-Molina Dealing with Web data: history and look
ahead . . . . . . . . . . . . . . . . . 4--4
Bettina Kemme and
Gustavo Alonso Database replication: a tale of research
across communities . . . . . . . . . . . 5--12
Mustafa Canim and
Murat Kantarcio\uglu and
Bijit Hore and
Sharad Mehrotra Building disclosure risk aware query
optimizers for relational databases . . 13--24
Tristan Allard and
Nicolas Anciaux and
Luc Bouganim and
Yanli Guo and
Lionel Le Folgoc and
Benjamin Nguyen and
Philippe Pucheral and
Indrajit Ray and
Indrakshi Ray and
Shaoyi Yin Secure personal data servers: a vision
paper . . . . . . . . . . . . . . . . . 25--35
Daniel Fabbri and
Kristen LeFevre and
Qiang Zhu PolicyReplay: misconfiguration-response
queries for data breach reporting . . . 36--47
Carlo Curino and
Evan Jones and
Yang Zhang and
Sam Madden Schism: a workload-driven approach to
database replication and partitioning 48--57
Lu Qin and
Jeffrey Xu Yu and
Lijun Chang Ten thousand SQLs: parallel keyword
queries computing . . . . . . . . . . . 58--69
Alexander Thomson and
Daniel J. Abadi The case for determinism in database
systems . . . . . . . . . . . . . . . . 70--80
Bogdan Alexe and
Mauricio Hernández and
Lucian Popa and
Wang-Chiew Tan MapMerge: correlating independent schema
mappings . . . . . . . . . . . . . . . . 81--92
Sergio Greco and
Francesca Spezzano Chase termination: a constraints
rewriting approach . . . . . . . . . . . 93--104
Bruno Marnette and
Giansalvatore Mecca and
Paolo Papotti Scalable data exchange with functional
dependencies . . . . . . . . . . . . . . 105--116
Yaron Kanza and
Roy Levin and
Eliyahu Safra and
Yehoshua Sagiv Interactive route search in the presence
of order constraints . . . . . . . . . . 117--128
Willis Lang and
Jignesh M. Patel Energy management for MapReduce clusters 129--139
Akanksha Baid and
Ian Rae and
Jiexing Li and
AnHai Doan and
Jeffrey Naughton Toward scalable keyword search over
relational data . . . . . . . . . . . . 140--149
Barzan Mozafari and
Kai Zeng and
Carlo Zaniolo From regular expressions to nested
words: unifying languages and query
execution for relational and XML
sequences . . . . . . . . . . . . . . . 150--161
Torsten Grust and
Jan Rittinger and
Tom Schreiber Avalanche-safe LINQ compilation . . . . 162--172
Wenfei Fan and
Jianzhong Li and
Shuai Ma and
Nan Tang and
Wenyuan Yu Towards certain fixes with editing rules
and master data . . . . . . . . . . . . 173--184
Melanie Herschel and
Mauricio A. Hernández Explaining missing answers to SPJUA
queries . . . . . . . . . . . . . . . . 185--196
George Beskales and
Ihab F. Ilyas and
Lukasz Golab Sampling the repairs of functional
dependency violations under hard
constraints . . . . . . . . . . . . . . 197--207
David Menestrina and
Steven Euijong Whang and
Hector Garcia-Molina Evaluating entity resolution results . . 208--219
Badrish Chandramouli and
Jonathan Goldstein and
David Maier High-performance dynamic pattern
matching over disordered streams . . . . 220--231
Irina Botan and
Roozbeh Derakhshan and
Nihal Dindar and
Laura Haas and
Renée J. Miller and
Nesime Tatbul SECRET: a model for analysis of the
execution semantics of stream processing
systems . . . . . . . . . . . . . . . . 232--243
Haopeng Zhang and
Yanlei Diao and
Neil Immerman Recognizing patterns in streams with
imprecise timestamps . . . . . . . . . . 244--255
Thomas Neumann and
Gerhard Weikum x-RDF-3X: fast querying, high update
rates, and consistency for RDF databases 256--263
Wenfei Fan and
Jianzhong Li and
Shuai Ma and
Nan Tang and
Yinghui Wu and
Yunpeng Wu Graph pattern matching: from intractable
to polynomial time . . . . . . . . . . . 264--275
Hilmi Yildirim and
Vineet Chaoji and
Mohammed J. Zaki GRAIL: scalable reachability index for
large graphs . . . . . . . . . . . . . . 276--284
Yingyi Bu and
Bill Howe and
Magdalena Balazinska and
Michael D. Ernst HaLoop: efficient iterative data
processing on large clusters . . . . . . 285--296
Michael Benedikt and
Georg Gottlob The impact of virtual views on
containment . . . . . . . . . . . . . . 297--308
James F. Terwilliger and
Lois M. L. Delcambre and
David Maier and
Jeremy Steinhauer and
Scott Britell Updatable and evolvable transforms for
virtual databases . . . . . . . . . . . 309--319
Daniel Deutch and
Ohad Greenshpan and
Tova Milo Navigating in complex mashed-up
applications . . . . . . . . . . . . . . 320--329
Sergey Melnik and
Andrey Gubarev and
Jing Jing Long and
Geoffrey Romer and
Shiva Shivakumar and
Matt Tolton and
Theo Vassilakis Dremel: interactive analysis of
Web-scale datasets . . . . . . . . . . . 330--339
Peixiang Zhao and
Jiawei Han On graph query optimization in large
networks . . . . . . . . . . . . . . . . 340--351
Davide Martinenghi and
Marco Tagliasacchi Proximity rank join . . . . . . . . . . 352--363
Akrivi Vlachou and
Christos Doulkeridis and
Kjetil Nòrvåg and
Yannis Kotidis Identifying the most influential data
objects with reverse top-$k$ queries . . 364--372
Xin Cao and
Gao Cong and
Christian S. Jensen Retrieving top-$k$ prestige-based
relevant spatial Web objects . . . . . . 373--384
Lei Li and
B. Aditya Prakash and
Christos Faloutsos Parsimonious linear fingerprinting for
time series . . . . . . . . . . . . . . 385--396
Rui Zhang and
Martin Stradling The HV-tree: a memory hierarchy aware
version index . . . . . . . . . . . . . 397--408
Sakti Pramanik and
Alok Watve and
Chad R. Meiners and
Alex Liu Transforming range queries to equivalent
box queries to optimize page access . . 409--416
Songtao Guo and
Xin Luna Dong and
Divesh Srivastava and
Remi Zajac Record linkage with uniqueness
constraints and erroneous values . . . . 417--428
Ekaterini Ioannou and
Wolfgang Nejdl and
Claudia Niederée and
Yannis Velegrakis On-the-fly entity-aware query processing
in the presence of linkage . . . . . . . 429--438
Mohamed Yakout and
Ahmed K. Elmagarmid and
Hazem Elmeleegy and
Mourad Ouzzani and
Alan Qi Behavior based record linkage . . . . . 439--448
Wook-Shin Han and
Jinsoo Lee and
Minh-Duc Pham and
Jeffrey Xu Yu iGraph: a framework for comparisons of
disk-based graph indexing techniques . . 449--459
Jörg Schad and
Jens Dittrich and
Jorge-Arnulfo Quiané-Ruiz Runtime measurements in the cloud:
observing, analyzing, and reducing
variance . . . . . . . . . . . . . . . . 460--471
Dawei Jiang and
Beng Chin Ooi and
Lei Shi and
Sai Wu The performance of MapReduce: an
in-depth study . . . . . . . . . . . . . 472--483
Hanna Köpcke and
Andreas Thor and
Erhard Rahm Evaluation of entity resolution
approaches on real-world match problems 484--493
Tomasz Nykiel and
Michalis Potamias and
Chaitanya Mishra and
George Kollios and
Nick Koudas MRShare: sharing across multiple queries
in MapReduce . . . . . . . . . . . . . . 494--505
Hoang Tam Vo and
Chun Chen and
Beng Chin Ooi Towards elastic transactional cloud
storage with range query support . . . . 506--514
Jens Dittrich and
Jorge-Arnulfo Quiané-Ruiz and
Alekh Jindal and
Yagiz Kargin and
Vinay Setty and
Jörg Schad Hadoop++: making a yellow elephant run
like a cheetah (without it even
noticing) . . . . . . . . . . . . . . . 515--529
Nicolas Bruno and
Vivek Narasayya and
Ravi Ramamurthy Slicing long-running queries . . . . . . 530--541
Kostas Tzoumas and
Amol Deshpande and
Christian S. Jensen Sharing-aware horizontal partitioning
for exploiting correlations during query
processing . . . . . . . . . . . . . . . 542--553
Andrea Cal\`\i and
Georg Gottlob and
Andreas Pieris Advanced processing for ontological
queries . . . . . . . . . . . . . . . . 554--565
Aditya Parameswaran and
Hector Garcia-Molina and
Anand Rajaraman Towards the Web of concepts: extracting
concepts from large datasets . . . . . . 566--577
Pankaj Gulhane and
Rajeev Rastogi and
Srinivasan H. Sengamedu and
Ashwin Tengli Exploiting content redundancy for Web
information extraction . . . . . . . . . 578--587
Bin Liu and
Laura Chiticariu and
Vivian Chu and
H. V. Jagadish and
Frederick R. Reiss Automatic rule refinement for
information extraction . . . . . . . . . 588--597
HweeHwa Pang and
Xuhua Ding and
Xiaokui Xiao Embellishing text search queries to
protect user privacy . . . . . . . . . . 598--607
Rhonda Chaytor and
Ke Wang Small domain randomization: same
privacy, more utility . . . . . . . . . 608--618
Stavros Papadopoulos and
Spiridon Bakiras and
Dimitris Papadias Nearest neighbor search with strong
location privacy . . . . . . . . . . . . 619--629
Hideaki Kimura and
Samuel Madden and
Stanley B. Zdonik UPI: a primary index for uncertain
databases . . . . . . . . . . . . . . . 630--637
Jian Li and
Amol Deshpande Ranking continuous probabilistic
datasets . . . . . . . . . . . . . . . . 638--649
Xiang Lian and
Lei Chen Set similarity join on probabilistic
data . . . . . . . . . . . . . . . . . . 650--659
Louis Woods and
Jens Teubner and
Gustavo Alonso Complex event detection at wire speed
with FPGAs . . . . . . . . . . . . . . . 660--669
Wenbin Fang and
Bingsheng He and
Qiong Luo Database compression on graphics
processors . . . . . . . . . . . . . . . 670--680
Ryan Johnson and
Ippokratis Pandis and
Radu Stoica and
Manos Athanassoulis and
Anastasia Ailamaki Aether: a scalable approach to logging 681--692
Kathy Macropol and
Ambuj Singh Scalable discovery of best clusters on
large graphs . . . . . . . . . . . . . . 693--702
Alexander Smola and
Shravan Narayanamurthy An architecture for parallel topic
models . . . . . . . . . . . . . . . . . 703--710
Venkatesh Ganti and
Yeye He and
Dong Xin Keyword++: a framework to improve
keyword search over entity databases . . 711--722
Zhenhui Li and
Bolin Ding and
Jiawei Han and
Roland Kays Swarm: mining relaxed temporal moving
object clusters . . . . . . . . . . . . 723--734
Su Chen and
Beng Chin Ooi and
Zhenjie Zhang An adaptive updating protocol for
reducing moving object database workload 735--746
Georgios Kellaris and
Kyriakos Mouratidis Shortest path computation on air indexes 747--757
Jia Xu and
Zhenjie Zhang and
Anthony K. H. Tung and
Ge Yu Efficient and effective similarity
search over probabilistic data based on
Earth Mover's Distance . . . . . . . . . 758--769
Michael Benedikt and
Evgeny Kharlamov and
Dan Olteanu and
Pierre Senellart Probabilistic XML via Markov Chains . . 770--781
Subi Arumugam and
Fei Xu and
Ravi Jampani and
Christopher Jermaine and
Luis L. Perez and
Peter J. Haas MCDB-R: risk analysis in the database 782--793
Michael Wick and
Andrew McCallum and
Gerome Miklau Scalable probabilistic databases with
factor graphs and MCMC . . . . . . . . . 794--804
Meihui Zhang and
Marios Hadjieleftheriou and
Beng Chin Ooi and
Cecilia M. Procopiuc and
Divesh Srivastava On multi-column foreign key discovery 805--814
Reynold Cheng and
Eric Lo and
Xuan S. Yang and
Ming-Hay Luk and
Xiang Li and
Xike Xie Explore or exploit?: effective
strategies for disambiguating large
databases . . . . . . . . . . . . . . . 815--825
Mohamed A. Soliman and
Ihab F. Ilyas and
Mina Saleeb Building ranked mashups of unstructured
sources with uncertain information . . . 826--837
Chedy Ra\"\issi and
Jian Pei and
Thomas Kister Computing closed skycubes . . . . . . . 838--847
Eric Lo and
Nick Cheng and
Wing-Kai Hon Generating databases for query workloads 848--859
Minji Wu and
Laure Berti-Équille and
Amélie Marian and
Cecilia M. Procopiuc and
Divesh Srivastava Processing top-$k$ join queries . . . . 860--870
Xavier Martinez-Palau and
David Dominguez-Sal and
Josep Lluis Larriba-Pey Two-way replacement selection . . . . . 871--881
Sebastian Maneth and
Kim Nguyen XPath whole query optimization . . . . . 882--893
Nils Grimsmo and
Truls A. Bjòrklund and
Magnus Lie Hetland Fast optimal twig joins . . . . . . . . 894--905
Michael Benedikt and
James Cheney Destabilizers and independence of XML
updates . . . . . . . . . . . . . . . . 906--917
Ziyang Liu and
Qihong Shao and
Yi Chen Searching workflows with hierarchical
views . . . . . . . . . . . . . . . . . 918--927
Ippokratis Pandis and
Ryan Johnson and
Nikos Hardavellas and
Anastasia Ailamaki Data-oriented transaction execution . . 928--939
Daniel Deutch and
Tova Milo and
Neoklis Polyzotis and
Tom Yam Optimal top-$k$ query evaluation for
weighted business processes . . . . . . 940--951
Guozhang Wang and
Marcos Vaz Salles and
Benjamin Sowell and
Xun Wang and
Tuan Cao and
Alan Demers and
Johannes Gehrke and
Walker White Behavioral simulations in MapReduce . . 952--963
Tingjian Ge and
Stan Zdonik A*-tree: a structure for storage and
modeling of uncertain multidimensional
arrays . . . . . . . . . . . . . . . . . 964--974
Charu C. Aggarwal and
Yao Li and
Philip S. Yu and
Ruoming Jin On dense pattern mining in graph streams 975--984
Man Lung Yiu and
Leong Hou U. and
Simonas Saltenis and
Kostas Tzoumas Efficient proximity detection among
mobile users via self-tuning policies 985--996
Michalis Potamias and
Francesco Bonchi and
Aristides Gionis and
George Kollios k-nearest neighbors in uncertain graphs 997--1008
Xin Cao and
Gao Cong and
Christian S. Jensen Mining significant semantic locations
from GPS data . . . . . . . . . . . . . 1009--1020
Michael Hay and
Vibhor Rastogi and
Gerome Miklau and
Dan Suciu Boosting the accuracy of differentially
private histograms through consistency 1021--1032
Jianneng Cao and
Panagiotis Karras and
Chedy Ra\"\issi and
Kian-Lee Tan $ \rho $-uncertainty: inference-proof
transaction anonymization . . . . . . . 1033--1044
Graham Cormode and
Divesh Srivastava and
Ninghui Li and
Tiancheng Li Minimizing minimality and maximizing
utility: analyzing method-based attacks
on anonymized data . . . . . . . . . . . 1045--1056
Daisy Zhe Wang and
Michael J. Franklin and
Minos Garofalakis and
Joseph M. Hellerstein Querying probabilistic information
extraction . . . . . . . . . . . . . . . 1057--1067
Prithviraj Sen and
Amol Deshpande and
Lise Getoor Read-once functions and query evaluation
in probabilistic databases . . . . . . . 1068--1079
Parag Agrawal and
Anish Das Sarma and
Jeffrey Ullman and
Jennifer Widom Foundations of uncertain-data
integration . . . . . . . . . . . . . . 1080--1090
Michael Mathioudakis and
Nilesh Bansal and
Nick Koudas Identifying, attributing and describing
spatial bursts . . . . . . . . . . . . . 1091--1102
Hideaki Kimura and
George Huo and
Alexander Rasin and
Samuel Madden and
Stanley B. Zdonik CORADD: correlation aware database
designer for materialized views and
indexes . . . . . . . . . . . . . . . . 1103--1113
Danupon Nanongkai and
Atish Das Sarma and
Ashwin Lall and
Richard J. Lipton and
Jun Xu Regret-minimizing representative
databases . . . . . . . . . . . . . . . 1114--1124
Benjamin Arai and
Gautam Das and
Dimitrios Gunopulos and
Vagelis Hristidis and
Nick Koudas An access cost-aware approach for object
retrieval over multiple sources . . . . 1125--1136
M. Abhirama and
Sourjya Bhaumik and
Atreyee Dey and
Harsh Shrimal and
Jayant R. Haritsa On the stability of plan costs and the
costs of plan stability . . . . . . . . 1137--1148
Herodotos Herodotou and
Shivnath Babu Xplus: a SQL-tuning-aware query
optimizer . . . . . . . . . . . . . . . 1149--1160
Wenfei Fan and
Jianzhong Li and
Shuai Ma and
Hongzhi Wang and
Yinghui Wu Graph homomorphism revisited for graph
matching . . . . . . . . . . . . . . . . 1161--1172
Ramakrishnan Kandhan and
Nikhil Teletia and
Jignesh M. Patel SigMatch: fast and scalable
multi-pattern matching . . . . . . . . . 1173--1184
Shijie Zhang and
Jiong Yang and
Wei Jin SAPPER: subgraph indexing and
approximate matching in large graphs . . 1185--1194
Yinan Li and
Bingsheng He and
Robin Jun Yang and
Qiong Luo and
Ke Yi Tree indexing on solid state drives . . 1195--1206
Sai Wu and
Dawei Jiang and
Beng Chin Ooi and
Kun-Lung Wu Efficient B-tree based indexing for
cloud data processing . . . . . . . . . 1207--1218
Jiannan Wang and
Jianhua Feng and
Guoliang Li Trie-join: efficient trie-based string
similarity joins with edit-distance
constraints . . . . . . . . . . . . . . 1219--1230
Mehdi Sharifzadeh and
Cyrus Shahabi VoR-tree: R-trees with Voronoi diagrams
for efficient processing of spatial
nearest neighbor queries . . . . . . . . 1231--1242
P. Deepak and
Prasad M. Deshpande Efficient RkNN retrieval with arbitrary
non-metric similarity measures . . . . . 1243--1254
Shiming Zhang and
Nikos Mamoulis and
David W. Cheung and
Ben Kao Efficient skyline evaluation over
partially ordered domains . . . . . . . 1255--1266
Mingzhu Wei and
Elke A. Rundensteiner and
Murali Mani Achieving high output quality under
limited resources through
structure-based spilling in XML streams 1267--1278
Svilen R. Mihaylov and
Marie Jacob and
Zachary G. Ives and
Sudipto Guha Dynamic join optimization in multi-hop
wireless sensor networks . . . . . . . . 1279--1290
Mert Akdere and
U\ugur Çetintemel and
Eli Upfal Database-support for continuous
prediction queries over streaming data 1291--1301
Thanh T. L. Tran and
Andrew McGregor and
Yanlei Diao and
Liping Peng and
Anna Liu Conditioning and aggregating uncertain
data streams: going beyond expectations 1302--1313
Boris Glavic and
Gustavo Alonso and
Renée J. Miller and
Laura M. Haas TRAMP: understanding the behavior of
schema mappings through provenance . . . 1314--1325
Steven Euijong Whang and
Hector Garcia-Molina Entity resolution with evolving rules 1326--1337
Girija Limaye and
Sunita Sarawagi and
Soumen Chakrabarti Annotating and searching Web tables
using entities, types and relationships 1338--1347
Srikanta Bedathur and
Klaus Berberich and
Jens Dittrich and
Nikos Mamoulis and
Gerhard Weikum Interesting-phrase mining for ad-hoc
text analytics . . . . . . . . . . . . . 1348--1357
Xin Luna Dong and
Laure Berti-Equille and
Yifan Hu and
Divesh Srivastava Global detection of complex copying
relationships between sources . . . . . 1358--1369
Sabrina De Capitani di Vimercati and
Sara Foresti and
Sushil Jajodia and
Stefano Paraboschi and
Pierangela Samarati Fragments and loose associations:
respecting privacy in data publishing 1370--1381
Francesco Fusco and
Marc Ph. Stoecklin and
Michail Vlachos NET-FLi: on-the-fly compression,
archiving and indexing of streaming
network traffic . . . . . . . . . . . . 1382--1393
Qiong Zou and
Huayong Wang and
Robert Soulé and
Martin Hirzel and
Henrique Andrade and
Bu\ugra Gedik and
Kun-Lung Wu From a stream of relational queries to
distributed stream processing . . . . . 1394--1405
James T. L. Mah and
Danny C. C. Poo and
Shaojiang Cai UASMAs (universal automated SNP mapping
algorithms): a set of algorithms to
instantaneously map SNPs in real time to
aid functional SNP discovery . . . . . . 1406--1413
Biplob Debnath and
Sudipta Sengupta and
Jin Li FlashStore: high throughput persistent
key--value store . . . . . . . . . . . . 1414--1425
Reynold S. Xin and
William McLaren and
Patrick Dantressangle and
Steve Schormann and
Sam Lightstone and
Maria Schwenger MEET DB2: automated database migration
evaluation . . . . . . . . . . . . . . . 1426--1434
Mustafa Canim and
George A. Mihaila and
Bishwaranjan Bhattacharjee and
Kenneth A. Ross and
Christian A. Lang SSD bufferpool extensions for database
systems . . . . . . . . . . . . . . . . 1435--1446
Charles Loboz and
Slawek Smyl and
Suman Nath DataGarage: warehousing massive
performance data on commodity servers 1447--1458
Songting Chen Cheetah: a high performance, custom data
warehouse on top of MapReduce . . . . . 1459--1468
Gustavo H. Orair and
Carlos H. C. Teixeira Wagner Meira, Jr. and
Ye Wang and
Srinivasan Parthasarathy Distance-based outlier detection:
consolidation and renewed bearing . . . 1469--1480
Young-Seok Kim and
Heegyu Jin and
Kyoung-Gu Woo Adaptive logging for mobile device . . . 1481--1492
Peter Pesti and
Ling Liu and
Bhuvan Bamba and
Arun Iyengar and
Matt Weber RoadTrack: scaling location updates for
mobile clients on road networks with
query awareness . . . . . . . . . . . . 1493--1504
Xiance Si and
Edward Y. Chang and
Zoltán Gyöngyi and
Maosong Sun Confucius and its intelligent disciples:
integrating social with search . . . . . 1505--1516
Jayant R. Haritsa The Picasso database query optimizer
visualizer . . . . . . . . . . . . . . . 1517--1520
Ziyang Liu and
Sivaramakrishnan Natarajan and
Bin He and
Hui-I Hsiao and
Yi Chen CODS: evolving data efficiently and
scalably in column oriented databases 1521--1524
Mohammad Sadoghi and
Martin Labrecque and
Harsh Singh and
Warren Shum and
Hans-Arno Jacobsen Efficient event processing through
reconfigurable hardware for algorithmic
trading . . . . . . . . . . . . . . . . 1525--1528
Justin J. Levandoski and
Mohamed F. Mokbel and
Mohamed E. Khalefa CareDB: a context and preference-aware
location-based database system . . . . . 1529--1532
Donald Kossmann and
Tim Kraska and
Simon Loesing and
Stephan Merkli and
Raman Mittal and
Flavio Pfaffhauser Cloudy: a modular cloud storage system 1533--1536
Seyed Jalal Kazemitabar and
Ugur Demiryurek and
Mohamed Ali and
Afsin Akdogan and
Cyrus Shahabi Geospatial stream query processing using
Microsoft SQL Server StreamInsight . . . 1537--1540
Curtis E. Dyreson and
Sourav S. Bhowmick and
Kirankanth Mallampalli Using XMorph to transform XML data . . . 1541--1544
Di Wang and
Elke A. Rundensteiner and
Han Wang and
Richard T. Ellison III Active complex event processing:
applications in real-time health care 1545--1548
Tom Schreiber and
Simone Bonetti and
Torsten Grust and
Manuel Mayr and
Jan Rittinger Thirteen new players in the team: a
FERRY-based LINQ to SQL provider . . . . 1549--1552
Serge Abiteboul and
Pierre Bourhis and
Bogdan Marinoiu and
Alban Galland AXART: enabling collaborative work with
AXML artifacts . . . . . . . . . . . . . 1553--1556
Christopher McConnell and
Fan Ping and
Jeong-Hyon Hwang iFlow: an approach for fast and reliable
Internet-scale stream processing
utilizing detouring and replication . . 1557--1560
Verena Kantere and
Maher Manoubi and
Iluju Kiringa and
Timos Sellis and
John Mylopoulos Peer coordination through distributed
triggers . . . . . . . . . . . . . . . . 1561--1564
Hao Wu and
Guoliang Li and
Chen Li and
Lizhu Zhou Seaform: search-as-you-type in forms . . 1565--1568
Jannik Strötgen and
Michael Gertz TimeTrails: a system for exploring
spatio-temporal information in documents 1569--1572
Jeffrey Pound and
Ihab F. Ilyas and
Grant Weddell QUICK: expressive and flexible search
over knowledge bases and text
collections . . . . . . . . . . . . . . 1573--1576
Marcin Kwietniewski and
Jarek Gryz and
Stephanie Hazlewood and
Paul Van Run Transforming XML documents as schemas
evolve . . . . . . . . . . . . . . . . . 1577--1580
Ziyang Liu and
Sivaramakrishnan Natarajan and
Peng Sun and
Stephen Booher and
Tim Meehan and
Robert Winkler and
Yi Chen XSACT: a comparison tool for structured
search results . . . . . . . . . . . . . 1581--1584
Talel Abdessalem and
Bogdan Cautis and
Nora Derouiche ObjectRunner: lightweight, targeted
extraction and querying of structured
Web data . . . . . . . . . . . . . . . . 1585--1588
Shady Elbassuoni and
Katja Hose and
Steffen Metzger and
Ralf Schenkel ROXXI: Reviving witness dOcuments to
eXplore eXtracted Information . . . . . 1589--1592
Arash Termehchy and
Marianne Winslett EXTRUCT: using deep structural
information in XML keyword search . . . 1593--1596
Javad Akbarnejad and
Gloria Chatzopoulou and
Magdalini Eirinaki and
Suju Koshy and
Sarika Mittal and
Duc On and
Neoklis Polyzotis and
Jothi S. Vindhiya Varman SQL QueRIE recommendations . . . . . . . 1597--1600
Hock Hee Ang and
Vivekanand Gopalkrishnan and
Wee Keong Ng and
Steven C. H. Hoi P2PDocTagger: content management through
automated P2P collaborative tagging . . 1601--1604
Vinay Setty and
Srikanta Bedathur and
Klaus Berberich and
Gerhard Weikum InZeit: efficiently identifying
insightful time points . . . . . . . . . 1605--1608
Aixin Sun and
Sourav S. Bhowmick and
Yao Liu iAVATAR: an interactive tool for finding
and visualizing visual-representative
tags in image search . . . . . . . . . . 1609--1612
Thomas Kabisch and
Eduard C. Dragut and
Clement Yu and
Ulf Leser Deep Web integration with VisQI . . . . 1613--1616
Xin Luna Dong and
Laure Berti-Equille and
Yifan Hu and
Divesh Srivastava SOLOMON: seeking the truth via copying
detection . . . . . . . . . . . . . . . 1617--1620
Martin Hentschel and
Laura Haas and
Renée J. Miller Just-in-time data integration in action 1621--1624
Alexander Alexandrov and
Max Heimel and
Volker Markl and
Dominic Battré and
Fabian Hueske and
Erik Nijkamp and
Stephan Ewen and
Odej Kao and
Daniel Warneke Massively parallel data analysis with
PACTs on Nephele . . . . . . . . . . . . 1625--1628
Morten Middelfart and
Torben Bach Pedersen Using sentinel technology in the TARGIT
BI suite . . . . . . . . . . . . . . . . 1629--1632
Stephan Günnemann and
Ines Färber and
Hardy Kremer and
Thomas Seidl CoDA: interactive cluster based concept
discovery . . . . . . . . . . . . . . . 1633--1636
Sonia Bergamaschi and
Elton Domnori and
Francesco Guerra and
Mirko Orsini and
Raquel Trillo Lado and
Yannis Velegrakis Keymantic: semantic keyword-based
searching in data integration systems 1637--1640
Lukasz Golab and
Howard Karloff and
Flip Korn and
Divesh Srivastava Data Auditor: exploring data quality and
semantics using pattern tableaux . . . . 1641--1644
Anil K. Nori Distributed caching platforms . . . . . 1645--1646
Divyakant Agrawal and
Sudipto Das and
Amr El Abbadi Big data and cloud computing: new wine
or just new bottles? . . . . . . . . . . 1647--1648
Hanan Samet Techniques for similarity searching in
multimedia databases . . . . . . . . . . 1649--1650
Opher Etzion Event processing: past, present and
future . . . . . . . . . . . . . . . . . 1651--1652
Matthias Renz and
Reynold Cheng and
Hans-Peter Kriegel Similarity search and mining in
uncertain databases . . . . . . . . . . 1653--1654
S. Muthukrishnan Data management and mining in Internet
AD systems . . . . . . . . . . . . . . . 1655--1656
Patrick Kling and
M. Tamer Özsu and
Khuzaima Daudjee Generating efficient execution plans for
vertically partitioned XML databases . . 1--11
Xiang Lian and
Lei Chen A generic framework for handling
uncertain data with local correlations 12--21
Nodira Khoussainova and
YongChul Kwon and
Magdalena Balazinska and
Dan Suciu SnipSuggest: context-aware
autocompletion for SQL . . . . . . . . . 22--33
Alexandra Meliou and
Wolfgang Gatterbauer and
Katherine F. Moore and
Dan Suciu The complexity of causality and
responsibility for query answers and
non-answers . . . . . . . . . . . . . . 34--45
Guy Sagy and
Daniel Keren and
Izchak Sharfman and
Assaf Schuster Distributed threshold querying of
general functions by a difference of
monotonic representation . . . . . . . . 46--57
Nan Wang and
Jingbo Zhang and
Kian-Lee Tan and
Anthony K. H. Tung On triangulation-based dense
neighborhood graph discovery . . . . . . 58--68
Michael Rice and
Vassilis J. Tsotras Graph indexing of road networks for
shortest path queries with label
restrictions . . . . . . . . . . . . . . 69--80
Li Qian and
Kristen LeFevre and
H. V. Jagadish CRIUS: user-friendly database design . . 81--92
João B. Rocha-Junior and
Akrivi Vlachou and
Christos Doulkeridis and
Kjetil Nòrvåg Efficient processing of top-$k$ spatial
preference queries . . . . . . . . . . . 93--104
Martin Grund and
Jens Krüger and
Hasso Plattner and
Alexander Zeier and
Philippe Cudre-Mauroux and
Samuel Madden HYRISE: a main memory hybrid storage
engine . . . . . . . . . . . . . . . . . 105--116
Carlo A. Curino and
Hyun Jin Moon and
Alin Deutsch and
Carlo Zaniolo Update rewriting and integrity
constraint maintenance in a schema
evolution support system: PRISM++ . . . 117--128
Ermelinda Oro and
Massimo Ruffolo and
Steffen Staab SXPath: extending XPath towards spatial
querying on Web documents . . . . . . . 129--140
Mingxuan Yuan and
Lei Chen and
Philip S. Yu Personalized privacy protection in
social networks . . . . . . . . . . . . 141--150
Guilherme A. Toda and
Eli Cortez and
Altigran S. da Silva and
Edleno de Moura A probabilistic approach for
automatically filling form-based Web
interfaces . . . . . . . . . . . . . . . 151--160
Panagiotis Papadimitriou and
Hector Garcia-Molina and
Ali Dasdan and
Santanu Kolay Output URL bidding . . . . . . . . . . . 161--172
Bahman Bahmani and
Abdur Chowdhury and
Ashish Goel Fast incremental and personalized
PageRank . . . . . . . . . . . . . . . . 173--184
Jongwuk Lee and
Seung-won Hwang QSkycube: efficient skycube computation
using point-based space partitioning . . 185--196
Bin Liu and
Chee-Yong Chan ZINC: efficient indexing for skyline
computation . . . . . . . . . . . . . . 197--207
Vibhor Rastogi and
Nilesh Dalvi and
Minos Garofalakis Large-scale collective entity matching 208--218
Nilesh Dalvi and
Ravi Kumar and
Mohamed Soliman Automatic wrappers for large scale Web
extraction . . . . . . . . . . . . . . . 219--230
Xintian Yang and
Srinivasan Parthasarathy and
P. Sadayappan Fast sparse matrix-vector multiplication
on GPUs: implications for graph mining 231--242
Jun Rao and
Eugene J. Shekita and
Sandeep Tata Using Paxos to build a scalable,
consistent, and highly available
datastore . . . . . . . . . . . . . . . 243--254
Bolin Ding and
Arnd Christian König Fast set intersection in memory . . . . 255--266
Aditya Parameswaran and
Anish Das Sarma and
Hector Garcia-Molina and
Neoklis Polyzotis and
Jennifer Widom Human-assisted graph search: it's okay
to ask questions . . . . . . . . . . . . 267--278
Mohamed Yakout and
Ahmed K. Elmagarmid and
Jennifer Neville and
Mourad Ouzzani and
Ihab F. Ilyas Guided data repair . . . . . . . . . . . 279--289
Petros Venetis and
Hector Gonzalez and
Christian S. Jensen and
Alon Halevy Hyper-local, directions-based ranking of
places . . . . . . . . . . . . . . . . . 290--301
M. Levent Koc and
Christopher Ré Incrementally maintaining classification
using an RDBMS . . . . . . . . . . . . . 302--313
Bingsheng He and
Jeffrey Xu Yu High-throughput transaction executions
on graphics processors . . . . . . . . . 314--325
Zhao Cao and
Charles Sutton and
Yanlei Diao and
Prashant Shenoy Distributed inference and query
processing for RFID tracking and
monitoring . . . . . . . . . . . . . . . 326--337
Hongrae Lee and
Raymond T. Ng and
Kyuseok Shim Similarity join size estimation using
locality sensitive hashing . . . . . . . 338--349
Ziyang Liu and
Sivaramakrishnan Natarajan and
Yi Chen Query expansion based on clustered
results . . . . . . . . . . . . . . . . 350--361
Debabrata Dash and
Neoklis Polyzotis and
Anastasia Ailamaki CoPhy: a scalable, portable, and
interactive index advisor for large
workloads . . . . . . . . . . . . . . . 362--372
Feng Niu and
Christopher Ré and
AnHai Doan and
Jude Shavlik Tuffy: scaling up statistical inference
in Markov logic networks using an RDBMS 373--384
Eaman Jahani and
Michael J. Cafarella and
Christopher Ré Automatic optimization for MapReduce
programs . . . . . . . . . . . . . . . . 385--396
De-Nian Yang and
Yi-Ling Chen and
Wang-Chien Lee and
Ming-Syan Chen On social-temporal group query with
acquaintance constraint . . . . . . . . 397--408
Hoa Nguyen and
Ariel Fuxman and
Stelios Paparizos and
Juliana Freire and
Rakesh Agrawal Synthesizing products for online
catalogs . . . . . . . . . . . . . . . . 409--418
Avrilia Floratou and
Jignesh M. Patel and
Eugene J. Shekita and
Sandeep Tata Column-oriented storage techniques for
MapReduce . . . . . . . . . . . . . . . 419--429
David Lomet and
Kostas Tzoumas and
Michael Zwilling Implementing performance competitive
logical recovery . . . . . . . . . . . . 430--439
Ashwin Machanavajjhala and
Aleksandra Korolova and
Atish Das Sarma Personalized social recommendations:
accurate or private . . . . . . . . . . 440--450
Gabriele Capannini and
Franco Maria Nardini and
Raffaele Perego and
Fabrizio Silvestri Efficient diversification of Web search
results . . . . . . . . . . . . . . . . 451--459
Gianmarco De Francisci Morales and
Aristides Gionis and
Mauro Sozio Social content matching in MapReduce . . 460--469
Naiyong Ao and
Fan Zhang and
Di Wu and
Douglas S. Stones and
Gang Wang and
Xiaoguang Liu and
Jing Liu and
Sheng Lin Efficient parallel lists intersection
and index compression algorithms using
graphics processing units . . . . . . . 470--481
Lei Zou and
Jinghui Mo and
Lei Chen and
M. Tamer Özsu and
Dongyan Zhao gStore: answering SPARQL queries via
subgraph matching . . . . . . . . . . . 482--493
Sudipto Das and
Shoji Nishimura and
Divyakant Agrawal and
Amr El Abbadi Albatross: lightweight elasticity in
shared storage databases for the cloud
using live data migration . . . . . . . 494--505
Sarana Nutanong and
Edwin H. Jacox and
Hanan Samet An incremental Hausdorff distance
calculation algorithm . . . . . . . . . 506--517
Barbara Blaustein and
Adriane Chapman and
Len Seligman and
M. David Allen and
Arnon Rosenthal Surrogate parenthood: protected and
informative graphs . . . . . . . . . . . 518--525
Petros Venetis and
Alon Halevy and
Jayant Madhavan and
Marius Pasca and
Warren Shen and
Fei Wu and
Gengxin Miao and
Chung Wu Recovering semantics of tables on the
web . . . . . . . . . . . . . . . . . . 528--538
Thomas Neumann Efficiently compiling efficient query
plans for modern hardware . . . . . . . 539--550
Ruoming Jin and
Lin Liu and
Bolin Ding and
Haixun Wang Distance-constraint reachability
computation in uncertain graphs . . . . 551--562
Yun Chi and
Hyun Jin Moon and
Hakan Hacigümüs iCBS: incremental cost-based scheduling
under piecewise linear SLAs . . . . . . 563--574
Mohamed Y. Eltabakh and
Yuanyuan Tian and
Fatma Özcan and
Rainer Gemulla and
Aljoscha Krettek and
John McPherson CoHadoop: flexible data placement and
its exploitation in Hadoop . . . . . . . 575--585
Stratos Idreos and
Stefan Manegold and
Harumi Kuno and
Goetz Graefe Merging what's cracked, cracking what's
merged: adaptive indexing in main-memory
column-stores . . . . . . . . . . . . . 586--597
Chonghai Wang and
Li Yan Yuan and
Jia-Huai You and
Osmar R. Zaiane and
Jian Pei On pruning for top-$k$ ranking in
uncertain databases . . . . . . . . . . 598--609
Ippokratis Pandis and
Pinar Tözün and
Ryan Johnson and
Anastasia Ailamaki PLP: page latch-free shared-everything
OLTP . . . . . . . . . . . . . . . . . . 610--621
Jiannan Wang and
Guoliang Li and
Jeffrey Xu Yu and
Jianhua Feng Entity matching: how similar is similar 622--633
Di Wang and
Elke A. Rundensteiner and
Richard T. Ellison III Active complex event processing over
event streams . . . . . . . . . . . . . 634--645
Ceren Budak and
Divyakant Agrawal and
Amr El Abbadi Structural trend analysis for online
social networks . . . . . . . . . . . . 646--656
Hideaki Kimura and
Vivek Narasayya and
Manoj Syamala Compression aware physical database
design . . . . . . . . . . . . . . . . . 657--668
Thomas Bernecker and
Tobias Emrich and
Hans-Peter Kriegel and
Matthias Renz and
Stefan Zankl and
Andreas Züfle Efficient probabilistic reverse nearest
neighbor query processing on uncertain
data . . . . . . . . . . . . . . . . . . 669--680
Mehdi Kargar and
Aijun An Keyword search in graphs: finding
$r$-cliques . . . . . . . . . . . . . . 681--692
Daniel Fabbri and
Kristen LeFevre Explanation-based auditing . . . . . . . 1--12
Adam Marcus and
Eugene Wu and
David Karger and
Samuel Madden and
Robert Miller Human-powered sorts and joins . . . . . 13--24
Graham Cormode and
Justin Thaler and
Ke Yi Verifying computations with streaming
interactive proofs . . . . . . . . . . . 25--36
Dan Lin and
Christian S. Jensen and
Rui Zhang and
Lu Xiao and
Jiaheng Lu A moving-object index for efficient
query processing with peer-wise location
privacy . . . . . . . . . . . . . . . . 37--48
Essam Mansour and
Amin Allam and
Spiros Skiadopoulos and
Panos Kalnis ERA: efficient serial and parallel
suffix tree construction for very long
strings . . . . . . . . . . . . . . . . 49--60
Jens Krueger and
Changkyu Kim and
Martin Grund and
Nadathur Satish and
David Schwalb and
Jatin Chhugani and
Hasso Plattner and
Pradeep Dubey and
Alexander Zeier Fast updates on read-optimized databases
using multi-core CPUs . . . . . . . . . 61--72
Amit Goyal and
Francesco Bonchi and
Laks V. S. Lakshmanan A data-based approach to social
influence maximization . . . . . . . . . 73--84
Andrew Pavlo and
Evan P. C. Jones and
Stanley Zdonik On predictive modeling for optimizing
transaction execution in parallel OLTP
systems . . . . . . . . . . . . . . . . 85--96
François Goasdoué and
Konstantinos Karanasos and
Julien Leblay and
Ioana Manolescu View selection in Semantic Web databases 97--108
Jeffrey Jestes and
Ke Yi and
Feifei Li Building wavelet histograms on large
data in MapReduce . . . . . . . . . . . 109--120
Di Yang and
Elke A. Rundensteiner and
Matthew O. Ward Summarization and matching of
density-based clusters in streaming
environments . . . . . . . . . . . . . . 121--132
Thanh Nguyen and
Viviane Moreira and
Huong Nguyen and
Hoa Nguyen and
Juliana Freire Multilingual schema matching for
Wikipedia infoboxes . . . . . . . . . . 133--144
Guimei Liu and
Haojun Zhang and
Limsoon Wong Controlling false positives in
association rule mining . . . . . . . . 145--156
Fabian M. Suchanek and
Serge Abiteboul and
Pierre Senellart PARIS: probabilistic alignment of
relations, instances, and schema . . . . 157--168
Sayan Ranu and
Ambuj K. Singh Answering top-$k$ queries over a mixture
of attractive and repulsive dimensions 169--180
Michael Armbrust and
Kristal Curtis and
Tim Kraska and
Armando Fox and
Michael J. Franklin and
David A. Patterson PIQL: success-tolerant query processing
in the cloud . . . . . . . . . . . . . . 181--192
Peixiang Zhao and
Charu C. Aggarwal and
Min Wang gSketch: on query estimation in graph
streams . . . . . . . . . . . . . . . . 193--204
Brian E. Ruttenberg and
Ambuj K. Singh Indexing the earth mover's distance
using normal distributions . . . . . . . 205--216
Rani Qumsiyeh and
Maria S. Pera and
Yiu-Kai Ng Generating exact- and ranked
partially-matched answers to questions
in advertisements . . . . . . . . . . . 217--228
Georgios J. Fakas and
Zhi Cai and
Nikos Mamoulis Size-$l$ object summaries for relational
keyword search . . . . . . . . . . . . . 229--240
Lujun Fang and
Anish Das Sarma and
Cong Yu and
Philip Bohannon REX: explaining relationships between
entity pairs . . . . . . . . . . . . . . 241--252
Guoliang Li and
Dong Deng and
Jiannan Wang and
Jianhua Feng Pass-join: a partition-based method for
similarity joins . . . . . . . . . . . . 253--264
Christopher Hoobin and
Simon J. Puglisi and
Justin Zobel Relative Lempel--Ziv factorization for
efficient storage and retrieval of Web
collections . . . . . . . . . . . . . . 265--273
Ning Zhang and
Junichi Tatemura and
Jignesh M. Patel and
Hakan Hacigümüs Towards cost-effective storage
provisioning for DBMSs . . . . . . . . . 274--285
Hongchan Roh and
Sanghyun Park and
Sungho Kim and
Mincheol Shin and
Sang-Won Lee B+-tree index optimization by exploiting
internal parallelism of flash-based
solid state drives . . . . . . . . . . . 286--297
Per-Åke Larson and
Spyros Blanas and
Cristian Diaconu and
Craig Freedman and
Jignesh M. Patel and
Mike Zwilling High-performance concurrency control
mechanisms for main-memory databases . . 298--309
Shuai Ma and
Yang Cao and
Wenfei Fan and
Jinpeng Huai and
Tianyu Wo Capturing topology in graph pattern
matching . . . . . . . . . . . . . . . . 310--321
Arun Kumar and
Christopher Ré Probabilistic management of OCR data
using an RDBMS . . . . . . . . . . . . . 322--333
Mateusz Pawlik and
Nikolaus Augsten RTED: a robust algorithm for the tree
edit distance . . . . . . . . . . . . . 334--345
Yael Amsterdamer and
Susan B. Davidson and
Daniel Deutch and
Tova Milo and
Julia Stoyanovich and
Val Tannen Putting lipstick on pig: enabling
database-style workflow provenance . . . 346--357
Jun Gao and
Ruoming Jin and
Jiashuai Zhou and
Jeffrey Xu Yu and
Xiao Jiang and
Tengjiao Wang Relational approach for shortest path
discovery over large graphs . . . . . . 358--369
Marina Barsky and
Sangkyum Kim and
Tim Weninger and
Jiawei Han Mining flipping correlations from large
datasets with taxonomies . . . . . . . . 370--381
Arnd Christian König and
Bolin Ding and
Surajit Chaudhuri and
Vivek Narasayya A statistical approach towards robust
progress estimation . . . . . . . . . . 382--393
Yizhou Sun and
Charu C. Aggarwal and
Jiawei Han Relation strength-aware clustering of
heterogeneous information networks with
incomplete attributes . . . . . . . . . 394--405
Lingkun Wu and
Xiaokui Xiao and
Dingxiong Deng and
Gao Cong and
Andy Diwen Zhu and
Shuigeng Zhou Shortest path and distance queries on
road networks: an experimental
evaluation . . . . . . . . . . . . . . . 406--417
Dóra Erd\Hos and
Vatche Ishakian and
Andrei Lapets and
Evimaria Terzi and
Azer Bestavros The filter-placement problem and its
application to minimizing information
multiplicity . . . . . . . . . . . . . . 418--429
Venu Satuluri and
Srinivasan Parthasarathy Bayesian locality sensitive hashing for
fast similarity search . . . . . . . . . 430--441
Yasuhiro Fujiwara and
Makoto Nakatsuji and
Makoto Onizuka and
Masaru Kitsuregawa Fast and exact top-$k$ search for random
walk with restart . . . . . . . . . . . 442--453
Bahman Bahmani and
Ravi Kumar and
Sergei Vassilvitskii Densest subgraph in streaming and
MapReduce . . . . . . . . . . . . . . . 454--465
Arlei Silva and
Wagner Meira, Jr. and
Mohammed J. Zaki Mining attribute-structure correlated
patterns in large attributed graphs . . 466--477
Karl Schnaitter and
Neoklis Polyzotis Semi-automatic index tuning: keeping
DBAs in the loop . . . . . . . . . . . . 478--489
Robert Fink and
Larisa Han and
Dan Olteanu Aggregation in probabilistic databases
via knowledge compilation . . . . . . . 490--501
Felix Halim and
Stratos Idreos and
Panagiotis Karras and
Roland H. C. Yap Stochastic database cracking: towards
robust adaptive indexing in main-memory
column-stores . . . . . . . . . . . . . 502--513
Chao Li and
Gerome Miklau An adaptive mechanism for accurate query
answering under differential privacy . . 514--525
Georgios Giannikis and
Gustavo Alonso and
Donald Kossmann SharedDB: killing one thousand queries
with one stone . . . . . . . . . . . . . 526--537
Joachim Selke and
Christoph Lofi and
Wolf-Tilo Balke Pushing the boundaries of crowd-enabled
databases with query-driven schema
expansion . . . . . . . . . . . . . . . 538--549
Bo Zhao and
Benjamin I. P. Rubinstein and
Jim Gemmell and
Jiawei Han A Bayesian approach to discovering truth
from conflicting sources for data
integration . . . . . . . . . . . . . . 550--561
Prasang Upadhyaya and
Magdalena Balazinska and
Dan Suciu How to price shared optimizations in the
cloud . . . . . . . . . . . . . . . . . 562--573
Albert Angel and
Nikos Sarkas and
Nick Koudas and
Divesh Srivastava Dense subgraph maintenance under
streaming edge weight updates for
real-time story identification . . . . . 574--585
Iman Elghandour and
Ashraf Aboulnaga ReStore: reusing results of MapReduce
jobs . . . . . . . . . . . . . . . . . . 586--597
Nodira Khoussainova and
Magdalena Balazinska and
Dan Suciu PerfXplain: debugging MapReduce job
performance . . . . . . . . . . . . . . 598--609
Francesco Gullo and
Andrea Tagarelli Uncertain centroid based partitional
clustering of uncertain data . . . . . . 610--621
Bahman Bahmani and
Benjamin Moseley and
Andrea Vattani and
Ravi Kumar and
Sergei Vassilvitskii Scalable $k$-means$ + + $ . . . . . . . 622--633
Michael Benedikt and
Pierre Bourhis and
Clemens Ley Querying schemas with access
restrictions . . . . . . . . . . . . . . 634--645
Goetz Graefe and
Harumi Kuno Definition, detection, and recovery of
single-page failures, a fourth class of
database failures . . . . . . . . . . . 646--655
Goetz Graefe and
Felix Halim and
Stratos Idreos and
Harumi Kuno and
Stefan Manegold Concurrency control for adaptive
indexing . . . . . . . . . . . . . . . . 656--667
Qiang Zeng and
Hai Zhuge Comments on ``Stack-based Algorithms for
Pattern Matching on DAGs'' . . . . . . . 668--679
Nilesh Dalvi and
Ashwin Machanavajjhala and
Bo Pang An analysis of structured data on the
web . . . . . . . . . . . . . . . . . . 680--691
Kyriakos Mouratidis and
Man Lung Yiu Shortest path computation with no
information leakage . . . . . . . . . . 692--703
Ahmed Metwally and
Christos Faloutsos V-SMART-join: a scalable MapReduce
framework for all-pair similarity joins
of multisets and vectors . . . . . . . . 704--715
Yucheng Low and
Danny Bickson and
Joseph Gonzalez and
Carlos Guestrin and
Aapo Kyrola and
Joseph M. Hellerstein Distributed GraphLab: a framework for
machine learning and data mining in the
cloud . . . . . . . . . . . . . . . . . 716--727
Qiang Zeng and
Xiaorui Jiang and
Hai Zhuge Adding logical operators to tree pattern
queries on graph-structured data . . . . 728--739
Rishabh Singh and
Sumit Gulwani Learning semantic string transformations
from examples . . . . . . . . . . . . . 740--751
Changbin Liu and
Lu Ren and
Boon Thau Loo and
Yun Mao and
Prithwish Basu Cologne: a declarative distributed
constraint optimization platform . . . . 752--763
Yi Zhang and
Jun Yang Optimizing I/O for big array analytics 764--775
Peter Bailis and
Shivaram Venkataraman and
Michael J. Franklin and
Joseph M. Hellerstein and
Ion Stoica Probabilistically bounded staleness for
practical partial quorums . . . . . . . 776--787
Zhao Sun and
Hongzhi Wang and
Haixun Wang and
Bin Shao and
Jianzhong Li Efficient subgraph matching on billion
node graphs . . . . . . . . . . . . . . 788--799
Ye Yuan and
Guoren Wang and
Lei Chen and
Haixun Wang Efficient subgraph similarity search on
large probabilistic graph databases . . 800--811
Jia Wang and
James Cheng Truss decomposition in massive networks 812--823
Ju Fan and
Guoliang Li and
Lizhu Zhou and
Shanshan Chen and
Jun Hu Seal: spatio-textual similarity search 824--835
Theodoros Lappas and
Marcos R. Vieira and
Dimitrios Gunopulos and
Vassilis J. Tsotras On the spatiotemporal burstiness of
terms . . . . . . . . . . . . . . . . . 836--847
Houtan Shirani-Mehr and
Farnoush Banaei-Kashani and
Cyrus Shahabi Efficient reachability query evaluation
in large spatiotemporal contact datasets 848--859
Thi Nguyen and
Zhen He and
Rui Zhang and
Phillip Ward Boosting moving object indexing through
velocity partitioning . . . . . . . . . 860--871
Nicole Bidoit-Tollu and
Dario Colazzo and
Federico Ulliana Type-based detection of XML query-update
independence . . . . . . . . . . . . . . 872--883
Benjamin Sowell and
Wojciech Golab and
Mehul A. Shah Minuet: a scalable distributed
multiversion B-tree . . . . . . . . . . 884--895
Hongzhi Yin and
Bin Cui and
Jing Li and
Junjie Yao and
Chen Chen Challenging the long tail recommendation 896--907
Rakesh Pimplikar and
Sunita Sarawagi Answering table queries on the Web using
column keywords . . . . . . . . . . . . 908--919
Michael T. Goodrich and
Charalampos Papamanthou and
Duy Nguyen and
Roberto Tamassia and
Cristina Videira Lopes and
Olga Ohrimenko and
Nikos Triandopoulos Efficient verification of web-content
searching through authenticated web
crawlers . . . . . . . . . . . . . . . . 920--931
Lukas Blunschi and
Claudio Jossen and
Donald Kossmann and
Magdalini Mori and
Kurt Stockinger SODA: generating SQL for business users 932--943
Manolis Terrovitis and
Nikos Mamoulis and
John Liagouris and
Spiros Skiadopoulos Privacy preservation by disassociation 944--955
Bhargav Kanagal and
Amr Ahmed and
Sandeep Pandey and
Vanja Josifovski and
Jeff Yuan and
Lluis Garcia-Pueyo Supercharging recommender systems using
taxonomies for learning user purchase
behavior . . . . . . . . . . . . . . . . 956--967
Yanif Ahmad and
Oliver Kennedy and
Christoph Koch and
Milos Nikolic DBToaster: higher-order delta processing
for dynamic, frequently fresh views . . 968--979
Manoj K. Agarwal and
Krithi Ramamritham and
Manish Bhide Real time discovery of dense clusters in
highly dynamic graphs: identifying real
world events in highly dynamic
environments . . . . . . . . . . . . . . 980--991
Odysseas Papapetrou and
Minos Garofalakis and
Antonios Deligiannakis Sketch-based querying of distributed
sliding-window data streams . . . . . . 992--1003
Hoang Tam Vo and
Sheng Wang and
Divyakant Agrawal and
Gang Chen and
Beng Chin Ooi LogBase: a scalable log-structured
database system in the cloud . . . . . . 1004--1015
Wei Lu and
Yanyan Shen and
Su Chen and
Beng Chin Ooi Efficient processing of $k$ nearest
neighbor joins using MapReduce . . . . . 1016--1027
Nikolay Laptev and
Kai Zeng and
Carlo Zaniolo Early accurate results for advanced
analytics on MapReduce . . . . . . . . . 1028--1039
Xuan Liu and
Meiyu Lu and
Beng Chin Ooi and
Yanyan Shen and
Sai Wu and
Meihui Zhang CDAS: a crowdsourcing data analytics
system . . . . . . . . . . . . . . . . . 1040--1051
Mayank Sachan and
Arnab Bhattacharya Mining statistically significant
substrings using the chi-square
statistic . . . . . . . . . . . . . . . 1052--1063
Martina-Cezara Albutiu and
Alfons Kemper and
Thomas Neumann Massively parallel sort-merge joins in
main memory multi-core database systems 1064--1075
Tian Luo and
Rubao Lee and
Michael Mesnier and
Feng Chen and
Xiaodong Zhang hStorage-DB: heterogeneity-aware data
management to exploit the full
capability of hybrid storage systems . . 1076--1087
Dong-Wan Choi and
Chin-Wan Chung and
Yufei Tao A scalable algorithm for maximizing
range sum in spatial databases . . . . . 1088--1099
Ahmed M. Aly and
Walid G. Aref and
Mourad Ouzzani Spatial queries with two kNN predicates 1100--1111
Cheng Sheng and
Nan Zhang and
Yufei Tao and
Xin Jin Optimal algorithms for crawling a hidden
database in the web . . . . . . . . . . 1112--1123
Lu Qin and
Jeffrey Xu Yu and
Lijun Chang Diversifying top-$k$ results . . . . . . 1124--1135
Xin Cao and
Lisi Chen and
Gao Cong and
Xiaokui Xiao Keyword-aware optimal route search . . . 1136--1147
Bogdan Cautis and
Evgeny Kharlamov Answering queries using views over
probabilistic XML: complexity and
tractability . . . . . . . . . . . . . . 1148--1159
Abhay Jha and
Dan Suciu Probabilistic databases with MarkoViews 1160--1171
Konstantinos Mamouras and
Sigal Oren and
Lior Seeman and
Lucja Kot and
Johannes Gehrke The complexity of social coordination 1172--1183
Xiaofei Zhang and
Lei Chen and
Min Wang Efficient multi-way theta-join
processing using MapReduce . . . . . . . 1184--1195
Harold Lim and
Herodotos Herodotou and
Shivnath Babu Stubby: a transformation-based optimizer
for MapReduce workflows . . . . . . . . 1196--1207
Zhuowei Bao and
Susan B. Davidson and
Tova Milo Labeling workflow views with
fine-grained dependencies . . . . . . . 1208--1219
Jaros\law Szlichta and
Parke Godfrey and
Jarek Gryz Fundamentals of order dependencies . . . 1220--1231
Nurzhan Bakibayev and
Dan Olteanu and
Jakub Závodný FDB: a query engine for factorised
relational databases . . . . . . . . . . 1232--1243
Yu Cao and
Chee-Yong Chan and
Jie Li and
Kian-Lee Tan Optimization of analytic window
functions . . . . . . . . . . . . . . . 1244--1255
Fabian Hueske and
Mathias Peters and
Matthias J. Sax and
Astrid Rheinländer and
Rico Bergmann and
Aljoscha Krettek and
Kostas Tzoumas Opening the black boxes in data flow
optimization . . . . . . . . . . . . . . 1256--1267
Stephan Ewen and
Kostas Tzoumas and
Moritz Kaufmann and
Volker Markl Spinning fast iterative data flows . . . 1268--1279
Svilen R. Mihaylov and
Zachary G. Ives and
Sudipto Guha REX: recursive, delta-based data-centric
computation . . . . . . . . . . . . . . 1280--1291
James Cheng and
Zechao Shang and
Hong Cheng and
Haixun Wang and
Jeffrey Xu Yu K-reach: who is in your small world . . 1292--1303
Wenfei Fan and
Xin Wang and
Yinghui Wu Performance guarantees for distributed
reachability queries . . . . . . . . . . 1304--1316
Pirooz Chubak and
Davood Rafiei Efficient indexing and querying over
syntactically annotated trees . . . . . 1316--1327
Vince Bárány and
Balder ten Cate and
Martin Otto Queries with guarded negation . . . . . 1328--1339
Ninghui Li and
Wahbeh Qardaji and
Dong Su and
Jianneng Cao PrivBasis: frequent itemset mining with
differential privacy . . . . . . . . . . 1340--1351
Ganzhao Yuan and
Zhenjie Zhang and
Marianne Winslett and
Xiaokui Xiao and
Yin Yang and
Zhifeng Hao Low-rank mechanism: optimizing batch
queries under differential privacy . . . 1352--1363
Jun Zhang and
Zhenjie Zhang and
Xiaokui Xiao and
Yin Yang and
Marianne Winslett Functional mechanism: regression
analysis under differential privacy . . 1364--1375
Paolo Boldi and
Francesco Bonchi and
Aristides Gionis and
Tamir Tassa Injecting uncertainty in graphs for
identity obfuscation . . . . . . . . . . 1376--1387
Jianneng Cao and
Panagiotis Karras Publishing microdata with a robust
privacy guarantee . . . . . . . . . . . 1388--1399
Ziyu Guan and
Xifeng Yan and
Lance M. Kaplan Measuring two-event structural
correlations on graphs . . . . . . . . . 1400--1411
Jeffrey Jestes and
Jeff M. Phillips and
Feifei Li and
Mingwang Tang Ranking large temporal data . . . . . . 1412--1423
Florian Funke and
Alfons Kemper and
Thomas Neumann Compacting transactional data in hybrid
OLTP&OLAP databases . . . . . . . . . . . 1424--1435
Alexander Hall and
Olaf Bachmann and
Robert Büssow and
Silviu Ganceanu and
Marc Nunkesser Processing a trillion cells per mouse
click . . . . . . . . . . . . . . . . . 1436--1446
Danica Porobic and
Ippokratis Pandis and
Miguel Branco and
Pinar Tözün and
Anastasia Ailamaki OLTP on hardware islands . . . . . . . . 1447--1458
Stacy Patterson and
Aaron J. Elmore and
Faisal Nawab and
Divyakant Agrawal and
Amr El Abbadi Serializability, not serial: concurrency
control and availability in
multi-datacenter datastores . . . . . . 1459--1470
Alvin Cheung and
Samuel Madden and
Owen Arden and
Andrew C. Myers Automatic partitioning of database
applications . . . . . . . . . . . . . . 1471--1482
Jiannan Wang and
Tim Kraska and
Michael J. Franklin and
Jianhua Feng CrowdER: crowdsourcing entity resolution 1483--1494
Caleb Chen Cao and
Jieying She and
Yongxin Tong and
Lei Chen Whom to ask?: jury selection for
decision making tasks on micro-blog
services . . . . . . . . . . . . . . . . 1495--1506
Xiaochun Yang and
Honglei Liu and
Bin Wang ALAE: accelerating local alignment with
affine gap exactly in biosequence
databases . . . . . . . . . . . . . . . 1507--1518
K. Selçuk Candan and
Rosaria Rossini and
Xiaolan Wang and
Maria Luisa Sapino sDTW: computing DTW distances using
locally relevant constraints based on
salient feature alignments . . . . . . . 1519--1530
Farhan Tauheed and
Thomas Heinis and
Felix Schürmann and
Henry Markram and
Anastasia Ailamaki SCOUT: prefetching for latent structure
following queries . . . . . . . . . . . 1531--1542
Kaibo Wang and
Yin Huai and
Rubao Lee and
Fusheng Wang and
Xiaodong Zhang and
Joel H. Saltz Accelerating pathology image data
cross-comparison on CPU--GPU hybrid
systems . . . . . . . . . . . . . . . . 1543--1554
Jiexing Li and
Arnd Christian König and
Vivek Narasayya and
Surajit Chaudhuri Robust estimation of resource
consumption for SQL queries using
statistical techniques . . . . . . . . . 1555--1566
Mahashweta Das and
Saravanan Thirumuruganathan and
Sihem Amer-Yahia and
Gautam Das and
Cong Yu Who tags what?: an analysis framework 1567--1578
Haohan Zhu and
George Kollios and
Vassilis Athitsos A generic framework for efficient and
effective subsequence retrieval . . . . 1579--1590
Jens Dittrich and
Jorge-Arnulfo Quiané-Ruiz and
Stefan Richter and
Stefan Schuh and
Alekh Jindal and
Jörg Schad Only aggressive elephants are fast
elephants . . . . . . . . . . . . . . . 1591--1602
Rui Li and
Shengjie Wang and
Kevin Chen-Chuan Chang Multiple location profiling for users
and relationships from social network
and content . . . . . . . . . . . . . . 1603--1614
Woon-Hak Kang and
Sang-Won Lee and
Bongki Moon Flash-based extended cache for higher
throughput and faster recovery . . . . . 1615--1626
Michael A. Bender and
Martin Farach-Colton and
Rob Johnson and
Russell Kraner and
Bradley C. Kuszmaul and
Dzejla Medjedovic and
Pablo Montes and
Pradeep Shetty and
Richard P. Spillane and
Erez Zadok Don't thrash: how to cache your hash on
flash . . . . . . . . . . . . . . . . . 1627--1637
Robert Isele and
Christian Bizer Learning expressive linkage rules using
genetic programming . . . . . . . . . . 1638--1649
Yongxin Tong and
Lei Chen and
Yurong Cheng and
Philip S. Yu Mining frequent itemsets over uncertain
databases . . . . . . . . . . . . . . . 1650--1661
Michele Dallachiesa and
Besmira Nushi and
Katsiaryna Mirylenka and
Themis Palpanas Uncertain time-series similarity: return
to the basics . . . . . . . . . . . . . 1662--1673
Tamraparni Dasu and
Ji Meng Loh Statistical distortion: consequences of
data cleaning . . . . . . . . . . . . . 1674--1683
Willis Lang and
Stavros Harizopoulos and
Jignesh M. Patel and
Mehul A. Shah and
Dimitris Tsirogiannis Towards energy-efficient database
cluster design . . . . . . . . . . . . . 1684--1695
Christian S. Jensen Data management on the spatial web . . . 1696--1696
Brenda Dietrich Data analytics opportunities in a
smarter planet . . . . . . . . . . . . . 1697--1697
Kenan Sahin Challenges in economic massive content
storage and management (MCSAM) in the
era of self-organizing, self-expanding
and self-linking data clusters . . . . . 1698--1698
Gurmeet Singh Manku and
Rajeev Motwani Approximate frequency counts over data
streams . . . . . . . . . . . . . . . . 1699--1699
Joseph M. Hellerstein and
Christoper Ré and
Florian Schoppmann and
Daisy Zhe Wang and
Eugene Fratkin and
Aleksander Gorajek and
Kee Siong Ng and
Caleb Welton and
Xixuan Feng and
Kun Li and
Arun Kumar The MADlib analytics library: or MAD
skills, the SQL . . . . . . . . . . . . 1700--1711
Avrilia Floratou and
Nikhil Teletia and
David J. DeWitt and
Jignesh M. Patel and
Donghui Zhang Can the elephants handle the NoSQL
onslaught? . . . . . . . . . . . . . . . 1712--1723
Tilmann Rabl and
Sergio Gómez-Villamor and
Mohammad Sadoghi and
Victor Muntés-Mulero and
Hans-Arno Jacobsen and
Serge Mankovskii Solving big data challenges for
enterprise application performance
management . . . . . . . . . . . . . . . 1724--1735
Avraham Shinnar and
David Cunningham and
Vijay Saraswat and
Benjamin Herta M3R: increased performance for in-memory
Hadoop jobs . . . . . . . . . . . . . . 1736--1747
Philipp Rösch and
Lars Dannecker and
Franz Färber and
Gregor Hackenbroich A storage advisor for hybrid-store
databases . . . . . . . . . . . . . . . 1748--1758
Micha\l \'Switakowski and
Peter Boncz and
Marcin Zukowski From cooperative scans to predictive
buffer management . . . . . . . . . . . 1759--1770
George Lee and
Jimmy Lin and
Chuang Liu and
Andrew Lorek and
Dmitriy Ryaboy The unified logging infrastructure for
data analytics at Twitter . . . . . . . 1771--1780
Tomas Talius and
Robin Dhamankar and
Andrei Dumitrache and
Hanuma Kodavalla Transaction log based application error
recovery and point in-time query . . . . 1781--1789
Andrew Lamb and
Matt Fuller and
Ramakrishna Varadarajan and
Nga Tran and
Ben Vandiver and
Lyric Doshi and
Chuck Bear The Vertica Analytic Database: C-Store 7
years later . . . . . . . . . . . . . . 1790--1801
Yanpei Chen and
Sara Alspaugh and
Randy Katz Interactive analytical processing in big
data systems: a cross-industry study of
MapReduce workloads . . . . . . . . . . 1802--1813
Wang Lam and
Lu Liu and
Sts Prasad and
Anand Rajaraman and
Zoheb Vacheri and
AnHai Doan Muppet: MapReduce-style processing of
fast data . . . . . . . . . . . . . . . 1814--1825
Gabriela Jacques-Silva and
Bugra Gedik and
Rohit Wagle and
Kun-Lung Wu and
Vibhore Kumar Building user-defined runtime adaptation
routines for stream processing
applications . . . . . . . . . . . . . . 1826--1837
Junchen Jiang and
Hongji Bao and
Edward Y. Chang and
Yuqian Li MOIST: a scalable and parallel moving
object indexer with school tracking . . 1838--1849
Dan R. K. Ports and
Kevin Grittner Serializable snapshot isolation in
PostgreSQL . . . . . . . . . . . . . . . 1850--1861
Karin Murthy and
Prasad M. Deshpande and
Atreyee Dey and
Ramanujam Halasipuram and
Mukesh Mohania and
P. Deepak and
Jennifer Reed and
Scott Schumacher Exploiting evidence from unstructured
data to enhance master data management 1862--1873
Lili Wu and
Roshan Sumbaly and
Chris Riccomini and
Gordon Koo and
Hyung Jin Kim and
Jay Kreps and
Sam Shah Avatara: OLAP for web-scale analytics
products . . . . . . . . . . . . . . . . 1874--1877
Lars Kolb and
Andreas Thor and
Erhard Rahm Dedoop: efficient deduplication with
Hadoop . . . . . . . . . . . . . . . . . 1878--1881
Xiufeng Liu and
Christian Thomsen and
Torben Bach Pedersen MapReduce-based dimensional ETL made
easy . . . . . . . . . . . . . . . . . . 1882--1885
Huiqi Xu and
Zhen Li and
Shumin Guo and
Keke Chen CloudVista: interactive and economical
visual cluster analysis for big data in
the cloud . . . . . . . . . . . . . . . 1886--1889
Alexander Alexandrov and
Kostas Tzoumas and
Volker Markl Myriad: scalable and expressive data
generation . . . . . . . . . . . . . . . 1890--1893
Eugene Wu and
Samuel Madden and
Michael Stonebraker A demonstration of DBWipes: clean as you
query . . . . . . . . . . . . . . . . . 1894--1897
Sattam Alsubaiee and
Yasser Altowim and
Hotham Altwaijry and
Alexander Behm and
Vinayak Borkar and
Yingyi Bu and
Michael Carey and
Raman Grover and
Zachary Heilbron and
Young-Seok Kim and
Chen Li and
Nicola Onose and
Pouria Pirzadeh and
Rares Vernica and
Jian Wen ASTERIX: an open source system for ``Big
Data'' management and analysis (demo) 1898--1901
Sameer Agarwal and
Anand P. Iyer and
Aurojit Panda and
Samuel Madden and
Barzan Mozafari and
Ion Stoica Blink and it's done: interactive queries
on very large data . . . . . . . . . . . 1902--1905
Abhishek Roy and
Yanlei Diao and
Evan Mauceli and
Yiping Shen and
Bai-Lin Wu Massive genomic data processing and deep
analysis . . . . . . . . . . . . . . . . 1906--1909
Erietta Liarou and
Stratos Idreos and
Stefan Manegold and
Martin Kersten MonetDB/DataCell: online analytics in a
streaming column-store . . . . . . . . . 1910--1913
Xin Cao and
Gao Cong and
Christian S. Jensen and
Jun Jie Ng and
Beng Chin Ooi and
Nhan-Tue Phan and
Dingming Wu SWORS: a system for the efficient
retrieval of relevant spatial web
objects . . . . . . . . . . . . . . . . 1914--1917
Atsuyuki Morishima and
Norihide Shinagawa and
Tomomi Mitsuishi and
Hideto Aoki and
Shun Fukusumi CyLog/Crowd4U: a declarative platform
for complex data-centric crowdsourcing 1918--1921
Yasin N. Silva and
Spencer Pearson Exploiting database similarity joins for
metric spaces . . . . . . . . . . . . . 1922--1925
Mrunal Gawade and
Martin Kersten Stethoscope: a platform for interactive
visual analysis of query execution plans 1926--1929
Alexios Kotsifakos and
Panagiotis Papapetrou and
Jaakko Hollmén and
Dimitrios Gunopulos and
Vassilis Athitsos and
George Kollios Hum-a-song: a subsequence matching with
gaps-range-tolerances query-by-humming
system . . . . . . . . . . . . . . . . . 1930--1933
YongChul Kwon and
Magdalena Balazinska and
Bill Howe and
Jerome Rolia SkewTune in action: mitigating skew in
MapReduce applications . . . . . . . . . 1934--1937
Azza Abouzied and
Joseph M. Hellerstein and
Avi Silberschatz Playful query specification with
DataPlay . . . . . . . . . . . . . . . . 1938--1941
Ioannis Alagiannis and
Renata Borovica and
Miguel Branco and
Stratos Idreos and
Anastasia Ailamaki NoDB in action: adaptive query
processing on raw data . . . . . . . . . 1942--1945
Florian Wenzel and
Markus Endres and
Stefan Mandl and
Werner Kießling Complex preference queries supporting
spatial applications for user groups . . 1946--1949
Nurzhan Bakibayev and
Dan Olteanu and
Jakub Závodný Demonstration of the FDB query engine
for factorised databases . . . . . . . . 1950--1953
Zichen Xu and
Yi-Cheng Tu and
Xiaorui Wang PET: reducing database energy cost via
query optimization . . . . . . . . . . . 1954--1957
Andrés Letelier and
Jorge Pérez and
Reinhard Pichler and
Sebastian Skritek SPAM: a SPARQL analysis and manipulation
tool . . . . . . . . . . . . . . . . . . 1958--1961
Paraschos Koutris and
Prasang Upadhyaya and
Magdalena Balazinska and
Bill Howe and
Dan Suciu QueryMarket demonstration: pricing for
online data markets . . . . . . . . . . 1962--1965
Siqiang Luo and
Yifeng Luo and
Shuigeng Zhou and
Gao Cong and
Jihong Guan DISKs: a system for distributed spatial
group keyword search on road networks 1966--1969
Stefan Endrullis and
Andreas Thor and
Erhard Rahm WETSUIT: an efficient mashup tool for
searching and fusing web entities . . . 1970--1973
Mohamed E. Khalefa and
Ulrike Fischer and
Torben Bach Pedersen and
Wolfgang Lehner Model-based integration of past & future
in TimeTravel . . . . . . . . . . . . . 1974--1977
Julian Eberius and
Maik Thiele and
Katrin Braunschweig and
Wolfgang Lehner DrillBeyond: enabling business analysts
to explore the Web of Open Data . . . . 1978--1981
Ndapandula Nakashole and
Gerhard Weikum and
Fabian Suchanek Discovering and exploring relations on
the web . . . . . . . . . . . . . . . . 1982--1985
Saravanan Thirumuruganathan and
Mahashweta Das and
Shrikant Desai and
Sihem Amer-Yahia and
Gautam Das and
Cong Yu MapRat: meaningful explanation,
interactive exploration and
geo-visualization of collaborative
ratings . . . . . . . . . . . . . . . . 1986--1989
Hyunjung Park and
Hector Garcia-Molina and
Richard Pang and
Neoklis Polyzotis and
Aditya Parameswaran and
Jennifer Widom Deco: a system for declarative
crowdsourcing . . . . . . . . . . . . . 1990--1993
Wim Martens and
Matthias Niewerth and
Frank Neven and
Thomas Schwentick Developing and analyzing XSDs through
BonXai . . . . . . . . . . . . . . . . . 1994--1997
Aaron J. Elmore and
Sudipto Das and
Divyakant Agrawal and
Amr El Abbadi InfoPuzzle: exploring group decision
making in mobile peer-to-peer databases 1998--2001
Jianqiu Xu and
Ralf Hartmut Güting Manage and query generic moving objects
in SECONDO . . . . . . . . . . . . . . . 2002--2005
Pei Li and
Christina Tziviskou and
Haidong Wang and
Xin Luna Dong and
Xiaoguang Liu and
Andrea Maurino and
Divesh Srivastava Chronos: facilitating history discovery
by linking temporal records . . . . . . 2006--2009
Manolis Koubarakis and
Mihai Datcu and
Charalambos Kontoes and
Ugo Di Giammatteo and
Stefan Manegold and
Eva Klien TELEIOS: a database-powered virtual
earth observatory . . . . . . . . . . . 2010--2013
Jens Dittrich and
Jorge-Arnulfo Quiané-Ruiz Efficient big data processing in Hadoop
MapReduce . . . . . . . . . . . . . . . 2014--2015
Kyuseok Shim MapReduce algorithms for big data
analysis . . . . . . . . . . . . . . . . 2016--2017
Lise Getoor and
Ashwin Machanavajjhala Entity resolution: theory, practice &
open challenges . . . . . . . . . . . . 2018--2019
Jiri Schindler I/O characteristics of NoSQL databases 2020--2021
Yizhou Sun and
Jiawei Han and
Xifeng Yan and
Philip S. Yu Mining knowledge from interconnected
data: a heterogeneous information
network analysis approach . . . . . . . 2022--2023
B. Aditya Prakash and
Christos Faloutsos Understanding and managing cascades on
large graphs . . . . . . . . . . . . . . 2024--2025
Asuman Dogac Interoperability in eHealth systems . . 2026--2027
Divyakant Agrawal and
Amr El Abbadi and
Shiyuan Wang Secure and privacy-preserving data
services in the cloud: a data centric
view . . . . . . . . . . . . . . . . . . 2028--2029
Sudipto Guha and
Andrew McGregor Graph synopses, sketches, and streams: a
survey . . . . . . . . . . . . . . . . . 2030--2031
Alexandros Labrinidis and
H. V. Jagadish Challenges and opportunities with big
data . . . . . . . . . . . . . . . . . . 2032--2033
Amr El Abbadi and
Mohamed F. Mokbel Panel discussion on social networks and
mobility in the cloud . . . . . . . . . 2034--2035
Panagiotis Bouros and
Shen Ge and
Nikos Mamoulis Spatio-textual similarity joins . . . . 1--12
Marina Drosou and
Evaggelia Pitoura DisC diversity: result diversification
based on dissimilarity and coverage . . 13--24
Chen Zeng and
Jeffrey F. Naughton and
Jin-Yi Cai On differentially private frequent
itemset mining . . . . . . . . . . . . . 25--36
Xin Luna Dong and
Barna Saha and
Divesh Srivastava Less is more: selecting sources wisely
for integration . . . . . . . . . . . . 37--48
Wenchao Zhou and
Suyog Mapara and
Yiqing Ren and
Yang Li and
Andreas Haeberlen and
Zachary Ives and
Boon Thau Loo and
Micah Sherr Distributed time-aware provenance . . . 49--60
Diego Calvanese and
Giuseppe De Giacomo and
Maurizio Lenzerini and
Moshe Y. Vardi Query processing under GLAV mappings for
relational and graph databases . . . . . 61--72
Kyriakos Mouratidis and
HweeHwa Pang Computing immutable regions for subspace
top-$k$ queries . . . . . . . . . . . . 73--84
Feng Zhao and
Anthony K. H. Tung Large scale cohesive subgraphs discovery
for social network visual analysis . . . 85--96
Xian Li and
Xin Luna Dong and
Kenneth Lyons and
Weiyi Meng and
Divesh Srivastava Truth finding on the Deep Web: is the
problem solved? . . . . . . . . . . . . 97--108
Adam Marcus and
David Karger and
Samuel Madden and
Robert Miller and
Sewoong Oh Counting with the crowd . . . . . . . . 109--120
Tao Zou and
Ronan Le Bras and
Marcos Vaz Salles and
Alan Demers and
Johannes Gehrke ClouDiA: a deployment advisor for public
clouds . . . . . . . . . . . . . . . . . 121--132
Jinsoo Lee and
Wook-Shin Han and
Romans Kasperovics and
Jeong-Hoon Lee An in-depth comparison of subgraph
isomorphism algorithms in graph
databases . . . . . . . . . . . . . . . 133--144
Kun Ren and
Alexander Thomson and
Daniel J. Abadi Lightweight locking for main memory
database systems . . . . . . . . . . . . 145--156
Ye Zhang and
Wai-Kit Wong and
S. M. Yiu and
Nikos Mamoulis and
David W. Cheung Lightweight privacy-preserving
peer-to-peer data integration . . . . . 157--168
Yang Li and
Pegah Kamousi and
Fangqiu Han and
Shengqi Yang and
Xifeng Yan and
Subhash Suri Memory efficient minimum substring
partitioning . . . . . . . . . . . . . . 169--180
Arijit Khan and
Yinghui Wu and
Charu C. Aggarwal and
Xifeng Yan NeMa: fast graph search with label
similarity . . . . . . . . . . . . . . . 181--192
Xika Lin and
Abhishek Mukherji and
Elke A. Rundensteiner and
Carolina Ruiz and
Matthew O. Ward PARAS: a parameter space framework for
online association mining . . . . . . . 193--204
Zhepeng Yan and
Nan Zheng and
Zachary G. Ives and
Partha Pratim Talukdar and
Cong Yu Actively soliciting feedback for query
answers in keyword search-based data
integration . . . . . . . . . . . . . . 205--216
Lisi Chen and
Gao Cong and
Christian S. Jensen and
Dingming Wu Spatial keyword query processing: an
experimental evaluation . . . . . . . . 217--228
Milad Eftekhar and
Nick Koudas Partitioning and ranking tagged data
sources . . . . . . . . . . . . . . . . 229--240
Antonio Badia and
Bin Cao Efficient implementation of generalized
quantification in relational query
languages . . . . . . . . . . . . . . . 241--252
Rui Liu and
Ashraf Aboulnaga and
Kenneth Salem DAX: a widely distributed multitenant
storage service for DBMS hosting . . . . 253--264
Kai Zeng and
Jiacheng Yang and
Haixun Wang and
Bin Shao and
Zhongyuan Wang A distributed graph engine for web scale
RDF data . . . . . . . . . . . . . . . . 265--276
Anish Das Sarma and
Foto N. Afrati and
Semih Salihoglu and
Jeffrey D. Ullman Upper and lower bounds on the cost of a
map-reduce computation . . . . . . . . . 277--288
Stephen Tu and
M. Frans Kaashoek and
Samuel Madden and
Nickolai Zeldovich Processing analytical queries over
encrypted data . . . . . . . . . . . . . 289--300
Georgios Kellaris and
Stavros Papadopoulos Practical differential privacy via
grouping and smoothing . . . . . . . . . 301--312
Raghav Kaushik and
Yupeng Fu and
Ravishankar Ramamurthy On scaling up sensitive data auditing 313--324
Maheswaran Sathiamoorthy and
Megasthenis Asteris and
Dimitris Papailiopoulos and
Alexandros G. Dimakis and
Ramkumar Vadali and
Scott Chen and
Dhruba Borthakur XORing elephants: novel erasure codes
for big data . . . . . . . . . . . . . . 325--336
Steffen Rendle Scaling factorization machines to
relational data . . . . . . . . . . . . 337--348
Steven Euijong Whang and
Peter Lofgren and
Hector Garcia-Molina Question selection for crowd entity
resolution . . . . . . . . . . . . . . . 349--360
Alekh Jindal and
Endre Palatinus and
Vladimir Pavlov and
Jens Dittrich A comparison of knives for bread slicing 361--372
Chuan Xiao and
Jianbin Qin and
Wei Wang and
Yoshiharu Ishikawa and
Koji Tsuda and
Kunihiko Sadakane Efficient error-tolerant query
autocompletion . . . . . . . . . . . . . 373--384
Alexander Shraer and
Maxim Gurevich and
Marcus Fontoura and
Vanja Josifovski Top-$k$ publish-subscribe for social
annotation of news . . . . . . . . . . . 385--396
Phokion G. Kolaitis and
Enela Pema and
Wang-Chiew Tan Efficient querying of inconsistent
databases with binary integer
programming . . . . . . . . . . . . . . 397--408
Aristides Gionis and
Flavio Junqueira and
Vincent Leroy and
Marco Serafini and
Ingmar Weber Piggybacking on social networks . . . . 409--420
Marco D. Adelfio and
Hanan Samet Schema extraction for tabular data on
the Web . . . . . . . . . . . . . . . . 421--432
Ahmet Erdem Saríyüce and
Bugra Gedik and
Gabriela Jacques-Silva and
Kun-Lung Wu and
Ümit V. Çatalyürek Streaming algorithms for $k$-core
decomposition . . . . . . . . . . . . . 433--444
Oktie Hassanzadeh and
Ken Q. Pu and
Soheil Hassas Yeganeh and
Renée J. Miller and
Lucian Popa and
Mauricio A. Hernández and
Howard Ho Discovering linkage points over Web data 445--456
Ada Wai-Chee Fu and
Huanhuan Wu and
James Cheng and
Raymond Chi-Wing Wong IS-Label: an independent-set based
labeling scheme for point-to-point
distance querying . . . . . . . . . . . 457--468
Thanh T. L. Tran and
Yanlei Diao and
Charles Sutton and
Anna Liu Supporting user-defined functions on
uncertain data . . . . . . . . . . . . . 469--480
Fanwei Zhu and
Yuan Fang and
Kevin Chen-Chuan Chang and
Jing Ying Incremental and accuracy-aware
Personalized PageRank through scheduled
approximation . . . . . . . . . . . . . 481--492
Weiguo Zheng and
Lei Zou and
Yansong Feng and
Lei Chen and
Dongyan Zhao Efficient simrank-based similarity join
over large graphs . . . . . . . . . . . 493--504
Guimei Liu and
Andre Suchitra and
Limsoon Wong A performance study of three disk-based
structures for indexing and querying
frequent itemsets . . . . . . . . . . . 505--516
Pingpeng Yuan and
Pu Liu and
Buwen Wu and
Hai Jin and
Wenya Zhang and
Ling Liu TripleBit: a fast and compact system for
large scale RDF data . . . . . . . . . . 517--528
Sumeet Bajaj and
Radu Sion CorrectDB: SQL engine with practical
query authentication . . . . . . . . . . 529--540
Xin Liu and
Kenneth Salem Hybrid storage management for database
systems . . . . . . . . . . . . . . . . 541--552
Eugene Wu and
Samuel Madden Scorpion: explaining away outliers in
aggregate queries . . . . . . . . . . . 553--564
Rajeev Gupta and
Krithi Ramamritham and
Mukesh Mohania Ratio threshold queries over distributed
data sources . . . . . . . . . . . . . . 565--576
Ting Deng and
Wenfei Fan On the complexity of query result
diversification . . . . . . . . . . . . 577--588
Sourav Dutta and
Ankur Narang and
Suman K. Bera Streaming quotient filter: a near
optimal approximate duplicate detection
approach for data streams . . . . . . . 589--600
Flip Korn and
Barna Saha and
Divesh Srivastava and
Shanshan Ying On repairing structural problems in
semi-structured data . . . . . . . . . . 601--612
Faraz Makari Manshadi and
Baruch Awerbuch and
Rainer Gemulla and
Rohit Khandekar and
Julián Mestre and
Mauro Sozio A distributed algorithm for large-scale
generalized matching . . . . . . . . . . 613--624
Floris Geerts and
Giansalvatore Mecca and
Paolo Papotti and
Donatello Santoro The LLUNATIC data-cleaning framework . . 625--636
Iraklis Psaroudakis and
Manos Athanassoulis and
Anastasia Ailamaki Sharing data and work across concurrent
analytical queries . . . . . . . . . . . 637--648
Haichuan Shang and
Masaru Kitsuregawa Skyline operator on anti-correlated
distributions . . . . . . . . . . . . . 649--660
Hatem Mahmoud and
Faisal Nawab and
Alexander Pucher and
Divyakant Agrawal and
Amr El Abbadi Low-latency multi-datacenter databases
using replicated commit . . . . . . . . 661--672
Yun Chi and
Hakan Hacígümüs and
Wang-Pin Hsiung and
Jeffrey F. Naughton Distribution-based query scheduling . . 673--684
Wenfei Fan and
Floris Geerts and
Frank Neven Making queries tractable on big data
with preprocessing: through the eyes of
complexity theory . . . . . . . . . . . 685--696
Haim Kaplan and
Ilia Lotosh and
Tova Milo and
Slava Novgorodov Answering planning queries with the
crowd . . . . . . . . . . . . . . . . . 697--708
Max Heimel and
Michael Saecker and
Holger Pirk and
Stefan Manegold and
Volker Markl Hardware-oblivious parallelism for
in-memory column-stores . . . . . . . . 709--720
Risi Thonangi and
Jun Yang Permuting data on random-access block
storage . . . . . . . . . . . . . . . . 721--732
Radu Stoica and
Anastasia Ailamaki Improving flash write performance by
using update frequency . . . . . . . . . 733--744
Lu Li and
Chee-Yong Chan Efficient indexing for diverse query
results . . . . . . . . . . . . . . . . 745--756
Chen Jason Zhang and
Lei Chen and
H. V. Jagadish and
Chen Caleb Cao Reducing uncertainty of schema matching
via crowdsourcing . . . . . . . . . . . 757--768
Bin Yang and
Chenjuan Guo and
Christian S. Jensen Travel cost inference from sparse,
spatio temporally correlated time series
using Markov models . . . . . . . . . . 769--780
Hyunjung Park and
Jennifer Widom Query optimization over crowdsourced
data . . . . . . . . . . . . . . . . . . 781--792
Yang Wang and
Peng Wang and
Jian Pei and
Wei Wang and
Sheng Huang A data-adaptive and dynamic segmentation
index for whole matching on time series 793--804
Mirko Bronzi and
Valter Crescenzi and
Paolo Merialdo and
Paolo Papotti Extraction and integration of partially
overlapping web sources . . . . . . . . 805--816
Yuan Yuan and
Rubao Lee and
Xiaodong Zhang The Yin and Yang of processing data
warehousing queries on GPU devices . . . 817--828
Dayu Yuan and
Prasenjit Mitra and
C. Lee Giles Mining and indexing graphs for
supergraph search . . . . . . . . . . . 829--840
Jianmin Wang and
Shaoxu Song and
Xiaochen Zhu and
Xuemin Lin Efficient recovery of missing events . . 841--852
Kai Ren and
YongChul Kwon and
Magdalena Balazinska and
Bill Howe Hadoop's adolescence: an analysis of
Hadoop usage in scientific workloads . . 853--864
Essam Mansour and
Ahmed El-Roby and
Panos Kalnis and
Aron Ahmadia and
Ashraf Aboulnaga RACE: a scalable and elastic parallel
system for discovering repeats in very
long sequences . . . . . . . . . . . . . 865--876
Justin Levandoski and
David Lomet and
Sudipta Sengupta LLAMA: a cache/storage subsystem for
modern hardware . . . . . . . . . . . . 877--888
Jiong He and
Mian Lu and
Bingsheng He Revisiting co-processing for hash joins
on the coupled CPU--GPU architecture . . 889--900
Miao Qiao and
Lu Qin and
Hong Cheng and
Jeffrey Xu Yu and
Wentao Tian Top-$k$ nearest keyword search on large
graphs . . . . . . . . . . . . . . . . . 901--912
Nikos Armenatzoglou and
Stavros Papadopoulos and
Dimitris Papadias A general framework for geo-social query
processing . . . . . . . . . . . . . . . 913--924
Wentao Wu and
Yun Chi and
Hakan Hacígümüs and
Jeffrey F. Naughton Towards predicting query execution time
for concurrent and dynamic database
workloads . . . . . . . . . . . . . . . 925--936
Minos Garofalakis and
Daniel Keren and
Vasilis Samoladas Sketch-based geometric monitoring of
distributed stream queries . . . . . . . 937--948
Cheng Long and
Raymond Chi-Wing Wong and
H. V. Jagadish Direction-preserving trajectory
simplification . . . . . . . . . . . . . 949--960
Nicolas Bruno and
Sapna Jain and
Jingren Zhou Continuous cloud-scale query
optimization and processing . . . . . . 961--972
Andrii Cherniak and
Huma Zaidi and
Vladimir Zadorozhny Optimization strategies for A/B testing
on HADOOP . . . . . . . . . . . . . . . 973--984
Khaled Elmeleegy Piranha: optimizing short jobs in Hadoop 985--996
Mohammad Sadoghi and
Kenneth A. Ross and
Mustafa Canim and
Bishwaranjan Bhattacharjee Making updates disk-I/O friendly using
SSDs . . . . . . . . . . . . . . . . . . 997--1008
Ablimit Aji and
Fusheng Wang and
Hoang Vo and
Rubao Lee and
Qiaoling Liu and
Xiaodong Zhang and
Joel Saltz Hadoop GIS: a high performance spatial
data warehousing system over MapReduce 1009--1020
Bhuvan Bamba and
Siva Ravada and
Ying Hu and
Richard Anderson Statistics collection in Oracle Spatial
and Graph: fast histogram construction
for complex geometry objects . . . . . . 1021--1032
Tyler Akidau and
Alex Balikov and
Kaya Bekiroglu and
Slava Chernyak and
Josh Haberman and
Reuven Lax and
Sam McVeety and
Daniel Mills and
Paul Nordstrom and
Sam Whittle MillWheel: fault-tolerant stream
processing at Internet scale . . . . . . 1033--1044
Ian Rae and
Eric Rollins and
Jeff Shute and
Sukhdeep Sodhi and
Radek Vingralek Online, asynchronous schema change in F1 1045--1056
Lior Abraham and
John Allen and
Oleksandr Barykin and
Vinayak Borkar and
Bhuwan Chopra and
Ciprian Gerea and
Daniel Merl and
Josh Metzler and
David Reiss and
Subbu Subramanian and
Janet L. Wiener and
Okay Zed Scuba: diving into data at Facebook . . 1057--1067
Jeff Shute and
Radek Vingralek and
Bart Samwel and
Ben Handy and
Chad Whipkey and
Eric Rollins and
Mircea Oancea and
Kyle Littlefield and
David Menestrina and
Stephan Ellner and
John Cieslewicz and
Ian Rae and
Traian Stancescu and
Himani Apte F1: a distributed SQL database that
scales . . . . . . . . . . . . . . . . . 1068--1079
Vijayshankar Raman and
Gopi Attaluri and
Ronald Barber and
Naresh Chainani and
David Kalmuk and
Vincent KulandaiSamy and
Jens Leenstra and
Sam Lightstone and
Shaorong Liu and
Guy M. Lohman and
Tim Malkemus and
Rene Mueller and
Ippokratis Pandis and
Berni Schiefer and
David Sharpe and
Richard Sidle and
Adam Storm and
Liping Zhang DB2 with BLU acceleration: so much more
than just a column store . . . . . . . . 1080--1091
Michael Ovsiannikov and
Silvius Rus and
Damian Reeves and
Paul Sutter and
Sriram Rao and
Jim Kelly The Quantcast File System . . . . . . . 1092--1101
Srikanth Bellamkonda and
Hua-Gang Li and
Unmesh Jagtap and
Yali Zhu and
Vince Liang and
Thierry Cruanes Adaptive and big data scale parallel
execution in Oracle . . . . . . . . . . 1102--1113
Kedar Bellare and
Carlo Curino and
Ashwin Machanavajihala and
Peter Mika and
Mandar Rahurkar and
Aamod Sane WOO: a scalable and multi-tenant
platform for continuous knowledge base
synthesis . . . . . . . . . . . . . . . 1114--1125
Abhishek Gattani and
Digvijay S. Lamba and
Nikesh Garera and
Mitul Tiwari and
Xiaoyong Chai and
Sanjib Das and
Sri Subramaniam and
Anand Rajaraman and
Venky Harinarayan and
AnHai Doan Entity extraction, linking,
classification, and tagging for social
media: a Wikipedia-based approach . . . 1126--1137
Hazem Elmeleegy and
Yinan Li and
Yan Qi and
Peter Wilmot and
Mingxi Wu and
Santanu Kolay and
Ali Dasdan and
Songting Chen Overview of turn data management
platform for digital advertising . . . . 1138--1149
Michael Curtiss and
Iain Becker and
Tudor Bosman and
Sergey Doroshenko and
Lucian Grijincu and
Tom Jackson and
Sandhya Kunnatur and
Soren Lassen and
Philip Pronin and
Sriram Sankar and
Guanghao Shen and
Gintaras Woss and
Chao Yang and
Ning Zhang Unicorn: a system for searching the
social graph . . . . . . . . . . . . . . 1150--1161
Sergio Ramazzina and
Chiara L. Ballari and
Daniela Somenzi A new service for customer care based on
the Trentorise bigdata platform . . . . 1162--1163
Fabrizio Antonelli and
Antonino Casella and
Cristiana Chitic and
Roberto Larcher and
Giovanni Torrisi Exploiting the diversity, mass and speed
of territorial data by TELCO Operator
for better user services . . . . . . . . 1164--1165
Ivan Bedini and
Benedikt Elser and
Yannis Velegrakis The Trento big data platform for public
administration and large companies: use
cases and opportunities . . . . . . . . 1166--1167
Nga Tran and
Sreenath Bodagala and
Jaimin Dave Designing query optimizers for big data
problems of the future . . . . . . . . . 1168--1169
Monica Franceschini How to maximize the value of big data
with the open source SpagoBI suite
through a comprehensive approach . . . . 1170--1171
Edward Y. Chang Context-aware computing: opportunities
and open issues . . . . . . . . . . . . 1172--1173
Oktie Hassanzadeh and
Anastasios Kementsietsidis and
Benny Kimelfeld and
Rajasekar Krishnamurthy and
Fatma Özcan and
Ippokratis Pandis Next generation data analytics at IBM
research . . . . . . . . . . . . . . . . 1174--1175
Mauro Brunato and
Roberto Battiti Learning and intelligent optimization
(LION): one ring to rule them all . . . 1176--1177
David Lomet Microsoft SQL server's integrated
database approach for modern
applications and hardware . . . . . . . 1178--1179
Hakan Hacígümüs and
Jagan Sankaranarayanan and
Junichi Tatemura and
Jeff LeFevre and
Neoklis Polyzotis Odyssey: a multistore system for
evolutionary analytics . . . . . . . . . 1180--1181
Paolo Bouquet and
Andrea Molinari A global Entity Name System (ENS) for
data ecosystems . . . . . . . . . . . . 1182--1183
Vishal Sikka and
Franz Färber and
Anil Goel and
Wolfgang Lehner SAP HANA: the evolution from a modern
main-memory data platform to an
enterprise application platform . . . . 1184--1185
Raghunath Nambiar and
Meikel Poess Keeping the TPC relevant! . . . . . . . 1186--1187
Xin Luna Dong and
Divesh Srivastava Big data integration . . . . . . . . . . 1188--1189
Stratis D. Viglas Just-in-time compilation for SQL query
processing . . . . . . . . . . . . . . . 1190--1191
Anastasia Ailamaki and
Ryan Johnson and
Ippokratis Pandis and
Pínar Tözün Toward scalable transaction processing:
evolution of Shore-MT . . . . . . . . . 1192--1193
Aaron J. Elmore and
Carlo Curino and
Divyakant Agrawal and
Amr El Abbadi Towards database virtualization for
database as a service . . . . . . . . . 1194--1195
Mohamed F. Mokbel and
Mohamed Sarwat Mobility and social networking: a data
management perspective . . . . . . . . . 1196--1197
Andy Yuan Xue and
Rui Zhang and
Yu Zheng and
Xing Xie and
Jianhui Yu and
Yong Tang DesTeller: a system for destination
prediction based on trajectories with
privacy protection . . . . . . . . . . . 1198--1201
Zhe Chen and
Michael Cafarella and
Jun Chen and
Daniel Prevo and
Junfeng Zhuang Senbazuru: a prototype spreadsheet
database management system . . . . . . . 1202--1205
Grégory Smits and
Olivier Pivert and
Thomas Girault ReqFlex: fuzzy queries for everyone . . 1206--1209
Martin Kaufmann and
Panagiotis Vagenas and
Peter M. Fischer and
Donald Kossmann and
Franz Färber Comprehensive and interactive temporal
query processing with SAP HANA . . . . . 1210--1213
Torsten Grust and
Nils Schweinsberg and
Alexander Ulrich Functions are data too:
defunctionalization for PL/SQL . . . . . 1214--1217
Amr Ebaid and
Ahmed Elmagarmid and
Ihab F. Ilyas and
Mourad Ouzzani and
Jorge-Arnulfo Quiane-Ruiz and
Nan Tang and
Si Yin NADEEF: a generalized data cleaning
system . . . . . . . . . . . . . . . . . 1218--1221
Sonia Bergamaschi and
Francesco Guerra and
Matteo Interlandi and
Raquel Trillo-Lado and
Yannis Velegrakis QUEST: a keyword search system for
relational data based on semantic and
machine learning techniques . . . . . . 1222--1225
Kenneth S. Bògh and
Anders Skovsgaard and
Christian S. Jensen GroupFinder: a new approach to top-$k$
point-of-interest group retrieval . . . 1226--1229
Ahmed Eldawy and
Mohamed F. Mokbel A demonstration of SpatialHadoop: an
efficient MapReduce framework for
spatial data . . . . . . . . . . . . . . 1230--1233
Mehmet Ali Abbasoglu and
Bugra Gedik and
Hakan Ferhatosmanoglu Aggregate profile clustering for telco
analytics . . . . . . . . . . . . . . . 1234--1237
Luying Chen and
Stefano Ortona and
Giorgio Orsi and
Michael Benedikt ROSeAnn: reconciling opinions of
semantic annotators . . . . . . . . . . 1238--1241
Mohamed Sarwat and
James Avery and
Mohamed F. Mokbel RecDB in action: recommendation made
easy in relational databases . . . . . . 1242--1245
Marina Drosou and
Evaggelia Pitoura POIKILO: a tool for evaluating the
results of diversification models and
algorithms . . . . . . . . . . . . . . . 1246--1249
Yael Amsterdamer and
Yael Grossman and
Tova Milo and
Pierre Senellart CrowdMiner: mining association rules
from the crowd . . . . . . . . . . . . . 1250--1253
Chen Chen and
Hongzhi Yin and
Junjie Yao and
Bin Cui TeRec: a temporal recommender system
over tweet stream . . . . . . . . . . . 1254--1257
Alexander Shkapsky and
Kai Zeng and
Carlo Zaniolo Graph queries in a next-generation
Datalog system . . . . . . . . . . . . . 1258--1261
Abdeltawab M. Hendawi and
Jie Bao and
Mohamed F. Mokbel iRoad: a framework for scalable
predictive query processing on road
networks . . . . . . . . . . . . . . . . 1262--1265
Mithila Nagendra and
K. Selçuk Candan SkySuite: a framework of skyline-join
operators for static and stream
environments . . . . . . . . . . . . . . 1266--1269
Jianlong Zhong and
Bingsheng He Parallel graph processing on graphics
processors made easy . . . . . . . . . . 1270--1273
Stefan Richter and
Jens Dittrich and
Stefan Schuh and
Tobias Frey Mosquito: another one bites the data
upload stream . . . . . . . . . . . . . 1274--1277
Sergej Hardock and
Ilia Petrov and
Robert Gottstein and
Alejandro Buchmann NoFTL: database systems on FTL-less
flash storage . . . . . . . . . . . . . 1278--1281
Dimitrios Kotsakos and
Panos Sakkos and
Vana Kalogeraki and
Dimitirios Gunopulos SmartMonitor: using smart devices to
perform structural health monitoring . . 1282--1285
Yagíz Kargín and
Milena Ivanova and
Ying Zhang and
Stefan Manegold and
Martin Kersten Lazy ETL in action: ETL technology dates
scientific data . . . . . . . . . . . . 1286--1289
Niv Dayan and
Martin Kjær Svendsen and
Matias Bjòrling and
Philippe Bonnet and
Luc Bouganim EagleTree: exploring the design space of
SSD-based algorithms . . . . . . . . . . 1290--1293
Saket Sathe and
Arthur Oviedo and
Dipanjan Chakraborty and
Karl Aberer EnviroMeter: a platform for querying
community-sensed data . . . . . . . . . 1294--1297
Alper Okcan and
Mirek Riedewald and
Biswanath Panda and
Daniel Fink Scolopax: exploratory analysis of
scientific data . . . . . . . . . . . . 1298--1301
Daniel Deutch and
Yuval Moskovitch and
Val Tannen PROPOLIS: provisioned analysis of
data-centric processes . . . . . . . . . 1302--1305
Pradap Konda and
Arun Kumar and
Christopher Ré and
Vaishnavi Sashikanth Feature selection in enterprise
analytics: a demonstration using an
R-based data analytics system . . . . . 1306--1309
Mohammadreza Najafi and
Mohammad Sadoghi and
Hans-Arno Jacobsen Flexible query processor on FPGAs . . . 1310--1313
Cristina Civili and
Marco Console and
Giuseppe De Giacomo and
Domenico Lembo and
Maurizio Lenzerini and
Lorenzo Lepore and
Riccardo Mancini and
Antonella Poggi and
Riccardo Rosati and
Marco Ruzzi and
Valerio Santarelli and
Domenico Fabio Savo MASTRO STUDIO: managing ontology-based
data access applications . . . . . . . . 1314--1317
David Fuhry and
Yang Zhang and
Venu Satuluri and
Arnab Nandi and
Srinivasan Parthasarathy PLASMA-HD: probing the lattice structure
and makeup of high-dimensional data . . 1318--1321
Matthew Moyers and
Emad Soroush and
Spencer C. Wallace and
Simon Krughoff and
Jake Vanderplas and
Magdalena Balazinska and
Andrew Connolly A demonstration of iterative parallel
array processing in support of telescope
image analysis . . . . . . . . . . . . . 1322--1325
Hamed Abdelhaq and
Christian Sengstock and
Michael Gertz EvenTweet: online localized event
detection from Twitter . . . . . . . . . 1326--1329
Hamid Mousavi and
Shi Gao and
Carlo Zaniolo IBminer: a text mining tool for
constructing and populating InfoBox
databases and knowledge bases . . . . . 1330--1333
Nicholas L. Farnan and
Adam J. Lee and
Panos K. Chrysanthis and
Ting Yu PAQO: a preference-aware query optimizer
for PostgreSQL . . . . . . . . . . . . . 1334--1337
Suvarna Bothe and
Panagiotis Karras and
Akrivi Vlachou eSkyline: processing skyline queries
over encrypted data . . . . . . . . . . 1338--1341
Lilong Jiang and
Michael Mandel and
Arnab Nandi GestureQuery: a multitouch database
query interface . . . . . . . . . . . . 1342--1345
Di Yang and
Kaiyu Zhao and
Maryam Hasan and
Hanyuan Lu and
Elke Rundensteiner and
Matthew Ward Mining and linking patterns across live
data streams and stream archives . . . . 1346--1349
Hanan Samet and
Marco D. Adelfio and
Brendan C. Fruin and
Michael D. Lieberman and
Jagan Sankaranarayanan PhotoStand: a map query interface for a
database of news photos . . . . . . . . 1350--1353
K. Ashwin Kumar and
Jonathan Gluck and
Amol Deshpande and
Jimmy Lin Hone: ``Scaling down'' Hadoop on
shared-memory systems . . . . . . . . . 1354--1357
Dolan Antenucci and
Erdong Li and
Shaobo Liu and
Bochun Zhang and
Michael J. Cafarella and
Christopher Ré Ringtail: a generalized nowcasting
system . . . . . . . . . . . . . . . . . 1358--1361
Min Xie and
Laks V. S. Lakshmanan and
Peter T. Wood IPS: an interactive package
configuration system for trip planning 1362--1365
Jingbo Zhou and
Anthony K. H. Tung and
Wei Wu and
Wee Siong Ng R2-D2: a system to support probabilistic
path prediction in dynamic environments
via ``Semi-lazy'' learning . . . . . . . 1366--1369
Byung-Gon Chun and
Tyson Condie and
Carlo Curino and
Chris Douglas and
Sergiy Matusevych and
Brandon Myers and
Shravan Narayanamurthy and
Raghu Ramakrishnan and
Sriram Rao and
Josh Rosen and
Russell Sears and
Markus Weimer REEF: retainable evaluator execution
framework . . . . . . . . . . . . . . . 1370--1373
Shuhao Zhang and
Jiong He and
Bingsheng He and
Mian Lu OmniDB: towards portable and efficient
query processing on parallel CPU/GPU
architectures . . . . . . . . . . . . . 1374--1377
Ognjen Savkovi\'c and
Paramita Mirza and
Alex Tomasi and
Werner Nutt Complete approximations of incomplete
queries . . . . . . . . . . . . . . . . 1378--1381
Georgia Koutrika and
Qian Lin and
Jerry Liu User analytics with UbeOne: insights
into web printing . . . . . . . . . . . 1382--1385
Ivo Santos and
Marcel Tilly and
Badrish Chandramouli and
Jonathan Goldstein DiAl: distributed streaming analytics
anywhere, anytime . . . . . . . . . . . 1386--1389
Rada Chirkova and
Jun Yang Big and useful: what's in the data for
me? . . . . . . . . . . . . . . . . . . 1390--1391
Tomás Bartos Universal indexing of arbitrary
similarity models . . . . . . . . . . . 1392--1397
Sebastian Breß and
Gunter Saake Why it is time for a HyPE: a hybrid
query processing engine for efficient
GPU coprocessing in DBMS . . . . . . . . 1398--1403
Alireza Rezaei Mahdiraji and
Peter Baumann Database support for unstructured meshes 1404--1409
Aastha Madaan and
Subhash Bhalla Domain specific multistage query
language for medical document
repositories . . . . . . . . . . . . . . 1410--1415
Io Taxidou and
Peter Fischer Realtime analysis of information
diffusion in social media . . . . . . . 1416--1421
Luca Bonomi and
Li Xiong Mining frequent patterns with
differential privacy . . . . . . . . . . 1422--1427
Anett Hoppe and
C. Nicolle and
A. Roxin Automatic ontology-based user profile
learning from heterogeneous Web
resources in a big data context . . . . 1428--1433
Akon Dey and
Alan Fekete and
Uwe Röhm Scalable transactions across
heterogeneous NoSQL key--value data
stores . . . . . . . . . . . . . . . . . 1434--1439
Nhung Ngo and
Enrico Franconi Getting unique solution in data exchange 1440--1443
Martin Kaufmann and
Donald Kossmann Storing and processing temporal data in
a main memory column store . . . . . . . 1444--1449
Stepan Kozak and
Pavel Zezula Efficiency and security in similarity
cloud services . . . . . . . . . . . . . 1450--1455
Thibault Sellam and
Martin Kersten Fast cartography for data explorers . . 1456--1461
Gonçalo Simões and
Helena Galhardas and
Luis Gravano When speed has a price: fast information
extraction using approximate algorithms 1462--1473
Craig Chasseur and
Jignesh M. Patel Design and evaluation of storage
organizations for read-optimized main
memory databases . . . . . . . . . . . . 1474--1485
Luying Chen and
Stefano Ortona and
Giorgio Orsi and
Michael Benedikt Aggregating semantic annotators . . . . 1486--1497
Xu Chu and
Ihab F. Ilyas and
Paolo Papotti Discovering denial constraints . . . . . 1498--1509
Wenfei Fan and
Xin Wang and
Yinghui Wu Diversified top-$k$ graph pattern
matching . . . . . . . . . . . . . . . . 1510--1521
Weixiong Rao and
Lei Chen and
Pan Hui and
Sasu Tarkoma Bitlist: new full-text index for low
space cost and efficient keyword search 1522--1533
Sebastian Wandelt and
Johannes Starlinger and
Marc Bux and
Ulf Leser RCSI: scalable similarity search in
thousand(s) of genomes . . . . . . . . . 1534--1545
Yufei Tao and
Xiaocheng Hu and
Dong-Wan Choi and
Chin-Wan Chung Approximate MaxRS in spatial databases 1546--1557
Benny Kimelfeld and
Jan Vondrák and
David P. Woodruff Multi-tuple deletion propagation:
approximations and complexity . . . . . 1558--1569
Badrish Chandramouli and
Suman Nath and
Wenchao Zhou Supporting distributed feed-following
apps over edge devices . . . . . . . . . 1570--1581
Saravanan Thirumuruganathan and
Nan Zhang and
Gautam Das Rank discovery from web databases . . . 1582--1593
Theodoros Rekatsinas and
Amol Deshpande and
Ashwin Machanavajjhala SPARSI: partitioning sensitive data
amongst multiple adversaries . . . . . . 1594--1605
Dong Deng and
Yu Jiang and
Guoliang Li and
Jian Li and
Cong Yu Scalable column concept determination
for Web tables using large knowledge
bases . . . . . . . . . . . . . . . . . 1606--1617
Xin Huang and
Hong Cheng and
Rong-Hua Li and
Lu Qin and
Jeffrey Xu Yu top-$k$ structural diversity search in
large networks . . . . . . . . . . . . . 1618--1629
Federico Cavalieri and
Alessandro Solimando and
Giovanna Guerrini Synthetising changes in XML documents as
PULs . . . . . . . . . . . . . . . . . . 1630--1641
Lei Zhang and
Thanh Tran and
Achim Rettinger Probabilistic query rewriting for
efficient and effective keyword search
on graph data . . . . . . . . . . . . . 1642--1653
Martin Schäler and
Alexander Grebhahn and
Reimar Schröter and
Sandro Schulze and
Veit Köppen and
Gunter Saake QuEval: beyond high-dimensional indexing
\`a la carte . . . . . . . . . . . . . . 1654--1665
Yuhong Li and
Leong Hou U. and
Man Lung Yiu and
Zhiguo Gong Discovering longest-lasting correlation
in sequence databases . . . . . . . . . 1666--1677
Adrian Daniel Popescu and
Andrey Balmin and
Vuk Ercegovac and
Anastasia Ailamaki PREDIcT: towards predicting the runtime
of large scale iterative analytics . . . 1678--1689
Xiaohan Zhao and
Adelbert Chang and
Atish Das Sarma and
Haitao Zheng and
Ben Y. Zhao On the embeddability of random walk
distances . . . . . . . . . . . . . . . 1690--1701
Tobias Mühlbauer and
Wolf Rödiger and
Robert Seilbeck and
Angelika Reiser and
Alfons Kemper and
Thomas Neumann Instant loading for main memory
databases . . . . . . . . . . . . . . . 1702--1713
Karolina Alexiou and
Donald Kossmann and
Per-Åke Larson Adaptive range filters for cold data:
avoiding trips to Siberia . . . . . . . 1714--1725
Badrish Chandramouli and
Jonathan Goldstein and
Abdul Quamar Scalable progressive analytics on big
data in the Cloud . . . . . . . . . . . 1726--1737
Peter Ogden and
David Thomas and
Peter Pietzuch Scalable XML query processing using
parallel pushdown transducers . . . . . 1738--1749
Yin Huai and
Siyuan Ma and
Rubao Lee and
Owen O'Malley and
Xiaodong Zhang Understanding insights into the basic
structure and essential issues of table
placement methods in clusters . . . . . 1750--1761
Davide Mottin and
Alice Marascu and
Senjuti Basu Roy and
Gautam Das and
Themis Palpanas and
Yannis Velegrakis A probabilistic optimization framework
for the empty-answer problem . . . . . . 1762--1773
Yinghui Wu and
Shengqi Yang and
Mudhakar Srivatsa and
Arun Iyengar and
Xifeng Yan Summarizing answer graphs induced by
keyword queries . . . . . . . . . . . . 1774--1785
Huizhong Duan and
ChengXiang Zhai and
Jinxing Cheng and
Abhishek Gattani Supporting keyword search in product
database: a probabilistic approach . . . 1786--1797
Supriya Nirkhiwale and
Alin Dobra and
Christopher Jermaine A sampling algebra for aggregate
estimation . . . . . . . . . . . . . . . 1798--1809
Maximilian Dylla and
Iris Miliaraki and
Martin Theobald A temporal-probabilistic database model
for information extraction . . . . . . . 1810--1821
Pit Fender and
Guido Moerkotte Counter strike: generic top-down join
enumeration for hypergraphs . . . . . . 1822--1833
Daniar Achakeev and
Bernhard Seeger Efficient bulk updates on multiversion
B-trees . . . . . . . . . . . . . . . . 1834--1845
Hotham Altwaijry and
Dmitri V. Kalashnikov and
Sharad Mehrotra Query-driven approach to entity
resolution . . . . . . . . . . . . . . . 1846--1857
Jaros\law Szlichta and
Parke Godfrey and
Jarek Gryz and
Calisto Zuzarte Expressiveness and complexity of order
dependencies . . . . . . . . . . . . . . 1858--1869
A. Pavan and
Kanat Tangwongsan and
Srikanta Tirthapura and
Kun-Lung Wu Counting and sampling triangles from a
graph stream . . . . . . . . . . . . . . 1870--1881
Benjamin Sowell and
Marcos Vaz Salles and
Tuan Cao and
Alan Demers and
Johannes Gehrke An experimental analysis of iterated
spatial joins in main memory . . . . . . 1882--1893
Kisung Lee and
Ling Liu Scaling queries over big RDF graphs with
semantic hash partitioning . . . . . . . 1894--1905
Jiwon Seo and
Jongsoo Park and
Jaeho Shin and
Monica S. Lam Distributed socialite: a datalog-based
language for large-scale graph analysis 1906--1917
Mohamed Sarwat and
Sameh Elnikety and
Yuxiong He and
Mohamed F. Mokbel Horton+: a distributed system for
processing declarative reachability
queries over partitioned graphs . . . . 1918--1929
Narayanan Sundaram and
Aizana Turmukhametova and
Nadathur Satish and
Todd Mostak and
Piotr Indyk and
Samuel Madden and
Pradeep Dubey Streaming similarity search over one
billion tweets using parallel
locality-sensitive hashing . . . . . . . 1930--1941
Justin DeBrabant and
Andrew Pavlo and
Stephen Tu and
Michael Stonebraker and
Stan Zdonik Anti-caching: a new approach to database
management system architecture . . . . . 1942--1953
Wahbeh Qardaji and
Weining Yang and
Ninghui Li Understanding hierarchical methods for
differentially private histograms . . . 1954--1965
Rui Li and
Shengjie Wang and
Kevin Chen-Chuan Chang Towards social data platform: automatic
topic-focused monitor for Twitter stream 1966--1977
Ruoming Jin and
Guan Wang Simple, fast, and scalable reachability
oracle . . . . . . . . . . . . . . . . . 1978--1989
Nurzhan Bakibayev and
Tomás Kociský and
Dan Olteanu and
Jakub Závodný Aggregation and ordering in factorised
databases . . . . . . . . . . . . . . . 1990--2001
Yoonjae Park and
Jun-Ki Min and
Kyuseok Shim Parallel computation of skyline and
reverse skyline queries using MapReduce 2002--2013
Wenlei Xie and
Guozhang Wang and
David Bindel and
Alan Demers and
Johannes Gehrke Fast iterative graph computation with
block updates . . . . . . . . . . . . . 2014--2025
Xiaoli Wang and
Xiaofeng Ding and
Anthony K. H. Tung and
Zhenjie Zhang Efficient and effective KNN sequence
search with approximate $n$-grams . . . 1--12
Weiren Yu and
Xuemin Lin and
Wenjie Zhang and
Lijun Chang and
Jian Pei More is simpler: effectively and
efficiently assessing node-pair
similarities based on hyperlinks . . . . 13--24
Marc Gyssens and
Jan Paredaens and
Dirk Van Gucht and
Jef Wijsen and
Yuqing Wu An approach towards the study of
symmetric queries . . . . . . . . . . . 25--36
Sudipto Das and
Vivek R. Narasayya and
Feng Li and
Manoj Syamala CPU sharing techniques for performance
isolation in multi-tenant relational
database-as-a-service . . . . . . . . . 37--48
Qian Chen and
Haibo Hu and
Jianliang Xu Authenticating top-$k$ queries in
location-based services with
confidentiality . . . . . . . . . . . . 49--60
Zichao Qi and
Yanghua Xiao and
Bin Shao and
Haixun Wang Toward a distance oracle for
billion-node graphs . . . . . . . . . . 61--72
Manohar Kaul and
Raymond Chi-Wing Wong and
Bin Yang and
Christian S. Jensen Finding shortest paths on terrains by
killing two birds with one stone . . . . 73--84
Cagri Balkesen and
Gustavo Alonso and
Jens Teubner and
M. Tamer Özsu Multi-core, main-memory joins: sort vs.
hash revisited . . . . . . . . . . . . . 85--96
Felix Martin Schuhknecht and
Alekh Jindal and
Jens Dittrich The uncracked pieces in database
cracking . . . . . . . . . . . . . . . . 97--108
Bahaeddin Eravci and
Hakan Ferhatosmanoglu Diversity based relevance feedback for
time series search . . . . . . . . . . . 109--120
Steven Pelley and
Thomas F. Wenisch and
Brian T. Gold and
Bill Bridge Storage management in the NVRAM era . . 121--132
Mariam Salloum and
Xin Luna Dong and
Divesh Srivastava and
Vassilis J. Tsotras Online ordering of overlapping data
sources . . . . . . . . . . . . . . . . 133--144
Guoping Wang and
Chee-Yong Chan Multi-query optimization in MapReduce
framework . . . . . . . . . . . . . . . 145--156
Zhenhui Li and
Bolin Ding and
Fei Wu and
Tobias Kin Hou Lei and
Roland Kays and
Margaret C. Crofoot Attraction and avoidance detection from
movements . . . . . . . . . . . . . . . 157--168
Xiang Zhao and
Chuan Xiao and
Xuemin Lin and
Qing Liu and
Wenjie Zhang A partition-based approach to structure
similarity search . . . . . . . . . . . 169--180
Peter Bailis and
Aaron Davidson and
Alan Fekete and
Ali Ghodsi and
Joseph M. Hellerstein and
Ion Stoica Highly available transactions: virtues
and limitations . . . . . . . . . . . . 181--192
Yuanyuan Tian and
Andrey Balmin and
Severin Andreas Corsten and
Shirish Tatikonda and
John McPherson From ``think like a vertex'' to ``think
like a graph'' . . . . . . . . . . . . . 193--204
Johannes Niedermayer and
Andreas Züfle and
Tobias Emrich and
Matthias Renz and
Nikos Mamoulis and
Lei Chen and
Hans-Peter Kriegel Probabilistic nearest neighbor queries
on uncertain moving object trajectories 205--216
Konstantinos Karanasos and
Asterios Katsifodimos and
Ioana Manolescu Delta: scalable data dissemination under
capacity constraints . . . . . . . . . . 217--228
Ceren Budak and
Theodore Georgiou and
Divyakant Agrawal and
Amr El Abbadi GeoScope: online detection of
geo-correlated information trends in
social networks . . . . . . . . . . . . 229--240
Makoto Onizuka and
Hiroyuki Kato and
Soichiro Hidaka and
Keisuke Nakano and
Zhenjiang Hu Optimization for iterative queries on
MapReduce . . . . . . . . . . . . . . . 241--252
Hong-Han Shuai and
De-Nian Yang and
Philip S. Yu and
Ming-Syan Chen Willingness optimization for social
group activity . . . . . . . . . . . . . 253--264
Lei Cao and
Elke A. Rundensteiner High performance stream query processing
with correlation-aware partitioning . . 265--276
Djellel Eddine Difallah and
Andrew Pavlo and
Carlo Curino and
Philippe Cudre-Mauroux OLTP-Bench: an extensible testbed for
benchmarking relational databases . . . 277--288
Arnab Nandi and
Lilong Jiang and
Michael Mandel Gestural query specification . . . . . . 289--300
Arvid Heise and
Jorge-Arnulfo Quiané-Ruiz and
Ziawasch Abedjan and
Anja Jentzsch and
Felix Naumann Scalable discovery of unique column
combinations . . . . . . . . . . . . . . 301--312
Yu Tang and
Leong Hou U. and
Yilun Cai and
Nikos Mamoulis and
Reynold Cheng Earth Mover's Distance based similarity
search at scale . . . . . . . . . . . . 313--324
Aditya Parameswaran and
Neoklis Polyzotis and
Hector Garcia-Molina SeeDB: visualizing database queries
efficiently . . . . . . . . . . . . . . 325--328
Hatem A. Mahmoud and
Vaibhav Arora and
Faisal Nawab and
Divyakant Agrawal and
Amr El Abbadi MaaT: effective and scalable
coordination of distributed transactions
in the cloud . . . . . . . . . . . . . . 329--340
Chao Li and
Michael Hay and
Gerome Miklau and
Yue Wang A data- and workload-aware algorithm for
range queries under differential privacy 341--352
Sergio Greco and
Fabian Pijcke and
Jef Wijsen Certain query answering in partially
consistent databases . . . . . . . . . . 353--364
Davide Mottin and
Matteo Lissandrini and
Yannis Velegrakis and
Themis Palpanas Exemplar queries: give me an example of
what you need . . . . . . . . . . . . . 365--376
Nitish Korula and
Silvio Lattanzi An efficient reconciliation algorithm
for social networks . . . . . . . . . . 377--388
Sean Chester and
Alex Thomo and
S. Venkatesh and
Sue Whitesides Computing $k$-regret minimizing sets . . 389--400
Adams Wei Yu and
Nikos Mamoulis and
Hao Su Reverse top-$k$ search using random walk
with restart . . . . . . . . . . . . . . 401--412
Stratis D. Viglas Write-limited sorts and joins for
persistent memory . . . . . . . . . . . 413--424
N. Anciaux and
L. Bouganim and
T. Delot and
S. Ilarri and
L. Kloul and
N. Mitton and
P. Pucheral Folk-IS: opportunistic data services in
least developed countries . . . . . . . 425--428
Georgios Giannikis and
Darko Makreshanski and
Gustavo Alonso and
Donald Kossmann Shared workload optimization . . . . . . 429--440
Mohammed Elseidy and
Abdallah Elguindy and
Aleksandar Vitorovic and
Christoph Koch Scalable and adaptive online joins . . . 441--452
Kristi Morton and
Magdalena Balazinska and
Dan Grossman and
Jock Mackinlay Support the data enthusiast: challenges
for next-generation data-analysis
systems . . . . . . . . . . . . . . . . 453--456
Daniel Deutch and
Yuval Moskovitch and
Val Tannen A provenance framework for
data-dependent process analysis . . . . 457--468
Yueh-Hsuan Chiang and
AnHai Doan and
Jeffrey F. Naughton Tracking entities in the dynamic world:
a fast algorithm for matching temporal
records . . . . . . . . . . . . . . . . 469--480
Neil Conway and
Peter Alvaro and
Emily Andrews and
Joseph M. Hellerstein Edelweiss: automatic storage reclamation
for distributed programming . . . . . . 481--492
Nikos Ntarmos and
Ioannis Patlakas and
Peter Triantafillou Rank join queries in NoSQL databases . . 493--504
Rahul Gupta and
Alon Halevy and
Xuezhi Wang and
Steven Euijong Whang and
Fei Wu Biperpedia: an ontology for search
applications . . . . . . . . . . . . . . 505--516
Mohammed Elseidy and
Ehab Abdelhamid and
Spiros Skiadopoulos and
Panos Kalnis GraMi: frequent subgraph and pattern
mining in a single large graph . . . . . 517--528
Sheng Wang and
David Maier and
Beng Chin Ooi Lightweight indexing of observational
data in log-structured storage . . . . . 529--540
Dawei Jiang and
Gang Chen and
Beng Chin Ooi and
Kian-Lee Tan and
Sai Wu epiC: an extensible and scalable system
for processing big data . . . . . . . . 541--552
Matthias Boehm and
Shirish Tatikonda and
Berthold Reinwald and
Prithviraj Sen and
Yuanyuan Tian and
Douglas R. Burdick and
Shivakumar Vaithyanathan Hybrid parallelization strategies for
large-scale machine learning in SystemML 553--564
Shengqi Yang and
Yinghui Wu and
Huan Sun and
Xifeng Yan Schemaless and structureless graph
querying . . . . . . . . . . . . . . . . 565--576
Semih Salihoglu and
Jennifer Widom Optimizing graph algorithms on
Pregel-like systems . . . . . . . . . . 577--588
You Wu and
Pankaj K. Agarwal and
Chengkai Li and
Jun Yang and
Cong Yu Toward computational fact-checking . . . 589--600
Marcelo Arenas and
Gonzalo Díaz and
Achille Fokoue and
Anastasios Kementsietsidis and
Kavitha Srinivas A principled approach to bridging the
gap between graph data and their schemas 601--612
Dongxiang Zhang and
Chee-Yong Chan and
Kian-Lee Tan An efficient publish/subscribe index for
e-commerce databases . . . . . . . . . . 613--624
Yu Jiang and
Guoliang Li and
Jianhua Feng and
Wen-Syan Li String similarity joins: an experimental
evaluation . . . . . . . . . . . . . . . 625--636
Davide Proserpio and
Sharon Goldberg and
Frank McSherry Calibrating data to sensitivity in
private data analysis: a platform for
differentially-private analysis of
weighted datasets . . . . . . . . . . . 637--648
Wei Wang and
Beng Chin Ooi and
Xiaoyan Yang and
Dongxiang Zhang and
Yueting Zhuang Effective multi-modal retrieval based on
stacked auto-encoders . . . . . . . . . 649--660
Renchu Song and
Weiwei Sun and
Baihua Zheng and
Yu Zheng PRESS: a novel framework of trajectory
compression in road networks . . . . . . 661--672
Yajun Yang and
Hong Gao and
Jeffrey Xu Yu and
Jianzhong Li Finding the cost-optimal path with time
constraint over time-dependent graphs 673--684
Aditya Parameswaran and
Stephen Boyd and
Hector Garcia-Molina and
Ashish Gupta and
Neoklis Polyzotis and
Jennifer Widom Optimal crowd-powered rating and
filtering algorithms . . . . . . . . . . 685--696
Anja Gruenheid and
Xin Luna Dong and
Divesh Srivastava Incremental record linkage . . . . . . . 697--708
Pratanu Roy and
Jens Teubner and
Rainer Gemulla Low-latency handshake join . . . . . . . 709--720
Huanhuan Wu and
James Cheng and
Silu Huang and
Yiping Ke and
Yi Lu and
Yanyan Xu Path problems in temporal graphs . . . . 721--732
Xin Cao and
Gao Cong and
Christian S. Jensen and
Man Lung Yiu Retrieving regions of interest for user
exploration . . . . . . . . . . . . . . 733--744
Yingfan Liu and
Jiangtao Cui and
Zi Huang and
Hui Li and
Heng Tao Shen SK--LSH: an efficient index structure
for approximate nearest neighbor search 745--756
Bing-Rong Lin and
Daniel Kifer On arbitrage-free pricing for general
data queries . . . . . . . . . . . . . . 757--768
Chao Zhang and
Jiawei Han and
Lidan Shou and
Jiajun Lu and
Thomas La Porta Splitter: mining fine-grained sequential
patterns in semantic trajectories . . . 769--780
Avrilia Floratou and
Frank Bertsch and
Jignesh M. Patel and
Georgios Laskaris Towards building wind tunnels for data
center design . . . . . . . . . . . . . 781--784
Zhao Zhang and
Cheqing Jin and
Qiangqiang Kang Reverse $k$-ranks query . . . . . . . . 785--796
Uwe Jugel and
Zbigniew Jerzak and
Gregor Hackenbroich and
Gregor Hackenbroich and
Volker Markl M4: a visualization-oriented time series
data aggregation . . . . . . . . . . . . 797--808
Mina Ghashami and
Jeff M. Phillips and
Feifei Li Continuous matrix approximation on
distributed data . . . . . . . . . . . . 809--820
Kun Ren and
Alexander Thomson and
Daniel J. Abadi An evaluation of the advantages and
disadvantages of deterministic database
systems . . . . . . . . . . . . . . . . 821--832
Hao Zhang and
Bogdan Marius Tudor and
Gang Chen and
Beng Chin Ooi Efficient in-memory data management: an
analysis . . . . . . . . . . . . . . . . 833--836
Günes Aluç and
M. Tamer Özsu and
Khuzaima Daudjee Workload matters: why RDF databases need
a new design . . . . . . . . . . . . . . 837--840
Sattam Alsubaiee and
Alexander Behm and
Vinayak Borkar and
Zachary Heilbron and
Young-Seok Kim and
Michael J. Carey and
Markus Dreseler and
Chen Li Storage management in AsterixDB . . . . 841--852
Yannis Klonatos and
Christoph Koch and
Tiark Rompf and
Hassan Chafi Building efficient query engines in a
high-level language . . . . . . . . . . 853--864
Tianzheng Wang and
Ryan Johnson Scalable logging through emerging
non-volatile memory . . . . . . . . . . 865--876
Bingsheng He When data management systems meet
approximate hardware: challenges and
opportunities . . . . . . . . . . . . . 877--880
Xin Luna Dong and
Evgeniy Gabrilovich and
Geremy Heitz and
Wilko Horn and
Kevin Murphy and
Shaohua Sun and
Wei Zhang From data fusion to knowledge fusion . . 881--892
Stefan Funke and
André Nusser and
Sabine Storandt On $k$-path covers and their
applications . . . . . . . . . . . . . . 893--902
Eugene Wu and
Leilani Battle and
Samuel R. Madden The case for data visualization
management systems: vision paper . . . . 903--906
Yinan Li and
Jignesh M. Patel WideTable: an accelerator for analytical
data processing . . . . . . . . . . . . 907--918
Hien To and
Gabriel Ghinita and
Cyrus Shahabi A framework for protecting worker
location privacy in spatial
crowdsourcing . . . . . . . . . . . . . 919--930
Ahmed Eldawy and
Justin Levandoski and
Per-Åke Larson Trekking through Siberia: managing cold
data in a memory-optimized database . . 931--942
Jennie Duggan The case for personal data-driven
decision making . . . . . . . . . . . . 943--946
Prima Chairunnanda and
Khuzaima Daudjee and
M. Tamer Özsu ConfluxDB: multi-master replication for
partitioned snapshot isolation databases 947--958
Bernardo Gonçalves and
Fabio Porto $ \gamma $-DB: managing scientific
hypotheses as uncertain data . . . . . . 959--962
Louis Woods and
Zsolt István and
Gustavo Alonso Ibex: an intelligent storage engine with
support for advanced SQL offloading . . 963--974
Hyokun Yun and
Hsiang-Fu Yu and
Cho-Jui Hsieh and
S. V. N. Vishwanathan and
Inderjit Dhillon NOMAD: non-locking, stochastic
multi-machine algorithm for asynchronous
and decentralized matrix completion . . 975--986
Shaoxu Song and
Hong Cheng and
Jeffrey Xu Yu and
Lei Chen Repairing vertex labels under
neighborhood constraints . . . . . . . . 987--998
Yasser Altowim and
Dmitri V. Kalashnikov and
Sharad Mehrotra Progressive approach to relational
entity resolution . . . . . . . . . . . 999--1010
Kaibo Wang and
Kai Zhang and
Yuan Yuan and
Siyuan Ma and
Rubao Lee and
Xiaoning Ding and
Xiaodong Zhang Concurrent analytical query processing
with GPUs . . . . . . . . . . . . . . . 1011--1022
Takanori Maehara and
Takuya Akiba and
Yoichi Iwata and
Ken-ichi Kawarabayashi Computing personalized PageRank quickly
by exploiting graph structures . . . . . 1023--1034
Marco Serafini and
Essam Mansour and
Ashraf Aboulnaga and
Kenneth Salem and
Taha Rafiq and
Umar Farooq Minhas Accordion: elastic scalability for
database systems supporting distributed
transactions . . . . . . . . . . . . . . 1035--1046
Minyang Han and
Khuzaima Daudjee and
Khaled Ammar and
M. Tamer Özsu and
Xingfang Wang and
Tianqi Jin An experimental comparison of
Pregel-like graph processing systems . . 1047--1058
Akash Das Sarma and
Yeye He and
Surajit Chaudhuri ClusterJoin: a similarity joins
framework using map-reduce . . . . . . . 1059--1070
Norases Vesdapunt and
Kedar Bellare and
Nilesh Dalvi Crowdsourcing algorithms for entity
resolution . . . . . . . . . . . . . . . 1071--1082
Wenfei Fan and
Xin Wang and
Yinghui Wu and
Dong Deng Distributed graph simulation:
impossibility and possibility . . . . . 1083--1094
Fabian Nagel and
Gavin Bierman and
Stratis D. Viglas Code generation for efficient query
processing in managed runtimes . . . . . 1095--1106
Weimo Liu and
Saravanan Thirumuruganathan and
Nan Zhang and
Gautam Das Aggregate estimation over dynamic hidden
web databases . . . . . . . . . . . . . 1107--1118
Manos Karpathiotakis and
Miguel Branco and
Ioannis Alagiannis and
Anastasia Ailamaki Adaptive query processing on RAW data 1119--1130
Foto N. Afrati and
Dan Delorey and
Mosha Pasumansky and
Jeffrey D. Ullman Storing and querying tree-structured
records in Dremel . . . . . . . . . . . 1131--1142
Johannes Starlinger and
Bryan Brancotte and
Sarah Cohen-Boulakia and
Ulf Leser Similarity search for scientific
workflows . . . . . . . . . . . . . . . 1143--1154
Georgios Kellaris and
Stavros Papadopoulos and
Xiaokui Xiao and
Dimitris Papadias Differentially private event sequences
over infinite streams . . . . . . . . . 1155--1166
Nikhil Londhe and
Vishrawas Gopalakrishnan and
Aidong Zhang and
Hung Q. Ngo and
Rohini Srihari Matching titles with cross title
web-search enrichment and community
detection . . . . . . . . . . . . . . . 1167--1178
Shaoxu Song and
Lei Chen and
Hong Cheng On concise set of relative candidate
keys . . . . . . . . . . . . . . . . . . 1179--1190
Hao Wei and
Jeffrey Xu Yu and
Can Lu and
Ruoming Jin Reachability querying: an independent
permutation labeling approach . . . . . 1191--1202
Minhao Jiang and
Ada Wai-Chee Fu and
Raymond Chi-Wing Wong and
Yanyan Xu Hop doubling label indexing for
point-to-point distance querying on
scale-free networks . . . . . . . . . . 1203--1214
Fabian M. Suchanek and
Nicoleta Preda Semantic culturomics . . . . . . . . . . 1215--1218
Jörn Kuhlenkamp and
Markus Klems and
Oliver Röss Benchmarking scalability and elasticity
of distributed database systems . . . . 1219--1230
Yang Cao and
Wenfei Fan and
Tianyu Wo and
Wenyuan Yu Bounded conjunctive queries . . . . . . 1231--1242
Anil Shanbhag and
S. Sudarshan Optimizing join enumeration in
transformation-based query optimizers 1243--1254
Marie Jacob and
Benny Kimelfeld and
Julia Stoyanovich A system for management and analysis of
preference data . . . . . . . . . . . . 1255--1258
Ashish Gupta and
Fan Yang and
Jason Govig and
Adam Kirsch and
Kelvin Chan and
Kevin Lai and
Shuo Wu and
Sandeep Govind Dhoot and
Abhilash Rajesh Kumar and
Ankur Agiwal and
Sanjay Bhansali and
Mingsheng Hong and
Jamie Cameron and
Masood Siddiqi and
David Jones and
Jeff Shute and
Andrey Gubarev and
Shivakumar Venkataraman and
Divyakant Agrawal Mesa: geo-replicated, near real-time,
scalable data warehousing . . . . . . . 1259--1270
John Liagouris and
Nikos Mamoulis and
Panagiotis Bouros and
Manolis Terrovitis An effective encoding scheme for spatial
RDF data . . . . . . . . . . . . . . . . 1271--1282
Ce Zhang and
Christopher Ré DimmWitted: a study of main-memory
statistical analytics . . . . . . . . . 1283--1294
Avrilia Floratou and
Umar Farooq Minhas and
Fatma Özcan SQL-on-Hadoop: full circle back to
shared-nothing database architectures 1295--1306
Marco Guarnieri and
David Basin Optimal security-aware query processing 1307--1318
Juwei Shi and
Jia Zou and
Jiaheng Lu and
Zhao Cao and
Shiqiang Li and
Chen Wang MRTuner: a toolkit to enable holistic
optimization for MapReduce jobs . . . . 1319--1330
Mohammad Sadoghi and
Mustafa Canim and
Bishwaranjan Bhattacharjee and
Fabian Nagel and
Kenneth A. Ross Reducing database locking contention
through multi-version concurrency . . . 1331--1342
Xueyuan Su and
Garret Swart and
Brian Goetz and
Brian Oliver and
Paul Sandoz Changing engines in midstream: a Java
stream computational model for big data
processing . . . . . . . . . . . . . . . 1343--1354
Jae-Gil Lee and
Gopi Attaluri and
Ronald Barber and
Naresh Chainani and
Oliver Draese and
Frederick Ho and
Stratos Idreos and
Min-Soo Kim and
Sam Lightstone and
Guy Lohman and
Konstantinos Morfonios and
Keshava Murthy and
Ippokratis Pandis and
Lin Qiao and
Vijayshankar Raman and
Vincent Kulandai Samy and
Richard Sidle and
Knut Stolze and
Liping Zhang Joins on encoded and partitioned data 1355--1366
Meikel Poess and
Tilmann Rabl and
Hans-Arno Jacobsen and
Brian Caufield TPC--DI: the first industry benchmark
for data integration . . . . . . . . . . 1367--1378
Pankaj Gupta and
Venu Satuluri and
Ajeet Grewal and
Siva Gurumurthy and
Volodymyr Zhabiuk and
Quannan Li and
Jimmy Lin Real-time Twitter recommendation: online
motif detection in large dynamic graphs 1379--1380
Sang K. Cha and
Kunsoo Park and
Changbin Song and
Kihong Kim and
Cheol Ryu and
Sunho Lee Interval disaggregate: a new operator
for business planning . . . . . . . . . 1381--1392
Zhuo Zhang and
Chao Li and
Yangyu Tao and
Renyu Yang and
Hong Tang and
Jie Xu Fuxi: a fault-tolerant resource
management and job scheduling system at
Internet scale . . . . . . . . . . . . . 1393--1404
David Simmen and
Karl Schnaitter and
Jeff Davis and
Yingjie He and
Sangeet Lohariwala and
Ajay Mysore and
Vinayak Shenoi and
Mingfeng Tan and
Yu Xiao Large-scale graph analytics in Aster 6:
bringing context to big data discovery 1405--1416
Zhimin Chen and
Vivek Narasayya and
Surajit Chaudhuri Fast foreign-key detection in Microsoft
SQL server PowerPivot for Excel . . . . 1417--1428
Meng-Chieh Yu and
Tong Yu and
Shao-Chen Wang and
Chih-Jen Lin and
Edward Y. Chang Big data small footprint: the design of
a low-power classifier for detecting
transportation modes . . . . . . . . . . 1429--1440
Oscar Boykin and
Sam Ritchie and
Ian O'Connell and
Jimmy Lin Summingbird: a framework for integrating
batch and online MapReduce computations 1441--1451
Rafi Ahmed and
Rajkumar Sen and
Meikel Poess and
Sunil Chakkappen Of snowstorms and bushy trees . . . . . 1452--1461
Srinivas Vemuri and
Maneesh Varshney and
Krishna Puttaswamy and
Rui Liu Execution primitives for scalable joins
and aggregations in MapReduce . . . . . 1462--1473
Javier Arauz CAP limits in telecom subscriber
database design . . . . . . . . . . . . 1474--1483
Nicolas Bruno and
YongChul Kwon and
Ming-Chuan Wu Advanced join strategies for large-scale
distributed computation . . . . . . . . 1484--1495
Yue Liu and
Songlin Hu and
Tilmann Rabl and
Wantao Liu and
Hans-Arno Jacobsen and
Kaifeng Wu and
Jian Chen and
Jintao Li DGFIndex for smart grid: enhancing Hive
with a cost-effective multidimensional
range index . . . . . . . . . . . . . . 1496--1507
Ying Yan and
Liang Jeff Chen and
Zheng Zhang Error-bounded sampling for analytics on
big sparse data . . . . . . . . . . . . 1508--1519
Vinitha Reddy Gankidi and
Nikhil Teletia and
Jignesh M. Patel and
Alan Halverson and
David J. DeWitt Indexing HDFS data in PDW: splitting the
data from the index . . . . . . . . . . 1520--1528
Chong Sun and
Narasimhan Rampalli and
Frank Yang and
AnHai Doan Chimera: large-scale classification
using machine learning, rules, and
crowdsourcing . . . . . . . . . . . . . 1529--1540
Angela Bonifati and
Radu Ciucanu and
S\LAwek Staworko Interactive join query inference with
JIM . . . . . . . . . . . . . . . . . . 1541--1544
Yuxin Zheng and
Zhifeng Bao and
Lidan Shou and
Anthony K. H. Tung MESA: a map service to support fuzzy
type-ahead search over geo-textual data 1545--1548
Henan Wang and
Guoliang Li and
Huiqi Hu and
Shuo Chen and
Bingwen Shen and
Hao Wu and
Wen-Syan Li and
Kian-Lee Tan R3: a real-time route recommendation
system . . . . . . . . . . . . . . . . . 1549--1552
Michael Benedikt and
Julien Leblay and
Efthymia Tsamoura PDQ: proof-driven query answering over
Web-based data . . . . . . . . . . . . . 1553--1556
Naeemul Hassan and
Afroza Sultana and
You Wu and
Gensheng Zhang and
Chengkai Li and
Jun Yang and
Cong Yu Data in, fact out: automated monitoring
of facts by FactWatcher . . . . . . . . 1557--1560
Mingxuan Yuan and
Ke Deng and
Jia Zeng and
Yanhua Li and
Bing Ni and
Xiuqiang He and
Fei Wang and
Wenyuan Dai and
Qiang Yang OceanST: a distributed analytic system
for large-scale spatiotemporal mobile
broadband data . . . . . . . . . . . . . 1561--1564
Floris Geerts and
Giansalvatore Mecca and
Paolo Papotti and
Donatello Santoro That's all folks!: Llunatic goes open
source . . . . . . . . . . . . . . . . . 1565--1568
Weimo Liu and
Saad Bin Suhaim and
Saravanan Thirumuruganathan and
Nan Zhang and
Gautam Das and
Ali Jaoua HDBTracker: monitoring the aggregates on
dynamic hidden web databases . . . . . . 1569--1572
Fan Xia and
Ye Li and
Chengcheng Yu and
Haixin Ma and
Weining Qian BSMA: a benchmark for analytical queries
over social media data . . . . . . . . . 1573--1576
André Petermann and
Martin Junghanns and
Robert Müller and
Erhard Rahm Graph-based data integration and
business intelligence with BIIIG . . . . 1577--1580
Manasi Vartak and
Samuel Madden and
Aditya Parameswaran and
Neoklis Polyzotis SeeDB: automatically generating query
visualizations . . . . . . . . . . . . . 1581--1584
Anshuman Dutt and
Sumit Neelam and
Jayant R. Haritsa QUEST: an exploratory approach to robust
query processing . . . . . . . . . . . . 1585--1588
Chuan Lei and
Zhongfang Zhuang and
Elke A. Rundensteiner and
Mohamed Y. Eltabakh Redoop infrastructure for recurring big
data queries . . . . . . . . . . . . . . 1589--1592
Matteo Brucato and
Rahul Ramakrishna and
Azza Abouzied and
Alexandra Meliou PackageBuilder: from tuples to packages 1593--1596
Yael Amsterdamer and
Susan B. Davidson and
Tova Milo and
Slava Novgorodov and
Amit Somech Ontology assisted crowd mining . . . . . 1597--1600
Lisi Chen and
Yan Cui and
Gao Cong and
Xin Cao SOPS: a system for efficient processing
of spatial-keyword publish/subscribe . . 1601--1604
Masumi Shirakawa and
Takahiro Hara and
Shojiro Nishio MLJ: language-independent real-time
search of tweets reported by media
outlets and journalists . . . . . . . . 1605--1608
Sebastian Breß and
Bastian Köcher and
Max Heimel and
Volker Markl and
Michael Saecker and
Gunter Saake Ocelot/HyPE: optimized data processing
on heterogeneous hardware . . . . . . . 1609--1612
Fei Wu and
Tobias Kin Hou Lei and
Zhenhui Li and
Jiawei Han MoveMine 2.0: mining object
relationships from movement data . . . . 1613--1616
Liwen Sun and
Sanjay Krishnan and
Reynold S. Xin and
Michael J. Franklin A partitioning framework for aggressive
data skipping . . . . . . . . . . . . . 1617--1620
Lei Cao and
Qingyang Wang and
Elke A. Rundensteiner Interactive outlier exploration in big
data streams . . . . . . . . . . . . . . 1621--1624
Quoc-Cuong To and
Benjamin Nguyen and
Philippe Pucheral SQL/AA: executing SQL on an asymmetric
architecture . . . . . . . . . . . . . . 1625--1628
Zhao Chen and
Rui Fu and
Ziyuan Zhao and
Zheng Liu and
Leihao Xia and
Lei Chen and
Peng Cheng and
Caleb Chen Cao and
Yongxin Tong and
Chen Jason Zhang gMission: a general spatial
crowdsourcing platform . . . . . . . . . 1629--1632
Ugur Cetintemel and
Jiang Du and
Tim Kraska and
Samuel Madden and
David Maier and
John Meehan and
Andrew Pavlo and
Michael Stonebraker and
Erik Sutherland and
Nesime Tatbul and
Kristin Tufte and
Hao Wang and
Stanley Zdonik S-Store: a streaming NewSQL system for
big velocity applications . . . . . . . 1633--1636
Runquan Xie and
Feida Zhu and
Hui Ma and
Wei Xie and
Chen Lin CLEar: a real-time online observatory
for bursty and viral events . . . . . . 1637--1640
Young-Kyoon Suh and
Richard T. Snodgrass and
Rui Zhang AZDBLab: a laboratory information system
for large-scale empirical DBMS studies 1641--1644
Qi Wang and
Manohar Kaul and
Cheng Long and
Raymond Chi-Wing Wong Terrain-Toolkit: a multi-functional tool
for terrain data . . . . . . . . . . . . 1645--1648
Yupeng Fu and
Kian Win Ong and
Yannis Papakonstantinou and
Erick Zamora Forward: data-centric ULS using
declarative templates that efficiently
wrap third-party JavaScript components 1649--1652
Xika Lin and
Abhishek Mukherji and
Elke A. Rundensteiner and
Matthew O. Ward SPIRE: supporting parameter-driven
interactive rule mining and exploration 1653--1656
Michael R. Anderson and
Michael Cafarella and
Yixing Jiang and
Guan Wang and
Bochun Zhang An integrated development environment
for faster feature engineering . . . . . 1657--1660
Pengcheng Xiong and
Hakan Hacigümüs Pronto: a software-defined networking
based system for performance management
of analytical queries on distributed
data stores . . . . . . . . . . . . . . 1661--1664
Rui Zhang and
Reshu Jain and
Prasenjit Sarkar and
Lukas Rupprecht Getting your big data priorities
straight: a demonstration of
priority-based QoS using
social-network-driven stock
recommendation . . . . . . . . . . . . . 1665--1668
Alekh Jindal and
Praynaa Rawlani and
Eugene Wu and
Samuel Madden and
Amol Deshpande and
Mike Stonebraker Vertexica: your relational friend for
graph analytics! . . . . . . . . . . . . 1669--1672
Abdul Quamar and
Amol Deshpande and
Jimmy Lin NScale: neighborhood-centric analytics
on large graphs . . . . . . . . . . . . 1673--1676
Haoran Li and
Li Xiong and
Lifan Zhang and
Xiaoqian Jiang DPSynthesizer: differentially private
data synthesizer for privacy preserving
data sharing . . . . . . . . . . . . . . 1677--1680
Longbo Kong and
Zhi Liu and
Yan Huang SPOT: locating social media users based
on social network context . . . . . . . 1681--1684
Zohreh Alavi and
Lu Zhou and
James Powers and
Keke Chen RASP-QS: efficient and confidential
query services in the cloud . . . . . . 1685--1688
Mayuresh Kunjir and
Prajakta Kalmegh and
Shivnath Babu Thoth: towards managing a multi-system
cluster . . . . . . . . . . . . . . . . 1689--1692
Lei Zhang and
Achim Rettinger X-LiSA: cross-lingual semantic
annotation . . . . . . . . . . . . . . . 1693--1696
Prasanth Jayachandran and
Karthik Tunga and
Niranjan Kamat and
Arnab Nandi Combining user interaction, speculative
query execution and sampling in the DICE
system . . . . . . . . . . . . . . . . . 1697--1700
Han Su and
Kai Zheng and
Kai Zeng and
Jiamin Huang and
Xiaofang Zhou STMaker: a system to make sense of
trajectory data . . . . . . . . . . . . 1701--1704
Uwe Jugel and
Zbigniew Jerzak and
Gregor Hackenbroich and
Volker Markl Faster visual analytics through
pixel-perfect aggregation . . . . . . . 1705--1708
Arijit Khan and
Sameh Elnikety Systems for big-graphs . . . . . . . . . 1709--1710
Avigdor Gal Uncertain entity resolution:
re-evaluating entity resolution in the
big data era: tutorial . . . . . . . . . 1711--1712
Fabian M. Suchanek and
Gerhard Weikum Knowledge bases in the age of big data
analytics . . . . . . . . . . . . . . . 1713--1714
Alexandra Meliou and
Sudeepa Roy and
Dan Suciu Causality and explanations in databases 1715--1716
Yunyao Li and
Ziyang Liu and
Huaiyu Zhu Enterprise search in the big data era:
recent developments and open challenges 1717--1718
Yunyao Li and
Erich Neuhold VLDB 2014 Ph.D. workshop: an overview 1719--1719
Shivakumar Venkataraman and
Divyakant Agrawal Datacenters as computers: Google
engineering & database research
perspectives . . . . . . . . . . . . . . 1720--1721
Hasso Plattner The impact of columnar in-memory
databases on enterprise systems:
implications of eliminating
transaction-maintained aggregates . . . 1722--1729
Volker Markl Breaking the chains: on declarative data
analysis and data independence in the
big data era . . . . . . . . . . . . . . 1730--1733
Thomas Neumann Engineering high-performance database
engines . . . . . . . . . . . . . . . . 1734--1741
Wei Cao and
Feng Yu and
Jiasen Xie Realization of the low cost and high
performance MySQL cloud database . . . . 1742--1747
An Qin and
Dianming Hu and
Jun Liu and
Wenjun Yang and
Dai Tan Fatman: cost-saving and reliable
archival storage based on volunteer
resources . . . . . . . . . . . . . . . 1748--1753
Shiming Zhang and
Yin Yang and
Wei Fan and
Marianne Winslett Design and implementation of a real-time
interactive analytics system for large
spatio-temporal data . . . . . . . . . . 1754--1759
Chaoyue Dai and
Feng Qian and
Wei Jiang and
Zhoutian Wang and
Zenghong Wu A personalized recommendation system for
NetEase dating site . . . . . . . . . . 1760--1765
Zheng Jye Ling and
Quoc Trung Tran and
Ju Fan and
Gerald C. H. Koh and
Thi Nguyen and
Chuen Seng Tan and
James W. L. Yip and
Meihui Zhang GEMINI: an integrative healthcare
analytics system . . . . . . . . . . . . 1766--1771
Yongqiang Zou and
Xing Jin and
Yi Li and
Zhimao Guo and
Eryu Wang and
Bin Xiao Mariana: Tencent deep learning platform
and its applications . . . . . . . . . . 1772--1777
Sai Wu and
Chun Chen and
Gang Chen and
Ke Chen and
Lidan Shou and
Hui Cao and
He Bai YZStack: provisioning customizable
solution for big data . . . . . . . . . 1778--1783
Yannis Klonatos and
Christoph Koch and
Tiark Rompf and
Hassan Chafi Errata for ``Building efficient query
engines in a high-level language'':
PVLDB \bf 7(10):853--864 . . . . . . . . 1784--1784
Wei Lu and
Shanshan Chen and
Keqian Li and
Laks V. S. Lakshmanan Show me the money: dynamic
recommendations for revenue maximization 1785--1796
Peng Lu and
Gang Chen and
Beng Chin Ooi and
Hoang Tam Vo and
Sai Wu ScalaGiST: scalable generalized search
trees for MapReduce systems [innovative
systems paper] . . . . . . . . . . . . . 1797--1808
Mohan Yang and
Bolin Ding and
Surajit Chaudhuri and
Kaushik Chakrabarti Finding patterns in a knowledge base
using keywords to compose table answers 1809--1820
Da Yan and
James Cheng and
Kai Xing and
Yi Lu and
Wilfred Ng and
Yingyi Bu Pregel algorithms for graph connectivity
problems with performance guarantees . . 1821--1832
Zechao Shang and
Jeffrey Xu Yu Auto-approximation of graph computing 1833--1844
Tim Furche and
Georg Gottlob and
Giovanni Grasso and
Xiaonan Guo and
Giorgio Orsi and
Christian Schallhart and
Cheng Wang DIADEM: thousands of websites to a
single database . . . . . . . . . . . . 1845--1856
Wentao Wu and
Xi Wu and
Hakan Hacigümüs and
Jeffrey F. Naughton Uncertainty aware query execution time
prediction . . . . . . . . . . . . . . . 1857--1868
George Konstantinidis and
José Luis Ambite Optimizing the chase: scalable data
integration under constraints . . . . . 1869--1880
Manos Athanassoulis and
Anastasia Ailamaki BF-tree: approximate tree indexing . . . 1881--1892
Pinar Tözün and
Islam Atta and
Anastasia Ailamaki and
Andreas Moshovos ADDICT: advanced instruction chasing for
transactions . . . . . . . . . . . . . . 1893--1904
Sattam Alsubaiee and
Yasser Altowim and
Hotham Altwaijry and
Alexander Behm and
Vinayak Borkar and
Yingyi Bu and
Michael Carey and
Inci Cetindil and
Madhusudan Cheelangi and
Khurram Faraaz and
Eugenia Gabrielova and
Raman Grover and
Zachary Heilbron and
Young-Seok Kim and
Chen Li and
Guangqiang Li and
Ji Mahn Ok and
Nicola Onose and
Pouria Pirzadeh and
Vassilis Tsotras and
Rares Vernica and
Jian Wen and
Till Westmann AsterixDB: a scalable, open source BDMS 1905--1916
Ning Xu and
Lei Chen and
Bin Cui LogGP: a log-based dynamic graph
partitioning method . . . . . . . . . . 1917--1928
George Papadakis and
George Papastefanatos and
Georgia Koutrika Supervised meta-blocking . . . . . . . . 1929--1940
Min Xie and
Laks V. S. Lakshmanan and
Peter T. Wood Generating top-$k$ packages via
preference elicitation . . . . . . . . . 1941--1952
Rui Li and
Alex X. Liu and
Ann L. Wang and
Bezawada Bruhadeshwar Fast range query processing with strong
privacy protection for cloud computing 1953--1964
Yihan Gao and
Aditya Parameswaran Finish them!: pricing algorithms for
human computation . . . . . . . . . . . 1965--1976
Michele Catasta and
Alberto Tonon and
Djellel Eddine Difallah and
Gianluca Demartini and
Karl Aberer and
Philippe Cudre-Mauroux TransactiveDB: tapping into collective
human memories . . . . . . . . . . . . . 1977--1980
Da Yan and
James Cheng and
Yi Lu and
Wilfred Ng Blogel: a block-centric framework for
distributed computation on real-world
graphs . . . . . . . . . . . . . . . . . 1981--1992
John Liagouris and
Manolis Terrovitis Efficient identification of implicit
facts in incomplete OWL2-EL knowledge
bases . . . . . . . . . . . . . . . . . 1993--2004
Chen Jason Zhang and
Yongxin Tong and
Lei Chen Where to: crowd-aided path selection . . 2005--2016
Yan Huang and
Favyen Bastani and
Ruoming Jin and
Xiaoyang Sean Wang Large scale real-time ridesharing with
service guarantee on road networks . . . 2017--2028
Yifang Sun and
Wei Wang and
Jianbin Qin and
Ying Zhang and
Xuemin Lin SRS: solving $c$-approximate nearest
neighbor queries in high dimensional
Euclidean space with a tiny index . . . 1--12
Michele Dallachiesa and
Themis Palpanas and
Ihab F. Ilyas Top-$k$ nearest neighbor search in
uncertain data series . . . . . . . . . 13--24
Jiexing Li and
Jeffrey Naughton and
Rimma V. Nehme Resource bricolage for parallel database
systems . . . . . . . . . . . . . . . . 25--36
Goetz Graefe and
Haris Volos and
Hideaki Kimura and
Harumi Kuno and
Joseph Tucek and
Mark Lillibridge and
Alistair Veitch In-memory performance for big data . . . 37--48
Cheng Long and
Raymond Chi-Wing Wong and
H. V. Jagadish Trajectory simplification: on minimizing
the direction-based error . . . . . . . 49--60
Kareem El Gebaly and
Parag Agrawal and
Lukasz Golab and
Flip Korn and
Divesh Srivastava Interpretable and informative
explanations of outcomes . . . . . . . . 61--72
Fei Li and
H. V. Jagadish Constructing an interactive natural
language interface for relational
databases . . . . . . . . . . . . . . . 73--84
Yuanyuan Zhu and
Jeffrey Xu Yu and
Lu Qin Leveraging graph dimensions in online
graph search . . . . . . . . . . . . . . 85--96
Darius Sidlauskas and
Christian S. Jensen Spatial joins in main memory:
implementation matters! . . . . . . . . 97--100
Xiaoyang Wang and
Ying Zhang and
Wenjie Zhang and
Xuemin Lin and
Wei Wang Selectivity estimation on streaming
spatio-textual data using local
correlations . . . . . . . . . . . . . . 101--112
Chuanwen Li and
Yu Gu and
Jianzhong Qi and
Ge Yu and
Rui Zhang and
Wang Yi Processing moving $k$ NN queries using
influential neighbor sets . . . . . . . 113--124
Barzan Mozafari and
Purna Sarkar and
Michael Franklin and
Michael Jordan and
Samuel Madden Scaling up crowd-sourcing to very large
datasets: a case for active learning . . 125--136
Dingyu Yang and
Dongxiang Zhang and
Kian-Lee Tan and
Jian Cao and
Frédéric Le Mouël CANDS: continuous optimal navigation via
distributed stream processing . . . . . 137--148
Nurjahan Begum and
Eamonn Keogh Rare time series motif discovery from
unbounded streams . . . . . . . . . . . 149--160
Yingyi Bu and
Vinayak Borkar and
Jianfeng Jia and
Michael J. Carey and
Tyson Condie Pregelix: Big(ger) graph analytics on a
dataflow engine . . . . . . . . . . . . 161--172
Shriram Sridharan and
Jignesh M. Patel Profiling R on a contemporary processor 173--184
Peter Bailis and
Alan Fekete and
Michael J. Franklin and
Ali Ghodsi and
Joseph M. Hellerstein and
Ion Stoica Coordination avoidance in database
systems . . . . . . . . . . . . . . . . 185--196
Qiang Zeng and
Jignesh M. Patel and
David Page QuickFOIL: scalable inductive logic
programming . . . . . . . . . . . . . . 197--208
Xiangyao Yu and
George Bezerra and
Andrew Pavlo and
Srinivas Devadas and
Michael Stonebraker Staring into the abyss: an evaluation of
concurrency control with one thousand
cores . . . . . . . . . . . . . . . . . 209--220
Immanuel Trummer and
Christoph Koch Multi-objective parametric query
optimization . . . . . . . . . . . . . . 221--232
Jana Giceva and
Gustavo Alonso and
Timothy Roscoe and
Tim Harris Deployment of query plans on multicores 233--244
Rebecca Taft and
Essam Mansour and
Marco Serafini and
Jennie Duggan and
Aaron J. Elmore and
Ashraf Aboulnaga and
Andrew Pavlo and
Michael Stonebraker E-store: fine-grained elastic
partitioning for distributed transaction
processing systems . . . . . . . . . . . 245--256
Saravanan Thirumuruganathan and
Habibur Rahman and
Sofiane Abbar and
Gautam Das Beyond itemsets: mining frequent
featuresets over structured items . . . 257--268
Jun Zhang and
Chaokun Wang and
Jianmin Wang and
Jeffrey Xu Yu Inferring continuous dynamic social
influence and personal preference for
temporal behavior prediction . . . . . . 269--280
Yi Lu and
James Cheng and
Da Yan and
Huanhuan Wu Large-scale distributed graph computing
systems: an experimental evaluation . . 281--292
Hiroshi Inoue and
Moriyoshi Ohara and
Kenjiro Taura Faster set intersection with SIMD
instructions by reducing branch
mispredictions . . . . . . . . . . . . . 293--304
Ahmed El-Kishky and
Yanglei Song and
Chi Wang and
Clare R. Voss and
Jiawei Han Scalable topical phrase mining from text
corpora . . . . . . . . . . . . . . . . 305--316
Wenbo Tao and
Minghe Yu and
Guoliang Li Efficient top-$k$ simrank-based
similarity join . . . . . . . . . . . . 317--328
Jiong He and
Shuhao Zhang and
Bingsheng He In-cache query co-processing on coupled
CPU--GPU architectures . . . . . . . . . 329--340
Yasuhiro Fujiwara and
Go Irie and
Shari Kuroyama and
Makoto Onizuka Scaling manifold ranking based image
retrieval . . . . . . . . . . . . . . . 341--352
R. Barber and
G. Lohman and
I. Pandis and
V. Raman and
R. Sidle and
G. Attaluri and
N. Chainani and
S. Lightstone and
D. Sharpe Memory-efficient hash joins . . . . . . 353--364
Bogdan Alexe and
Mary Roth and
Wang-Chiew Tan Preference-aware integration of temporal
data . . . . . . . . . . . . . . . . . . 365--376
Chang Zhou and
Jun Gao and
Binbin Sun and
Jeffrey Xu Yu MOCgraph: scalable distributed graph
processing using message online
computing . . . . . . . . . . . . . . . 377--388
Jian Huang and
Karsten Schwan and
Moinuddin K. Qureshi NVRAM-aware logging in transaction
systems . . . . . . . . . . . . . . . . 389--400
Badrish Chandramouli and
Jonathan Goldstein and
Mike Barnett and
Robert DeLine and
Danyel Fisher and
John C. Platt and
James F. Terwilliger and
John Wernsing Trill: a high-performance incremental
query processor for diverse analytics 401--412
Chunyao Song and
Tingjian Ge and
Cindy Chen and
Jie Wang Event pattern matching over graph
streams . . . . . . . . . . . . . . . . 413--424
Qi Li and
Yaliang Li and
Jing Gao and
Lu Su and
Bo Zhao and
Murat Demirbas and
Wei Fan and
Jiawei Han A confidence-aware approach for truth
discovery on long-tail data . . . . . . 425--436
Yanyan Shen and
Gang Chen and
H. V. Jagadish and
Wei Lu and
Beng Chin Ooi and
Bogdan Marius Tudor Fast failure recovery in distributed
graph processing systems . . . . . . . . 437--448
Manuel Then and
Moritz Kaufmann and
Fernando Chirigati and
Tuan-Anh Hoang-Vu and
Kien Pham and
Alfons Kemper and
Thomas Neumann and
Huy T. Vo The more the merrier: efficient
multi-source graph traversal . . . . . . 449--460
Sebastian Wandelt and
Ulf Leser MRCSI: compressing and searching string
collections with multiple references . . 461--472
Rui Ding and
Qiang Wang and
Yingnong Dang and
Qiang Fu and
Haidong Zhang and
Dongmei Zhang YADING: fast clustering of large-scale
time series data . . . . . . . . . . . . 473--484
Ting Wu and
Lei Chen and
Pan Hui and
Chen Jason Zhang and
Weikai Li Hear the whole story: towards the
diversity of opinion in crowdsourcing
markets . . . . . . . . . . . . . . . . 485--496
Andreas Chatzistergiou and
Marcelo Cintra and
Stratis D. Viglas REWIND: recovery write-ahead system for
in-memory non-volatile data-structures 497--508
Rong-Hua Li and
Lu Qin and
Jeffrey Xu Yu and
Rui Mao Influential community search in large
networks . . . . . . . . . . . . . . . . 509--520
Albert Kim and
Eric Blais and
Aditya Parameswaran and
Piotr Indyk and
Sam Madden and
Ronitt Rubinfeld Rapid sampling for visualizations with
ordering guarantees . . . . . . . . . . 521--532
Lijun Chang and
Xuemin Lin and
Wenjie Zhang and
Jeffrey Xu Yu and
Ying Zhang and
Lu Qin Optimal enumeration: efficient top-$k$
tree matching . . . . . . . . . . . . . 533--544
Arnon Lazerson and
Izchak Sharfman and
Daniel Keren and
Assaf Schuster and
Minos Garofalakis and
Vasilis Samoladas Monitoring distributed streams using
convex decompositions . . . . . . . . . 545--556
Kun Li and
Daisy Zhe Wang and
Alin Dobra and
Christopher Dudley UDA-GIST: an in-database framework to
unify data-parallel and state-parallel
analytics . . . . . . . . . . . . . . . 557--568
Weiren Yu and
Julie A. McCann Efficient partial-pairs SimRank search
on large networks . . . . . . . . . . . 569--580
Wolfgang Gatterbauer and
Stephan Günnemann and
Danai Koutra and
Christos Faloutsos Linearized and single-pass belief
propagation . . . . . . . . . . . . . . 581--592
Loc Do and
Hady W. Lauw and
Ke Wang Mining revenue-maximizing bundling
configuration . . . . . . . . . . . . . 593--604
Shiyu Yang and
Muhammad Aamir Cheema and
Xuemin Lin and
Wei Wang Reverse $k$ nearest neighbors query
processing: experiments and analysis . . 605--616
Xuguang Ren and
Junhu Wang Exploiting vertex relationships in
speeding up subgraph isomorphism over
large graphs . . . . . . . . . . . . . . 617--628
Wolfgang Gatterbauer and
Dan Suciu Approximate lifted inference with
probabilistic databases . . . . . . . . 629--640
Norases Vesdapunt and
Kedar Bellare and
Nilesh Dalvi Errata for ``Crowdsourcing algorithms
for entity resolution'': (PVLDB \bf
7(12): 1071--1082) . . . . . . . . . . . 641--641
Saurabh Jha and
Bingsheng He and
Mian Lu and
Xuntao Cheng and
Huynh Phung Huynh Improving main memory hash joins on
Intel Xeon Phi processors: an
experimental approach . . . . . . . . . 642--653
Mohammad Hammoud and
Dania Abed Rabbou and
Reza Nouri and
Seyed-Mehdi-Reza Beheshti and
Sherif Sakr DREAM: distributed RDF engine with
adaptive query planner and minimal
communication . . . . . . . . . . . . . 654--665
Shuo Chen and
Ju Fan and
Guoliang Li and
Jianhua Feng and
Kian-lee Tan and
Jinhui Tang Online topic-aware influence
maximization . . . . . . . . . . . . . . 666--677
Azade Nazi and
Zhuojie Zhou and
Saravanan Thirumuruganathan and
Nan Zhang and
Gautam Das Walk, not wait: faster sampling over
online social networks . . . . . . . . . 678--689
Michael Benedikt and
Julien Leblay and
Efthymia Tsamoura Querying with access patterns and
integrity constraints . . . . . . . . . 690--701
Kanat Tangwongsan and
Martin Hirzel and
Scott Schneider and
Kun-Lung Wu General incremental sliding-window
aggregation . . . . . . . . . . . . . . 702--713
Chuan Lei and
Zhongfang Zhuang and
Elke A. Rundensteiner and
Mohamed Eltabakh Shared execution of recurring workloads
in MapReduce . . . . . . . . . . . . . . 714--725
Vivek Narasayya and
Ishai Menache and
Mohit Singh and
Feng Li and
Manoj Syamala and
Surajit Chaudhuri Sharing buffer pool memory in
multi-tenant relational
database-as-a-service . . . . . . . . . 726--737
Yunjun Gao and
Qing Liu and
Gang Chen and
Baihua Zheng and
Linlin Zhou Answering why-not questions on reverse
top-$k$ queries . . . . . . . . . . . . 738--749
Dimitrios Papadopoulos and
Charalampos Papamanthou and
Roberto Tamassia and
Nikos Triandopoulos Practical authenticated pattern matching
with optimal proof size . . . . . . . . 750--761
Dumitrel Loghin and
Bogdan Marius Tudor and
Hao Zhang and
Beng Chin Ooi and
Yong Meng Teo A performance study of big data on small
nodes . . . . . . . . . . . . . . . . . 762--773
Thorsten Papenbrock and
Sebastian Kruse and
Jorge-Arnulfo Quiané-Ruiz and
Felix Naumann Divide & conquer-based inclusion
dependency discovery . . . . . . . . . . 774--785
Shimin Chen and
Qin Jin Persistent B+-trees in non-volatile main
memory . . . . . . . . . . . . . . . . . 786--797
Yubao Wu and
Ruoming Jin and
Jing Li and
Xiang Zhang Robust local community detection: on
free rider effect and its elimination 798--809
Hua Fan and
Aditya Ramaraju and
Marlon McKenzie and
Wojciech Golab and
Bernard Wong Understanding the causes of consistency
anomalies in Apache Cassandra . . . . . 810--813
Cigdem Aslay and
Wei Lu and
Francesco Bonchi and
Amit Goyal and
Laks V. S. Lakshmanan Viral marketing meets social
advertising: ad allocation with minimum
regret . . . . . . . . . . . . . . . . . 814--825
Lingyang Chu and
Shuhui Wang and
Siyuan Liu and
Qingming Huang and
Jian Pei ALID: scalable dominant cluster
detection . . . . . . . . . . . . . . . 826--837
Yingxia Shao and
Bin Cui and
Lei Chen and
Mingming Liu and
Xing Xie An efficient similarity search framework
for SimRank over large dynamic graphs 838--849
Muhammad Yousuf Ahmad and
Bettina Kemme Compaction management in distributed
key--value datastores . . . . . . . . . 850--861
Rachid Guerraoui and
Anne-Marie Kermarrec and
Rhicheek Patra and
Mahsa Taziki D2P: distance-based differential privacy
in recommenders . . . . . . . . . . . . 862--873
Ioannis Mitliagkas and
Michael Borokhovich and
Alexandros G. Dimakis and
Constantine Caramanis FrogWild!: fast PageRank approximations
on graph engines . . . . . . . . . . . . 874--885
Andrea Vattani and
Flavio Chierichetti and
Keegan Lowenstein Optimal probabilistic cache stampede
prevention . . . . . . . . . . . . . . . 886--897
Navneet Potti and
Jignesh M. Patel DAQ: a new paradigm for approximate
query processing . . . . . . . . . . . . 898--909
Nicolas Anciaux and
Saliha Lallali and
Iulian Sandu Popa and
Philippe Pucheral A scalable search engine for mass
storage smart objects . . . . . . . . . 910--921
Lanjun Wang and
Shuo Zhang and
Juwei Shi and
Limei Jiao and
Oktie Hassanzadeh and
Jia Zou and
Chen Wangz Schema management for document stores 922--933
Felix Martin Schuhknecht and
Pankaj Khanchandani and
Jens Dittrich On the surprising difficulty of simple
things: the case of radix partitioning 934--937
Xin Luna Dong and
Evgeniy Gabrilovich and
Kevin Murphy and
Van Dang and
Wilko Horn and
Camillo Lugaresi and
Shaohua Sun and
Wei Zhang Knowledge-based trust: estimating the
trustworthiness of web sources . . . . . 938--949
Minyang Han and
Khuzaima Daudjee Giraph unchained: barrierless
asynchronous parallel execution in
Pregel-like graph processing systems . . 950--961
Kenneth S. Bògh and
Sean Chester and
Ira Assent Work-efficient parallel skyline
computation for the GPU . . . . . . . . 962--973
Longbin Lai and
Lu Qin and
Xuemin Lin and
Lijun Chang Scalable subgraph enumeration in
MapReduce . . . . . . . . . . . . . . . 974--985
Jan Finis and
Robert Brunel and
Alfons Kemper and
Thomas Neumann and
Norman May and
Franz Faerber Indexing highly dynamic hierarchical
data . . . . . . . . . . . . . . . . . . 986--997
Meng Wang and
Chaokun Wang and
Jeffrey Xu Yu and
Jun Zhang Community detection in social networks:
an in-depth benchmarking study with a
procedure-oriented framework . . . . . . 998--1009
Ehsan Kazemi and
S. Hamed Hassani and
Matthias Grossglauser Growing a graph matching from a handful
of seeds . . . . . . . . . . . . . . . . 1010--1021
Peng Cheng and
Xiang Lian and
Zhao Chen and
Rui Fu and
Lei Chen and
Jinsong Han and
Jizhong Zhao Reliable diversity-based spatial
crowdsourcing by moving workers . . . . 1022--1033
Zhuojie Zhou and
Nan Zhang and
Gautam Das Leveraging history for faster sampling
of online social networks . . . . . . . 1034--1045
Yufei Ding and
Xipeng Shen and
Madanlal Musuvathi and
Todd Mytkowicz TOP: a framework for enabling
algorithmic optimizations for
distance-related problems . . . . . . . 1046--1057
Viktor Leis and
Kan Kundhikanjana and
Alfons Kemper and
Thomas Neumann Efficient processing of window functions
in analytical SQL queries . . . . . . . 1058--1069
Yuchen Li and
Dongxiang Zhang and
Kian-Lee Tan Real-time targeted influence
maximization for online advertisements 1070--1081
Thorsten Papenbrock and
Jens Ehrlich and
Jannik Marten and
Tommy Neubert and
Jan-Peer Rudolph and
Martin Schönberg and
Jakob Zwiener and
Felix Naumann Functional dependency discovery: an
experimental evaluation of seven
algorithms . . . . . . . . . . . . . . . 1082--1093
Alexander Kalinin and
Ugur Cetintemel and
Stan Zdonik Searchlight: enabling integrated search
and exploration over large
multidimensional data . . . . . . . . . 1094--1105
Md Farhadur Rahman and
Weimo Liu and
Saravanan Thirumuruganathan and
Nan Zhang and
Gautam Das Privacy implications of database ranking 1106--1117
Henning Köhler and
Sebastian Link and
Xiaofang Zhou Possible and certain SQL keys . . . . . 1118--1129
Yu Tang and
Yilun Cai and
Nikos Mamoulis Scaling similarity joins over
tree-structured data . . . . . . . . . . 1130--1141
Habibur Rahman and
Saravanan Thirumuruganathan and
Senjuti Basu Roy and
Sihem Amer-Yahia and
Gautam Das Worker skill estimation in team-based
tasks . . . . . . . . . . . . . . . . . 1142--1153
Xi He and
Graham Cormode and
Ashwin Machanavajjhala and
Cecilia M. Procopiuc and
Divesh Srivastava DPT: differentially private trajectory
synthesis using hierarchical reference
systems . . . . . . . . . . . . . . . . 1154--1165
Boduo Li and
Yanlei Diao and
Prashant Shenoy Supporting scalable analytics with
latency constraints . . . . . . . . . . 1166--1177
Hiroaki Shiokawa and
Yasuhiro Fujiwara and
Makoto Onizuka SCAN++: efficient algorithm for finding
clusters, hubs and outliers on
large-scale graphs . . . . . . . . . . . 1178--1189
Jose M. Faleiro and
Daniel J. Abadi Rethinking serializable multiversion
concurrency control . . . . . . . . . . 1190--1201
Bryan Brancotte and
Bo Yang and
Guillaume Blin and
Sarah Cohen-Boulakia and
Alain Denise and
Sylvie Hamel Rank aggregation with ties: experiments
and analysis . . . . . . . . . . . . . . 1202--1213
Narayanan Sundaram and
Nadathur Satish and
Md Mostofa Ali Patwary and
Subramanya R. Dulloor and
Michael J. Anderson and
Satya Gautam Vadlamudi and
Dipankar Das and
Pradeep Dubey GraphMat: high performance graph
analytics made productive . . . . . . . 1214--1225
Kai Zhang and
Kaibo Wang and
Yuan Yuan and
Lei Guo and
Rubao Lee and
Xiaodong Zhang Mega-KV: a case for GPUs to maximize the
throughput of in-memory key--value
stores . . . . . . . . . . . . . . . . . 1226--1237
Jinha Kim and
Hyungyu Shin and
Wook-Shin Han and
Sungpack Hong and
Hassan Chafi Taming subgraph isomorphism for RDF
query processing . . . . . . . . . . . . 1238--1249
Lilong Jiang and
Arnab Nandi SnapToQuery: providing interactive
feedback during exploratory query
specification . . . . . . . . . . . . . 1250--1261
Yang Zhou and
Ling Liu and
Kisung Lee and
Qi Zhang GraphTwist: fast iterative graph
computation with two-tier optimizations 1262--1273
Hiroshi Inoue and
Kenjiro Taura SIMD- and cache-friendly algorithm for
sorting an array of structures . . . . . 1274--1285
Shaoxu Song and
Aoqian Zhang and
Lei Chen and
Jianmin Wang Enriching data imputation with extensive
similarity neighbors . . . . . . . . . . 1286--1297
Darko Makreshanski and
Justin Levandoski and
Ryan Stutsman To lock, swap, or elide: on the
interplay of hardware transactional
memory and lock-free indexing . . . . . 1298--1309
Jaeho Shin and
Sen Wu and
Feiran Wang and
Christopher De Sa and
Ce Zhang and
Christopher Ré Incremental knowledge base construction
using DeepDive . . . . . . . . . . . . . 1310--1321
Li Qian and
Jinyang Gao and
H. V. Jagadish Learning user preferences by adaptive
pairwise comparison . . . . . . . . . . 1322--1333
Weimo Liu and
Md Farhadur Rahman and
Saravanan Thirumuruganathan and
Nan Zhang and
Gautam Das Aggregate estimations over location
based services . . . . . . . . . . . . . 1334--1345
Souvik Bhattacherjee and
Amit Chavan and
Silu Huang and
Amol Deshpande and
Aditya Parameswaran Principles of dataset versioning:
exploring the recreation/storage
tradeoff . . . . . . . . . . . . . . . . 1346--1357
Yeye He and
Kris Ganjam and
Xu Chu SEMA--JOIN: joining semantically-related
tables using big table corpora . . . . . 1358--1369
Sanjay Krishnan and
Jiannan Wang and
Michael J. Franklin and
Ken Goldberg and
Tim Kraska Stale view cleaning: getting fresh
answers from stale materialized views 1370--1381
Parth Nagarkar and
K. Selçuk Candan and
Aneesha Bhat Compressed spatial hierarchical bitmap
(cSHB) indexes for efficiently
processing spatial range query workloads 1382--1393
Daniel Deutch and
Amir Gilad and
Yuval Moskovitch Selective provenance for datalog
programs using top-$k$ queries . . . . . 1394--1405
Yoonjae Park and
Jun-Ki Min and
Kyuseok Shim Processing of probabilistic skyline
queries using MapReduce . . . . . . . . 1406--1417
Xiaofei Zhang and
Hong Cheng and
Lei Chen Bonding vertex sets over distributed
graph: a betweenness aware approach . . 1418--1429
Yael Amsterdamer and
Anna Kukliansky and
Tova Milo A natural language interface for
querying general and individual
knowledge . . . . . . . . . . . . . . . 1430--1441
Iraklis Psaroudakis and
Tobias Scheuer and
Norman May and
Abdelkader Sellami and
Anastasia Ailamaki Scaling up concurrent main-memory
column-store scans: towards adaptive
NUMA-aware data and task placement . . . 1442--1453
Gihwan Oh and
Sangchul Kim and
Sang-Won Lee and
Bongki Moon SQLite optimization with phase change
memory for mobile applications . . . . . 1454--1465
Andrew Crotty and
Alex Galakatos and
Kayhan Dursun and
Tim Kraska and
Carsten Binnig and
Ugur Cetintemel and
Stan Zdonik An architecture for compiling
UDF-centric workflows . . . . . . . . . 1466--1477
Daniel Margo and
Margo Seltzer A scalable distributed graph partitioner 1478--1489
Artyom Sharov and
Alexander Shraer and
Arif Merchant and
Murray Stokely Take me to your leader!: online
optimization of distributed storage
configurations . . . . . . . . . . . . . 1490--1501
Wenfei Fan and
Xin Wang and
Yinghui Wu and
Jingbo Xu Association rules with graph patterns 1502--1513
Ben Kimmett and
Venkatesh Srinivasan and
Alex Thomo Fuzzy joins in MapReduce: an
experimental study . . . . . . . . . . . 1514--1517
Minsik Cho and
Daniel Brand and
Rajesh Bordawekar and
Ulrich Finkler and
Vincent Kulandaisamy and
Ruchir Puri PARADIS: an efficient parallel algorithm
for in-place radix sort . . . . . . . . 1518--1529
David Vengerov and
Andre Cavalheiro Menck and
Mohamed Zait and
Sunil P. Chakkappen Join size estimation subject to filter
conditions . . . . . . . . . . . . . . . 1530--1541
Jingjing Wang and
Magdalena Balazinska and
Daniel Halperin Asynchronous and fault-tolerant
recursive datalog evaluation in
shared-nothing engines . . . . . . . . . 1542--1553
Kyriakos Mouratidis and
Jilian Zhang and
HweeHwa Pang Maximum rank query . . . . . . . . . . . 1554--1565
Foteini Katsarou and
Nikos Ntarmos and
Peter Triantafillou Performance and scalability of indexed
subgraph query processing methods . . . 1566--1577
Ying Yang and
Niccol\`o Meneghetti and
Ronny Fehling and
Zhen Hua Liu and
Oliver Kennedy Lenses: an on-demand approach to ETL . . 1578--1589
Wenfei Fan and
Zhe Fan and
Chao Tian and
Xin Luna Dong Keys for graphs . . . . . . . . . . . . 1590--1601
Ahmed Eldawy and
Louai Alarabi and
Mohamed F. Mokbel Spatial partitioning techniques in
SpatialHadoop . . . . . . . . . . . . . 1602--1605
Tomohiro Manabe and
Keishi Tajima Extracting logical hierarchical
structure of HTML documents based on
headings . . . . . . . . . . . . . . . . 1606--1617
Bilegsaikhan Naidan and
Leonid Boytsov and
Eric Nyberg Permutation search methods are
efficient, yet faster search is possible 1618--1629
Niloy Mukherjee and
Shasank Chavan and
Maria Colgan and
Dinesh Das and
Mike Gleeson and
Sanket Hase and
Allison Holloway and
Hui Jin and
Jesse Kamp and
Kartik Kulkarni and
Tirthankar Lahiri and
Juan Loaiza and
Neil Macnaughton and
Vineet Marwah and
Atrayee Mullick and
Andy Witkowski and
Jiaqi Yan and
Mohamed Zait Distributed architecture of Oracle
database in-memory . . . . . . . . . . . 1630--1641
Daniel Haas and
Jason Ansel and
Lydia Gu and
Adam Marcus Argonaut: macrotask crowdsourcing for
complex data processing . . . . . . . . 1642--1653
Guozhang Wang and
Joel Koshy and
Sriram Subramanian and
Kartik Paramasivam and
Mammad Zadeh and
Neha Narkhede and
Jun Rao and
Jay Kreps and
Joe Stein Building a replicated logging system
with Apache Kafka . . . . . . . . . . . 1654--1655
Alessandra Loro and
Anja Gruenheid and
Donald Kossmann and
Damien Profeta and
Philippe Beaudequin Indexing and selecting hierarchical
business logic . . . . . . . . . . . . . 1656--1667
Dharma Shukla and
Shireesh Thota and
Karthik Raman and
Madhan Gajendran and
Ankur Shah and
Sergii Ziuzin and
Krishnan Sundaram and
Miguel Gonzalez Guajardo and
Anna Wawrzyniak and
Samer Boshra and
Renato Ferreira and
Mohamed Nassar and
Michael Koltachev and
Ji Huang and
Sudipta Sengupta and
Justin Levandoski and
David Lomet Schema-agnostic indexing with Azure
DocumentDB . . . . . . . . . . . . . . . 1668--1679
Eric Boutin and
Paul Brett and
Xiaoyu Chen and
Jaliya Ekanayake and
Tao Guan and
Anna Korsun and
Zhicheng Yin and
Nan Zhang and
Jingren Zhou JetScope: reliable and interactive
analytics at cloud scale . . . . . . . . 1680--1691
Xueyang Hu and
Mingxuan Yuan and
Jianguo Yao and
Yu Deng and
Lei Chen and
Qiang Yang and
Haibing Guan and
Jia Zeng Differential privacy in telco big data
platform . . . . . . . . . . . . . . . . 1692--1703
Amr El-Helw and
Venkatesh Raghavan and
Mohamed A. Soliman and
George Caragea and
Zhongxian Gu and
Michalis Petropoulos Optimization of common table expressions
in MPP database systems . . . . . . . . 1704--1715
Anil K. Goel and
Jeffrey Pound and
Nathan Auch and
Peter Bumbulis and
Scott MacLean and
Franz Färber and
Francis Gropengiesser and
Christian Mathis and
Thomas Bodner and
Wolfgang Lehner Towards scalable real-time analytics: an
architecture for scale-out of OLxP
workloads . . . . . . . . . . . . . . . 1716--1727
Tamraparni Dasu and
Vladislav Shkapenyuk and
Divesh Srivastava and
Deborah F. Swayne FIT to monitor feed quality . . . . . . 1728--1739
Per-Åke Larson and
Adrian Birka and
Eric N. Hanson and
Weiyun Huang and
Michal Nowakiewicz and
Vassilis Papadimos Real-time analytical processing with SQL
server . . . . . . . . . . . . . . . . . 1740--1751
You Wu and
Boulos Harb and
Jun Yang and
Cong Yu Efficient evaluation of object-centric
exploration queries for visualization 1752--1763
Lin Qiao and
Yinan Li and
Sahil Takiar and
Ziyang Liu and
Narasimha Veeramreddy and
Min Tu and
Ying Dai and
Issac Buenrostro and
Kapil Surlaker and
Shirshanka Das and
Chavdar Botev Gobblin: unifying data ingestion for
Hadoop . . . . . . . . . . . . . . . . . 1764--1769
Dinesh Das and
Jiaqi Yan and
Mohamed Zait and
Satyanarayana R. Valluri and
Nirav Vyas and
Ramarajan Krishnamachari and
Prashant Gaharwar and
Jesse Kamp and
Niloy Mukherjee Query optimization in Oracle 12c
database in-memory . . . . . . . . . . . 1770--1781
Todd J. Green and
Dan Olteanu and
Geoffrey Washburn Live programming in the LogicBlox
system: a MetaLogiQL approach . . . . . 1782--1791
Tyler Akidau and
Robert Bradshaw and
Craig Chambers and
Slava Chernyak and
Rafael J. Fernández-Moctezuma and
Reuven Lax and
Sam McVeety and
Daniel Mills and
Frances Perry and
Eric Schmidt and
Sam Whittle The dataflow model: a practical approach
to balancing correctness, latency, and
cost in massive-scale, unbounded,
out-of-order data processing . . . . . . 1792--1803
Avery Ching and
Sergey Edunov and
Maja Kabiljo and
Dionysios Logothetis and
Sambavi Muthukrishnan One trillion edges: graph processing at
Facebook-scale . . . . . . . . . . . . . 1804--1815
Tuomas Pelkonen and
Scott Franklin and
Justin Teller and
Paul Cavallaro and
Qi Huang and
Justin Meza and
Kaushik Veeraraghavan Gorilla: a fast, scalable, in-memory
time series database . . . . . . . . . . 1816--1827
Rahul Potharaju and
Joseph Chan and
Luhui Hu and
Cristina Nita-Rotaru and
Mingshi Wang and
Liyuan Zhang and
Navendu Jain ConfSeer: leveraging customer support
knowledge bases for automated
misconfiguration detection . . . . . . . 1828--1839
Michael Armbrust and
Tathagata Das and
Aaron Davidson and
Ali Ghodsi and
Andrew Or and
Josh Rosen and
Ion Stoica and
Patrick Wendell and
Reynold Xin and
Matei Zaharia Scaling spark in the real world:
performance and usability . . . . . . . 1840--1843
Majed Sahli and
Essam Mansour and
Panos Kalnis StarDB: a large-scale DBMS for strings 1844--1847
Razen Harbi and
Ibrahim Abdelaziz and
Panos Kalnis and
Nikos Mamoulis Evaluating SPARQL queries on massive RDF
datasets . . . . . . . . . . . . . . . . 1848--1851
Ngai Meng Kou and
Leong Hou U. and
Nikos Mamoulis and
Yuhong Li and
Ye Li and
Zhiguo Gong A topic-based reviewer assignment system 1852--1855
Miguel Liroz-Gistau and
Reza Akbarinia and
Patrick Valduriez FP--Hadoop: efficient execution of
parallel jobs over skewed data . . . . . 1856--1859
Thorsten Papenbrock and
Tanja Bergmann and
Moritz Finke and
Jakob Zwiener and
Felix Naumann Data profiling with Metanome . . . . . . 1860--1863
Arun Kumar and
Mona Jalal and
Boqun Yan and
Jeffrey Naughton and
Jignesh M. Patel Demonstration of Santoku: optimizing
machine learning over normalized data 1864--1867
Boon Siew Seah and
Sourav S. Bhowmick and
Aixin Sun PRISM: concept-preserving summarization
of top-$k$ social image search results 1868--1871
Tobias Müller and
Torsten Grust Provenance for SQL through abstract
interpretation: value-less, but
worthwhile . . . . . . . . . . . . . . . 1872--1875
Zhian He and
Wai Kit Wong and
Ben Kao and
David Wai Lok Cheung and
Rongbin Li and
Siu Ming Yiu and
Eric Lo SDB: a secure query processing system
with data interoperability . . . . . . . 1876--1879
Ibrahim Abdelaziz and
Razen Harbi and
Semih Salihoglu and
Panos Kalnis and
Nikos Mamoulis SPARTex: a vertex-centric framework for
RDF data analytics . . . . . . . . . . . 1880--1883
Lu Chen and
Yunjun Gao and
Zhihao Xing and
Christian S. Jensen and
Gang Chen I2RS: a distributed geo-textual image
retrieval and recommendation system . . 1884--1887
Damian Bursztyn and
François Goasdoué and
Ioana Manolescu Reformulation-based query answering in
RDF: alternatives and performance . . . 1888--1891
Marc Bux and
Jörgen Brandt and
Carsten Lipka and
Kamal Hakimzadeh and
Jim Dowling and
Ulf Leser SAASFEE: scalable scientific workflow
execution engine . . . . . . . . . . . . 1892--1895
Ahmed Eldawy and
Mohamed F. Mokbel and
Christopher Jonathan A demonstration of HadoopViz: an
extensible MapReduce system for
visualizing big spatial data . . . . . . 1896--1899
Moria Bergman and
Tova Milo and
Slava Novgorodov and
Wang-Chiew Tan QOCO: a query oriented data cleaning
system with oracles . . . . . . . . . . 1900--1903
Shanshan Ying and
Flip Korn and
Barna Saha and
Divesh Srivastava TreeScope: finding structural anomalies
in semi-structured data . . . . . . . . 1904--1907
A. Elmore and
J. Duggan and
M. Stonebraker and
M. Balazinska and
U. Cetintemel and
V. Gadepally and
J. Heer and
B. Howe and
J. Kepner and
T. Kraska and
S. Madden and
D. Maier and
T. Mattson and
S. Papadopoulos and
J. Parkhurst and
N. Tatbul and
M. Vartak and
S. Zdonik A demonstration of the BigDAWG polystore
system . . . . . . . . . . . . . . . . . 1908--1911
Kostas Zoumpatianos and
Stratos Idreos and
Themis Palpanas RINSE: interactive data series
exploration with ADS+ . . . . . . . . . 1912--1915
Anant Bhardwaj and
Amol Deshpande and
Aaron J. Elmore and
David Karger and
Sam Madden and
Aditya Parameswaran and
Harihar Subramanyam and
Eugene Wu and
Rebecca Zhang Collaborative data analytics with
DataHub . . . . . . . . . . . . . . . . 1916--1919
Jaeho Shin and
Christopher Ré and
Michael Cafarella Mindtagger: a demonstration of data
labeling in knowledge base construction 1920--1923
Danai Koutra and
Di Jin and
Yuanchi Ning and
Christos Faloutsos Perseus: an interactive large-scale
graph mining and visualization tool . . 1924--1927
Manas Joglekar and
Hector Garcia-Molina and
Aditya Parameswaran Smart drill-down: a new data exploration
operator . . . . . . . . . . . . . . . . 1928--1931
Curtis E. Dyreson and
Sourav S. Bhowmick and
Ryan Grapp Virtual eXist-db: liberating
hierarchical queries from the shackles
of access path dependence . . . . . . . 1932--1935
Eli Cortez and
Philip A. Bernstein and
Yeye He and
Lev Novik Annotating database schemas to help
enterprise search . . . . . . . . . . . 1936--1939
Nandish Jayaram and
Sidharth Goyal and
Chengkai Li VIIQ: auto-suggestion enabled visual
interface for interactive graph query
formulation . . . . . . . . . . . . . . 1940--1943
Qingyuan Liu and
Eduard C. Dragut and
Arjun Mukherjee and
Weiyi Meng FLORIN: a system to support (near)
real-time applications on user generated
content on daily news . . . . . . . . . 1944--1947
Yunyao Li and
Elmer Kim and
Marc A. Touchette and
Ramiya Venkatachalam and
Hao Wang VINERy: a visual IDE for information
extraction . . . . . . . . . . . . . . . 1948--1951
Xu Chu and
John Morcos and
Ihab F. Ilyas and
Mourad Ouzzani and
Paolo Papotti and
Nan Tang and
Yin Ye KATARA: reliable data cleaning with
knowledge bases and crowdsourcing . . . 1952--1955
Foteini Alvanaki and
Romulo Goncalves and
Milena Ivanova and
Martin Kersten and
Kostis Kyzirakos GIS navigation boosted by column stores 1956--1959
Patricia C. Arocena and
Radu Ciucanu and
Boris Glavic and
Renée J. Miller Gain control over your integration
evaluations . . . . . . . . . . . . . . 1960--1963
Yanlei Diao and
Kyriaki Dimitriadou and
Zhan Li and
Wenzhao Liu and
Olga Papaemmanouil and
Kemi Peng and
Liping Peng AIDE: an automatic user navigation
system for interactive data exploration 1964--1967
Ahmed M. Aly and
Ahmed S. Abdelhamid and
Ahmed R. Mahmood and
Walid G. Aref and
Mohamed S. Hassan and
Hazem Elmeleegy and
Mourad Ouzzani A demonstration of AQWA: adaptive
query-workload-aware partitioning of big
spatial data . . . . . . . . . . . . . . 1968--1971
Jens Dittrich and
Patrick Bender Janiform intra-document analytics for
reproducible research . . . . . . . . . 1972--1975
Erich Schubert and
Alexander Koos and
Tobias Emrich and
Andreas Züfle and
Klaus Arthur Schmid and
Arthur Zimek A framework for clustering uncertain
data . . . . . . . . . . . . . . . . . . 1976--1979
Nicole Bidoit and
Melanie Herschel and
Katerina Tzompanaki EFQ: why-not answer polynomials in
action . . . . . . . . . . . . . . . . . 1980--1983
Xiaolan Wang and
Mary Feng and
Yue Wang and
Xin Luna Dong and
Alexandra Meliou Error diagnosis and data profiling with
DataXRay . . . . . . . . . . . . . . . . 1984--1987
Quan Pham and
Severin Thaler and
Tanu Malik and
Ian Foster and
Boris Glavic Sharing and reproducing database
applications . . . . . . . . . . . . . . 1988--1991
Marcin Wylot and
Philippe Cudré-Mauroux and
Paul Groth A demonstration of TripleProv: tracking
and querying provenance over Web data 1992--1995
Stefano Ortona and
Giorgio Orsi and
Marcello Buoncristiano and
Tim Furche WADaR: joint wrapper and data repair . . 1996--1999
Mangesh Bendre and
Bofan Sun and
Ding Zhang and
Xinyan Zhou and
Kevin Chen-Chuan Chang and
Aditya Parameswaran DataSpread: unifying databases and
spreadsheets . . . . . . . . . . . . . . 2000--2003
Daniel Haas and
Sanjay Krishnan and
Jiannan Wang and
Michael J. Franklin and
Eugene Wu Wisteria: nurturing scalable data
cleaning infrastructure . . . . . . . . 2004--2007
Ashoke S. and
Jayant R. Haritsa CODD: a dataless approach to big data
testing . . . . . . . . . . . . . . . . 2008--2011
Sejla Cebiri\'c and
François Goasdoué and
Ioana Manolescu Query-oriented summarization of RDF
graphs . . . . . . . . . . . . . . . . . 2012--2015
Yodsawalai Chodpathumwan and
Amirhossein Aleyasen and
Arash Termehchy and
Yizhou Sun Universal-DB: towards representation
independent graph analytics . . . . . . 2016--2019
Ahmed R. Mahmood and
Ahmed M. Aly and
Thamir Qadah and
El Kindi Rezig and
Anas Daghistani and
Amgad Madkour and
Ahmed S. Abdelhamid and
Mohamed S. Hassan and
Walid G. Aref and
Saleh Basalamah Tornado: a distributed spatio-textual
stream processing system . . . . . . . . 2020--2023
Andrew Crotty and
Alex Galakatos and
Emanuel Zgraggen and
Carsten Binnig and
Tim Kraska Vizdom: interactive analytics through
pen and touch . . . . . . . . . . . . . 2024--2027
Mariano P. Consens and
Valeria Fionda and
Shahan Khatchadourian and
Giuseppe Pirr\`o S+EPPs: construct and explore
bisimulation summaries, plus optimize
navigational queries; all on existing
SPARQL systems . . . . . . . . . . . . . 2028--2031
Konstantinos Xirogiannopoulos and
Udayan Khurana and
Amol Deshpande GraphGen: exploring interesting graphs
in relational data . . . . . . . . . . . 2032--2035
Dong Young Yoon and
Barzan Mozafari and
Douglas P. Brown DBSeer: pain-free database
administration through workload
intelligence . . . . . . . . . . . . . . 2036--2039
Arun Kejariwal and
Sanjeev Kulkarni and
Karthik Ramasamy Real time analytics: algorithms and
systems . . . . . . . . . . . . . . . . 2040--2041
Arijit Khan and
Lei Chen On uncertain graphs modeling and queries 2042--2043
Xin Luna Dong and
Wang-Chiew Tan A time machine for information: looking
back to look forward . . . . . . . . . . 2044--2045
Mahashweta Das and
Gautam Das Structured analytics in social media . . 2046--2047
Jing Gao and
Qi Li and
Bo Zhao and
Wei Fan and
Jiawei Han Truth discovery and crowdsourcing
aggregation: a unified perspective . . . 2048--2049
Daniel Abadi and
Shivnath Babu and
Fatma Özcan and
Ippokratis Pandis SQL-on-Hadoop systems: tutorial . . . . 2050--2051
Juan Loaiza Engineering database hardware and
software together . . . . . . . . . . . 2052--2052
Magdalena Balazinska Big data research: will industry solve
all the problems? . . . . . . . . . . . 2053--2056
Todd Walter Big plateaus of Big Data on the big
island . . . . . . . . . . . . . . . . . 2057--2057
Anastasia Ailamaki Databases and hardware: the beginning
and sequel of a beautiful friendship . . 2058--2061
Ahmed M. Aly and
Ahmed R. Mahmood and
Mohamed S. Hassan and
Walid G. Aref and
Mourad Ouzzani and
Hazem Elmeleegy and
Thamir Qadah AQWA: adaptive query workload aware
partitioning of big spatial data . . . . 2062--2073
Zuhair Khayyat and
William Lucia and
Meghna Singh and
Mourad Ouzzani and
Paolo Papotti and
Jorge-Arnulfo Quiané-Ruiz and
Nan Tang and
Panos Kalnis Lightning fast and space efficient
inequality joins . . . . . . . . . . . . 2074--2085
Jinfei Liu and
Li Xiong and
Jian Pei and
Jun Luo and
Haoyu Zhang Finding Pareto optimal groups:
group-based skyline . . . . . . . . . . 2086--2097
Taylor Kessler Faulkner and
Will Brackenbury and
Ashwin Lall $k$-regret queries with nonlinear
utilities . . . . . . . . . . . . . . . 2098--2109
Juwei Shi and
Yunjie Qiu and
Umar Farooq Minhas and
Limei Jiao and
Chen Wang and
Berthold Reinwald and
Fatma Özcan Clash of the titans: MapReduce vs. Spark
for large scale data analytics . . . . . 2110--2121
Yu Liu and
Jiaheng Lu and
Hua Yang and
Xiaokui Xiao and
Zhewei Wei Towards maximum independent sets on
massive graphs . . . . . . . . . . . . . 2122--2133
John Meehan and
Nesime Tatbul and
Stan Zdonik and
Cansu Aslantas and
Ugur Cetintemel and
Jiang Du and
Tim Kraska and
Samuel Madden and
David Maier and
Andrew Pavlo and
Michael Stonebraker and
Kristin Tufte and
Hao Wang S-Store: streaming meets transaction
processing . . . . . . . . . . . . . . . 2134--2145
Justin Levandoski and
David Lomet and
Sudipta Sengupta and
Ryan Stutsman and
Rui Wang Multi-version range concurrency control
in Deuteronomy . . . . . . . . . . . . . 2146--2157
Hao Li and
Chee-Yong Chan and
David Maier Query from examples: an iterative,
data-driven approach to query
construction . . . . . . . . . . . . . . 2158--2169
Sainyam Galhotra and
Amitabha Bagchi and
Srikanta Bedathur and
Maya Ramanath and
Vidit Jain Tracking the conductance of rapidly
evolving topic-subgraphs . . . . . . . . 2170--2181
Manasi Vartak and
Sajjadur Rahman and
Samuel Madden and
Aditya Parameswaran and
Neoklis Polyzotis SeeDB: efficient data-driven
visualization recommendations to support
visual analytics . . . . . . . . . . . . 2182--2193
Disheng Qiu and
Luciano Barbosa and
Xin Luna Dong and
Yanyan Shen and
Divesh Srivastava Dexter: large-scale discovery and
extraction of product specifications on
the web . . . . . . . . . . . . . . . . 2194--2205
Qiang Huang and
Jianlin Feng and
Yikai Zhang and
Qiong Fang and
Wilfred Ng Query-aware locality-sensitive hashing
for approximate nearest neighbor search 1--12
Wissam Khaouid and
Marina Barsky and
Venkatesh Srinivasan and
Alex Thomo $K$-core decomposition of large networks
on a single PC . . . . . . . . . . . . . 13--23
Zhenguo Li and
Yixiang Fang and
Qin Liu and
Jiefeng Cheng and
Reynold Cheng and
John C. S. Lui Walking in the cloud: parallel SimRank
at scale . . . . . . . . . . . . . . . . 24--35
Patricia C. Arocena and
Boris Glavic and
Giansalvatore Mecca and
Renée J. Miller and
Paolo Papotti and
Donatello Santoro Messing up with BART: error generation
for evaluating data-cleaning algorithms 36--47
Takanori Hayashi and
Takuya Akiba and
Yuichi Yoshida Fully dynamic betweenness centrality
maintenance on massive networks . . . . 48--59
Wei Lu and
Wei Chen and
Laks V. S. Lakshmanan From competition to complementarity:
comparative influence diffusion and
maximization . . . . . . . . . . . . . . 60--71
Konstantinos Kloudas and
Margarida Mamede and
Nuno Preguiça and
Rodrigo Rodrigues Pixida: optimizing data parallel jobs in
wide-area data analytics . . . . . . . . 72--83
Lu Wang and
Robert Christensen and
Feifei Li and
Ke Yi Spatial online sampling and aggregation 84--95
Stefan Richter and
Victor Alvarez and
Jens Dittrich A seven-dimensional analysis of hashing
methods and its implications on query
processing . . . . . . . . . . . . . . . 96--107
Patricia C. Arocena and
Boris Glavic and
Radu Ciucanu and
Renée J. Miller The iBench integration metadata
generator . . . . . . . . . . . . . . . 108--119
Hotham Altwaijry and
Sharad Mehrotra and
Dmitri V. Kalashnikov QuERy: a framework for integrating
entity resolution with query processing 120--131
Taesung Lee and
Jin-woo Park and
Sanghoon Lee and
Seung-Won Hwang and
Sameh Elnikety and
Yuxiong He Processing and optimizing main memory
spatial-keyword queries . . . . . . . . 132--143
Yongjoo Park and
Michael Cafarella and
Barzan Mozafari Neighbor-sensitive hashing . . . . . . . 144--155
Botong Huang and
Nicholas W. D. Jarrett and
Shivnath Babu and
Sayan Mukherjee and
Jun Yang Cümülön: matrix-based data analytics in the
cloud with spot instances . . . . . . . 156--167
Manohar Kaul and
Raymond Chi-Wing Wong and
Christian S. Jensen New lower and upper bounds for shortest
distance queries on terrains . . . . . . 168--179
Cibele Freire and
Wolfgang Gatterbauer and
Neil Immerman and
Alexandra Meliou The complexity of resilience and
responsibility for self-join-free
conjunctive queries . . . . . . . . . . 180--191
Hao Huang and
Shiva Prasad Kasiviswanathan Streaming anomaly detection using
randomized matrix sketching . . . . . . 192--203
Viktor Leis and
Andrey Gubichev and
Atanas Mirchev and
Peter Boncz and
Alfons Kemper and
Thomas Neumann How good are query optimizers, really? 204--215
Matteo Interlandi and
Kshitij Shah and
Sai Deep Tetali and
Muhammad Ali Gulzar and
Seunghyun Yoo and
Miryung Kim and
Todd Millstein and
Tyson Condie Titian: data provenance support in Spark 216--227
Wolf Rödiger and
Tobias Mühlbauer and
Alfons Kemper and
Thomas Neumann High-speed query processing over
high-speed networks . . . . . . . . . . 228--239
Bo Zong and
Xusheng Xiao and
Zhichun Li and
Zhenyu Wu and
Zhiyun Qian and
Xifeng Yan and
Ambuj K. Singh and
Guofei Jiang Behavior query discovery in
system-generated temporal graphs . . . . 240--251
Onur Kocberber and
Babak Falsafi and
Boris Grot Asynchronous memory access chaining . . 252--263
Samuel Haney and
Ashwin Machanavajjhala and
Bolin Ding Design of policy-aware differentially
private algorithms . . . . . . . . . . . 264--275
Xin Huang and
Laks V. S. Lakshmanan and
Jeffrey Xu Yu and
Hong Cheng Approximate closest community search in
networks . . . . . . . . . . . . . . . . 276--287
Fabien André and
Anne-Marie Kermarrec and
Nicolas Le Scouarnec Cache locality is not enough:
high-performance nearest neighbor search
with product quantization fast scan . . 288--299
Nataliya Prokoshyna and
Jaros\law Szlichta and
Fei Chiang and
Renée J. Miller and
Divesh Srivastava Combining quantitative and logical data
cleaning . . . . . . . . . . . . . . . . 300--311
George Papadakis and
George Alexiou and
George Papastefanatos and
Georgia Koutrika Schema-agnostic vs schema-based
configurations for blocking methods on
homogeneous data . . . . . . . . . . . . 312--323
Alessandro Epasto and
Silvio Lattanzi and
Vahab Mirrokni and
Ismail Oner Sebe and
Ahmed Taei and
Sunita Verma Ego-net community mining applied to
friend suggestion . . . . . . . . . . . 324--335
Ziawasch Abedjan and
Cuneyt G. Akcora and
Mourad Ouzzani and
Paolo Papotti and
Michael Stonebraker Temporal rules discovery for web data
cleaning . . . . . . . . . . . . . . . . 336--347
Sudeepa Roy and
Laurel Orr and
Dan Suciu Explaining query answers with
explanation-ready databases . . . . . . 348--359
Dong Deng and
Guoliang Li and
He Wen and
Jianhua Feng An efficient partition based method for
exact set similarity joins . . . . . . . 360--371
Daniel Haas and
Jiannan Wang and
Eugene Wu and
Michael J. Franklin CLAMShell: speeding up crowds for
low-latency data labeling . . . . . . . 372--383
Donatella Firmani and
Barna Saha and
Divesh Srivastava Online entity resolution using an oracle 384--395
Marco Calautti and
Sergio Greco and
Cristian Molinaro and
Irina Trubitsyna Exploiting equality generating
dependencies in checking chase
termination . . . . . . . . . . . . . . 396--407
Tong Yang and
Alex X. Liu and
Muhammad Shahzad and
Yuankun Zhong and
Qiaobin Fu and
Zi Li and
Gaogang Xie and
Xiaoming Li A shifting Bloom filter framework for
set queries . . . . . . . . . . . . . . 408--419
Fan Yang and
Jinfeng Li and
James Cheng Husky: towards a more efficient and
expressive distributed computing
framework . . . . . . . . . . . . . . . 420--431
Zeyu Li and
Hongzhi Wang and
Wei Shao and
Jianzhong Li and
Hong Gao Repairing data through regular
expressions . . . . . . . . . . . . . . 432--443
Cong Yan and
Alvin Cheung Leveraging lock contention to improve
OLTP application performance . . . . . . 444--455
Farhana M. Choudhury and
J. Shane Culpepper and
Timos Sellis and
Xin Cao Maximizing bichromatic reverse spatial
and textual $k$ nearest neighbor queries 456--467
Julien Subercaze and
Christophe Gravier and
Jules Chevalier and
Frederique Laforest Inferray: fast in-memory RDF inference 468--479
Darko Makreshanski and
Georgios Giannikis and
Gustavo Alonso and
Donald Kossmann MQJoin: efficient shared execution of
main-memory joins . . . . . . . . . . . 480--491
Tenindra Abeywickrama and
Muhammad Aamir Cheema and
David Taniar $k$-nearest neighbors on road networks:
a journey in experimentation and
in-memory implementation . . . . . . . . 492--503
Yuan Yuan and
Kaibo Wang and
Rubao Lee and
Xiaoning Ding and
Jing Xing and
Spyros Blanas and
Xiaodong Zhang BCC: reducing false aborts in optimistic
concurrency control with low cost for
in-memory databases . . . . . . . . . . 504--515
Long Yuan and
Lu Qin and
Xuemin Lin and
Lijun Chang and
Wenjie Zhang I/O efficient ECC graph decomposition
via graph reduction . . . . . . . . . . 516--527
Carsten Binnig and
Andrew Crotty and
Alex Galakatos and
Tim Kraska and
Erfan Zamanian The end of slow networks: it's time for
a redesign . . . . . . . . . . . . . . . 528--539
Jiewen Huang and
Daniel J. Abadi Leopard: lightweight edge-oriented
partitioning and replication for dynamic
graphs . . . . . . . . . . . . . . . . . 540--551
Eric Gribkoff and
Dan Suciu SlimShot: in-database probabilistic
inference for knowledge bases . . . . . 552--563
Da Yan and
James Cheng and
M. Tamer Özsu and
Fan Yang and
Yi Lu and
John C. S. Lui and
Qizhen Zhang and
Wilfred Ng A general-purpose query-centric
framework for querying big graphs . . . 564--575
Matteo Brucato and
Juan Felipe Beltran and
Azza Abouzied and
Alexandra Meliou Scalable package queries in relational
database systems . . . . . . . . . . . . 576--587
Xiang Wang and
Ying Zhang and
Wenjie Zhang and
Xuemin Lin and
Zengfeng Huang Skype: top-$k$ spatial-keyword
publish/subscribe over sliding window 588--599
Abolfazl Asudeh and
Saravanan Thirumuruganathan and
Nan Zhang and
Gautam Das Discovering the skyline of web databases 600--611
Xiaohang Zhang and
Guoliang Li and
Jianhua Feng Crowdsourced top-$k$ algorithms: an
experimental evaluation . . . . . . . . 612--623
Michael Maddox and
David Goehring and
Aaron J. Elmore and
Samuel Madden and
Aditya Parameswaran and
Amol Deshpande Decibel: the relational dataset
branching system . . . . . . . . . . . . 624--635
Willi Mann and
Nikolaus Augsten and
Panagiotis Bouros An empirical evaluation of set
similarity join techniques . . . . . . . 636--647
Immanuel Trummer and
Christoph Koch Multiple query optimization on the
D-Wave 2X adiabatic quantum computer . . 648--659
Immanuel Trummer and
Christoph Koch Parallelizing query optimization on
shared-nothing architectures . . . . . . 660--671
Vasiliki Kalavri and
Tiago Simas and
Dionysios Logothetis The shortest path is not always a
straight line: leveraging semi-metricity
in graph analysis . . . . . . . . . . . 672--683
George Papadakis and
Jonathan Svirsky and
Avigdor Gal and
Themis Palpanas Comparative analysis of approximate
blocking techniques for entity
resolution . . . . . . . . . . . . . . . 684--695
Yiran Zhao and
Shen Li and
Shaohan Hu and
Hongwei Wang and
Shuochao Yao and
Huajie Shao and
Tarek Abdelzaher An experimental evaluation of datacenter
workloads on low-power embedded micro
servers . . . . . . . . . . . . . . . . 696--707
Shaoxu Song and
Yue Cao and
Jianmin Wang Cleaning timestamps with temporal
constraints . . . . . . . . . . . . . . 708--719
Zilong Tan and
Shivnath Babu Tempo: robust and self-tuning resource
management in multi-tenant parallel
databases . . . . . . . . . . . . . . . 720--731
Jonny Daenen and
Frank Neven and
Tony Tan and
Stijn Vansummeren Parallel evaluation of multi-semi-joins 732--743
Jianfei Chen and
Kaiwei Li and
Jun Zhu and
Wenguang Chen WarpLDA: a cache efficient O(1)
algorithm for latent Dirichlet
allocation . . . . . . . . . . . . . . . 744--755
Marius Eich and
Pit Fender and
Guido Moerkotte Faster plan generation through
consideration of functional dependencies
and keys . . . . . . . . . . . . . . . . 756--767
Felix Martin Schuhknecht and
Jens Dittrich and
Ankur Sharma RUMA has it: rewired user-space memory
access is possible! . . . . . . . . . . 768--779
Ryan Marcus and
Olga Papaemmanouil WiSeDB: a learning-based workload
management advisor for cloud databases 780--791
Gianmarco De Francisci Morales and
Aristides Gionis Streaming similarity self-join . . . . . 792--803
Alexander Schätzle and
Martin Przyjaciel-Zablocki and
Simon Skilevic and
Georg Lausen S2RDF: RDF querying with SPARQL on spark 804--815
Rishabh Singh BlinkFill: semi-supervised programming
by example for syntactic string
transformations . . . . . . . . . . . . 816--827
Dong Deng and
Guoliang Li and
He Wen and
H. V. Jagadish and
Jianhua Feng META: an efficient matching-based method
for error-tolerant autocompletion . . . 828--839
Weiguo Zheng and
Lei Zou and
Wei Peng and
Xifeng Yan and
Shaoxu Song and
Dongyan Zhao Semantic SPARQL similarity search over
RDF knowledge graphs . . . . . . . . . . 840--851
Ayush Dubey and
Greg D. Hill and
Robert Escriva and
Emin Gün Sirer Weaver: a high-performance,
transactional graph database based on
refinable timestamps . . . . . . . . . . 852--863
Xu Chu and
Ihab F. Ilyas and
Paraschos Koutris Distributed data deduplication . . . . . 864--875
Marcelo Arenas and
Francisco Maturana and
Cristian Riveros and
Domagoj Vrgoc A framework for annotating CSV-like data 876--887
Abolfazl Asudeh and
Nan Zhang and
Gautam Das Query reranking as a service . . . . . . 888--899
Hongbin Ma and
Bin Shao and
Yanghua Xiao and
Liang Jeff Chen and
Haixun Wang G-SQL: fast query processing via graph
exploration . . . . . . . . . . . . . . 900--911
Mingxing Zhang and
Yongwei Wu and
Kang Chen and
Teng Ma and
Weimin Zheng Measuring and optimizing distributed
array programs . . . . . . . . . . . . . 912--923
Insoon Jo and
Duck-Ho Bae and
Andre S. Yoon and
Jeong-Uk Kang and
Sangyeun Cho and
Daniel D. G. Lee and
Jaeheon Jeong YourSQL: a high-performance database
system leveraging in-storage computing 924--935
Lu Lu and
Xuanhua Shi and
Yongluan Zhou and
Xiong Zhang and
Hai Jin and
Cheng Pei and
Ligang He and
Yuanzhen Geng Lifetime-based memory management for
distributed data processing systems . . 936--947
Sanjay Krishnan and
Jiannan Wang and
Eugene Wu and
Michael J. Franklin and
Ken Goldberg ActiveClean: interactive data cleaning
for statistical modeling . . . . . . . . 948--959
Ahmed Elgohary and
Matthias Boehm and
Peter J. Haas and
Frederick R. Reiss and
Berthold Reinwald Compressed linear algebra for
large-scale machine learning . . . . . . 960--971
Manos Karpathiotakis and
Ioannis Alagiannis and
Anastasia Ailamaki Fast queries over heterogeneous data
through engine customization . . . . . . 972--983
Sourav S. Bhowmick and
Byron Choi and
Curtis Dyreson Data-driven visual graph query interface
construction and maintenance: challenges
and opportunities . . . . . . . . . . . 984--992
Ziawasch Abedjan and
Xu Chu and
Dong Deng and
Raul Castro Fernandez and
Ihab F. Ilyas and
Mourad Ouzzani and
Paolo Papotti and
Michael Stonebraker and
Nan Tang Detecting data errors: where are we and
what needs to be done? . . . . . . . . . 993--1004
Hai Liu and
Dongqing Xiao and
Pankaj Didwania and
Mohamed Y. Eltabakh Exploiting soft and hard correlations in
big data query optimization . . . . . . 1005--1016
Minsuk Kahng and
Shamkant B. Navathe and
John T. Stasko and
Duen Horng Polo Chau Interactive browsing and navigation in
relational databases . . . . . . . . . . 1017--1028
Renata Borovica-Gaji\'c and
Raja Appuswamy and
Anastasia Ailamaki Cheap data analytics using cold storage
devices . . . . . . . . . . . . . . . . 1029--1040
Julian Shun and
Farbod Roosta-Khorasani and
Kimon Fountoulakis and
Michael W. Mahoney Parallel local graph clustering . . . . 1041--1052
Yongxin Tong and
Jieying She and
Bolin Ding and
Lei Chen and
Tianyu Wo and
Ke Xu Online minimum matching in real-time
spatial data: experiments and analysis 1053--1064
Robert Brunel and
Norman May and
Alfons Kemper Index-assisted hierarchical computations
in main-memory RDBMS . . . . . . . . . . 1065--1076
Naoto Ohsaka and
Takuya Akiba and
Yuichi Yoshida and
Ken-ichi Kawarabayashi Dynamic influence analysis in evolving
networks . . . . . . . . . . . . . . . . 1077--1088
Luan Tran and
Liyue Fan and
Cyrus Shahabi Distance-based outlier detection in data
streams . . . . . . . . . . . . . . . . 1089--1100
Michalis Mountantonakis and
Yannis Tzitzikas On measuring the lattice of
commonalities among several linked
datasets . . . . . . . . . . . . . . . . 1101--1112
Zhao Chang and
Dong Xie and
Feifei Li Oblivious RAM: a dissection and
experimental evaluation . . . . . . . . 1113--1124
Fisnik Kastrati and
Guido Moerkotte Optimization of conjunctive predicates
for main memory column stores . . . . . 1125--1136
Zaheer Chothia and
John Liagouris and
Frank McSherry and
Timothy Roscoe Explaining outputs in modern data
analytics . . . . . . . . . . . . . . . 1137--1148
Peter Buneman and
Slawek Staworko RDF graph alignment with bisimulation 1149--1160
Damian Bursztyn and
François Goasdoué and
Ioana Manolescu Teaching an RDBMS about ontological
constraints . . . . . . . . . . . . . . 1161--1172
Giovanni Simonini and
Sonia Bergamaschi and
H. V. Jagadish BLAST: a loosely schema-aware
meta-blocking approach for entity
resolution . . . . . . . . . . . . . . . 1173--1184
Erkang Zhu and
Fatemeh Nargesian and
Ken Q. Pu and
Renée J. Miller LSH ensemble: Internet-scale domain
search . . . . . . . . . . . . . . . . . 1185--1196
Pradap Konda and
Sanjib Das and
Paul Suganthan G. C. and
AnHai Doan and
Adel Ardalan and
Jeffrey R. Ballard and
Han Li and
Fatemah Panahi and
Haojun Zhang and
Jeff Naughton and
Shishir Prasad and
Ganesh Krishnan and
Rohit Deep and
Vijay Raghavendra Magellan: toward building entity
matching management systems . . . . . . 1197--1208
Diptikalyan Saha and
Avrilia Floratou and
Karthik Sankaranarayanan and
Umar Farooq Minhas and
Ashish R. Mittal and
Fatma Özcan ATHENA: an ontology-driven system for
natural language querying over
relational data stores . . . . . . . . . 1209--1220
Richard Wesley and
Fei Xu Incremental computation of common
windowed holistic aggregates . . . . . . 1221--1232
Yixiang Fang and
Reynold Cheng and
Siqiang Luo and
Jiafeng Hu Effective community search for large
attributed graphs . . . . . . . . . . . 1233--1244
Willis Lang and
Karthik Ramachandra and
David J. DeWitt and
Shize Xu and
Qun Guo and
Ajay Kalhan and
Peter Carlin Not for the timid: on the impact of
aggressive over-booking in the cloud . . 1245--1256
Martin Sevenich and
Sungpack Hong and
Oskar van Rest and
Zhe Wu and
Jayanta Banerjee and
Hassan Chafi Using domain-specific languages for
analytic graph databases . . . . . . . . 1257--1268
Shaosu Liu and
Bin Song and
Sriharsha Gangam and
Lawrence Lo and
Khaled Elmeleegy Kodiak: leveraging materialized views
for very low-latency analytics over
high-dimensional web-scale data . . . . 1269--1280
Aneesh Sharma and
Jerry Jiang and
Praveen Bommannavar and
Brian Larson and
Jimmy Lin GraphJet: real-time content
recommendations at Twitter . . . . . . . 1281--1292
Edward Ma and
Vishrut Gupta and
Meichun Hsu and
Indrajit Roy \pkgdmapply: a functional primitive to
express distributed machine learning
algorithms in R . . . . . . . . . . . . 1293--1304
Pedro Pedreira and
Chris Croswhite and
Luis Bona Cubrick: indexing millions of records
per second for interactive analytics . . 1305--1316
Alexandru Iosup and
Tim Hegeman and
Wing Lung Ngai and
Stijn Heldens and
Arnau Prat-Pérez and
Thomas Manhardto and
Hassan Chafio and
Mihai Capota and
Narayanan Sundaram and
Michael Anderson and
Ilie Gabriel Tanase and
Yinglong Xia and
Lifeng Nai and
Peter Boncz LDBC graphalytics: a benchmark for
large-scale graph analysis on parallel
and distributed platforms . . . . . . . 1317--1328
Hermano Lustosa and
Fabio Porto and
Patrick Valduriez and
Pablo Blanco Database system support of simulation
data . . . . . . . . . . . . . . . . . . 1329--1340
Gabriela Jacques-Silva and
Fang Zheng and
Daniel Debrunner and
Kun-Lung Wu and
Victor Dogaru and
Eric Johnson and
Michael Spicer and
Ahmet Erdem Sariyüce Consistent regions: guaranteed tuple
processing in IBM streams . . . . . . . 1341--1352
Mohammed Al-Kateb and
Paul Sinclair and
Grace Au and
Carrie Ballinger Hybrid row-column partitioning in
Teradata\reg . . . . . . . . . . . . . . 1353--1364
Ricardo Fernandes and
Piotr Zaczkowski and
Bernd Göttler and
Conor Ettinoffe and
Anis Moussa TrafficDB: HERE's high performance
shared-memory data store . . . . . . . . 1365--1376
Alex Scotti and
Mark Hannum and
Michael Ponomarenko and
Dorin Hogea and
Akshat Sikarwar and
Mohit Khullar and
Adi Zaimi and
James Leddy and
Rivers Zhang and
Fabio Angius and
Lingzhi Deng Comdb2: Bloomberg's highly available
relational database system . . . . . . . 1377--1388
V. Srinivasan and
Brian Bulkowski and
Wei-Ling Chu and
Sunil Sayyaparaju and
Andrew Gooding and
Rajkumar Iyer and
Ashish Shinde and
Thomas Lopatic Aerospike: architecture of a real-time
operational DBMS . . . . . . . . . . . . 1389--1400
Jack Chen and
Samir Jindel and
Robert Walzer and
Rajkumar Sen and
Nika Jimsheleishvilli and
Michael Andrews The MemSQL query optimizer: a modern
optimizer for real-time analytics in a
distributed database . . . . . . . . . . 1401--1412
Sarath Lakshman and
Sriram Melkote and
John Liang and
Ravi Mayuram Nitro: a fast, scalable in-memory
storage engine for NoSQL global
secondary index . . . . . . . . . . . . 1413--1424
Matthias Boehm and
Michael W. Dusenberry and
Deron Eriksson and
Alexandre V. Evfimievski and
Faraz Makari Manshadi and
Niketan Pansare and
Berthold Reinwald and
Frederick R. Reiss and
Prithviraj Sen and
Arvind C. Surve and
Shirish Tatikonda SystemML: declarative machine learning
on spark . . . . . . . . . . . . . . . . 1425--1436
Aurosish Mishra and
Shasank Chavan and
Allison Holloway and
Tirthankar Lahiri and
Zhen Hua Liu and
Sunil Chakkappen and
Dennis Lui and
Vinita Subramanian and
Ramesh Kumar and
Maria Colgan and
Jesse Kamp and
Niloy Mukherjee and
Vineet Marwah Accelerating analytics with dynamic
in-memory expressions . . . . . . . . . 1437--1448
Satyajit Bhadange and
Akhil Arora and
Arnab Bhattacharya GARUDA: a system for large-scale mining
of statistically significant connected
subgraphs . . . . . . . . . . . . . . . 1449--1452
Huan Li and
Hua Lu and
Xin Chen and
Gang Chen and
Ke Chen and
Lidan Shou Vita: a versatile toolkit for generating
indoor mobility data for real-world
buildings . . . . . . . . . . . . . . . 1453--1456
Guillaume Bagan and
Angela Bonifati and
Radu Ciucanu and
George H. L. Fletcher and
Aurélien Lemay and
Nicky Advokaat Generating flexible workloads for graph
databases . . . . . . . . . . . . . . . 1457--1460
Xiaofeng Zhou and
Yang Chen and
Daisy Zhe Wang ArchimedesOne: query processing over
probabilistic knowledge bases . . . . . 1461--1464
Tova Milo and
Slava Novgorodov and
Wang-Chiew Tan Rudolf: interactive rule refinement
system for fraud detection . . . . . . . 1465--1468
Antonio Maccioni and
Matteo Collina Graph databases in the browser: using
LevelGraph to explore New Delhi . . . . 1469--1472
Thibault Sellam and
Martin Kersten Ziggy: characterizing query results for
data explorers . . . . . . . . . . . . . 1473--1476
Thibault Sellam and
Robin Cijvat and
Richard Koopmanschap and
Martin Kersten Blaeu: mapping and navigating large
tables with cluster analysis . . . . . . 1477--1480
Ahmed El-Roby and
Khaled Ammar and
Ashraf Aboulnaga and
Jimmy Lin Sapphire: querying RDF data made simple 1481--1484
Yael Amsterdamer and
Tova Milo and
Amit Somech and
Brit Youngmann December: a declarative tool for crowd
member selection . . . . . . . . . . . . 1485--1488
Xi He and
Nisarg Raval and
Ashwin Machanavajjhala A demonstration of VisDPT: visual
exploration of differentially private
trajectories . . . . . . . . . . . . . . 1489--1492
Tobias Scheuer and
Norman May and
Alexander Böhm and
Daniel Scheibli JexLog: a sonar for the abyss . . . . . 1493--1496
Kosetsu Ikeda and
Atsuyuki Morishima and
Habibur Rahman and
Senjuti Basu Roy and
Saravanan Thirumuruganathan and
Sihem Amer-Yahia and
Gautam Das Collaborative crowdsourcing with Crowd4u 1497--1500
Lei Chen and
Jianliang Xu and
Christian S. Jensen and
Yafei Li YASK: a why-not question answering
engine for spatial keyword query
services . . . . . . . . . . . . . . . . 1501--1504
Peipei Yi and
Byron Choi and
Sourav S. Bhowmick and
Jianliang Xu AutoG: a visual query autocompletion
framework for graph databases . . . . . 1505--1508
Xiaoye Miao and
Yunjun Gao and
Gang Chen and
Huiyong Cui and
Chong Guo and
Weida Pan Si$^2$ p: a restaurant recommendation
system using preference queries over
incomplete information . . . . . . . . . 1509--1512
R. Bonaque and
T. D. Cao and
B. Cautis and
F. Goasdoué and
J. Letelier and
I. Manolescu and
O. Mendoza and
S. Ribeiro and
X. Tannier Mixed-instance querying: a lightweight
integration architecture for data
journalism . . . . . . . . . . . . . . . 1513--1516
Dennis Butterstein and
Torsten Grust Precision performance surgery for
CostgreSQL: LLVM-based Expression
Compilation, Just in Time . . . . . . . 1517--1520
Mohamed Yahya and
Klaus Berberich and
Maya Ramanath and
Gerhard Weikum Exploratory querying of extended
knowledge graphs . . . . . . . . . . . . 1521--1524
Kiril Panev and
Sebastian Michel and
Evica Milchevski and
Koninika Pal Exploring databases via reverse
engineering ranking queries with PALEO 1525--1528
Ramon Bespinyowong and
Wei Chen and
H. V. Jagadish and
Yuxin Ma ExRank: an exploratory ranking interface 1529--1532
Gonzalo Diaz and
Marcelo Arenas and
Michael Benedikt SPARQLByE: querying RDF data by example 1533--1536
Daniel Deutch and
Nave Frost and
Amir Gilad NLProv: natural language provenance . . 1537--1540
Bikash Chandra and
Mathew Joseph and
Bharath Radhakrishnan and
Shreevidhya Acharya and
S. Sudarshan Partial marking for automated grading of
SQL queries . . . . . . . . . . . . . . 1541--1544
Kaiqi Zhao and
Yiding Liu and
Quan Yuan and
Lisi Chen and
Zhida Chen and
Gao Cong Towards personalized maps: mining user
preferences from geo-textual data . . . 1545--1548
Kaiyu Feng and
Kaiqi Zhao and
Yiding Liu and
Gao Cong A system for region search and
exploration . . . . . . . . . . . . . . 1549--1552
Aleksandar Vitorovic and
Mohammed Elseidy and
Khayyam Guliyev and
Khue Vu Minh and
Daniel Espino and
Mohammad Dashti and
Yannis Klonatos and
Christoph Koch Squall: scalable real-time analytics . . 1553--1556
Udayan Khurana and
Srinivasan Parthasarathy and
Deepak Turaga Graph-based exploration of non-graph
datasets . . . . . . . . . . . . . . . . 1557--1560
Minjian Liu and
Qing Wang Rogas: a declarative framework for
network analytics . . . . . . . . . . . 1561--1564
Mingjie Tang and
Yongyang Yu and
Qutaibah M. Malluhi and
Mourad Ouzzani and
Walid G. Aref LocationSpark: a distributed in-memory
data management system for big spatial
data . . . . . . . . . . . . . . . . . . 1565--1568
Anil Shanbhag and
Alekh Jindal and
Yi Lu and
Samuel Madden Amoeba: a shape changing storage system
for big data . . . . . . . . . . . . . . 1569--1572
Dan Olteanu and
Maximilian Schleich F: regression models over factorized
views . . . . . . . . . . . . . . . . . 1573--1576
Miguel Rodríguez and
Sean Goldberg and
Daisy Zhe Wang SigmaKB: multiple probabilistic
knowledge base fusion . . . . . . . . . 1577--1580
Pradap Konda and
Sanjib Das and
Paul Suganthan G. C. and
AnHai Doan and
Adel Ardalan and
Jeffrey R. Ballard and
Han Li and
Fatemah Panahi and
Haojun Zhang and
Jeff Naughton and
Shishir Prasad and
Ganesh Krishnan and
Rohit Deep and
Vijay Raghavendra Magellan: toward building entity
matching management systems over data
science stacks . . . . . . . . . . . . . 1581--1584
Wail Y. Alkowaileet and
Sattam Alsubaiee and
Michael J. Carey and
Till Westmann and
Yingyi Bu Large-scale complex analytics on
semi-structured datasets using AsterixDB
and Spark . . . . . . . . . . . . . . . 1585--1588
Jose Picado and
Parisa Ataei and
Arash Termehchy and
Alan Fern Schema independent and scalable
relational learning by Castor . . . . . 1589--1592
Rajeshkumar Kannapalli and
Azade Nazi and
Mahashweta Das and
Gautam Das AD-WIRE: add-on for Web item reviewing
system . . . . . . . . . . . . . . . . . 1593--1596
Vineet Chaoji and
Rajeev Rastogi and
Gourav Roy Machine learning in the real world . . . 1597--1600
Alexander Böhm and
Jens Dittrich and
Niloy Mukherjee and
Ippokratis Pandis and
Rajkumar Sen Operational analytics data management
systems . . . . . . . . . . . . . . . . 1601--1604
Xu Chu and
Ihab F. Ilyas Qualitative data cleaning . . . . . . . 1605--1608
Per-Åke Larson and
Justin Levandoski Modern main-memory database systems . . 1609--1610
Ashwin Machanavajjhala and
Xi He and
Michael Hay Differential privacy in the wild: a
tutorial on current practices & open
challenges . . . . . . . . . . . . . . . 1611--1614
Sihem Amer-Yahia and
Senjuti Basu Roy Human factors in crowdsourcing . . . . . 1615--1618
Ion Stoica Trends and challenges in big data
processing . . . . . . . . . . . . . . . 1619--1619
Anand Rajaraman Data-driven disruption: the view from
Silicon Valley . . . . . . . . . . . . . 1620--1620
Xin Luna Dong Leave no valuable data behind: the crazy
ideas and the business . . . . . . . . . 1621--1621
Mohamed Mokbel and
Chi-Yin Chow and
Walid Aref Location data management: a tale of two
systems and the ``next destination''! 1622--1622
Badrish Chandramouli and
Raul Castro Fernandez and
Jonathan Goldstein and
Ahmed Eldawy and
Abdul Quamar Quill: efficient, transferable, and rich
analytics at scale . . . . . . . . . . . 1623--1634
Brett Walenz and
Jun Yang Perturbation analysis of database
queries . . . . . . . . . . . . . . . . 1635--1646
Jing Li and
Hung-Wei Tseng and
Chunbin Lin and
Yannis Papakonstantinou and
Steven Swanson HippogriffDB: balancing I/O and GPU
bandwidth in big data analytics . . . . 1647--1658
Steffen Zeuch and
Holger Pirk and
Johann-Christoph Freytag Non-invasive progressive optimization
for in-memory databases . . . . . . . . 1659--1670
J. W. Zhang and
Y. C. Tay Dscaler: synthetically scaling a given
relational database . . . . . . . . . . 1671--1682
Sheng Wang and
David Maier and
Beng Chin Ooi Fast and adaptive indexing of
multi-dimensional observational data . . 1683--1694
Prasang Upadhyaya and
Magdalena Balazinska and
Dan Suciu Price-optimal querying with data APIs 1695--1706
Holger Pirk and
Oscar Moll and
Matei Zaharia and
Sam Madden Voodoo --- a vector algebra for portable
database performance on modern hardware 1707--1718
Dawei Jiang and
Qingchao Cai and
Gang Chen and
H. V. Jagadish and
Beng Chin Ooi and
Kian-Lee Tan and
Anthony K. H. Tung Cohort query processing . . . . . . . . 1--12
Yubao Wu and
Yuchen Bian and
Xiang Zhang Remember where you came from: on the
second-order random walk based proximity
measures . . . . . . . . . . . . . . . . 13--24
Lars George and
Bruno Cadonna and
Matthias Weidlich IL-Miner: instance-level discovery of
complex event patterns . . . . . . . . . 25--36
Iraklis Psaroudakis and
Tobias Scheuer and
Norman May and
Abdelkader Sellami and
Anastasia Ailamaki Adaptive NUMA-aware data placement and
task scheduling for analytical workloads
in main-memory column-stores . . . . . . 37--48
Tianzheng Wang and
Hideaki Kimura Mostly-optimistic concurrency control
for highly contended dynamic workloads
on a thousand cores . . . . . . . . . . 49--60
Sibo Wang and
Xiaokui Xiao and
Yin Yang and
Wenqing Lin Effective indexing for approximate
constrained shortest path queries on
large road networks . . . . . . . . . . 61--72
Qun Huang and
Patrick P. C. Lee Toward high-performance distributed
stream processing via approximate fault
tolerance . . . . . . . . . . . . . . . 73--84
Jian Dai and
Bin Yang and
Chenjuan Guo and
Christian S. Jensen and
Jilin Hu Path cost distribution estimation using
trajectory data . . . . . . . . . . . . 85--96
Ahmet Erdem Sariyüce and
Ali Pinar Fast hierarchy construction for dense
subgraphs . . . . . . . . . . . . . . . 97--108
Xuhong Zhang and
Jun Wang and
Jiangling Yin Sapprox: enabling efficient and accurate
approximations on sub-datasets with
distribution-aware online sampling . . . 109--120
Xuguang Ren and
Junhu Wang Multi-query optimization for subgraph
isomorphism search . . . . . . . . . . . 121--132
Michael Simpson and
Venkatesh Srinivasan and
Alex Thomo Efficient computation of feedback arc
set at web-scale . . . . . . . . . . . . 133--144
Dolan Antenucci and
Michael R. Anderson and
Michael Cafarella A declarative query processing system
for nowcasting . . . . . . . . . . . . . 145--156
Alessandro Lulli and
Matteo Dell'Amico and
Pietro Michiardi and
Laura Ricci NG-DBSCAN: scalable density-based
clustering for arbitrary data . . . . . 157--168
Rodica Neamtu and
Ramoza Ahsan and
Elke Rundensteiner and
Gabor Sarkozy Interactive time series exploration
powered by the marriage of similarity
distances . . . . . . . . . . . . . . . 169--180
Youhuan Li and
Lei Zou and
Huaming Zhang and
Dongyan Zhao Computing longest increasing
subsequences over sequential data
streams . . . . . . . . . . . . . . . . 181--192
Fernando Chirigati and
Jialu Liu and
Flip Korn and
You (Will) Wu and
Cong Yu and
Hao Zhang Knowledge exploration using tables on
the web . . . . . . . . . . . . . . . . 193--204
Sibo Wang and
Youze Tang and
Xiaokui Xiao and
Yin Yang and
Zengxiang Li HubPPR: effective indexing for
approximate Personalized Pagerank . . . 205--216
Longbin Lai and
Lu Qin and
Xuemin Lin and
Ying Zhang and
Lijun Chang and
Shiyu Yang Scalable distributed subgraph
enumeration . . . . . . . . . . . . . . 217--228
Yasuhiro Fujiwara and
Yasutoshi Ida and
Junya Arai and
Mai Nishimura and
Sotetsu Iwamura Fast algorithm for the lasso based $ L_1
$-graph construction . . . . . . . . . . 229--240
Ennan Zhai and
Zhenhua Li and
Zhenyu Li and
Fan Wu and
Guihai Chen Resisting tag spam by leveraging
implicit user behaviors . . . . . . . . 241--252
Xiaowei Chen and
Yongkun Li and
Pinghui Wang and
John C. S. Lui A general framework for estimating
graphlet statistics via random walk . . 253--264
Chunbin Lin and
Benjamin Mandel and
Yannis Papakonstantinou and
Matthias Springer Fast in-memory SQL analytics on typed
graphs . . . . . . . . . . . . . . . . . 265--276
Zheng Li and
Tingjian Ge Stochastic data acquisition for
answering queries as time goes by . . . 277--288
Haipeng Dai and
Muhammad Shahzad and
Alex X. Liu and
Yuankun Zhong Finding persistent items in data streams 289--300
Shuotao Xu and
Sungjin Lee and
Sang-Woo Jun and
Ming Liu and
Jamey Hicks and
Arvind Bluecache: a scalable distributed
flash-based key--value store . . . . . . 301--312
Qi Fan and
Dongxiang Zhang and
Huayu Wu and
Kian-Lee Tan A general and parallel platform for
mining co-movement patterns over
large-scale trajectories . . . . . . . . 313--324
Zhou Shao and
Muhammad Aamir Cheema and
David Taniar and
Hua Lu VIP-Tree: an effective index for indoor
spatial queries . . . . . . . . . . . . 325--336
Joy Arulraj and
Matthew Perron and
Andrew Pavlo Write-behind logging . . . . . . . . . . 337--348
Stavros Papadopoulos and
Kushal Datta and
Samuel Madden and
Timothy Mattson The TileDB array data storage manager 349--360
Yudian Zheng and
Guoliang Li and
Reynold Cheng DOCS: a domain-aware crowdsourcing
system using knowledge bases . . . . . . 361--372
Yue Wang and
Alexandra Meliou and
Gerome Miklau Lifting the haze off the cloud: a
consumer-centric market for database
computation in the cloud . . . . . . . . 373--384
Jia Yu and
Mohamed Sarwat Two birds, one stone: a fast, yet
lightweight, indexing scheme for modern
database systems . . . . . . . . . . . . 385--396
Zheng Li and
Tingjian Ge History is a mirror to the future:
best-effort approximate complex event
matching with insufficient resources . . 397--408
Hillel Avni and
Trevor Brown Persistent hybrid transactional memory
for databases . . . . . . . . . . . . . 409--420
Liwen Sun and
Michael J. Franklin and
Jiannan Wang and
Eugene Wu Skipping-oriented partitioning for
columnar layouts . . . . . . . . . . . . 421--432
Sneha Aman Singh and
Divesh Srivastava and
Srikanta Tirthapura Estimating quantiles from the union of
historical and streaming data . . . . . 433--444
Marco Serafini and
Rebecca Taft and
Aaron J. Elmore and
Andrew Pavlo and
Ashraf Aboulnaga and
Michael Stonebraker Clay: fine-grained adaptive partitioning
for general database schemas . . . . . . 445--456
Tarique Siddiqui and
Albert Kim and
John Lee and
Karrie Karahalios and
Aditya Parameswaran Effortless data exploration with
zenvisage: an expressive and interactive
visual analytics system . . . . . . . . 457--468
Matteo Ceccarello and
Andrea Pietracaprina and
Geppino Pucci and
Eli Upfal MapReduce and streaming algorithms for
diversity maximization in metric spaces
of bounded doubling dimension . . . . . 469--480
Vincent Bindschaedler and
Reza Shokri and
Carl A. Gunter Plausible deniability for
privacy-preserving data synthesis . . . 481--492
Shiv Verma and
Luke M. Leslie and
Yosub Shin and
Indranil Gupta An experimental comparison of
partitioning strategies in distributed
graph processing . . . . . . . . . . . . 493--504
Badrish Chandramouli and
Jonathan Goldstein Shrink: prescribing resiliency solutions
for streaming . . . . . . . . . . . . . 505--516
Claude Barthels and
Ingo Müller and
Timo Schneider and
Gustavo Alonso and
Torsten Hoefler Distributed join algorithms on thousands
of cores . . . . . . . . . . . . . . . . 517--528
Junling Liu and
Ke Deng and
Huanliang Sun and
Yu Ge and
Xiaofang Zhou and
Christian S. Jensen Clue-based spatio-textual query . . . . 529--540
Yudian Zheng and
Guoliang Li and
Yuanbing Li and
Caihua Shan and
Reynold Cheng Truth inference in crowdsourcing: is the
problem solved? . . . . . . . . . . . . 541--552
Rachael Harding and
Dana Van Aken and
Andrew Pavlo and
Michael Stonebraker An evaluation of distributed concurrency
control . . . . . . . . . . . . . . . . 553--564
Wanyun Cui and
Yanghua Xiao and
Haixun Wang and
Yangqiu Song and
Seung-won Hwang and
Wei Wang KBQA: learning question answering over
QA corpora and knowledge bases . . . . . 565--576
Daniel Deutch and
Nave Frost and
Amir Gilad Provenance for natural language queries 577--588
Yi Lu and
Anil Shanbhag and
Alekh Jindal and
Samuel Madden AdaptDB: adaptive partitioning for
distributed joins . . . . . . . . . . . 589--600
Zhipeng Zhang and
Yingxia Shao and
Bin Cui and
Ce Zhang An experimental evaluation of
SimRank-based similarity search
algorithms . . . . . . . . . . . . . . . 601--612
Jose M. Faleiro and
Daniel J. Abadi and
Joseph M. Hellerstein High performance transactions via early
write visibility . . . . . . . . . . . . 613--624
Dhivya Eswaran and
Stephan Günnemann and
Christos Faloutsos and
Disha Makhija and
Mohit Kumar ZooBP: belief propagation for
heterogeneous networks . . . . . . . . . 625--636
Min Lyu and
Dong Su and
Ninghui Li Understanding the sparse vector
technique for differential privacy . . . 637--648
Fan Zhang and
Wenjie Zhang and
Ying Zhang and
Lu Qin and
Xuemin Lin OLAK: an efficient algorithm to prevent
unraveling in social networks . . . . . 649--660
Meraj Khan and
Larry Xu and
Arnab Nandi and
Joseph M. Hellerstein Data tweening: incremental visualization
of data transforms . . . . . . . . . . . 661--672
Johes Bater and
Gregory Elliott and
Craig Eggen and
Satyender Goel and
Abel Kho and
Jennie Rogers SMCQL: secure querying for federated
databases . . . . . . . . . . . . . . . 673--684
Erfan Zamanian and
Carsten Binnig and
Tim Harris and
Tim Kraska The end of a myth: distributed
transactions can scale . . . . . . . . . 685--696
Haohan Zhu and
Xianrui Meng and
George Kollios NED: an inter-graph node metric based on
edit distance . . . . . . . . . . . . . 697--708
Yixiang Fang and
Reynold Cheng and
Xiaodong Li and
Siqiang Luo and
Jiafeng Hu Effective community search over large
spatial graphs . . . . . . . . . . . . . 709--720
Jaros\law Szlichta and
Parke Godfrey and
Lukasz Golab and
Mehdi Kargar and
Divesh Srivastava Effective and complete discovery of
order dependencies via set-based
axiomatization . . . . . . . . . . . . . 721--732
Tomas Karnagel and
Dirk Habich and
Wolfgang Lehner Adaptive work placement for query
processing on heterogeneous computing
resources . . . . . . . . . . . . . . . 733--744
Fan Yang and
Fanhua Shang and
Yuzhen Huang and
James Cheng and
Jinfeng Li and
Yunjian Zhao and
Ruihao Zhao LFTF: a framework for efficient tensor
analytics at scale . . . . . . . . . . . 745--756
Shalmoli Gupta and
Ravi Kumar and
Kefu Lu and
Benjamin Moseley and
Sergei Vassilvitskii Local search methods for $k$-means with
outliers . . . . . . . . . . . . . . . . 757--768
Guillaume Casanova and
Elias Englmeier and
Michael E. Houle and
Peer Kröger and
Michael Nett and
Erich Schubert and
Arthur Zimek Dimensional testing for reverse
$k$-nearest neighbor search . . . . . . 769--780
Yingjun Wu and
Joy Arulraj and
Jiexi Lin and
Ran Xian and
Andrew Pavlo An empirical evaluation of in-memory
multi-version concurrency control . . . 781--792
You Wu and
Junyang Gao and
Pankaj K. Agarwal and
Jun Yang Finding diverse, high-value
representatives on a surface of answers 793--804
Yanhao Wang and
Qi Fan and
Yuchen Li and
Kian-Lee Tan Real-time influence maximization on
dynamic social streams . . . . . . . . . 805--816
Hongyun Cai and
Vincent W. Zheng and
Fanwei Zhu and
Kevin Chen-Chuan Chang and
Zi Huang From community detection to community
profiling . . . . . . . . . . . . . . . 817--828
Ayush Jain and
Akash Das Sarma and
Aditya Parameswaran and
Jennifer Widom Understanding workers, developing
effective tasks, and enhancing
marketplace dynamics: a study of a large
crowdsourcing marketplace . . . . . . . 829--840
Xuelian Lin and
Shuai Ma and
Han Zhang and
Tianyu Wo and
Jinpeng Huai One-pass error bounded trajectory
simplification . . . . . . . . . . . . . 841--852
Jianguo Wang and
Chunbin Lin and
Ruining He and
Moojin Chae and
Yannis Papakonstantinou and
Steven Swanson MILC: inverted list compression in
memory . . . . . . . . . . . . . . . . . 853--864
Botong Huang and
Jun Yang Cümülön--D: data analytics in a dynamic
spot market . . . . . . . . . . . . . . 865--876
Manuel Then and
Timo Kersten and
Stephan Günnemann and
Alfons Kemper and
Thomas Neumann Automatic algorithm transformation for
efficient multi-snapshot analytics on
temporal graphs . . . . . . . . . . . . 877--888
Jianqiao Zhu and
Navneet Potti and
Saket Saurabh and
Jignesh M. Patel Looking ahead makes query plans robust:
making the initial case with in-memory
star schema data warehouse workloads . . 889--900
Michael Anderson and
Shaden Smith and
Narayanan Sundaram and
Mihai Capota and
Zheguang Zhao and
Subramanya Dulloor and
Nadathur Satish and
Theodore L. Willke Bridging the gap between HPC and big
data frameworks . . . . . . . . . . . . 901--912
Keke Huang and
Sibo Wang and
Glenn Bevilacqua and
Xiaokui Xiao and
Laks V. S. Lakshmanan Revisiting the stop-and-stare algorithms
for influence maximization . . . . . . . 913--924
Xubo Wang and
Lu Qin and
Xuemin Lin and
Ying Zhang and
Lijun Chang Leveraging set relations in exact set
similarity join . . . . . . . . . . . . 925--936
Minhao Jiang and
Ada Wai-Chee Fu and
Raymond Chi-Wing Wong READS: a random walk approach for
efficient and accurate dynamic SimRank 937--948
Xin Huang and
Laks V. S. Lakshmanan Attribute-driven community search . . . 949--960
Jiecao Chen and
Qin Zhang Bias-aware sketches . . . . . . . . . . 961--972
Yang Cao and
Wenfei Fan Data driven approximation with bounded
resources . . . . . . . . . . . . . . . 973--984
Zuhair Khayyat and
William Lucia and
Meghna Singh and
Mourad Ouzzani and
Paolo Papotti and
Jorge-Arnulfo Quiané-Ruiz and
Nan Tang and
Panos Kalnis Errata for ``Lightning Fast and Space
Efficient Inequality Joins'' (PVLDB
8(13): 2074--2085) . . . . . . . . . . . 985--985
Chengjie Qin and
Martin Torres and
Florin Rusu Scalable asynchronous gradient descent
optimization for out-of-core models . . 986--997
Fan Zhang and
Ying Zhang and
Lu Qin and
Wenjie Zhang and
Xuemin Lin When engagement meets similarity:
efficient $ (k, r)$-core computation on
social networks . . . . . . . . . . . . 998--1009
Yiding Liu and
Tuan-Anh Nguyen Pham and
Gao Cong and
Quan Yuan An experimental evaluation of
point-of-interest recommendation in
location-based social networks . . . . . 1010--1021
Mark Raasveldt and
Hannes Mühleisen Don't hold my data hostage: a case for
client protocol redesign . . . . . . . . 1022--1033
Erkang Zhu and
Yeye He and
Surajit Chaudhuri Auto-join: joining tables by leveraging
transformations . . . . . . . . . . . . 1034--1045
Aoqian Zhang and
Shaoxu Song and
Jianmin Wang and
Philip S. Yu Time series data cleaning: from anomaly
detection to anomaly repairing . . . . . 1046--1057
Lu Chen and
Yunjun Gao and
Baihua Zheng and
Christian S. Jensen and
Hanyu Yang and
Keyu Yang Pivot-based metric indexing . . . . . . 1058--1069
Rachid Guerraoui and
Anne-Marie Kermarrec and
Tao Lin and
Rhicheek Patra Heterogeneous recommendations: what you
might like to read after watching
interstellar . . . . . . . . . . . . . . 1070--1081
Dong Deng and
Albert Kim and
Samuel Madden and
Michael Stonebraker SilkMoth: an efficient method for
finding related sets with maximum
matching constraints . . . . . . . . . . 1082--1093
Yeounoh Chung and
Sanjay Krishnan and
Tim Kraska A data quality metric (DQM): how to
estimate the number of undetected errors
in data sets . . . . . . . . . . . . . . 1094--1105
Matthaios Olma and
Manos Karpathiotakis and
Ioannis Alagiannis and
Manos Athanassoulis and
Anastasia Ailamaki Slalom: coasting through raw data via
adaptive partitioning and indexing . . . 1106--1117
Yinan Li and
Nikos R. Katsipoulakis and
Badrish Chandramouli and
Jonathan Goldstein and
Donald Kossmann Mison: a fast JSON parser for data
analytics . . . . . . . . . . . . . . . 1118--1129
Silu Huang and
Liqi Xu and
Jialin Liu and
Aaron J. Elmore and
Aditya Parameswaran OrpheusDB: bolt-on versioning for
relational databases . . . . . . . . . . 1130--1141
Alex Galakatos and
Andrew Crotty and
Emanuel Zgraggen and
Carsten Binnig and
Tim Kraska Revisiting reuse for approximate query
processing . . . . . . . . . . . . . . . 1142--1153
Laurel Orr and
Magdalena Balazinska and
Dan Suciu Probabilistic database summarization for
interactive data exploration . . . . . . 1154--1165
Ismail Oukid and
Daniel Booss and
Adrien Lespinasse and
Wolfgang Lehner and
Thomas Willhalm and
Grégoire Gomes Memory management techniques for
large-scale persistent-main-memory
systems . . . . . . . . . . . . . . . . 1166--1177
Shuo Shang and
Lisi Chen and
Zhewei Wei and
Christian S. Jensen and
Kai Zheng and
Panos Kalnis Trajectory similarity join in spatial
networks . . . . . . . . . . . . . . . . 1178--1189
Theodoros Rekatsinas and
Xu Chu and
Ihab F. Ilyas and
Christopher Ré HoloClean: holistic data repairs with
probabilistic inference . . . . . . . . 1190--1201
Zsolt István and
David Sidler and
Gustavo Alonso Caribou: intelligent distributed storage 1202--1213
Lingjiao Chen and
Arun Kumar and
Jeffrey Naughton and
Jignesh M. Patel Towards linear algebra over normalized
data . . . . . . . . . . . . . . . . . . 1214--1225
Parmita Mehta and
Sven Dorkenwald and
Dongfang Zhao and
Tomer Kaftan and
Alvin Cheung and
Magdalena Balazinska and
Ariel Rokem and
Andrew Connolly and
Jacob Vanderplas and
Yusra AlSayyad Comparative evaluation of big-data
systems on scientific image analytics
workloads . . . . . . . . . . . . . . . 1226--1237
Cigdem Aslay and
Francesco Bonchi and
Laks V. S. Lakshmanan and
Wei Lu Revenue maximization in incentivized
social advertising . . . . . . . . . . . 1238--1249
Lukas Rupprecht and
William Culhane and
Peter Pietzuch SquirrelJoin: network-aware distributed
join processing with lazy partitioning 1250--1261
Sajjadur Rahman and
Maryam Aliakbarpour and
Ha Kyung Kong and
Eric Blais and
Karrie Karahalios and
Aditya Parameswaran and
Ronitt Rubinfield I've seen ``enough'': incrementally
improving visualizations to support
rapid decision making . . . . . . . . . 1262--1273
Lei Li and
Wen Hua and
Xingzhong Du and
Xiaofang Zhou Minimal on-road time route scheduling on
time-dependent graphs . . . . . . . . . 1274--1285
Nikos R. Katsipoulakis and
Alexandros Labrinidis and
Panos K. Chrysanthis A holistic view of stream partitioning
costs . . . . . . . . . . . . . . . . . 1286--1297
Esra Akbas and
Peixiang Zhao Truss-based community search: a
truss-equivalence based indexing
approach . . . . . . . . . . . . . . . . 1298--1309
José Cambronero and
John K. Feser and
Micah J. Smith and
Samuel Madden Query optimization for dynamic
imputation . . . . . . . . . . . . . . . 1310--1321
Neil G. Marchant and
Benjamin I. P. Rubinstein In search of an entity resolution OASIS:
optimal asymptotic sequential importance
sampling . . . . . . . . . . . . . . . . 1322--1333
Yongxin Tong and
Libin Wang and
Zimu Zhou and
Bolin Ding and
Lei Chen and
Jieping Ye and
Ke Xu Flexible online task assignment in
real-time spatial data . . . . . . . . . 1334--1345
Panagiotis Bouros and
Nikos Mamoulis A forward scan based plane sweep
algorithm for parallel interval joins 1346--1357
Kexin Rong and
Peter Bailis ASAP: prioritizing attention via time
series smoothing . . . . . . . . . . . . 1358--1369
Furong Li and
Xin Luna Dong and
Anno Langen and
Yang Li Knowledge verification for long-tail
verticals . . . . . . . . . . . . . . . 1370--1381
Shiladitya Pande and
Sayan Ranu and
Arnab Bhattacharya SkyGraph: retrieving regions of interest
using skyline subgraph queries . . . . . 1382--1393
Wei Chit Tan and
Meihui Zhang and
Hazem Elmeleegy and
Divesh Srivastava Reverse engineering aggregation queries 1394--1405
Lele Yut and
Ce Zhang and
Yingxia Shao and
Bin Cui LDA*: a robust and large-scale topic
modeling system . . . . . . . . . . . . 1406--1417
Igor Kabiljo and
Brian Karrer and
Mayank Pundir and
Sergey Pupyrev and
Alon Shalita Social hash partitioner: a scalable
distributed hypergraph partitioner . . . 1418--1429
Nesreen K. Ahmed and
Nick Duffield and
Theodore L. Willke and
Ryan A. Rossi On sampling from massive graph streams 1430--1441
Tong Yang and
Yang Zhou and
Hao Jin and
Shigang Chen and
Xiaoming Li Pyramid sketch: a sketch framework for
frequency estimation of data streams . . 1442--1453
Paolo Ciaccia and
Davide Martinenghi Reconciling skyline and ranking queries 1454--1465
Stella Giannakopoulou and
Manos Karpathiotakis and
Benjamin Gaidioz and
Anastasia Ailamaki CleanM: an optimizable query language
for unified scale-out data cleaning . . 1466--1477
Dong Xie and
Feifei Li and
Jeff M. Phillips Distributed trajectory similarity search 1478--1489
Bikash Chandra and
S. Sudarshan Runtime optimization of join location in
parallel data management systems . . . . 1490--1501
Oliver Lehmberg and
Christian Bizer Stitching web tables for improving
matching quality . . . . . . . . . . . . 1502--1513
Michael Shekelyan and
Anton Dignös and
Johann Gamper DigitHist: a histogram-based data
summary with tight error bounds . . . . 1514--1525
Markus Pilman and
Kevin Bocksrocker and
Lucas Braun and
Renato Marroquín and
Donald Kossmann Fast scans on key--value stores . . . . 1526--1537
Can Lu and
Jeffrey Xu Yu and
Hao Wei and
Yikai Zhang Finding the maximum clique in massive
graphs . . . . . . . . . . . . . . . . . 1538--1549
Yuankai Zhang and
Adam O'Neill and
Micah Sherr and
Wenchao Zhou Privacy-preserving network provenance 1550--1561
Daniel A. Garcia-Ulloa and
Li Xiong and
Vaidy Sunderam Truth discovery for spatio-temporal
events from crowdsourced data . . . . . 1562--1573
Immanuel Trummer and
Jiancheng Zhu and
Mark Bryan Data vocalization: optimizing voice
output of relational data . . . . . . . 1574--1585
Daniel Kang and
John Emmons and
Firas Abuzaid and
Peter Bailis and
Matei Zaharia NoScope: optimizing neural network
queries over video at scale . . . . . . 1586--1597
Juchang Lee and
SeungHyun Moon and
Kyu Hwan Kim and
Deok Hoe Kim and
Sang Kyun Cha and
Wook-Shin Han Parallel replication across formats in
SAP HANA for scaling out mixed OLTP/OLAP
workloads . . . . . . . . . . . . . . . 1598--1609
Rittika Shamsuddin and
Amit Sawant and
Balakrishnan Prabhakaran Developing a low dimensional patient
class profile in accordance to their
respiration-induced tumor motion . . . . 1610--1621
Mohamed Ziauddin and
Andrew Witkowski and
You Jung Kim and
Dmitry Potapov and
Janaki Lahorani and
Murali Krishna Dimensions based data clustering and
zone maps . . . . . . . . . . . . . . . 1622--1633
Shadi A. Noghabi and
Kartik Paramasivam and
Yi Pan and
Navina Ramesh and
Jon Bringhurst and
Indranil Gupta and
Roy H. Campbell Samza: stateful scalable stream
processing at LinkedIn . . . . . . . . . 1634--1645
Eric Falk and
Vijay K. Gurbani and
Radu State Query-able Kafka: an agile data
analytics pipeline for mobile wireless
networks . . . . . . . . . . . . . . . . 1646--1657
Anisoara Nica and
Reza Sherkat and
Mihnea Andrei and
Xun Cheng and
Martin Heidel and
Christian Bensberg and
Heiko Gerwens Statisticum: data statistics management
in SAP HANA . . . . . . . . . . . . . . 1658--1669
Felix Gessert and
Michael Schaarschmidt and
Wolfram Wingerath and
Erik Witt and
Eiko Yoneki and
Norbert Ritter Quaestor: query web caching for
database-as-a-service providers . . . . 1670--1681
Vaidas Gasiunas and
David Dominguez-Sal and
Ralph Acker and
Aharon Avitzur and
Ilan Bronshtein and
Rushan Chen and
Eli Ginot and
Norbert Martinez-Bazan and
Michael Müller and
Alexander Nozdrin and
Weijie Ou and
Nir Pachter and
Dima Sivov and
Eliezer Levy Fiber-based architecture for NFV cloud
databases . . . . . . . . . . . . . . . 1682--1693
Joos-Hendrik Böse and
Valentin Flunkert and
Jan Gasthaus and
Tim Januschowski and
Dustin Lange and
David Salinas and
Sebastian Schelter and
Matthias Seeger and
Yuyang Wang Probabilistic demand forecasting at
scale . . . . . . . . . . . . . . . . . 1694--1705
Jinho Lee and
Heesu Kim and
Sungjoo Yoo and
Kiyoung Choi and
H. Peter Hofstee and
Gi-Joon Nam and
Mark R. Nutter and
Damir Jamsek ExtraV: boosting graph processing near
storage with a coherent accelerator . . 1706--1717
Paris Carbone and
Stephan Ewen and
Gyula Fóra and
Seif Haridi and
Stefan Richter and
Kostas Tzoumas State management in Apache Flink\reg:
consistent stateful distributed stream
processing . . . . . . . . . . . . . . . 1718--1729
Jianjun Zheng and
Qian Lin and
Jiatao Xu and
Cheng Wei and
Chuwei Zeng and
Pingan Yang and
Yunfan Zhang PaxosStore: high-availability storage
made practical in WeChat . . . . . . . . 1730--1741
Panagiotis Antonopoulos and
Hanuma Kodavalla and
Alex Tran and
Nitish Upreti and
Chaitali Shah and
Mirek Sztajno Resumable online index rebuild in SQL
server . . . . . . . . . . . . . . . . . 1742--1753
Mihnea Andrei and
Christian Lemke and
Günter Radestock and
Robert Schulze and
Carsten Thiel and
Rolando Blanco and
Akanksha Meghlan and
Muhammad Sharique and
Sebastian Seifert and
Surendra Vishnoi and
Daniel Booss and
Thomas Peh and
Ivan Schreter and
Werner Thesing and
Mehul Wagle and
Thomas Willhalm SAP HANA adoption of non-volatile memory 1754--1765
Mingming Zhang and
Tianyu Wo and
Tao Xie and
Xuelian Lin and
Yaxiao Liu CarStream: an industrial system of big
data processing for Internet-of-Vehicles 1766--1777
Daniele Bonetta and
Matthias Brantner FAD.js: fast JSON data access using
JIT-based speculative optimizations . . 1778--1789
Kareem S. Aggour and
Jenny Weisenberg Williams and
Justin McHugh and
Vijay S. Kumar Colt: concept lineage tool for data flow
metadata capture and analysis . . . . . 1790--1801
Chin-Chia Michael Yeh and
Nickolas Kavantzas and
Eamonn Keogh Matrix profile IV: using weakly labeled
time series to predict outcomes . . . . 1802--1812
Sunil Chakkappen and
Suratna Budalakoti and
Ramarajan Krishnamachari and
Satyanarayana R. Valluri and
Alan Wood and
Mohamed Zait Adaptive statistics in Oracle 12c . . . 1813--1824
Avrilia Floratou and
Ashvin Agrawal and
Bill Graham and
Sriram Rao and
Karthik Ramasamy Dhalion: self-regulating stream
processing in Heron . . . . . . . . . . 1825--1836
Erkang Zhu and
Ken Q. Pu and
Fatemeh Nargesian and
Renée J. Miller Interactive navigation of open data
linkages . . . . . . . . . . . . . . . . 1837--1840
João Felipe Pimentel and
Leonardo Murta and
Vanessa Braganholo and
Juliana Freire noWorkflow: a tool for collecting,
analyzing, and managing provenance from
Python scripts . . . . . . . . . . . . . 1841--1844
Chao Wang and
Yihao Feng and
Qi Guo and
Zhaoxian Li and
Kexin Liu and
Zijian Tang and
Anthony K. H. Tung and
Lifu Wu and
Yuxin Zheng ARShop: a cloud-based augmented reality
system for shopping . . . . . . . . . . 1845--1848
Christopher R. Aberger and
Andrew Lamb and
Kunle Olukotun and
Christopher Ré Mind the gap: bridging multi-domain
query workloads with EmptyHeaded . . . . 1849--1852
Antonio Maccioni and
Riccardo Torlone Crossing the finish line faster when
paddling the data lake with KAYAK . . . 1853--1856
Xing Niu and
Bahareh Sadat Arab and
Seokki Lee and
Su Feng and
Xun Zou and
Dieter Gawlick and
Vasudha Krishnaswamy and
Zhen Hua Liu and
Boris Glavic Debugging transactions and tracking
their provenance with reenactment . . . 1857--1860
Kai Huang and
Sourav S. Bhowmick and
Shuigeng Zhou and
Byron Choi picasso: exploratory search of connected
subgraph substructures in graph
databases . . . . . . . . . . . . . . . 1861--1864
Ruichu Cai and
Zijie Lu and
Li Wang and
Zhenjie Zhang and
Tom Z. J. Fur and
Marianne Winslett DITIR: distributed index for high
throughput trajectory insertion and
real-time temporal range query . . . . . 1865--1868
Zhifei Pang and
Sai Wu and
Gang Chen and
Ke Chen and
Lidan Shou FlashView: an interactive visual
explorer for raw data . . . . . . . . . 1869--1872
Julien Subercaze and
Christophe Gravier and
Syed Gillani and
Abderrahmen Kammoun and
Frédérique Laforest Upsortable: programming top-$k$ queries
over data streams . . . . . . . . . . . 1873--1876
Javad Chamanara and
Birgitta König-Ries and
H. V. Jagadish QUIS: in-situ heterogeneous data source
querying . . . . . . . . . . . . . . . . 1877--1880
Abdussalam Alawini and
Susan B. Davidson and
Wei Hu and
Yinjun Wu Automating data citation in CiteDB . . . 1881--1884
Yixiang Fang and
Reynold Cheng and
Siqiang Luo and
Jiafeng Hu and
Kai Huang C-explorer: browsing communities in
large graphs . . . . . . . . . . . . . . 1885--1888
Wenfei Fan and
Jingbo Xu and
Yinghui Wu and
Wenyuan Yu and
Jiaxin Jiang GRAPE: parallelizing sequential graph
computations . . . . . . . . . . . . . . 1889--1892
Alireza Khoshkbarforoushha and
Rajiv Ranjan and
Qing Wang and
Carsten Friedrich Flower: a data analytics flow elasticity
manager . . . . . . . . . . . . . . . . 1893--1896
Zhiyi Wang and
Dongyan Zhou and
Shimin Chen STEED: an analytical database system for
tree-structured data . . . . . . . . . . 1897--1900
Yonghui Xiao and
Li Xiong and
Si Zhang and
Yang Cao LocLok: location cloaking with
differential privacy via hidden Markov
model . . . . . . . . . . . . . . . . . 1901--1904
Xiangnan Ren and
Olivier Curé and
Li Ke and
Jeremy Lhez and
Badre Belabbess and
Tendry Randriamalala and
Yufan Zheng and
Gabriel Kepeklian Strider: an adaptive, inference-enabled
distributed RDF stream processing engine 1905--1908
Yan Li and
Ngai Meng Kou and
Hao Wang and
Leong Hou U. and
Zhiguo Gong A confidence-aware top-$k$ query
processing toolkit on crowdsourcing . . 1909--1912
Valeria Fionda and
Giuseppe Pirr\`o Explaining and querying knowledge graphs
by relatedness . . . . . . . . . . . . . 1913--1916
Mayuresh Kunjir and
Shivnath Babu Thoth in action: memory management in
modern data analytics . . . . . . . . . 1917--1920
Maximilian E. Schüle and
Pascal M. N. Schliski and
Thomas Hutzelmann and
Tobias Rosenberger and
Viktor Leis and
Dimitri Vorona and
Alfons Kemper and
Thomas Neumann Monopedia: staying single is good enough
--- the hyper way for web scale
applications . . . . . . . . . . . . . . 1921--1924
Ji Sun and
Zeyuan Shang and
Guoliang Li and
Dong Deng and
Zhifeng Bao Dima: a distributed in-memory
similarity-based query processing system 1925--1928
Melisachew W. Chekol and
Giuseppe Pirr\`o and
Joerg Schoenfisch and
Heiner Stuckenschmidt TeCoRe: temporal conflict resolution in
knowledge graphs . . . . . . . . . . . . 1929--1932
Xupeng Li and
Bin Cui and
Yiru Chen and
Wentao Wu and
Ce Zhang MLog: towards declarative in-database
machine learning . . . . . . . . . . . . 1933--1936
Çagatay Demiralp and
Peter J. Haas and
Srinivasan Parthasarathy and
Tejaswini Pedapati Foresight: recommending visual insights 1937--1940
Steven Jacobs and
Md Yusuf Sarwar Uddin and
Michael Carey and
Vagelis Hristidis and
Vassilis J. Tsotras and
N. Venkatasubramanian and
Yao Wu and
Syed Safir and
Purvi Kaul and
Xikui Wang and
Mohiuddin Abdul Qader and
Yawei Li A BAD demonstration: towards Big Active
Data . . . . . . . . . . . . . . . . . . 1941--1944
Naeemul Hassan and
Gensheng Zhang and
Fatma Arslan and
Josue Caraballo and
Damian Jimenez and
Siddhant Gawsane and
Shohedul Hasan and
Minumol Joseph and
Aaditya Kulkarni and
Anil Kumar Nayak and
Vikas Sable and
Chengkai Li and
Mark Tremayne ClaimBuster: the first-ever end-to-end
fact-checking system . . . . . . . . . . 1945--1948
Shaleen Deep and
Paraschos Koutris and
Yash Bidasaria QIRANA demonstration: real time scalable
query pricing . . . . . . . . . . . . . 1949--1952
Meraj Khan and
Larry Xu and
Arnab Nandi and
Joseph M. Hellerstein DataTweener: a demonstration of a
tweening engine for incremental
visualization of data transforms . . . . 1953--1956
Babak Salimi and
Corey Cole and
Dan R. K. Ports and
Dan Suciu ZaliQL: causal inference from
observational data at scale . . . . . . 1957--1960
Louai Alarabi and
Mohamed F. Mokbel A demonstration of ST-Hadoop: a
MapReduce framework for big
spatio-temporal data . . . . . . . . . . 1961--1964
S. Bharadwaj and
L. Chiticariu and
M. Danilevsky and
S. Dhingra and
S. Divekar and
A. Carreno-Fuentes and
H. Gupta and
N. Gupta and
S.-D. Han and
M. Hernández and
H. Ho and
P. Jain and
S. Joshi and
H. Karanam and
S. Krishnan and
R. Krishnamurthy and
Y. Li and
S. Manivannan and
A. Mittal and
F. Özcan and
A. Quamar and
P. Raman and
D. Saha and
K. Sankaranarayanan and
J. Sen and
P. Sen and
S. Vaithyanathan and
M. Vasa and
H. Wang and
H. Zhu Creation and interaction with
large-scale domain-specific knowledge
bases . . . . . . . . . . . . . . . . . 1965--1968
Christopher Jonathan and
Mohamed F. Mokbel A demonstration of Stella: a
crowdsourcing-based geotagging framework 1969--1972
Oscar Moll and
Aaron Zalewski and
Sudeep Pillai and
Sam Madden and
Michael Stonebraker and
Vijay Gadepally Exploring big volume sensor data with
Vroom . . . . . . . . . . . . . . . . . 1973--1976
Davide Mottin and
Matteo Lissandrini and
Yannis Velegrakis and
Themis Palpanas New trends on exploratory methods for
data analytics . . . . . . . . . . . . . 1977--1980
Arijit Khan and
Sourav S. Bhowmick and
Francesco Bonchi Summarizing static and dynamic big
graphs . . . . . . . . . . . . . . . . . 1981--1984
Kyriakos Mouratidis Geometric approaches for top-$k$ queries 1985--1987
Yongxin Tong and
Lei Chen and
Cyrus Shahabi Spatial crowdsourcing: challenges,
techniques, and applications . . . . . . 1988--1991
Ahmed Eldawy and
Mohamed F. Mokbel The era of big spatial data . . . . . . 1992--1995
Nikos Giatrakos and
Alexander Artikis and
Antonios Deligiannakis and
Minos Garofalakis Complex event recognition in the big
data era . . . . . . . . . . . . . . . . 1996--1999
C. Mohan Tutorial: blockchains and databases . . 2000--2001
Victor Zakhary and
Divyakant Agrawal and
Amr El Abbadi Caching at the web scale . . . . . . . . 2002--2005
Guoliang Li Human-in-the-loop data integration . . . 2006--2017
Wolfgang Lehner The data center under your desk: how
disruptive is modern hardware for DB
system design? . . . . . . . . . . . . . 2018--2019
Tova Milo 7 secrets that my mother didn't tell me 2020--2020
Qin Lv and
William Josephson and
Zhe Wang and
Moses Charikar and
Kai Li Intelligent probing for locality
sensitive hashing: multi-probe LSH and
beyond . . . . . . . . . . . . . . . . . 2021--2024
Dai Qin and
Angela Demke Brown and
Ashvin Goel Scalable replay-based replication for
fast databases . . . . . . . . . . . . . 2025--2036
Kai Ren and
Qing Zheng and
Joy Arulraj and
Garth Gibson SlimDB: a space-efficient key--value
storage engine for semi-sorted data . . 2037--2048
Ibrahim Abdelaziz and
Razen Harbi and
Zuhair Khayyat and
Panos Kalnis A survey and experimental comparison of
distributed SPARQL engines for very
large RDF data . . . . . . . . . . . . . 2049--2060
Andreas Kunft and
Asterios Katsifodimos and
Sebastian Schelter and
Tilmann Rabl and
Volker Markl Blockjoin: efficient matrix partitioning
through joins . . . . . . . . . . . . . 2061--2072
Dong-Wan Choi and
Jian Pei and
Thomas Heinis Efficient mining of regional movement
patterns in semantic trajectories . . . 2073--2084
Martin Kiefer and
Max Heimel and
Sebastian Breß and
Volker Markl Estimating join selectivities using
bandwidth-optimized kernel density
models . . . . . . . . . . . . . . . . . 2085--2096
Prashanth Menon and
Todd C. Mowry and
Andrew Pavlo Relaxed operator fusion for in-memory
databases: making compilation,
vectorization, and prefetching work
together at last . . . . . . . . . . . . 1--13
Yu Liu and
Bolong Zheng and
Xiaodong He and
Zhewei Wei and
Xiaokui Xiao and
Kai Zheng and
Jiaheng Lu Probesim: scalable single-source and
top-$k$ SimRank computations on dynamic
graphs . . . . . . . . . . . . . . . . . 14--26
Paolo Guagliardo and
Leonid Libkin A formal semantics of SQL queries, its
validation, and applications . . . . . . 27--39
Jinhyun Kim and
Jun-Ki Min and
Kyuseok Shim Efficient Haar$^+$ synopsis construction
for the maximum absolute error measure 40--52
Wenbo Tao and
Dong Deng and
Michael Stonebraker Approximate string joins with
abbreviations . . . . . . . . . . . . . 53--65
Dat Ba Nguyen and
Abdalghani Abujabal and
Nam Khanh Tran and
Martin Theobald and
Gerhard Weikum Query-driven on-the-fly knowledge base
construction . . . . . . . . . . . . . . 66--79
Olga Poppe and
Chuan Lei and
Elke A. Rundensteiner and
David Maier GRETA: graph-based real-time event trend
aggregation . . . . . . . . . . . . . . 80--92
Wentian Guo and
Yuchen Li and
Mo Sha and
Kian-Lee Tan Parallel Personalized PageRank on
dynamic graphs . . . . . . . . . . . . . 93--106
Mo Sha and
Yuchen Li and
Bingsheng He and
Kian-Lee Tan Accelerating dynamic graph analytics on
GPUs . . . . . . . . . . . . . . . . . . 107--120
Raja Appuswamy and
Angelos C. Anadiotis and
Danica Porobic and
Mustafa K. Iman and
Anastasia Ailamaki Analyzing the impact of system
architecture on the scalability of OLTP
engines for high-contention workloads 121--134
Hyungsoo Jung and
Hyuck Han and
Sooyong Kang Scalable database logging for multicores 135--148
Angela Bonifati and
Wim Martens and
Thomas Timm An analytical study of large SPARQL
query logs . . . . . . . . . . . . . . . 149--161
Pinghui Wang and
Yiyan Qi and
Yu Sun and
Xiangliang Zhang and
Jing Tao and
Xiaohong Guan Approximately counting triangles in
large graph streams including edge
duplicates with a fixed memory usage . . 162--175
Miao Qiao and
Hao Zhang and
Hong Cheng Subgraph matching: on compression and
computation . . . . . . . . . . . . . . 176--188
Rohit Singh and
Venkata Vamsikrishna Meduri and
Ahmed Elmagarmid and
Samuel Madden and
Paolo Papotti and
Jorge-Arnulfo Quiané-Ruiz and
Armando Solar-Lezama and
Nan Tang Synthesizing entity matching rules by
examples . . . . . . . . . . . . . . . . 189--202
Liang He and
Bin Shao and
Yatao Li and
Huanhuan Xia and
Yanghua Xiao and
Enhong Chen and
Liang Jeff Chen Stylus: a strongly-typed store for
serving massive RDF data . . . . . . . . 203--216
Ekaterini Ioannou and
Minos Garofalakis Holistic query evaluation over
information extraction pipelines . . . . 217--229
Georgios Psaropoulos and
Thomas Legler and
Norman May and
Anastasia Ailamaki Interleaving with coroutines: a
practical approach for robust index
joins . . . . . . . . . . . . . . . . . 230--242
Dong Wen and
Lu Qin and
Ying Zhang and
Lijun Chang and
Xuemin Lin Efficient structural graph clustering:
an index-based approach . . . . . . . . 243--255
Sabrina De Capitani di Vimercati and
Sara Foresti and
Sushil Jajodia and
Giovanni Livraga and
Stefano Paraboschi and
Pierangela Samarati An authorization model for multi
provider queries . . . . . . . . . . . . 256--268
Alexander Ratner and
Stephen H. Bach and
Henry Ehrenberg and
Jason Fries and
Sen Wu and
Christopher Ré Snorkel: rapid training data creation
with weak supervision . . . . . . . . . 269--282
Yuliang Li and
Alin Deutsch and
Victor Vianu VERIFAS: a practical verifier for
artifact systems . . . . . . . . . . . . 283--296
Zhihao Jia and
Yongkee Kwon and
Galen Shipman and
Pat McCormick and
Mattan Erez and
Alex Aiken A distributed multi-GPU system for fast
graph processing . . . . . . . . . . . . 297--310
Tobias Bleifuß and
Sebastian Kruse and
Felix Naumann Efficient denial constraint discovery
with Hydra . . . . . . . . . . . . . . . 311--323
Tahir Azim and
Manos Karpathiotakis and
Anastasia Ailamaki ReCache: reactive caching for fast
analytics over heterogeneous data . . . 324--337
Long Yuan and
Lu Qin and
Xuemin Lin and
Lijun Chang and
Wenjie Zhang Effective and efficient dynamic graph
coloring . . . . . . . . . . . . . . . . 338--351
Eleni Tzirita Zacharatou and
Harish Doraiswamy and
Anastasia Ailamaki and
Cláudio T. Silva and
Juliana Freiref GPU rasterization for real-time spatial
aggregation over arbitrary polygons . . 352--365
Vraj Shah and
Arun Kumar and
Xiaojin Zhu Are key--foreign key joins safe to avoid
when learning high-capacity classifiers? 366--379
Zheng Liu and
Lei Chen Worker recommendation for crowdsourced
Q&A services: a triple-factor aware
approach . . . . . . . . . . . . . . . . 380--392
Shufeng Gong and
Yanfeng Zhang and
Ge Yu Clustering stream data by exploring the
evolution of density mountain . . . . . 393--405
Tianzheng Wang and
Ryan Johnson and
Ippokratis Pandis Query fresh: log shipping on steroids 406--419
Siddhartha Sahu and
Amine Mhedhbi and
Semih Salihoglu and
Jimmy Lin and
M. Tamer Özsu The ubiquity of large graphs and
surprising challenges of graph
processing . . . . . . . . . . . . . . . 420--431
Karthik Ramachandra and
Kwanghyun Park and
K. Venkatesh Emani and
Alan Halverson and
César Galindo-Legaria and
Conor Cunningham Froid: optimization of imperative
programs in a relational database . . . 432--444
Ye Li and
Leong Hou U. and
Man Lung Yiu and
Ngai Meng Kou An experimental study on hub labeling
based shortest path algorithms . . . . . 445--457
Alexander Merritt and
Ada Gavrilovska and
Yuan Chen and
Dejan Milojicic Concurrent log-structured memory for
many-core key--value stores . . . . . . 458--471
Matteo Ceccarello and
Carlo Fantozzi and
Andrea Pietracaprina and
Geppino Pucci and
Fabio Vandin Clustering uncertain graphs . . . . . . 472--484
Ibrahim Abdelaziz and
Essam Mansour and
Mourad Ouzzani and
Ashraf Aboulnaga and
Panos Kalnis Lusail: a system for querying linked
data at scale . . . . . . . . . . . . . 485--498
Hazar Harmouch and
Felix Naumann Cardinality estimation: an experimental
survey . . . . . . . . . . . . . . . . . 499--512
Jong-Hyeok Park and
Gihwan Oh and
Sang-Won Lee SQL statement logging for making SQLite
truly lite . . . . . . . . . . . . . . . 513--525
Noah Johnson and
Joseph P. Near and
Dawn Song Towards practical differential privacy
for SQL queries . . . . . . . . . . . . 526--539
Alexander Shraer and
Alexandre Aybes and
Bryan Davis and
Christos Chrysafis and
Dave Browning and
Eric Krugler and
Eric Stone and
Harrison Chandler and
Jacob Farkas and
John Quinn and
Jonathan Ruben and
Michael Ford and
Mike McMahon and
Nathan Williams and
Nicolas Favre-Felix and
Nihar Sharma and
Ori Herrnstadt and
Paul Seligman and
Raghav Pisolkar and
Scott Dugas and
Scott Gray and
Sytze Harkema and
Valentin Kravtsov and
Vanessa Hong and
Wan Ling Yih and
Yizuo Tian Cloudkit: structured storage for mobile
applications . . . . . . . . . . . . . . 540--552
Joy Arulraj and
Justin Levandoski and
Umar Farooq Minhas and
Per-Ake Larson Bztree: a high-performance latch-free
range index for non-volatile memory . . 553--565
Yuzhen Huang and
Tatiana Jin and
Yidi Wu and
Zhenkun Cai and
Xiao Yan and
Fan Yang and
Jinfeng Li and
Yuying Guo and
James Cheng FlexPS: flexible parallelism control in
parameter server architecture . . . . . 566--579
Navid Yaghmazadeh and
Xinyu Wang and
Isil Dillig Automated migration of hierarchical data
to relational tables using
programming-by-example . . . . . . . . . 580--593
Siqiang Luo and
Ben Kao and
Guoliang Li and
Jiafeng Hu and
Reynold Cheng and
Yudian Zheng TOAIN: a throughput optimizing adaptive
index for answering dynamic $k$ NN
queries on road networks . . . . . . . . 594--606
Tian Li and
Jie Zhong and
Ji Liu and
Wentao Wu and
Ce Zhang Ease.ml: towards multi-tenant resource
sharing for machine learning workloads 607--620
Jianzhong Qi and
Yufei Tao and
Yanchuan Chang and
Rui Zhang Theoretically optimal and empirically
efficient R-trees with strong
parallelizability . . . . . . . . . . . 621--634
Xueling Lin and
Lei Chen Domain-aware multi-truth discovery from
conflicting sources . . . . . . . . . . 635--647
Boyu Tian and
Jiamin Huang and
Barzan Mozafari and
Grant Schoenebeck Contention-aware lock scheduling for
transactional databases . . . . . . . . 648--662
Jignesh M. Patel and
Harshad Deshmukh and
Jianqiao Zhu and
Navneet Potti and
Zuyu Zhang and
Marc Spehlmann and
Hakan Memisoglu and
Saket Saurabh Quickstep: a data platform based on the
scaling-up approach . . . . . . . . . . 663--676
Haridimos Kondylakis and
Niv Dayan and
Kostas Zoumpatianos and
Themis Palpanas Coconut: a scalable bottom-up approach
for building data series indexes . . . . 677--690
Khaled Ammar and
Frank McSherry and
Semih Salihoglu and
Manas Joglekar Distributed evaluation of subgraph
queries using worst-case optimal
low-memory dataflows . . . . . . . . . . 691--704
Teng Li and
Zhiyuan Xu and
Jian Tang and
Yanzhi Wang Model-free control for distributed
stream data processing using deep
reinforcement learning . . . . . . . . . 705--718
Fotis Psallidas and
Eugene Wu Smoke: fine-grained lineage at
interactive speed . . . . . . . . . . . 719--732
Muhammad Idris and
Martín Ugarte and
Stijn Vansummeren and
Hannes Voigt and
Wolfgang Lehner Conjunctive queries with inequalities
under updates . . . . . . . . . . . . . 733--745
Zhicheng Yint and
Jin Sun and
Ming Li and
Jaliya Ekanayake and
Haibo Lin and
Marc Friedman and
José A. Blakeley and
Clemens Szyperski and
Nikhil R. Devanur Bubble execution: resource-aware
reliable analytics at cloud scale . . . 746--758
Sebastian Kruse and
Felix Naumann Efficient discovery of approximate
dependencies . . . . . . . . . . . . . . 759--772
Yue Wang and
Alexandra Meliou and
Gerome Miklau RC-index: diversifying answers to range
queries . . . . . . . . . . . . . . . . 773--786
Xin Ding and
Lu Chen and
Yunjun Gao and
Christian S. Jensen and
Hujun Bao UlTraMan: a unified platform for big
trajectory data management and analytics 787--799
Alekh Jindal and
Konstantinos Karanasos and
Sriram Rao and
Hiren Patel Selecting subexpressions to materialize
at datacenter scale . . . . . . . . . . 800--812
Fatemeh Nargesian and
Erkang Zhu and
Ken Q. Pu and
Renée J. Miller Table union search on open data . . . . 813--825
Jianfei Chen and
Jun Zhu and
Jie Lu and
Shixia Liu Scalable training of hierarchical topic
models . . . . . . . . . . . . . . . . . 826--839
Mustafa Coskun and
Ananth Grama and
Mehmet Koyutürk Indexed fast network proximity querying 840--852
Libin Zheng and
Lei Chen and
Jieping Ye Order dispatch in price-aware
ridesharing . . . . . . . . . . . . . . 853--865
Kyriakos Mouratidis and
Bo Tang Exact processing of uncertain top-$k$
queries in multi-criteria settings . . . 866--879
Laure Berti-Équille and
Hazar Harmouch and
Felix Naumann and
Noël Novelli and
Saravanan Thirumuruganathan Discovery of genuine functional
dependencies from relational data with
missing values . . . . . . . . . . . . . 880--892
Qingchao Cai and
Zhongle Xie and
Meihui Zhang and
Gang Chen and
H. V. Jagadish and
Beng Chin Ooi Effective temporal dependence discovery
in time series data . . . . . . . . . . 893--905
Akhil Arora and
Sakshi Sinha and
Piyush Kumar and
Arnab Bhattacharya HD-index: pushing the
scalability-accuracy boundary for
approximate kNN search in
high-dimensional spaces . . . . . . . . 906--919
Yousuf Ahmad and
Omar Khattab and
Arsal Malik and
Ahmad Musleh and
Mohammad Hammoud and
Mucahid Kutlu and
Mostafa Shehata and
Tamer Elsayed LA3: a scalable link- and locality-aware
linear algebra-based graph analytics
system . . . . . . . . . . . . . . . . . 920--933
Dongxiang Zhang and
Mengting Ding and
Dingyu Yang and
Yi Liu and
Ju Fan and
Heng Tao Shen Trajectory simplification: an
experimental study and quality analysis 934--946
Dolan Antenucci and
Michael Cafarella Constraint-based explanation and repair
of filter-based transformations . . . . 947--960
Xiaolan Wang and
Aaron Feng and
Behzad Golshan and
Alon Halevy and
George Mihaila and
Hidekazu Oiwa and
Wang-Chiew Tan Scalable semantic querying of text . . . 961--974
Luigi Bellomarini and
Emanuel Sallinger and
Georg Gottlob The Vadalog system: datalog-based
reasoning for knowledge graphs . . . . . 975--987
Sourav Medya and
Sayan Ranu and
Jithin Vachery and
Ambuj Singh Noticeable network delay minimization
via node upgrades . . . . . . . . . . . 988--1001
Shoumik Palkar and
James Thomas and
Deepak Narayanan and
Pratiksha Thaker and
Rahul Palamuttam and
Parimajan Negi and
Anil Shanbhag and
Malte Schwarzkopf and
Holger Pirk and
Saman Amarasinghe and
Samuel Madden and
Matei Zaharia Evaluating end-to-end optimization for
data analytics applications in Weld . . 1002--1015
Magnus Müller and
Guido Moerkotte and
Oliver Kolb Improved selectivity estimation by
combining knowledge from sampling and
synopses . . . . . . . . . . . . . . . . 1016--1028
Kai Han and
Keke Huang and
Xiaokui Xiao and
Jing Tang and
Aixin Sun and
Xueyan Tang Efficient algorithms for adaptive
influence maximization . . . . . . . . . 1029--1040
Alex D. Breslow and
Nuwan S. Jayasena Morton filters: faster, space-efficient
cuckoo filters via biasing, compression,
and decoupled logical sparsity . . . . . 1041--1055
Fei Bi and
Lijun Chang and
Xuemin Lin and
Wenjie Zhang An optimal and progressive approach to
online search of top-$k$ influential
communities . . . . . . . . . . . . . . 1056--1068
Andreas Meister and
Guido Moerkotte and
Gunter Saake Errata for ``Analysis of two existing
and one new dynamic programming
algorithm for the generation of optimal
bushy join trees without cross
products'' . . . . . . . . . . . . . . . 1069--1070
Noseong Park and
Mahmoud Mohammadi and
Kshitij Gorde and
Sushil Jajodia and
Hongkyu Park and
Youngmin Kim Data synthesis based on generative
adversarial networks . . . . . . . . . . 1071--1083
Colin Lockard and
Xin Luna Dong and
Arash Einolghozati and
Prashant Shiralkar CERES: distantly supervised relation
extraction from the semi-structured web 1084--1096
Azade Nazi and
Bolin Ding and
Vivek Narasayya and
Surajit Chaudhuri Efficient estimation of inclusion
coefficient using hyperloglog sketches 1097--1109
Fabian Fier and
Nikolaus Augsten and
Panagiotis Bouros and
Ulf Leser and
Johann-Christoph Freytag Set similarity joins on MapReduce: an
experimental survey . . . . . . . . . . 1110--1122
Bailu Ding and
Sudipto Das and
Wentao Wu and
Surajit Chaudhuri and
Vivek Narasayya Plan Stitch: harnessing the best of many
plans . . . . . . . . . . . . . . . . . 1123--1136
Sheng Wang and
Tien Tuan Anh Dinh and
Qian Lin and
Zhongle Xie and
Meihui Zhang and
Qingchao Cai and
Gang Chen and
Beng Chin Ooi and
Pingcheng Ruan Forkbase: an efficient storage engine
for blockchain and forkable applications 1137--1150
Khaled Ammar and
M. Tamer Özsu Experimental analysis of distributed
graph systems . . . . . . . . . . . . . 1151--1164
Yeye He and
Xu Chu and
Kris Ganjam and
Yudian Zheng and
Vivek Narasayya and
Surajit Chaudhuri Transform-data-by-example (TDE): an
extensible search engine for data
transformations . . . . . . . . . . . . 1165--1177
Dan O'Keeffe and
Theodoros Salonidis and
Peter Pietzuch Frontier: resilient edge processing for
the Internet of Things . . . . . . . . . 1178--1191
Brandon Haynes and
Amrita Mazumdar and
Armin Alaghi and
Magdalena Balazinska and
Luis Ceze and
Alvin Cheung LightDB: a DBMS for virtual reality
video . . . . . . . . . . . . . . . . . 1192--1205
Ryan McKenna and
Gerome Miklau and
Michael Hay and
Ashwin Machanavajjhala Optimizing error of high-dimensional
statistical queries under differential
privacy . . . . . . . . . . . . . . . . 1206--1219
Yu Liu and
Hantian Zhang and
Luyuan Zeng and
Wentao Wu and
Ce Zhang MLbench: benchmarking machine learning
services against human experts . . . . . 1220--1232
Lu Chen and
Chengfei Liu and
Rui Zhou and
Jianxin Li and
Xiaochun Yang and
Bin Wang Maximum co-located community search in
large scale social networks . . . . . . 1233--1246
Ramon Antonio Rodriges Zalipynis ChronosDB: distributed, file based,
geospatial array DBMS . . . . . . . . . 1247--1261
Stephen Macke and
Yiming Zhang and
Silu Huang and
Aditya Parameswaran Adaptive sampling for rapidly matching
histograms . . . . . . . . . . . . . . . 1262--1275
Abolfazl Asudeh and
Azade Nazi and
Jees Augustine and
Saravanan Thirumuruganathan and
Nan Zhang and
Gautam Das and
Divesh Srivastava Leveraging similarity joins for signal
reconstruction . . . . . . . . . . . . . 1276--1288
Xiangyao Yu and
Yu Xia and
Andrew Pavlo and
Daniel Sanchez and
Larry Rudolph and
Srinivas Devadas Sundial: harmonizing concurrency control
and caching in a distributed OLTP
database management system . . . . . . . 1289--1302
Luo Mai and
Kai Zeng and
Rahul Potharaju and
Le Xu and
Steve Suh and
Shivaram Venkataraman and
Paolo Costa and
Terry Kim and
Saravanan Muthukrishnan and
Vamsi Kuppa and
Sudheer Dhulipalla and
Sriram Rao Chi: a scalable and programmable control
plane for distributed stream processing
systems . . . . . . . . . . . . . . . . 1303--1316
Divya Mahajan and
Joon Kyung Kim and
Jacob Sacks and
Adel Ardalan and
Arun Kumar and
Hadi Esmaeilzadeh In-RDBMS hardware acceleration of
advanced analytics . . . . . . . . . . . 1317--1331
Ilya Kolchinsky and
Assaf Schuster Join query optimization techniques for
complex event processing applications 1332--1345
Ilya Kolchinsky and
Assaf Schuster Efficient adaptive detection of complex
event patterns . . . . . . . . . . . . . 1346--1359
Florian Wolf and
Michael Brendle and
Norman May and
Paul R. Willems and
Kai-Uwe Sattler and
Michael Grossniklaus Robustness metrics for relational query
execution plans . . . . . . . . . . . . 1360--1372
Weiguo Zheng and
Jeffrey Xu Yu and
Lei Zou and
Hong Cheng Question answering over knowledge
graphs: question understanding via
template decomposition . . . . . . . . . 1373--1386
Joeri Rammelaere and
Floris Geerts Explaining repaired data with CFDs . . . 1387--1399
Joseph Vinish D'silva and
Florestan De Moor and
Bettina Kemme AIDA: abstraction for advanced
in-database analytics . . . . . . . . . 1400--1413
Divy Agrawal and
Sanjay Chawla and
Bertty Contreras-Rojas and
Ahmed Elmagarmid and
Yasser Idris and
Zoi Kaoudi and
Sebastian Kruse and
Ji Lucas and
Essam Mansour and
Mourad Ouzzani and
Paolo Papotti and
Jorge-Arnulfo Quiané-Ruiz and
Nan Tang and
Saravanan Thirumuruganathan and
Anis Troudi RHEEM: enabling cross-platform data
processing: may the big data be with
you! . . . . . . . . . . . . . . . . . . 1414--1427
Peng Cheng and
Xun Jian and
Lei Chen An experimental evaluation of task
assignment in spatial crowdsourcing . . 1428--1440
Rohit Kumar and
Toon Calders 2SCENT: an efficient algorithm for
enumerating all simple temporal cycles 1441--1453
Muhammad Ebraheem and
Saravanan Thirumuruganathan and
Shafiq Joty and
Mourad Ouzzani and
Nan Tang Distributed representations of tuples
for entity resolution . . . . . . . . . 1454--1467
Sona Hasani and
Saravanan Thirumuruganathan and
Abolfazl Asudeh and
Nick Koudas and
Gautam Das Efficient construction of approximate
ad-hoc ML models through materialization
and reuse . . . . . . . . . . . . . . . 1468--1481
Shumo Chu and
Brendan Murphy and
Jared Roesch and
Alvin Cheung and
Dan Suciu Axiomatic foundations and algorithms for
deciding semantic equivalences of SQL
queries . . . . . . . . . . . . . . . . 1482--1495
Faisal M. Almutairi and
Fan Yang and
Hyun Ah Song and
Christos Faloutsos and
Nicholas Sidiropoulos and
Vladimir Zadorozhny Homerun: scalable sparse-spectrum
reconstruction of aggregated historical
data . . . . . . . . . . . . . . . . . . 1496--1508
Yu-Hsuan Kuo and
Cho-Chun Chiu and
Daniel Kifer and
Michael Hay and
Ashwin Machanavajjhala Differentially private hierarchical
count-of-counts histograms . . . . . . . 1509--1521
Feng Zhang and
Jidong Zhai and
Xipeng Shen and
Onur Mutlu and
Wenguang Chen Efficient document analytics on
compressed data: method, challenges,
algorithms, insights . . . . . . . . . . 1522--1535
Tobias Müller and
Benjamin Dietrich and
Torsten Grust You say 'what', I hear 'where' and
'why': (mis-)interpreting SQL to derive
fine-grained provenance . . . . . . . . 1536--1549
Lars-Christian Schulz and
David Broneske and
Gunter Saake An eight-dimensional systematic
evaluation of optimized search
algorithms on modern processors . . . . 1550--1562
Immanuel Trummer and
Mark Bryan and
Ramya Narasimha Vocalizing large time series efficiently 1563--1575
Shoumik Palkar and
Firas Abuzaid and
Peter Bailis and
Matei Zaharia Filter before you parse: faster
analytics on raw data with sparser . . . 1576--1589
Zainab Abbas and
Vasiliki Kalavri and
Paris Carbone and
Vladimir Vlassov Streaming graph partitioning: an
experimental study . . . . . . . . . . . 1590--1603
Qingchao Cai and
Wentian Guo and
Hao Zhang and
Divyakant Agrawal and
Gang Chen and
Beng Chin Ooi and
Kian-Lee Tan and
Yong Meng Teo and
Sheng Wang Efficient distributed memory management
with RDMA and caching . . . . . . . . . 1604--1617
Diego Didona and
Rachid Guerraoui and
Jingjing Wang and
Willy Zwaenepoel Causal consistency and latency
optimality: friend or foe? . . . . . . . 1618--1632
Yongxin Tong and
Yuxiang Zeng and
Zimu Zhou and
Lei Chen and
Jieping Ye and
Ke Xu A unified approach to route planning for
shared mobility . . . . . . . . . . . . 1633--1646
Edward Gan and
Jialin Ding and
Kai Sheng Tai and
Vatsal Sharan and
Peter Bailis Moment-based quantile sketches for
efficient high cardinality aggregation
queries . . . . . . . . . . . . . . . . 1647--1660
Varun Pandey and
Andreas Kipf and
Thomas Neumann and
Alfons Kemper How good are modern spatial analytics
systems? . . . . . . . . . . . . . . . . 1661--1673
Kexin Rong and
Clara E. Yoon and
Karianne J. Bergen and
Hashem Elezabi and
Peter Bailis and
Philip Levis and
Gregory C. Beroza Locality-sensitive hashing for
earthquake detection: a case study of
scaling data-driven science . . . . . . 1674--1687
Sòren Kejser Jensen and
Torben Bach Pedersen and
Christian Thomsen ModelarDB: modular model-based time
series management with Spark and
Cassandra . . . . . . . . . . . . . . . 1688--1701
Christopher Jonathan and
Umar Farooq Minhas and
James Hunter and
Justin Levandoski and
Gor Nishanov Exploiting coroutines to attack the
``killer nanoseconds'' . . . . . . . . . 1702--1714
Vincent Bindschaedler and
Paul Grubbs and
David Cash and
Thomas Ristenpart and
Vitaly Shmatikov The tao of inference in
privacy-protected databases . . . . . . 1715--1728
Ioannis Demertzis and
Rajdeep Talapatra and
Charalampos Papamanthou Efficient searchable encryption through
compression . . . . . . . . . . . . . . 1729--1741
Shen Li and
Paul Gerver and
John MacMillan and
Daniel Debrunner and
William Marshall and
Kun-Lung Wu Challenges and experiences in building
an efficient Apache Beam runner for IBM
streams . . . . . . . . . . . . . . . . 1742--1754
Matthias Boehm and
Berthold Reinwald and
Dylan Hutchison and
Prithviraj Sen and
Alexandre V. Evfimievski and
Niketan Pansare On optimizing operator fusion plans for
large-scale machine learning in systemML 1755--1768
Robin Rehrmann and
Carsten Binnig and
Alexander Böhm and
Kihong Kim and
Wolfgang Lehner and
Amr Rizk OLTPshare: the case for sharing in OLTP
workloads . . . . . . . . . . . . . . . 1769--1780
Sebastian Schelter and
Dustin Lange and
Philipp Schmidt and
Meltem Celikel and
Felix Biessmann and
Andreas Grafberger Automating large-scale data quality
verification . . . . . . . . . . . . . . 1781--1794
Ohad Shacham and
Yonatan Gottesman and
Aran Bergman and
Edward Bortnikov and
Eshcar Hillel and
Idit Keidar Taking Omid to the clouds: fast,
scalable transactions for real-time
cloud analytics . . . . . . . . . . . . 1795--1808
Gabriela Jacques-Silva and
Ran Lei and
Luwei Cheng and
Guoqiang Jerry Chen and
Kuen Ching and
Tanji Hu and
Yuan Mei and
Kevin Wilfong and
Rithin Shetty and
Serhat Yilmaz and
Anirban Banerjee and
Benjamin Heintz and
Shridar Iyer and
Anshul Jaiswal Providing streaming joins as a service
at Facebook . . . . . . . . . . . . . . 1809--1821
Le Cai and
Jianjun Chen and
Jun Chen and
Yu Chen and
Kuorong Chiang and
Marko Dimitrijevic and
Yonghua Ding and
Yu Dong and
Ahmad Ghazal and
Jacques Hebert and
Kamini Jagtiani and
Suzhen Lin and
Ye Liu and
Demai Ni and
Chunfeng Pei and
Jason Sun and
Yongyan Wang and
Li Zhang and
Mingyi Zhang and
Cheng Zhu Fusion insight librA: Huawei's
enterprise cloud data analytics platform 1822--1834
Bart Samwel and
John Cieslewicz and
Ben Handy and
Jason Govig and
Petros Venetis and
Chanjun Yang and
Keith Peters and
Jeff Shute and
Daniel Tenedorio and
Himani Apte and
Felix Weigel and
David Wilhite and
Jiacheng Yang and
Jun Xu and
Jiexing Li and
Zhan Yuan and
Craig Chasseur and
Qiang Zeng and
Ian Rae and
Anurag Biyani and
Andrew Harn and
Yang Xia and
Andrey Gubichev and
Amr El-Helw and
Orri Erling and
Zhepeng Yan and
Mohan Yang and
Yiqun Wei and
Thanh Do and
Colin Zheng and
Goetz Graefe and
Somayeh Sardashti and
Ahmed M. Aly and
Divy Agrawal and
Ashish Gupta and
Shiv Venkataraman F1 query: declarative querying at scale 1835--1848
Wei Cao and
Zhenjun Liu and
Peng Wang and
Sen Chen and
Caifeng Zhu and
Song Zheng and
Yuhui Wang and
Guoqing Ma PolarFS: an ultra-low latency and
failure resilient distributed file
system for shared storage cloud database 1849--1862
Edward Bortnikov and
Anastasia Braginsky and
Eshcar Hillel and
Idit Keidar and
Gali Sheffi Accordion: better memory organization
for LSM key--value stores . . . . . . . 1863--1875
Xiafei Qiu and
Wubin Cen and
Zhengping Qian and
You Peng and
Ying Zhang and
Xuemin Lin and
Jingren Zhou Real-time constrained cycle detection in
large dynamic graphs . . . . . . . . . . 1876--1888
Aditya Gurajada and
Dheren Gala and
Fei Zhou and
Amit Pathak and
Zhan-Feng Ma BTrim: hybrid in-memory database
architecture for extreme transaction
processing in VLDBs . . . . . . . . . . 1889--1901
Avinesh P. V. S. and
Benjamin Hättasch and
Orkan Özyurt and
Carsten Binnig and
Christian M. Meyer Sherlock: a system for interactive
summarization of large text collections 1902--1905
Hans Walter Behrens and
K. Selçuk Candan and
Xilun Chen and
Ashish Gadkari and
Yash Garg and
Mao-Lin Li and
Xinsheng Li and
Sicong Liu and
Nicholas Martinez and
Jiayong Mo and
Elliot Nester and
Silvestro Poccia and
Manjusha Ravindranath and
Maria Luisa Sapino Datastorm-FE: a data- and decision-flow
and coordination engine for coupled
simulation ensembles . . . . . . . . . . 1906--1909
Bohan Zhang and
Dana Van Aken and
Justin Wang and
Tao Dai and
Shuli Jiang and
Jacky Lao and
Siyuan Sheng and
Andrew Pavlo and
Geoffrey J. Gordon A demonstration of the OtterTune
automatic database management system
tuning service . . . . . . . . . . . . . 1910--1913
Elena Kakoulli and
Nikolaos D. Karmiris and
Herodotos Herodotou OctopusFS in action: tiered storage
management for data intensive computing 1914--1917
Huan Li and
Hua Lu and
Feichao Shi and
Gang Chen and
Ke Chen and
Lidan Shou TRIPS: a system for translating raw
indoor positioning data into visual
mobility semantics . . . . . . . . . . . 1918--1921
Xiangyu Ke and
Michelle Teo and
Arijit Khan and
Vijaya Krishna Yalavarthi A demonstration of PERC: probabilistic
entity resolution with crowd errors . . 1922--1925
Guoliang Li and
Chengliang Chai and
Ju Fan and
Xueping Weng and
Jian Li and
Yudian Zheng and
Yuanbing Li and
Xiang Yu and
Xiaohang Zhang and
Haitao Yuan CDB: a crowd-powered database system . . 1926--1929
Badrish Chandramouli and
Guna Prasaad and
Donald Kossmann and
Justin Levandoski and
James Hunter and
Mike Barnett FASTER: an embedded concurrent
key--value store for state management 1930--1933
Gensheng Zhang and
Chengkai Li Maverick: a system for discovering
exceptional facts from knowledge graphs 1934--1937
Lu Chen and
Yunjun Gao and
Zixian Liu and
Xiaokui Xiao and
Christian S. Jensen and
Yifan Zhu PTrider: a price-and-time-aware
ridesharing system . . . . . . . . . . . 1938--1941
Amin Beheshti and
Boualem Benatallah and
Reza Nouri and
Alireza Tabebordbar CoreKG: a knowledge lake service . . . . 1942--1945
Stefano Ortona and
Venkata Vamsikrishna Meduri and
Paolo Papotti RuDiK: rule discovery in knowledge bases 1946--1949
George Papadakis and
Leonidas Tsekouras and
Emmanouil Thanos and
George Giannakopoulos and
Themis Palpanas and
Manolis Koubarakis The return of jedAI: end-to-end entity
resolution for structured and
semi-structured data . . . . . . . . . . 1950--1953
Seokki Lee and
Bertram Ludäscher and
Boris Glavic Provenance summaries for answers and
non-answers . . . . . . . . . . . . . . 1954--1957
Doris Xin and
Litian Ma and
Jialin Liu and
Stephen Macke and
Shuchen Song and
Aditya Parameswaran Helix: accelerating human-in-the-loop
machine learning . . . . . . . . . . . . 1958--1961
Tarique Siddiqui and
Paul Luh and
Zesheng Wang and
Karrie Karahalios and
Aditya Parameswaran Shapesearch: flexible pattern-based
querying of trend line visualizations 1962--1965
Miao Xie and
Sourav S Bhowmick and
Hao Su and
Gao Cong and
Wook-Shin Han PANDA: a system for partial
topology-based search on large networks 1966--1969
Wei Lu and
Xinyi Zhang and
Zhiyu Shui and
Zhe Peng and
Xiao Zhang and
Xiaoyong Du and
Hao Huang and
Xiaoyu Wang and
Anqun Pan and
Haixiang Li MSQL+: a plugin toolkit for similarity
search under metric spaces in
distributed relational database systems 1970--1973
Anupam Sanghi and
Raghav Sood and
Dharmendra Singh and
Jayant R. Haritsa and
Srikanta Tirthapura HYDRA: a dynamic big data regenerator 1974--1977
Fuad Jamour and
Ibrahim Abdelaziz and
Panos Kalnis A demonstration of MAGiQ: matrix algebra
approach for solving RDF graph queries 1978--1981
Wei Chit Tan and
Meihui Zhang and
Hazem Elmeleegy and
Divesh Srivastava REGAL+: reverse engineering SPJA queries 1982--1985
Daniel Deutch and
Nave Frost and
Amir Gilad and
Tomer Haimovich NLproveNAns: natural language provenance
for non-answers . . . . . . . . . . . . 1986--1989
Chen Xu and
Rudi Poepsel Lemaitre and
Juan Soto and
Volker Markl Fault-tolerance for distributed
iterative dataflows in action . . . . . 1990--1993
Efrat Abramovitz and
Daniel Deutch and
Amir Gilad QuestPro: queries in SPARQL through
provenance . . . . . . . . . . . . . . . 1994--1997
Ariel Jarovsky and
Tova Milo and
Slava Novgorodov and
Wang-Chiew Tan GOLDRUSH: rule sharing system for fraud
detection . . . . . . . . . . . . . . . 1998--2001
Christian Aebeloe and
Gabriela Montoya and
Vinay Setty and
Katja Hose Discovering diversified paths in
knowledge bases . . . . . . . . . . . . 2002--2005
Martin Junghanns and
Max Kießling and
Niklas Teichmann and
Kevin Gómez and
André Petermann and
Erhard Rahm Declarative and distributed graph
analytics with GRADOOP . . . . . . . . . 2006--2009
J. W. Zhang and
Yu Wang and
Y. C. Tay A collaborative framework for tweaking
properties in a synthetic dataset . . . 2010--2013
Manasa Jammi and
Jaydeep Sen and
Ashish Mittal and
Sagar Verma and
Vardaan Pahuja and
Rema Ananthanarayanan and
Pranay Lohia and
Hima Karanam and
Diptikalyan Saha and
Karthik Sankaranarayanan Tooling framework for instantiating
natural language querying system . . . . 2014--2017
Xiaolan Wang and
Jiyu Komiya and
Yoshihiko Suhara and
Aaron Feng and
Behzad Golshan and
Alon Halevy and
Wang-Chiew Tan Koko: a system for scalable semantic
querying of text . . . . . . . . . . . . 2018--2021
Jing Wang and
Zichen Liu and
Shuai Ma and
Nikos Ntarmos and
Peter Triantafillou GC: a graph caching system for
subgraph/supergraph queries . . . . . . 2022--2025
Matteo Lissandrini and
Davide Mottin and
Yannis Velegrakis and
Themis Palpanas X$^2$Q: your personal example-based
graph explorer . . . . . . . . . . . . . 2026--2029
Camille Chanial and
Rédouane Dziri and
Helena Galhardas and
Julien Leblay and
Minh-Huong Le Nguyen and
Ioana Manolescu Connectionlens: finding connections
across heterogeneous data sources . . . 2030--2033
Pierre Senellart and
Louis Jachiet and
Silviu Maniu and
Yann Ramusat ProvSQL: provenance and probability
management in postgreSQL . . . . . . . . 2034--2037
Zechao Shang and
Will Brackenbury and
Aaron J. Elmore and
Michael J. Franklin CYADB: a database that covers your ask 2038--2041
Yash Govind and
Erik Paulson and
Palaniappan Nagarajan and
Paul Suganthan G. C. and
AnHai Doan and
Youngchoon Park and
Glenn M. Fung and
Devin Conathan and
Marshall Carter and
Mingju Sun Cloudmatcher: a hands-off cloud/crowd
service for entity matching . . . . . . 2042--2045
Philipp M. Grulich and
Faisal Nawab Collaborative edge and cloud neural
networks for real-time video processing 2046--2049
Ashvin Agrawal and
Avrilia Floratou Dhalion in action: automatic management
of streaming applications . . . . . . . 2050--2053
Bojan Karlas and
Ji Liu and
Wentao Wu and
Ce Zhang Ease.ml in action: towards multi-tenant
declarative learning services . . . . . 2054--2057
Antonio Cavalcante Araujo Neto and
Mario A. Nascimento and
Joerg Sander and
Ricardo J. G. B. Campello MustaCHE: a multiple clustering
hierarchies explorer . . . . . . . . . . 2058--2061
Babak Salimi and
Corey Cole and
Peter Li and
Johannes Gehrke and
Dan Suciu HypDB: a demonstration of detecting,
explaining and resolving bias in OLAP
queries . . . . . . . . . . . . . . . . 2062--2065
Jose Picado and
Arash Termehchy and
Sudhanshu Pathak Learning efficiently over heterogeneous
databases . . . . . . . . . . . . . . . 2066--2069
Walter dos Santos and
Gustavo P. Avelar and
Manoel Horta Ribeiro and
Dorgival Guedes and
Wagner Meira Scalable and efficient data analytics
and mining with lemonade . . . . . . . . 2070--2073
Immanuel Trummer and
Samuel Moseley and
Deepak Maram and
Saehan Jo and
Joseph Antonakakis SkinnerDB: regret-bounded query
evaluation via reinforcement learning 2074--2077
Hoang Vo and
Yanhui Liang and
Jun Kong and
Fusheng Wang iSPEED: a scalable and distributed
in-memory based spatial query system for
large and structurally complex 3D data 2078--2081
Vítor Silva and
Daniel de Oliveira and
Patrick Valduriez and
Marta Mattoso DfAnalyzer: runtime dataflow analysis of
scientific applications using provenance 2082--2085
Nick Hynes and
David Dao and
David Yan and
Raymond Cheng and
Dawn Song A demonstration of Sterling: a
privacy-preserving data marketplace . . 2086--2089
Yang Cao and
Li Xiong and
Masatoshi Yoshikawa and
Yonghui Xiao and
Si Zhang ConTPL: controlling temporal privacy
leakage in differentially private
continuous data release . . . . . . . . 2090--2093
Xin Luna Dong and
Theodoros Rekatsinas Data integration and machine learning: a
natural synergy . . . . . . . . . . . . 2094--2097
Sujaya Maiyya and
Victor Zakhary and
Divyakant Agrawal and
Amr El Abbadi Database and distributed computing
fundamentals for scalable,
fault-tolerant, and consistent
maintenance of blockchains . . . . . . . 2098--2101
Christos Faloutsos and
Jan Gasthaus and
Tim Januschowski and
Yuyang Wang Forecasting big time series: old and new 2102--2105
Alin Deutsch and
Yannis Papakonstantinou Graph data models, query languages and
programming paradigms . . . . . . . . . 2106--2109
Sylvie Cazalens and
Julien Leblay and
Philippe Lamarre and
Ioana Manolescu and
Xavier Tannier Computational fact checking: a content
management perspective . . . . . . . . . 2110--2113
Antonio L. Furtado and
Nivio Ziviani Information and data management at
PUC-Rio and UFMG . . . . . . . . . . . . 2114--2129
Renée J. Miller Open data integration . . . . . . . . . 2130--2139
Michael Cafarella and
Alon Halevy and
Hongrae Lee and
Jayant Madhavan and
Cong Yu and
Daisy Zhe Wang and
Eugene Wu Ten years of WebTables . . . . . . . . . 2140--2149
Tim Kraska Northstar: an interactive data science
system . . . . . . . . . . . . . . . . . 2150--2164
Julia Stoyanovich and
Bill Howe and
HV Jagadish and
Gerome Miklau Panel: a debate on data and algorithmic
ethics . . . . . . . . . . . . . . . . . 2165--2167
Anthony Thomas and
Arun Kumar A comparative evaluation of systems for
scalable linear algebra-based analytics 2168--2182
Srinivas Karthik and
Jayant R. Haritsa and
Sreyash Kenkre and
Vinayaka Pandit A concave path to low-overhead robust
query processing . . . . . . . . . . . . 2183--2195
Yuhao Wen and
Xiaodan Zhu and
Sudeepa Roy and
Jun Yang Interactive summarization and
exploration of top aggregate query
answers . . . . . . . . . . . . . . . . 2196--2208
Timo Kersten and
Viktor Leis and
Alfons Kemper and
Thomas Neumann and
Andrew Pavlo and
Peter Boncz Everything you always wanted to know
about compiled and vectorized queries
but were afraid to ask . . . . . . . . . 2209--2222
Junyang Gao and
Pankaj K. Agarwal and
Jun Yang Durable top-$k$ queries on temporal data 2223--2235
Michele Linardi and
Themis Palpanas Scalable, variable-length similarity
search in data series: the ULISSE
approach . . . . . . . . . . . . . . . . 2236--2248
Caetano Sauer and
Goetz Graefe and
Theo Härder FineLine: log-structured transactional
storage and recovery . . . . . . . . . . 2249--2262
Protiva Rahman and
Courtney Hebert and
Arnab Nandi ICARUS: minimizing human effort in
iterative data completion . . . . . . . 2263--2276
Sunghwan Kim and
Taesung Lee and
Seung-won Hwang and
Sameh Elnikety List intersection for web search:
algorithms, cost models, and
optimizations . . . . . . . . . . . . . 1--13
Michael Whittaker and
Joseph M. Hellerstein Interactive checks for coordination
avoidance . . . . . . . . . . . . . . . 14--27
Jianbin Qin and
Chuan Xiao Pigeonring: a principle for faster
thresholded similarity search . . . . . 28--42
Ahmet Erdem Sariyüce and
C. Seshadhri and
Ali Pinar Local algorithms for hierarchical dense
subgraph discovery . . . . . . . . . . . 43--56
Jingru Yang and
Ju Fan and
Zhewei Wei and
Guoliang Li and
Tongyu Liu and
Xiaoyong Du Cost-effective data annotation using
game-based crowdsourcing . . . . . . . . 57--70
Enhui Huang and
Liping Peng and
Luciano Di Palma and
Ahmed Abdelkafi and
Anna Liu and
Yanlei Diao Optimization for active learning-based
interactive database exploration . . . . 71--84
Tobias Bleifuß and
Leon Bornemann and
Theodore Johnson and
Dmitri V. Kalashnikov and
Felix Naumann and
Divesh Srivastava Exploring change: a new dimension of
data analytics . . . . . . . . . . . . . 85--98
Bishwamittra Ghosh and
Mohammed Eunus Ali and
Farhana M. Choudhury and
Sajid Hasan Apon and
Timos Sellis and
Jianxin Li The flexible socio spatial group queries 99--111
Karima Echihabi and
Kostas Zoumpatianos and
Themis Palpanas and
Houda Benbrahim The Lernaean Hydra of data series
similarity search: an experimental
evaluation of the state of the art . . . 112--127
Wei Wang and
Jinyang Gao and
Meihui Zhang and
Sheng Wang and
Gang Chen and
Teck Khim Ng and
Beng Chin Ooi and
Jie Shao and
Moaz Reyad Rafiki: machine learning as an analytics
service system . . . . . . . . . . . . . 128--140
Pavle Suboti\'c and
Herbert Jordan and
Lijun Chang and
Alan Fekete and
Bernhard Scholz Automatic index selection for
large-scale datalog computation . . . . 141--153
Shuang Song and
Xu Liu and
Qinzhe Wu and
Andreas Gerstlauer and
Tao Li and
Lizy K. John Start late, finish early: a distributed
graph processing system with redundancy
reduction . . . . . . . . . . . . . . . 154--168
Bailu Ding and
Lucja Kot and
Johannes Gehrke Improving optimistic concurrency control
through transaction batching and
operation reordering . . . . . . . . . . 169--182
Ting Xie and
Varun Chandola and
Oliver Kennedy Query log compression for workload
analytics . . . . . . . . . . . . . . . 183--196
Mohammed Eunus Ali and
Shadman Saqib Eusuf and
Kaysar Abdullah and
Farhana M. Choudhury and
J. Shane Culpepper and
Timos Sellis The maximum trajectory coverage query in
spatial databases . . . . . . . . . . . 197--209
Chenggang Wu and
Alekh Jindal and
Saeed Amizadeh and
Hiren Patel and
Wangchao Le and
Shi Qiao and
Sriram Rao Towards a learning optimizer for shared
clouds . . . . . . . . . . . . . . . . . 210--222
Paroma Varma and
Christopher Ré Snuba: automating weak supervision to
label training data . . . . . . . . . . 223--236
Abolfazl Asudeh and
H. V. Jagadish and
Gerome Miklau and
Julia Stoyanovich On obtaining stable rankings . . . . . . 237--250
Shuping Ji and
Hans-Arno Jacobsen PS-tree-based efficient Boolean
expression matching for high-dimensional
and dense workloads . . . . . . . . . . 251--264
Yizhou Yan and
Lei Cao and
Samuel Madden and
Elke A. Rundensteiner SWIFT: mining representative patterns
from large event streams . . . . . . . . 265--277
Paul Suganthan G. C. and
Adel Ardalan and
AnHai Doan and
Aditya Akella Smurf: self-service string matching
using random forests . . . . . . . . . . 278--291
Feilong Liu and
Ario Salmasi and
Spyros Blanas and
Anastasios Sidiropoulos Chasing similarity: distribution-aware
aggregation scheduling . . . . . . . . . 292--306
Johes Bater and
Xi He and
William Ehrich and
Ashwin Machanavajjhala and
Jennie Rogers Shrinkwrap: efficient SQL query
processing in differentially private
data federations . . . . . . . . . . . . 307--320
Gurbinder Gill and
Roshan Dathathri and
Loc Hoang and
Keshav Pingali A study of partitioning policies for
graph analytics on large-scale
distributed platforms . . . . . . . . . 321--334
K. Ashwin Kumar and
Petros Efstathopoulos Utility-driven graph summarization . . . 335--347
Kaan Kara and
Ken Eguro and
Ce Zhang and
Gustavo Alonso ColumnML: column-store machine learning
with on-the-fly data transformation . . 348--361
Yanying Li and
Haipei Sun and
Boxiang Dong and
Hui (Wendy) Wang Cost-efficient data acquisition on
online data marketplaces for correlation
analysis . . . . . . . . . . . . . . . . 362--375
Mohamad Dolatshah and
Mathew Teoh and
Jiannan Wang and
Jian Pei Cleaning crowdsourced labels using
oracles for statistical classification 376--389
Matteo Lissandrini and
Martin Brugnara and
Yannis Velegrakis Beyond macrobenchmarks:
microbenchmark-based graph database
evaluation . . . . . . . . . . . . . . . 390--403
Valter Balegas and
Sérgio Duarte and
Carla Ferreira and
Rodrigo Rodrigues and
Nuno Preguiça IPA: invariant-preserving applications
for weakly consistent replicated
databases . . . . . . . . . . . . . . . 404--418
Firas Abuzaid and
Peter Kraft and
Sahaana Suri and
Edward Gan and
Eric Xu and
Atul Shenoy and
Asvin Ananthanarayan and
John Sheu and
Erik Meijer and
Xi Wu and
Jeff Naughton and
Peter Bailis and
Matei Zaharia DIFF: a relational interface for
large-scale data explanation . . . . . . 419--432
Ran Ben Basat and
Roy Friedman and
Rana Shahout Stream frequency over interval queries 433--445
Doris Xin and
Stephen Macke and
Litian Ma and
Jialin Liu and
Shuchen Song and
Aditya Parameswaran HELIX: holistic optimization for
accelerating iterative machine learning 446--460
Cong Fu and
Chao Xiang and
Changxu Wang and
Deng Cai Fast approximate nearest neighbor search
with the navigating spreading-out graph 461--474
Qi Wang and
Torsten Suel Document reordering for faster
intersection . . . . . . . . . . . . . . 475--487
Xiaofei Zhang and
M. Tamer Özsu Correlation constraint shortest path
over large multi-relation graphs . . . . 488--501
Harald Lang and
Thomas Neumann and
Alfons Kemper and
Peter Boncz Performance-optimal filtering: Bloom
overtakes Cuckoo at high throughput . . 502--515
Steffen Zeuch and
Bonaventura Del Monte and
Jeyhun Karimov and
Clemens Lutz and
Manuel Renz and
Jonas Traub and
Sebastian Breß and
Tilmann Rabl and
Volker Markl Analyzing efficient stream processing on
modern hardware . . . . . . . . . . . . 516--530
Chen Luo and
Michael J. Carey Efficient data ingestion and query
processing for LSM-based storage systems 531--543
Periklis Chrysogelos and
Manos Karpathiotakis and
Raja Appuswamy and
Anastasia Ailamaki HetExchange: encapsulating heterogeneous
CPU--GPU parallelism in JIT compiled
engines . . . . . . . . . . . . . . . . 544--556
Paolo Atzeni and
Luigi Bellomarini and
Paolo Papotti and
Riccardo Torlone Meta-mappings for schema mapping reuse 557--569
Lijie Xu and
Tian Guo and
Wensheng Dou and
Wei Wang and
Jun Wei An experimental evaluation of garbage
collectors on big data applications . . 570--583
Jinwei Guo and
Peng Cai and
Jiahao Wang and
Weining Qian and
Aoying Zhou Adaptive optimistic concurrency control
for heterogeneous workloads . . . . . . 584--596
Yu-Shan Lin and
Shao-Kan Pi and
Meng-Kai Liao and
Ching Tsai and
Aaron Elmore and
Shan-Hung Wu MgCrab: transaction crabbing for live
migration in deterministic database
systems . . . . . . . . . . . . . . . . 597--610
Sujaya Maiyya and
Faisal Nawab and
Divyakant Agrawal and
Amr El Abbadi Unifying consensus and atomic commitment
for effective cloud data management . . 611--623
Chenggang Wu and
Vikram Sreekanti and
Joseph M. Hellerstein Autoscaling tiered cloud storage in Anna 624--638
Anton Dignös and
Boris Glavic and
Xing Niu and
Michael Böhlen and
Johann Gamper Snapshot semantics for temporal multiset
relations . . . . . . . . . . . . . . . 639--652
Selasi Kwashie and
Lin Liu and
Jixue Liu and
Markus Stumptner and
Jiuyong Li and
Lujing Yang Certus: an effective entity resolution
approach with graph differential
dependencies (GDDs) . . . . . . . . . . 653--666
Kai Han and
Fei Gui and
Xiaokui Xiao and
Jing Tang and
Yuntian He and
Zongmai Cao and
He Huang Efficient and effective algorithms for
clustering uncertain graphs . . . . . . 667--680
Jia Zou and
Arun Iyengar and
Chris Jermaine Pangea: monolithic distributed storage
for data analytics . . . . . . . . . . . 681--694
Zhiwei Fan and
Jianqiao Zhu and
Zuyu Zhang and
Aws Albarghouthi and
Paraschos Koutris and
Jignesh M. Patel Scaling-up in-memory datalog processing:
observations and techniques . . . . . . 695--708
Aaron Archer and
Kevin Aydin and
Mohammad Hossein Bateni and
Vahab Mirrokni and
Aaron Schild and
Ray Yang and
Richard Zhuang Cache-aware load balancing of data
center applications . . . . . . . . . . 709--723
Michael Borkowski and
Christoph Hochreiner and
Stefan Schulte Minimizing cost by reducing scaling
operations in distributed stream
processing . . . . . . . . . . . . . . . 724--737
Yinjun Wu and
Abdussalam Alawini and
Daniel Deutch and
Tova Milo and
Susan Davidson ProvCite: provenance-based data citation 738--751
Wenfei Fan and
Ping Lu and
Chao Tian and
Jingren Zhou Deducing certain fixes to graphs . . . . 752--765
Matteo Ceccarello and
Andrea Pietracaprina and
Geppino Pucci Solving $k$-center clustering (with
outliers) in MapReduce and streaming,
almost as accurately as sequentially . . 766--778
Xiaolan Wang and
Alexandra Meliou Explain$3$D: explaining disagreements in
disjoint datasets . . . . . . . . . . . 779--792
Youjip Won and
Sundoo Kim and
Juseong Yun and
Dam Quang Tuan and
Jiwon Seo DASH: database shadowing for mobile DBMS 793--806
Zeke Wang and
Kaan Kara and
Hantian Zhang and
Gustavo Alonso and
Onur Mutlu and
Ce Zhang Accelerating generalized linear models
with MLWeaving: a one-size-fits-all
system for any-precision learning . . . 807--821
Dimitrije Jankov and
Shangyu Luo and
Binhang Yuan and
Zhuhua Cai and
Jia Zou and
Chris Jermaine and
Zekai J. Gao Declarative recursive computation on an
RDBMS: or, why you should use a database
for distributed machine learning . . . . 822--835
Shahram Ghandeharizadeh and
Hieu Nguyen Design, implementation, and evaluation
of write-back policy with cache
augmented data stores . . . . . . . . . 836--849
Thanh Tam Nguyen and
Matthias Weidlich and
Hongzhi Yin and
Bolong Zheng and
Quoc Viet Hung Nguyen and
Bela Stantic User guidance for efficient fact
checking . . . . . . . . . . . . . . . . 850--863
Xiangyu Ke and
Arijit Khan and
Leroy Lim Hong Quan An in-depth comparison of $s$--$t$
reliability algorithms over uncertain
graphs . . . . . . . . . . . . . . . . . 864--876
Wenfei Fan and
Chunming Hu and
Muyang Liu and
Ping Lu and
Qiang Yin and
Jingren Zhou Dynamic scaling for parallel graph
computations . . . . . . . . . . . . . . 877--890
Dongsheng Li and
Yiming Zhang and
Jinyan Wang and
Kian-Lee Tan TopoX: topology refactorization for
efficient graph partitioning and
processing . . . . . . . . . . . . . . . 891--905
Dmitrii Avdiukhin and
Sergey Pupyrev and
Grigory Yaroslavtsev Multi-dimensional balanced graph
partitioning via projected gradient
descent . . . . . . . . . . . . . . . . 906--919
Lei Cao and
Yizhou Yan and
Samuel Madden and
Elke A. Rundensteiner and
Mathan Gopalsamy Efficient discovery of sequence outlier
patterns . . . . . . . . . . . . . . . . 920--932
Dmytro Bogatov and
George Kollios and
Leonid Reyzin A comparative evaluation of
order-revealing encryption schemes and
secure range-query protocols . . . . . . 933--947
Faisal Orakzai and
Toon Calders and
Torben Bach Pedersen $ k / 2$-hop: fast mining of convoy
patterns with effective pruning . . . . 948--960
Ji Sun and
Zeyuan Shang and
Guoliang Li and
Dong Deng and
Zhifeng Bao Balance-aware distributed string
similarity-based query processing system 961--974
Pingcheng Ruan and
Gang Chen and
Tien Tuan Anh Dinh and
Qian Lin and
Beng Chin Ooi and
Meihui Zhang Fine-grained, secure and efficient data
provenance on blockchain systems . . . . 975--988
Dalsu Choi and
Chang-Sup Park and
Yon Dohn Chung Progressive top-$k$ subarray query
processing in array databases . . . . . 989--1001
Moritz Hoffmann and
Andrea Lattuada and
Frank McSherry Megaphone: latency-conscious state
migration for distributed streaming
dataflows . . . . . . . . . . . . . . . 1002--1015
Nguyen Thanh Tam and
Matthias Weidlich and
Bolong Zheng and
Hongzhi Yin and
Nguyen Quoc Viet Hung and
Bela Stantic From anomaly detection to rumour
detection using data streams of social
platforms . . . . . . . . . . . . . . . 1016--1029
Peeyush Gupta and
Yin Li and
Sharad Mehrotra and
Nisha Panwar and
Shantanu Sharma and
Sumaya Almanee Obscure: information-theoretic oblivious
and verifiable aggregation queries . . . 1030--1043
Anshuman Dutt and
Chi Wang and
Azade Nazi and
Srikanth Kandula and
Vivek Narasayya and
Surajit Chaudhuri Selectivity estimation for range
predicates using lightweight models . . 1044--1057
Ye Yuan and
Xiang Lian and
Guoren Wang and
Yuliang Ma and
Yishu Wang Constrained shortest path query in a
large time-dependent graph . . . . . . . 1058--1070
Lingyang Chu and
Zhefeng Wang and
Jian Pei and
Yanyan Zhang and
Yu Yang and
Enhong Chen Finding theme communities from database
networks . . . . . . . . . . . . . . . . 1071--1084
James J. Pan and
Guoliang Li and
Juntao Hu Ridesharing: simulator, benchmark, and
evaluation . . . . . . . . . . . . . . . 1085--1098
Longbin Lai and
Zhu Qing and
Zhengyi Yang and
Xin Jin and
Zhengmin Lai and
Ran Wang and
Kongzhang Hao and
Xuemin Lin and
Lu Qin and
Wenjie Zhang and
Ying Zhang and
Zhengping Qian and
Jingren Zhou Distributed subgraph matching on timely
dataflow . . . . . . . . . . . . . . . . 1099--1112
Shi Qiao and
Adrian Nicoara and
Jin Sun and
Marc Friedman and
Hiren Patel and
Jaliya Ekanayake Hyper dimension shuffle: efficient data
repartition at petabyte scale in SCOPE 1113--1125
Graham Cormode and
Tejas Kulkarni and
Divesh Srivastava Answering range queries under local
differential privacy . . . . . . . . . . 1126--1138
Kai Wang and
Xuemin Lin and
Lu Qin and
Wenjie Zhang and
Ying Zhang Vertex priority based butterfly counting
for large-scale bipartite networks . . . 1139--1152
Yang Cao and
Wenfei Fan and
Tengfei Yuan Block as a value for SQL over NoSQL . . 1153--1166
Kanat Tangwongsan and
Martin Hirzel and
Scott Schneider Optimal and general out-of-order
sliding-window aggregation . . . . . . . 1167--1180
Bo Tang and
Kyriakos Mouratidis and
Man Lung Yiu and
Zhenyu Chen Creating top ranking options in the
continuous option and preference space 1181--1194
Hanchao Ma and
Morteza Alipourlangouri and
Yinghui Wu and
Fei Chiang and
Jiaxing Pi Ontology-based entity matching in
attributed graphs . . . . . . . . . . . 1195--1207
Lu Chen and
Yunjun Gao and
Ziquan Fang and
Xiaoye Miao and
Christian S. Jensen and
Chenjuan Guo Real-time distributed co-movement
pattern detection on streaming
trajectories . . . . . . . . . . . . . . 1208--1220
Jian Tan and
Tieying Zhang and
Feifei Li and
Jie Chen and
Qixing Zheng and
Ping Zhang and
Honglin Qiao and
Yue Shi and
Wei Cao and
Rui Zhang iBTune: individualized buffer tuning for
large-scale cloud databases . . . . . . 1221--1234
Michael Whittaker and
Nick Edmonds and
Sandeep Tata and
James B. Wendt and
Marc Najork Online template induction for
machine-generated emails . . . . . . . . 1235--1248
Yong Wang and
Guoliang Li and
Nan Tang Querying shortest paths on time
dependent road networks . . . . . . . . 1249--1261
Anna Fariha and
Alexandra Meliou Example-driven query intent discovery:
abductive reasoning using semantic
similarity . . . . . . . . . . . . . . . 1262--1275
Qi Zhou and
Joy Arulraj and
Shamkant Navathe and
William Harris and
Dong Xu Automated verification of query
equivalence using satisfiability modulo
theories . . . . . . . . . . . . . . . . 1276--1288
Pengfei Xu and
Jiaheng Lu Towards a unified framework for string
similarity joins . . . . . . . . . . . . 1289--1302
Susik Yoon and
Jae-Gil Lee and
Byung Suk Lee NETS: extremely fast outlier detection
from a data stream via set-based
processing . . . . . . . . . . . . . . . 1303--1315
Yi Lu and
Xiangyao Yu and
Samuel Madden STAR: scaling transactions through
asymmetric replication . . . . . . . . . 1316--1329
Yuliang Li and
Aaron Feng and
Jinfeng Li and
Saran Mumick and
Alon Halevy and
Vivian Li and
Wang-Chiew Tan Subjective databases . . . . . . . . . . 1330--1343
Xuguang Ren and
Junhu Wang and
Wook-Shin Han and
Jeffrey Xu Yu Fast and robust distributed subgraph
enumeration . . . . . . . . . . . . . . 1344--1356
Fangeheng Fu and
Jiawei Jiang and
Yingxia Shao and
Bin Cui An experimental evaluation of large
scale GBDT systems . . . . . . . . . . . 1357--1370
Ios Kotsogiannis and
Yuchao Tao and
Xi He and
Maryam Fanaeepour and
Ashwin Machanavajjhala and
Michael Hay and
Gerome Miklau PrivateSQL: a differentially private SQL
query engine . . . . . . . . . . . . . . 1371--1384
Mohammad Javad Amiri and
Divyakant Agrawal and
Amr El Abbadi CAPER: a cross-application permissioned
blockchain . . . . . . . . . . . . . . . 1385--1398
Alexandros Koliousis and
Pijika Watcharapichat and
Matthias Weidlich and
Luo Mai and
Paolo Costa and
Peter Pietzuch Crossbow: scaling deep learning with
small batch sizes on multi-GPU servers 1399--1412
Kaiyu Feng and
Gao Cong and
Christian S. Jensen and
Tao Guo Finding attribute-aware similar regions
for data analysis . . . . . . . . . . . 1414--1426
Dixin Tang and
Zechao Shang and
Aaron J. Elmore and
Sanjay Krishnan and
Michael J. Franklin Intermittent query processing . . . . . 1427--1441
Mihai Budiu and
Parikshit Gopalan and
Lalith Suresh and
Udi Wieder and
Han Kruiger and
Marcos K. Aguilera Hillview: a trillion-cell spreadsheet
for big data . . . . . . . . . . . . . . 1442--1457
Ziheng Wei and
Sebastian Link Embedded functional dependencies and
data-completeness tailored database
design . . . . . . . . . . . . . . . . . 1458--1470
Hua Fan and
Wojciech Golab Ocean Vista: gossip-based visibility
control for speedy geo-distributed
transactions . . . . . . . . . . . . . . 1471--1484
Xikui Wang and
Michael J. Carey An IDEA: an ingestion framework for data
enrichment in asterixDB . . . . . . . . 1485--1498
Alexey Karyakin and
Kenneth Salem DimmStore: memory power optimization for
database systems . . . . . . . . . . . . 1499--1512
Cong Yan and
Alvin Cheung Generating application-specific data
layouts for in-memory databases . . . . 1513--1525
Rihan Hai and
Christoph Quix Rewriting of plain SO tgds into nested
tgds . . . . . . . . . . . . . . . . . . 1526--1538
Senthil Nathan and
Chander Govindarajan and
Adarsh Saraf and
Manish Sethi and
Praveen Jayachandran Blockchain meets database: design and
implementation of a blockchain
relational database . . . . . . . . . . 1539--1552
Andreas Kunft and
Asterios Katsifodimos and
Sebastian Schelter and
Sebastian Breß and
Tilmann Rabl and
Volker Markl An intermediate representation for
optimizing machine learning pipelines 1553--1567
Yuanwei Fang and
Chen Zou and
Andrew A. Chien Accelerating raw data analysis with the
ACCORDA software and hardware
architecture . . . . . . . . . . . . . . 1568--1582
A. B. Siddique and
Ahmed Eldawy and
Vagelis Hristidis Comparing synopsis techniques for
approximate spatial data analysis . . . 1583--1596
Muhammad El-Hindi and
Carsten Binnig and
Arvind Arasu and
Donald Kossmann and
Ravi Ramamurthy BlockchainDB: a shared database on
blockchains . . . . . . . . . . . . . . 1597--1609
Ruoxi Jia and
David Dao and
Boxin Wang and
Frances Ann Hubis and
Nezihe Merve Gurel and
Bo Li and
Ce Zhang and
Costas Spanos and
Dawn Song Efficient task-specific data valuation
for nearest neighbor algorithms . . . . 1610--1623
Hemant Saxena and
Lukasz Golab and
Ihab F. Ilyas Distributed implementations of
dependency discovery algorithms . . . . 1624--1636
Erfan Zamanian and
Xiangyao Yu and
Michael Stonebraker and
Tim Kraska Rethinking database high availability
with RDMA networks . . . . . . . . . . . 1637--1650
Marco Bressan and
Stefano Leucci and
Alessandro Panconesi Motivo: fast motif counting via succinct
color coding and adaptive sampling . . . 1651--1663
Rishabh Poddar and
Tobias Boelter and
Raluca Ada Popa Arx: an encrypted database using
semantically secure encryption . . . . . 1664--1678
Junyang Gao and
Xian Li and
Yifan Ethan Xu and
Bunyamin Sisman and
Xin Luna Dong and
Jun Yang Efficient knowledge graph accuracy
evaluation . . . . . . . . . . . . . . . 1679--1691
Amine Mhedhbi and
Semih Salihoglu Optimizing subgraph queries by combining
binary and worst-case optimal joins . . 1692--1704
Ryan Marcus and
Parimarjan Negi and
Hongzi Mao and
Chi Zhang and
Mohammad Alizadeh and
Tim Kraska and
Olga Papaemmanouil and
Nesime Tatbul Neo: a learned query optimizer . . . . . 1705--1718
Yixiang Fang and
Kaiqiang Yu and
Reynold Cheng and
Laks V. S. Lakshmanan and
Xuemin Lin Efficient algorithms for densest
subgraph discovery . . . . . . . . . . . 1719--1732
Ryan Marcus and
Olga Papaemmanouil Plan-structured deep neural network
models for query performance prediction 1733--1746
Kun Ren and
Dennis Li and
Daniel J. Abadi SLOG: serializable, low-latency,
geo-replicated transactions . . . . . . 1747--1761
John Paparrizos and
Michael J. Franklin GRAIL: efficient time-series
representation learning . . . . . . . . 1762--1777
Guilherme Damasio and
Spencer Bryson and
Vincent Corvinelli and
Parke Godfrey and
Piotr Mierzejewski and
Jaroslaw Szlichta and
Calisto Zuzarte GALO: guided automated learning for
re-optimization . . . . . . . . . . . . 1778--1781
Yuanyuan Tian and
Wen Sun and
Sui Jun Tong and
En Liang Xu and
Mir Hamid Pirahesh and
Wei Zhao Synergistic graph and SQL analytics
inside IBM Db2 . . . . . . . . . . . . . 1782--1785
Xiaoou Ding and
Hongzhi Wang and
Jiaxuan Su and
Zijue Li and
Jianzhong Li and
Hong Gao Cleanits: a data cleaning system for
industrial time series . . . . . . . . . 1786--1789
Yipeng Zhang and
Zhifeng Bao and
Songsong Mo and
Yuchen Li and
Yanghao Zhou ITAA: an intelligent trajectory-driven
outdoor advertising deployment assistant 1790--1793
Kun Qian and
Lucian Popa and
Prithviraj Sen SystemER: a human-in-the-loop system for
explainable entity resolution . . . . . 1794--1797
Viet-Phi Huynh and
Paolo Papotti Buckle: evaluating fact checking
algorithms built on knowledge bases . . 1798--1801
Peng Gao and
Xusheng Xiao and
Zhichun Li and
Kangkook Jee and
Fengyuan Xu and
Sanjeev R. Kulkarni and
Prateek Mittal A query system for efficiently
investigating complex attack behaviors
for enterprise security . . . . . . . . 1802--1805
Zhengjie Miao and
Qitian Zeng and
Chenjie Li and
Boris Glavic and
Oliver Kennedy and
Sudeepa Roy CAPE: explaining outliers by
counterbalancing . . . . . . . . . . . . 1806--1809
Karthik Ramachandra and
Kwanghyun Park BlackMagic: automatic inlining of scalar
UDFs into SQL queries with Froid . . . . 1810--1813
Lukas Berg and
Tobias Ziegler and
Carsten Binnig and
Uwe Röhm ProgressiveDB: progressive data
analytics as a middleware . . . . . . . 1814--1817
Kaan Kara and
Zeke Wang and
Ce Zhang and
Gustavo Alonso doppioDB 2.0: hardware techniques for
improved integration of machine learning
into databases . . . . . . . . . . . . . 1818--1821
Cicero A. L. Pahins and
Behrooz Omidvar-Tehrani and
Sihem Amer-Yahia and
Valérie Siroux and
Jean-Louis Pepin and
Jean-Christian Borel and
João L. D. Comba COVIZ: a system for visual formation and
exploration of patient cohorts . . . . . 1822--1825
Martin Franke and
Ziad Sehili and
Erhard Rahm PRIMAT: a toolbox for fast
privacy-preserving matching . . . . . . 1826--1829
Ryan Marcus and
Chi Zhang and
Shuai Yu and
Geoffrey Kao and
Olga Papaemmanouil NashDB: fragmentation, replication, and
provisioning using economic methods . . 1830--1833
Ibrahim Sabek and
Mashaal Musleh and
Mohamed F. Mokbel Flash in action: scalable spatial data
analysis using Markov logic networks . . 1834--1837
Lucas Kuhring and
Zsolt István I can't believe it's not (only)
software!: bionic distributed storage
for Parquet files . . . . . . . . . . . 1838--1841
Hyewon Choi and
Erkang Zhu and
Arsala Bangash and
Renée J. Miller VISE: vehicle image search engine with
traffic camera . . . . . . . . . . . . . 1842--1845
Stephan Goldberg and
Tova Milo and
Slava Novgorodov and
Kathy Razmadze WiClean: a system for fixing Wikipedia
interlinks using revision history
patterns . . . . . . . . . . . . . . . . 1846--1849
Abhishek Roy and
Alekh Jindal and
Hiren Patel and
Ashit Gosalia and
Subru Krishnan and
Carlo Curino SparkCruise: handsfree computation reuse
in Spark . . . . . . . . . . . . . . . . 1850--1853
Sandeep Singh Sandha and
Wellington Cabrera and
Mohammed Al-Kateb and
Sanjay Nair and
Mani Srivastava In-database distributed machine
learning: demonstration using Teradata
SQL engine . . . . . . . . . . . . . . . 1854--1857
Zhao Li and
Xia Chen and
Xuming Pan and
Pengcheng Zou and
Yuchen Li and
Guoxian Yu SHOAL: large-scale hierarchical taxonomy
via graph-based query coalition in
e-commerce . . . . . . . . . . . . . . . 1858--1861
Min Xu and
Tianhao Wang and
Bolin Ding and
Jingren Zhou and
Cheng Hong and
Zhicong Huang DPSAaS: multi-dimensional data sharing
and analytics as services under local
differential privacy . . . . . . . . . . 1862--1865
Yang Cao and
Yonghui Xiao and
Li Xiong and
Liquan Bai and
Masatoshi Yoshikawa PriSTE: protecting spatiotemporal event
privacy in continuous location-based
services . . . . . . . . . . . . . . . . 1866--1869
Daniel Deutch and
Evgeny Marants and
Yuval Moskovitch Datalignment: ontology schema alignment
through datalog containment . . . . . . 1870--1873
Congcong Ge and
Yunjun Gao and
Xiaoye Miao and
Lu Chen and
Christian S. Jensen and
Ziyuan Zhu IHCS: an integrated hybrid cleaning
system . . . . . . . . . . . . . . . . . 1874--1877
Constantinos Costa and
Xiaoyu Ge and
Panos K. Chrysanthis CAPRIO: graph-based integration of
indoor and outdoor data for path
discovery . . . . . . . . . . . . . . . 1878--1881
Yingjun Wu and
Jia Yu and
Yuanyuan Tian and
Richard Sidle and
Ronald Barber HERMIT in action: succinct secondary
indexing mechanism via correlation
exploration . . . . . . . . . . . . . . 1882--1885
Julien Loudet and
Iulian Sandu-Popa and
Luc Bouganim DISPERS: securing highly distributed
queries on personal data management
systems . . . . . . . . . . . . . . . . 1886--1889
Adil Akhter and
Marios Fragkoulis and
Asterios Katsifodimos Stateful functions as a service in
action . . . . . . . . . . . . . . . . . 1890--1893
Allen Ordookhanians and
Xin Li and
Supun Nakandala and
Arun Kumar Demonstration of Krypton: optimized CNN
inference for occlusion-based deep CNN
explanations . . . . . . . . . . . . . . 1894--1897
Zhengjie Miao and
Andrew Lee and
Sudeepa Roy LensXPlain: visualizing and explaining
contributing subsets for aggregate query
answers . . . . . . . . . . . . . . . . 1898--1901
Yi Zhang and
Zachary G. Ives Juneau: data lake management for Jupyter 1902--1905
Sona Hasani and
Faezeh Ghaderi and
Shohedul Hasan and
Saravanan Thirumuruganathan and
Abolfazl Asudeh and
Nick Koudas and
Gautam Das ApproxML: efficient approximate ad-hoc
ML models through materialization and
reuse . . . . . . . . . . . . . . . . . 1906--1909
Grégory Essertel and
Ruby Y. Tahboub and
Fei Wang and
James Decker and
Tiark Rompf Flare & Lantern: efficiently swapping
horses midstream . . . . . . . . . . . . 1910--1913
Ruben Martins and
Jia Chen and
Yanju Chen and
Yu Feng and
Isil Dillig Trinity: an extensible synthesis
framework for data science . . . . . . . 1914--1917
Zhiqi Huang and
Ryan McKenna and
George Bissias and
Gerome Miklau and
Michael Hay and
Ashwin Machanavajjhala PSynDB: accurate and accessible private
data generation . . . . . . . . . . . . 1918--1921
Badrish Chandramouli and
Dong Xie and
Yinan Li and
Donald Kossmann FishStore: fast ingestion and indexing
of raw data . . . . . . . . . . . . . . 1922--1925
Yanlei Diao and
Pawe\l Guzewicz and
Ioana Manolescu and
Mirjana Mazuran Spade: a modular framework for
analytical exploration of RDF graphs . . 1926--1929
Joseph Vinish D'silva and
Florestan De Moor and
Bettina Kemme Making an RDBMS data scientist friendly:
advanced in-database interactive
analytics with visualization support . . 1930--1933
Khaled Zaouk and
Fei Song and
Chenghao Lyu and
Arnab Sinha and
Yanlei Diao and
Prashant Shenoy UDAO: a next-generation unified data
analytics optimizer . . . . . . . . . . 1934--1937
Saehan Jo and
Immanuel Trummer and
Weicheng Yu and
Xuezhi Wang and
Cong Yu and
Daniel Liu and
Niyati Mehta AggChecker: a fact-checking system for
text summaries of relational data sets 1938--1941
Hanzhang Wang and
Phuong Nguyen and
Jun Li and
Selcuk Kopru and
Gene Zhang and
Sanjeev Katariya and
Sami Ben-Romdhane GRANO: interactive graph-based root
cause analysis for cloud-native
distributed data platform . . . . . . . 1942--1945
Davide Frey and
Marc X. Makkes and
Pierre-Louis Roman and
François Ta\"\iani and
Spyros Voulgaris Dietcoin: hardening Bitcoin transaction
verification process for mobile devices 1946--1949
Samriddhi Singla and
Ahmed Eldawy and
Rami Alghamdi and
Mohamed F. Mokbel Raptor: large scale analysis of big
raster and vector data . . . . . . . . . 1950--1953
El Kindi Rezig and
Lei Cao and
Michael Stonebraker and
Giovanni Simonini and
Wenbo Tao and
Samuel Madden and
Mourad Ouzzani and
Nan Tang and
Ahmed K. Elmagarmid Data Civilizer 2.0: a holistic framework
for data preparation and analytics . . . 1954--1957
Leonhard F. Spiegelberg and
Tim Kraska Tuplex: robust, efficient analytics when
Python rules . . . . . . . . . . . . . . 1958--1961
Cedric Renggli and
Frances Ann Hubis and
Bojan Karlas and
Kevin Schawinski and
Wentao Wu and
Ce Zhang Ease.ml/ci and Ease.ml/meter in action:
towards data management for statistical
generalization . . . . . . . . . . . . . 1962--1965
Xueran Han and
Jun Chen and
Jiaheng Lu and
Yueguo Chen and
Xiaoyong Du PivotE: revealing and visualizing the
underlying entity structures for
exploration . . . . . . . . . . . . . . 1966--1969
Jiaheng Lu and
Yuxing Chen and
Herodotos Herodotou and
Shivnath Babu Speedup your analytics: automatic
parameter tuning for databases and big
data systems . . . . . . . . . . . . . . 1970--1973
Yu Meng and
Jiaxin Huang and
Jingbo Shang and
Jiawei Han TextCube: automated construction and
multidimensional exploration . . . . . . 1974--1977
Sihem Amer-Yahia and
Senjuti Basu Roy The ever evolving online labor market:
overview, challenges and opportunities 1978--1981
Ibrahim Sabek and
Mohamed F. Mokbel Machine learning meets big spatial data 1982--1985
Fatemeh Nargesian and
Erkang Zhu and
Renée J. Miller and
Ken Q. Pu and
Patricia C. Arocena Data lake management: challenges and
opportunities . . . . . . . . . . . . . 1986--1989
Laks V. S. Lakshmanan and
Michael Simpson and
Saravanan Thirumuruganathan Combating fake news: a data management
and mining perspective . . . . . . . . . 1990--1993
Nicolas Anciaux and
Luc Bouganim and
Philippe Pucheral and
Iulian Sandu Popa and
Guillaume Scerri Personal database security and trusted
execution environments: a tutorial at
the crossroads . . . . . . . . . . . . . 1994--1997
Stephan Kessler and
Jens Hoff and
Johann-Christoph Freytag SAP HANA goes private: from privacy
research to privacy aware enterprise
analytics . . . . . . . . . . . . . . . 1998--2009
Guilherme Damasio and
Vincent Corvinelli and
Parke Godfrey and
Piotr Mierzejewski and
Alex Mihaylov and
Jaroslaw Szlichta and
Calisto Zuzarte Guided automated learning for query
workload re-optimization . . . . . . . . 2010--2021
Biswapesh Chattopadhyay and
Priyam Dutta and
Weiran Liu and
Ott Tinn and
Andrew Mccormick and
Aniket Mokashi and
Paul Harvey and
Hector Gonzalez and
David Lomax and
Sagar Mittal and
Roee Ebenstein and
Nikita Mikhaylin and
Hung-ching Lee and
Xiaoyan Zhao and
Tony Xu and
Luis Perez and
Farhad Shahmohammadi and
Tran Bui and
Neil McKay and
Selcuk Aya and
Vera Lychagina and
Brett Elliott Procella: unifying serving and
analytical data at YouTube . . . . . . . 2022--2034
Wei Lu and
Zhanhao Zhao and
Xiaoyu Wang and
Haixiang Li and
Zhenmiao Zhang and
Zhiyu Shui and
Sheng Ye and
Anqun Pan and
Xiaoyong Du A lightweight and efficient temporal
database management system in TDSQL . . 2035--2046
Reza Sherkat and
Colin Florendo and
Mihnea Andrei and
Rolando Blanco and
Adrian Dragusanu and
Amit Pathak and
Pushkar Khadilkar and
Neeraj Kulkarni and
Christian Lemke and
Sebastian Seifert and
Sarika Iyer and
Sasikanth Gottapu and
Robert Schulze and
Chaitanya Gottipati and
Nirvik Basak and
Yanhong Wang and
Vivek Kandiyanallur and
Santosh Pendap and
Dheren Gala and
Rajesh Almeida and
Prasanta Ghosh Native store extension for SAP HANA . . 2047--2058
Chaoqun Zhan and
Maomeng Su and
Chuangxian Wei and
Xiaoqiang Peng and
Liang Lin and
Sheng Wang and
Zhe Chen and
Feifei Li and
Yue Pan and
Fang Zheng and
Chengliang Chai AnalyticDB: real-time OLAP database
system at Alibaba cloud . . . . . . . . 2059--2070
William Schultz and
Tess Avitabile and
Alyson Cabral Tunable consistency in MongoDB . . . . . 2071--2081
Shaosheng Cao and
XinXing Yang and
Cen Chen and
Jun Zhou and
Xiaolong Li and
Yuan Qi TitAnt: online real-time transaction
fraud detection in Ant Financial . . . . 2082--2093
Rong Zhu and
Kun Zhao and
Hongxia Yang and
Wei Lin and
Chang Zhou and
Baole Ai and
Yong Li and
Jingren Zhou AliGraph: a comprehensive graph neural
network platform . . . . . . . . . . . . 2094--2105
Zhimin Chen and
Yue Wang and
Vivek Narasayya and
Surajit Chaudhuri Customizable and scalable fuzzy join for
big data . . . . . . . . . . . . . . . . 2106--2117
Guoliang Li and
Xuanhe Zhou and
Shifu Li and
Bo Gao QTune: a query-aware database tuning
system with deep reinforcement learning 2118--2130
Srikanth Kandula and
Kukjin Lee and
Surajit Chaudhuri and
Marc Friedman Experiences with approximating queries
in Microsoft's production big-data
clusters . . . . . . . . . . . . . . . . 2131--2142
Panagiotis Antonopoulos and
Peter Byrne and
Wayne Chen and
Cristian Diaconu and
Raghavendra Thallam Kodandaramaih and
Hanuma Kodavalla and
Prashanth Purnananda and
Adrian-Leonard Radu and
Chaitanya Sreenivas Ravella and
Girish Mittur Venkataramanappa Constant time recovery in Azure SQL
database . . . . . . . . . . . . . . . . 2143--2154
Yuzhen Huang and
Yingjie Shi and
Zheng Zhong and
Yihui Feng and
James Cheng and
Jiwei Li and
Haochuan Fan and
Chao Li and
Tao Guan and
Jingren Zhou Yugong: geo-distributed data and job
placement at scale . . . . . . . . . . . 2155--2169
Junjay Tan and
Thanaa Ghanem and
Matthew Perron and
Xiangyao Yu and
Michael Stonebraker and
David DeWitt and
Marco Serafini and
Ashraf Aboulnaga and
Tim Kraska Choosing a cloud DBMS: architectures and
tradeoffs . . . . . . . . . . . . . . . 2170--2182
Jingtian Zhang and
Sai Wu and
Zeyuan Tan and
Gang Chen and
Zhushi Cheng and
Wei Cao and
Yusong Gao and
Xiaojie Feng S3: a scalable in-memory skip-list index
for key--value store . . . . . . . . . . 2183--2194
Charles Masson and
Jee E. Rim and
Homin K. Lee DDSketch: a fast and fully-mergeable
quantile sketch with relative-error
guarantees . . . . . . . . . . . . . . . 2195--2205
Qiang Long and
Wei Wang and
Jinfu Deng and
Song Liu and
Wenhao Huang and
Fangying Chen and
Sifan Liu A distributed system for large-scale
$n$-gram language models at Tencent . . 2206--2217
Kayhan Dursun and
Carsten Binnig and
Ugur Cetintemel and
Garret Swart and
Weiwei Gong A morsel-driven query execution engine
for heterogeneous multi-cores . . . . . 2218--2229
Lei Cao and
Wenbo Tao and
Sungtae An and
Jing Jin and
Yizhou Yan and
Xiaoyu Liu and
Wendong Ge and
Adam Sah and
Leilani Battle and
Jimeng Sun and
Remco Chang and
Brandon Westover and
Samuel Madden and
Michael Stonebraker Smile: a system to support machine
learning on EEG data at scale . . . . . 2230--2241
Alastair Green and
Paolo Guagliardo and
Leonid Libkin and
Tobias Lindaaker and
Victor Marsault and
Stefan Plantikow and
Martin Schuster and
Petra Selmer and
Hannes Voigt Updating graph databases with Cypher . . 2242--2254
Asya Kamsky Adapting TPC-C benchmark to measure
performance of multi-document
transactions in MongoDB . . . . . . . . 2254--2262
Feifei Li Cloud-native database systems at
Alibaba: opportunities and challenges 2263--2272
Alexander Boehm In-memory for the masses: enabling
cost-efficient deployments of in-memory
data management platforms for business
applications . . . . . . . . . . . . . . 2273--2275
Murtadha Al Hubail and
Ali Alsuliman and
Michael Blow and
Michael Carey and
Dmitry Lychagin and
Ian Maxon and
Till Westmann Couchbase analytics: NoETL for scalable
NoSQL data analysis . . . . . . . . . . 2275--2286
Adrian Colyer Performance in the spotlight . . . . . . 2287--2289
Azza Abouzied and
Daniel J. Abadi and
Kamil Bajda-Pawlikowski and
Avi Silberschatz Integration of large-scale data
processing systems and traditional
parallel database technology . . . . . . 2290--2299
Brian F. Cooper and
P. P. S. Narayan and
Raghu Ramakrishnan and
Utkarsh Srivastava and
Adam Silberstein and
Philip Bohannon and
Hans-Arno Jacobsen and
Nick Puz and
Daniel Weaver and
Ramana Yerneni PNUTS to Sherpa: lessons from Yahoo!'s
cloud database . . . . . . . . . . . . . 2300--2307
Wang-Chiew Tan What I probably did right and what I
think I could have done better . . . . . 2308--2308
Aditya Parameswaran Enabling data science for the majority 2309--2322
Theodoras Rekatsinas and
Sudeepa Roy and
Manasi Vartak and
Ce Zhang and
Neoklis Polyzotis Opportunities for data management
research in the era of horizontal AI/ML 2323--2323
Claude Barthels and
Ingo Müller and
Konstantin Taranov and
Gustavo Alonso and
Torsten Hoefler Strong consistency is not hard to get:
two-phase locking and two-phase commit
on thousands of cores . . . . . . . . . 2325--2338
Ziheng Wei and
Uwe Leck and
Sebastian Link Discovery and ranking of embedded
uniqueness constraints . . . . . . . . . 2339--2352
Lingyang Chu and
Yanyan Zhang and
Yu Yang and
Lanjun Wang and
Jian Pei Online density bursting subgraph
detection from temporal graphs . . . . . 2353--2365
Pedro Holanda and
Mark Raasveldt and
Stefan Manegold and
Hannes Mühleisen Progressive indexes: indexing for
interactive data analysis . . . . . . . 2366--2378
Masatoshi Hanai and
Toyotaro Suzumura and
Wen Jun Tan and
Elvis Liu and
Georgios Theodoropoulos and
Wentong Cai Distributed edge partitioning for
trillion-edge graphs . . . . . . . . . . 2379--2392
Manos Athanassoulis and
Kenneth S. Bògh and
Stratos Idreos Optimal column layout for hybrid
workloads . . . . . . . . . . . . . . . 2393--2407
Stavros Sintos and
Pankaj K. Agarwal and
Jun Yang Selecting data to clean for fact
checking: minimizing uncertainty vs.
maximizing surprise . . . . . . . . . . 2408--2421
Shuchi Chawla and
Shaleen Deep and
Paraschos Koutrisw and
Yifeng Teng Revenue maximization for query pricing 1--14
Jieming Shi and
Renchi Yang and
Tianyuan Jin and
Xiaokui Xiao and
Yin Yang Realtime top-$k$ Personalized PageRank
over large graphs on GPUs . . . . . . . 15--28
Sheng Wang and
Zhifeng Bao and
J. Shane Culpepper and
Timos Sellis and
Xiaolin Qin Fast large-scale trajectory clustering 29--42
Herodotos Herodotou and
Elena Kakoulli Automating distributed tiered storage
management in cluster computing . . . . 43--56
Jinho Jung and
Hong Hu and
Joy Arulraj and
Taesoo Kim and
Woonhak Kang APOLLO: automatic detection and
diagnosis of performance regressions in
database systems . . . . . . . . . . . . 57--70
Muhsen Owaida and
Gustavo Alonso and
Laura Fogliarini and
Anthony Hock-Koon and
Pierre-Etienne Melet Lowering the latency of data processing
pipelines through FPGA based hardware
acceleration . . . . . . . . . . . . . . 71--85
Shaofeng Cai and
Gang Chen and
Beng Chin Ooi and
Jinyang Gao Model slicing for supporting complex
analytics with elastic inference cost
and resource constraints . . . . . . . . 86--99
Maurice Herlihy and
Barbara Liskov and
Liuba Shrira Cross-chain deals and adversarial
commerce . . . . . . . . . . . . . . . . 100--113
Vasileios Zois and
Vassilis J. Tsotras and
Walid A. Najjar Efficient main-memory top-$k$ selection
for multicore architectures . . . . . . 114--127
Jan Böttcher and
Viktor Leis and
Thomas Neumann and
Alfons Kemper Scalable garbage collection for
in-memory MVCC systems . . . . . . . . . 128--141
Bohua Yang and
Dong Wen and
Lu Qin and
Ying Zhang and
Xubo Wang and
Xuemin Lin Fully dynamic depth-first search in
directed graphs . . . . . . . . . . . . 142--154
Chenhao Ma and
Reynold Cheng and
Laks V. S. Lakshmanan and
Tobias Grubenmann and
Yixiang Fang and
Xiaodong Li LINC: a motif counting algorithm for
uncertain graphs . . . . . . . . . . . . 155--168
Saba Eskandarian and
Matei Zaharia ObliDB: oblivious query processing for
secure databases . . . . . . . . . . . . 169--183
Chang Ge and
Ihab F. Ilyas and
Florian Kerschbaum Secure multi-party functional dependency
discovery . . . . . . . . . . . . . . . 184--196
Minmei Wang and
Mingxun Zhou and
Shouqian Shi and
Chen Qian Vacuum filters: more space-efficient and
faster replacement for Bloom and cuckoo
filters . . . . . . . . . . . . . . . . 197--210
Yihan Sun and
Guy E. Blelloch and
Wan Shen Lim and
Andrew Pavlo On supporting efficient snapshot
isolation for hybrid workloads with
multi-versioned indexes . . . . . . . . 211--225
Zhuhe Fang and
Beilei Zheng and
Chuliang Weng Interleaved multi-vectorizing . . . . . 226--238
Suraj Shetiya and
Abolfazl Asudeh and
Sadia Ahmed and
Gautam Das A unified optimization algorithm for
solving ``regret-minimizing
representative'' problems . . . . . . . 239--251
Srikanth Kandula and
Laurel Orr and
Surajit Chaudhuri Pushing data-induced predicates through
joins in big-data clusters . . . . . . . 252--265
Eduardo H. M. Pena and
Eduardo C. de Almeida and
Felix Naumann Discovery of approximate (and exact)
denial constraints . . . . . . . . . . . 266--278
Zongheng Yang and
Eric Liang and
Amog Kamsetty and
Chenggang Wu and
Yan Duan and
Xi Chen and
Pieter Abbeel and
Joseph M. Hellerstein and
Sanjay Krishnan and
Ion Stoica Deep unsupervised cardinality estimation 279--292
Zeyu Ding and
Yuxin Wang and
Danfeng Zhang and
Daniel Kifer Free gap information from the
differentially private sparse vector and
noisy max mechanisms . . . . . . . . . . 293--306
Ji Sun and
Guoliang Li An end-to-end learning-based cost
estimator . . . . . . . . . . . . . . . 307--319
Yuxiang Zeng and
Yongxin Tong and
Lei Chen Last-mile delivery made practical: an
efficient route planning framework with
theoretical guarantees . . . . . . . . . 320--333
Tiago R. Kepe and
Eduardo C. de Almeida and
Marco A. Z. Alves Database processing-in-memory: an
experimental study . . . . . . . . . . . 334--347
Jyoti Leeka and
Kaushik Rajan Incorporating super-operators in
big-data query optimizers . . . . . . . 348--361
Conggai Li and
Fan Zhang and
Ying Zhang and
Lu Qin and
Wenjie Zhang and
Xuemin Lin Efficient progressive minimum $k$-core
search . . . . . . . . . . . . . . . . . 362--375
Hang Zhu and
Zhihao Bai and
Jialin Li and
Ellis Michael and
Dan R. K. Ports and
Ion Stoica and
Xin Jin Harmonia: near-linear scalability for
replicated storage with in-network
conflict detection . . . . . . . . . . . 376--389
Brett Walenz and
Stavros Sintos and
Sudeepa Roy and
Jun Yang Learning to sample: counting with
complex queries . . . . . . . . . . . . 390--402
Karima Echihabi and
Kostas Zoumpatianos and
Themis Palpanas and
Houda Benbrahim Return of the Lernaean Hydra:
experimental evaluation of data series
approximate similarity search . . . . . 403--420
Xinjing Zhou and
Lidan Shou and
Ke Chen and
Wei Hu and
Gang Chen DPTree: differential indexing for
persistent memory . . . . . . . . . . . 421--434
Jeyhun Karimov and
Tilmann Rabl and
Volker Markl AJoin: ad-hoc stream joins at scale . . 435--448
Chen Luo and
Michael J. Carey On performance stability in LSM-based
storage systems . . . . . . . . . . . . 449--462
You Peng and
Ying Zhang and
Xuemin Lin and
Wenjie Zhang and
Lu Qin and
Jingren Zhou Towards bridging theory and practice:
hop-constrained $s$--$t$ simple path
enumeration . . . . . . . . . . . . . . 463--476
Yuhao Zhang and
Arun Kumar Panorama: a data system for unbounded
vocabulary querying over video . . . . . 477--491
Kartik Lakhotia and
Rajgopal Kannan and
Qing Dong and
Viktor Prasanna Planting trees for scalable and
efficient canonical hub labeling . . . . 492--505
Preethi Lahoti and
Krishna P. Gummadi and
Gerhard Weikum Operationalizing individual fairness
with pairwise fair representations . . . 506--518
Aarati Kakaraparthy and
Jignesh M. Patel and
Kwanghyun Park and
Brian P. Kroth Optimizing databases by learning hidden
parameters of solid state drives . . . . 519--532
Daniel Kang and
Peter Bailis and
Matei Zaharia BlazeIt: optimizing declarative
aggregation and limit queries for neural
network-based video analytics . . . . . 533--546
Dawei Huang and
Dong Young Yoon and
Seth Pettie and
Barzan Mozafari Joins on samples: a theoretical guide
for practitioners . . . . . . . . . . . 547--560
Georgios Karagiannis and
Immanuel Trummer and
Saehan Jo and
Shubham Khandelwal and
Xuezhi Wang and
Cong Yu Mining an ``anti-knowledge base'' from
Wikipedia updates with applications to
fact checking and beyond . . . . . . . . 561--573
Lucas Lersch and
Xiangpeng Hao and
Ismail Oukid and
Tianzheng Wang and
Thomas Willhalm Evaluating persistent memory range
indexes . . . . . . . . . . . . . . . . 574--587
Jonathan Goldstein and
Ahmed Abdelhamid and
Mike Barnett and
Sebastian Burckhardt and
Badrish Chandramouli and
Darren Gehring and
Niel Lebeck and
Christopher Meiklejohn and
Umar Farooq Minhas and
Ryan Newton and
Rahee Ghosh Peshawaria and
Tal Zaccai and
Irene Zhang A.M.B.R.O.S.I.A: providing performant
virtual resiliency for distributed
applications . . . . . . . . . . . . . . 588--601
Dian Ouyang and
Long Yuan and
Lu Qin and
Lijun Chang and
Ying Zhang and
Xuemin Lin Efficient shortest path index
maintenance on dynamic road networks
with theoretical guarantees . . . . . . 602--615
Elias Stehle and
Hans-Arno Jacobsen ParPaRaw: massively parallel parsing of
delimiter-separated raw data . . . . . . 616--628
Yihe Huang and
William Qian and
Eddie Kohler and
Barbara Liskov and
Liuba Shrira Opportunities for optimism in contended
main-memory multicore transactions . . . 629--642
Bolong Zheng and
Xi Zhao and
Lianggui Weng and
Nguyen Quoc Viet Hung and
Hang Liu and
Christian S. Jensen PM-LSH: a fast and accurate LSH
framework for high-dimensional
approximate NN search . . . . . . . . . 643--655
Yahui Sun and
Jun Luo and
Theodoros Lappas and
Xiaokui Xiao and
Bin Cui Hunting multiple bumps in graphs . . . . 656--669
Renchi Yang and
Jieming Shi and
Xiaokui Xiao and
Yin Yang and
Sourav S. Bhowmick Homogeneous network embedding for
massive graphs via reweighted
personalized PageRank . . . . . . . . . 670--683
Abdulhakim Qahtan and
Nan Tang and
Mourad Ouzzani and
Yang Cao and
Michael Stonebraker Pattern functional dependencies for data
cleaning . . . . . . . . . . . . . . . . 684--697
Joyce Jiyoung Whang and
Rundong Du and
Sangwon Jung and
Geon Lee and
Barry Drake and
Qingqing Liu and
Seonggoo Kang and
Haesun Park MEGA: multi-view semi-supervised
clustering of hypergraphs . . . . . . . 698--711
Ioannis Koumarelas and
Thorsten Papenbrock and
Felix Naumann MDedup: duplicate detection with
matching dependencies . . . . . . . . . 712--725
Van-Dang Tran and
Hiroyuki Kato and
Zhenjiang Hu Programmable view update strategies on
relations . . . . . . . . . . . . . . . 726--739
Avinash Kumar and
Zuozhi Wang and
Shengquan Ni and
Chen Li Amber: a debuggable dataflow system
based on the actor model . . . . . . . . 740--753
Filippo Schiavio and
Daniele Bonetta and
Walter Binder Dynamic speculative optimizations for
SQL compilation in Apache Spark . . . . 754--767
Mourad Khayati and
Alberto Lerner and
Zakhar Tymchenko and
Philippe Cudré-Mauroux Mind the gap: an experimental evaluation
of imputation of missing values
techniques in time series . . . . . . . 768--782
Mohammad Hasanzadeh Mofrad and
Rami Melhem and
Yousuf Ahmad and
Mohammad Hammoud Graphite: a NUMA-aware HPC system for
graph analytics based on a new MPI * X
parallelism model . . . . . . . . . . . 783--797
Naama Boer and
Daniel Deutch and
Nave Frost and
Tova Milo Personal insights for altering decisions
of tree-based ensembles over time . . . 798--811
You Peng and
Ying Zhang and
Xuemin Lin and
Lu Qin and
Wenjie Zhang Answering billion-scale
label-constrained reachability queries
within microsecond . . . . . . . . . . . 812--825
Ruihong Huang and
Shaoxu Song and
Yunsu Lee and
Jungho Park and
Soo-Hyung Kim and
Sungmin Yi Effective and efficient retrieval of
structured entities . . . . . . . . . . 826--839
Utku Sirin and
Anastasia Ailamaki Micro-architectural analysis of OLAP:
limitations and opportunities . . . . . 840--853
Yixiang Fang and
Yixing Yang and
Wenjie Zhang and
Xuemin Lin and
Xin Cao Effective and efficient community search
over large heterogeneous information
networks . . . . . . . . . . . . . . . . 854--867
Suyash Gupta and
Sajjad Rahnama and
Jelle Hellings and
Mohammad Sadoghi ResilientDB: global scale resilient
blockchain fabric . . . . . . . . . . . 868--883
Henning Funke and
Jens Teubner Data-parallel query processing on
non-uniform data . . . . . . . . . . . . 884--897
Zonghao Feng and
Qiong Luo Evaluating memory-hard proof-of-work
algorithms on three processors . . . . . 898--911
Seokki Lee and
Bertram Ludäscher and
Boris Glavic Approximate summaries for why and
why-not provenance . . . . . . . . . . . 912--924
Hao Jiang and
Chunwei Liu and
Qi Jin and
John Paparrizos and
Aaron J. Elmore PIDS: attribute decomposition for
improved compression and query
performance in columnar storage . . . . 925--938
Abolfazl Asudeh and
H. V. Jagadish and
You (Will) Wu and
Cong Yu On detecting cherry-picked trendlines 939--952
Masayo Ota and
Heiko Müller and
Juliana Freire and
Divesh Srivastava Data-driven domain discovery for
structured datasets . . . . . . . . . . 953--965
Jieming Shi and
Tianyuan Jin and
Renchi Yang and
Xiaokui Xiao and
Yin Yang Realtime index-free single source
SimRank processing on web-scale graphs 966--978
Jiachuan Wang and
Peng Cheng and
Libin Zheng and
Chao Feng and
Lei Chen and
Xuemin Lin and
Zheng Wang Demand-aware route planning for shared
mobility services . . . . . . . . . . . 979--991
Benjamin Hilprecht and
Andreas Schmidt and
Moritz Kulessa and
Alejandro Molina and
Kristian Kersting and
Carsten Binnig DeepDB: learn from data, not from
queries! . . . . . . . . . . . . . . . . 992--1005
Yuepeng Wang and
Rushi Shah and
Abby Criswell and
Rong Pan and
Isil Dillig Data migration using datalog program
synthesis . . . . . . . . . . . . . . . 1006--1019
Xiaowei Zhu and
Guanyu Feng and
Marco Serafini and
Xiaosong Ma and
Jiping Yu and
Lei Xie and
Ashraf Aboulnaga and
Wenguang Chen LiveGraph: a transactional graph storage
system with purely sequential adjacency
list scans . . . . . . . . . . . . . . . 1020--1034
Xueling Lin and
Haoyang Li and
Hao Xin and
Zijian Li and
Lei Chen KBPearl: a knowledge base population
system supported by joint entity and
relation linking . . . . . . . . . . . . 1035--1049
Tianyi Li and
Ruikai Huang and
Lu Chen and
Christian S. Jensen and
Torben Bach Pedersen Compression of uncertain trajectories in
road networks . . . . . . . . . . . . . 1050--1063
Supreeth Shastri and
Vinay Banakar and
Melissa Wasserman and
Arun Kumar and
Vijay Chidambaram Understanding and benchmarking the
impact of GDPR on database systems . . . 1064--1077
Jihang Liu and
Shimin Chen and
Lujun Wang LB+Trees: optimizing persistent index
performance on $3$DXPoint memory . . . . 1078--1090
Lucas Lersch and
Ivan Schreter and
Ismail Oukid and
Wolfgang Lehner Enabling low tail latency on multicore
key-value stores . . . . . . . . . . . . 1091--1104
Chunbin Lin and
Etienne Boursier and
Yannis Papakonstantinou Plato: approximate analytics over
compressed time series with tight
deterministic error guarantees . . . . . 1105--1118
Prasun Gera and
Hyojong Kim and
Piyush Sao and
Hyesoon Kim and
David Bader Traversing large graphs on GPUs with
unified memory . . . . . . . . . . . . . 1119--1133
Haoyue Ping and
Julia Stoyanovich and
Benny Kimelfeld Supporting hard queries over
probabilistic preferences . . . . . . . 1134--1146
Baotong Lu and
Xiangpeng Hao and
Tianzheng Wang and
Eric Lo Dash: scalable hashing on persistent
memory . . . . . . . . . . . . . . . . . 1147--1161
Paolo Ferragina and
Giorgio Vinciguerra The PGM-index: a fully-dynamic
compressed learned index with provable
worst-case bounds . . . . . . . . . . . 1162--1175
Minghua Ma and
Zheng Yin and
Shenglin Zhang and
Sheng Wang and
Christopher Zheng and
Xinhao Jiang and
Hanwen Hu and
Cheng Luo and
Yilin Li and
Nengjun Qiu and
Feifei Li and
Changcheng Chen and
Dan Pei Diagnosing root causes of intermittent
slow queries in cloud databases . . . . 1176--1189
Xuhao Chen and
Roshan Dathathri and
Gurbinder Gill and
Keshav Pingali Pangolin: an efficient and flexible
graph mining system on CPU and GPU . . . 1190--1205
Markus Dreseler and
Martin Boissier and
Tilmann Rabl and
Matthias Uflacker Quantifying TPC-H choke points and their
optimizations . . . . . . . . . . . . . 1206--1220
Yuanbing Li and
Xian Wu and
Yifei Jin and
Jian Li and
Guoliang Li Efficient algorithms for crowd-aided
categorization . . . . . . . . . . . . . 1221--1233
Shaowei Wang and
Yuqiu Qian and
Jiachun Du and
Wei Yang and
Liusheng Huang and
Hongli Xu Set-valued data publication with local
privacy: tight error bounds and
efficient mechanisms . . . . . . . . . . 1234--1247
Leonidas Fegaras and
Hasanuzzaman Noor Translation of array-based loops to
distributed data-parallel programs . . . 1248--1260
Wenfei Fan and
Muyang Liu and
Chao Tian and
Ruiqi Xu and
Jingren Zhou Incrementalization of graph partitioning
algorithms . . . . . . . . . . . . . . . 1261--1274
Shao-Heng Ko and
Hsu-Chao Lai and
Hong-Han Shuai and
Wang-Chien Lee and
Philip S. Yu and
De-Nian Yang Optimizing item and subgroup
configurations for social-aware VR
shopping . . . . . . . . . . . . . . . . 1275--1289
Savvas Savvides and
Darshika Khandelwal and
Patrick Eugster Efficient confidentiality-preserving
data analytics over symmetrically
encrypted datasets . . . . . . . . . . . 1290--1303
Gurbinder Gill and
Roshan Dathathri and
Loc Hoang and
Ramesh Peri and
Keshav Pingali Single machine graph analytics on
massive datasets using Intel Optane DC
Persistent Memory . . . . . . . . . . . 1304--1318
Victor Zakhary and
Divyakant Agrawal and
Amr El Abbadi Atomic commitment across blockchains . . 1319--1331
Ajit Mathew and
Changwoo Min HydraList: a scalable in-memory index
using asynchronous updates and partial
replication . . . . . . . . . . . . . . 1332--1345
A. Jesse Jiryu Davis and
Max Hirschhorn and
Judah Schvimer Extreme modelling in practice . . . . . 1346--1358
Bingqing Lyu and
Lu Qin and
Xuemin Lin and
Ying Zhang and
Zhengping Qian and
Jingren Zhou Maximum biclique search at billion scale 1359--1372
Nadiia Chepurko and
Ryan Marcus and
Emanuel Zgraggen and
Raul Castro Fernandez and
Tim Kraska and
David Karger ARDA: automatic relational data
augmentation for machine learning . . . 1373--1387
Wail Y. Alkowaileet and
Sattam Alsubaiee and
Michael J. Carey An LSM-based tuple compaction framework
for Apache AsterixDB . . . . . . . . . . 1388--1400
Roee Shraga and
Avigdor Gal and
Haggai Roitman ADnEV: cross-domain schema matching
using deep similarity matrix adjustment
and evaluation . . . . . . . . . . . . . 1401--1415
Xuanhe Zhou and
Ji Sun and
Guoliang Li and
Jianhua Feng Query performance prediction for
concurrent queries using graph embedding 1416--1428
Ahmed Alquraan and
Alex Kogan and
Virendra J. Marathe and
Samer Al-Kiswany Scalable, near-zero loss disaster
recovery for distributed data stores . . 1429--1442
Kejing Lu and
Hongya Wang and
Wei Wang and
Mineichi Kudo VHP: approximate nearest neighbor search
via virtual hypersphere partitioning . . 1443--1455
Hyunjoon Kim and
Seunghwan Min and
Kunsoo Park and
Xuemin Lin and
Seok-Hee Hong and
Wook-Shin Han IDAR: fast supergraph search using DAG
integration . . . . . . . . . . . . . . 1456--1468
Mariia Seleznova and
Behrooz Omidvar-Tehrani and
Sihem Amer-Yahia and
Eric Simon Guided exploration of user groups . . . 1469--1482
Long Gong and
Huayi Wang and
Mitsunori Ogihara and
Jun Xu iDEC: indexable distance estimating
codes for approximate nearest neighbor
search . . . . . . . . . . . . . . . . . 1483--1497
Song Bian and
Qintian Guo and
Sibo Wang and
Jeffrey Xu Yu Efficient algorithms for budgeted
influence maximization on massive social
networks . . . . . . . . . . . . . . . . 1498--1510
Arneish Prateek and
Arijit Khan and
Akshit Goyal and
Sayan Ranu Mining Top-$k$ pairs of correlated
subgraphs in a large network . . . . . . 1511--1524
Yehonatan Buchnik and
Roy Friedman FireLedger: a high throughput blockchain
consensus protocol . . . . . . . . . . . 1525--1539
Kefei Wang and
Jian Liu and
Feng Chen Put an elephant into a fridge:
optimizing cache efficiency for
in-memory key--value stores . . . . . . 1540--1554
Simon Aagaard Pedersen and
Bin Yang and
Christian S. Jensen Anytime stochastic routing with hybrid
learning . . . . . . . . . . . . . . . . 1555--1567
Qizhen Zhang and
Yifan Cai and
Xinyi Chen and
Sebastian Angel and
Ang Chen and
Vincent Liu and
Boon Thau Loo Understanding the effect of data center
resource disaggregation on production
DBMSs . . . . . . . . . . . . . . . . . 1568--1581
Nikolaos Tziavelis and
Deepak Ajwani and
Wolfgang Gatterbauer and
Mirek Riedewald and
Xiaofeng Yang Optimal algorithms for ranked
enumeration of answers to full
conjunctive queries . . . . . . . . . . 1582--1597
Laxman Dhulipala and
Charles McGuffey and
Hongbo Kang and
Yan Gu and
Guy E. Blelloch and
Phillip B. Gibbons and
Julian Shun Sage: parallel semi-asymmetric graph
algorithms for NVRAMs . . . . . . . . . 1598--1613
Yuqing Zhu and
Jing Tang and
Xueyan Tang Pricing influential nodes in online
social networks . . . . . . . . . . . . 1614--1627
Bintao Sun and
Maximilien Danisch and
T-H. Hubert Chan and
Mauro Sozio KClist++: a simple algorithm for finding
$k$-clique densest subgraphs in large
graphs . . . . . . . . . . . . . . . . . 1628--1640
Kevin Wellenzohn and
Michael H. Böhlen and
Sven Helmer Dynamic interleaving of content and
structure for robust indexing of
semi-structured hierarchical data . . . 1641--1653
Shubhangi Agarwal and
Sourav Dutta and
Arnab Bhattacharya ChiSeL: graph similarity search using
chi-squared statistics in large
probabilistic graphs . . . . . . . . . . 1654--1668
Zijing Tan and
Ai Ran and
Shuai Ma and
Sheng Qin Fast incremental discovery of pointwise
order dependencies . . . . . . . . . . . 1669--1681
Ester Livshits and
Alireza Heidari and
Ihab F. Ilyas and
Benny Kimelfeld Approximate denial constraints . . . . . 1682--1695
Robin Rehrmann and
Carsten Binnig and
Alexander Böhm and
Kihong Kim and
Wolfgang Lehner Sharing opportunities for OLTP workloads
in different isolation levels . . . . . 1696--1708
Stefan Neumann and
Pauli Miettinen Biclustering and Boolean matrix
factorization in data streams . . . . . 1709--1722
Xun Jian and
Yue Wang and
Lei Chen Effective and efficient relational
community detection and search in large
dynamic heterogeneous information
networks . . . . . . . . . . . . . . . . 1723--1736
Hyeonji Kim and
Byeong-Hoon So and
Wook-Shin Han and
Hongrae Lee Natural language to SQL: where are we
today? . . . . . . . . . . . . . . . . . 1737--1750
Yulin Che and
Zhuohang Lai and
Shixuan Sun and
Yue Wang and
Qiong Luo Accelerating truss decomposition on
heterogeneous processors . . . . . . . . 1751--1764
Rohan Mukherjee and
Swarat Chaudhuri and
Chris Jermaine Searching a database of source codes
using contextualized code search . . . . 1765--1778
Yan Li and
Tingjian Ge and
Cindy Chen Data stream event prediction based on
timing knowledge and state transitions 1779--1792
Frank McSherry and
Andrea Lattuada and
Malte Schwarzkopf and
Timothy Roscoe Shared arrangements: practical
inter-query sharing for streaming
dataflows . . . . . . . . . . . . . . . 1793--1806
Peeyush Gupta and
Michael J. Carey and
Sharad Mehrotra and
oberto Yus SmartBench: a benchmark for data
management in smart spaces . . . . . . . 1807--1820
Paul Boniol and
Themis Palpanas Series2Graph: graph-based subsequence
anomaly detection for time series . . . 1821--1834
Dan Zhang and
Madelon Hulsebos and
Yoshihiko Suhara and
Çagatay Demiralp and
Jinfeng Li and
Wang-Chiew Tan Sato: contextual semantic type detection
in tables . . . . . . . . . . . . . . . 1835--1848
Qijian He and
Wei Yang and
Bingren Chen and
Yangyang Geng and
Liusheng Huang TransNet: training privacy-preserving
neural network over transformed layer 1849--1862
Wenfei Fan and
Ruochun Jin and
Muyang Liu and
Ping Lu and
Chao Tian and
Jingren Zhou Capturing associations in graphs . . . . 1863--1876
Alexander Renz-Wieland and
Rainer Gemulla and
Steffen Zeuch and
Volker Markl Dynamic parameter allocation in
parameter servers . . . . . . . . . . . 1877--1890
Michael Freitag and
Maximilian Bandle and
Tobias Schmidt and
Alfons Kemper and
Thomas Neumann Adopting worst-case optimal joins in
relational database systems . . . . . . 1891--1904
Ryan McKenna and
Raj Kumar Maity and
Arya Mazumdar and
Gerome Miklau A workload-adaptive mechanism for linear
queries under local differential privacy 1905--1918
Yisu Remy Wang and
Shana Hutchison and
Jonathan Leang and
Bill Howe and
Dan Suciu SPORES: sum-product optimization via
relational equality saturation for large
scale linear algebra . . . . . . . . . . 1919--1932
Raul Castro Fernandez and
Pranav Subramaniam and
Michael J. Franklin Data market platforms: trading data
assets to solve data problems . . . . . 1933--1947
Mohammad Mahdavi and
Ziawasch Abedjan Baran: effective error correction via a
unified context representation and
transfer learning . . . . . . . . . . . 1948--1961
Ju Fan and
Junyou Chen and
Tongyu Liu and
Yuwei Shen and
Guoliang Li and
Xiaoyong Du Relational data synthesis using
generative adversarial networks: a
design space exploration . . . . . . . . 1962--1975
Lei Yang and
Hong Wu and
Tieying Zhang and
Xuntao Cheng and
Feifei Li and
Lei Zou and
Yujie Wang and
Rongyao Chen and
Jianying Wang and
Gui Huang Leaper: a learned prefetcher for cache
invalidation in LSM-tree based storage
engines . . . . . . . . . . . . . . . . 1976--1989
Daniel Kang and
Edward Gan and
Peter Bailis and
Tatsunori Hashimoto and
Matei Zaharia Approximate selection with guarantees
using proxies . . . . . . . . . . . . . 1990--2003
Minji Kang and
Soyee Choi and
Gihwan Oh and
Sang-Won Lee 2R: efficiently isolating cold pages in
flash storages . . . . . . . . . . . . . 2004--2017
Bahar Ghadiri Bashardoost and
Renée J. Miller and
Kelly Lyons and
Fatemeh Nargesian Knowledge translation . . . . . . . . . 2018--2032
Devin Petersohn and
Stephen Macke and
Doris Xin and
William Ma and
Doris Lee and
Xiangxi Mo and
Joseph E. Gonzalez and
Joseph M. Hellerstein and
Anthony D. Joseph and
Aditya Parameswaran Towards scalable dataframe systems . . . 2033--2046
Yi Lu and
Xiangyao Yu and
Lei Cao and
Samuel Madden Aria: a fast and practical deterministic
OLTP database . . . . . . . . . . . . . 2047--2060
Dongjing Miao and
Zhipeng Cai and
Jianzhong Li and
Xiangyu Gao and
Xianmin Liu The computation of optimal subset
repairs . . . . . . . . . . . . . . . . 2061--2074
Christina Christodoulakis and
Eric B. Munson and
Moshe Gabel and
Angela Demke Brown and
Renée J. Miller Pytheas: pattern-based table discovery
in CSV files . . . . . . . . . . . . . . 2075--2089
Yuncheng Wu and
Shaofeng Cai and
Xiaokui Xiao and
Gang Chen and
Beng Chin Ooi Privacy preserving vertical federated
learning for tree-based models . . . . . 2090--2103
Ahmed Al-Baghdadi and
Xiang Lian Topic-based community search over
spatial-social networks . . . . . . . . 2104--2117
Manuel Fritz and
Michael Behringer and
Holger Schwarz LOG-Means: efficiently estimating the
number of clusters in large datasets . . 2118--2131
Simeon Krastnikov and
Florian Kerschbaum and
Douglas Stebila Efficient oblivious database joins . . . 2132--2145
Ousmane Issa and
Angela Bonifati and
Farouk Toumani Evaluating top-$k$ queries with
inconsistency degrees . . . . . . . . . 2146--2158
Supun Nakandala and
Yuhao Zhang and
Arun Kumar Cerebro: a data system for optimized
deep learning model selection . . . . . 2159--2173
Edward Gan and
Peter Bailis and
Moses Charikar CoopStore: optimizing precomputed
summaries for aggregation . . . . . . . 2174--2187
Satoshi Koide and
Chuan Xiao and
Yoshiharu Ishikawa Fast subtrajectory similarity search in
road networks under weighted edit
distance constraints . . . . . . . . . . 2188--2201
Yu Liu and
Lei Zou and
Qian Ge and
Zhewei Wei SimTab: accuracy-guaranteed SimRank
queries through tighter confidence
bounds and multi-armed bandits . . . . . 2202--2214
Anshuman Dutt and
Chi Wang and
Vivek Narasayya and
Surajit Chaudhuri Efficiently approximating selectivity
functions using low overhead regression
models . . . . . . . . . . . . . . . . . 2215--2228
Yin Lin and
Yifan Guan and
Abolfazl Asudeh and
H. V. Jagadish Identifying insufficient data coverage
in databases with multiple relations . . 2229--2242
Lingxiao Li and
Muhammad Aamir Cheema and
Mohammed Eunus Ali and
Hua Lu and
David Taniar Continuously monitoring alternative
shortest paths on road networks . . . . 2243--2255
Geon Lee and
Jihoon Ko and
Kijung Shin Hypergraph motifs: concepts, algorithms,
and discoveries . . . . . . . . . . . . 2256--2269
Johann Birnick and
Thomas Bläsius and
Tobias Friedrich and
Felix Naumann and
Thorsten Papenbrock and
Martin Schirneck Hitting set enumeration with partial
information for unique column
combination discovery . . . . . . . . . 2270--2283
Yue Chen and
Zhida Chen and
Gao Cong and
Ahmed R. Mahmood and
Walid G. Aref SSTD: a distributed system on streaming
spatio-textual data . . . . . . . . . . 2284--2296
Haneen Mohammed and
Ziyun Wei and
Eugene Wu and
Ravi Netravali Continuous prefetch for interactive data
applications . . . . . . . . . . . . . . 2297--2311
Zheng Wang and
Cheng Long and
Gao Cong and
Yiding Liu Efficient and effective similar
subtrajectory search with deep
reinforcement learning . . . . . . . . . 2312--2325
Zequn Sun and
Qingheng Zhang and
Wei Hu and
Chengming Wang and
Muhao Chen and
Farahnaz Akrami and
Chengkai Li A benchmarking study of embedding-based
entity alignment for knowledge graphs 2326--2340
Jianzhong Qi and
Guanli Liu and
Christian S. Jensen and
Lars Kulik Effectively learning spatial indices . . 2341--2354
Qiyu Liu and
Libin Zheng and
Yanyan Shen and
Lei Chen Stable learned bloom filters for data
streams . . . . . . . . . . . . . . . . 2355--2367
Zhongjun Jin and
Yeye He and
Surajit Chauduri Auto-transform: learning-to-transform by
patterns . . . . . . . . . . . . . . . . 2368--2381
Jan Kossmann and
Stefan Halfpap and
Marcel Jankrift and
Rainer Schlosser Magic mirror in my hand, which is the
best in the land?: an experimental
evaluation of index selection algorithms 2382--2395
Patrick Damme and
Annett Ungethüm and
Johannes Pietrzyk and
Alexander Krause and
Dirk Habich and
Wolfgang Lehner MorphStore: analytical query engine with
a holistic compression-enabled
processing model . . . . . . . . . . . . 2396--2410
Panos Parchas and
Yonatan Naamad and
Peter Van Bouwel and
Christos Faloutsos and
Michalis Petropoulos Fast and effective distribution-key
recommendation for Amazon Redshift . . . 2411--2423
Primal Pappachan and
Roberto Yus and
Sharad Mehrotra and
Johann-Christoph Freytag Sieve: a middleware approach to scalable
access control for database management
systems . . . . . . . . . . . . . . . . 2424--2437
Vikram Sreekanti and
Chenggang Wu and
Xiayue Charles Lin and
Johann Schleier-Smith and
Joseph E. Gonzalez and
Joseph M. Hellerstein and
Alexey Tumanov Cloudburst: stateful
functions-as-a-service . . . . . . . . . 2438--2452
Abhijit Suprem and
Joy Arulraj and
Calton Pu and
Joao Ferreira ODIN: automated drift detection and
recovery in video analytics . . . . . . 2453--2465
Chengzhi Piao and
Weiguo Zheng and
Yu Rong and
Hong Cheng Maximizing the reduction ability for
near-maximum independent set computation 2466--2478
Zhao Chen and
Peng Cheng and
Lei Chen and
Xuemin Lin and
Cyrus Shahabi Fair task assignment in spatial
crowdsourcing . . . . . . . . . . . . . 2479--2492
Hao Zhang and
Jeffrey Xu Yu and
Yikai Zhang and
Kangfei Zhao and
Hong Cheng Distributed subgraph counting: a general
approach . . . . . . . . . . . . . . . . 2493--2507
Georgios Karagiannis and
Mohammed Saeed and
Paolo Papotti and
Immanuel Trummer Scrutinizer: a mixed-initiative approach
to large-scale, data-driven claim
verification . . . . . . . . . . . . . . 2508--2521
Evgeny Krivosheev and
Siarhei Bykau and
Fabio Casati and
Sunil Prabhakar Detecting and preventing confused labels
in crowdsourced data . . . . . . . . . . 2522--2535
Rong-Hua Li and
Sen Gao and
Lu Qin and
Guoren Wang and
Weihua Yang and
Jeffrey Xu Yu Ordering heuristics for $k$-clique
listing . . . . . . . . . . . . . . . . 2536--2548
Jinfeng Li and
Yuliang Li and
Xiaolan Wang and
Wang-Chiew Tan Deep or simple models for semantic
tagging?: it depends on your data . . . 2549--2562
Muhammad Bilal and
Marco Serafini and
Marco Canini and
Rodrigo Rodrigues Do the best cloud configurations grow on
trees?: an experimental evaluation of
black box algorithms for optimizing
cloud workloads . . . . . . . . . . . . 2563--2575
Alexander Zhou and
Yue Wang and
Lei Chen Finding large diverse communities on
networks: the edge maximum $ k*$-partite
clique . . . . . . . . . . . . . . . . . 2576--2589
Albert van der Linde and
João Leitão and
Nuno Preguiça Practical client-side replication: weak
consistency semantics for insecure
settings . . . . . . . . . . . . . . . . 2590--2605
Kexin Rong and
Yao Lu and
Peter Bailis and
Srikanth Kandula and
Philip Levis Approximate partition selection for
big-data workloads using summary
statistics . . . . . . . . . . . . . . . 2606--2619
Christian Winter and
Tobias Schmidt and
Thomas Neumann and
Alfons Kemper Meet me halfway: split maintenance of
continuous views . . . . . . . . . . . . 2620--2633
Youmin Chen and
Youyou Lu and
Kedong Fang and
Qing Wang and
Jiwu Shu uTree: a persistent B+-tree with low
tail latency . . . . . . . . . . . . . . 2634--2648
Peter Boncz and
Thomas Neumann and
Viktor Leis FSST: fast random access string
compression . . . . . . . . . . . . . . 2649--2661
Lukas Vogel and
Viktor Leis and
Alexander van Renen and
Thomas Neumann and
Satoshi Imamura and
Alfons Kemper Mosaic: a budget-conscious storage
engine for relational database systems 2662--2675
Jovan Blanusa and
Radu Stoica and
Paolo Ienne and
Kubilay Atasu Manycore clique enumeration with fast
set intersections . . . . . . . . . . . 2676--2690
Johes Bater and
Yongjoo Park and
Xi He and
Xiao Wang and
Jennie Rogers SAQE: practical privacy-preserving
approximate query processing for data
federations . . . . . . . . . . . . . . 2691--2705
Caitlin Kuhlman and
Elke Rundensteiner Rank aggregation algorithms for fair
consensus . . . . . . . . . . . . . . . 2706--2719
Brad Glasbergen and
Michael Abebe and
Khuzaima Daudjee and
Amit Levi Sentinel: universal analysis and insight
for data systems . . . . . . . . . . . . 2720--2733
Jingzhi Fang and
Yanyan Shen and
Yue Wang and
Lei Chen Optimizing DNN computation graph using
graph substitutions . . . . . . . . . . 2734--2746
Jaydeep Sen and
Chuan Lei and
Abdul Quamar and
Fatma Özcan and
Vasilis Efthymiou and
Ayushi Dalmia and
Greg Stager and
Ashish Mittal and
Diptikalyan Saha and
Karthik Sankaranarayanan ATHENA++: natural language querying for
complex nested SQL queries . . . . . . . 2747--2759
Min Xu and
Bolin Ding and
Tianhao Wang and
Jingren Zhou Collecting and analyzing data jointly
from multiple services under local
differential privacy . . . . . . . . . . 2760--2772
Yifan Gan and
Xueyuan Ren and
Drew Ripberger and
Spyros Blanas and
Yang Wang IsoDiff: debugging anomalies caused by
weak isolation . . . . . . . . . . . . . 2773--2786
Marina Barsky and
Jonathan Gabor and
Mariano P. Consens and
Alex Thomo Suffix rank: a new scalable algorithm
for indexing large string collections 2787--2800
Jayant R. Haritsa Robust query processing: mission
possible . . . . . . . . . . . . . . . . 3425--3428
Steven Euijong Whang and
Jae-Gil Lee Data collection and quality challenges
for deep learning . . . . . . . . . . . 3429--3432
Douglas Burdick and
Marina Danilevsky and
Alexandre V Evfimievski and
Yannis Katsis and
Nancy Wang Table extraction and understanding for
scientific and enterprise applications 3433--3436
Jianbin Qin and
Wei Wang and
Chuan Xiao and
Ying Zhang Similarity query processing for
high-dimensional data . . . . . . . . . 3437--3440
Suyash Gupta and
Jelle Hellings and
Sajjad Rahnama and
Mohammad Sadoghi Building high throughput permissioned
blockchain fabrics: challenges and
opportunities . . . . . . . . . . . . . 3441--3444
Abolfazl Asudeh and
H. V. Jagadish Fairly evaluating and scoring items in a
data set . . . . . . . . . . . . . . . . 3445--3448
Meihui Zhang and
Zhongle Xie and
Cong Yue and
Ziyue Zhong Spitz: a verifiable database system . . 3449--3460
Sergey Melnik and
Andrey Gubarev and
Jing Jing Long and
Geoffrey Romer and
Shiva Shivakumar and
Matt Tolton and
Theo Vassilakis and
Hossein Ahmadi and
Dan Delorey and
Slava Min and
Mosha Pasumansky and
Jeff Shute Dremel: a decade of interactive SQL
analysis at web scale . . . . . . . . . 3461--3472
Anastasia Ailamaki JIT works: decide when all data is known
(VLDB women in database research award
talk) . . . . . . . . . . . . . . . . . 3473--3473
Julia Stoyanovich and
Bill Howe and
H. V. Jagadish Responsible data management . . . . . . 3474--3488
Kazuo Goda and
Yuto Hayamizu and
Hiroyuki Yamada and
Masaru Kitsuregawa Out-of-order execution of database
queries . . . . . . . . . . . . . . . . 3489--3501
Dan Olteanu The relational data borg is learning . . 3502--3515
Peter Bailis and
Juliana Freire and
Magda Balazinska and
Raghu Ramakrishnan and
Joseph M. Hellerstein and
Xin Luna Dong and
Michael Stonebraker Winds from Seattle: database research
directions . . . . . . . . . . . . . . . 3516--3516
Yuxiang Zeng and
Yongxin Tong and
Yuguang Song and
Lei Chen The simpler the better: an indexing
approach for shared-route planning
queries . . . . . . . . . . . . . . . . 3517--3530
Takayuki Tanabe and
Takashi Hoshino and
Hideyuki Kawashima and
Osamu Tatebe An analysis of concurrency control
protocols for in-memory databases with
CCBench . . . . . . . . . . . . . . . . 3531--3544
Tianhao Wang and
Bolin Ding and
Min Xu and
Zhicong Huang and
Cheng Hong and
Jingren Zhou and
Ninghui Li and
Somesh Jha Improving utility and security of the
shuffler-based differential privacy . . 3545--3558
Andreas Kipf and
Damian Chromejko and
Alexander Hall and
Peter Boncz and
David G. Andersen Cuckoo index: a lightweight secondary
index structure . . . . . . . . . . . . 3559--3572
Michael Abebe and
Brad Glasbergen and
Khuzaima Daudjee MorphoSys: automatic physical design
metamorphosis for distributed database
systems . . . . . . . . . . . . . . . . 3573--3587
Soheil Behnezhad and
Laxman Dhulipala and
Hossein Esfandiari and
Jakub Lacki and
Vahab Mirrokni and
Warren Schudy Parallel graph algorithms in constant
adaptive rounds: theory meets practice 3588--3602
Runhui Wang and
Dong Deng DeltaPQ: lossless product quantization
code compression for high dimensional
similarity search . . . . . . . . . . . 3603--3616
Ryan Marcus and
Andreas Kipf and
Alexander van Renen and
Mihail Stoian and
Sanchit Misra and
Alfons Kemper and
Thomas Neumann and
Tim Kraska Benchmarking learned indexes . . . . . . 1--13
Zuozhi Wang and
Kai Zeng and
Botong Huang and
Wei Chen and
Xiaozong Cui and
Bo Wang and
Ji Liu and
Liya Fan and
Dachuan Qu and
Zhenyu Hou and
Tao Guan and
Chen Li and
Jingren Zhou Tempura: a general cost-based optimizer
framework for incremental data
processing . . . . . . . . . . . . . . . 14--27
Geon Heo and
Yuji Roh and
Seonghyeon Hwang and
Dayun Lee and
Steven Euijong Whang Inspector gadget: a data
programming-based labeling system for
industrial images . . . . . . . . . . . 28--36
Renchi Yang and
Jieming Shi and
Xiaokui Xiao and
Yin Yang and
Juncheng Liu and
Sourav S. Bhowmick Scaling attributed network embedding to
massive graphs . . . . . . . . . . . . . 37--49
Yuliang Li and
Jinfeng Li and
Yoshihiko Suhara and
AnHai Doan and
Wang-Chiew Tan Deep entity matching with pre-trained
language models . . . . . . . . . . . . 50--60
Zongheng Yang and
Amog Kamsetty and
Sifei Luan and
Eric Liang and
Yan Duan and
Xi Chen and
Ion Stoica NeuroCard: one cardinality estimator for
all tables . . . . . . . . . . . . . . . 61--73
Jialin Ding and
Vikram Nathan and
Mohammad Alizadeh and
Tim Kraska Tsunami: a learned multi-dimensional
index for correlated data and skewed
workloads . . . . . . . . . . . . . . . 74--86
Daniel Kang and
Ankit Mathur and
Teja Veeramacheneni and
Peter Bailis and
Matei Zaharia Jointly optimizing preprocessing and
inference for DNN-based visual analytics 87--100
Prashanth Menon and
Amadou Ngom and
Lin Ma and
Todd C. Mowry and
Andrew Pavlo Permutable compiled queries: dynamically
adapting compiled queries without
recompiling . . . . . . . . . . . . . . 101--113
Seung Won Min and
Vikram Sharma Mailthody and
Zaid Qureshi and
Jinjun Xiong and
Eiman Ebrahimi and
Wen-mei Hwu EMOGI: efficient memory-access for
out-of-memory graph-traversal in GPUs 114--127
Yinda Zhang and
Jinyang Li and
Yutian Lei and
Tong Yang and
Zhetao Li and
Gong Zhang and
Bin Cui On-off sketch: a fast and accurate
sketch on persistence . . . . . . . . . 128--140
Luan Tran and
Min Y. Mun and
Cyrus Shahabi Real-time distance-based outlier
detection in data streams . . . . . . . 141--153
Olga Poppe and
Tayo Amuneke and
Dalitso Banda and
Aritra De and
Ari Green and
Manon Knoertzer and
Ehi Nosakhare and
Karthik Rajendran and
Deepak Shankargouda and
Meina Wang and
Alan Au and
Carlo Curino and
Qun Guo and
Alekh Jindal and
Ajay Kalhan and
Morgan Oslake and
Sonia Parchani and
Vijay Ramani and
Raj Sellappan and
Saikat Sen and
Sheetal Shrotri and
Soundararajan Srinivasan and
Ping Xia and
Shize Xu and
Alicia Yang and
Yiwen Zhu Seagull: an infrastructure for load
prediction and optimized resource
allocation . . . . . . . . . . . . . . . 154--162
Sheng Wang and
Yuan Sun and
Zhifeng Bao On the efficiency of K-means clustering:
evaluation, optimization, and algorithm
selection . . . . . . . . . . . . . . . 163--175
Shixuan Sun and
Xibo Sun and
Yulin Che and
Qiong Luo and
Bingsheng He RapidMatch: a holistic approach to
subgraph query processing . . . . . . . 176--188
Yu Xia and
Xiangyao Yu and
Andrew Pavlo and
Srinivas Devadas Taurus: lightweight parallel logging for
in-memory database management systems 189--201
Johns Paul and
Bingsheng He and
Shengliang Lu and
Chiew Tong Lau Improving execution efficiency of
just-in-time compilation based query
processing on GPUs . . . . . . . . . . . 202--214
Shuang Wang and
Hakan Ferhatosmanoglu PPQ-trajectory: spatio-temporal
quantization for querying in large
trajectory repositories . . . . . . . . 215--227
Xiao Hu and
Shouzhuo Sun and
Shweta Patwa and
Debmalya Panigrahi and
Sudeepa Roy Aggregated deletion propagation for
counting conjunctive query answers . . . 228--240
Chen Luo and
Michael J. Carey Breaking down memory walls: adaptive
memory management in LSM-based storage
systems . . . . . . . . . . . . . . . . 241--254
Bojan Karlas and
Peng Li and
Renzhi Wu and
Nezihe Merve Gürel and
Xu Chu and
Wentao Wu and
Ce Zhang Nearest neighbor classifiers over
incomplete information: from certain
answers to certain predictions . . . . . 255--267
Kyle Kingsbury and
Peter Alvaro Elle: inferring isolation anomalies from
experimental observations . . . . . . . 268--280
Martin Kiefer and
Ilias Poulakis and
Sebastian Breß and
Volker Markl Scotch: generating FPGA-accelerators for
sketching at line rate . . . . . . . . . 281--293
Mourad Khayati and
Ines Arous and
Zakhar Tymchenko and
Philippe Cudré-Mauroux ORBITS: online recovery of missing
values in multiple time series streams 294--306
Xiang Deng and
Huan Sun and
Alyssa Lees and
You Wu and
Cong Yu TURL: table understanding through
representation learning . . . . . . . . 307--319
Long Guo and
Lifeng Hua and
Rongfei Jia and
Fei Fang and
Binqiang Zhao and
Bin Cui EdgeDIPN: a unified deep intent
prediction network deployed at the edge 320--328
Yiming Lin and
Daokun Jiang and
Roberto Yus and
Georgios Bouloukakis and
Andrew Chio and
Sharad Mehrotra and
Nalini Venkatasubramanian Locater: cleaning wifi connectivity
datasets for semantic localization . . . 329--341
Hao Liu and
Jindong Han and
Yanjie Fu and
Jingbo Zhou and
Xinjiang Lu and
Hui Xiong Multi-modal transportation
recommendation with unified route
representation learning . . . . . . . . 342--350
Yue Wang and
Ruiqi Xu and
Zonghao Feng and
Yulin Che and
Lei Chen and
Qiong Luo and
Rui Mao Disk: a distributed framework for
single-source SimRank with accuracy
guarantee . . . . . . . . . . . . . . . 351--363
Diego Didona and
Nikolas Ioannou and
Radu Stoica and
Kornilios Kourtis Toward a better understanding and
evaluation of tree structures on flash
SSDs . . . . . . . . . . . . . . . . . . 364--377
Jianyu Yang and
Tianhao Wang and
Ninghui Li and
Xiang Cheng and
Sen Su Answering multi-dimensional range
queries under local differential privacy 378--390
Dimitris Palyvos-Giannas and
Bastian Havers and
Marina Papatriantafilou and
Vincenzo Gulisano Ananke: a streaming framework for live
forward provenance . . . . . . . . . . . 391--403
Kartik Lakhotia and
Rajgopal Kannan and
Viktor Prasanna and
Cesar A. F. De Rose Receipt: refine coarse-grained
independent tasks for parallel tip
decomposition of bipartite graphs . . . 404--417
Shaleen Deep and
Anja Gruenheid and
Paraschos Koutris and
Jeffrey Naughton and
Stratis Viglas Comprehensive and efficient workload
compression . . . . . . . . . . . . . . 418--430
Yongjun He and
Jiacheng Lu and
Tianzheng Wang CoroBase: coroutine-oriented main-memory
database engine . . . . . . . . . . . . 431--444
Jaclyn Smith and
Michael Benedikt and
Milos Nikolic and
Amir Shaikhha Scalable querying of nested data . . . . 445--457
Long Gong and
Ziheng Liu and
Liang Liu and
Jun Xu and
Mitsunori Ogihara and
Tong Yang Space- and computationally-efficient set
reconciliation via parity bitmap sketch
(PBS) . . . . . . . . . . . . . . . . . 458--470
Suraj Shetiya and
Saravanan Thirumuruganathan and
Nick Koudas and
Gautam Das Astrid: accurate selectivity estimation
for string predicates using deep
learning . . . . . . . . . . . . . . . . 471--484
Nan Zheng and
Zachary G. Ives Compact, tamper-resistant archival of
fine-grained provenance . . . . . . . . 485--497
Ingo Müller and
Ghislain Fourny and
Stefan Irimescu and
Can Berker Cikis and
Gustavo Alonso Rumble: data independence for large
messy data sets . . . . . . . . . . . . 498--506
Adriane Chapman and
Paolo Missier and
Giulia Simonelli and
Riccardo Torlone Capturing and querying fine-grained
provenance of preprocessing pipelines in
data science . . . . . . . . . . . . . . 507--520
Victor A. E. Farias and
Felipe T. Brito and
Cheryl Flynn and
Javam C. Machado and
Subhabrata Majumdar and
Divesh Srivastava Local dampening: differential privacy
for non-numeric queries via local
sensitivity . . . . . . . . . . . . . . 521--533
Tianyu Li and
Matthew Butrovich and
Amadou Ngom and
Wan Shen Lim and
Wes McKinney and
Andrew Pavlo Mainlining databases: supporting fast
transactional workloads on universal
columnar data file formats . . . . . . . 534--546
Shengliang Lu and
Bingsheng He and
Yuchen Li and
Hao Fu Accelerating exact constrained shortest
paths on GPUs . . . . . . . . . . . . . 547--559
Songsong Mo and
Zhifeng Bao and
Ping Zhang and
Zhiyong Peng Towards an efficient weighted random
walk domination . . . . . . . . . . . . 560--572
Guimu Guo and
Da Yan and
M. Tamer Özsu and
Zhe Jiang and
Jalal Khalil Scalable mining of maximal
quasi-cliques: an algorithm-system
codesign approach . . . . . . . . . . . 573--585
Eleftherios Kokoris-Kogias and
Enis Ceyhun Alp and
Linus Gasser and
Philipp Jovanovic and
Ewa Syta and
Bryan Ford CALYPSO: private data management for
decentralized ledgers . . . . . . . . . 586--599
Kyle Deeds and
Brian Hentschel and
Stratos Idreos Stacked filters: learning to filter by
structure . . . . . . . . . . . . . . . 600--612
Prithu Banerjee and
Wei Chen and
Laks V. S. Lakshmanan Maximizing social welfare in a
competitive diffusion model . . . . . . 613--625
Shashank Gugnani and
Arjun Kashyap and
Xiaoyi Lu Understanding the idiosyncrasies of real
persistent memory . . . . . . . . . . . 626--639
Abraham Gale and
Amélie Marian Explaining monotonic ranking functions 640--652
Laxman Dhulipala and
Changwan Hong and
Julian Shun ConnectIt: a framework for static and
incremental parallel graph connectivity
algorithms . . . . . . . . . . . . . . . 653--667
Wissam Mammar Kouadri and
Mourad Ouziri and
Salima Benbernou and
Karima Echihabi and
Themis Palpanas and
Iheb Ben Amor Quality of sentiment analysis tools: the
reasons of inconsistency . . . . . . . . 668--681
Rolando Garcia and
Eric Liu and
Vikram Sreekanti and
Bobby Yan and
Anusha Dandamudi and
Joseph E. Gonzalez and
Joseph M. Hellerstein and
Koushik Sen Hindsight logging for model training . . 682--693
Lin Jiang and
Junqiao Qiu and
Zhijia Zhao Scalable structural index construction
for JSON analytics . . . . . . . . . . . 694--707
Ran Rui and
Hao Li and
Yi-Cheng Tu Efficient join algorithms for large
database tables in a multi-GPU
environment . . . . . . . . . . . . . . 708--720
Shuyuan Yan and
Bolin Ding and
Wei Guo and
Jingren Zhou and
Zhewei Wei and
Xiaowei Jiang and
Sheng Xu FlashP: an analytical pipeline for
real-time forecasting of time-series
relational data . . . . . . . . . . . . 721--729
Chi Thang Duong and
Trung Dung Hoang and
Hongzhi Yin and
Matthias Weidlich and
Quoc Viet Hung Nguyen and
Karl Aberer Efficient streaming subgraph isomorphism
with graph neural networks . . . . . . . 730--742
Yi Lu and
Xiangyao Yu and
Lei Cao and
Samuel Madden Epoch-based commit and replication in
distributed OLTP databases . . . . . . . 743--756
Zhe Lin and
Fan Zhang and
Xuemin Lin and
Wenjie Zhang and
Zhihong Tian Hierarchical core maintenance on large
dynamic graphs . . . . . . . . . . . . . 757--770
Jayashree Mohan and
Amar Phanishayee and
Ashish Raniwala and
Vijay Chidambaram Analyzing and mitigating data stalls in
DNN training . . . . . . . . . . . . . . 771--784
Daokun Hu and
Zhiwen Chen and
Jianbing Wu and
Jianhua Sun and
Hao Chen Persistent memory hash indexes: an
experimental evaluation . . . . . . . . 785--798
Cheng Chen and
Jun Yang and
Mian Lu and
Taize Wang and
Zhao Zheng and
Yuqiang Chen and
Wenyuan Dai and
Bingsheng He and
Weng-Fai Wong and
Guoan Wu and
Yuping Zhao and
Andy Rudoff Optimizing in-memory database engine for
AI-powered on-line decision augmentation
using persistent memory . . . . . . . . 799--812
Arif Usta and
Akifhan Karakayali and
Özgür Ulusoy DBTagger: multi-task learning for
keyword mapping in NLIDBs using
bi-directional recurrent neural networks 813--821
Ritesh Sarkhel and
Arnab Nandi Improving information extraction from
visually rich documents using visual
span representations . . . . . . . . . . 822--834
Gang Liu and
Leying Chen and
Shimin Chen Zen: a high-throughput log-free OLTP
engine for non-volatile main memory . . 835--848
Tianxi Ji and
Pan Li and
Emre Yilmaz and
Erman Ayday and
Yanfang (Fanny) Ye and
Jinyuan Sun Differentially private binary- and
matrix-valued data query: an XOR
mechanism . . . . . . . . . . . . . . . 849--862
Supun Nakandala and
Yuhao Zhang and
Arun Kumar Errata for ``Cerebro: a data system for
optimized deep learning model
selection'' . . . . . . . . . . . . . . 863--863
Lujia Yin and
Yiming Zhang and
Zhaoning Zhang and
Yuxing Peng and
Peng Zhao ParaX: boosting deep learning for big
data analytics on many-core CPUs . . . . 864--877
Walter Cai and
Philip A. Bernstein and
Wentao Wu and
Badrish Chandramouli Optimization of threshold functions over
streams . . . . . . . . . . . . . . . . 878--889
Xuliang Zhu and
Xin Huang and
Byron Choi and
Jiaxin Jiang and
Zhaonian Zou and
Jianliang Xu Budget constrained interactive search
for multiple targets . . . . . . . . . . 890--902
Yangjun Chen and
Hoang Hai Nguyen On the string matching with $k$
differences in DNA databases . . . . . . 903--915
Yasuhiro Fujiwara and
Sekitoshi Kanai and
Yasutoshi Ida and
Atsutoshi Kumagai and
Naonori Ueda Fast algorithm for anchor graph hashing 916--928
Wangda Zhang and
Junyoung Kim and
Kenneth A. Ross and
Eric Sedlar and
Lukas Stadler Adaptive code generation for
data-intensive analytics . . . . . . . . 929--942
Efthymia Tsamoura and
David Carral and
Enrico Malizia and
Jacopo Urbani Materializing knowledge bases via
trigger graphs . . . . . . . . . . . . . 943--956
Jinfei Liu and
Jian Lou and
Junxu Liu and
Li Xiong and
Jian Pei and
Jimeng Sun Dealer: an end-to-end model marketplace
with differential privacy . . . . . . . 957--969
Sajjadur Rahman and
Mangesh Bendre and
Yuyang Liu and
Shichu Zhu and
Zhaoyuan Su and
Karrie Karahalios and
Aditya G. Parameswaran NOAH: interactive spreadsheet
exploration with dynamic hierarchical
overviews . . . . . . . . . . . . . . . 970--983
Yixing Yang and
Yixiang Fang and
Maria E. Orlowska and
Wenjie Zhang and
Xuemin Lin Efficient bi-triangle counting for large
bipartite networks . . . . . . . . . . . 984--996
Sandeep Tata and
Navneet Potti and
James B. Wendt and
Lauro Beltrão Costa and
Marc Najork and
Beliz Gunel Glean: structured extractions from
templatic documents . . . . . . . . . . 997--1005
Jun Gao and
Jiazun Chen and
Zhao Li and
Ji Zhang ICS-GNN: lightweight interactive
community search via graph neural
network . . . . . . . . . . . . . . . . 1006--1018
Yuanyuan Sun and
Sheng Wang and
Huorong Li and
Feifei Li Building enclave-native storage engines
for practical encrypted databases . . . 1019--1032
James Thorne and
Majid Yazdani and
Marzieh Saeidi and
Fabrizio Silvestri and
Sebastian Riedel and
Alon Halevy From natural language processing to
neural databases . . . . . . . . . . . . 1033--1039
Haibo Wang and
Chaoyi Ma and
Olufemi O. Odegbile and
Shigang Chen and
Jih-Kwon Peir Randomized error removal for online
spread estimation in data streaming . . 1040--1052
Dean De Leo and
Peter Boncz Teseo and the analysis of structural
dynamic graphs . . . . . . . . . . . . . 1053--1066
Tim Gubner and
Peter Boncz Charting the design space of query
execution using VOILA . . . . . . . . . 1067--1079
Zhiqi Wang and
Jin Xue and
Zili Shao Heracles: an efficient storage model and
data flushing for performance monitoring
timeseries . . . . . . . . . . . . . . . 1080--1092
Stephen Macke and
Hongpu Gong and
Doris Jung-Lin Lee and
Andrew Head and
Doris Xin and
Aditya Parameswaran Fine-grained lineage for safer notebook
interactions . . . . . . . . . . . . . . 1093--1101
Anton Tsitsulin and
Marina Munkhoeva and
Davide Mottin and
Panagiotis Karras and
Ivan Oseledets and
Emmanuel Müller FREDE: anytime graph embeddings . . . . 1102--1110
Xiaodong Li and
Reynold Cheng and
Kevin Chen-Chuan Chang and
Caihua Shan and
Chenhao Ma and
Hongtai Cao On analyzing graphs with motif-paths . . 1111--1123
Dimitris Tsaras and
George Trimponias and
Lefteris Ntaflos and
Dimitris Papadias Collective influence maximization for
multiple competing products with an
awareness-to-influence model . . . . . . 1124--1136
Yahui Sun and
Xiaokui Xiao and
Bin Cui and
Saman Halgamuge and
Theodoros Lappas and
Jun Luo Finding group Steiner trees in graphs
with both vertex and edge weights . . . 1137--1149
Tenindra Abeywickrama and
Victor Liang and
Kian-Lee Tan Optimizing bipartite matching in
real-world applications by incremental
cost computation . . . . . . . . . . . . 1150--1158
Immanuel Trummer The case for NLP-enhanced database
tuning: towards tuning tools that ``read
the manual'' . . . . . . . . . . . . . . 1159--1165
Sujaya Maiyya and
Faisal Nawab and
Divyakant Agrawal and
Amr El Abbadi Errata for ``Unifying consensus and
atomic commitment for effective cloud
data management'' . . . . . . . . . . . 1166--1166
Zsolt István and
Soujanya Ponnapalli and
Vijay Chidambaram Software-defined data protection: low
overhead policy compliance at the
storage layer is within reach! . . . . . 1167--1174
Tianyi Li and
Lu Chen and
Christian S. Jensen and
Torben Bach Pedersen TRACE: real-time compression of
streaming trajectories in road networks 1175--1187
Arkaprava Saha and
Ruben Brokkelkamp and
Yllka Velaj and
Arijit Khan and
Francesco Bonchi Shortest paths and centrality in
uncertain networks . . . . . . . . . . . 1188--1201
Tongyu Liu and
Ju Fan and
Yinqing Luo and
Nan Tang and
Guoliang Li and
Xiaoyong Du Adaptive data augmentation for
supervised learning over missing data 1202--1214
Fuheng Zhao and
Sujaya Maiyya and
Ryan Wiener and
Divyakant Agrawal and
Amr El Abbadi KLL$^\pm $ approximate quantile sketches
over dynamic datasets . . . . . . . . . 1215--1227
Dimitrije Jankov and
Binhang Yuan and
Shangyu Luo and
Chris Jermaine Distributed numerical and machine
learning computations via two-phase
execution of aggregated join trees . . . 1228--1240
Dana Van Aken and
Dongsheng Yang and
Sebastien Brillard and
Ari Fiorino and
Bohan Zhang and
Christian Bilien and
Andrew Pavlo An inquiry into machine learning-based
automatic configuration tuning services
on real-world database management
systems . . . . . . . . . . . . . . . . 1241--1253
Nan Tang and
Ju Fan and
Fangyi Li and
Jianhong Tu and
Xiaoyong Du and
Guoliang Li and
Sam Madden and
Mourad Ouzzani RPT: relational pre-trained transformer
is almost all you need towards
democratizing data preparation . . . . . 1254--1261
Jia Zou and
Amitabh Das and
Pratik Barhate and
Arun Iyengar and
Binhang Yuan and
Dimitrije Jankov and
Chris Jermaine Lachesis: automatic partitioning for
UDF-centric analytics . . . . . . . . . 1262--1275
Jiacheng Wu and
Yong Zhang and
Shimin Chen and
Jin Wang and
Yu Chen and
Chunxiao Xing Updatable learned index with precise
positions . . . . . . . . . . . . . . . 1276--1288
Ziquan Fang and
Lu Pan and
Lu Chen and
Yuntao Du and
Yunjun Gao MDTP: a multi-source deep traffic
prediction framework over
spatio-temporal trajectory data . . . . 1289--1297
Seunghwan Min and
Sung Gwan Park and
Kunsoo Park and
Dora Giammarresi and
Giuseppe F. Italiano and
Wook-Shin Han Symmetric continuous subgraph matching
with bidirectional dynamic programming 1298--1310
Tomoya Suzuki and
Kazuhiro Hiwada and
Hirotsugu Kajihara and
Shintaro Sano and
Shuou Nomura and
Tatsuo Shiozawa Approaching DRAM performance by using
microsecond-latency flash memory for
small-sized random read accesses: a new
access method and its graph applications 1311--1324
Abdelghny Orogat and
Isabelle Liu and
Ahmed El-Roby CBench: towards better evaluation of
question answering over knowledge graphs 1325--1337
Binhang Yuan and
Dimitrije Jankov and
Jia Zou and
Yuxin Tang and
Daniel Bourgeois and
Chris Jermaine Tensor relational algebra for
distributed machine learning system
design . . . . . . . . . . . . . . . . . 1338--1350
Wenfei Fan and
Chao Tian and
Yanghao Wang and
Qiang Yin Parallel discrepancy detection and
incremental detection . . . . . . . . . 1351--1364
Tiantian Liu and
Huan Li and
Hua Lu and
Muhammad Aamir Cheema and
Lidan Shou Towards crowd-aware indoor path planning 1365--1377
Surabhi Gupta and
Karthik Ramachandra Procedural extensions of SQL:
understanding their usage in the wild 1378--1391
Sagar Bharadwaj and
Praveen Gupta and
Ranjita Bhagwan and
Saikat Guha Discovering related data at scale . . . 1392--1400
Stefano Cereda and
Stefano Valladares and
Paolo Cremonesi and
Stefano Doni CGPTuner: a contextual Gaussian process
bandit approach for the automatic tuning
of IT configurations under varying
workload conditions . . . . . . . . . . 1401--1413
Filippo Schiavio and
Daniele Bonetta and
Walter Binder Language-agnostic integrated queries in
a managed polyglot runtime . . . . . . . 1414--1426
Chinmay Kulkarni and
Badrish Chandramouli and
Ryan Stutsman Achieving high throughput and elasticity
in a larger-than-memory store . . . . . 1427--1440
Kai Yao and
Lijun Chang Efficient size-bounded community search
over large networks . . . . . . . . . . 1441--1453
Jianwen Zhao and
Yufei Tao Minimum vertex augmentation . . . . . . 1454--1466
Kevin P. Gaffney and
Robert Claus and
Jignesh M. Patel Database isolation by scheduling . . . . 1467--1480
Jong-Hyeok Park and
Soyee Choi and
Gihwan Oh and
Sang-Won Lee SaS: SSD as SQL database system . . . . 1481--1488
Rong Zhu and
Ziniu Wu and
Yuxing Han and
Kai Zeng and
Andreas Pfadler and
Zhengping Qian and
Jingren Zhou and
Bin Cui FLAT: fast, lightweight and accurate
method for cardinality estimation . . . 1489--1502
Tsz Nam Chan and
Zhe Li and
Leong Hou U. and
Jianliang Xu and
Reynold Cheng Fast augmentation algorithms for network
kernel density visualization . . . . . . 1503--1516
Jiawei Wang and
Cheng Li and
Kai Ma and
Jingze Huo and
Feng Yan and
Xinyu Feng and
Yinlong Xu AUTOGR: automated geo-replication with
fast system performance and preserved
application semantics . . . . . . . . . 1517--1530
Qing Liu and
Xuliang Zhu and
Xin Huang and
Jianliang Xu Local algorithms for
distance-generalized core decomposition
over large dynamic graphs . . . . . . . 1531--1543
Lawrence Benson and
Hendrik Makait and
Tilmann Rabl Viper: an efficient hybrid PMem-DRAM
key-value store . . . . . . . . . . . . 1544--1556
Sepanta Zeighami and
Cyrus Shahabi and
John Krumm Estimating spread of contact-based
contagions in a population through
sub-sampling . . . . . . . . . . . . . . 1557--1569
Herodotos Herodotou and
Elena Kakoulli Trident: task scheduling over tiered
storage systems in big data platforms 1570--1582
Zicun Cong and
Lingyang Chu and
Yu Yang and
Jian Pei Comprehensible counterfactual
explanation on Kolmogorov--Smirnov test 1583--1596
Hongkuan Zhou and
Ajitesh Srivastava and
Hanqing Zeng and
Rajgopal Kannan and
Viktor Prasanna Accelerating large scale real-time GNN
inference using channel pruning . . . . 1597--1605
Viktor Leis and
Maximilian Kuschewski Towards cost-optimal query processing in
the cloud . . . . . . . . . . . . . . . 1606--1612
Shufeng Gong and
Chao Tian and
Qiang Yin and
Wenyuan Yu and
Yanfeng Zhang and
Liang Geng and
Song Yu and
Ge Yu and
Jingren Zhou Automating incremental graph processing
with flexible memoization . . . . . . . 1613--1625
Theo Jepsen and
Alberto Lerner and
Fernando Pedone and
Robert Soulé and
Philippe Cudré-Mauroux In-network support for transaction
triaging . . . . . . . . . . . . . . . . 1626--1639
Xiaoying Wang and
Changbo Qu and
Weiyuan Wu and
Jiannan Wang and
Qingqing Zhou Are we ready for learned cardinality
estimation? . . . . . . . . . . . . . . 1640--1654
Jakub Lemiesz On the algebra of data sketches . . . . 1655--1667
Guanhao Hou and
Xingguang Chen and
Sibo Wang and
Zhewei Wei Massively parallel algorithms for
Personalized Pagerank . . . . . . . . . 1668--1680
Maximilian Schleich and
Zixuan Geng and
Yihong Zhang and
Dan Suciu GeCo: quality counterfactual
explanations in real time . . . . . . . 1681--1693
Ricardo Salazar and
Felix Neutatz and
Ziawasch Abedjan Automated feature engineering for
algorithmic fairness . . . . . . . . . . 1694--1702
Raghavendra Addanki and
Sainyam Galhotra and
Barna Saha How to design robust algorithms using
noisy comparison Oracle . . . . . . . . 1703--1716
Paul Boniol and
John Paparrizos and
Themis Palpanas and
Michael J. Franklin SAND: streaming subsequence anomaly
detection . . . . . . . . . . . . . . . 1717--1729
Yingtai Xiao and
Zeyu Ding and
Yuxin Wang and
Danfeng Zhang and
Daniel Kifer Optimizing fitness-for-use of
differentially private linear queries 1730--1742
Xinle Cao and
Jian Liu and
Hao Lu and
Kui Ren Cryptanalysis of an encrypted database
in SIGMOD '14 . . . . . . . . . . . . . 1743--1755
Tianyuan Jin and
Yu Yang and
Renchi Yang and
Jieming Shi and
Keke Huang and
Xiaokui Xiao Unconstrained submodular maximization
with modular costs: tight approximation
and application to profit maximization 1756--1768
Yuhao Zhang and
Frank McQuillan and
Nandish Jayaram and
Nikhil Kak and
Ekta Khanna and
Orhan Kislal and
Domino Valdano and
Arun Kumar Distributed deep learning on data
systems: a comparative analysis of
approaches . . . . . . . . . . . . . . . 1769--1782
Siyuan Sheng and
Qun Huang and
Sa Wang and
Yungang Bao PR-sketch: monitoring per-key
aggregation of streaming data with
nearly full accuracy . . . . . . . . . . 1783--1796
Dimitrios Koutsoukos and
Supun Nakandala and
Konstantinos Karanasos and
Karla Saur and
Gustavo Alonso and
Matteo Interlandi Tensors: an abstraction for general data
processing . . . . . . . . . . . . . . . 1797--1804
David Pujol and
Yikai Wu and
Brandon Fain and
Ashwin Machanavajjhala Budget sharing for multi-analyst
differential privacy . . . . . . . . . . 1805--1817
Rudi Poepsel-Lemaitre and
Martin Kiefer and
Joscha von Hein and
Jorge-Arnulfo Quiané-Ruiz and
Volker Markl In the land of data streams where
synopses are missing, one framework to
bring them all . . . . . . . . . . . . . 1818--1831
Yifan Li and
Xiaohui Yu and
Nick Koudas Data acquisition for improving machine
learning models . . . . . . . . . . . . 1832--1844
Xiaoshuang Chen and
Kai Wang and
Xuemin Lin and
Wenjie Zhang and
Lu Qin and
Ying Zhang Efficiently answering reachability and
path queries on temporal bipartite
graphs . . . . . . . . . . . . . . . . . 1845--1858
Paolo Ciaccia and
Davide Martinenghi and
Riccardo Torlone Preference queries over taxonomic
domains . . . . . . . . . . . . . . . . 1859--1871
Baoyue Yan and
Xuntao Cheng and
Bo Jiang and
Shibin Chen and
Canfang Shang and
Jianying Wang and
Gui Huang and
Xinjun Yang and
Wei Cao and
Feifei Li Revisiting the design of LSM-tree Based
OLTP storage engine with persistent
memory . . . . . . . . . . . . . . . . . 1872--1885
Chang Ge and
Shubhankar Mohapatra and
Xi He and
Ihab F. Ilyas Kamino: constraint-aware differentially
private data synthesis . . . . . . . . . 1886--1899
Yingqiang Zhang and
Chaoyi Ruan and
Cheng Li and
Xinjun Yang and
Wei Cao and
Feifei Li and
Bo Wang and
Jing Fang and
Yuhui Wang and
Jingze Huo and
Chao Bi Towards cost-effective and elastic cloud
database deployment via memory
disaggregation . . . . . . . . . . . . . 1900--1912
Ralph Peeters and
Christian Bizer Dual-objective fine-tuning of BERT for
entity matching . . . . . . . . . . . . 1913--1921
Maciej Besta and
Zur Vonarburg-Shmaria and
Yannick Schaffner and
Leonardo Schwarz and
Grzegorz Kwasniewski and
Lukas Gianinazzi and
Jakub Beranek and
Kacper Janda and
Tobias Holenstein and
Sebastian Leisinger and
Peter Tatkowski and
Esref Ozdemir and
Adrian Balla and
Marcin Copik and
Philipp Lindenberger and
Marek Konieczny and
Onur Mutlu and
Torsten Hoefler GraphMineSuite: enabling
high-performance and programmable graph
mining algorithms with set algebra . . . 1922--1935
Keita Takenouchi and
Takashi Ishio and
Joji Okada and
Yuji Sakata PATSQL: efficient synthesis of SQL
queries from example tables with quick
inference of projected columns . . . . . 1937--1949
Jie Liu and
Wenqian Dong and
Qingqing Zhou and
Dong Li Fauce: fast and accurate deep ensembles
with uncertainty for cardinality
estimation . . . . . . . . . . . . . . . 1950--1963
Mengzhao Wang and
Xiaoliang Xu and
Qiang Yue and
Yuxiang Wang A comprehensive survey and experimental
comparison of graph-based approximate
nearest neighbor search . . . . . . . . 1964--1978
Zifeng Yuan and
Huey Eng Chua and
Sourav S. Bhowmick and
Zekun Ye and
Wook-Shin Han and
Byron Choi Towards plug-and-play visual graph query
interfaces: data-driven selection of
canned patterns for large networks . . . 1979--1991
Shixuan Sun and
Yuhang Chen and
Shengliang Lu and
Bingsheng He and
Yuchen Li ThunderRW: an in-memory graph random
walk engine . . . . . . . . . . . . . . 1992--2005
Zheng Dong and
Xin Huang and
Guorui Yuan and
Hengshu Zhu and
Hui Xiong Butterfly-core community search over
labeled graphs . . . . . . . . . . . . . 2006--2018
Parimarjan Negi and
Ryan Marcus and
Andreas Kipf and
Hongzi Mao and
Nesime Tatbul and
Tim Kraska and
Mohammad Alizadeh Flow-loss: learning cardinality
estimates that matter . . . . . . . . . 2019--2032
Michael Yu and
Dong Wen and
Lu Qin and
Ying Zhang and
Wenjie Zhang and
Xuemin Lin On querying historical $k$-cores . . . . 2033--2045
Graham Cormode and
Samuel Maddock and
Carsten Maple Frequency estimation under local
differential privacy . . . . . . . . . . 2046--2058
Fatjon Zogaj and
José Pablo Cambronero and
Martin C. Rinard and
Jürgen Cito Doing more with less: characterizing
dataset downsampling for AutoML . . . . 2059--2072
Yifan Li and
Xiaohui Yu and
Nick Koudas LES 3: learning-based exact set
similarity search . . . . . . . . . . . 2073--2086
Seung Won Min and
Kun Wu and
Sitao Huang and
Mert Hidayetoglu and
Jinjun Xiong and
Eiman Ebrahimi and
Deming Chen and
Wen-mei Hwu Large graph convolutional network
training with GPU-oriented data
communication architecture . . . . . . . 2087--2100
Yifei Yang and
Matt Youill and
Matthew Woicik and
Yizhou Liu and
Xiangyao Yu and
Marco Serafini and
Ashraf Aboulnaga and
Michael Stonebraker FlexPushdownDB: hybrid pushdown and
caching in a cloud DBMS . . . . . . . . 2101--2113
Zhiwei Chen and
Shaoxu Song and
Ziheng Wei and
Jingyun Fang and
Jiang Long Approximating median absolute deviation
with bounded error . . . . . . . . . . . 2114--2126
Mengxuan Zhang and
Lei Li and
Xiaofang Zhou An experimental evaluation and guideline
for path finding in weighted dynamic
network . . . . . . . . . . . . . . . . 2127--2140
Brecht Vandevoort and
Bas Ketsman and
Christoph Koch and
Frank Neven Robustness against read committed for
transaction templates . . . . . . . . . 2141--2153
Huayi Zhang and
Lei Cao and
Samuel Madden and
Elke Rundensteiner LANCET: labeling complex data at scale 2154--2166
Yang Li and
Yu Shen and
Wentao Zhang and
Jiawei Jiang and
Bolin Ding and
Yaliang Li and
Jingren Zhou and
Zhi Yang and
Wentao Wu and
Ce Zhang and
Bin Cui VolcanoML: speeding up end-to-end AutoML
via scalable search space decomposition 2167--2176
Peng Cheng and
Jiabao Jin and
Lei Chen and
Xuemin Lin and
Libin Zheng A queueing-theoretic framework for
vehicle dispatching in dynamic
car-hailing . . . . . . . . . . . . . . 2177--2189
Kuntai Cai and
Xiaoyu Lei and
Jianxin Wei and
Xiaokui Xiao Data synthesis via differentially
private Markov random fields . . . . . . 2190--2202
Michael Whittaker and
Ailidani Ailijiang and
Aleksey Charapko and
Murat Demirbas and
Neil Giridharan and
Joseph M. Hellerstein and
Heidi Howard and
Ion Stoica and
Adriana Szekeres Scaling replicated state machines with
compartmentalization . . . . . . . . . . 2203--2215
Subhadeep Sarkar and
Dimitris Staratzis and
Ziehen Zhu and
Manos Athanassoulis Constructing and analyzing the LSM
compaction design space . . . . . . . . 2216--2229
Jelle Hellings and
Mohammad Sadoghi ByShard: sharding in a Byzantine
environment . . . . . . . . . . . . . . 2230--2243
Otmar Ertl SetSketch: filling the gap between
MinHash and HyperLogLog . . . . . . . . 2244--2257
Ergute Bao and
Yin Yang and
Xiaokui Xiao and
Bolin Ding CGM: an enhanced mechanism for streaming
data collection with local differential
privacy . . . . . . . . . . . . . . . . 2258--2270
Dean De Leo and
Per Fuchs and
Peter Boncz Errata for ``Teseo and the analysis of
structural dynamic graphs'': (PVLDB \bf
14(6):1053--1066) . . . . . . . . . . . 2271--2272
Mashaal Musleh and
Sofiane Abbar and
Rade Stanojevic and
Mohamed Mokbel QARTA: an ML-based system for accurate
map services . . . . . . . . . . . . . . 2273--2282
Teddy Cunningham and
Graham Cormode and
Hakan Ferhatosmanoglu and
Divesh Srivastava Real-world trajectory sharing with local
differential privacy . . . . . . . . . . 2283--2295
Phanwadee Sinthong and
Michael J. Carey PolyFrame: a retargetable query-based
approach to scaling dataframes . . . . . 2296--2304
Jessica Shi and
Laxman Dhulipala and
David Eisenstat and
Jakub Lacki and
Vahab Mirrokni Scalable community detection via
parallel correlation clustering . . . . 2305--2313
Cheng Xu and
Ce Zhang and
Jianliang Xu and
Jian Pei SlimChain: scaling blockchain
transactions through off-chain storage
and parallel processing . . . . . . . . 2314--2326
Side Li and
Arun Kumar Towards an optimized GROUP by
abstraction for large-scale machine
learning . . . . . . . . . . . . . . . . 2327--2340
Daniel Kang and
John Guibas and
Peter Bailis and
Tatsunori Hashimoto and
Yi Sun and
Matei Zaharia Accelerating approximate aggregation
queries with expensive predicates . . . 2341--2354
Tobias Schmidt and
Maximilian Bandle and
Jana Giceva A four-dimensional analysis of
partitioned approximate filters . . . . 2355--2368
Monica Chiosa and
Thomas B. Preußer and
Gustavo Alonso SKT: a one-pass multi-sketch data
analytics accelerator . . . . . . . . . 2369--2382
Philipp Fent and
Thomas Neumann A practical approach to groupjoin and
nested aggregates . . . . . . . . . . . 2383--2396
Ziyun Wei and
Immanuel Trummer and
Connor Anderson Robust voice querying with MUVE:
optimally visualizing results of
phonetically similar queries . . . . . . 2397--2409
Yinjun Wu and
James Weimer and
Susan B. Davidson CHEF: a cheap and fast pipeline for
iteratively cleaning label uncertainties 2410--2418
Tarique Siddiqui and
Surajit Chaudhuri and
Vivek Narasayya COMPARE: accelerating groupwise
comparison in relational databases for
data analytics . . . . . . . . . . . . . 2419--2431
Dominik Durner and
Badrish Chandramouli and
Yinan Li Crystal: a unified cache storage system
for analytical databases . . . . . . . . 2432--2444
Valerio Cetorelli and
Paolo Atzeni and
Valter Crescenzi and
Franco Milicchio The smallest extraction problem . . . . 2445--2458
Saravanan Thirumuruganathan and
Han Li and
Nan Tang and
Mourad Ouzzani and
Yash Govind and
Derek Paulsen and
Glenn Fung and
AnHai Doan Deep learning for blocking in entity
matching: a design space exploration . . 2459--2472
Wentao Zhang and
Zhi Yang and
Yexin Wang and
Yu Shen and
Yang Li and
Liang Wang and
Bin Cui GRAIN: improving data efficiency of \em
graph neural networks via diversified
\em influence maximization . . . . . . . 2473--2482
Maximilian Bandle and
Jana Giceva Database technology for the masses:
sub-operators as first-class entities 2483--2490
Pranjal Gupta and
Amine Mhedhbi and
Semih Salihoglu Columnar storage and list-based
processing for graph database management
systems . . . . . . . . . . . . . . . . 2491--2504
Yiwen Zhu and
Matteo Interlandi and
Abhishek Roy and
Krishnadhan Das and
Hiren Patel and
Malay Bag and
Hitesh Sharma and
Alekh Jindal Phoebe: a learning-based checkpoint
optimizer . . . . . . . . . . . . . . . 2505--2518
Fatemeh Nargesian and
Abolfazl Asudeh and
H. V. Jagadish Tailoring data source distributions for
fairness-aware data integration . . . . 2519--2532
Parikshit Bansal and
Prathamesh Deshpande and
Sunita Sarawagi Missing value imputation on
multidimensional time series . . . . . . 2533--2545
El Kindi Rezig and
Mourad Ouzzani and
Walid G. Aref and
Ahmed K. Elmagarmid and
Ahmed R. Mahmood and
Michael Stonebraker Horizon: scalable dependency-driven data
cleaning . . . . . . . . . . . . . . . . 2546--2554
Ted Shaowang and
Nilesh Jain and
Dennis D. Matthews and
Sanjay Krishnan Declarative data serving: the future of
machine learning inference on the edge 2555--2562
Junwen Yang and
Yeye He and
Surajit Chaudhuri Auto-pipeline: synthesizing complex data
pipelines by-target using reinforcement
learning and search . . . . . . . . . . 2563--2575
Brandon Lockhart and
Jinglin Peng and
Weiyuan Wu and
Jiannan Wang and
Eugene Wu Explaining inference queries with
Bayesian optimization . . . . . . . . . 2576--2585
Chunwei Liu and
Hao Jiang and
John Paparrizos and
Aaron J. Elmore Decomposed bounded floats for fast
compression and queries . . . . . . . . 2586--2598
Nikolaos Tziavelis and
Wolfgang Gatterbauer and
Mirek Riedewald Beyond equi-joins: ranking, enumeration
and factorization . . . . . . . . . . . 2599--2612
Vincent Jacob and
Fei Song and
Arnaud Stiegler and
Bijan Rad and
Yanlei Diao and
Nesime Tatbul Exathlon: a benchmark for explainable
anomaly detection over time series . . . 2613--2626
Michael Kuchnik and
George Amvrosiadis and
Virginia Smith Progressive compressed records: taking a
byte out of deep learning data . . . . . 2627--2641
Abdulrahman Alsaudi and
Yasser Altowim and
Sharad Mehrotra and
Yaming Yu TQEL: framework for query-driven linking
of top-$k$ entities in social media
blogs . . . . . . . . . . . . . . . . . 2642--2654
Tsz Nam Chan and
Pak Lon Ip and
Leong Hou U. and
Weng Hou Tong and
Shivansh Mittal and
Ye Li and
Reynold Cheng KDV-explorer: a near real-time kernel
density visualization system for spatial
analysis . . . . . . . . . . . . . . . . 2655--2658
Zhebin Zhang and
Dajie Dong and
Yuhang Ma and
Yilong Ying and
Dawei Jiang and
Ke Chen and
Lidan Shou and
Gang Chen Refiner: a reliable incentive-driven
federated learning system powered by
blockchain . . . . . . . . . . . . . . . 2659--2662
Valter Uotila and
Jiaheng Lu and
Dieter Gawlick and
Zhen Hua Liu and
Souripriya Das and
Gregory Pogossiants MultiCategory: multi-model query
processing meets category theory and
functional programming . . . . . . . . . 2663--2666
Qichen Wang and
Chaoqi Zhang and
Danish Alsayed and
Ke Yi and
Bin Wu and
Feifei Li and
Chaoqun Zhan Cquirrel: continuous query processing
over acyclic relational schemas . . . . 2667--2670
Yuetian Mao and
Shuai Yuan and
Nan Cui and
Tianjiao Du and
Beijun Shen and
Yuting Chen DeFiHap: detecting and fixing HiveQL
anti-patterns . . . . . . . . . . . . . 2671--2674
Ahmed Helal and
Mossad Helali and
Khaled Ammar and
Essam Mansour A demonstration of KGLac: a data
discovery and enrichment platform for
data science . . . . . . . . . . . . . . 2675--2678
Pierre Faure-Giovagnoli and
Marie Le Guilly and
Jean-Marc Petit and
Vasile-Marian Scuturici ADESIT: visualize the limits of your
data in a machine learning process . . . 2679--2682
Yinzhao Yan and
Raymond Chi-Wing Wong Path advisor: a multi-functional campus
map tool for shortest path . . . . . . . 2683--2686
Liangde Li and
Supun Nakandala and
Arun Kumar Intermittent human-in-the-loop model
selection using Cerebro: a demonstration 2687--2690
Henning Funke and
Jens Teubner Low-latency compilation of SQL queries
to machine code . . . . . . . . . . . . 2691--2694
Sven Groppe and
Rico Klinckenberg and
Benjamin Warnke Sound of databases: sonification of a
semantic web database engine . . . . . . 2695--2698
Zihao Chen and
Zhizhen Xu and
Chen Xu and
Juan Soto and
Volker Markl and
Weining Qian and
Aoying Zhou HyMAC: a hybrid matrix computation
system . . . . . . . . . . . . . . . . . 2699--2702
Jingbo Xu and
Zhanning Bai and
Wenfei Fan and
Longbin Lai and
Xue Li and
Zhao Li and
Zhengping Qian and
Lei Wang and
Lei Wang and
Yanyan Wang and
Wenyuan Yu and
Jingren Zhou GraphScope: a one-stop large graph
processing system . . . . . . . . . . . 2703--2706
Alexander Renz-Wieland and
Tobias Drobisch and
Zoi Kaoudi and
Rainer Gemulla and
Volker Markl Just move it!: dynamic parameter
allocation in action . . . . . . . . . . 2707--2710
Abdelghny Orogat and
Ahmed El-Roby CBench: demonstrating comprehensive
evaluation of question answering systems
over knowledge graphs through deep
analysis of benchmarks . . . . . . . . . 2711--2714
Lucas Woltmann and
Dominik Olwig and
Claudio Hartmann and
Dirk Habich and
Wolfgang Lehner PostCENN: postgreSQL with machine
learning models for cardinality
estimation . . . . . . . . . . . . . . . 2715--2718
Jinyang Li and
Yuval Moskovitch and
H. V. Jagadish DENOUNCER: detection of unfairness in
classifiers . . . . . . . . . . . . . . 2719--2722
Sofiane Abbar and
Rade Stanojevic and
Mashaal Musleh and
Mohamed ElShrif and
Mohamed Mokbel A demonstration of QARTA: an ML-based
system for accurate map services . . . . 2723--2726
Jaclyn Smith and
Michael Benedikt and
Brandon Moore and
Milos Nikolic TraNCE: transforming nested collections
efficiently . . . . . . . . . . . . . . 2727--2730
Ralf Diestelkämper and
Seokki Lee and
Boris Glavic and
Melanie Herschel Debugging missing answers for spark
queries over nested data with Breadcrumb 2731--2734
Renzhi Wu and
Prem Sakala and
Peng Li and
Xu Chu and
Yeye He Demonstration of Panda: a weakly
supervised entity matching system . . . 2735--2738
Jiabin Liu and
Fu Zhu and
Chengliang Chai and
Yuyu Luo and
Nan Tang Automatic data acquisition for deep
learning . . . . . . . . . . . . . . . . 2739--2742
Xuanhe Zhou and
Lianyuan Jin and
Ji Sun and
Xinyang Zhao and
Xiang Yu and
Jianhua Feng and
Shifu Li and
Tianqing Wang and
Kun Li and
Luyang Liu DBMind: a self-driving platform in
openGauss . . . . . . . . . . . . . . . 2743--2746
Qiongqiong Lin and
Jiayao Zhang and
Jinfei Liu and
Kui Ren and
Jian Lou and
Junxu Liu and
Li Xiong and
Jian Pei and
Jimeng Sun Demonstration of dealer: an end-to-end
model marketplace with differential
privacy . . . . . . . . . . . . . . . . 2747--2750
Tianyu Mu and
Hongzhi Wang and
Shenghe Zheng and
Shaoqing Zhang and
Cheng Liang and
Haoyun Tang Assassin: an automatic classification
system based on algorithm selection . . 2751--2754
Lei Cao and
Dongqing Xiao and
Yizhou Yan and
Samuel Madden and
Guoliang Li ATLANTIC: making database differentially
private and faster with accuracy
guarantee . . . . . . . . . . . . . . . 2755--2758
Anze Xie and
Anders Carlsson and
Jason Mohoney and
Roger Waleffe and
Shanan Peters and
Theodoros Rekatsinas and
Shivaram Venkataraman Demo of Marius: a system for large-scale
graph embeddings . . . . . . . . . . . . 2759--2762
Heiko Müller and
Sonia Castelo and
Munaf Qazi and
Juliana Freire From papers to practice: the
\pkgopenclean open-source data cleaning
library . . . . . . . . . . . . . . . . 2763--2766
Yongming Ge and
Vanessa Lin and
Maureen Daum and
Brandon Haynes and
Alvin Cheung and
Magdalena Balazinska Demonstration of apperception: a
database management system for
geospatial video data . . . . . . . . . 2767--2770
Mary Karatzoglidi and
Paraskevas Kerasiotis and
Verena Kantere Automated energy consumption forecasting
with EnForce . . . . . . . . . . . . . . 2771--2774
Myung-Hwan Jang and
Yong-Yeon Jo and
Sang-Wook Kim RealGraph web: a graph analysis platform
on the web . . . . . . . . . . . . . . . 2775--2778
Arthita Ghosh and
Deven Bansod and
Arpit Narechania and
Prashanth Dintyala and
Su Timurturkan and
Joy Arulraj Interactive demonstration of SQLCheck 2779--2782
Yiming Lin and
Pramod Khargonekar and
Sharad Mehrotra and
Nalini Venkatasubramanian T-cove: an exposure tracing system based
on cleaning wi-fi events on
organizational premises . . . . . . . . 2783--2786
Paul Y. Wang and
Sainyam Galhotra and
Romila Pradhan and
Babak Salimi Demonstration of generating explanations
for black-box algorithms using Lewis . . 2787--2790
Sonia Castelo and
Rémi Rampin and
Aécio Santos and
Aline Bessa and
Fernando Chirigati and
Juliana Freire Auctus: a dataset search engine for data
discovery and augmentation . . . . . . . 2791--2794
Mohammed Suhail Rehman and
Silu Huang and
Aaron J. Elmore A demonstration of RELIC: a system for
retrospective lineage inference of data
workflows . . . . . . . . . . . . . . . 2795--2798
Zhihao Chen and
Haizhen Zhuo and
Quanqing Xu and
Xiaodong Qi and
Chengyu Zhu and
Zhao Zhang and
Cheqing Jin and
Aoying Zhou and
Ying Yan and
Hui Zhang SChain: a scalable consortium blockchain
exploiting intra- and inter-block
concurrency . . . . . . . . . . . . . . 2799--2802
Chrysovalantis Anastasiou and
Constantinos Costa and
Panos K. Chrysanthis and
Cyrus Shahabi EPICGen: an experimental platform for
indoor congestion generation and
forecasting . . . . . . . . . . . . . . 2803--2806
Hiba Arnaout and
Simon Razniewski and
Gerhard Weikum and
Jeff Z. Pan Wikinegata: a knowledge base with
interesting negative statements . . . . 2807--2810
Jinwei Zhu and
Kun Cheng and
Jiayang Liu and
Liang Guo Full encryption: an end to end
encryption mechanism in GaussDB . . . . 2811--2814
Antonis Mandamadiotis and
Stavroula Eleftherakis and
Apostolos Glenis and
Dimitrios Skoutas and
Yannis Stavrakas and
Georgia Koutrika DatAgent: the imminent age of
intelligent data assistants . . . . . . 2815--2818
El Kindi Rezig and
Anshul Bhandari and
Anna Fariha and
Benjamin Price and
Allan Vanterpool and
Vijay Gadepally and
Michael Stonebraker DICE: data discovery by example . . . . 2819--2822
Felix Schuhknecht and
Aaron Priesterroth and
Justus Henneberg and
Reza Salkhordeh AnyOLAP: analytical processing of
arbitrary data-intensive applications
without ETL . . . . . . . . . . . . . . 2823--2826
Vincent Jacob and
Fei Song and
Arnaud Stiegler and
Bijan Rad and
Yanlei Diao and
Nesime Tatbul A demonstration of the Exathlon
benchmarking platform for explainable
anomaly detection . . . . . . . . . . . 2827--2830
Amir Shaikhha and
Maximilian Schleich and
Dan Olteanu An intermediate representation for
hybrid database and machine learning
workloads . . . . . . . . . . . . . . . 2831--2834
Eliana Pastor and
Andrew Gavgavian and
Elena Baralis and
Luca de Alfaro How divergent is your data? . . . . . . 2835--2838
Auday Berro and
Mohammad-Ali Yaghub Zade Fard and
Marcos Baez and
Boualem Benatallah and
Khalid Benabdeslem An extensible and reusable pipeline for
automated utterance paraphrases . . . . 2839--2842
Kaustubh Beedkar and
David Brekardin and
Jorge-Anulfo Quiané-Ruiz and
Volker Markl Compliant geo-distributed data
processing in action . . . . . . . . . . 2843--2846
Piyush Yadav and
Dhaval Salwala and
Felipe Arruda Pontes and
Praneet Dhingra and
Edward Curry Query-driven video event processing for
the Internet of Multimedia Things . . . 2847--2850
Nikolaos Koutroumanis and
Nikolaos Kousathanas and
Christos Doulkeridis and
Akrivi Vlachou A demonstration of NoDA: unified access
to NoSQL stores . . . . . . . . . . . . 2851--2854
Rathijit Sen and
Abhishek Roy and
Alekh Jindal and
Rui Fang and
Jeff Zheng and
Xiaolei Liu and
Ruiping Li AutoExecutor: predictive parallelism for
Spark SQL queries . . . . . . . . . . . 2855--2858
Jiaxiang Liu and
Karl Knopf and
Yiqing Tan and
Bolin Ding and
Xi He Catch a blowfish alive: a demonstration
of policy-aware differential privacy for
interactive data exploration . . . . . . 2859--2862
Paul Ouellette and
Aidan Sciortino and
Fatemeh Nargesian and
Bahar Ghadiri Bashardoost and
Erkang Zhu and
Ken Q. Pu and
Renée J. Miller RONIN: data lake exploration . . . . . . 2863--2866
Paul Boniol and
John Paparrizos and
Themis Palpanas and
Michael J. Franklin SAND in action: subsequence anomaly
detection for streams . . . . . . . . . 2867--2870
Christos Koutras and
Kyriakos Psarakis and
George Siachamis and
Andra Ionescu and
Marios Fragkoulis and
Angela Bonifati and
Asterios Katsifodimos Valentine in action: matching tabular
data at scale . . . . . . . . . . . . . 2871--2874
Sheng Guan and
Hanchao Ma and
Sutanay Choudhury and
Yinghui Wu GEDet: detecting erroneous nodes with a
few examples . . . . . . . . . . . . . . 2875--2878
Wenfei Fan and
Tao He and
Longbin Lai and
Xue Li and
Yong Li and
Zhao Li and
Zhengping Qian and
Chao Tian and
Lei Wang and
Jingbo Xu and
Youyang Yao and
Qiang Yin and
Wenyuan Yu and
Jingren Zhou and
Diwen Zhu and
Rong Zhu GraphScope: a unified engine for big
graph processing . . . . . . . . . . . . 2879--2892
Zeyuan Shang and
Emanuel Zgraggen and
Benedetto Buratti and
Philipp Eichmann and
Navid Karimeddiny and
Charlie Meyer and
Wesley Runnels and
Tim Kraska Davos: a system for interactive
data-driven decision making . . . . . . 2893--2905
An Qin and
Mengbai Xiao and
Yongwei Wu and
Xinjie Huang and
Xiaodong Zhang Mixer: efficiently understanding and
retrieving visual content at web-scale 2906--2917
David Justo and
Shaoqing Yi and
Lukas Stadler and
Nadia Polikarpova and
Arun Kumar Towards a polyglot framework for
factorized ML . . . . . . . . . . . . . 2918--2931
Niv Dayan and
Moshe Twitto and
Yuval Rochman and
Uri Beitler and
Itai Ben Zion and
Edward Bortnikov and
Shmuel Dashevsky and
Ofer Frishman and
Evgeni Ginzburg and
Igal Maly and
Avraham (Poza) Meir and
Mark Mokryn and
Iddo Naiss and
Noam Rabinovich The end of Moore's law and the rise of
the data processor . . . . . . . . . . . 2932--2944
Derek G. Murray and
Jirí Simsa and
Ana Klimovic and
Ihor Indyk \pkgtf.data: a machine learning data
processing framework . . . . . . . . . . 2945--2958
Mohamed Eltabakh and
Anantha Subramanian and
Awny Al-Omari and
Mohammed Al-Kateb and
Sanjay Nair and
Mahbub Hasan and
Wellington Cabrera and
Charles Zhang and
Amit Kishore and
Snigdha Prasad Not black-box anymore!: enabling
analytics-aware optimizations in
Teradata Vantage . . . . . . . . . . . . 2959--2971
Yingda Chen and
Jiamang Wang and
Yifeng Lu and
Ying Han and
Zhiqiang Lv and
Xuebin Min and
Hua Cai and
Wei Zhang and
Haochuan Fan and
Chao Li and
Tao Guan and
Wei Lin and
Yangqing Jia and
Jingren Zhou Fangorn: adaptive execution framework
for heterogeneous workloads on shared
clusters . . . . . . . . . . . . . . . . 2972--2985
Ankur Agiwal and
Kevin Lai and
Gokul Nath Babu Manoharan and
Indrajit Roy and
Jagan Sankaranarayanan and
Hao Zhang and
Tao Zou and
Min Chen and
Zongchang (Jim) Chen and
Ming Dai and
Thanh Do and
Haoyu Gao and
Haoyan Geng and
Raman Grover and
Bo Huang and
Yanlai Huang and
Zhi (Adam) Li and
Jianyi Liang and
Tao Lin and
Li Liu and
Yao Liu and
Xi Mao and
Yalan (Maya) Meng and
Prashant Mishra and
Jay Patel and
Rajesh S. R. and
Vijayshankar Raman and
Sourashis Roy and
Mayank Singh Shishodia and
Tianhang Sun and
Ye (Justin) Tang and
Junichi Tatemura and
Sagar Trehan and
Ramkumar Vadali and
Prasanna Venkatasubramanian and
Gensheng Zhang and
Kefei Zhang and
Yupu Zhang and
Zeleng Zhuang and
Goetz Graefe and
Divyakant Agrawal and
Jeff Naughton and
Sujata Kosalge and
Hakan Hacìgümü\cs Napa: powering scalable data warehousing
with robust query performance at Google 2986--2997
Rubao Lee and
Minghong Zhou and
Chi Li and
Shenggang Hu and
Jianping Teng and
Dongyang Li and
Xiaodong Zhang The art of balance: a RateupDBTM
experience of building a CPU/GPU hybrid
database product . . . . . . . . . . . . 2999--3013
Audrey Cheng and
Xiao Shi and
Lu Pan and
Anthony Simpson and
Neil Wheaton and
Shilpa Lawande and
Nathan Bronson and
Peter Bailis and
Natacha Crooks and
Ion Stoica RAMP-TAO: layering atomic transactions
on Facebook's online TAO data store . . 3014--3027
Guoliang Li and
Xuanhe Zhou and
Ji Sun and
Xiang Yu and
Yue Han and
Lianyuan Jin and
Wenbo Li and
Tianqing Wang and
Shifu Li openGauss: an autonomous database system 3028--3042
Rahul Potharaju and
Terry Kim and
Eunjin Song and
Wentao Wu and
Lev Novik and
Apoorve Dave and
Andrew Fogarty and
Pouria Pirzadeh and
Vidip Acharya and
Gurleen Dhody and
Jiying Li and
Sinduja Ramanujam and
Nicolas Bruno and
César A. Galindo-Legaria and
Vivek Narasayya and
Surajit Chaudhuri and
Anil K. Nori and
Tomas Talius and
Raghu Ramakrishnan Hyperspace: the indexing subsystem of
Azure Synapse . . . . . . . . . . . . . 3043--3055
Bolong Zheng and
Lei Bi and
Juan Cao and
Hua Chai and
Jun Fang and
Lu Chen and
Yunjun Gao and
Xiaofang Zhou and
Christian S. Jensen SpeakNav: voice-based route description
language understanding for
template-driven path search . . . . . . 3056--3068
Ana Sofia Gomes and
João Oliveirinha and
Pedro Cardoso and
Pedro Bizarro Railgun: managing large streaming
windows under MAD requirements . . . . . 3069--3082
Pavan Edara and
Mosha Pasumansky Big metadata: when metadata is big data 3083--3095
Josh Stoddard and
Adam Mustafa and
Naveen Goela Tanium Reveal: a federated search engine
for querying unstructured file data on
large enterprise networks . . . . . . . 3096--3109
Can Gencer and
Marko Topolnik and
Viliam \vDurina and
Emin Demirci and
Ensar B. Kahveci and
Ali Gürbüz and
Ond\vrej Luká\vs and
József Bartók and
Grzegorz Gierlach and
Franti\vsek Hartman and
Ufuk Yìlmaz and
Mehmet Do\ugan and
Mohamed Mandouh and
Marios Fragkoulis and
Asterios Katsifodimos Hazelcast Jet: low-latency stream
processing at the 99.99-th percentile 3110--3121
Abhishek Roy and
Alekh Jindal and
Priyanka Gomatam and
Xiating Ouyang and
Ashit Gosalia and
Nishkam Ravi and
Swinky Mann and
Prakhar Jain SparkCruise: workload optimization in
managed spark clusters at Microsoft . . 3122--3134
Tyler Akidau and
Edmon Begoli and
Slava Chernyak and
Fabian Hueske and
Kathryn Knight and
Kenneth Knowles and
Daniel Mills and
Dan Sotolongo Watermarks in stream processing systems:
semantics and comparative analysis of
Apache Flink and Google cloud dataflow 3135--3147
Conor Power and
Hiren Patel and
Alekh Jindal and
Jyoti Leeka and
Bob Jenkins and
Michael Rys and
Ed Triou and
Dexin Zhu and
Lucky Katahanas and
Chakrapani Bhat Talapady and
Joshua Rowe and
Fan Zhang and
Rich Draves and
Marc Friedman and
Ivan Santa Maria Filho and
Amrish Kumar The Cosmos big data platform at
Microsoft: over a decade of progress and
a decade to look forward . . . . . . . . 3148--3161
Ippokratis Pandis The evolution of Amazon Redshift . . . . 3162--3174
Simon Razniewski and
Hiba Arnaout and
Shrestha Ghosh and
Fabian Suchanek On the limits of machine knowledge:
completeness, recall and negation in
web-scale knowledge bases . . . . . . . 3175--3177
Laurel Orr and
Atindriyo Sanyal and
Xiao Ling and
Karan Goel and
Megan Leszczynski Managing ML pipelines: feature stores
and the coming wave of embedding
ecosystems . . . . . . . . . . . . . . . 3178--3181
Yuliang Li and
Xiaolan Wang and
Zhengjie Miao and
Wang-Chiew Tan Data augmentation for ML-driven data
preparation and integration . . . . . . 3182--3185
Ramon Antonio Rodriges Zalipynis Array DBMS: past, present, and (near)
future . . . . . . . . . . . . . . . . . 3186--3189
Guoliang Li and
Xuanhe Zhou and
Lei Cao Machine learning for databases . . . . . 3190--3193
Saeed Kargar and
Faisal Nawab Extending the lifetime of NVM:
challenges and opportunities . . . . . . 3194--3197
Karima Echihabi and
Kostas Zoumpatianos and
Themis Palpanas New trends in high-D vector similarity
search: AI-driven, progressive, and
distributed . . . . . . . . . . . . . . 3198--3201
Alekh Jindal and
Matteo Interlandi Machine learning for cloud data systems:
the progress so far and the path forward 3202--3205
Susan B. Davidson It's not just cookies and tea . . . . . 3206--3206
Thomas Neumann Evolution of a compiling query engine 3207--3210
Andrew Pavlo and
Matthew Butrovich and
Lin Ma and
Prashanth Menon and
Wan Shen Lim and
Dana Van Aken and
William Zhang Make your database system dream of
electric sheep: towards self-driving
operation . . . . . . . . . . . . . . . 3211--3221
Tim Kraska Towards instance-optimized data systems 3222--3232
Gerhard Weikum Knowledge graphs 2021: a data odyssey 3233--3238
Zachary G. Ives and
Rachel Pottinger and
Arun Kumar and
Johannes Gehrke and
Jana Giceva The future of data(base) education: is
the ``cow book'' dead? . . . . . . . . . 3239--3239
Luis Remis and
Chaunté W. Lacewell Using VDMS to index and search 100M
images . . . . . . . . . . . . . . . . . 3240--3252
Jian Liu and
Kefei Wang and
Feng Chen TSCache: an efficient flash-based
caching scheme for time-series data
workloads . . . . . . . . . . . . . . . 3253--3266
Huayi Wang and
Jingfan Meng and
Long Gong and
Jun Xu and
Mitsunori Ogihara MP-RW-LSH: an efficient multi-probe LSH
solution to ANNS-L$_1$ . . . . . . . . . 3267--3280
Theofilos Mailis and
Yannis Kotidis and
Stamatis Christoforidis and
Evgeny Kharlamov and
Yannis Ioannidis View selection over knowledge graphs in
triple stores . . . . . . . . . . . . . 3281--3294
Dongjie Li and
Siyi Lv and
Yanyu Huang and
Yijing Liu and
Tong Li and
Zheli Liu and
Liang Guo Frequency-hiding order-preserving
encryption with small client storage . . 3295--3307
Dimitrios Koutsoukos and
Ingo Müller and
Renato Marroquín and
Ana Klimovic and
Gustavo Alonso Modularis: modular relational analytics
over heterogeneous distributed platforms 3308--3321
Yunkai Lou and
Chaokun Wang and
Tiankai Gu and
Hao Feng and
Jun Chen and
Jeffrey Xu Yu Time-topology analysis . . . . . . . . . 3322--3334
Daniel Bernau and
Günther Eibl and
Philip W. Grassal and
Hannah Keller and
Florian Kerschbaum Quantifying identifiability to choose
and audit $ \epsilon $ in differentially
private deep learning . . . . . . . . . 3335--3347
Rodrigo Laigner and
Yongluan Zhou and
Marcos Antonio Vaz Salles and
Yijian Liu and
Marcos Kalinowski Data management in microservices: state
of the practice, challenges, and
research directions . . . . . . . . . . 3348--3361
Remmelt Ammerlaan and
Gilbert Antonius and
Marc Friedman and
H. M. Sajjad Hossain and
Alekh Jindal and
Peter Orenberg and
Hiren Patel and
Shi Qiao and
Vijay Ramani and
Lucas Rosenblatt and
Abhishek Roy and
Irene Shaffer and
Soundarajan Srinivasan and
Markus Weimer PerfGuard: deploying ML-for-systems
without performance regressions, almost! 3362--3375
Bailu Ding and
Surajit Chaudhuri and
Johannes Gehrke and
Vivek Narasayya DSB: a decision support benchmark for
workload-driven and traditional database
systems . . . . . . . . . . . . . . . . 3376--3388
Daniel Hernández and
Luis Galárraga and
Katja Hose Computing how-provenance for SPARQL
queries via query rewriting . . . . . . 3389--3401
Junxiong Wang and
Immanuel Trummer and
Debabrota Basu UDO: universal database optimization
using reinforcement learning . . . . . . 3402--3414
Anja Feldmann Internet traffic analysis at scale . . . 3415--3415
Danai Koutra The power of summarization in graph
mining and learning: smaller data,
faster methods, more interpretability 3416--3416
Nigam Shah Summarizing patients like mine via an
on-demand consultation service . . . . . 3417--3417
Joaquin Vanschoren Towards scalable online machine learning
collaborations with OpenML . . . . . . . 3418--3418
Manasi Vartak From ML models to intelligent
applications: the rise of MLOps . . . . 3419--3419
Matei Zaharia Designing production-friendly machine
learning . . . . . . . . . . . . . . . . 3420--3420
Kang Zhao and
Liuyihan Song and
Yingya Zhang and
Pan Pan and
Yinghui Xu and
Rong Jin ANN softmax: acceleration of extreme
classification training . . . . . . . . 1--10
Gyeong-In Yu and
Saeed Amizadeh and
Sehoon Kim and
Artidoro Pagnoni and
Ce Zhang and
Byung-Gon Chun and
Markus Weimer and
Matteo Interlandi WindTunnel: towards differentiable ML
pipelines beyond a single model . . . . 11--20
Athinagoras Skiadopoulos and
Qian Li and
Peter Kraft and
Kostis Kaffes and
Daniel Hong and
Shana Mathew and
David Bestor and
Michael Cafarella and
Vijay Gadepally and
Goetz Graefe and
Jeremy Kepner and
Christos Kozyrakis and
Tim Kraska and
Michael Stonebraker and
Lalith Suresh and
Matei Zaharia DBOS: a DBMS-oriented operating system 21--30
Arjit Jain and
Sunita Sarawagi and
Prithviraj Sen Deep indexed active learning for
matching heterogeneous entity
representations . . . . . . . . . . . . 31--45
Xuanhe Zhou and
Guoliang Li and
Chengliang Chai and
Jianhua Feng A learned query rewrite system using
Monte Carlo tree search . . . . . . . . 46--58
Yin Lin and
Brit Youngmann and
Yuval Moskovitch and
H. V. Jagadish and
Tova Milo On detecting cherry-picked
generalizations . . . . . . . . . . . . 59--71
Jiayi Wang and
Chengliang Chai and
Jiabin Liu and
Guoliang Li FACE: a normalizing flow based
cardinality estimator . . . . . . . . . 72--84
Ji Sun and
Jintao Zhang and
Zhaoyan Sun and
Guoliang Li and
Nan Tang Learned cardinality estimation: a design
space exploration and a comparative
evaluation . . . . . . . . . . . . . . . 85--97
Dong He and
Maureen Daum and
Walter Cai and
Magdalena Balazinska DeepEverest: accelerating declarative
top-$K$ queries for deep neural network
interpretation . . . . . . . . . . . . . 98--111
Subarna Chatterjee and
Meena Jagadeesan and
Wilson Qin and
Stratos Idreos Cosine: a cloud-cost optimized
self-designing key--value storage engine 112--126
Muhammad Adnan and
Yassaman Ebrahimzadeh Maboud and
Divya Mahajan and
Prashant J. Nair Accelerating recommendation system
training by leveraging popular choices 127--140
Jianye Yang and
Yun Peng and
Wenjie Zhang $ (p, q) $-biclique counting and
enumeration for large sparse bipartite
graphs . . . . . . . . . . . . . . . . . 141--153
Dan Graur and
Ingo Müller and
Mason Proffitt and
Ghislain Fourny and
Gordon T. Watts and
Gustavo Alonso Evaluating query languages and systems
for high-energy physics data . . . . . . 154--168
Kongzhang Hao and
Long Yuan and
Wenjie Zhang Distributed hop-constrained $s$--$t$
simple path enumeration at billion scale 169--182
Jingzhi Fang and
Yanyan Shen and
Yue Wang and
Lei Chen ETO: accelerating optimization of DNN
operators by high-performance tensor
program reuse . . . . . . . . . . . . . 183--195
Philipp Marian Grulich and
Steffen Zeuch and
Volker Markl Babelfish: efficient execution of
polyglot queries . . . . . . . . . . . . 196--210
Alexander Zhou and
Yue Wang and
Lei Chen Butterfly counting on uncertain
bipartite graphs . . . . . . . . . . . . 211--223
Yue Cui and
Kai Zheng and
Dingshan Cui and
Jiandong Xie and
Liwei Deng and
Feiteng Huang and
Xiaofang Zhou METRO: a generic graph neural network
framework for multivariate time series
forecasting . . . . . . . . . . . . . . 224--236
Congcong Ge and
Xiaoze Liu and
Lu Chen and
Yunjun Gao and
Baihua Zheng LargeEA: aligning entities for
large-scale knowledge graphs . . . . . . 237--245
Kejing Lu and
Mineichi Kudo and
Chuan Xiao and
Yoshiharu Ishikawa HVS: hierarchical graph structure based
on Voronoi diagrams for solving
approximate nearest neighbor search . . 246--258
Arif Arman and
Dmitri Loguinov Origami: a high-performance mergesort
framework . . . . . . . . . . . . . . . 259--271
Renzhi Wu and
Bolin Ding and
Xu Chu and
Zhewei Wei and
Xiening Dai and
Tao Guan and
Jingren Zhou Learning to be a statistician: learned
estimator for number of distinct values 272--284
Shangdi Yu and
Yiqiu Wang and
Yan Gu and
Laxman Dhulipala and
Julian Shun ParChain: a framework for parallel
hierarchical agglomerative clustering
using nearest-neighbor chain . . . . . . 285--298
Komal Chauhan and
Kartik Jain and
Sayan Ranu and
Srikanta Bedathur and
Amitabha Bagchi Answering regular path queries through
exemplars . . . . . . . . . . . . . . . 299--311
Xupeng Miao and
Hailin Zhang and
Yining Shi and
Xiaonan Nie and
Zhi Yang and
Yangyu Tao and
Bin Cui HET: scaling out huge embedding model
training via cache-enabled distributed
framework . . . . . . . . . . . . . . . 312--320
Pengfei Li and
Yu Hua and
Jingnan Jia and
Pengfei Zuo FINEdex: a fine-grained learned index
scheme for scalable and concurrent
memory systems . . . . . . . . . . . . . 321--334
Jiyang Bai and
Peixiang Zhao TaGSim: type-aware graph similarity
learning and computation . . . . . . . . 335--347
Yuqing Zhu and
Jing Tang and
Xueyan Tang and
Lei Chen Analysis of influence contribution in
social advertising . . . . . . . . . . . 348--360
Georgios Theodorakis and
Fotios Kounelis and
Peter Pietzuch and
Holger Pirk Scabbard: single-node fault-tolerant
stream processing . . . . . . . . . . . 361--374
George Konstantinidis and
Jet Holt and
Adriane Chapman Enabling personal consent in databases 375--387
Yejia Liu and
Weiyuan Wu and
Lampros Flokas and
Jiannan Wang and
Eugene Wu Enabling SQL-based training data
debugging for federated learning . . . . 388--400
Kapil Vaidya and
Anshuman Dutt and
Vivek Narasayya and
Surajit Chaudhuri Leveraging query logs and machine
learning for parametric query
optimization . . . . . . . . . . . . . . 401--413
Yao Lu and
Srikanth Kandula and
Arnd Christian König and
Surajit Chaudhuri Pre-training summarization models of
structured datasets for cardinality
estimation . . . . . . . . . . . . . . . 414--426
Susie Xi Rao and
Shuai Zhang and
Zhichao Han and
Zitao Zhang and
Wei Min and
Zhiyao Chen and
Yinan Shan and
Yang Zhao and
Ce Zhang xFraud: explainable fraud transaction
detection . . . . . . . . . . . . . . . 427--436
Ye Yuan and
Delong Ma and
Zhenyu Wen and
Zhiwei Zhang and
Guoren Wang Subgraph matching over graph federation 437--450
Xing Niu and
Boris Glavic and
Ziyu Liu and
Pengyuan Li and
Dieter Gawlick and
Vasudha Krishnaswamy and
Zhen Hua Liu and
Danica Porobic Provenance-based data skipping . . . . . 451--464
Di Jin and
Bunyamin Sisman and
Hao Wei and
Xin Luna Dong and
Danai Koutra Deep transfer learning for multi-source
entity linkage via domain adaptation . . 465--477
Lu Xing and
Eric Lee and
Tong An and
Bo-Cheng Chu and
Ahmed Mahmood and
Ahmed M. Aly and
Jianguo Wang and
Walid G. Aref An experimental evaluation and
investigation of waves of misery in
$r$-trees . . . . . . . . . . . . . . . 478--490
Yongyi Liu and
Ahmed R. Mahmood and
Amr Magdy and
Sergio Rey PRUC: P-regions with user-defined
constraint . . . . . . . . . . . . . . . 491--503
Yile Chen and
Xiucheng Li and
Gao Cong and
Cheng Long and
Zhifeng Bao and
Shang Liu and
Wanli Gu and
Fuzheng Zhang Points-of-interest relationship
inference with spatial-enriched graph
neural networks . . . . . . . . . . . . 504--512
Tsz Nam Chan and
Pak Lon Ip and
Leong Hou U. and
Byron Choi and
Jianliang Xu SAFE: a share-and-aggregate bandwidth
exploration framework for kernel density
visualization . . . . . . . . . . . . . 513--526
Jens Dittrich and
Joris Nix and
Christian Schön The next 50 years in database indexing
or: the case for automatically generated
index structures . . . . . . . . . . . . 527--540
Koral Chapnik and
Ilya Kolchinsky and
Assaf Schuster DARLING: data-aware load shedding in
complex event processing systems . . . . 541--554
Danyang Zhuo and
Kaiyuan Zhang and
Zhuohan Li and
Siyuan Zhuang and
Stephanie Wang and
Ang Chen and
Ion Stoica Rearchitecting in-memory object stores
for low latency . . . . . . . . . . . . 555--568
Pingchuan Ma and
Shuai Wang MT-teql: evaluating and augmenting
neural NLIDB on real-world linguistic
and schema variations . . . . . . . . . 569--582
Jessica Shi and
Laxman Dhulipala and
Julian Shun Theoretically and practically efficient
parallel nucleus decomposition . . . . . 583--596
Baotong Lu and
Jialin Ding and
Eric Lo and
Umar Farooq Minhas and
Tianzheng Wang APEX: a high-performance learned index
on persistent memory . . . . . . . . . . 597--610
David Campos and
Tung Kieu and
Chenjuan Guo and
Feiteng Huang and
Kai Zheng and
Bin Yang and
Christian S. Jensen Unsupervised time series outlier
detection with diversity-driven
convolutional ensembles . . . . . . . . 611--623
Xiaoye Miao and
Yangyang Wu and
Lu Chen and
Yunjun Gao and
Jun Wang and
Jianwei Yin Efficient and effective data imputation
with influence functions . . . . . . . . 624--632
Adrian Kochsiek and
Rainer Gemulla Parallel training of knowledge graph
embedding models: a comparison of
techniques . . . . . . . . . . . . . . . 633--645
Gerardo Vitagliano and
Lan Jiang and
Felix Naumann Detecting layout templates in complex
multiregion files . . . . . . . . . . . 646--658
Kajetan Maliszewski and
Jorge-Arnulfo Quiané-Ruiz and
Jonas Traub and
Volker Markl What is the price for joining securely?:
benchmarking equi-joins in trusted
execution environments . . . . . . . . . 659--672
Van Long Ho and
Nguyen Ho and
Torben Bach Pedersen Efficient temporal pattern mining in big
time series using mutual information . . 673--685
Junhua Zhang and
Long Yuan and
Wentao Li and
Lu Qin and
Ying Zhang Efficient label-constrained shortest
path queries on road networks: a tree
decomposition approach . . . . . . . . . 686--698
Sahaana Suri and
Ihab F. Ilyas and
Christopher Ré and
Theodoros Rekatsinas Ember: no-code context enrichment via
similarity-based keyless joins . . . . . 699--712
Tin Vu and
Ahmed Eldawy and
Vagelis Hristidis and
Vassilis Tsotras Incremental partitioning for efficient
spatial data analytics . . . . . . . . . 713--726
Doris Jung-Lin Lee and
Dixin Tang and
Kunal Agarwal and
Thyne Boonmark and
Caitlyn Chen and
Jake Kang and
Ujjaini Mukhopadhyay and
Jerry Song and
Micah Yong and
Marti A. Hearst and
Aditya G. Parameswaran Lux: always-on visualization
recommendations for exploratory
dataframe workflows . . . . . . . . . . 727--738
Devin Petersohn and
Dixin Tang and
Rehan Durrani and
Areg Melik-Adamyan and
Joseph E. Gonzalez and
Anthony D. Joseph and
Aditya G. Parameswaran Flexible rule-based decomposition and
metadata independence in modin: a
parallel dataframe system . . . . . . . 739--751
Yuxing Han and
Ziniu Wu and
Peizhi Wu and
Rong Zhu and
Jingyi Yang and
Liang Wei Tan and
Kai Zeng and
Gao Cong and
Yanzhao Qin and
Andreas Pfadler and
Zhengping Qian and
Jingren Zhou and
Jiangneng Li and
Bin Cui Cardinality estimation in DBMS: a
comprehensive benchmark evaluation . . . 752--765
Qizhen Zhang and
Philip A. Bernstein and
Daniel S. Berger and
Badrish Chandramouli Redy: remote dynamic memory cache . . . 766--779
Martin Boissier Robust and budget-constrained encoding
configurations for in-memory database
systems . . . . . . . . . . . . . . . . 780--793
Shulong Tan and
Weijie Zhao and
Ping Li Fast neural ranking on bipartite graph
indices . . . . . . . . . . . . . . . . 794--803
Shaoduo Gan and
Jiawei Jiang and
Binhang Yuan and
Ce Zhang and
Xiangru Lian and
Rui Wang and
Jianbin Chang and
Chengjun Liu and
Hongmei Shi and
Shengzhuo Zhang and
Xianghong Li and
Tengxu Sun and
Sen Yang and
Ji Liu Bagua: scaling up distributed learning
with system relaxations . . . . . . . . 804--813
Tsz Nam Chan and
Pak Lon Ip and
Leong Hou U. and
Byron Choi and
Jianliang Xu SWS: a complexity-optimized solution for
spatial-temporal kernel density
visualization . . . . . . . . . . . . . 814--827
Junxu Liu and
Jian Lou and
Li Xiong and
Jinfei Liu and
Xiaofeng Meng Projected federated averaging with
heterogeneous differential privacy . . . 828--840
Daniel Haimovich and
Dima Karamshuk and
Thomas J. Leeper and
Evgeniy Riabenko and
Milan Vojnovic Popularity prediction for social media
over arbitrary time horizons . . . . . . 841--849
Ishita Doshi and
Dhritiman Das and
Ashish Bhutani and
Rajeev Kumar and
Rushi Bhatt and
Niranjan Balasubramanian LANNS: a web-scale approximate nearest
neighbor lookup system . . . . . . . . . 850--858
Eduardo H. M. Pena and
Eduardo C. de Almeida and
Felix Naumann Fast detection of denial constraint
violations . . . . . . . . . . . . . . . 859--871
Bowen Yu and
Guanyu Feng and
Huanqi Cao and
Xiaohan Li and
Zhenbo Sun and
Haojie Wang and
Xiaowei Zhu and
Weimin Zheng and
Wenguang Chen Chukonu: a fully-featured
high-performance big data framework that
integrates a native compute engine into
Spark . . . . . . . . . . . . . . . . . 872--885
Sian Jin and
Chengming Zhang and
Xintong Jiang and
Yunhe Feng and
Hui Guan and
Guanpeng Li and
Shuaiwen Leon Song and
Dingwen Tao COMET: a novel memory-efficient deep
learning training framework by using
error-bounded lossy compression . . . . 886--899
Zitao Li and
Bolin Ding and
Ce Zhang and
Ninghui Li and
Jingren Zhou Federated matrix factorization with
privacy guarantee . . . . . . . . . . . 900--913
Chi Thang Duong and
Trung Dung Hoang and
Hongzhi Yin and
Matthias Weidlich and
Quoc Viet Hung Nguyen and
Karl Aberer Scalable robust graph embedding with
Spark . . . . . . . . . . . . . . . . . 914--922
Debjyoti Paul and
Jie Cao and
Feifei Li and
Vivek Srikumar Database workload characterization with
query plan encoders . . . . . . . . . . 923--935
Abhishek Modi and
Kaushik Rajan and
Srinivas Thimmaiah and
Prakhar Jain and
Swinky Mann and
Ayushi Agarwal and
Ajith Shetty and
Shahid K. I. and
Ashit Gosalia and
Partho Sarthi New query optimization techniques in the
Spark engine of Azure synapse . . . . . 936--948
Phanwadee Sinthong and
Dhaval Patel and
Nianjun Zhou and
Shrey Shrivastava and
Arun Iyengar and
Anuradha Bhamidipaty DQDF: data-quality-aware dataframes . . 949--957
Archita Agarwal and
Marilyn George and
Aaron Jeyaraj and
Malte Schwarzkopf Retrofitting GDPR compliance onto legacy
databases . . . . . . . . . . . . . . . 958--970
Xinle Wu and
Dalin Zhang and
Chenjuan Guo and
Chaoyang He and
Bin Yang and
Christian S. Jensen AutoCTS: automated correlated time
series forecasting . . . . . . . . . . . 971--983
Sivaprasad Sudhir and
Michael Cafarella and
Samuel Madden Replicated layout for in-memory database
systems . . . . . . . . . . . . . . . . 984--997
Anupam Sanghi and
Shadab Ahmed and
Jayant R. Haritsa Projection-compliant database generation 998--1010
Guodong Jin and
Semih Salihoglu Making RDBMSs efficient on graph
workloads through predefined joins . . . 1011--1023
Shaleen Deep and
Xiao Hu and
Paraschos Koutris Ranked enumeration of join queries with
projections . . . . . . . . . . . . . . 1024--1037
Ahnjae Shin and
Joo Seong Jeong and
Do Yoon Kim and
Soyoung Jung and
Byung-Gon Chun Hippo: sharing computations in
hyper-parameter optimization . . . . . . 1038--1052
Arik Rinberg and
Tomer Solomon and
Roee Shlomo and
Guy Khazma and
Gal Lushi and
Idit Keidar and
Paula Ta-Shma DSON: JSON CRDT using delta-mutations
for document stores . . . . . . . . . . 1053--1065
Sepanta Zeighami and
Ritesh Ahuja and
Gabriel Ghinita and
Cyrus Shahabi A neural database for differentially
private spatial range queries . . . . . 1066--1078
Marcel Maltry and
Jens Dittrich A critical analysis of recursive model
indexes . . . . . . . . . . . . . . . . 1079--1091
Zerui Ge and
Dumitrel Loghin and
Beng Chin Ooi and
Pingcheng Ruan and
Tianwen Wang Hybrid blockchain database systems:
design and performance . . . . . . . . . 1092--1104
Angela Bonifati and
Stefania Dumbrava and
George Fletcher and
Jan Hidders and
Matthias Hofer and
Wim Martens and
Filip Murlak and
Joshua Shinavier and
S\lawek Staworko and
Dominik Tomaszuk Threshold queries in theory and in the
wild . . . . . . . . . . . . . . . . . . 1105--1118
Moritz Sichert and
Thomas Neumann User-defined operators: efficiently
integrating custom algorithms into
modern databases . . . . . . . . . . . . 1119--1131
Yue Wang and
Vivek Narasayya and
Yeye He and
Surajit Chaudhuri PACk: an efficient partition-based
distributed agglomerative hierarchical
clustering algorithm for deduplication 1132--1145
Lijun Chang and
Zhiyi Wang A near-optimal approach to edge
connectivity-based hierarchical graph
decomposition . . . . . . . . . . . . . 1146--1158
Yongxin Tong and
Xuchen Pan and
Yuxiang Zeng and
Yexuan Shi and
Chunbo Xue and
Zimu Zhou and
Xiaofei Zhang and
Lei Chen and
Yi Xu and
Ke Xu and
Weifeng Lv Hu-Fu: efficient and secure spatial
queries over data federation . . . . . . 1159--1172
Per Fuchs and
Domagoj Margan and
Jana Giceva Sortledton: a universal, transactional
graph data structure . . . . . . . . . . 1173--1186
Bowen Zhang and
Shengan Zheng and
Zhenlin Qi and
Linpeng Huang NBTree: a lock-free PM-friendly
persistent B+-tree for eADR-enabled PM
systems . . . . . . . . . . . . . . . . 1187--1200
Shreshth Tuli and
Giuliano Casale and
Nicholas R. Jennings TranAD: deep transformer networks for
anomaly detection in multivariate time
series data . . . . . . . . . . . . . . 1201--1214
Fuheng Zhao and
Divyakant Agrawal and
Amr El Abbadi and
Ahmed Metwally SpaceSaving$ \pm $: an optimal algorithm
for frequency estimation and frequent
items in the bounded-deletion model . . 1215--1227
Chenguang Zheng and
Hongzhi Chen and
Yuxuan Cheng and
Zhezheng Song and
Yifan Wu and
Changji Li and
James Cheng and
Hao Yang and
Shuai Zhang ByteGNN: efficient graph neural network
training at large scale . . . . . . . . 1228--1242
Yuli Jiang and
Yu Rong and
Hong Cheng and
Xin Huang and
Kangfei Zhao and
Junzhou Huang Query driven-graph neural networks for
community search: from non-attributed,
attributed, to interactive attributed 1243--1255
Yang Li and
Yu Shen and
Huaijun Jiang and
Wentao Zhang and
Jixiang Li and
Ji Liu and
Ce Zhang and
Bin Cui Hyper-tune: towards efficient
hyper-parameter tuning at scale . . . . 1256--1265
Koen Minartz and
Jens E. d'Hondt and
Odysseas Papapetrou Multivariate correlations discovery in
static and streaming data . . . . . . . 1266--1278
Olga Poppe and
Qun Guo and
Willis Lang and
Pankaj Arora and
Morgan Oslake and
Shize Xu and
Ajay Kalhan Moneyball: proactive auto-scaling in
Microsoft Azure SQL database serverless 1279--1287
Kewei Cheng and
Xian Li and
Yifan Ethan Xu and
Xin Luna Dong and
Yizhou Sun PGE: robust product graph embedding
learning for error detection . . . . . . 1288--1296
Naga Nithin Manne and
Shilvi Satpati and
Tanu Malik and
Amitabha Bagchi and
Ashish Gehani and
Amitabh Chaudhary CHEX: multiversion replay with ordered
checkpoints . . . . . . . . . . . . . . 1297--1310
Tomer Even and
Guy Even and
Adam Morrison Prefix filter: practically and
theoretically better than bloom . . . . 1311--1323
Hiroyuki Yamada and
Jun Nemoto Scalar DL: scalable and practical
Byzantine fault detection for
transactional database systems . . . . . 1324--1336
Gyuyeong Kim and
Wonjun Lee In-network leaderless replication for
distributed data stores . . . . . . . . 1337--1349
Xin Sun and
Xin Huang and
Di Jin Fast algorithms for core maximization on
large graphs . . . . . . . . . . . . . . 1350--1362
Shuye Pan and
Peng Wang and
Chen Wang and
Wei Wang and
Jianmin Wang NLC: search correlated window pairs on
long time series . . . . . . . . . . . . 1363--1375
Hanzhi Wang and
Zhewei Wei and
Junhao Gan and
Ye Yuan and
Xiaoyong Du and
Ji-Rong Wen Edge-based local push for personalized
PageRank . . . . . . . . . . . . . . . . 1376--1389
Harry Kai-Ho Chan and
Huan Li and
Xiao Li and
Hua Lu Continuous social distance monitoring in
indoor space . . . . . . . . . . . . . . 1390--1402
Xibo Sun and
Shixuan Sun and
Qiong Luo and
Bingsheng He An in-depth study of continuous subgraph
matching . . . . . . . . . . . . . . . . 1403--1416
Abhinav Mishra and
Ram Sriharsha and
Sichen Zhong OnlineSTL: scaling time series
decomposition by $ 100 \times $ . . . . 1417--1425
Haoyu Li and
Qizhi Chen and
Yixin Zhang and
Tong Yang and
Bin Cui Stingy sketch: a sketch framework for
accurate and fast frequency estimation 1426--1438
Yang Wang and
Miao Yu and
Yujie Hui and
Fang Zhou and
Yuyang Huang and
Rui Zhu and
Xueyuan Ren and
Tianxi Li and
Xiaoyi Lu A study of database performance
sensitivity to experiment settings . . . 1439--1452
Zemin Chao and
Hong Gao and
Yinan An and
Jianzhong Li The inherent time complexity and an
efficient algorithm for subsequence
matching problem . . . . . . . . . . . . 1453--1465
Chengliang Chai and
Jiabin Liu and
Nan Tang and
Guoliang Li and
Yuyu Luo Selective data acquisition in the wild
for model charging . . . . . . . . . . . 1466--1478
Wenfei Fan and
Wenzhi Fu and
Ruochun Jin and
Ping Lu and
Chao Tian Discovering association rules from big
graphs . . . . . . . . . . . . . . . . . 1479--1492
Xiaolin Han and
Reynold Cheng and
Chenhao Ma and
Tobias Grubenmann DeepTEA: effective and efficient online
time-dependent trajectory outlier
detection . . . . . . . . . . . . . . . 1493--1505
Giovanni Simonini and
Luca Zecchini and
Sonia Bergamaschi and
Felix Naumann Entity resolution on-demand . . . . . . 1506--1518
Afnan Alhazmi and
Tom Blount and
George Konstantinidis ForBackBench: a benchmark for chasing
vs. query-rewriting . . . . . . . . . . 1519--1532
Jeremy Chen and
Yuqing Huang and
Mushi Wang and
Semih Salihoglu and
Ken Salem Accurate summary-based cardinality
estimation through the lens of
cardinality estimation graphs . . . . . 1533--1545
Xuankun Liao and
Qing Liu and
Jiaxin Jiang and
Xin Huang and
Jianliang Xu and
Byron Choi Distributed D-core decomposition over
large directed graphs . . . . . . . . . 1546--1558
Lu Chen and
Chengfei Liu and
Rui Zhou and
Jiajie Xu and
Jianxin Li Efficient maximal biclique enumeration
for large sparse bipartite graphs . . . 1559--1571
Hongkuan Zhou and
Da Zheng and
Israt Nisa and
Vasileios Ioannidis and
Xiang Song and
George Karypis TGL: a general framework for temporal
GNN training on billion-scale graphs . . 1572--1580
Binhang Yuan and
Cameron R. Wolfe and
Chen Dun and
Yuxin Tang and
Anastasios Kyrillidis and
Chris Jermaine Distributed learning of fully connected
neural networks using independent subnet
training . . . . . . . . . . . . . . . . 1581--1590
Sebastian Burckhardt and
Badrish Chandramouli and
Chris Gillum and
David Justo and
Konstantinos Kallas and
Connor McMahon and
Christopher S. Meiklejohn and
Xiangfeng Zhu Netherite: efficient execution of
serverless workflows . . . . . . . . . . 1591--1604
Andy Huynh and
Harshal A. Chaudhari and
Evimaria Terzi and
Manos Athanassoulis Endure: a robust tuning paradigm for LSM
trees under workload uncertainty . . . . 1605--1618
Hongzheng Li and
Yingxia Shao and
Junping Du and
Bin Cui and
Lei Chen An I/O-efficient disk-based graph system
for scalable second-order random walk of
large graphs . . . . . . . . . . . . . . 1619--1631
Kapil Vaidya and
Subarna Chatterjee and
Eric Knorr and
Michael Mitzenmacher and
Stratos Idreos and
Tim Kraska SNARF: a learning-enhanced range filter 1632--1644
Xin Chen and
You Peng and
Sibo Wang and
Jeffrey Xu Yu DLCR: efficient indexing for
label-constrained reachability queries
on large dynamic graphs . . . . . . . . 1645--1657
Yue Zhao and
Gao Cong and
Jiachen Shi and
Chunyan Miao QueryFormer: a tree transformer model
for query plan representation . . . . . 1658--1670
Leon Lee and
Siphrey Xie and
Yunus Ma and
Shimin Chen Index checkpoints for instant recovery
in in-memory database systems . . . . . 1671--1683
Mahdi Esmailoghli and
Jorge-Arnulfo Quiané-Ruiz and
Ziawasch Abedjan MATE: multi-attribute table extraction 1684--1696
John Paparrizos and
Yuhao Kang and
Paul Boniol and
Ruey S. Tsay and
Themis Palpanas and
Michael J. Franklin TSB-UAD: an end-to-end benchmark suite
for univariate time-series anomaly
detection . . . . . . . . . . . . . . . 1697--1711
Manuel Leone and
Stefano Huber and
Akhil Arora and
Alberto García-Durán and
Robert West A critical re-evaluation of neural
methods for entity alignment . . . . . . 1712--1725
Matteo Paganelli and
Francesco Del Buono and
Andrea Baraldi and
Francesco Guerra Analyzing how BERT performs entity
matching . . . . . . . . . . . . . . . . 1726--1738
Balaji Arun and
Binoy Ravindran Scalable Byzantine fault tolerance via
partial decentralization . . . . . . . . 1739--1752
Huan Li and
Lanjing Yi and
Bo Tang and
Hua Lu and
Christian S. Jensen Efficient and error-bounded
spatiotemporal quantile monitoring in
edge computing environments . . . . . . 1753--1765
Fumiyuki Kato and
Tsubasa Takahashi and
Shun Takagi and
Yang Cao and
Seng Pei Liew and
Masatoshi Yoshikawa HDPView: differentially private
materialized view for exploring high
dimensional relational data . . . . . . 1766--1778
Sebastian Schmidl and
Phillip Wenig and
Thorsten Papenbrock Anomaly detection in time series: a
comprehensive evaluation . . . . . . . . 1779--1797
Brit Youngmann and
Sihem Amer-Yahia and
Aurelien Personnaz Guided exploration of data summaries . . 1798--1807
Xinyi Zhang and
Zhuo Chang and
Yang Li and
Hong Wu and
Jian Tan and
Feifei Li and
Bin Cui Facilitating database tuning with
hyper-parameter optimization: a
comprehensive experimental evaluation 1808--1821
Zuan Wang and
Xiaofeng Ding and
Hai Jin and
Pan Zhou Efficient secure and verifiable
location-based skyline queries over
encrypted data . . . . . . . . . . . . . 1822--1834
Zhuoyue Zhao and
Dong Xie and
Feifei Li AB-tree: index for concurrent random
sampling and updates . . . . . . . . . . 1835--1847
Chenguang Fang and
Shaoxu Song and
Yinan Mei On repairing timestamps for regular
interval time series . . . . . . . . . . 1848--1860
Wenfei Fan and
Ruochun Jin and
Ping Lu and
Chao Tian and
Ruiqi Xu Towards event prediction in temporal
graphs . . . . . . . . . . . . . . . . . 1861--1874
Yihuai Liang and
Yan Li and
Byeong-Seok Shin Decentralized crowdsourcing for human
intelligence tasks with efficient
on-chain cost . . . . . . . . . . . . . 1875--1888
Yue Wang and
Ruiqi Xu and
Xun Jian and
Alexander Zhou and
Lei Chen Towards distributed bitruss
decomposition on bipartite graphs . . . 1889--1901
Luca Gagliardelli and
George Papadakis and
Giovanni Simonini and
Sonia Bergamaschi and
Themis Palpanas Generalized supervised meta-blocking . . 1902--1910
Mijin An and
Soojun Im and
Dawoon Jung and
Sang-Won Lee Your read is our priority in flash
storage . . . . . . . . . . . . . . . . 1911--1923
Arindam Bhattacharya and
Chathur Gudesa and
Amitabha Bagchi and
Srikanta Bedathur New wine in an old bottle: data-aware
hash functions for Bloom filters . . . . 1924--1936
Jingshu Peng and
Zhao Chen and
Yingxia Shao and
Yanyan Shen and
Lei Chen and
Jiannong Cao Sancus: staleness-aware
communication-avoiding full-graph
decentralized training in large-scale
graph neural networks . . . . . . . . . 1937--1950
Marco Bucchi and
Alejandro Grez and
Andrés Quintana and
Cristian Riveros and
Stijn Vansummeren CORE: a complex event recognition engine 1951--1964
Audrey Cheng and
Xiao Shi and
Aaron Kabcenell and
Shilpa Lawande and
Hamza Qadeer and
Jason Chan and
Harrison Tin and
Ryan Zhao and
Peter Bailis and
Mahesh Balakrishnan and
Nathan Bronson and
Natacha Crooks and
Ion Stoica TAOBench: an end-to-end benchmark for
social network workloads . . . . . . . . 1965--1977
Aarati Kakaraparthy and
Jignesh M. Patel and
Brian P. Kroth and
Kwanghyun Park VIP hashing: adapting to skew in
popularity of data on the fly . . . . . 1978--1990
Tobias Vinçon and
Christian Knödler and
Leonardo Solis-Vasquez and
Arthur Bernhardt and
Sajjad Tamimi and
Lukas Weber and
Florian Stock and
Andreas Koch and
Ilia Petrov Near-data processing in database systems
on native computational storage under
HTAP workloads . . . . . . . . . . . . . 1991--2004
Karima Echihabi and
Panagiota Fatourou and
Kostas Zoumpatianos and
Themis Palpanas and
Houda Benbrahim Hercules against data series similarity
search . . . . . . . . . . . . . . . . . 2005--2018
Tarique Siddiqui and
Wentao Wu and
Vivek Narasayya and
Surajit Chaudhuri DISTILL: low-overhead data-driven
techniques for filtering and costing
indexes for scalable index tuning . . . 2019--2031
Zhihui Yang and
Zuozhi Wang and
Yicong Huang and
Yao Lu and
Chen Li and
X. Sean Wang Optimizing machine learning inference
queries with correlative proxy models 2032--2044
Li Su and
Xiaoming Qin and
Zichao Zhang and
Rui Yang and
Le Xu and
Indranil Gupta and
Wenyuan Yu and
Kai Zeng and
Jingren Zhou Banyan: a scoped dataflow engine for
graph query service . . . . . . . . . . 2045--2057
Ziyue Huang and
Yuan Qiu and
Ke Yi and
Graham Cormode Frequency estimation under multiparty
differential privacy: one-shot and
streaming . . . . . . . . . . . . . . . 2058--2070
Ahsan Ali and
Riccardo Pinciroli and
Feng Yan and
Evgenia Smirni Optimizing inference serving on
serverless platforms . . . . . . . . . . 2071--2084
Wail Y. Alkowaileet and
Michael J. Carey Columnar formats for schemaless
LSM-based document stores . . . . . . . 2085--2097
Yu-Xuan Qiu and
Dong Wen and
Lu Qin and
Wentao Li and
Rong-Hua Li and
Ying Zhang Efficient shortest path counting on
large road networks . . . . . . . . . . 2098--2110
Fangcheng Fu and
Xupeng Miao and
Jiawei Jiang and
Huanran Xue and
Bin Cui Towards communication-efficient vertical
federated learning training via
cache-enabled local updates . . . . . . 2111--2120
Yifan Zhu and
Lu Chen and
Yunjun Gao and
Baihua Zheng and
Pengfei Wang DESIRE: an efficient dynamic
cluster-based forest indexing for
similarity search in multi-metric spaces 2121--2133
Junghoon Kim and
Kaiyu Feng and
Gao Cong and
Diwen Zhu and
Wenyuan Yu and
Chunyan Miao ABC: attributed bipartite co-clustering 2134--2147
Jinzhao Xiao and
Yuxiang Huang and
Changyu Hu and
Shaoxu Song and
Xiangdong Huang and
Jianmin Wang Time series data encoding for efficient
storage: a comparative analysis in
Apache IoTDB . . . . . . . . . . . . . . 2148--2160
Teng Zhang and
Jian Tan and
Xin Cai and
Jianying Wang and
Feifei Li and
Jianling Sun SA-LSM: optimize data layout for
LSM-tree based storage using survival
analysis . . . . . . . . . . . . . . . . 2161--2174
Paolo Ferragina and
Giovanni Manzini and
Travis Gagie and
Dominik Köppl and
Gonzalo Navarro and
Manuel Striani and
Francesco Tosoni Improving matrix-vector multiplication
via lossless grammar-compressed matrices 2175--2187
Shangyu Wu and
Yufei Cui and
Jinghuan Yu and
Xuan Sun and
Tei-Wei Kuo and
Chun Jason Xue NFL: robust learned index via
distribution transformation . . . . . . 2188--2200
Hamidreza Zare and
Viveck Ramesh Cadambe and
Bhuvan Urgaonkar and
Nader Alfares and
Praneet Soni and
Chetan Sharma and
Arif A. Merchant LEGOStore: a linearizable
geo-distributed store combining
replication and erasure coding . . . . . 2201--2215
Michael Simpson and
Farnoosh Hashemi and
Laks V. S. Lakshmanan Misinformation mitigation under
differential propagation rates and
temporal penalties . . . . . . . . . . . 2216--2229
Lixi Zhou and
Jiaqing Chen and
Amitabh Das and
Hong Min and
Lei Yu and
Ming Zhao and
Jia Zou Serving deep learning models with
deduplication from relational databases 2230--2243
Zichun Huang and
Shimin Chen Density-optimized intersection-free
mapping and matrix multiplication for
join-project operations . . . . . . . . 2244--2256
Shiva Jahangiri and
Michael J. Carey and
Johann-Christoph Freytag Design trade-offs for a robust dynamic
hybrid hash join . . . . . . . . . . . . 2257--2269
Yannis Foufoulas and
Alkis Simitsis and
Lefteris Stamatogiannakis and
Yannis Ioannidis YeSQL: ``you extend SQL'' with rich and
highly performant user-defined functions
in relational databases . . . . . . . . 2270--2283
Shqiponja Ahmetaj and
Bianca Löhnert and
Magdalena Ortiz and
Mantas Simkus Magic shapes for SHACL validation . . . 2284--2296
Arpit Merchant and
Aristides Gionis and
Michael Mathioudakis Succinct graph representations as
distance oracles: an experimental
evaluation . . . . . . . . . . . . . . . 2297--2306
Yangqin Jiang and
Yixiang Fang and
Chenhao Ma and
Xin Cao and
Chunshan Li Effective community search over large
star-schema heterogeneous information
networks . . . . . . . . . . . . . . . . 2307--2320
Kai Ming Ting and
Zongyou Liu and
Hang Zhang and
Ye Zhu A new distributional treatment for time
series and an anomaly detection
investigation . . . . . . . . . . . . . 2321--2333
Benjamin Denham and
Edmund M-K. Lai and
Roopak Sinha and
M. Asif Naeem Witan: unsupervised labelling function
generation for assisted data programming 2334--2347
Ergute Bao and
Yizheng Zhu and
Xiaokui Xiao and
Yin Yang and
Beng Chin Ooi and
Benjamin Hong Meng Tan and
Khin Mi Mi Aung Skellam mixture mechanism: a novel
approach to federated learning with
differential privacy . . . . . . . . . . 2348--2360
Benjamin Hilprecht and
Carsten Binnig Zero-shot cost models for out-of-the-box
learned cost prediction . . . . . . . . 2361--2374
Dalsu Choi and
Hyunsik Yoon and
Hyubjin Lee and
Yon Dohn Chung Waffle: in-memory grid index for moving
objects with reinforcement
learning-based configuration tuning
system . . . . . . . . . . . . . . . . . 2375--2388
Michael Jungmair and
André Kohn and
Jana Giceva Designing an open framework for query
optimization and compilation . . . . . . 2389--2401
Lam-Duy Nguyen and
Sang-Won Lee and
Beomseok Nam In-page shadowing and two-version
timestamp ordering for mobile DBMSs . . 2402--2414
Shixuan Sun and
Xibo Sun and
Bingsheng He and
Qiong Luo RapidFlow: an efficient approach to
continuous subgraph matching . . . . . . 2415--2427
Mossad Helali and
Essam Mansour and
Ibrahim Abdelaziz and
Julian Dolby and
Kavitha Srinivas A scalable AutoML approach based on
graph neural networks . . . . . . . . . 2428--2436
Primal Pappachan and
Shufan Zhang and
Xi He and
Sharad Mehrotra Don't be a tattle-tale: preventing
leakages through data dependencies on
access control protected data . . . . . 2437--2449
Qingyu Xu and
Feng Zhang and
Zhiming Yao and
Lv Lu and
Xiaoyong Du and
Dong Deng and
Bingsheng He Efficient load-balanced butterfly
counting on GPU . . . . . . . . . . . . 2450--2462
Lawrence Benson and
Leon Papke and
Tilmann Rabl PerMA-bench: benchmarking persistent
memory access . . . . . . . . . . . . . 2463--2476
Yuliang He and
Duo Lu and
Kaisong Huang and
Tianzheng Wang Evaluating persistent memory range
indexes: part two . . . . . . . . . . . 2477--2490
Bobbi W. Yogatama and
Weiwei Gong and
Xiangyao Yu Orchestrating data placement and query
execution in heterogeneous CPU-GPU DBMS 2491--2503
Weicheng Wang and
Raymond Chi-Wing Wong Interactive mining with ordered and
unordered attributes . . . . . . . . . . 2504--2516
Wenzhe Yang and
Sheng Wang and
Yuan Sun and
Zhiyong Peng Fast dataset search with earth mover's
distance . . . . . . . . . . . . . . . . 2517--2529
João L. M. Pereira and
João Casanova and
Helena Galhardas and
Dennis Shasha AcX: system, techniques, and experiments
for acronym expansion . . . . . . . . . 2530--2544
Hongzhi Chen and
Changji Li and
Chenguang Zheng and
Chenghuan Huang and
Juncheng Fang and
James Cheng and
Jian Zhang G-tran: a high performance distributed
graph database with a decentralized
architecture . . . . . . . . . . . . . . 2545--2558
Arnd Christian König and
Yi Shan and
Tobias Ziegler and
Aarati Kakaraparthy and
Willis Lang and
Justin Moeller and
Ajay Kalhan and
Vivek Narasayya Tenant placement in over-subscribed
database-as-a-service clusters . . . . . 2559--2571
Yue Chen and
Kaiyu Feng and
Gao Cong and
Han Mao Kiah Example-based spatial pattern matching 2572--2584
Zeshun Peng and
Yanfeng Zhang and
Qian Xu and
Haixu Liu and
Yuxiao Gao and
Xiaohua Li and
Ge Yu NeuChain: a fast permissioned blockchain
system with deterministic ordering . . . 2585--2598
Ryan McKenna and
Brett Mullins and
Daniel Sheldon and
Gerome Miklau AIM: an adaptive and iterative mechanism
for differentially private synthetic
data . . . . . . . . . . . . . . . . . . 2599--2612
Etienne Toussaint and
Paolo Guagliardo and
Leonid Libkin and
Juan Sequeda Troubles with nulls, views from the
users . . . . . . . . . . . . . . . . . 2613--2625
Yeonhong Park and
Sunhong Min and
Jae W. Lee Ginex: SSD-enabled billion-scale graph
neural network training on a single
machine via provably optimal in-memory
caching . . . . . . . . . . . . . . . . 2626--2639
Junhua Zhang and
Wentao Li and
Long Yuan and
Lu Qin and
Ying Zhang and
Lijun Chang Shortest-path queries on complex
networks: experiments, analyses, and
improvement . . . . . . . . . . . . . . 2640--2652
Sameera Ghayyur and
Dhrubajyoti Ghosh and
Xi He and
Sharad Mehrotra MIDE: accuracy aware minimally invasive
data exploration for decision support 2653--2665
Dhrubajyoti Ghosh and
Peeyush Gupta and
Sharad Mehrotra and
Roberto Yus and
Yasser Altowim JENNER: just-in-time enrichment in query
processing . . . . . . . . . . . . . . . 2666--2678
Jiaoyi Zhang and
Yihan Gao CARMI: a cache-aware learned index with
a cost-based construction algorithm . . 2679--2691
Ian P. Swift and
Sana Ebrahimi and
Azade Nova and
Abolfazl Asudeh Maximizing fair content spread via edge
suggestion in social networks . . . . . 2692--2705
Axel Hertzschuch and
Claudio Hartmann and
Dirk Habich and
Wolfgang Lehner Turbo-charging SPJ query plans with
learned physical join operator
selections . . . . . . . . . . . . . . . 2706--2718
Chenhao Ma and
Reynold Cheng and
Laks V. S. Lakshmanan and
Xiaolin Han Finding locally densest subgraphs: a
convex programming approach . . . . . . 2719--2732
Zezhi Shao and
Zhao Zhang and
Wei Wei and
Fei Wang and
Yongjun Xu and
Xin Cao and
Christian S. Jensen Decoupled dynamic spatial-temporal graph
neural network for traffic forecasting 2733--2746
Youjie Li and
Amar Phanishayee and
Derek Murray and
Jakub Tarnawski and
Nam Sung Kim Harmony: overcoming the hurdles of GPU
memory capacity to train massive DNN
models on commodity servers . . . . . . 2747--2760
Xuan Luo and
Jian Pei and
Zicun Cong and
Cheng Xu On Shapley value in data assemblage
under independent utility . . . . . . . 2761--2773
John Paparrizos and
Paul Boniol and
Themis Palpanas and
Ruey S. Tsay and
Aaron Elmore and
Michael J. Franklin Volume under the surface: a new accuracy
evaluation measure for time-series
anomaly detection . . . . . . . . . . . 2774--2787
Haoteng Yin and
Muhan Zhang and
Yanbang Wang and
Jianguo Wang and
Pan Li Algorithm and system co-design for
efficient subgraph-based graph
representation learning . . . . . . . . 2788--2796
Michael Freitag and
Alfons Kemper and
Thomas Neumann Memory-optimized multi-version
concurrency control for disk-based
database systems . . . . . . . . . . . . 2797--2810
Dong He and
Supun C. Nakandala and
Dalitso Banda and
Rathijit Sen and
Karla Saur and
Kwanghyun Park and
Carlo Curino and
Jesús Camacho-Rodríguez and
Konstantinos Karanasos and
Matteo Interlandi Query processing on tensor computation
runtimes . . . . . . . . . . . . . . . . 2811--2825
Yifu Tang and
Jianxin Li and
Nur Al Hasan Haldar and
Ziyu Guan and
Jiajie Xu and
Chengfei Liu Reliable community search in dynamic
networks . . . . . . . . . . . . . . . . 2826--2838
Mohammad Javad Amiri and
Boon Thau Loo and
Divyakant Agrawal and
Amr El Abbadi Qanaat: a scalable multi-enterprise
permissioned blockchain system with
confidentiality guarantees . . . . . . . 2839--2852
Tsz Nam Chan and
Leong Hou U. and
Yun Peng and
Byron Choi and
Jianliang Xu Fast network $k$-function-based spatial
analysis . . . . . . . . . . . . . . . . 2853--2866
Robert Lasch and
Thomas Legler and
Norman May and
Bernhard Scheirle and
Kai-Uwe Sattler Cost modelling for optimal data
placement in heterogeneous main memory 2867--2880
Junru Li and
Youyou Lu and
Yiming Zhang and
Qing Wang and
Zhuo Cheng and
Keji Huang and
Jiwu Shu SwitchTx: scalable in-network
coordination for distributed transaction
processing . . . . . . . . . . . . . . . 2881--2894
Lukas Vogel and
Alexander van Renen and
Satoshi Imamura and
Jana Giceva and
Thomas Neumann and
Alfons Kemper Plush: a write-optimized persistent
log-structured hash-table . . . . . . . 2895--2907
Fangyuan Zhang and
Sibo Wang Effective indexing for dynamic
structural graph clustering . . . . . . 2908--2920
Immanuel Trummer CodexDB: synthesizing code for query
processing from natural language
instructions using GPT-3 codex . . . . . 2921--2928
Arnab Phani and
Lukas Erlbacher and
Matthias Boehm UPLIFT: parallelization strategies for
feature transformations in machine
learning workloads . . . . . . . . . . . 2929--2938
Xinjing Zhou and
Xiangyao Yu and
Goetz Graefe and
Michael Stonebraker Lotus: scalable multi-partition
transactions on single-threaded
partitioned databases . . . . . . . . . 2939--2952
Konstantinos Kanellis and
Cong Ding and
Brian Kroth and
Andreas Müller and
Carlo Curino and
Shivaram Venkataraman LlamaTune: sample-efficient DBMS
configuration tuning . . . . . . . . . . 2953--2965
Christian Winter and
Jana Giceva and
Thomas Neumann and
Alfons Kemper On-demand state separation for cloud
data warehousing . . . . . . . . . . . . 2966--2979
Immanuel Trummer BABOONS: black-box optimization of data
summaries in natural language . . . . . 2980--2993
Xiaoying Wang and
Weiyuan Wu and
Jinze Wu and
Yizhou Chen and
Nick Zrymiak and
Changbo Qu and
Lampros Flokas and
George Chow and
Jiannan Wang and
Tianzheng Wang and
Eugene Wu and
Qingqing Zhou ConnectorX: accelerating data loading
from databases to dataframes . . . . . . 2994--3003
Chaichon Wongkham and
Baotong Lu and
Chris Liu and
Zhicong Zhong and
Eric Lo and
Tianzheng Wang Are updatable learned indexes ready? . . 3004--3017
Maximilian Reif and
Thomas Neumann A scalable and generic approach to range
joins . . . . . . . . . . . . . . . . . 3018--3030
Ellen Hohma and
Christian M. M. Frey and
Anna Beer and
Thomas Seidl SCAR: spectral clustering accelerated
and robustified . . . . . . . . . . . . 3031--3044
Michael Benedikt and
Maxime Buron and
Stefano Germano and
Kevin Kappelmann and
Boris Motik Rewriting the infinite chase . . . . . . 3045--3057
Panagiotis Liakos and
Katia Papakonstantinopoulou and
Yannis Kotidis Chimp: efficient lossless floating point
compression for time series databases 3058--3070
Niv Dayan and
Tamar Weiss and
Shmuel Dashevsky and
Michael Pan and
Edward Bortnikov and
Moshe Twitto Spooky: granulating LSM-tree compactions
correctly . . . . . . . . . . . . . . . 3071--3084
Kai Yao and
Lijun Chang and
Jeffrey Xu Yu Identifying similar-bicliques in
bipartite graphs . . . . . . . . . . . . 3085--3097
Chenghao Lyu and
Qi Fan and
Fei Song and
Arnab Sinha and
Yanlei Diao and
Wei Chen and
Li Ma and
Yihui Feng and
Yaliang Li and
Kai Zeng and
Jingren Zhou Fine-grained modeling and optimization
for intelligent resource management in
big data processing . . . . . . . . . . 3098--3111
Ziyi Liu and
Lei Li and
Mengxuan Zhang and
Wen Hua and
Xiaofang Zhou FHL-cube: multi-constraint shortest path
querying with flexible combination of
constraints . . . . . . . . . . . . . . 3112--3125
Michael Abebe and
Horatiu Lazu and
Khuzaima Daudjee Tiresias: enabling predictive autonomous
storage and indexing . . . . . . . . . . 3126--3136
Abolfazl Asudeh and
Fatemeh Nargesian Towards distribution-aware query
answering in data markets . . . . . . . 3137--3144
Suyong Kwon and
Woohwan Jung and
Kyuseok Shim Cardinality estimation of approximate
substring queries using deep learning 3145--3157
Karla Saur and
Tara Mirmira and
Konstantinos Karanasos and
Jesús Camacho-Rodríguez Containerized execution of UDFs: an
experimental evaluation . . . . . . . . 3158--3171
Siyuan Xia and
Zhiru Zhu and
Chris Zhu and
Jinjin Zhao and
Kyle Chard and
Aaron J. Elmore and
Ian Foster and
Michael Franklin and
Sanjay Krishnan and
Raul Castro Fernandez Data station: delegated, trustworthy,
and auditable computation to enable
data-sharing consortia with a data
escrow . . . . . . . . . . . . . . . . . 3172--3185
Khaled Ammar and
Siddhartha Sahu and
Semih Salihoglu and
M. Tamer Özsu Optimizing differentially-maintained
recursive queries on dynamic graphs . . 3186--3198
Zihan Luo and
Lei Li and
Mengxuan Zhang and
Wen Hua and
Yehong Xu and
Xiaofang Zhou Diversified top-$k$ route planning in
road network . . . . . . . . . . . . . . 3199--3212
Xiangmin Zhou and
Lei Chen Migrating social event recommendation
over microblogs . . . . . . . . . . . . 3213--3225
Yueting Chen and
Nick Koudas and
Xiaohui Yu and
Ziqiang Yu Spatial and temporal constrained ranked
retrieval over videos . . . . . . . . . 3226--3239
Ningyi Liao and
Dingheng Mo and
Siqiang Luo and
Xiang Li and
Pengcheng Yin SCARA: scalable graph neural networks
with feature-oriented optimization . . . 3240--3248
Antonis Manousis and
Zhuo Cheng and
Ran Ben Basat and
Zaoxing Liu and
Vyas Sekar Enabling efficient and general
subpopulation analytics in
multidimensional data streams . . . . . 3249--3262
Qing Chen and
Oded Lachish and
Sven Helmer and
Michael H. Böhlen Dynamic spanning trees for connectivity
queries on fully-dynamic undirected
graphs . . . . . . . . . . . . . . . . . 3263--3276
Monica Chiosa and
Fabio Maschi and
Ingo Müller and
Gustavo Alonso and
Norman May Hardware acceleration of compression and
encryption in SAP HANA . . . . . . . . . 3277--3291
Martin Graf and
Lukas Laskowski and
Florian Papsdorf and
Florian Sold and
Roland Gremmelspacher and
Felix Naumann and
Fabian Panse Frost: a platform for benchmarking and
exploring data matching results . . . . 3292--3305
Changji Li and
Hongzhi Chen and
Shuai Zhang and
Yingqian Hu and
Chao Chen and
Zhenjie Zhang and
Meng Li and
Xiangchen Li and
Dongqing Han and
Xiaohui Chen and
Xudong Wang and
Huiming Zhu and
Xuwei Fu and
Tingwei Wu and
Hongfei Tan and
Hengtian Ding and
Mengjin Liu and
Kangcheng Wang and
Ting Ye and
Lei Li and
Xin Li and
Yu Wang and
Chenguang Zheng and
Hao Yang and
James Cheng ByteGraph: a high-performance
distributed graph database in ByteDance 3306--3318
Prakash Das and
Shivangi Srivastava and
Valentin Moskovich and
Anmol Chaturvedi and
Anant Mittal and
Yongqin Xiao and
Mosharaf Chowdhury CDI-E: an elastic cloud service for data
engineering . . . . . . . . . . . . . . 3319--3331
Sheng Wang and
Yiran Li and
Huorong Li and
Feifei Li and
Chengjin Tian and
Le Su and
Yanshan Zhang and
Yubing Ma and
Lie Yan and
Yuanyuan Sun and
Xuntao Cheng and
Xiaolong Xie and
Yu Zou Operon: an encrypted database for
ownership-preserving data management . . 3332--3345
Caixin Gong and
Chengjin Tian and
Zhengheng Wang and
Sheng Wang and
Xiyu Wang and
Qiulei Fu and
Wu Qin and
Long Qian and
Rui Chen and
Jiang Qi and
Ruo Wang and
Guoyun Zhu and
Chenghu Yang and
Wei Zhang and
Feifei Li Tair-PMem: a fully durable non-volatile
memory database . . . . . . . . . . . . 3346--3358
Branimir Lambov Trie memtables in Cassandra . . . . . . 3359--3371
Pedro Pedreira and
Orri Erling and
Masha Basmanova and
Kevin Wilfong and
Laith Sakka and
Krishna Pai and
Wei He and
Biswapesh Chattopadhyay Velox: meta's unified execution engine 3372--3384
Zhenkun Yang and
Chuanhui Yang and
Fusheng Han and
Mingqiang Zhuang and
Bing Yang and
Zhifeng Yang and
Xiaojun Cheng and
Yuzhong Zhao and
Wenhui Shi and
Huafeng Xi and
Huang Yu and
Bin Liu and
Yi Pan and
Boxue Yin and
Junquan Chen and
Quanqing Xu OceanBase: a 707 million tpmC
distributed relational database system 3385--3397
Hai Lan and
Jiong Xie and
Zhifeng Bao and
Feifei Li and
Wei Tian and
Fang Wang and
Sheng Wang and
Ailin Zhang VRE: a versatile, robust, and economical
trajectory data system . . . . . . . . . 3398--3410
Jianjun Chen and
Yonghua Ding and
Ye Liu and
Fangshi Li and
Li Zhang and
Mingyi Zhang and
Kui Wei and
Lixun Cao and
Dan Zou and
Yang Liu and
Lei Zhang and
Rui Shi and
Wei Ding and
Kai Wu and
Shangyu Luo and
Jason Sun and
Yuming Liang ByteHTAP: Bytedance's HTAP system with
high data freshness and strong data
consistency . . . . . . . . . . . . . . 3411--3424
Wolfram Wingerath and
Benjamin Wollmer and
Markus Bestehorn and
Stephan Succo and
Sophie Ferrlein and
Florian Bücklers and
Jörn Domnik and
Fabian Panse and
Erik Witt and
Anil Sener and
Felix Gessert and
Norbert Ritter Beaconnect: continuous web performance
A/B testing at scale . . . . . . . . . . 3425--3431
Zongzhi Chen and
Xinjun Yang and
Feifei Li and
Xuntao Cheng and
Qingda Hu and
Zheyu Miao and
Rongbiao Xie and
Xiaofei Wu and
Kang Wang and
Zhao Song and
Haiqing Sun and
Zechao Zhuang and
Yuming Yang and
Jie Xu and
Liang Yin and
Wenchao Zhou and
Sheng Wang CloudJump: optimizing cloud databases
for cloud storages . . . . . . . . . . . 3432--3444
Kaiping Zheng and
Shaofeng Cai and
Horng Ruey Chua and
Melanie Herschel and
Meihui Zhang and
Beng Chin Ooi DyHealth: making neural networks dynamic
for effective healthcare analytics . . . 3445--3458
Andrey Mishchenko and
Dominique Danco and
Abhilash Jindal and
Adrian Blue Blueprint: a constraint-solving approach
for document extraction . . . . . . . . 3459--3471
Muzhi Yu and
Zhaoxiang Lin and
Jinan Sun and
Runyun Zhou and
Guoqiang Jiang and
Hua Huang and
Shikun Zhang TencentCLS: the cloud log service with
high query performances . . . . . . . . 3472--3482
Jiong Xie and
Zhen Chen and
Jianwei Liu and
Fang Wang and
Feifei Li and
Zhida Chen and
Yinpei Liu and
Songlu Cai and
Zhenhua Fan and
Fei Xiao and
Yue Chen Ganos: a multidimensional, dynamic, and
scene-oriented cloud-native spatial
database engine . . . . . . . . . . . . 3483--3495
Sarath Lakshman and
Apaar Gupta and
Rohan Suri and
Scott Lashley and
John Liang and
Srinath Duvuru and
Ravi Mayuram Magma: a high data density storage
engine used in Couchbase . . . . . . . . 3496--3508
Joyce Cahoon and
Wenjing Wang and
Yiwen Zhu and
Katherine Lin and
Sean Liu and
Raymond Truong and
Neetu Singh and
Chengcheng Wan and
Alexandra Ciortea and
Sreraman Narasimhan and
Subru Krishnan Doppler: automated SKU recommendation in
migrating SQL workloads to the cloud . . 3509--3521
Stavros Harizopoulos and
Taylor Hopper and
Morton Mo and
Shyam Sundar Chandrasekaran and
Tongguang Chen and
Yan Cui and
Nandini Ganesh and
Gary Helmling and
Hieu Pham and
Sebastian Wong Meta's next-generation realtime
monitoring and analytics platform . . . 3522--3534
Kevin P. Gaffney and
Martin Prammer and
Larry Brasfield and
D. Richard Hipp and
Dan Kennedy and
Jignesh M. Patel SQLite: past, present, and future . . . 3535--3547
Rentong Guo and
Xiaofan Luan and
Long Xiang and
Xiao Yan and
Xiaomeng Yi and
Jigao Luo and
Qianya Cheng and
Weizhi Xu and
Jiarui Luo and
Frank Liu and
Zhenshan Cao and
Yanliang Qiao and
Ting Wang and
Bo Tang and
Charles Xie Manu: a cloud native vector database
management system . . . . . . . . . . . 3548--3561
Sainyam Galhotra and
Udayan Khurana Automated relational data explanation
using external semantic knowledge . . . 3562--3565
Andrea Rossi and
Donatella Firmani and
Paolo Merialdo and
Tommaso Teofili Kelpie: an explainability framework for
embedding-based link prediction models 3566--3569
Yin Lin and
Brit Youngmann and
Yuval Moskovitch and
H. V. Jagadish and
Tova Milo OREO: detection of cherry-picked
generalizations . . . . . . . . . . . . 3570--3573
André Kohn and
Dominik Moritz and
Mark Raasveldt and
Hannes Mühleisen and
Thomas Neumann DuckDB-wasm: fast analytical processing
for the web . . . . . . . . . . . . . . 3574--3577
Yihai Xi and
Ning Wang and
Xinyu Chen and
Yiyi Zhang and
Zilong Wang and
Zhihong Xu and
Yue Wang EasyDR: a human-in-the-loop error
detection & repair platform for holistic
table cleaning . . . . . . . . . . . . . 3578--3581
Xuchen Pan and
Yongxin Tong and
Chunbo Xue and
Zimu Zhou and
Junping Du and
Yuxiang Zeng and
Yexuan Shi and
Xiaofei Zhang and
Lei Chen and
Yi Xu and
Ke Xu and
Weifeng Lv Hu-fu: a data federation system for
secure spatial queries . . . . . . . . . 3582--3585
Marius Gassen and
Benjamin Hättasch and
Benjamin Hilprecht and
Nadja Geisler and
Alexander Fraser and
Carsten Binnig Demonstrating CAT: synthesizing
data-aware conversational agents for
transactional databases . . . . . . . . 3586--3589
Aurélien Personnaz and
Brit Youngmann and
Sihem Amer-Yahia EDA4SUM: guided exploration of data
summaries . . . . . . . . . . . . . . . 3590--3593
Chenjie Li and
Juseung Lee and
Zhengjie Miao and
Boris Glavic and
Sudeepa Roy CaJaDE: explaining query results by
augmenting provenance with context . . . 3594--3597
Yuki Asada and
Victor Fu and
Apurva Gandhi and
Advitya Gemawat and
Lihao Zhang and
Dong He and
Vivek Gupta and
Ehi Nosakhare and
Dalitso Banda and
Rathijit Sen and
Matteo Interlandi Share the tensor tea: how databases can
leverage the machine learning ecosystem 3598--3601
Jess Tan and
Desmond Yeo and
Rachael Neoh and
Huey-Eng Chua and
Sourav S Bhowmick MOCHA: a tool for visualizing impact of
operator choices in query execution
plans for database education . . . . . . 3602--3605
Tsz Nam Chan and
Pak Lon Ip and
Kaiyan Zhao and
Leong Hou U and
Byron Choi and
Jianliang Xu LIBKDV: a versatile kernel density
visualization library for geospatial
analytics . . . . . . . . . . . . . . . 3606--3609
Arul Ajmani and
Aayush Shah and
Alexander Shraer and
Adam Storm and
Rebecca Taft and
Oliver Tan and
Nathan VanBenschoten A demonstration of multi-region
CockroachDB . . . . . . . . . . . . . . 3610--3613
Adriane Chapman and
Luca Lauro and
Paolo Missier and
Riccardo Torlone DPDS: assisting data science with data
provenance . . . . . . . . . . . . . . . 3614--3617
Vargha Dadvar and
Lukasz Golab and
Divesh Srivastava POEM: pattern-oriented explanations of
CNN models . . . . . . . . . . . . . . . 3618--3621
Ramon Antonio Rodriges Zalipynis and
Nikita Terlych WebArrayDB: a geospatial array DBMS in
your web browser . . . . . . . . . . . . 3622--3625
Hai Lan and
Yuanjia Zhang and
Zhifeng Bao and
Yu Dong and
Dongxu Huang and
Liu Tang and
Jian Zhang AutoDI: towards an automatic plan
regression analysis . . . . . . . . . . 3626--3629
Susan B. Davidson and
Shay Gershtein and
Tova Milo and
Slava Novgorodov and
May Shoshan PHOcus: efficiently archiving photos . . 3630--3633
Kai Huang and
Qingqing Ye and
Jing Zhao and
Xi Zhao and
Haibo Hu and
Xiaofang Zhou VINCENT: towards efficient exploratory
subgraph search in graph databases . . . 3634--3637
Osnat Drien and
Matanya Freiman and
Yael Amsterdamer ActivePDB: active probabilistic
databases . . . . . . . . . . . . . . . 3638--3641
Tommaso Teofili and
Donatella Firmani and
Nick Koudas and
Paolo Merialdo and
Divesh Srivastava CERTEM: explaining and debugging
black-box entity resolution systems with
CERTA . . . . . . . . . . . . . . . . . 3642--3645
Ahmet Kerem Aksoy and
Pavel Dushev and
Eleni Tzirita Zacharatou and
Holmer Hemsen and
Marcela Charfuelan and
Jorge-Arnulfo Quiané-Ruiz and
Begüm Demir and
Volker Markl Satellite image search in AgoraEO . . . 3646--3649
Li Yan and
Nerissa Xu and
Guozhong Li and
Sourav S Bhowmick and
Byron Choi and
Jianliang Xu SENSOR: data-driven construction of
sketch-based visual query interfaces for
time series data . . . . . . . . . . . . 3650--3653
Angela Bonifati and
Stefania Dumbrava and
Emile Martinez and
Fatemeh Ghasemi and
Malo Jaffré and
Pacôme Luton and
Thomas Pickles DiscoPG: property graph schema discovery
and exploration . . . . . . . . . . . . 3654--3657
Wissam Maamar-Kouadri and
Salima Benbernou and
Mourad Ouziri and
Themis Palpanas and
Iheb Ben Amor SA-Q: observing, evaluating, and
enhancing the quality of the results of
sentiment analysis tools . . . . . . . . 3658--3661
Abdelghny Orogat and
Ahmed El-Roby SmartBench: demonstrating automatic
generation of comprehensive benchmarks
for question answering over knowledge
graphs . . . . . . . . . . . . . . . . . 3662--3665
Jianhong Tu and
Xiaoyue Han and
Ju Fan and
Nan Tang and
Chengliang Chai and
Guoliang Li and
Xiaoyong Du DADER: hands-off entity resolution with
domain adaptation . . . . . . . . . . . 3666--3669
James Gale and
Max Seiden and
Deepanshu Utkarsh and
Jason Frantz and
Rob Woollen and
Ça\ugatay Demiralp Sigma workbook: a spreadsheet for cloud
data warehouses . . . . . . . . . . . . 3670--3673
Zihao Chen and
Zhizhen Xu and
Baokun Han and
Chen Xu and
Weining Qian and
Aoying Zhou ReMac: a matrix computation system with
redundancy elimination . . . . . . . . . 3674--3677
Phillip Wenig and
Sebastian Schmidl and
Thorsten Papenbrock TimeEval: a benchmarking toolkit for
time series anomaly detection algorithms 3678--3681
Alberto Lerner and
Matthias Jasny and
Theo Jepsen and
Carsten Binnig and
Philippe Cudré-Mauroux DBMS annihilator: a high-performance
database workload generator in action 3682--3685
Zhiyu Liang and
Hongzhi Wang FedTSC: a secure federated learning
system for interpretable time series
classification . . . . . . . . . . . . . 3686--3689
Qingshun Wu and
Yafei Li and
Huiling Li and
Di Zhang and
Guanglei Zhu AMRAS: a visual analysis system for
spatial crowdsourcing . . . . . . . . . 3690--3693
Hani Al-Sayeh and
Muhammad Attahir Jibril and
Muhammad Waleed Bin Saeed and
Kai-Uwe Sattler SparkCAD: caching anomalies detector for
Spark applications . . . . . . . . . . . 3694--3697
Wilco v. Leeuwen and
Thomas Mulder and
Bram van de Wall and
George Fletcher and
Nikolay Yakovets AvantGraph query processing engine . . . 3698--3701
Paul Boniol and
John Paparrizos and
Yuhao Kang and
Themis Palpanas and
Ruey S. Tsay and
Aaron J. Elmore and
Michael J. Franklin Theseus: navigating the labyrinth of
time-series anomaly detection . . . . . 3702--3705
Dennis Hofmann and
Peter VanNostrand and
Huayi Zhang and
Yizhou Yan and
Lei Cao and
Samuel Madden and
Elke Rundensteiner A demonstration of AutoOD: a self-tuning
anomaly detection system . . . . . . . . 3706--3709
Sunny Gakhar and
Joyce Cahoon and
Wangchao Le and
Xiangnan Li and
Kaushik Ravichandran and
Hiren Patel and
Marc Friedman and
Brandon Haynes and
Shi Qiao and
Alekh Jindal and
Jyoti Leeka Pipemizer: an optimizer for analytics
data pipelines . . . . . . . . . . . . . 3710--3713
Sergey Redyuk and
Zoi Kaoudi and
Sebastian Schelter and
Volker Markl DORIAN in action: assisted design of
data science pipelines . . . . . . . . . 3714--3717
Yuntian He and
Yue Zhang and
Saket Gurukar and
Srinivasan Parthasarathy WebMILE: democratizing network
representation learning at scale . . . . 3718--3721
Nadja Geisler and
Benjamin Hättasch and
Carsten Binnig Demonstrating quest: a query-driven
framework to explain classification
models on tabular data . . . . . . . . . 3722--3725
Drew Ripberger and
Yifan Gan and
Xueyuan Ren and
Spyros Blanas and
Yang Wang IsoBugView: interactively debugging
isolation bugs in database applications 3726--3729
Yannis Foufoulas and
Alkis Simitsis and
Yannis Ioannidis YeSQL: rich user-defined functions
without the overhead . . . . . . . . . . 3730--3733
Zhihui Yang and
Yicong Huang and
Zuozhi Wang and
Feng Gao and
Yao Lu and
Chen Li and
X. Sean Wang Demonstration of accelerating machine
learning inference queries with
correlative proxy models . . . . . . . . 3734--3737
Xiaozhen Liu and
Zuozhi Wang and
Shengquan Ni and
Sadeem Alsudais and
Yicong Huang and
Avinash Kumar and
Chen Li Demonstration of collaborative and
interactive workflow-based data
analytics in Texera . . . . . . . . . . 3738--3741
Ramon Antonio Rodriges Zalipynis SimDB in action: road traffic
simulations completely inside array DBMS 3742--3745
Gilbert Badaro and
Paolo Papotti Transformers for tabular data
representation: a tutorial on models and
applications . . . . . . . . . . . . . . 3746--3749
Felix Kiehn and
Mareike Schmidt and
Daniel Glake and
Fabian Panse and
Wolfram Wingerath and
Benjamin Wollmer and
Martin Poppinga and
Norbert Ritter Polyglot data management: state of the
art & open challenges . . . . . . . . . . 3750--3753
Abdul Wasay and
Nesime Tatbul and
Justin Gottschlich Machine programming: turning data into
programmer productivity . . . . . . . . 3754--3757
Guoliang Li and
Haowen Dong and
Chao Zhang Cloud databases: new techniques,
challenges, and opportunities . . . . . 3758--3761
Amine Mhedhbi and
Semih Salihoglu Modern techniques for querying
graph-structured relations: foundations,
system implementations, and open
challenges . . . . . . . . . . . . . . . 3762--3765
Yixiang Fang and
Wensheng Luo and
Chenhao Ma Densest subgraph discovery on large
graphs: applications, challenges, and
techniques . . . . . . . . . . . . . . . 3766--3769
Immanuel Trummer From BERT to GPT-3 codex: harnessing the
potential of very large language models
for data management . . . . . . . . . . 3770--3773
Kaisong Huang and
Yuliang He and
Tianzheng Wang The past, present and future of indexing
on persistent memory . . . . . . . . . . 3774--3777
Zoi Kaoudi and
Jorge-Arnulfo Quiané-Ruiz Unified data analytics: state-of-the-art
and open problems . . . . . . . . . . . 3778--3781
Wenfei Fan Big graphs: challenges and opportunities 3782--3797
Sihem Amer-Yahia Towards AI-powered data-driven education 3798--3806
Yizhou Sun and
Jiawei Han and
Xifeng Yan and
Philip S. Yu and
Tianyi Wu Heterogeneous information networks: the
past, the present, and the future . . . 3807--3811
Sudeepa Roy Toward interpretable and actionable data
analysis with explanations and causality 3812--3820
Fatma Özcan Reflections on my data management
research journey (VLDB women in database
research award talk) . . . . . . . . . . 3821--3822
C. Mohan Panel: startups founded by database
researchers . . . . . . . . . . . . . . 3823--3825
Magdalena Balazinska and
Surajit Chaudhuri and
AnHai Doan and
Joseph M. Hellerstein and
Hanuma Kodavalla and
Ippokratis Pandis and
Matei Zaharia Cloud data systems: what are the
opportunities for the database research
community? . . . . . . . . . . . . . . . 3826--3827
Sachin Basil John and
Christoph Koch High-Dimensional Data Cubes . . . . . . 3828--3840
Matteo Ceccarello and
Johann Gamper Fast and Scalable Mining of Time Series
Motifs with Probabilistic Guarantees . . 3841--3853
Daniel Deutch and
Amir Gilad and
Tova Milo and
Amit Mualem and
Amit Somech FEDEX: an Explainability Framework for
Data Exploration Steps . . . . . . . . . 3854--3868
Maria Xekalaki and
Juan Fumero and
Athanasios Stratikopoulos and
Katerina Doka and
Christos Katsakioris and
Constantinos Bitsakos and
Nectarios Koziris and
Christos Kotselidis Enabling Transparent Acceleration of Big
Data Frameworks Using Heterogeneous
Hardware . . . . . . . . . . . . . . . . 3869--3882
Adriano Fazzone and
Tommaso Lanciano and
Riccardo Denni and
Charalampos E. Tsourakakis and
Francesco Bonchi Discovering Polarization Niches via
Dense Subgraphs with Attractors and
Repulsers . . . . . . . . . . . . . . . 3883--3896
Eunjae Lee and
Sam H. Noh and
Jiwon Seo Sage: a System for Uncertain Network
Analysis . . . . . . . . . . . . . . . . 3897--3910
Hongchao Qin and
Rong-Hua Li and
Ye Yuan and
Guoren Wang and
Lu Qin and
Zhiwei Zhang Mining Bursting Core in Large Temporal
Graphs . . . . . . . . . . . . . . . . . 3911--3923
Xiang Yu and
Chengliang Chai and
Guoliang Li and
Jiabin Liu Cost-Based or Learning-Based?: a Hybrid
Query Optimizer for Query Plan Selection 3924--3936
Jingfan Meng and
Huayi Wang and
Jun Xu and
Mitsunori Ogihara ONe Index for All Kernels (ONIAK): a
Zero Re-Indexing LSH Solution to
ANNS-ALT (After Linear Transformation) 3937--3949
Jiachen Shi and
Gao Cong and
Xiao-Li Li Learned Index Benefits: Machine Learning
Based Index Performance Estimation . . . 3950--3962
Jiachuan Wang and
Peng Cheng and
Libin Zheng and
Lei Chen and
Wenjie Zhang Online Ridesharing with Meeting Points 3963--3975
Luigi Bellomarini and
Davide Benedetto and
Matteo Brandetti and
Emanuel Sallinger Exploiting the Power of
Equality-Generating Dependencies in
Ontological Reasoning . . . . . . . . . 3976--3988
Anders Aamand and
Debarati Das and
Evangelos Kipouridis and
Jakob B. T. Knudsen and
Peter M. R. Rasmussen and
Mikkel Thorup No Repetition: Fast and Reliable
Sampling with Highly Concentrated
Hashing . . . . . . . . . . . . . . . . 3989--4001
Lyes Attouche and
Mohamed-Amine Baazizi and
Dario Colazzo and
Giorgio Ghelli and
Carlo Sartiani and
Stefanie Scherzinger Witness Generation for JSON Schema . . . 4002--4014
Shreya Shankar and
Aditya G. Parameswaran Towards Observability for Production
Machine Learning Pipelines . . . . . . . 4015--4022
Sekwon Lee and
Soujanya Ponnapalli and
Sharad Singhal and
Marcos K. Aguilera and
Kimberly Keeton and
Vijay Chidambaram DINOMO: an Elastic, Scalable,
High-Performance Key-Value Store for
Disaggregated Persistent Memory . . . . 4023--4037
Shreya Shankar and
Stephen Macke and
Sarah Chasins and
Andrew Head and
Aditya Parameswaran Bolt-on, Compact, and Rapid Program
Slicing for Notebooks . . . . . . . . . 4038--4047
Weijie Sun and
Zihuan Xu and
Lei Chen Fairness Matters: a Tit-for-Tat Strategy
Against Selfish Mining . . . . . . . . . 4048--4061
Jialin Ding and
Ryan Marcus and
Andreas Kipf and
Vikram Nathan and
Aniruddha Nrusimha and
Kapil Vaidya and
Alexander van Renen and
Tim Kraska SageDB: an Instance-Optimized Data
Analytics System . . . . . . . . . . . . 4062--4078
Keven Richly and
Rainer Schlosser and
Martin Boissier Budget-Conscious Fine-Grained
Configuration Optimization for
Spatio-Temporal Applications . . . . . . 4079--4092
Cheng-Yu Hsieh and
Jieyu Zhang and
Alexander Ratner Nemo: Guiding and Contextualizing Weak
Supervision for Interactive Data
Programming . . . . . . . . . . . . . . 4093--4105
Jeffrey Helt and
Abhinav Sharma and
Daniel J. Abadi and
Wyatt Lloyd and
Jose M. Faleiro C5: cloned concurrency control that
always keeps up . . . . . . . . . . . . 1--14
Ruihong Wang and
Jianguo Wang and
Stratos Idreos and
M. Tamer Özsu and
Walid G. Aref The case for distributed shared-memory
databases with RDMA-enabled memory
disaggregation . . . . . . . . . . . . . 15--22
Chenyuan Wu and
Mohammad Javad Amiri and
Jared Asch and
Heena Nagda and
Qizhen Zhang and
Boon Thau Loo FlexChain: an elastic disaggregated
blockchain . . . . . . . . . . . . . . . 23--36
Zhen Zhang and
Shuai Zheng and
Yida Wang and
Justin Chiu and
George Karypis and
Trishul Chilimbi and
Mu Li and
Xin Jin MiCS: near-linear scaling for training
gigantic model on public cloud . . . . . 37--50
Yi Yang and
Yurong Cheng and
Ye Yuan and
Guoren Wang and
Lei Chen and
Yongjiao Sun Privacy-preserving cooperative online
matching over spatial crowdsourcing
platforms . . . . . . . . . . . . . . . 51--63
Jiayi Wang and
Chengliang Chai and
Nan Tang and
Jiabin Liu and
Guoliang Li Coresets over multiple tables for
feature-rich and data-efficient machine
learning . . . . . . . . . . . . . . . . 64--76
Zihao Zhang and
Huiqi Hu and
Xuan Zhou and
Jiang Wang Starry: multi-master transaction
processing on semi-leader architecture 77--89
Konstantinos Skitsas and
Ioannis G. Papageorgiou and
Mohammad Sadegh Talebi and
Verena Kantere and
Michael N. Katehakis and
Panagiotis Karras SIFTER: space-efficient value iteration
for finite-horizon MDPs . . . . . . . . 90--98
Geoffrey X. Yu and
Markos Markakis and
Andreas Kipf and
Per-Åke Larson and
Umar Farooq Minhas and
Tim Kraska TreeLine: an update-in-place key-value
store for modern storage . . . . . . . . 99--112
Yuchao Tao and
Amir Gilad and
Ashwin Machanavajjhala and
Sudeepa Roy DPXPlain: privately explaining aggregate
query answers . . . . . . . . . . . . . 113--126
Lijun Chang and
Mouyi Xu and
Darren Strash Efficient maximum $k$-plex computation
over large sparse graphs . . . . . . . . 127--139
Tianxun Hu and
Tianzheng Wang and
Qingqing Zhou Online schema evolution is (almost) free
for snapshot databases . . . . . . . . . 140--153
Yifan Wang and
Haodi Ma and
Daisy Zhe Wang LIDER: an efficient high-dimensional
learned index for large-scale dense
passage retrieval . . . . . . . . . . . 154--166
Sina Shaham and
Gabriel Ghinita and
Cyrus Shahabi Models and mechanisms for spatial data
fairness . . . . . . . . . . . . . . . . 167--179
Shixun Huang and
Wenqing Lin and
Zhifeng Bao and
Jiachen Sun Influence maximization in real-world
closed social networks . . . . . . . . . 180--192
Angela Bonifati and
Francesco Del Buono and
Francesco Guerra and
Donato Tiano Time2Feat: learning interpretable
representations for multivariate time
series clustering . . . . . . . . . . . 193--201
Xiaochen Li and
Yuke Hu and
Weiran Liu and
Hanwen Feng and
Li Peng and
Yuan Hong and
Kui Ren and
Zhan Qin OpBoost: a vertical federated tree
boosting framework based on
order-preserving desensitization . . . . 202--215
R. Malinga Perera and
Bastian Oetomo and
Benjamin I. P. Rubinstein and
Renata Borovica-Gajic HMAB: self-driving hierarchy of bandits
for integrated physical database design
tuning . . . . . . . . . . . . . . . . . 216--229
Dimitris Palyvos-Giannas and
Katerina Tzompanaki and
Marina Papatriantafilou and
Vincenzo Gulisano Erebus: explaining the outputs of data
streaming queries . . . . . . . . . . . 230--242
Zhou Zhang and
Zhaole Chu and
Peiquan Jin and
Yongping Luo and
Xike Xie and
Shouhong Wan and
Yun Luo and
Xufei Wu and
Peng Zou and
Chunyang Zheng and
Guoan Wu and
Andy Rudoff PLIN: a persistent learned index for
non-volatile memory with high
performance and instant recovery . . . . 243--255
Zuozhi Wang and
Shengquan Ni and
Avinash Kumar and
Chen Li Fries: fast and consistent runtime
reconfiguration in dataflow systems with
transactional guarantees . . . . . . . . 256--268
Renjie Xiao and
Zijing Tan and
Haojin Wang and
Shuai Ma Fast approximate denial constraint
discovery . . . . . . . . . . . . . . . 269--281
Haoyu Wang and
Shaoxu Song Frequency domain data encoding in Apache
IoTDB . . . . . . . . . . . . . . . . . 282--290
Jiping Zheng and
Yuan Ma and
Wei Ma and
Yanhao Wang and
Xiaoyang Wang Happiness maximizing sets under group
fairness constraints . . . . . . . . . . 291--303
Cedric Renggli and
Xiaozhe Yao and
Luka Kolar and
Luka Rimanic and
Ana Klimovic and
Ce Zhang SHiFT: an efficient, flexible search
engine for transfer learning . . . . . . 304--316
Md. Mouinul Islam and
Dong Wei and
Baruch Schieber and
Senjuti Basu Roy Satisfying complex top-$k$ fairness
constraints by preference substitutions 317--329
Nikolai Karpov and
Qin Zhang SyncSignature: a simple, efficient,
parallelizable framework for tree
similarity joins . . . . . . . . . . . . 330--342
Shuang Yang and
Yahui Sun and
Jiesong Liu and
Xiaokui Xiao and
Rong-Hua Li and
Zhewei Wei Approximating probabilistic group
Steiner trees in graphs . . . . . . . . 343--355
Serafeim Papadias and
Zoi Kaoudi and
Jorge-Arnulfo Quiané-Ruiz and
Volker Markl Space-efficient random walks on
streaming graphs . . . . . . . . . . . . 356--368
Pengfei Wang and
Xiaocan Zeng and
Lu Chen and
Fan Ye and
Yuren Mao and
Junhao Zhu and
Yunjun Gao PromptEM: prompt-tuning for low-resource
generalized entity matching . . . . . . 369--378
Zhihan Guo and
Xinyu Zeng and
Kan Wu and
Wuh-Chwen Hwang and
Ziwei Ren and
Xiangyao Yu and
Mahesh Balakrishnan and
Philip A. Bernstein Cornus: atomic commit for a cloud DBMS
with storage disaggregation . . . . . . 379--392
Haitao Yuan and
Guoliang Li and
Zhifeng Bao Route Travel Time Estimation on a Road
Network Revisited: Heterogeneity,
Proximity, Periodicity and Dynamicity 393--405
Yongji Wu and
Matthew Lentz and
Danyang Zhuo and
Yao Lu Serving and Optimizing Machine Learning
Workflows on Heterogeneous
Infrastructures . . . . . . . . . . . . 406--419
Zixuan Geng and
Maximilian Schleich and
Dan Suciu Computing Rule-Based Explanations by
Leveraging Counterfactuals . . . . . . . 420--432
Jinfeng Peng and
Derong Shen and
Nan Tang and
Tieying Liu and
Yue Kou and
Tiezheng Nie and
Hang Cui and
Ge Yu Self-Supervised and Interpretable Data
Cleaning with Sequence Generative
Adversarial Networks . . . . . . . . . . 433--446
Francisco Romero and
Johann Hauswald and
Aditi Partap and
Daniel Kang and
Matei Zaharia and
Christos Kozyrakis Optimizing Video Analytics with
Declarative Model Relationships . . . . 447--460
Jiaxin Jiang and
Yuan Li and
Bingsheng He and
Bryan Hooi and
Jia Chen and
Johan Kok Zhi Kang Spade: a Real-Time Fraud Detection
Framework on Evolving Graphs . . . . . . 461--469
Xupeng Miao and
Yujie Wang and
Youhe Jiang and
Chunan Shi and
Xiaonan Nie and
Hailin Zhang and
Bin Cui Galvatron: Efficient Transformer
Training over Multiple GPUs Using
Automatic Parallelism . . . . . . . . . 470--479
Qitong Wang and
Stephen Whitmarsh and
Vincent Navarro and
Themis Palpanas iEDeaL: a Deep Learning Framework for
Detecting Highly Imbalanced Interictal
Epileptiform Discharges . . . . . . . . 480--490
Eleni Zapridou and
Ioannis Mytilinis and
Anastasia Ailamaki Dalton: Learned Partitioning for
Distributed Data Streams . . . . . . . . 491--504
Ali Behrouz and
Farnoosh Hashemi and
Laks V. S. Lakshmanan FirmTruss Community Search in Multilayer
Networks . . . . . . . . . . . . . . . . 505--518
Tianyang Xu and
Zhao Lu and
Yuanyuan Zhu Efficient Triangle-Connected Truss
Community Search in Dynamic Graphs . . . 519--531
Ibrahim Sabek and
Kapil Vaidya and
Dominik Horn and
Andreas Kipf and
Michael Mitzenmacher and
Tim Kraska Can Learned Models Replace Hash
Functions? . . . . . . . . . . . . . . . 532--545
Yue Zhao and
George H. Chen and
Zhihao Jia TOD: GPU-Accelerated Outlier Detection
via Tensor Operations . . . . . . . . . 546--560
Chaohong Ma and
Xiaohui Yu and
Yifan Li and
Xiaofeng Meng and
Aishan Maoliniyazi FILM: a Fully Learned Index for
Larger-Than-Memory Databases . . . . . . 561--573
Miti Mazmudar and
Thomas Humphries and
Jiaxiang Liu and
Matthew Rafuse and
Xi He Cache Me If You Can: Accuracy-Aware
Inference Engine for Differentially
Private Data Exploration . . . . . . . . 574--586
Francesca Falzon and
Evangelia Anna Markatou and
Zachary Espiritu and
Roberto Tamassia Range Search over Encrypted
Multi-Attribute Data . . . . . . . . . . 587--600
Xuanle Ren and
Le Su and
Zhen Gu and
Sheng Wang and
Feifei Li and
Yuan Xie and
Song Bian and
Chao Li and
Fan Zhang HEDA: Multi-Attribute Unbounded
Aggregation over Homomorphically
Encrypted Database . . . . . . . . . . . 601--614
Chih-Ya Shen and
Shao-Heng Ko and
Guang-Siang Lee and
Wang-Chien Lee and
De-Nian Yang Density Personalized Group Query . . . . 615--628
Jinkun Geng and
Anirudh Sivaraman and
Balaji Prabhakar and
Mendel Rosenblum Nezha: Deployable and High-Performance
Consensus Using Synchronized Clocks . . 629--642
Ishtiyaque Ahmad and
Divyakant Agrawal and
Amr El Abbadi and
Trinabh Gupta Pantheon: Private Retrieval from Public
Key--Value Store . . . . . . . . . . . . 643--656
Francesco Da Dalt and
Simon Scherrer and
Adrian Perrig Bayesian Sketches for Volume Estimation
in Data Streams . . . . . . . . . . . . 657--669
Moin Hussain Moti and
Panagiotis Simatis and
Dimitris Papadias Waffle: a Workload-Aware and
Query-Sensitive Framework for Disk-Based
Spatial Indexing . . . . . . . . . . . . 670--683
Eduardo H. M. Pena and
Fabio Porto and
Felix Naumann Fast Algorithms for Denial Constraint
Discovery . . . . . . . . . . . . . . . 684--696
Pu Jiao and
Sheng Di and
Hanqi Guo and
Kai Zhao and
Jiannan Tian and
Dingwen Tao and
Xin Liang and
Franck Cappello Toward Quantity-of-Interest Preserving
Lossy Compression for Scientific Data 697--710
Gunduz Vehbi Demirci and
Aparajita Haldar and
Hakan Ferhatosmanoglu Scalable Graph Convolutional Network
Training on Distributed-Memory Systems 711--724
Patrick Schäfer and
Ulf Leser Motiflets: Simple and Accurate Detection
of Motifs in Time Series . . . . . . . . 725--737
Avanika Narayan and
Ines Chami and
Laurel Orr and
Christopher Ré Can Foundation Models Wrangle Your Data? 738--746
Bogyeong Kim and
Kyoseung Koo and
Undraa Enkhbat and
Sohyun Kim and
Juhun Kim and
Bongki Moon M2Bench: a Database Benchmark for
Multi-Model Analytic Workloads . . . . . 747--759
Nirvik Baruah and
Peter Kraft and
Fiodar Kazhamiaka and
Peter Bailis and
Matei Zaharia Parallelism-Optimizing Data Placement
for Faster Data-Parallel Computations 760--771
Teddy Lazebnik and
Amit Somech and
Abraham Itzhak Weinberg SubStrat: a Subset-Based Optimization
Strategy for Faster AutoML . . . . . . . 772--780
Saket Gurukar and
Nikil Pancha and
Andrew Zhai and
Eric Kim and
Samson Hu and
Srinivasan Parthasarathy and
Charles Rosenberg and
Jure Leskovec MultiBiSage: a Web-Scale Recommendation
System Using Multiple Bipartite Graphs
at Pinterest . . . . . . . . . . . . . . 781--789
Alexandros Zeakis and
Dimitrios Skoutas and
Dimitris Sacharidis and
Odysseas Papapetrou and
Manolis Koubarakis TokenJoin: Efficient Filtering for Set
Similarity Join with Maximum Weighted
Bipartite Matching . . . . . . . . . . . 790--802
Moe Kayali and
Dan Suciu Quasi-Stable Coloring for Graph
Compression: Approximating Max-Flow,
Linear Programs, and Centrality . . . . 803--815
David Pujol and
Albert Sun and
Brandon Fain and
Ashwin Machanavajjhala Multi-Analyst Differential Privacy for
Online Query Answering . . . . . . . . . 816--828
Tim Gubner and
Peter Boncz Excalibur: a Virtual Machine for
Adaptive Fine-grained JIT-Compiled Query
Execution based on VOILA . . . . . . . . 829--841
Lianke Qin and
Rajesh Jayaram and
Elaine Shi and
Zhao Song and
Danyang Zhuo and
Shumo Chu Adore: Differentially Oblivious
Relational Database Operators . . . . . 842--855
Shadaj Laddad and
Conor Power and
Mae Milano and
Alvin Cheung and
Natacha Crooks and
Joseph M. Hellerstein Keep CALM and CRDT On . . . . . . . . . 856--863
Kejing Lu and
Yoshiharu Ishikawa and
Chuan Xiao MQH: Locality Sensitive Hashing on
Multi-level Quantization Errors for
Point-to-Hyperplane Distances . . . . . 864--876
Gábor Szárnyas and
Jack Waudby and
Benjamin A. Steer and
Dávid Szakállas and
Altan Birler and
Mingxi Wu and
Yuchen Zhang and
Peter Boncz The LDBC Social Network Benchmark:
Business Intelligence Workload . . . . . 877--890
Shuai An and
Yang Cao Making Cache Monotonic and Consistent 891--904
Ziyun Wei and
Immanuel Trummer SkinnerMT: Parallelizing for Efficiency
and Robustness in Adaptive Query
Processing on Multicore Platforms . . . 905--917
Dujian Ding and
Sihem Amer-Yahia and
Laks Lakshmanan On Efficient Approximate Queries over
Machine Learning Models . . . . . . . . 918--931
Aamod Khatiwada and
Roee Shraga and
Wolfgang Gatterbauer and
Renée J. Miller Integrating Data Lake Tables . . . . . . 932--945
Hongbo Kang and
Yiwei Zhao and
Guy E. Blelloch and
Laxman Dhulipala and
Yan Gu and
Charles McGuffey and
Phillip B. Gibbons PIM-Tree: a Skew-Resistant Index for
Processing-in-Memory . . . . . . . . . . 946--958
Zhijia Chen and
Weiyi Meng and
Eduard Dragut Web Record Extraction with Invariants 959--972
Yong Wang and
Guoliang Li and
Kaiyu Li and
Haitao Yuan A Deep Generative Model for Trajectory
Modeling and Utilization . . . . . . . . 973--985
Zihuan Xu and
Lei Chen L2chain: Towards High-performance,
Confidential and Secure Layer-2
Blockchain Solution for Decentralized
Applications . . . . . . . . . . . . . . 986--999
Mingxuan Li and
Yazhe Wang and
Shuai Ma and
Chao Liu and
Dongdong Huo and
Yu Wang and
Zhen Xu Auto-Tuning with Reinforcement Learning
for Permissioned Blockchain Systems . . 1000--1012
Minhui Xie and
Youyou Lu and
Qing Wang and
Yangyang Feng and
Jiaqiang Liu and
Kai Ren and
Jiwu Shu PetPS: Supporting Huge Embedding Models
with Persistent Memory . . . . . . . . . 1013--1022
Kashif Rabbani and
Matteo Lissandrini and
Katja Hose Extraction of Validating Shapes from
Very Large Knowledge Graphs . . . . . . 1023--1032
Pu Pang and
Gang Deng and
Kaihao Bai and
Quan Chen and
Shixuan Sun and
Bo Liu and
Yu Xu and
Hongbo Yao and
Zhengheng Wang and
Xiyu Wang and
Zheng Liu and
Zhuo Song and
Yong Yang and
Tao Ma and
Minyi Guo Async-Fork: Mitigating Query Latency
Spikes Incurred by the Fork-based
Snapshot Mechanism from the OS Level . . 1033--1045
Qichen Wang and
Xiao Hu and
Binyang Dai and
Ke Yi Change Propagation Without Joins . . . . 1046--1058
Yuexiang Xie and
Zhen Wang and
Dawei Gao and
Daoyuan Chen and
Liuyi Yao and
Weirui Kuang and
Yaliang Li and
Bolin Ding and
Jingren Zhou FederatedScope: a Flexible Federated
Learning Platform for Heterogeneity . . 1059--1072
Boyang Li and
Yurong Cheng and
Ye Yuan and
Yi Yang and
QianQian Jin and
Guoren Wang ACTA: Autonomy and Coordination Task
Assignment in Spatial Crowdsourcing
Platforms . . . . . . . . . . . . . . . 1073--1085
Taegeon Um and
Byungsoo Oh and
Byeongchan Seo and
Minhyeok Kweun and
Goeun Kim and
Woo-Yeon Lee FastFlow: Accelerating Deep Learning
Model Training with Smart Offloading of
Input Data Pipeline . . . . . . . . . . 1086--1099
Xi Zhao and
Bolong Zheng and
Xiaomeng Yi and
Xiaofan Luan and
Charles Xie and
Xiaofang Zhou and
Christian S. Jensen FARGO: Fast Maximum Inner Product Search
via Global Multi-Probing . . . . . . . . 1100--1112
Martin Kiefer and
Ilias Poulakis and
Eleni Tzirita Zacharatou and
Volker Markl Optimistic Data Parallelism for
FPGA-Accelerated Sketching . . . . . . . 1113--1125
Héber H. Arcolezi and
Sébastien Gambs and
Jean-François Couchot and
Catuscia Palamidessi On the Risks of Collecting
Multidimensional Data Under Local
Differential Privacy . . . . . . . . . . 1126--1139
Manos Chatzakis and
Panagiota Fatourou and
Eleftherios Kosmas and
Themis Palpanas and
Botao Peng Odyssey: a Journey in the Land of
Distributed Data Series Similarity
Search . . . . . . . . . . . . . . . . . 1140--1153
Lanting Fang and
Kaiyu Feng and
Jie Gui and
Shanshan Feng and
Aiqun Hu Anonymous Edge Representation for
Inductive Anomaly Detection in Dynamic
Bipartite Graph . . . . . . . . . . . . 1154--1167
Junyong Yang and
Ming Zhong and
Yuanyuan Zhu and
Tieyun Qian and
Mengchi Liu and
Jeffrey Xu Yu Scalable Time-Range $k$-Core Query on
Temporal Graphs . . . . . . . . . . . . 1168--1180
Erkang Zhu and
Silu Huang and
Surajit Chaudhuri High-Performance Row Pattern Recognition
Using Joins . . . . . . . . . . . . . . 1181--1195
Kelin Luo and
Alexandre M. Florio and
Syamantak Das and
Xiangyu Guo A Hierarchical Grouping Algorithm for
the Multi-Vehicle Dial-a-Ride Problem 1195--1207
Xiaoxuan Liu and
Shuxian Wang and
Mengzhu Sun and
Sicheng Pan and
Ge Li and
Siddharth Jha and
Cong Yan and
Junwen Yang and
Shan Lu and
Alvin Cheung Leveraging Application Data Constraints
to Optimize Database-Backed Web
Applications . . . . . . . . . . . . . . 1208--1221
Ferdinand Gruber and
Maximilian Bandle and
Alexis Engelke and
Thomas Neumann and
Jana Giceva Bringing Compiling Databases to RISC
Architectures . . . . . . . . . . . . . 1222--1234
Hokeun Cha and
Xiangpeng Hao and
Tianzheng Wang and
Huanchen Zhang and
Aditya Akella and
Xiangyao Yu B$^{\rm link}$-hash: an Adaptive Hybrid
Index for In-Memory Time-Series
Databases . . . . . . . . . . . . . . . 1235--1248
Wentao Huang and
Yunhong Ji and
Xuan Zhou and
Bingsheng He and
Kian-Lee Tan A Design Space Exploration and
Evaluation for Main-Memory Hash Joins in
Storage Class Memory . . . . . . . . . . 1249--1263
Kaile Huang and
Si Liu and
Zhenge Chen and
Hengfeng Wei and
David Basin and
Haixiang Li and
Anqun Pan Efficient Black-Box Checking of Snapshot
Isolation in Databases . . . . . . . . . 1264--1276
Zitao Li and
Tianhao Wang and
Ninghui Li Differentially Private Vertical
Federated Clustering . . . . . . . . . . 1277--1290
Fuheng Zhao and
Punnal Ismail Khan and
Divyakant Agrawal and
Amr El Abbadi and
Arpit Gupta and
Zaoxing Liu Panakos: Chasing the Tails for
Multidimensional Data Streams . . . . . 1291--1304
Jonathan Fürst and
Mauricio Fadel Argerich and
Bin Cheng VersaMatch: Ontology Matching with Weak
Supervision . . . . . . . . . . . . . . 1305--1318
Yushi Sun and
Hao Xin and
Lei Chen RECA: Related Tables Enhanced Column
Semantic Type Annotation Framework . . . 1319--1331
Yiming Li and
Yanyan Shen and
Lei Chen and
Mingxuan Yuan Zebra: When Temporal Graph Neural
Networks Meet Temporal Personalized
PageRank . . . . . . . . . . . . . . . . 1332--1345
Su Feng and
Boris Glavic and
Oliver Kennedy Efficient Approximation of Certain and
Possible Answers for Ranking and Window
Queries over Uncertain Data . . . . . . 1346--1358
Cong Yue and
Tien Tuan Anh Dinh and
Zhongle Xie and
Meihui Zhang and
Gang Chen and
Beng Chin Ooi and
Xiaokui Xiao GlassDB: an Efficient Verifiable Ledger
Database System Through Transparency . . 1359--1371
Qian Zhang and
Jingyao Li and
Hongyao Zhao and
Quanqing Xu and
Wei Lu and
Jinliang Xiao and
Fusheng Han and
Chuanhui Yang and
Xiaoyong Du Efficient Distributed Transaction
Processing in Heterogeneous Networks . . 1372--1385
Zhiguo Jiang and
Hanhua Chen and
Hai Jin Auxo: a Scalable and Efficient Graph
Stream Summarization Structure . . . . . 1386--1398
Xiao He and
Ye Li and
Jian Tan and
Bin Wu and
Feifei Li OneShotSTL: One-Shot Seasonal-Trend
Decomposition For Online Time Series
Anomaly Detection And Forecasting . . . 1399--1412
Alexander van Renen and
Viktor Leis Cloud Analytics Benchmark . . . . . . . 1413--1425
Adnan Alhomssi and
Viktor Leis Scalable and Robust Snapshot Isolation
for High-Performance Storage Engines . . 1426--1438
Xiang Li and
Fabing Li and
Mingyu Gao Flare: a Fast, Secure, and
Memory-Efficient Distributed Analytics
Framework . . . . . . . . . . . . . . . 1439--1452
Mijin An and
Jonghyeok Park and
Tianzheng Wang and
Beomseok Nam and
Sang-Won Lee NV-SQL: Boosting OLTP Performance with
Non-Volatile DIMMs . . . . . . . . . . . 1453--1465
Rong Zhu and
Wei Chen and
Bolin Ding and
Xingguang Chen and
Andreas Pfadler and
Ziniu Wu and
Jingren Zhou Lero: a Learning-to-Rank Query Optimizer 1466--1479
Kitaek Lee and
Insoon Jo and
Jaechan Ahn and
Hyuk Lee and
Hwang Lee and
Woong Sul and
Hyungsoo Jung Deploying Computational Storage for HTAP
DBMSs Takes More Than Just Computation
Offloading . . . . . . . . . . . . . . . 1480--1493
Dixin Tang and
Alan Fekete and
Indranil Gupta and
Aditya G. Parameswaran Transactional Panorama: a Conceptual
Framework for User Perception in
Analytical Visual Interfaces . . . . . . 1494--1506
Derek Paulsen and
Yash Govind and
AnHai Doan Sparkly: a Simple yet Surprisingly
Strong TF/IDF Blocker for Entity
Matching . . . . . . . . . . . . . . . . 1507--1519
Parimarjan Negi and
Ziniu Wu and
Andreas Kipf and
Nesime Tatbul and
Ryan Marcus and
Sam Madden and
Tim Kraska and
Mohammad Alizadeh Robust Query Driven Cardinality
Estimation under Changing Workloads . . 1520--1533
Han Fu and
Chang Liu and
Bin Wu and
Feifei Li and
Jian Tan and
Jianling Sun CatSQL: Towards Real World Natural
Language to SQL Applications . . . . . . 1534--1547
Ilias Azizi and
Karima Echihabi and
Themis Palpanas ELPIS: Graph-Based Similarity Search for
Scalable Data Science . . . . . . . . . 1548--1559
Carl Duffy and
Jaehoon Shim and
Sang-Hoon Kim and
Jin-Soo Kim Dotori: a Key--Value SSD Based KV Store 1560--1572
David Pujol and
Amir Gilad and
Ashwin Machanavajjhala PreFair: Privately Generating
Justifiably Fair Synthetic Data . . . . 1573--1586
Roee Shraga and
Renée J. Miller Explaining Dataset Changes for Semantic
Data Versioning with Explain-Da-V . . . 1587--1600
Mihai Budiu and
Tej Chajed and
Frank McSherry and
Leonid Ryzhyk and
Val Tannen DBSP: Automatic Incremental View
Maintenance for Rich Query Languages . . 1601--1614
Ling Liang and
Jilan Lin and
Zheng Qu and
Ishtiyaque Ahmad and
Fengbin Tu and
Trinabh Gupta and
Yufei Ding and
Yuan Xie SPG: Structure-Private Graph Database
via SqueezePIR . . . . . . . . . . . . . 1615--1628
Jingyuan Zhang and
Ao Wang and
Xiaolong Ma and
Benjamin Carver and
Nicholas John Newman and
Ali Anwar and
Lukas Rupprecht and
Vasily Tarasov and
Dimitrios Skourtis and
Feng Yan and
Yue Cheng InfiniStore: Elastic Serverless Cloud
Storage . . . . . . . . . . . . . . . . 1629--1642
Peng Fang and
Arijit Khan and
Siqiang Luo and
Fang Wang and
Dan Feng and
Zhenli Li and
Wei Yin and
Yuchao Cao Distributed Graph Embedding with
Information-Oriented Random Walks . . . 1643--1656
Shuyuan Zheng and
Yang Cao and
Masatoshi Yoshikawa Secure Shapley Value for Cross-Silo
Federated Learning . . . . . . . . . . . 1657--1670
Xiang Li and
Nuozhou Sun and
Yunqian Luo and
Mingyu Gao SODA: a Set of Fast Oblivious Algorithms
in Distributed Secure Data Analytics . . 1671--1684
Zicong Hong and
Song Guo and
Enyuan Zhou and
Wuhui Chen and
Huawei Huang and
Albert Zomaya GriDB: Scaling Blockchain Database via
Sharding and Off-Chain Cross-Shard
Mechanism . . . . . . . . . . . . . . . 1685--1698
Xun Jian and
Zhiyuan Li and
Lei Chen SUFF: Accelerating Subgraph Matching
with Historical Data . . . . . . . . . . 1699--1711
Haochen He and
Erci Xu and
Shanshan Li and
Zhouyang Jia and
Si Zheng and
Yue Yu and
Jun Ma and
Xiangke Liao When Database Meets New Storage Devices:
Understanding and Exposing Performance
Mismatches via Configurations . . . . . 1712--1725
Grace Fan and
Jin Wang and
Yuliang Li and
Dan Zhang and
Renée J. Miller Semantics-Aware Dataset Discovery from
Data Lakes with Contextualized
Column-Based Representation Learning . . 1726--1739
Kasper Overgaard Mortensen and
Fatemeh Zardbani and
Mohammad Ahsanul Haque and
Steinn Ymir Agustsson and
Davide Mottin and
Philip Hofmann and
Panagiotis Karras Marigold: Efficient $k$-Means Clustering
in High Dimensions . . . . . . . . . . . 1740--1748
Ibrahim Sabek and
Tim Kraska The Case for Learned In-Memory Joins . . 1749--1762
Ruiyuan Li and
Zheng Li and
Yi Wu and
Chao Chen and
Yu Zheng Elf: Erasing-Based Lossless
Floating-Point Compression . . . . . . . 1763--1776
Tianyi Chen and
Jun Gao and
Hedui Chen and
Yaofeng Tu LOGER: a Learned Optimizer Towards
Generating Efficient and Robust Query
Execution Plans . . . . . . . . . . . . 1777--1789
Wim Martens and
Matthias Niewerth and
Tina Popp and
Carlos Rojas and
Stijn Vansummeren and
Domagoj Vrgoc Representing Paths in Graph Database
Pattern Matching . . . . . . . . . . . . 1790--1803
Xiling Li and
Chenkai Weng and
Yongxin Xu and
Xiao Wang and
Jennie Rogers ZKSQL: Verifiable and Efficient Query
Evaluation with Zero-Knowledge Proofs 1804--1816
Chengzhi Piao and
Tingyang Xu and
Xiangguo Sun and
Yu Rong and
Kangfei Zhao and
Hong Cheng Computing Graph Edit Distance via Neural
Graph Matching . . . . . . . . . . . . . 1817--1829
Christine Schäler and
Thomas Hütter and
Martin Schäler Benchmarking the Utility of $w$-Event
Differential Privacy Mechanisms --- When
Baselines Become Mighty Competitors . . 1830--1842
Eriq Augustine and
Lise Getoor Collective Grounding: Applying Database
Techniques to Grounding Templated Models 1843--1855
Jan Niklas Adams and
Cameron Pitsch and
Tobias Brockhoff and
Wil M. P. van der Aalst An Experimental Evaluation of Process
Concept Drift Detection . . . . . . . . 1856--1869
Gerardo Vitagliano and
Mazhar Hameed and
Lan Jiang and
Lucas Reisener and
Eugene Wu and
Felix Naumann Pollock: a Data Loading Benchmark . . . 1870--1882
Yingtai Xiao and
Guanhong Wang and
Danfeng Zhang and
Daniel Kifer Answering Private Linear Queries
Adaptively Using the Common Mechanism 1883--1896
Yuntao Du and
Yujia Hu and
Zhikun Zhang and
Ziquan Fang and
Lu Chen and
Baihua Zheng and
Yunjun Gao LDPTrace: Locally Differentially Private
Trajectory Synthesis . . . . . . . . . . 1897--1909
Xenophon Kitsios and
Panagiotis Liakos and
Katia Papakonstantinopoulou and
Yannis Kotidis Sim-Piece: Highly Accurate Piecewise
Linear Approximation through Similar
Segment Merging . . . . . . . . . . . . 1910--1922
Eugenio Marinelli and
Yiqing Yan and
Virginie Magnone and
Charlotte Dumargne and
Pascal Barbry and
Thomas Heinis and
Raja Appuswamy Towards Migration-Free ``Just-in-Case''
Data Archival for Future Cloud Data
Lakes Using Synthetic DNA . . . . . . . 1923--1929
Zhiyuan Dong and
Zhaoguo Wang and
Xiaodong Zhang and
Xian Xu and
Changgeng Zhao and
Haibo Chen and
Aurojit Panda and
Jinyang Li Fine-Grained Re-Execution for Efficient
Batched Commit of Distributed
Transactions . . . . . . . . . . . . . . 1930--1943
Wenfei Fan and
Resul Tugay and
Yaoshu Wang and
Min Xie and
Muhammad Asif Ali Learning and Deducing Temporal Orders 1944--1957
Xu Chen and
Zhen Wang and
Shuncheng Liu and
Yaliang Li and
Kai Zeng and
Bolin Ding and
Jingren Zhou and
Han Su and
Kai Zheng BASE: Bridging the Gap between Cost and
Latency for Query Optimization . . . . . 1958--1966
Jakub Lemiesz Efficient Framework for Operating on
Data Sketches . . . . . . . . . . . . . 1967--1978
Xi Zhao and
Yao Tian and
Kai Huang and
Bolong Zheng and
Xiaofang Zhou Towards Efficient Index Construction and
Approximate Nearest Neighbor Search in
High-Dimensional Spaces . . . . . . . . 1979--1991
Zhaoyan Sun and
Xuanhe Zhou and
Guoliang Li Learned Index: a Comprehensive
Experimental Evaluation . . . . . . . . 1992--2004
Yanping Zhang and
Johes Bater and
Kartik Nayak and
Ashwin Machanavajjhala Longshot: Indexing Growing Databases
Using MPC and Differential Privacy . . . 2005--2018
John Paparrizos and
Kaize Wu and
Aaron Elmore and
Christos Faloutsos and
Michael J. Franklin Accelerating Similarity Search for
Elastic Measures: a Study and New
Generalization of Lower Bounding
Distances . . . . . . . . . . . . . . . 2019--2032
Chenyuan Wu and
Bhavana Mehta and
Mohammad Javad Amiri and
Ryan Marcus and
Boon Thau Loo AdaChain: a Learned Adaptive Blockchain 2033--2046
Yingli Zhou and
Yixiang Fang and
Wensheng Luo and
Yunming Ye Influential Community Search over Large
Heterogeneous Information Networks . . . 2047--2060
Naheed Anjum Arafat and
Arijit Khan and
Arpit Kumar Rai and
Bishwamittra Ghosh Neighborhood-Based Hypergraph Core
Decomposition . . . . . . . . . . . . . 2061--2074
Janet Layne and
Justin Carpenter and
Edoardo Serra and
Francesco Gullo Temporal SIR-GN: Efficient and Effective
Structural Representation Learning for
Temporal Graphs . . . . . . . . . . . . 2075--2089
Gabriel Haas and
Viktor Leis What Modern NVMe Storage Can Do, and How
to Exploit it: High-Performance I/O for
High-Performance Storage Engines . . . . 2090--2102
Vinay Banakar and
Kan Wu and
Yuvraj Patel and
Kimberly Keeton and
Andrea C. Arpaci-Dusseau and
Remzi H. Arpaci-Dusseau WiscSort: External Sorting for
Byte-Addressable Storage . . . . . . . . 2103--2116
Lorraine A. K. Ayad and
Grigorios Loukides and
Solon P. Pissis Text Indexing for Long Patterns: Anchors
are All you Need . . . . . . . . . . . . 2117--2131
Azim Afroozeh and
Peter Boncz The FastLanes Compression Layout:
Decoding $ > 100 $ Billion Integers per
Second with Scalar Code . . . . . . . . 2132--2144
Cong Yue and
Meihui Zhang and
Changhao Zhu and
Gang Chen and
Dumitrel Loghin and
Beng Chin Ooi VeriBench: Analyzing the Performance of
Database Systems with Verifiability . . 2145--2157
Jiangneng Li and
Zheng Wang and
Gao Cong and
Cheng Long and
Han Mao Kiah and
Bin Cui Towards Designing and Learning Piecewise
Space-Filling Curves . . . . . . . . . . 2158--2171
Xiaoke Zhu and
Yang Liu and
Shuhao Liu and
Wenfei Fan MiniGraph: Querying Big Graphs with a
Single Machine . . . . . . . . . . . . . 2172--2185
Yunyoung Choi and
Kunsoo Park and
Hyunjoon Kim BICE: Exploring Compact Search Space by
Using Bipartite Matching and Cell-Wide
Verification . . . . . . . . . . . . . . 2186--2198
Anxin Tian and
Alexander Zhou and
Yue Wang and
Lei Chen Maximal D-Truss Search in Dynamic
Directed Graphs . . . . . . . . . . . . 2199--2211
Pengfei Li and
Hua Lu and
Rong Zhu and
Bolin Ding and
Long Yang and
Gang Pan DILI: a Distribution-Driven Learned
Index . . . . . . . . . . . . . . . . . 2212--2224
Alexandros Zeakis and
George Papadakis and
Dimitrios Skoutas and
Manolis Koubarakis Pre-Trained Embeddings for Entity
Resolution: an Experimental Analysis . . 2225--2238
Yanping Zheng and
Zhewei Wei and
Jiajun Liu Decoupled Graph Neural Networks for
Large Dynamic Graphs . . . . . . . . . . 2239--2247
Fatemeh Zardbani and
Nikos Mamoulis and
Stratos Idreos and
Panagiotis Karras Adaptive Indexing of Objects with
Spatial Extent . . . . . . . . . . . . . 2248--2260
Xu Chen and
Haitian Chen and
Zibo Liang and
Shuncheng Liu and
Jinghong Wang and
Kai Zeng and
Han Su and
Kai Zheng LEON: a New Framework for ML-Aided Query
Optimization . . . . . . . . . . . . . . 2261--2273
Nuno Faria and
José Pereira and
Ana Nunes Alonso and
Ricardo Vilaça and
Yunus Koning and
Niels Nes TiQuE: Improving the Transactional
Performance of Analytical Systems for
True Hybrid Workloads . . . . . . . . . 2274--2288
Jaeho Bang and
Gaurav Tarlok Kakkar and
Pramod Chunduri and
Subrata Mitra and
Joy Arulraj Seiden: Revisiting Query Processing in
Video Database Systems . . . . . . . . . 2289--2301
Ferdi Kossmann and
Ziniu Wu and
Eugenie Lai and
Nesime Tatbul and
Lei Cao and
Tim Kraska and
Sam Madden Extract-Transform-Load for Video Streams 2302--2315
Sivaprasad Sudhir and
Wenbo Tao and
Nikolay Laptev and
Cyrille Habis and
Michael Cafarella and
Samuel Madden Pando: Enhanced Data Skipping with
Logical Data Partitioning . . . . . . . 2316--2329
Kajetan Maliszewski and
Jorge-Arnulfo Quiané-Ruiz and
Volker Markl Cracking-Like Join for Trusted Execution
Environments . . . . . . . . . . . . . . 2330--2343
Umut Çalikyilmaz and
Sven Groppe and
Jinghua Groppe and
Tobias Winker and
Stefan Prestel and
Farida Shagieva and
Daanish Arya and
Florian Preis and
Le Gruenwald Opportunities for Quantum Acceleration
of Databases: Optimization of Queries
and Transaction Schedules . . . . . . . 2344--2353
Xupeng Miao and
Yining Shi and
Zhi Yang and
Bin Cui and
Zhihao Jia SDPipe: a Semi-Decentralized Framework
for Heterogeneity-Aware
Pipeline-parallel Training . . . . . . . 2354--2363
Bohyun Lee and
Mijin An and
Sang-Won Lee LRU-C: Parallelizing Database I/Os for
Flash SSDs . . . . . . . . . . . . . . . 2364--2376
Zixuan Chen and
Panagiotis Manolios and
Mirek Riedewald Why Not Yet: Fixing a Top-$k$ Ranking
that is Not Fair to Individuals . . . . 2377--2390
Shantanu Sharma and
Yin Li and
Sharad Mehrotra and
Nisha Panwar and
Komal Kumari and
Swagnik Roychoudhury Information-Theoretically Secure and
Highly Efficient Search and Row
Retrieval . . . . . . . . . . . . . . . 2391--2403
Fumiyuki Kato and
Yang Cao and
Masatoshi Yoshikawa Olive: Oblivious Federated Learning on
Trusted Execution Environment against
the Risk of Sparsification . . . . . . . 2404--2417
Chengyang Luo and
Qing Liu and
Yunjun Gao and
Lu Chen and
Ziheng Wei and
Congcong Ge Task: an Efficient Framework for Instant
Error-Tolerant Spatial Keyword Queries
on Road Networks . . . . . . . . . . . . 2418--2430
Besat Kassaie and
Frank Wm. Tompa Autonomously Computable Information
Extraction . . . . . . . . . . . . . . . 2431--2443
Dimitrios Koutsoukos and
Raghav Bhartia and
Michal Friedman and
Ana Klimovic and
Gustavo Alonso NVM: Is it Not Very Meaningful for
Databases? . . . . . . . . . . . . . . . 2444--2457
Yuyang Dong and
Chuan Xiao and
Takuma Nozawa and
Masafumi Enomoto and
Masafumi Oyamada DeepJoin: Joinable Table Discovery with
Pre-Trained Language Models . . . . . . 2458--2470
Yuncheng Wu and
Naili Xing and
Gang Chen and
Tien Tuan Anh Dinh and
Zhaojing Luo and
Beng Chin Ooi and
Xiaokui Xiao and
Meihui Zhang Falcon: a Privacy-Preserving and
Interpretable Vertical Federated
Learning System . . . . . . . . . . . . 2471--2484
Haotian Gao and
Cong Yue and
Tien Tuan Anh Dinh and
Zhiyong Huang and
Beng Chin Ooi Enabling Secure and Efficient Data
Analytics Pipeline Evolution with
Trusted Execution Environment . . . . . 2485--2498
Harish Doraiswamy and
Vikas Kalagi and
Karthik Ramachandra and
Jayant R. Haritsa A Case for Graphics-Driven Query
Processing . . . . . . . . . . . . . . . 2499--2511
Wei Tian and
Jieming Shi and
Siqiang Luo and
Hui Li and
Xike Xie and
Yuanhang Zou Effective and Efficient Route Planning
Using Historical Trajectories on Road
Networks . . . . . . . . . . . . . . . . 2512--2524
Konstantinos Lampropoulos and
Fatemeh Zardbani and
Nikos Mamoulis and
Panagiotis Karras Adaptive Indexing in High-Dimensional
Metric Spaces . . . . . . . . . . . . . 2525--2537
Sen Gao and
Hongchao Qin and
Rong-Hua Li and
Bingsheng He Parallel Colorful $h$-Star Core
Maintenance in Dynamic Graphs . . . . . 2538--2550
Jia Li and
Wenyue Zhao and
Nikos Ntarmos and
Yang Cao and
Peter Buneman MITra: a Framework for Multi-Instance
Graph Traversal . . . . . . . . . . . . 2551--2564
Jiazun Chen and
Yikuan Xia and
Jun Gao CommunityAF: an Example-Based Community
Search Method via Autoregressive Flow 2565--2577
Yiming Lin and
Yeye He and
Surajit Chaudhuri Auto-BI: Automatically Build BI-Models
Leveraging Local Join Prediction and
Global Schema Graph . . . . . . . . . . 2578--2590
Yuemin Zhang and
Qingqing Ye and
Rui Chen and
Haibo Hu and
Qilong Han Trajectory Data Collection with Local
Differential Privacy . . . . . . . . . . 2591--2604
Jian Gao and
Xin Cao and
Xin Yao and
Gong Zhang and
Wei Wang LMSFC: a Novel Multidimensional Index
Based on Learned Monotonic Space Filling
Curves . . . . . . . . . . . . . . . . . 2605--2617
Kexin Rong and
Mihai Budiu and
Athinagoras Skiadopoulos and
Lalith Suresh and
Amy Tai Scaling a Declarative Cluster Manager
Architecture with Query Optimization
Techniques . . . . . . . . . . . . . . . 2618--2631
Mukul Singh and
José Cambronero Sánchez and
Sumit Gulwani and
Vu Le and
Carina Negreanu and
Mohammad Raza and
Gust Verbruggen Cornet: Learning Table Formatting Rules
By Example . . . . . . . . . . . . . . . 2632--2644
Chaoji Zuo and
Dong Deng ARKGraph: All-Range Approximate
$K$-Nearest-Neighbor Graph . . . . . . . 2645--2658
Brit Youngmann and
Michael Cafarella and
Babak Salimi and
Anna Zeng Causal Data Integration . . . . . . . . 2659--2665
Michael Martini and
Daniel Schuster and
Wil M. P. van der Aalst Mining Frequent Infix Patterns from
Concurrency-Aware Process Execution
Variants . . . . . . . . . . . . . . . . 2666--2678
Pedro Pedreira and
Orri Erling and
Konstantinos Karanasos and
Scott Schneider and
Wes McKinney and
Satya R. Valluri and
Mohamed Zait and
Jacques Nadeau The Composable Data Management System
Manifesto . . . . . . . . . . . . . . . 2679--2685
Daniel Schmitt and
Daniel Kocher and
Nikolaus Augsten and
Willi Mann and
Alexander Miller A Two-Level Signature Scheme for Stable
Set Similarity Joins . . . . . . . . . . 2686--2698
Olivier Rodriguez and
Federico Ulliana and
Marie-Laure Mugnier Scalable Reasoning on Document Stores
via Instance-Aware Query Rewriting . . . 2699--2713
Enhao Zhang and
Maureen Daum and
Dong He and
Brandon Haynes and
Ranjay Krishna and
Magdalena Balazinska EQUI-VOCAL: Synthesizing Queries for
Compositional Video Events from Limited
User Interactions . . . . . . . . . . . 2714--2727
Yuhao Zhang and
Arun Kumar Lotan: Bridging the Gap between GNNs and
Scalable Graph Analytics Engines . . . . 2728--2741
Peter Kraft and
Qian Li and
Xinjing Zhou and
Peter Bailis and
Michael Stonebraker and
Matei Zaharia and
Xiangyao Yu Epoxy: ACID Transactions across Diverse
Data Stores . . . . . . . . . . . . . . 2742--2754
Maximilian Böther and
Lawrence Benson and
Ana Klimovic and
Tilmann Rabl Analyzing Vectorized Hash Tables across
CPU Architectures . . . . . . . . . . . 2755--2768
Dominik Durner and
Viktor Leis and
Thomas Neumann Exploiting Cloud Object Storage for
High-Performance Analytics . . . . . . . 2769--2782
Dimitrios Karapiperis and
Christos Tjortjis and
Vassilios S. Verykios A Randomized Blocking Structure for
Streaming Record Linkage . . . . . . . . 2783--2791
Cristian Riveros and
Nicolás Van Sint Jan and
Domagoj Vrgoc REmatch: a Novel Regex Engine for
Finding All Matches . . . . . . . . . . 2792--2804
Junxiong Wang and
Immanuel Trummer and
Ahmet Kara and
Dan Olteanu ADOPT: Adaptively Optimizing Attribute
Orders for Worst-Case Optimal Join
Algorithms via Reinforcement Learning 2805--2817
Zheng Hu and
Weiguo Zheng and
Xiang Lian Triangular Stability Maximization by
Influence Spread over Social Networks 2818--2831
Haoquan Guan and
Ziling Chen and
Shaoxu Song CORE-Sketch: On Exact Computation of
Median Absolute Deviation with Limited
Space . . . . . . . . . . . . . . . . . 2832--2844
Christian Lülf and
Denis Mayr Lima Martins and
Marcos Antonio Vaz Salles and
Yongluan Zhou and
Fabian Gieseke Fast Search-by-Classification for
Large-Scale Databases Using Index-Aware
Decision Trees and Random Forests . . . 2845--2857
Marco Calautti and
Mostafa Milani and
Andreas Pieris Semi-Oblivious Chase Termination for
Linear Existential Rules: an
Experimental Studya . . . . . . . . . . 2858--2870
Kukjin Lee and
Anshuman Dutt and
Vivek Narasayya and
Surajit Chaudhuri Analyzing the Impact of Cardinality
Estimation on Execution Plans in
Microsoft SQL Server . . . . . . . . . . 2871--2883
Jongsung Lee and
Donguk Kim and
Jae W. Lee WALTZ: Leveraging Zone Append to Tighten
the Tail Latency of LSM Tree on ZNS SSD 2884--2896
Matthew Russo and
Tatsunori Hashimoto and
Daniel Kang and
Yi Sun and
Matei Zaharia Accelerating Aggregation Queries on
Unstructured Streams of Data . . . . . . 2897--2910
Qiushi Bai and
Sadeem Alsudais and
Chen Li QueryBooster: Improving SQL Performance
Using Middleware Services for
Human-Centered Query Rewriting . . . . . 2911--2924
Jiongli Zhu and
Sainyam Galhotra and
Nazanin Sabri and
Babak Salimi Consistent Range Approximation for Fair
Predictive Modeling . . . . . . . . . . 2925--2938
Haoteng Yin and
Muhan Zhang and
Jianguo Wang and
Pan Li SUREL+: Moving from Walks to Sets for
Scalable Subgraph-Based Graph
Representation Learning . . . . . . . . 2939--2948
Hanzhi Wang and
Zhewei Wei Estimating Single-Node PageRank in $
\tilde {O}(\min d_t, \sqrt {m}) $ Time 2949--2961
Yunjia Zhang and
Yannis Chronis and
Jignesh M. Patel and
Theodoros Rekatsinas Simple Adaptive Query Processing vs.
Learned Query Optimizers: Observations
and Analysis . . . . . . . . . . . . . . 2962--2975
Helen Xu and
Amanda Li and
Brian Wheatman and
Manoj Marneni and
Prashant Pandey BP-Tree: Overcoming the Point-Range
Operation Tradeoff for In-Memory B-Trees 2976--2989
Ge Lv and
Chen Jason Zhang and
Lei Chen HENCE-X: Toward Heterogeneity-Agnostic
Multi-Level Explainability for Deep
Graph Networks . . . . . . . . . . . . . 2990--3003
Haitao Yuan and
Sai Wang and
Zhifeng Bao and
Shangguang Wang Automatic Road Extraction with
Multi-Source Data Revisited:
Completeness, Smoothness and
Discrimination . . . . . . . . . . . . . 3004--3017
Philipp Fent and
Guido Moerkotte and
Thomas Neumann Asymptotically Better Query Optimization
Using Indexed Algebra . . . . . . . . . 3018--3030
Philipp Skavantzos and
Sebastian Link Normalizing Property Graphs . . . . . . 3031--3043
Chunwei Liu and
Anna Pavlenko and
Matteo Interlandi and
Brandon Haynes A Deep Dive into Common Open Formats for
Analytical DBMSs . . . . . . . . . . . . 3044--3056
Zezhou Huang and
Jiaxiang Liu and
Daniel Gbenga Alabi and
Raul Castro Fernandez and
Eugene Wu Saibot: a Differentially Private Data
Search Platform . . . . . . . . . . . . 3057--3070
Zezhou Huang and
Rathijit Sen and
Jiaxiang Liu and
Eugene Wu JoinBoost: Grow Trees over Normalized
Data Using Only SQL . . . . . . . . . . 3071--3084
Qian Li and
Peter Kraft and
Michael Cafarella and
Çagatay Demiralp and
Goetz Graefe and
Christos Kozyrakis and
Michael Stonebraker and
Lalith Suresh and
Xiangyao Yu and
Matei Zaharia R$^3$: Record-Replay-Retroaction for
Database-Backed Applications . . . . . . 3085--3097
Ritesh Sarkhel and
Binxuan Huang and
Colin Lockard and
Prashant Shiralkar Self-Training for Label-Efficient
Information Extraction from
Semi-Structured Web-Pages . . . . . . . 3098--3110
Jiabao Jin and
Peng Cheng and
Lei Chen and
Xuemin Lin and
Wenjie Zhang Efficient Non-Learning Similar
Subtrajectory Search . . . . . . . . . . 3111--3123
Xinle Cao and
Jian Liu and
Yongsheng Shen and
Xiaohua Ye and
Kui Ren Frequency-Revealing Attacks against
Frequency-Hiding Order-Preserving
Encryption . . . . . . . . . . . . . . . 3124--3136
Tianyu Zhang and
Kaige Liu and
Jack Kosaian and
Juncheng Yang and
Rashmi Vinayak Efficient Fault Tolerance for
Recommendation Model Training via
Erasure Coding . . . . . . . . . . . . . 3137--3150
Rui Dong and
Jie Liu and
Yuxuan Zhu and
Cong Yan and
Barzan Mozafari and
Xinyu Wang SlabCity: Whole-Query Optimization Using
Program Synthesis . . . . . . . . . . . 3151--3164
Joseph Howie and
Venkatesh Srinivasan and
Alex Thomo Scaling Up Structural Clustering to
Large Probabilistic Graphs Using
Lyapunov Central Limit Theorem . . . . . 3165--3177
Lucas Rosenblatt and
Bernease Herman and
Anastasia Holovenko and
Wonkwon Lee and
Joshua Loftus and
Elizabeth McKinnie and
Taras Rumezhak and
Andrii Stadnik and
Bill Howe and
Julia Stoyanovich Epistemic Parity: Reproducibility as an
Evaluation Metric for Differential
Privacy . . . . . . . . . . . . . . . . 3178--3191
Vargha Dadvar and
Lukasz Golab and
Divesh Srivastava POEM: Pattern-Oriented Explanations of
Convolutional Neural Networks . . . . . 3192--3200
Dandan Liu and
Zhaonian Zou gCore: Exploring Cross-Layer
Cohesiveness in Multi-Layer Graphs . . . 3201--3213
Yulai Tong and
Jiazhen Liu and
Hua Wang and
Ke Zhou and
Rongfeng He and
Qin Zhang and
Cheng Wang Sieve: a Learned Data-Skipping Index for
Data Analytics . . . . . . . . . . . . . 3214--3226
Kanat Tangwongsan and
Martin Hirzel and
Scott Schneider Out-of-Order Sliding-Window Aggregation
with Efficient Bulk Evictions and
Insertions . . . . . . . . . . . . . . . 3227--3239
Siyuan Wu and
Leong Hou U. and
Panagiotis Karras $k$-Best Egalitarian Stable Marriages
for Task Assignment . . . . . . . . . . 3240--3252
Graham Cormode and
Igor L. Markov Federated Calibration and Evaluation of
Binary Classifiers . . . . . . . . . . . 3253--3265
Jonghyeok Park and
Soyee Choi and
Gihwan Oh and
Soojun Im and
Moon-Wook Oh and
Sang-Won Lee FlashAlloc: Dedicating Flash Blocks by
Objects . . . . . . . . . . . . . . . . 3266--3278
Nima Shahbazi and
Nikola Danevski and
Fatemeh Nargesian and
Abolfazl Asudeh and
Divesh Srivastava Through the Fairness Lens: Experimental
Analysis and Evaluation of Entity
Matching . . . . . . . . . . . . . . . . 3279--3292
Tim Kraska and
Tianyu Li and
Samuel Madden and
Markos Markakis and
Amadou Ngom and
Ziniu Wu and
Geoffrey X. Yu Check Out the Big Brain on BRAD:
Simplifying Cloud Data Processing with
Learned Automated Data Meshes . . . . . 3293--3301
Raul Castro Fernandez and
Aaron J. Elmore and
Michael J. Franklin and
Sanjay Krishnan and
Chenhao Tan How Large Language Models Will Disrupt
Data Management . . . . . . . . . . . . 3302--3309
Lucas Woltmann and
Jerome Thiessat and
Claudio Hartmann and
Dirk Habich and
Wolfgang Lehner FASTgres: Making Learned Query Optimizer
Hinting Effective . . . . . . . . . . . 3310--3322
Demian Vöhringer and
Viktor Leis Write-Aware Timestamp Tracking:
Effective and Efficient Page Replacement
for Modern Hardware . . . . . . . . . . 3323--3334
Matthew Butrovich and
Karthik Ramanathan and
John Rollinson and
Wan Shen Lim and
William Zhang and
Justine Sherry and
Andrew Pavlo Tigger: a Database Proxy That Bounces
with User-Bypass . . . . . . . . . . . . 3335--3348
Haocheng Xia and
Jinfei Liu and
Jian Lou and
Zhan Qin and
Kui Ren and
Yang Cao and
Li Xiong Equitable Data Valuation Meets the Right
to Be Forgotten in Model Markets . . . . 3349--3362
Abdelouahab Khelifati and
Mourad Khayati and
Anton Dignös and
Djellel Difallah and
Philippe Cudré-Mauroux TSM-Bench: Benchmarking Time Series
Database Systems for Monitoring
Applications . . . . . . . . . . . . . . 3363--3376
Mohamed Y. Eltabakh and
Mayuresh Kunjir and
Ahmed K. Elmagarmid and
Mohammad Shahmeer Ahmad Cross Modal Data Discovery over
Structured and Unstructured Data Lakes 3377--3390
Peng Li and
Yeye He and
Cong Yan and
Yue Wang and
Surajit Chaudhuri Auto-Tables: Synthesizing Multi-Step
Transformations to Relationalize Tables
without Using Examples . . . . . . . . . 3391--3403
Christopher Buss and
Jasmin Mousavi and
Mikhail Tokarev and
Arash Termehchy and
David Maier and
Stefan Lee Effective Entity Augmentation by
Querying External Data Sources . . . . . 3404--3417
Emmanouil Sylligardos and
Paul Boniol and
John Paparrizos and
Panos Trahanias and
Themis Palpanas Choose Wisely: an Extensive Evaluation
of Model Selection for Anomaly Detection
in Time Series . . . . . . . . . . . . . 3418--3432
Cecilia Aguerrebere and
Ishwar Singh Bhati and
Mark Hildebrand and
Mariano Tepper and
Theodore Willke Similarity Search in the Blink of an Eye
with Compressed Indices . . . . . . . . 3433--3446
Ge Lv and
Lei Chen On Data-Aware Global Explainability of
Graph Neural Networks . . . . . . . . . 3447--3460
Michael Jungmair and
Jana Giceva Declarative Sub-Operators for Universal
Data Processing . . . . . . . . . . . . 3461--3474
Junichi Tatemura and
Tao Zou and
Jagan Sankaranarayanan and
Yanlai Huang and
Jim Chen and
Yupu Zhang and
Kevin Lai and
Hao Zhang and
Gokul Nath Babu Manoharan and
Goetz Graefe and
Divyakant Agrawal and
Brad Adelberg and
Shilpa Kolhar and
Indrajit Roy Progressive Partitioning for
Parallelized Query Execution in Google's
Napa . . . . . . . . . . . . . . . . . . 3475--3487
Alex Depoutovitch and
Chong Chen and
Per-Ake Larson and
Jack Ng and
Shu Lin and
Guanzhu Xiong and
Paul Lee and
Emad Boctor and
Samiao Ren and
Lengdong Wu and
Yuchen Zhang and
Calvin Sun Taurus MM: Bringing Multi-Master to the
Cloud . . . . . . . . . . . . . . . . . 3488--3500
Yancan Mao and
Zhanghao Chen and
Yifan Zhang and
Meng Wang and
Yong Fang and
Guanghui Zhang and
Rui Shi and
Richard T. B. Ma StreamOps: Cloud-Native Runtime
Management for Streaming Services in
ByteDance . . . . . . . . . . . . . . . 3501--3514
Christoph Anneser and
Nesime Tatbul and
David Cohen and
Zhenggang Xu and
Prithviraj Pandian and
Nikolay Laptev and
Ryan Marcus AutoSteer: Learned Query Optimization
for Any SQL Database . . . . . . . . . . 3515--3527
Jianjun Chen and
Rui Shi and
Heng Chen and
Li Zhang and
Ruidong Li and
Wei Ding and
Liya Fan and
Hao Wang and
Mu Xiong and
Yuxiang Chen and
Benchao Dong and
Kuankuan Guo and
Yuanjin Lin and
Xiao Liu and
Haiyang Shi and
Peipei Wang and
Zikang Wang and
Yemeng Yang and
Junda Zhao and
Dongyan Zhou and
Zhikai Zuo and
Yuming Liang Krypton: Real-Time Serving and
Analytical SQL Engine at ByteDance . . . 3528--3542
Yuanhang Zou and
Zhihao Ding and
Jieming Shi and
Shuting Guo and
Chunchen Su and
Yafei Zhang EmbedX: a Versatile, Efficient and
Scalable Platform to Embed Both Graphs
and High-Dimensional Sparse Data . . . . 3543--3556
Mohit Saxena and
Benjamin Sowell and
Daiyan Alamgir and
Nitin Bahadur and
Bijay Bisht and
Santosh Chandrachood and
Chitti Keswani and
G. Krishnamoorthy and
Austin Lee and
Bohou Li and
Zach Mitchell and
Vaibhav Porwal and
Maheedhar Reddy Chappidi and
Brian Ross and
Noritaka Sekiyama and
Omer Zaki and
Linchi Zhang and
Mehul A. Shah The Story of AWS Glue . . . . . . . . . 3557--3569
Yang Li and
Huaijun Jiang and
Yu Shen and
Yide Fang and
Xiaofeng Yang and
Danqing Huang and
Xinyi Zhang and
Wentao Zhang and
Ce Zhang and
Peng Chen and
Bin Cui Towards General and Efficient Online
Tuning for Spark . . . . . . . . . . . . 3570--3583
Jiashu Zhang and
Wen Jiang and
Bo Tang and
Haoxiang Ma and
Lixun Cao and
Zhongbin Jiang and
Yuanyuan Nie and
Fan Wang and
Lei Zhang and
Yuming Liang CDSBen: Benchmarking the Performance of
Storage Services in Cloud-Native
Database System at ByteDance . . . . . . 3584--3596
Xuanhe Zhou and
Cheng Chen and
Kunyi Li and
Bingsheng He and
Mian Lu and
Qiaosheng Liu and
Wei Huang and
Guoliang Li and
Zhao Zheng and
Yuqiang Chen FEBench: a Benchmark for Real-Time
Relational Data Feature Extraction . . . 3597--3609
Fei Xiao and
Yuncheng Wu and
Meihui Zhang and
Gang Chen and
Beng Chin Ooi MINT: Detecting Fraudulent Behaviors
from Time-Series Relational Data . . . . 3610--3623
Shafi Ahmad and
Dillidorai Arumugam and
Srdan Bozovic and
Elnata Degefa and
Sailesh Duvvuri and
Steven Gott and
Nitish Gupta and
Joachim Hammer and
Nivedita Kaluskar and
Raghav Kaushik and
Rakesh Khanduja and
Prasad Mujumdar and
Gaurav Malhotra and
Pankaj Naik and
Nikolas Ogg and
Krishna Kumar Parthasarthy and
Raghu Ramakrishnan and
Vlad Rodriguez and
Rahul Sharma and
Jakub Szymaszek and
Andreas Wolter Microsoft Purview: a System for Central
Governance of Data . . . . . . . . . . . 3624--3635
Liang Lin and
Yuhan Li and
Bin Wu and
Huijun Mai and
Renjie Lou and
Jian Tan and
Feifei Li Anser: Adaptive Information Sharing
Framework of AnalyticDB . . . . . . . . 3636--3648
Christoph Brücke and
Philipp Härtling and
Rodrigo D Escobar Palacios and
Hamesh Patel and
Tilmann Rabl TPCx-AI --- An Industry Standard
Benchmark for Artificial Intelligence
and Machine Learning Systems . . . . . . 3649--3661
Fotis Psallidas and
Ashvin Agrawal and
Chandru Sugunan and
Khaled Ibrahim and
Konstantinos Karanasos and
Jesús Camacho-Rodr\'ìguez and
Avrilia Floratou and
Carlo Curino and
Raghu Ramakrishnan OneProvenance: Efficient Extraction of
Dynamic Coarse-Grained Provenance from
Database Query Event Logs . . . . . . . 3662--3675
V. Srinivasan and
Andrew Gooding and
Sunil Sayyaparaju and
Thomas Lopatic and
Kevin Porter and
Ashish Shinde and
B. Narendran Techniques and Efficiencies from
Building a Real-Time DBMS . . . . . . . 3676--3688
Jiaqi Wang and
Tianyi Li and
Anni Wang and
Xiaoze Liu and
Lu Chen and
Jie Chen and
Jianye Liu and
Junyang Wu and
Feifei Li and
Yunjun Gao Real-Time Workload Pattern Analysis for
Large-Scale Cloud Databases . . . . . . 3689--3701
Jiang Li and
Qi Xie and
Yan Ma and
Jian Ma and
Kunshang Ji and
Yizhong Zhang and
Chaojun Zhang and
Yixiu Chen and
Gangsheng Wu and
Jie Zhang and
Kaidi Yang and
Xinyi He and
Qiuyang Shen and
Yanting Tao and
Haiwei Zhao and
Penghui Jiao and
Chengfei Zhu and
David Qian and
Cheng Xu Big Data Analytic Toolkit: a
General-Purpose, Modular, and
Heterogeneous Acceleration Toolkit for
Data Analytical Engines . . . . . . . . 3702--3714
Chunhui Shen and
Qianyu Ouyang and
Feibo Li and
Zhipeng Liu and
Longcheng Zhu and
Yujie Zou and
Qing Su and
Tianhuan Yu and
Yi Yi and
Jianhong Hu and
Cen Zheng and
Bo Wen and
Hanbang Zheng and
Lunfan Xu and
Sicheng Pan and
Bin Wu and
Xiao He and
Ye Li and
Jian Tan and
Sheng Wang and
Dan Pei and
Wei Zhang and
Feifei Li Lindorm TSDB: a Cloud-Native Time-Series
Database for Large-Scale Monitoring
Systems . . . . . . . . . . . . . . . . 3715--3727
Zhifeng Yang and
Quanqing Xu and
Shanyan Gao and
Chuanhui Yang and
Guoping Wang and
Yuzhong Zhao and
Fanyu Kong and
Hao Liu and
Wanhong Wang and
Jinliang Xiao OceanBase Paetica: a Hybrid
Shared-Nothing/Shared-Everything
Database for Supporting Single Machine
and Distributed Cluster . . . . . . . . 3728--3740
Yuanyuan Yao and
Dimeng Li and
Hailiang Jie and
Hailiang Jie and
Tianyi Li and
Jie Chen and
Jiaqi Wang and
Feifei Li and
Yunjun Gao SimpleTS: an Efficient and Universal
Model Selection Framework for Time
Series Forecasting . . . . . . . . . . . 3741--3753
Xinjun Yang and
Yingqiang Zhang and
Hao Chen and
Chuan Sun and
Feifei Li and
Wenchao Zhou PolarDB-SCC: a Cloud-Native Database
Ensuring Low Latency for Strongly
Consistent Reads . . . . . . . . . . . . 3754--3767
Hiroyuki Yamada and
Toshihiro Suzuki and
Yuji Ito and
Jun Nemoto ScalarDB: Universal Transaction Manager
for Polystores . . . . . . . . . . . . . 3768--3780
Xiaonan Nie and
Yi Liu and
Fangcheng Fu and
Jinbao Xue and
Dian Jiao and
Xupeng Miao and
Yangyu Tao and
Bin Cui Angel-PTM: a Scalable and Economical
Large-Scale Pre-Training System in
Tencent . . . . . . . . . . . . . . . . 3781--3794
Ji You Li and
Jiachi Zhang and
Wenchao Zhou and
Yuhang Liu and
Shuai Zhang and
Zhuoming Xue and
Ding Xu and
Hua Fan and
Fangyuan Zhou and
Feifei Li Eigen: End-to-End Resource Optimization
for Large-Scale Databases on the Cloud 3795--3807
Zhicheng Pan and
Yihang Wang and
Yingying Zhang and
Sean Bin Yang and
Yunyao Cheng and
Peng Chen and
Chenjuan Guo and
Qingsong Wen and
Xiduo Tian and
Yunliang Dou and
Zhiqiang Zhou and
Chengcheng Yang and
Aoying Zhou and
Bin Yang MagicScaler: Uncertainty-Aware,
Predictive Autoscaling . . . . . . . . . 3808--3821
Anna Povzner and
Prince Mahajan and
Jason Gustafson and
Jun Rao and
Ismael Juma and
Feng Min and
Shriram Sridharan and
Nikhil Bhatia and
Gopi Attaluri and
Adithya Chandra and
Stanislav Kozlovski and
Rajini Sivaram and
Lucas Bradstreet and
Bob Barrett and
Dhruvil Shah and
David Jacot and
David Arthur and
Ron Dagostino and
Colin McCabe and
Manikumar Reddy Obili and
Kowshik Prakasam and
Jose Garcia Sancio and
Vikas Singh and
Alok Nikhil and
Kamal Gupta Kora: a Cloud-Native Event Streaming
Platform for Kafka . . . . . . . . . . . 3822--3834
Krishna Kantikiran Pasupuleti and
Jiakun Li and
Hong Su and
Mohamed Ziauddin Automatic SQL Error Mitigation in Oracle 3835--3847
Yanli Zhao and
Andrew Gu and
Rohan Varma and
Liang Luo and
Chien-Chin Huang and
Min Xu and
Less Wright and
Hamid Shojanazeri and
Myle Ott and
Sam Shleifer and
Alban Desmaison and
Can Balioglu and
Pritam Damania and
Bernard Nguyen and
Geeta Chauhan and
Yuchen Hao and
Ajit Mathews and
Shen Li PyTorch FSDP: Experiences on Scaling
Fully Sharded Data Parallel . . . . . . 3848--3860
Eamonn Keogh Time Series Data Mining: a Unifying View 3861--3863
Kai Siong Yow and
Ningyi Liao and
Siqiang Luo and
Reynold Cheng Machine Learning for Subgraph
Extraction: Methods, Applications and
Challenges . . . . . . . . . . . . . . . 3864--3867
Ishtiyaque Ahmad and
Divyakant Agrawal and
Amr El Abbadi and
Trinabh Gupta Private Information Retrieval in Large
Scale Public Data Repositories . . . . . 3868--3871
Jian Pei and
Raul Castro Fernandez and
Xiaohui Yu Data and AI Model Markets: Opportunities
for Data and Model Sharing, Discovery,
and Integration . . . . . . . . . . . . 3872--3873
Yannis Foufoulas and
Alkis Simitsis Efficient Execution of User-Defined
Functions in SQL Queries . . . . . . . . 3874--3877
George Katsogiannis-Meimarakis and
Mike Xydas and
Georgia Koutrika Natural Language Interfaces for
Databases with Deep Learning . . . . . . 3878--3881
Zhengtong Yan and
Valter Uotila and
Jiaheng Lu Join Order Selection with Deep
Reinforcement Learning: Fundamentals,
Techniques, and Challenges . . . . . . . 3882--3885
Ioana Manolescu and
Madhulika Mohanty Full-Power Graph Querying: State of the
Art and Challenges . . . . . . . . . . . 3886--3889
Wolfgang Gatterbauer A Tutorial on Visual Representations of
Relational Queries . . . . . . . . . . . 3890--3893
Alberto Lerner and
Carsten Binnig and
Philippe Cudré-Mauroux and
Rana Hussein and
Matthias Jasny and
Theo Jepsen and
Dan R. K. Ports and
Lasse Thostrup and
Tobias Ziegler Databases on Modern Networks: a Decade
of Research That Now Comes into Practice 3894--3897
Zuozhi Wang and
Chen Li Building a Collaborative Data Analytics
System: Opportunities and Challenges . . 3898--3901
Wenjia He and
Ibrahim Sabek and
Yuze Lou and
Michael Cafarella PAINE Demo: Optimizing Video Selection
Queries with Commonsense Knowledge . . . 3902--3905
Ziyang Xiao and
Dongxiang Zhang and
Zepeng Li and
Sai Wu and
Kian-Lee Tan and
Gang Chen DoveDB: a Declarative and Low-Latency
Video Database . . . . . . . . . . . . . 3906--3909
Dong June Lew and
Kihyun Yoo and
Kwang Woo Nam DeepVQL: Deep Video Queries on
PostgreSQL . . . . . . . . . . . . . . . 3910--3913
Xiu Tang and
Sai Wu and
Dongxiang Zhang and
Ziyue Wang and
Gongsheng Yuan and
Gang Chen A Demonstration of DLBD: Database Logic
Bug Detection System . . . . . . . . . . 3914--3917
Wangze Ni and
Pengze Chen and
Lei Chen PSFQ: a Blockchain-Based
Privacy-Preserving and Verifiable
Student Feedback Questionnaire Platform 3918--3921
Christoph Anneser and
Mario Petruccelli and
Nesime Tatbul and
David Cohen and
Zhenggang Xu and
Prithviraj Pandian and
Nikolay Laptev and
Ryan Marcus and
Alfons Kemper QO-Insight: Inspecting Steered Query
Optimizers . . . . . . . . . . . . . . . 3922--3925
Zhihong Shen and
Chuan Hu and
Zihao Zhao Lynx: a Graph Query Framework for
Multiple Heterogeneous Data Sources . . 3926--3929
Aljoscha Lepping and
Hoang Mi Pham and
Laura Mons and
Balint Rueb and
Philipp M. Grulich and
Ankit Chaudhary and
Steffen Zeuch and
Volker Markl Showcasing Data Management Challenges
for Future IoT Applications with
NebulaStream . . . . . . . . . . . . . . 3930--3933
Zilong Wang and
Qixiong Zeng and
Ning Wang and
Haowen Lu and
Yue Zhang CEDA: Learned Cardinality Estimation
with Domain Adaptation . . . . . . . . . 3934--3937
Ramon Antonio Rodriges Zalipynis FastMosaic in Action: a New Mosaic
Operator for Array DBMSs . . . . . . . . 3938--3941
Zhuo Ma and
Yilong Yang and
Bin Xiao and
Yang Liu and
Xinjing Liu and
Zhuoran Ma and
Tong Yang Sniffer: a Novel Model Type Detection
System against
Machine-Learning-as-a-Service Platforms 3942--3945
Xiang Wang and
Xin Wang and
Zhaozhuo Li and
Dong Han KGNav: a Knowledge Graph Navigational
Visual Query System . . . . . . . . . . 3946--3949
Ju Hyoung Mun and
Konstantinos Karatsenidis and
Tarikul Islam Papon and
Shahin Roozkhosh and
Denis Hoornaert and
Ulrich Drepper and
Ahmed Sanaullah and
Renato Mancuso and
Manos Athanassoulis On-the-Fly Data Transformation in Action 3950--3953
Dalsu Choi and
Hyunsik Yoon and
Hyubjin Lee and
Yon Dohn Chung Demonstrating Waffle: a Self-Driving
Grid Index . . . . . . . . . . . . . . . 3954--3957
Niels Bylois and
Frank Neven and
Stijn Vansummeren CM-Explorer: Dissecting Data Ingestion
Problems . . . . . . . . . . . . . . . . 3958--3961
Tingyu Wang and
Yuchao Tao and
Amir Gilad and
Ashwin Machanavajjhala and
Sudeepa Roy Explaining Differentially Private Query
Results with DPXPlain . . . . . . . . . 3962--3965
Fei Xiao and
Jiong Xie and
Zhida Chen and
Feifei Li and
Zhen Chen and
Jianwei Liu and
Yinpei Liu Ganos Aero: a Cloud-Native System for
Big Raster Data Management and
Processing . . . . . . . . . . . . . . . 3966--3969
Mahdi Ghorbani and
Amir Shaikhha Demonstration of OpenDBML, a Framework
for Democratizing In-Database Machine
Learning . . . . . . . . . . . . . . . . 3970--3973
Hussein Abdallah and
Waleed Afandi and
Essam Mansour Demonstration of SPARQL ML : an
Interfacing Language for Supporting
Graph Machine Learning for RDF Graphs 3974--3977
Enhao Zhang and
Maureen Daum and
Dong He and
Manasi Ganti and
Brandon Haynes and
Ranjay Krishna and
Magdalena Balazinska EQUI-VOCAL Demonstration: Synthesizing
Video Queries from User Interactions . . 3978--3981
Yuanhui Qiu and
Chenguang Fang and
Shaoxu Song and
Xiangdong Huang and
Chen Wang and
Jianmin Wang TsQuality: Measuring Time Series Data
Quality in Apache IoTDB . . . . . . . . 3982--3985
Congying Wang and
Nithin Sastry Tellapuri and
Sphoorthi Keshannagari and
Dylan Zinsley and
Zhuoyue Zhao and
Dong Xie Approximate Queries over Concurrent
Updates . . . . . . . . . . . . . . . . 3986--3989
Kristin Fritsch and
Stefanie Scherzinger Solving Hard Variants of Database Schema
Matching on Quantum Computers . . . . . 3990--3993
Angela Bonifati and
Francesco Del Buono and
Francesco Guerra and
Miki Lombardi and
Donato Tiano Interpretable Clustering of Multivariate
Time Series with Time2Feat . . . . . . . 3994--3997
Chaozu Zhang and
Qiaomu Shen and
Bo Tang DHive: Query Execution Performance
Analysis via Dataflow in Apache Hive . . 3998--4001
Stefan Grafberger and
Shubha Guha and
Paul Groth and
Sebastian Schelter mlwhatif: What If You Could Stop
Re-Implementing Your Machine Learning
Pipeline Analyses over and over? . . . . 4002--4005
Yuchen Peng and
Ke Chen and
Lidan Shou and
Dawei Jiang and
Gang Chen AQUA: Automatic Collaborative Query
Processing in Analytical Database . . . 4006--4009
Kai Huang and
Houdong Liang and
Chongchong Yao and
Xi Zhao and
Yue Cui and
Yao Tian and
Ruiyuan Zhang and
Xiaofang Zhou VisualNeo: Bridging the Gap between
Visual Query Interfaces and Graph Query
Engines . . . . . . . . . . . . . . . . 4010--4013
Luigi Bellomarini and
Marco Benedetti and
Andrea Gentili and
Davide Magnanimi and
Emanuel Sallinger KG-Roar: Interactive Datalog-Based
Reasoning on Virtual Knowledge Graphs 4014--4017
Nico Schäfer and
Damjan Gjurovski and
Angjela Davitkova and
Sebastian Michel To UDFs and Beyond: Demonstration of a
Fully Decomposed Data Processor for
General Data Wrangling Tasks . . . . . . 4018--4021
Yushi Liu and
Liwei Yuan and
Zhihao Chen and
Yekai Yu and
Zhao Zhang and
Cheqing Jin and
Ying Yan ChainDash: an Ad-Hoc Blockchain Data
Analytics System . . . . . . . . . . . . 4022--4025
Luca Zecchini and
Giovanni Simonini and
Sonia Bergamaschi and
Felix Naumann BrewER: Entity Resolution On-Demand . . 4026--4029
Fanchao Chen and
Dixin Tang and
Haotian Li and
Aditya G. Parameswaran Visualizing Spreadsheet Formula Graphs
Compactly . . . . . . . . . . . . . . . 4030--4033
Daniel ten Wolde and
Gábor Szárnyas and
Peter Boncz DuckPGQ: Bringing SQL/PGQ to DuckDB . . 4034--4037
Qiushi Bai and
Sadeem Alsudais and
Chen Li Demo of QueryBooster: Supporting
Middleware-Based SQL Query Rewriting as
a Service . . . . . . . . . . . . . . . 4038--4041
Weiyuan Wu and
Pei Wang and
Yi Xie and
Yejia Liu and
George Chow and
Jiannan Wang Web Connector: a Unified API Wrapper to
Simplify Web Data Collection . . . . . . 4042--4045
Dawei Gao and
Daoyuan Chen and
Zitao Li and
Yuexiang Xie and
Xuchen Pan and
Yaliang Li and
Bolin Ding and
Jingren Zhou FS-Real: a Real-World Cross-Device
Federated Learning Platform . . . . . . 4046--4049
Xintong Song and
Yusen Zhu and
Jianfei Wu and
Bai Liu and
Hongkang Wei ADOps: an Anomaly Detection Pipeline in
Structured Logs . . . . . . . . . . . . 4050--4053
Jonas Spenger and
Chengyang Huang and
Philipp Haller and
Paris Carbone Portals: a Showcase of Multi-Dataflow
Stateful Serverless . . . . . . . . . . 4054--4057
Mukul Singh and
José Cambronero Sanchez and
Sumit Gulwani and
Vu Le and
Carina Negreanu and
Gust Verbruggen Cornet: Learning Spreadsheet Formatting
Rules by Example . . . . . . . . . . . . 4058--4061
Chen Qian and
Shiwei Liang and
Zhaoyang Wang and
Yin Lou Fanglue: an Interactive System for
Decision Rule Crafting . . . . . . . . . 4062--4065
John Paparrizos and
Sai Prasanna Teja Reddy Odyssey: an Engine Enabling the
Time-Series Clustering Journey . . . . . 4066--4069
Jinyang Li and
Alon Silberstein and
Yuval Moskovitch and
Julia Stoyanovich and
H. V. Jagadish Erica: Query Refinement for Diversity
Constraint Satisfaction . . . . . . . . 4070--4073
Zui Chen and
Lei Cao and
Sam Madden Lingua Manga : a Generic Large Language
Model Centric System for Data Curation 4074--4077
Haralampos Gavriilidis and
Leonhard Rose and
Joel Ziegler and
Kaustubh Beedkar and
Jorge-Arnulfo Quiané-Ruiz and
Volker Markl XDB in Action: Decentralized
Cross-Database Query Processing for
Black-Box DBMSes . . . . . . . . . . . . 4078--4081
Gaurav Tarlok Kakkar and
Aryan Rajoria and
Myna Prasanna Kalluraya and
Ashmita Raju and
Jiashen Cao and
Kexin Rong and
Joy Arulraj Interactive Demonstration of EVA . . . . 4082--4085
Xiling Li and
Gefei Tan and
Xiao Wang and
Jennie Rogers and
Soamar Homsi RESCU-SQL: Oblivious Querying for the
Zero Trust Cloud . . . . . . . . . . . . 4086--4089
Ibrahim Abdelaziz and
Julian Dolby and
Udayan Khurana and
Horst Samulowitz and
Kavitha Srinivas DataRinse: Semantic Transforms for Data
Preparation Based on Code Mining . . . . 4090--4093
Junxiong Wang and
Mitchell Gray and
Immanuel Trummer and
Ahmet Kara and
Dan Olteanu Demonstrating ADOPT: Adaptively
Optimizing Attribute Orders for
Worst-Case Optimal Joins via
Reinforcement Learning . . . . . . . . . 4094--4097
Immanuel Trummer Demonstrating GPT-DB: Generating
Query-Specific and Customizable Code for
SQL Processing with GPT-4 . . . . . . . 4098--4101
Vicente Nejar de Almeida and
Eduardo Ribeiro and
Nassim Bouarour and
João Luiz Dihl Comba and
Sihem Amer-Yahia SHEVA: a Visual Analytics System for
Statistical Hypothesis Exploration . . . 4102--4105
Shi Qiao and
Alekh Jindal PikePlace: Generating Intelligence for
Marketplace Datasets . . . . . . . . . . 4106--4109
Xuanhe Zhou and
Guoliang Li and
Jianming Wu and
Jiesi Liu and
Zhaoyan Sun and
Xinning Zhang A Learned Query Rewrite System . . . . . 4110--4113
Alon Halevy and
Yejin Choi and
Avrilia Floratou and
Michael J. Franklin and
Natasha Noy and
Haixun Wang Will LLMs Reshape, Supercharge, or Kill
Data Science? (VLDB 2023 Panel) . . . . 4114--4115
Alvin Cheung and
Maaz Bin Safeer Ahmad and
Brandon Haynes and
Chanwut Kittivorawong and
Shadaj Laddad and
Xiaoxuan Liu and
Chenglong Wang and
Cong Yan Towards Auto-Generated Data Systems . . 4116--4129
Xin Luna Dong Generations of Knowledge Graphs: The
Crazy Ideas and the Business Impact . . 4130--4137
Joseph E. Gonzalez and
Yucheng Low The Story of GraphLab --- From Scaling
Machine Learning to Shaping Graph
Systems Research (VLDB 2023 Test-of-Time
Award Talk) . . . . . . . . . . . . . . 4138--4138
Yejin Choi Common Sense: The Dark Matter of
Language and Intelligence (VLDB 2023
Keynote) . . . . . . . . . . . . . . . . 4139--4139
Feifei Li Modernization of Databases in the Cloud
Era: Building Databases that Run Like
Legos . . . . . . . . . . . . . . . . . 4140--4151
Natasa Milic-Frayling On the Cusp: Computing Thrills and
Perils and Professional Awakening . . . 4152--4159
Lina Qiu and
Georgios Kellaris and
Nikos Mamoulis and
Kobbi Nissim and
George Kollios Doquet: Differentially Oblivious Range
and Join Queries with Private Data
Structures . . . . . . . . . . . . . . . 4160--4173
Monica Chiosa and
Thomas B. Preußer and
Michaela Blott and
Gustavo Alonso AMNES: Accelerating the Computation of
Data Correlation Using FPGAs . . . . . . 4174--4187
Maureen Daum and
Enhao Zhang and
Dong He and
Stephen Mussmann and
Brandon Haynes and
Ranjay Krishna and
Magdalena Balazinska VOCALExplore: Pay-as-You-Go Video Data
Exploration and Model Building . . . . . 4188--4201
Pankaj Arora and
Surajit Chaudhuri and
Sudipto Das and
Junfeng Dong and
Cyril George and
Ajay Kalhan and
Arnd Christian König and
Willis Lang and
Changsong Li and
Feng Li and
Jiaqi Liu and
Lukas M. Maas and
Akshay Mata and
Ishai Menache and
Justin Moeller and
Vivek Narasayya and
Matthaios Olma and
Morgan Oslake and
Elnaz Rezai and
Yi Shan and
Manoj Syamala and
Shize Xu and
Vasileios Zois Flexible Resource Allocation for
Relational Database-as-a-Service . . . . 4202--4215
Rong Gu and
Han Li and
Haipeng Dai and
Wenjie Huang and
Jie Xue and
Meng Li and
Jiaqi Zheng and
Haoran Cai and
Yihua Huang and
Guihai Chen ShadowAQP: Efficient Approximate
Group-by and Join Query via
Attribute-Oriented Sample Size
Allocation and Data Generation . . . . . 4216--4229
Rui Liu and
Kwanghyun Park and
Fotis Psallidas and
Xiaoyong Zhu and
Jinghui Mo and
Rathijit Sen and
Matteo Interlandi and
Konstantinos Karanasos and
Yuanyuan Tian and
Jesús Camacho-Rodríguez Optimizing Data Pipelines for Machine
Learning in Feature Stores . . . . . . . 4230--4239
Renzo Angles and
Georg Gottlob and
Aleksandar Pavlovi\'c and
Reinhard Pichler and
Emanuel Sallinger SparqLog: a System for Efficient
Evaluation of SPARQL 1.1 Queries via
Datalog . . . . . . . . . . . . . . . . 4240--4253
Arnd Christian König and
Yi Shan and
Karan Newatia and
Luke Marshall and
Vivek Narasayya Solver-In-The-Loop Cluster Resource
Management for Database-as-a-Service . . 4254--4267
Justus Henneberg and
Felix Schuhknecht RTIndeX: Exploiting Hardware-Accelerated
GPU Raytracing for Database Indexing . . 4268--4281
Jinqing Lian and
Xinyi Zhang and
Yingxia Shao and
Zenglin Pu and
Qingfeng Xiang and
Yawen Li and
Bin Cui ContTune: Continuous Tuning by
Conservative Bayesian Optimization for
Distributed Stream Data Processing
Systems . . . . . . . . . . . . . . . . 4282--4295
Dimitrios Melissourgos and
Haibo Wang and
Shigang Chen and
Chaoyi Ma and
Shiping Chen Single Update Sketch with Variable
Counter Structure . . . . . . . . . . . 4296--4309
Immanuel Trummer Can Large Language Models Predict Data
Correlations from Column Names? . . . . 4310--4323
Javad Ghareh Chamani and
Ioannis Demertzis and
Dimitrios Papadopoulos and
Charalampos Papamanthou and
Rasool Jalili GraphOS: Towards Oblivious Graph
Processing . . . . . . . . . . . . . . . 4324--4338
Kefei Wang and
Feng Chen Catalyst: Optimizing Cache Management
for Large In-memory Key-value Systems 4339--4352
Bolong Zheng and
Yongyong Gao and
Jingyi Wan and
Lingsen Yan and
Long Hu and
Bo Liu and
Yunjun Gao and
Xiaofang Zhou and
Christian S. Jensen DecLog: Decentralized Logging in
Non-Volatile Memory for Time Series
Database Systems . . . . . . . . . . . . 1--14
Fangyuan Zhang and
Mengxu Jiang and
Sibo Wang Efficient Dynamic Weighted Set Sampling
and Its Extension . . . . . . . . . . . 15--27
Yiming Lin and
Sharad Mehrotra ZIP: Lazy Imputation during Query
Processing . . . . . . . . . . . . . . . 28--40
Xunkai Li and
Zhengyu Wu and
Wentao Zhang and
Yinlin Zhu and
Rong-Hua Li and
Guoren Wang FedGTA: Topology-Aware Averaging for
Federated Graph Learning . . . . . . . . 41--50
Xueqin Chang and
Xiangyu Ke and
Lu Chen and
Congcong Ge and
Ziheng Wei and
Yunjun Gao Host Profit Maximization: Leveraging
Performance Incentives and User
Flexibility . . . . . . . . . . . . . . 51--64
Shweta Patwa and
Danyu Sun and
Amir Gilad and
Ashwin Machanavajjhala and
Sudeepa Roy DP-PQD: Privately Detecting Per-Query
Gaps in Synthetic Data Generated by
Black-Box Mechanisms . . . . . . . . . . 65--78
Ruidi Wei and
Florian Kerschbaum Cryptographically Secure Private Record
Linkage using Locality-Sensitive Hashing 79--91
Simran Arora and
Brandon Yang and
Sabri Eyuboglu and
Avanika Narayan and
Andrew Hojel and
Immanuel Trummer and
Christopher Ré Language Models Enable Simple Systems
for Generating Structured Views of
Heterogeneous Data Lakes . . . . . . . . 92--105
Jinyang Li and
Yuval Moskovitch and
Julia Stoyanovich and
H. V. Jagadish Query Refinement for Diversity
Constraint Satisfaction . . . . . . . . 106--118
Zhaoheng Li and
Pranav Gor and
Rahul Prabhu and
Hui Yu and
Yuzhou Mao and
Yongjoo Park ElasticNotebook: Enabling Live Migration
for Computational Notebooks . . . . . . 119--133
Kecheng Huang and
Zhaoyan Shen and
Zili Shao and
Tong Zhang and
Feng Chen Breathing New Life into an Old Tree:
Resolving Logging Dilemma of B$^+$-tree
on Modern Computational Storage Drives 134--147
Xinyu Zeng and
Yulong Hui and
Jiahong Shen and
Andrew Pavlo and
Wes McKinney and
Huanchen Zhang An Empirical Evaluation of Columnar
Storage Formats . . . . . . . . . . . . 148--161
Yichao Yuan and
Haojie Ye and
Sanketh Vedula and
Wynn Kaza and
Nishil Talati Everest: GPU-Accelerated System for
Mining Temporal Motifs . . . . . . . . . 162--174
Xueyi Wu and
Yuanyuan Xu and
Wenjie Zhang and
Ying Zhang Billion-Scale Bipartite Graph Embedding:
a Global-Local Induced Approach . . . . 175--183
Wangze Ni and
Pengze Chen and
Lei Chen and
Peng Cheng and
Chen Jason Zhang and
Xuemin Lin Utility-Aware Payment Channel Network
Rebalance . . . . . . . . . . . . . . . 184--196
Pengfei Li and
Wenqing Wei and
Rong Zhu and
Bolin Ding and
Jingren Zhou and
Hua Lu ALECE: an Attention-based Learned
Cardinality Estimator for SPJ Queries on
Dynamic Workloads . . . . . . . . . . . 197--210
Haojun Xia and
Zhen Zheng and
Yuchao Li and
Donglin Zhuang and
Zhongzhu Zhou and
Xiafei Qiu and
Yong Li and
Wei Lin and
Shuaiwen Leon Song Flash-LLM: Enabling Cost-Effective and
Highly-Efficient Large Generative Model
Inference with Unstructured Sparsity . . 211--224
Heidi Howard and
Fritz Alder and
Edward Ashton and
Amaury Chamayou and
Sylvan Clebsch and
Manuel Costa and
Antoine Delignat-Lavaud and
Cédric Fournet and
Andrew Jeffery and
Matthew Kerner and
Fotios Kounelis and
Markus A. Kuppe and
Julien Maffre and
Mark Russinovich and
Christoph M. Wintersteiger Confidential Consortium Framework:
Secure Multiparty Applications with
Confidentiality, Integrity, and High
Availability . . . . . . . . . . . . . . 225--240
Sijing Duan and
Feng Lyu and
Xin Zhu and
Yi Ding and
Haotian Wang and
Desheng Zhang and
Xue Liu and
Yaoxue Zhang and
Ju Ren VeLP: Vehicle Loading Plan Learning from
Human Behavior in Nationwide Logistics
System . . . . . . . . . . . . . . . . . 241--249
Aaditya Naik and
Aalok Thakkar and
Adam Stein and
Rajeev Alur and
Mayur Naik Relational Query Synthesis $ \bowtie $
Decision Tree Learning . . . . . . . . . 250--263
Feng Yao and
Qian Tao and
Wenyuan Yu and
Yanfeng Zhang and
Shufeng Gong and
Qiange Wang and
Ge Yu and
Jingren Zhou RAGraph: a Region-Aware Framework for
Geo-Distributed Graph Processing . . . . 264--277
Qiuru Lin and
Sai Wu and
Junbo Zhao and
Jian Dai and
Meng Shi and
Gang Chen and
Feifei Li SmartLite: a DBMS-Based Serving System
for DNN Inference in
Resource-Constrained Environments . . . 278--291
Shiwen Wu and
Qiyu Wu and
Honghua Dong and
Wen Hua and
Xiaofang Zhou Blocker and Matcher Can Mutually
Benefit: a Co-Learning Framework for
Low-Resource Entity Resolution . . . . . 292--304
Yihao Ang and
Qiang Huang and
Yifan Bao and
Anthony K. H. Tung and
Zhiyong Huang TSGBench: Time Series Generation
Benchmark . . . . . . . . . . . . . . . 305--318
Wieger R. Punter and
Odysseas Papapetrou and
Minos Garofalakis OmniSketch: Efficient Multi-Dimensional
High-Velocity Stream Analytics with
Arbitrary Predicates . . . . . . . . . . 319--331
Kai Hiu Chung and
Alexander Zhou and
Yue Wang and
Lei Chen Maximum Balanced $ (k,
\epsilon)$-Bitruss Detection in Signed
Bipartite Graph . . . . . . . . . . . . 332--344
Xiao Li and
Huan Li and
Hua Lu and
Christian S. Jensen and
Varun Pandey and
Volker Markl Missing Value Imputation for
Multi-Attribute Sensor Data Streams via
Message Propagation . . . . . . . . . . 345--358
Yuhang Chen and
Chaoyun Zhang and
Minghua Ma and
Yudong Liu and
Ruomeng Ding and
Bowen Li and
Shilin He and
Saravan Rajmohan and
Qingwei Lin and
Dongmei Zhang ImDiffusion: Imputed Diffusion Models
for Multivariate Time Series Anomaly
Detection . . . . . . . . . . . . . . . 359--372
Dajun Sun and
Wei Dong and
Ke Yi Confidence Intervals for Private Query
Processing . . . . . . . . . . . . . . . 373--385
Zhiyu Liang and
Jianfeng Zhang and
Chen Liang and
Hongzhi Wang and
Zheng Liang and
Lujia Pan A Shapelet-Based Framework for
Unsupervised Multivariate Time Series
Representation Learning . . . . . . . . 386--399
Letong Wang and
Xiangyun Ding and
Yan Gu and
Yihan Sun Fast and Space-Efficient Parallel
Algorithms for Influence Maximization 400--413
Yile Chen and
Gao Cong and
Cuauhtemoc Anda TERI: an Effective Framework for
Trajectory Recovery with Irregular Time
Intervals . . . . . . . . . . . . . . . 414--426
Yuhan Chen and
Haojie Ye and
Sanketh Vedula and
Alex Bronstein and
Ronald Dreslinski and
Trevor Mudge and
Nishil Talati Demystifying Graph Sparsification
Algorithms in Graph Properties
Preservation . . . . . . . . . . . . . . 427--440
Jiashen Cao and
Rathijit Sen and
Matteo Interlandi and
Joy Arulraj and
Hyesoon Kim GPU Database Systems Characterization
and Optimization . . . . . . . . . . . . 441--454
Chaoyi Chen and
Dechao Gao and
Yanfeng Zhang and
Qiange Wang and
Zhenbo Fu and
Xuecang Zhang and
Junhua Zhu and
Yu Gu and
Ge Yu NeutronStream: a Dynamic GNN Training
Framework with Sliding Window for Graph
Streams . . . . . . . . . . . . . . . . 455--468
Joshua Hildred and
Michael Abebe and
Khuzaima Daudjee Caerus: Low-Latency Distributed
Transactions for Geo-Replicated Systems 469--482
Aoqian Zhang and
Shuqing Deng and
Dongping Cui and
Ye Yuan and
Guoren Wang An Experimental Evaluation of Anomaly
Detection in Time Series . . . . . . . . 483--496
Mukul Singh and
José Cambronero and
Sumit Gulwani and
Vu Le and
Carina Negreanu and
Elnaz Nouri and
Mohammad Raza and
Gust Verbruggen FormaT5: Abstention and Examples for
Conditional Table Formatting with
Natural Language . . . . . . . . . . . . 497--510
Manuel Schönberger and
Immanuel Trummer and
Wolfgang Mauerer Quantum-Inspired Digital Annealing for
Join Ordering . . . . . . . . . . . . . 511--524
Mashaal Musleh and
Mohamed F. Mokbel Kamel: a Scalable BERT-Based System for
Trajectory Imputation . . . . . . . . . 525--538
Xinyi Zhang and
Hong Wu and
Yang Li and
Zhengju Tang and
Jian Tan and
Feifei Li and
Bin Cui An Efficient Transfer Learning Based
Configuration Adviser for Database
Tuning . . . . . . . . . . . . . . . . . 539--552
Adrien Petralia and
Philippe Charpentier and
Themis Palpanas ADF & TransApp: a Transformer-Based
Framework for Appliance Detection Using
Smart Meter Consumption Series . . . . . 553--562
Sarah Wooders and
Xiangxi Mo and
Amit Narang and
Kevin Lin and
Ion Stoica and
Joseph M. Hellerstein and
Natacha Crooks and
Joseph E. Gonzalez RALF: Accuracy-Aware Scheduling for
Feature Store Maintenance . . . . . . . 563--576
Kaisong Huang and
Tianzheng Wang and
Qingqing Zhou and
Qingzhong Meng The Art of Latency Hiding in Modern
Database Engines . . . . . . . . . . . . 577--590
Mohammad Matin Najafi and
Chenhao Ma and
Xiaodong Li and
Reynold Cheng and
Laks V. S. Lakshmanan MOSER: Scalable Network Motif Discovery
Using Serial Test . . . . . . . . . . . 591--603
Dongxiang Zhang and
Teng Ma and
Junnan Hu and
Yijun Bei and
Kian-Lee Tan and
Gang Chen Co-Movement Pattern Mining from Videos 604--616
Qian Ge and
Yu Liu and
Yinghao Zhao and
Yuetian Sun and
Lei Zou and
Yuxing Chen and
Anqun Pan Efficient and Accurate SimRank-Based
Similarity Joins: Experiments, Analysis,
and Improvement . . . . . . . . . . . . 617--629
Wentao Li and
Maolin Cai and
Min Gao and
Dong Wen and
Lu Qin and
Wei Wang Expanding Reverse Nearest Neighbors . . 630--642
Fuheng Zhao and
Divyakant Agrawal and
Amr El Abbadi and
Ahmed Metwally and
Claire Mathieu and
Michel de Rougemont Errata for ``SpaceSaving$ \pm $: an
Optimal Algorithm for Frequency
Estimation and Frequent Items in the
Bounded-Deletion Model'' . . . . . . . . 643--643
Tarique Siddiqui and
Vivek Narasayya and
Marius Dumitru and
Surajit Chaudhuri Cache-Efficient Top-$k$ Aggregation over
High Cardinality Large Datasets . . . . 644--656
Xinwei Cai and
Xiangyu Ke and
Kai Wang and
Lu Chen and
Tianming Zhang and
Qing Liu and
Yunjun Gao Efficient Temporal Butterfly Counting
and Enumeration on Temporal Bipartite
Graphs . . . . . . . . . . . . . . . . . 657--670
Tianxiong Zhong and
Zhiwei Zhang and
Guo Lu and
Ye Yuan and
Yu-Ping Wang and
Guoren Wang TVM: a Tile-based Video Management
Framework . . . . . . . . . . . . . . . 671--684
Yi Zhang and
Jan Deriu and
George Katsogiannis-Meimarakis and
Catherine Kosten and
Georgia Koutrika and
Kurt Stockinger ScienceBenchmark: a Complex Real-World
Benchmark for Evaluating Natural
Language to SQL Systems . . . . . . . . 685--698
Lu Chen and
Chengfei Liu and
Rui Zhou and
Kewen Liao and
Jiajie Xu and
Jianxin Li Densest Multipartite Subgraph Search in
Heterogeneous Information Networks . . . 699--711
Kabir Nagrecha and
Arun Kumar Saturn: an Optimized Data System for
Multi-Large-Model Deep Learning
Workloads . . . . . . . . . . . . . . . 712--725
Miao Cai and
Junru Shen and
Yifan Yuan and
Zhihao Qu and
Baoliu Ye BonsaiKV: Towards Fast, Scalable, and
Persistent Key--Value Stores with
Tiered, Heterogeneous Memory System . . 726--739
Silvan Reiner and
Michael Grossniklaus Sample-Efficient Cardinality Estimation
Using Geometric Deep Learning . . . . . 740--752
Kai Zhao and
Chenjuan Guo and
Yunyao Cheng and
Peng Han and
Miao Zhang and
Bin Yang Multiple Time Series Forecasting with
Dynamic Graph Modeling . . . . . . . . . 753--765
Yunyao Cheng and
Peng Chen and
Chenjuan Guo and
Kai Zhao and
Qingsong Wen and
Bin Yang and
Christian S. Jensen Weakly Guided Adaptation for Robust Time
Series Forecasting . . . . . . . . . . . 766--779
Rui Yang and
Evgenios M. Kornaropoulos and
Yue Cheng Algorithmic Complexity Attacks on
Dynamic Learned Indexes . . . . . . . . 780--793
Jiaqi Zhu and
Shaofeng Cai and
Fang Deng and
Beng Chin Ooi and
Wenqiao Zhang METER: a Dynamic Concept Adaptation
Framework for Online Anomaly Detection 794--807
Hailin Zhang and
Penghao Zhao and
Xupeng Miao and
Yingxia Shao and
Zirui Liu and
Tong Yang and
Bin Cui Experimental Analysis of Large-Scale
Learnable Vector Storage Compression . . 808--822
Yue Zhao and
Zhaodonghui Li and
Gao Cong A Comparative Study and Component
Analysis of Query Plan Representation
Techniques in ML4DB Studies . . . . . . 823--835
Zeyang Zhuang and
Penghui Li and
Pingchuan Ma and
Wei Meng and
Shuai Wang Testing Graph Database Systems via
Graph-Aware Metamorphic Relations . . . 836--848
Tianji Cong and
Madelon Hulsebos and
Zhenjie Sun and
Paul Groth and
H. V. Jagadish Observatory: Characterizing Embeddings
of Relational Tables . . . . . . . . . . 849--862
Taeyoon Kim and
ChanHo Park and
Mansur Mukimbekov and
Heelim Hong and
Minseok Kim and
Ze Jin and
Changdae Kim and
Ji-Yong Shin and
Myeongjae Jeon FusionFlow: Accelerating Data
Preprocessing for Machine Learning with
CPU--GPU Cooperation . . . . . . . . . . 863--876
Hubert Mohr-Daurat and
Xuan Sun and
Holger Pirk BOSS --- an Architecture for Database
Kernel Composition . . . . . . . . . . . 877--890
Xiangyu Zhi and
Xiao Yan and
Bo Tang and
Ziyao Yin and
Yanchao Zhu and
Minqi Zhou CoroGraph: Bridging Cache Efficiency and
Work Efficiency for Graph Algorithm
Execution . . . . . . . . . . . . . . . 891--903
Audrey Cheng and
Jack Waudby and
Hugo Firth and
Natacha Crooks and
Ion Stoica Mammoths are Slow: The Overlooked
Transactions of Graph Data . . . . . . . 904--911
Enyuan Zhou and
Song Guo and
Zicong Hong and
Christian S. Jensen and
Yang Xiao and
Dalin Zhang and
Jinwen Liang and
Qingqi Pei VeriDKG: a Verifiable SPARQL Query
Engine for Decentralized Knowledge
Graphs . . . . . . . . . . . . . . . . . 912--925
Lianggui Weng and
Rong Zhu and
Di Wu and
Bolin Ding and
Bolong Zheng and
Jingren Zhou Eraser: Eliminating Performance
Regression on Learned Query Optimizer 926--938
Chao Zhang and
Guoliang Li and
Tao Lv HyBench: a New Benchmark for HTAP
Databases . . . . . . . . . . . . . . . 939--951
Ki Hyun Tae and
Hantian Zhang and
Jaeyoung Park and
Kexin Rong and
Steven Euijong Whang Falcon: Fair Active Learning Using
Multi-Armed Bandits . . . . . . . . . . 952--965
Honghu Wu and
Xiangrong Zhu and
Wei Hu A Blockchain System for Clustered
Federated Learning with Peer-to-Peer
Knowledge Transfer . . . . . . . . . . . 966--979
Rong Zhu and
Lianggui Weng and
Wenqing Wei and
Di Wu and
Jiazhen Peng and
Yifan Wang and
Bolin Ding and
Defu Lian and
Bolong Zheng and
Jingren Zhou PilotScope: Steering Databases with
Machine Learning Drivers . . . . . . . . 980--993
Yishuai Li and
Yunfeng Zhu and
Chao Shi and
Guanhua Zhang and
Jianzhong Wang and
Xiaolu Zhang Timestamp as a Service, Not an Oracle 994--1006
Junjie Xing and
Xinyu Wang and
H. V. Jagadish Data-Driven Insight Synthesis for
Multi-Dimensional Data . . . . . . . . . 1007--1019
Naili Xing and
Shaofeng Cai and
Gang Chen and
Zhaojing Luo and
Beng Chin Ooi and
Jian Pei Database Native Model Selection:
Harnessing Deep Neural Networks in
Database Systems . . . . . . . . . . . . 1020--1033
Kaiyu Chen and
Dong Wen and
Wenjie Zhang and
Ying Zhang and
Xiaoyang Wang and
Xuemin Lin Querying Structural Diversity in
Streaming Graphs . . . . . . . . . . . . 1034--1046
Xiangyang Gou and
Xinyi Ye and
Lei Zou and
Jeffrey Xu Yu LM-SRPQ: Efficiently Answering Regular
Path Query in Streaming Graphs . . . . . 1047--1059
Shihong Gao and
Yiming Li and
Yanyan Shen and
Yingxia Shao and
Lei Chen ETC: Efficient Training of Temporal
Graph Neural Networks over Large-Scale
Dynamic Graphs . . . . . . . . . . . . . 1060--1072
Chenyuan Wu and
Mohammad Javad Amiri and
Haoyun Qin and
Bhavana Mehta and
Ryan Marcus and
Boon Thau Loo Towards Full Stack Adaptivity in
Permissioned Blockchains . . . . . . . . 1073--1080
Jindong Han and
Weijia Zhang and
Hao Liu and
Tao Tao and
Naiqiang Tan and
Hui Xiong BigST: Linear Complexity Spatio-Temporal
Graph Neural Network for Traffic
Forecasting on Large-Scale Road Networks 1081--1090
Xinhao Min and
Kai Lu and
Pengyu Liu and
Jiguang Wan and
Changsheng Xie and
Daohui Wang and
Ting Yao and
Huatao Wu SepHash: a Write-Optimized Hash Index On
Disaggregated Memory via Separate
Segment Structure . . . . . . . . . . . 1091--1104
Dahai Tang and
Jiali Wang and
Rong Chen and
Lei Wang and
Wenyuan Yu and
Jingren Zhou and
Kenli Li XGNN: Boosting Multi-GPU GNN Training
via Global GNN Memory Store . . . . . . 1105--1118
Youming Tao and
Cheng-Long Wang and
Miao Pan and
Dongxiao Yu and
Xiuzhen Cheng and
Di Wang Communication Efficient and Provable
Federated Unlearning . . . . . . . . . . 1119--1131
Dawei Gao and
Haibin Wang and
Yaliang Li and
Xiuyu Sun and
Yichen Qian and
Bolin Ding and
Jingren Zhou Text-to-SQL Empowered by Large Language
Models: a Benchmark Evaluation . . . . . 1132--1145
Anh L. Mai and
Pengyu Wang and
Azza Abouzied and
Matteo Brucato and
Peter J. Haas and
Alexandra Meliou Scaling Package Queries to a Billion
Tuples via Hierarchical Partitioning and
Customized Optimization . . . . . . . . 1146--1158
Yuhao Deng and
Chengliang Chai and
Lei Cao and
Nan Tang and
Jiayi Wang and
Ju Fan and
Ye Yuan and
Guoren Wang MisDetect: Iterative Mislabel Detection
using Early Loss . . . . . . . . . . . . 1159--1172
Wenfei Fan and
Muyang Liu and
Shuhao Liu and
Chao Tian Capturing More Associations by
Referencing External Graphs . . . . . . 1173--1186
Longlong Lin and
Pingpeng Yuan and
Rong-Hua Li and
Chunxue Zhu and
Hongchao Qin and
Hai Jin and
Tao Jia QTCS: Efficient Query-Centered Temporal
Community Search . . . . . . . . . . . . 1187--1199
Jie Fu and
Qingqing Ye and
Haibo Hu and
Zhili Chen and
Lulu Wang and
Kuncan Wang and
Xun Ran DPSUR: Accelerating Differentially
Private Stochastic Gradient Descent
Using Selective Update and Release . . . 1200--1213
Alexander Erben and
Ruben Mayer and
Hans-Arno Jacobsen How Can We Train Deep Learning Models
Across Clouds and Continents? An
Experimental Study . . . . . . . . . . . 1214--1226
Jeongmin Brian Park and
Vikram Sharma Mailthody and
Zaid Qureshi and
Wen-mei Hwu Accelerating Sampling and Aggregation
Operations in GNN Frameworks with GPU
Initiated Direct Storage Accesses . . . 1227--1240
Hao Yuan and
Yajiong Liu and
Yanfeng Zhang and
Xin Ai and
Qiange Wang and
Chaoyi Chen and
Yu Gu and
Ge Yu Comprehensive Evaluation of GNN Training
Systems: a Data Management Perspective 1241--1254
Tsz Nam Chan and
Rui Zang and
Bojian Zhu and
Leong Hou U. and
Dingming Wu and
Jianliang Xu LION: Fast and High-Resolution Network
Kernel Density Visualization . . . . . . 1255--1268
Zitao Li and
Bolin Ding and
Liuyi Yao and
Yaliang Li and
Xiaokui Xiao and
Jingren Zhou Performance-Based Pricing for Federated
Learning via Auction . . . . . . . . . . 1269--1282
Yiqun Diao and
Yutong Yang and
Qinbin Li and
Bingsheng He and
Mian Lu OEBench: Investigating Open Environment
Challenges in Real-World Relational Data
Streams . . . . . . . . . . . . . . . . 1283--1296
Jiadong Xie and
Zehua Chen and
Deming Chu and
Fan Zhang and
Xuemin Lin and
Zhihong Tian Influence Maximization via Vertex
Countering . . . . . . . . . . . . . . . 1297--1309
Tingting Wang and
Shixun Huang and
Zhifeng Bao and
J. Shane Culpepper and
Volkan Dedeoglu and
Reza Arablouei Optimizing Data Acquisition to Enhance
Machine Learning Performance . . . . . . 1310--1323
Xin Chen and
Jieming Shi and
You Peng and
Wenqing Lin and
Sibo Wang and
Wenjie Zhang Minimum Strongly Connected Subgraph
Collection in Dynamic Graphs . . . . . . 1324--1336
Junhao Zhu and
Yuren Mao and
Lu Chen and
Congcong Ge and
Ziheng Wei and
Yunjun Gao FusionQuery: On-demand Fusion Queries
over Multi-source Heterogeneous Data . . 1337--1349
David Justen and
Daniel Ritter and
Campbell Fraser and
Andrew Lamb and
Allison Lee and
Thomas Bodner and
Mhd Yamen Haddad and
Steffen Zeuch and
Volker Markl and
Matthias Boehm POLAR: Adaptive and Non-invasive Join
Order Selection via Plans of Least
Resistance . . . . . . . . . . . . . . . 1350--1363
Zhiyuan Li and
Xun Jian and
Yue Wang and
Yingxia Shao and
Lei Chen DAHA: Accelerating GNN Training with
Data and Hardware Aware Execution
Planning . . . . . . . . . . . . . . . . 1364--1376
Ziyi Lu and
Qiang Cao and
Hong Jiang and
Yuxing Chen and
Jie Yao and
Anqun Pan FluidKV: Seamlessly Bridging the Gap
between Indexing Performance and
Memory-Footprint on Ultra-Fast Storage 1377--1390
Vraj Shah and
Thomas Parashos and
Arun Kumar How Do Categorical Duplicates Affect ML?
A New Benchmark and Empirical Analyses 1391--1404
Pengjie Cui and
Haotian Liu and
Bo Tang and
Ye Yuan CGgraph: an Ultra-Fast Graph Processing
System on Modern Commodity CPU--GPU
Co-processor . . . . . . . . . . . . . . 1405--1417
Xinyu Chen and
Jiannan Tian and
Ian Beaver and
Cynthia Freeman and
Yan Yan and
Jianguo Wang and
Dingwen Tao FCBench: Cross-Domain Benchmarking of
Lossless Compression for Floating-Point
Data . . . . . . . . . . . . . . . . . . 1418--1431
Aaron Hurst and
Daniel E. Lucani and
Qi Zhang PairwiseHist: Fast, Accurate and
Space-Efficient Approximate Query
Processing with Data Compression . . . . 1432--1445
Huayi Zhang and
Binwei Yan and
Lei Cao and
Samuel Madden and
Elke Rundensteiner MetaStore: Analyzing Deep Learning
Meta-Data at Scale . . . . . . . . . . . 1446--1459
Yangming Lv and
Kai Zhang and
Ziming Wang and
Xiaodong Zhang and
Rubao Lee and
Zhenying He and
Yinan Jing and
X. Sean Wang RTScan: Efficient Scan with Ray Tracing
Cores . . . . . . . . . . . . . . . . . 1460--1472
Kezhao Huang and
Haitian Jiang and
Minjie Wang and
Guangxuan Xiao and
David Wipf and
Xiang Song and
Quan Gan and
Zengfeng Huang and
Jidong Zhai and
Zheng Zhang FreshGNN: Reducing Memory Access via
Stable Historical Embeddings for Graph
Neural Network Training . . . . . . . . 1473--1486
Ying Zheng and
Kian-Lee Tan Sorting on Byte-Addressable Storage: The
Resurgence of Tree Structure . . . . . . 1487--1500
Xenofon Chatziliadis and
Eleni Tzirita Zacharatou and
Alphan Eracar and
Steffen Zeuch and
Volker Markl Efficient Placement of Decomposable
Aggregation Functions for Stream
Processing over Large Geo-Distributed
Topologies . . . . . . . . . . . . . . . 1501--1514
Jiamin Hou and
Zhanhao Zhao and
Zhouyu Wang and
Wei Lu and
Guodong Jin and
Dong Wen and
Xiaoyong Du AeonG: an Efficient Built-in Temporal
Support in Graph Databases . . . . . . . 1515--1527
Tao Yu and
Zhaonian Zou and
Weihua Sun and
Yu Yan Refactoring Index Tuning Process with
Benefit Estimation . . . . . . . . . . . 1528--1541
Xunkai Li and
Meihao Liao and
Zhengyu Wu and
Daohan Su and
Wentao Zhang and
Rong-Hua Li and
Guoren Wang LightDiC: a Simple Yet Effective
Approach for Large-Scale Digraph
Representation Learning . . . . . . . . 1542--1551
Shulei Kuang and
Honghui Yang and
Zijing Tan and
Shuai Ma Efficient Differential Dependency
Discovery . . . . . . . . . . . . . . . 1552--1564
Claude Lehmann and
Pavel Sulimov and
Kurt Stockinger Is Your Learned Query Optimizer Behaving
As You Expect? A Machine Learning
Perspective . . . . . . . . . . . . . . 1565--1577
Zhuoxing Zhang and
Sebastian Link Mixed Covers of Keys and Functional
Dependencies for Maintaining the
Integrity of Data under Updates . . . . 1578--1590
Yuhao Deng and
Yu Wang and
Lei Cao and
Lianpeng Qiao and
Yuping Wang and
Jingzhe Xu and
Yizhou Yan and
Samuel Madden Outlier Summarization via Human
Interpretable Rules . . . . . . . . . . 1591--1604
Haitao Yuan and
Gao Cong and
Guoliang Li Nuhuo: an Effective Estimation Model for
Traffic Speed Histogram Imputation on a
Road Network . . . . . . . . . . . . . . 1605--1617
Deepak Ravikumar and
Alex Yeo and
Yiwen Zhu and
Aditya Lakra and
Harsha Nagulapalli and
Santhosh Ravindran and
Steve Suh and
Niharika Dutta and
Andrew Fogarty and
Yoonjae Park and
Sumeet Khushalani and
Arijit Tarafdar and
Kunal Parekh and
Subru Krishnan Intelligent Pooling: Proactive Resource
Provisioning in Large-scale Cloud
Service . . . . . . . . . . . . . . . . 1618--1627
Yutong Ye and
Xiang Lian and
Mingsong Chen Efficient Exact Subgraph Matching via
GNN-Based Path Dominance Embedding . . . 1628--1641
Zijia Wang and
Haoran Liu and
Chen Lin and
Zhifeng Bao and
Guoliang Li and
Tianqing Wang Leveraging Dynamic and Heterogeneous
Workload Knowledge to Boost the
Performance of Index Advisors . . . . . 1642--1654
Otmar Ertl UltraLogLog: a Practical and More
Space-Efficient Alternative to
HyperLogLog for Approximate Distinct
Counting . . . . . . . . . . . . . . . . 1655--1668
Zengyang Gong and
Yuxiang Zeng and
Lei Chen Real-Time Insertion Operator for Shared
Mobility on Time-Dependent Road Networks 1669--1682
Dayi Fan and
Rubao Lee and
Xiaodong Zhang X-TED: Massive Parallelization of Tree
Edit Distance . . . . . . . . . . . . . 1683--1696
Wonseok Shin and
Siwoo Song and
Kunsoo Park and
Wook-Shin Han Cardinality Estimation of Subgraph
Matching: a Filtering-Sampling Approach 1697--1709
Qi Liang and
Dian Ouyang and
Fan Zhang and
Jianye Yang and
Xuemin Lin and
Zhihong Tian Efficient Regular Simple Path Queries
under Transitive Restricted Expressions 1710--1722
Shuhan Zhong and
Sizhe Song and
Weipeng Zhuo and
Guanyao Li and
Yang Liu and
S.-H. Gary Chan A Multi-Scale Decomposition MLP-Mixer
for Time Series Analysis . . . . . . . . 1723--1736
Haocheng Xia and
Xiang Li and
Junyuan Pang and
Jinfei Liu and
Kui Ren and
Li Xiong P-Shapley: Shapley Values on
Probabilistic Classifiers . . . . . . . 1737--1750
Wenjia He and
Ibrahim Sabek and
Yuze Lou and
Michael Cafarella Optimizing Video Selection LIMIT Queries
with Commonsense Knowledge . . . . . . . 1751--1764
Nan Huo and
Reynold Cheng and
Ben Kao and
Wentao Ning and
Nur Al Hasan Haldar and
Xiaodong Li and
Jinyang Li and
Mohammad Matin Najafi and
Tian Li and
Ge Qu ZeroEA: a Zero-Training Entity Alignment
Framework via Pre-Trained Language Model 1765--1774
Xueli Liu and
Bowen Dong and
Wenzhi Fu and
Nannan Wu and
Xin Wang and
Wenjun Wang Extending Graph Rules with Oracles . . . 1775--1787
Junyi Mei and
Shixuan Sun and
Chao Li and
Cheng Xu and
Cheng Chen and
Yibo Liu and
Jing Wang and
Cheng Zhao and
Xiaofeng Hou and
Minyi Guo and
Bingsheng He and
Xiaoliang Cong FlowWalker: a Memory-Efficient and
High-Performance GPU-Based Dynamic Graph
Random Walk Framework . . . . . . . . . 1788--1801
Minsu Kim and
Jinwoo Hwang and
Guseul Heo and
Seiyeon Cho and
Divya Mahajan and
Jongse Park Accelerating String-Key Learned Index
Structures via Memoization-Based
Incremental Training . . . . . . . . . . 1802--1815
Xuankun Liao and
Qing Liu and
Xin Huang and
Jianliang Xu Truss-Based Community Search over
Streaming Directed Graphs . . . . . . . 1816--1829
Ricardo Salazar-Díaz and
Boris Glavic and
Tilmann Rabl InferDB: In-Database Machine Learning
Inference Using Indexes . . . . . . . . 1830--1842
Fei Wei and
Ergute Bao and
Xiaokui Xiao and
Yin Yang and
Bolin Ding AAA: an Adaptive Mechanism for Locally
Differentially Private Mean Estimation 1843--1855
Yangshen Deng and
Muxi Yan and
Bo Tang Accelerating Merkle Patricia Trie with
GPU . . . . . . . . . . . . . . . . . . 1856--1869
Shaowei Wang and
Yun Peng and
Jin Li and
Zikai Wen and
Zhipeng Li and
Shiyu Yu and
Di Wang and
Wei Yang Privacy Amplification via Shuffling:
Unified, Simplified, and Tightened . . . 1870--1883
Jiansen Song and
Wensheng Dou and
Yu Gao and
Ziyu Cui and
Yingying Zheng and
Dong Wang and
Wei Wang and
Jun Wei and
Tao Huang Detecting Metadata-Related Logic Bugs in
Database Systems via Raw Database
Construction . . . . . . . . . . . . . . 1884--1897
Biao Wu and
Qiang Huang and
Anthony K. H. Tung From Zero to Hero: Detecting Leaked Data
through Synthetic Data Injection and
Model Querying . . . . . . . . . . . . . 1898--1910
Guanduo Chen and
Zhenying He and
Meng Li and
Siqiang Luo Oasis: an Optimal Disjoint Segmented
Learned Range Filter . . . . . . . . . . 1911--1924
Yuhao Deng and
Chengliang Chai and
Lei Cao and
Qin Yuan and
Siyuan Chen and
Yanrui Yu and
Zhaoze Sun and
Junyi Wang and
Jiajun Li and
Ziqi Cao and
Kaisen Jin and
Chi Zhang and
Yuqing Jiang and
Yuanfang Zhang and
Yuping Wang and
Ye Yuan and
Guoren Wang and
Nan Tang LakeBench: a Benchmark for Discovering
Joinable and Unionable Tables in Data
Lakes . . . . . . . . . . . . . . . . . 1925--1938
Jiale Lao and
Yibo Wang and
Yufei Li and
Jianping Wang and
Yunjia Zhang and
Zhiyuan Cheng and
Wanghu Chen and
Mingjie Tang and
Jianguo Wang GPTuner: a Manual-Reading Database
Tuning System via GPT-Guided Bayesian
Optimization . . . . . . . . . . . . . . 1939--1952
Arik Ermshaus and
Patrick Schäfer and
Ulf Leser Raising the ClaSS of Streaming Time
Series Segmentation . . . . . . . . . . 1953--1966
Qiyan Li and
Jeffrey Xu Yu Fast Local Subgraph Counting . . . . . . 1967--1980
Yunjia Zhang and
Jordan Henkel and
Avrilia Floratou and
Joyce Cahoon and
Shaleen Deep and
Jignesh M. Patel ReAcTable: Enhancing ReAct for Table
Question Answering . . . . . . . . . . . 1981--1994
Xin Ai and
Qiange Wang and
Chunyu Cao and
Yanfeng Zhang and
Chaoyi Chen and
Hao Yuan and
Yu Gu and
Ge Yu NeutronOrch: Rethinking Sample-Based GNN
Training under CPU--GPU Heterogeneous
Environments . . . . . . . . . . . . . . 1995--2008
Zifan Liu and
Shaleen Deep and
Anna Fariha and
Fotis Psallidas and
Ashish Tiwari and
Avrilia Floratou Rapidash: Efficient Detection of
Constraint Violations . . . . . . . . . 2009--2021
Shubhankar Mohapatra and
Jianqiao Zong and
Florian Kerschbaum and
Xi He Differentially Private Data Generation
with Missing Data . . . . . . . . . . . 2022--2035
Zhaoyuan Su and
Ammar Ahmed and
Zirui Wang and
Ali Anwar and
Yue Cheng Everything You Always Wanted to Know
About Storage Compressibility of
Pre-Trained ML Models but Were Afraid to
Ask . . . . . . . . . . . . . . . . . . 2036--2049
Haoyang Li and
Shimin Di and
Calvin Hong Yi Li and
Lei Chen and
Xiaofang Zhou Fight Fire with Fire: Towards Robust
Graph Neural Networks on Dynamic Graphs
via Actively Defense . . . . . . . . . . 2050--2063
Farzaneh Zirak and
Farhana Choudhury and
Renata Borovica-Gajic SeLeP: Learning Based Semantic
Prefetching for Exploratory Database
Workloads . . . . . . . . . . . . . . . 2064--2076
Yiwei Chen and
Kaiyu Li and
Guoliang Li and
Yong Wang Contributions Estimation in Federated
Learning: a Comprehensive Experimental
Evaluation . . . . . . . . . . . . . . . 2077--2090
Stavros Maroulis and
Vassilis Stamatopoulos and
George Papastefanatos and
Manolis Terrovitis Visualization-Aware Time Series Min-Max
Caching with Error Bound Guarantees . . 2091--2103
Moe Kayali and
Anton Lykov and
Ilias Fountalis and
Nikolaos Vasiloglou and
Dan Olteanu and
Dan Suciu Chorus: Foundation Models for Unified
Data Discovery and Exploration . . . . . 2104--2114
Viktor Leis and
Christian Dietrich Cloud-Native Database Systems and
Unikernels: Reimagining OS Abstractions
for Modern Hardware . . . . . . . . . . 2115--2122
Haoran Xiong and
Hang Zhang and
Zeyu Wang and
Zhenying He and
Peng Wang and
X. Sean Wang CIVET: Exploring Compact Index for
Variable-Length Subsequence Matching on
Time Series . . . . . . . . . . . . . . 2123--2135
Chanwut Kittivorawong and
Yongming Ge and
Yousef Helal and
Alvin Cheung Spatialyze: a Geospatial Video Analytics
System with Spatial-Aware Optimizations 2136--2148
Hanyan Yin and
Dongxie Wen and
Jiajun Li and
Zhewei Wei and
Xiao Zhang and
Zengfeng Huang and
Feifei Li Optimal Matrix Sketching over Sliding
Windows . . . . . . . . . . . . . . . . 2149--2161
Radim Baca Window Function Expression: Let the
Self-Join Enter . . . . . . . . . . . . 2162--2174
Aarati Kakaraparthy and
Jignesh M. Patel SplitDF: Splitting Dataframes for
Memory-Efficient Data Analysis . . . . . 2175--2184
Majid Daliri and
Juliana Freire and
Christopher Musco and
Aécio Santos and
Haoxiang Zhang Sampling Methods for Inner Product
Sketching . . . . . . . . . . . . . . . 2185--2197
Han Hu and
Jiye Qiu and
Hongzhi Wang and
Bin Liang and
Songling Zou DIDS: Double Indices and Double
Summarizations for Fast Similarity
Search . . . . . . . . . . . . . . . . . 2198--2211
Qian Xu and
Juan Yang and
Feng Zhang and
Zheng Chen and
Jiawei Guan and
Kang Chen and
Ju Fan and
Youren Shen and
Ke Yang and
Yu Zhang and
Xiaoyong Du Improving Graph Compression for
Efficient Resource-Constrained Graph
Analytics . . . . . . . . . . . . . . . 2212--2226
Jianwei Wang and
Kai Wang and
Xuemin Lin and
Wenjie Zhang and
Ying Zhang Efficient Unsupervised Community Search
with Pre-Trained Graph Transformer . . . 2227--2240
Jiuqi Wei and
Botao Peng and
Xiaodong Lee and
Themis Palpanas DET-LSH: a Locality-Sensitive Hashing
Scheme with Dynamic Encoding Tree for
Approximate Nearest Neighbor Search . . 2241--2254
Haoyu Liu and
Siqiang Luo BIRD: Efficient Approximation of
Bidirectional Hidden Personalized
PageRank . . . . . . . . . . . . . . . . 2255--2268
Zihao Yu and
Ningyi Liao and
Siqiang Luo GENTI: GPU-Powered Walk-Based Subgraph
Extraction for Scalable Representation
Learning on Dynamic Graphs . . . . . . . 2269--2278
Benjamin Feuer and
Yurong Liu and
Chinmay Hegde and
Juliana Freire ArcheType: a Novel Framework for
Open-Source Column Type Annotation Using
Large Language Models . . . . . . . . . 2279--2292
Yanchuan Chang and
Egemen Tanin and
Gao Cong and
Christian S. Jensen and
Jianzhong Qi Trajectory Similarity Measurement: an
Efficiency Perspective . . . . . . . . . 2293--2306
Brian Wheatman and
Xiaojun Dong and
Zheqi Shen and
Laxman Dhulipala and
Jakub Lacki and
Prashant Pandey and
Helen Xu BYO: a Unified Framework for
Benchmarking Large-Scale Graph
Containers . . . . . . . . . . . . . . . 2307--2320
Yizheng Zhu and
Yuncheng Wu and
Zhaojing Luo and
Beng Chin Ooi and
Xiaokui Xiao Secure and Verifiable Data Collaboration
with Low-Cost Zero-Knowledge Proofs . . 2321--2334
Heena Nagda and
Shubhendra Pal Singhal and
Mohammad Javad Amiri and
Boon Thau Loo Rashnu: Data-Dependent Order-Fairness 2335--2348
Yuchuan Huang and
Mohamed F. Mokbel Sparcle: Boosting the Accuracy of Data
Cleaning Systems through Spatial
Awareness . . . . . . . . . . . . . . . 2349--2362
Xiangfei Qiu and
Jilin Hu and
Lekui Zhou and
Xingjian Wu and
Junyang Du and
Buang Zhang and
Chenjuan Guo and
Aoying Zhou and
Christian S. Jensen and
Zhenli Sheng and
Bin Yang TFB: Towards Comprehensive and Fair
Benchmarking of Time Series Forecasting
Methods . . . . . . . . . . . . . . . . 2363--2377
Chengjun Liu and
Zhuo Peng and
Weiguo Zheng and
Lei Zou FSM: a Fine-Grained Splitting and
Merging Framework for Dual-Balanced
Graph Partition . . . . . . . . . . . . 2378--2391
Stefano Marchesin and
Gianmaria Silvello Efficient and Reliable Estimation of
Knowledge Graph Accuracy . . . . . . . . 2392--2403
Wei Zhou and
Chen Lin and
Xuanhe Zhou and
Guoliang Li Breaking It Down: an In-Depth Study of
Index Advisors . . . . . . . . . . . . . 2405--2418
Wen Deng and
Weiguo Zheng and
Hong Cheng Accelerating Maximal Clique Enumeration
via Graph Reduction . . . . . . . . . . 2419--2431
Jiyang Bai and
Peixiang Zhao Poligras: Policy-Based Graph
Summarization . . . . . . . . . . . . . 2432--2444
Leqian Zheng and
Lei Xu and
Cong Wang and
Sheng Wang and
Yuke Hu and
Zhan Qin and
Feifei Li and
Kui Ren SWAT: a System-Wide Approach to Tunable
Leakage Mitigation in Encrypted Data
Stores . . . . . . . . . . . . . . . . . 2445--2458
Kai Wang and
Yuwei Xu and
Siqiang Luo TIGER: Training Inductive Graph Neural
Network for Large-Scale Knowledge Graph
Reasoning . . . . . . . . . . . . . . . 2459--2472
Chao Zhang and
Angela Bonifati and
M. Tamer Özsu Incremental Sliding Window Connectivity
over Streaming Graphs . . . . . . . . . 2473--2486
Qingpeng Cai and
Kaiping Zheng and
H. V. Jagadish and
Beng Chin Ooi and
James Yip CohortNet: Empowering Cohort Discovery
for Interpretable Healthcare Analytics 2487--2500
Jinghao Wang and
Yanping Wu and
Xiaoyang Wang and
Ying Zhang and
Lu Qin and
Wenjie Zhang and
Xuemin Lin Efficient Influence Minimization via
Node Blocking . . . . . . . . . . . . . 2501--2513
Xuanhe Zhou and
Guoliang Li and
Zhaoyan Sun and
Zhiyuan Liu and
Weize Chen and
Jianming Wu and
Jiesi Liu and
Ruohang Feng and
Guoyang Zeng D-Bot: Database Diagnosis System using
Large Language Models . . . . . . . . . 2514--2527
Yiming Qiao and
Yihan Gao and
Huanchen Zhang Blitzcrank: Fast Semantic Compression
for In-Memory Online Transaction
Processing . . . . . . . . . . . . . . . 2528--2540
Zhihao Chen and
Tianji Yang and
Yixiao Zheng and
Zhao Zhang and
Cheqing Jin and
Aoying Zhou Spectrum: Speedy and
Strictly-Deterministic Smart Contract
Transactions for Blockchain Ledgers . . 2541--2554
Zihao Zhang and
Huiqi Hu and
Xuan Zhou and
Yaofeng Tu and
Weining Qian and
Aoying Zhou Fast Commitment for Geo-Distributed
Transactions via Decentralized
Co-Coordinators . . . . . . . . . . . . 2555--2567
Alberto Lerner and
Gustavo Alonso CXL and the Return of Scale-Up Database
Engines . . . . . . . . . . . . . . . . 2568--2575
Shuheng Fang and
Kangfei Zhao and
Yu Rong and
Zhixun Li and
Jeffrey Xu Yu Inductive Attributed Community Search:
To Learn Communities Across Graphs . . . 2576--2589
Long Yuan and
Xia Li and
Zi Chen and
Xuemin Lin and
Xiang Zhao and
Wenjie Zhang I/O Efficient Label-Constrained
Reachability Queries in Large Graphs . . 2590--2602
Baotong Lu and
Kaisong Huang and
Chieh-Jan Mike Liang and
Tianzheng Wang and
Eric Lo DEX: Scalable Range Indexing on
Disaggregated Memory . . . . . . . . . . 2603--2616
Wei Ni and
Xiaoye Miao and
Xiangyu Zhao and
Yangyang Wu and
Shuwei Liang and
Jianwei Yin Automatic Data Repair: Are We Ready to
Deploy? . . . . . . . . . . . . . . . . 2617--2630
Chaokun Chang and
Eric Lo and
Chunxiao Ye Biathlon: Harnessing Model Resilience
for Accelerating ML Inference Pipelines 2631--2640
Yuanyuan Zeng and
Chenhao Ma and
Yixiang Fang Distributed Shortest Distance Labeling
on Large-Scale Graphs . . . . . . . . . 2641--2653
Wensheng Luo and
Yixiang Fang and
Chunxu Lin and
Yingli Zhou Efficient Parallel D-Core Decomposition
at Scale . . . . . . . . . . . . . . . . 2654--2667
Leonardo Pellegrina and
Fabio Vandin Efficient Discovery of Significant
Patterns with Few-Shot Resampling . . . 2668--2680
Qixu Chen and
Raymond Chi-Wing Wong Robust Best Point Selection under
Unreliable User Feedback . . . . . . . . 2681--2693
Audrey Cheng and
Aaron Kabcenell and
Jason Chan and
Xiao Shi and
Peter Bailis and
Natacha Crooks and
Ion Stoica Towards Optimal Transaction Scheduling 2694--2707
David Campos and
Bin Yang and
Tung Kieu and
Miao Zhang and
Chenjuan Guo and
Christian S. Jensen QCore: Data-Efficient, On-Device
Continual Calibration for Quantized
Models . . . . . . . . . . . . . . . . . 2708--2721
Yalong Zhang and
Rong-Hua Li and
Qi Zhang and
Hongchao Qin and
Lu Qin and
Guoren Wang Efficient Algorithms for
Pseudoarboricity Computation in Large
Static and Dynamic Graphs . . . . . . . 2722--2734
Meng Chen and
Kai Zhang and
Zhenying He and
Yinan Jing and
X. Sean Wang RoarGraph: a Projected Bipartite Graph
for Efficient Cross-Modal Approximate
Nearest Neighbor Search . . . . . . . . 2735--2749
Ju Fan and
Zihui Gu and
Songyue Zhang and
Yuxin Zhang and
Zui Chen and
Lei Cao and
Guoliang Li and
Samuel Madden and
Xiaoyong Du and
Nan Tang Combining Small Language Models and
Large Language Models for Zero-Shot
NL2SQL . . . . . . . . . . . . . . . . . 2750--2763
Rustam Guliyev and
Aparajita Haldar and
Hakan Ferhatosmanoglu D3-GNN: Dynamic Distributed Dataflow for
Streaming Graph Neural Networks . . . . 2764--2777
Yunxiang Su and
Shaoxu Song and
Xiangdong Huang and
Chen Wang and
Jianmin Wang Distance-Based Outlier Query
Optimization in Apache IoTDB . . . . . . 2778--2790
Jianye Yang and
Sheng Fang and
Zhaoquan Gu and
Ziyi Ma and
Xuemin Lin and
Zhihong Tian TC-Match: Fast Time-Constrained
Continuous Subgraph Matching . . . . . . 2791--2804
Michael Wornow and
Avanika Narayan and
Krista Opsahl-Ong and
Quinn McIntyre and
Nigam Shah and
Christopher Ré Automating the Enterprise with
Foundation Models . . . . . . . . . . . 2805--2812
Anxin Tian and
Alexander Zhou and
Yue Wang and
Xun Jian and
Lei Chen Efficient Index for Temporal Core
Queries over Bipartite Graphs . . . . . 2813--2825
Fumiyuki Kato and
Li Xiong and
Shun Takagi and
Yang Cao and
Masatoshi Yoshikawa Uldp-FL: Federated Learning with
Across-Silo User-Level Differential
Privacy . . . . . . . . . . . . . . . . 2826--2839
Junyong Yang and
Ming Zhong and
Yuanyuan Zhu and
Tieyun Qian and
Mengchi Liu and
Jeffrey Xu Yu Evolution Forest Index: Towards Optimal
Temporal $k$-Core Component Search via
Time-Topology Isomorphic Computation . . 2840--2853
Yuxin Ma and
Ping Gong and
Tianming Wu and
Jiawei Yi and
Chengru Yang and
Cheng Li and
Qirong Peng and
Guiming Xie and
Yongcheng Bao and
Haifeng Liu and
Yinlong Xu Eliminating Data Processing Bottlenecks
in GNN Training over Large Graphs via
Two-level Feature Compression . . . . . 2854--2866
Douglas B. Rumbaugh and
Dong Xie and
Zhuoyue Zhao Towards Systematic Index Dynamization 2867--2879
Dennis Treder-Tschechlov and
Manuel Fritz and
Holger Schwarz and
Bernhard Mitschang Ensemble Clustering Based on
Meta-Learning and Hyperparameter
Optimization . . . . . . . . . . . . . . 2880--2892
Chenjuan Guo and
Ronghui Xu and
Bin Yang and
Ye Yuan and
Tung Kieu and
Yan Zhao and
Christian S. Jensen Efficient Stochastic Routing in
Path-Centric Uncertain Road Networks . . 2893--2905
Angela Bonifati and
Filip Murlak and
Yann Ramusat Transforming Property Graphs . . . . . . 2906--2918
Yushi Sun and
Hao Xin and
Kai Sun and
Yifan Ethan Xu and
Xiao Yang and
Xin Luna Dong and
Nan Tang and
Lei Chen Are Large Language Models a Good
Replacement of Taxonomies? . . . . . . . 2919--2932
Yalong Zhang and
Rong-Hua Li and
Qi Zhang and
Hongchao Qin and
Guoren Wang Efficient Algorithms for Density
Decomposition on Large Static and
Dynamic Graphs . . . . . . . . . . . . . 2933--2945
Yingli Zhou and
Yixiang Fang and
Chenhao Ma and
Tianci Hou and
Xin Huang Efficient Maximal Motif-Clique
Enumeration over Large Heterogeneous
Information Networks . . . . . . . . . . 2946--2959
Zeang Sheng and
Wentao Zhang and
Yangyu Tao and
Bin Cui OUTRE: an OUT-of-Core De-REdundancy GNN
Training Framework for Massive Graphs
within A Single Machine . . . . . . . . 2960--2973
Chenguang Fang and
Zijie Chen and
Shaoxu Song and
Xiangdong Huang and
Chen Wang and
Jianmin Wang On Reducing Space Amplification with
Multi-Column Compaction in Apache IoTDB 2974--2986
Sebastian Schmidl and
Felix Naumann and
Thorsten Papenbrock AutoTSAD: Unsupervised Holistic Anomaly
Detection for Time Series Data . . . . . 2987--3002
Zheng Chen and
Feng Zhang and
Yang Chen and
Xiaokun Fang and
Guanyu Feng and
Xiaowei Zhu and
Wenguang Chen and
Xiaoyong Du Enabling Window-Based Monotonic Graph
Analytics with Reusable Transitional
Results for Pattern-Consistent Queries 3003--3016
Maria Ramos and
João Azevedo and
Kyle Kingsbury and
José Pereira and
Tânia Esteves and
Ricardo Macedo and
João Paulo When Amnesia Strikes: Understanding and
Reproducing Data Loss Bugs with Fault
Injection . . . . . . . . . . . . . . . 3017--3030
Leixia Wang and
Qingqing Ye and
Haibo Hu and
Xiaofeng Meng PriPL-Tree: Accurate Range Query for
Arbitrary Distribution under Local
Differential Privacy . . . . . . . . . . 3031--3044
Yu Sun and
Jingyu Zhu and
Xiao Xu and
Xian Xu and
Yuyao Sun and
Shaoxu Song and
Xiang Li and
Xiaojie Yuan Win-Win: On Simultaneous Clustering and
Imputing over Incomplete Data . . . . . 3045--3057
Shun Takagi and
Li Xiong and
Fumiyuki Kato and
Yang Cao and
Masatoshi Yoshikawa HRNet: Differentially Private
Hierarchical and Multi-Resolution
Network for Human Mobility Data
Synthesization . . . . . . . . . . . . . 3058--3071
Sijie Dong and
Qitong Wang and
Soror Sahri and
Themis Palpanas and
Divesh Srivastava Efficiently Mitigating the Impact of
Data Drift on Machine Learning Pipelines 3072--3081
Libin Wang and
Raymond Chi-Wing Wong PCSP: Efficiently Answering
Label-Constrained Shortest Path Queries
in Road Networks . . . . . . . . . . . . 3082--3094
Amela Fejza and
Pierre Genev\`es and
Nabil Laya\"\ida Efficient Enumeration of Recursive Plans
in Transformation-Based Query Optimizers 3095--3108
Mengyi Yan and
Wenfei Fan and
Yaoshu Wang and
Min Xie Enriching Relations with Additional
Attributes for ER . . . . . . . . . . . 3109--3123
Haibo Wang Enhancing Accuracy for Super Spreader
Identification in High-Speed Data
Streams . . . . . . . . . . . . . . . . 3124--3137
Jeremy Seeman and
William Sexton and
David Pujol and
Ashwin Machanavajjhala Privately Answering Queries on Skewed
Data via Per-Record Differential Privacy 3138--3150
Hani Al-Sayeh and
Muhammad Attahir Jibril and
Kai-Uwe Sattler Agile-Ant: Self-Managing Distributed
Cache Management for Cost Optimization
of Big Data Applications . . . . . . . . 3151--3164
Elias Alevizos and
Alexander Artikis and
Georgios Paliouras Complex Event Recognition with Symbolic
Register Transducers . . . . . . . . . . 3165--3177
Yupeng Xie and
Yuyu Luo and
Guoliang Li and
Nan Tang HAIChart: Human and AI Paired
Visualization System . . . . . . . . . . 3178--3191
Yun Wang and
Chrysanthi Kosyfaki and
Sihem Amer-Yahia and
Reynold Cheng A Sampling-Based Framework for
Hypothesis Testing on Large Attributed
Graphs . . . . . . . . . . . . . . . . . 3192--3200
Qinbin Li and
Junyuan Hong and
Chulin Xie and
Jeffrey Tan and
Rachel Xin and
Junyi Hou and
Xavier Yin and
Zhun Wang and
Dan Hendrycks and
Zhangyang Wang and
Bo Li and
Bingsheng He and
Dawn Song LLM-PBE: Assessing Data Privacy in Large
Language Models . . . . . . . . . . . . 3201--3214
Altan Birler and
Alfons Kemper and
Thomas Neumann Robust Join Processing with Diamond
Hardened Joins . . . . . . . . . . . . . 3215--3228
Rundong Zuo and
Guozhong Li and
Rui Cao and
Byron Choi and
Jianliang Xu and
Sourav S. Bhowmick DARKER: Efficient Transformer with
Data-Driven Attention Mechanism for Time
Series . . . . . . . . . . . . . . . . . 3229--3242
Yanping Wu and
Renjie Sun and
Xiaoyang Wang and
Dong Wen and
Ying Zhang and
Lu Qin and
Xuemin Lin Efficient Maximal Frequent Group
Enumeration in Temporal Bipartite Graphs 3243--3255
Daren Chao and
Yueting Chen and
Nick Koudas and
Xiaohui Yu Optimizing Video Queries with
Declarative Clues . . . . . . . . . . . 3256--3268
Lennart Behme and
Sainyam Galhotra and
Kaustubh Beedkar and
Volker Markl Fainder: a Fast and Accurate Index for
Distribution-Aware Dataset Search . . . 3269--3282
Ye Yuan and
Bo Tang and
Tianfei Zhou and
Zhiwei Zhang and
Jianbin Qin nsDB: Architecting the Next Generation
Database by Integrating Neural and
Symbolic Systems . . . . . . . . . . . . 3283--3289
Tobias Schmidt and
Dominik Durner and
Viktor Leis and
Thomas Neumann Two Birds With One Stone: Designing a
Hybrid Cloud Storage Engine for HTAP . . 3290--3303
Qizhen Zhang and
Philip A. Bernstein and
Badrish Chandramouli and
Jiasheng Hu and
Yiming Zheng DDS: DPU-Optimized Disaggregated Storage 3304--3317
Boyan Li and
Yuyu Luo and
Chengliang Chai and
Guoliang Li and
Nan Tang The Dawn of Natural Language to SQL: Are
We Fully Ready? . . . . . . . . . . . . 3318--3331
Michelle Si and
Jian Pei Counterfactual Explanation of Shapley
Value in Data Coalitions . . . . . . . . 3332--3345
Yi Zhang and
Peter Baile Chen and
Zachary G. Ives Searching Data Lakes for Nested and
Joined Data . . . . . . . . . . . . . . 3346--3359
Xinrui Wang and
Yiran Wang and
Xuemin Lin and
Jeffrey Xu Yu and
Hong Gao and
Xiuzhen Cheng and
Dongxiao Yu Efficient Betweenness Centrality
Computation over Large Heterogeneous
Information Networks . . . . . . . . . . 3360--3372
William Zhang and
Wan Shen Lim and
Matthew Butrovich and
Andrew Pavlo The Holon Approach for Simultaneously
Tuning Multiple Components in a
Self-Driving Database Management System
with Machine Learning via Synthesized
Proto-Actions . . . . . . . . . . . . . 3373--3387
Zhen Song and
Yu Gu and
Qing Sun and
Tianyi Li and
Yanfeng Zhang and
Yushuai Li and
Christian S. Jensen and
Ge Yu DynaHB: a Communication-Avoiding
Asynchronous Distributed Framework with
Hybrid Batches for Dynamic GNN Training 3388--3401
Qingshuai Feng and
Junhua Zhang and
Wenjie Zhang and
Lu Qin and
Ying Zhang and
Xuemin Lin Efficient $k$NN Search in Public
Transportation Networks . . . . . . . . 3402--3414
Yifan Yang and
Shimin Chen LITS: an Optimized Learned Index for
Strings . . . . . . . . . . . . . . . . 3415--3427
Alessandro Fogli and
Bo Zhao and
Peter Pietzuch and
Maximilian Bandle and
Jana Giceva OLAP on Modern Chiplet-Based Processors 3428--3441
Xiangpeng Hao and
Badrish Chandramouli Bf-Tree: a Modern Read-Write-Optimized
Concurrent Larger-Than-Memory Range
Index . . . . . . . . . . . . . . . . . 3442--3455
Patrick Hansert and
Sebastian Michel Partition, Don't Sort! Compression
Boosters for Cloud Data Ingestion
Pipelines . . . . . . . . . . . . . . . 3456--3469
Mahdi Erfanian and
H. V. Jagadish and
Abolfazl Asudeh Chameleon: Foundation Models for
Fairness-Aware Multi-Modal Data
Augmentation to Enhance Coverage of
Minorities . . . . . . . . . . . . . . . 3470--3483
Xiaoou Ding and
Yixing Lu and
Hongzhi Wang and
Chen Wang and
Yida Liu and
Jianmin Wang DAFDiscover: Robust Mining Algorithm for
Dynamic Approximate Functional
Dependencies on Dirty Data . . . . . . . 3484--3496
Hubert Mohr-Daurat and
Georgios Theodorakis and
Holger Pirk Hardware-Efficient Data Imputation
through DBMS Extensibility . . . . . . . 3497--3510
Immanuel Trummer Generating Succinct Descriptions of
Database Schemata for Cost-Efficient
Prompting of Large Language Models . . . 3511--3523
Tapan Srivastava and
Raul Castro Fernandez Saving Money for Analytical Workloads in
the Cloud . . . . . . . . . . . . . . . 3524--3537
Joohyung Yun and
Byungchul Tak and
Wook-Shin Han ReCG: Bottom-up JSON Schema Discovery
Using a Repetitive
Cluster-and-Generalize Framework . . . . 3538--3550
Gabriel Mersy and
Zhuo Wang and
Stavros Sintos and
Sanjay Krishnan Optimizing Collections of Bloom Filters
within a Space Budget . . . . . . . . . 3551--3564
Chenghao Lyu and
Qi Fan and
Philippe Guyard and
Yanlei Diao A Spark Optimizer for Adaptive,
Fine-Grained Parameter Tuning . . . . . 3565--3579
Zuozhi Wang and
Yicong Huang and
Shengquan Ni and
Avinash Kumar and
Sadeem Alsudais and
Xiaozhen Liu and
Xinyuan Lin and
Yunyan Ding and
Chen Li Texera: a System for Collaborative and
Interactive Data Analytics Using
Workflows . . . . . . . . . . . . . . . 3580--3588
Jin Ke and
Zenon Zacouris and
Maribel Acosta Efficient Validation of SHACL Shapes
with Reasoning . . . . . . . . . . . . . 3589--3601
Shuxian Wang and
Sicheng Pan and
Alvin Cheung QED: a Powerful Query Equivalence
Decider for SQL . . . . . . . . . . . . 3602--3614
Manish Kesarwani and
Jayant R. Haritsa Index Advisors on Quantum Platforms . . 3615--3628
Geoffrey X. Yu and
Ziniu Wu and
Ferdi Kossmann and
Tianyu Li and
Markos Markakis and
Amadou Ngom and
Samuel Madden and
Tim Kraska Blueprinting the Cloud: Unifying and
Automatically Optimizing Cloud Data
Infrastructures with BRAD . . . . . . . 3629--3643
Niv Dayan and
Ioana-Oriana Bercea and
Rasmus Pagh Aleph Filter: To Infinity in Constant
Time . . . . . . . . . . . . . . . . . . 3644--3656
Yuhang Chen and
Jiaxin Jiang and
Shixuan Sun and
Bingsheng He and
Min Chen RUSH: Real-Time Burst Subgraph Detection
in Dynamic Graphs . . . . . . . . . . . 3657--3665
Zhicheng Liang and
Yu Yang and
Xiangyu Ke and
Xiaokui Xiao and
Yunjun Gao A Benchmark Study of Deep-RL Methods for
Maximum Coverage Problems over Graphs 3666--3679
Wan Shen Lim and
Lin Ma and
William Zhang and
Matthew Butrovich and
Samuel Arch and
Andrew Pavlo Hit the Gym: Accelerating Query
Execution to Efficiently Bootstrap
Behavior Models for Self-Driving
Database Management Systems . . . . . . 3680--3693
Alexander van Renen and
Dominik Horn and
Pascal Pfeil and
Kapil Vaidya and
Wenjian Dong and
Murali Narayanaswamy and
Zhengchun Liu and
Gaurav Saxena and
Andreas Kipf and
Tim Kraska Why TPC is Not Enough: an Analysis of
the Amazon Redshift Fleet . . . . . . . 3694--3706
Lijun Chang and
Rashmika Gamage and
Jeffrey Xu Yu Efficient $k$-Clique Count Estimation
with Accuracy Guarantee . . . . . . . . 3707--3719
Tamer Eldeeb and
Sebastian Burckhardt and
Reuben Bond and
Asaf Cidon and
Junfeng Yang and
Philip A. Bernstein Cloud Actor-Oriented Database
Transactions in Orleans . . . . . . . . 3720--3730
Robert Schulze and
Tom Schreiber and
Ilya Yatsishin and
Ryadh Dahimene and
Alexey Milovidov ClickHouse --- Lightning Fast Analytics
for Everyone . . . . . . . . . . . . . . 3731--3744
Fusheng Han and
Hao Liu and
Bin Chen and
Debin Jia and
Jianfeng Zhou and
Xuwang Teng and
Chuanhui Yang and
Huafeng Xi and
Wei Tian and
Shuning Tao and
Sen Wang and
Quanqing Xu and
Zhenkun Yang PALF: Replicated Write-Ahead Logging for
Distributed Databases . . . . . . . . . 3745--3758
Yixin Wu and
Xiuqi Huang and
Zhongjia Wei and
Hang Cheng and
Chaohui Xin and
Zuzhi Chen and
Binbin Chen and
Yufei Wu and
Hao Wang and
Tieying Zhang and
Rui Shi and
Xiaofeng Gao and
Yuming Liang and
Pengwei Zhao and
Guihai Chen Towards Resource Efficiency: Practical
Insights into Large-Scale Spark
Workloads at ByteDance . . . . . . . . . 3759--3771
Cheng Chen and
Chenzhe Jin and
Yunan Zhang and
Sasha Podolsky and
Chun Wu and
Szu-Po Wang and
Eric Hanson and
Zhou Sun and
Robert Walzer and
Jianguo Wang SingleStore-V: an Integrated Vector
Database System in SingleStore . . . . . 3772--3785
Guoliang Li and
Wengang Tian and
Jinyu Zhang and
Ronen Grosman and
Zongchao Liu and
Sihao Li GaussDB: a Cloud-Native Multi-Primary
Database with Compute-Memory-Storage
Disaggregation . . . . . . . . . . . . . 3786--3798
Hao Wang and
Jiaxin Ou and
Ming Zhao and
Sheng Qiu and
Yizheng Jiao and
Yi Wang and
Qizhong Mao and
Zhengyu Yang and
Yang Liu and
Jianshun Zhang and
Jianyang Hu and
Jingwei Zhang and
Jinrui Liu and
Jiaqiang Chen and
Yong Shen and
Lixun Cao and
Heng Zhang and
Hongde Li and
Ming Li and
Yue Ma and
Lei Zhang and
Jian Liu and
Guanghui Zhang and
Fei Liu and
Jianjun Chen LavaStore: ByteDance's Purpose-Built,
High-Performance, Cost-Effective Local
Storage Engine for Cloud Services . . . 3799--3812
Andrei Paduroiu and
Sungheun Wi and
Yan Yan and
Roni Burd and
Ruhollah Farchtchi and
Giovanni Matteo Fumarola Membrane --- Safe and Performant Data
Access Controls in Apache Spark in the
Presence of Imperative Code . . . . . . 3813--3826
Minseon Ahn and
Thomas Willhalm and
Norman May and
Donghun Lee and
Suprasad Mutalik Desai and
Daniel Booss and
Jungmin Kim and
Navneet Singh and
Daniel Ritter and
Oliver Rebholz An Examination of CXL Memory Use Cases
for In-Memory Database Management
Systems Using SAP HANA . . . . . . . . . 3827--3840
Peng Yi and
Lei Liang and
Da Zhang and
Yong Chen and
Jinye Zhu and
Xiangyu Liu and
Kun Tang and
Jialin Chen and
Hao Lin and
Leijie Qiu and
Jun Zhou KGFabric: a Scalable Knowledge Graph
Warehouse for Enterprise Data
Interconnection . . . . . . . . . . . . 3841--3854
Alexander Bianchi and
Andrew Chai and
Vincent Corvinelli and
Parke Godfrey and
Jarek Szlichta and
Calisto Zuzarte Db2une: Tuning Under Pressure via Deep
Learning . . . . . . . . . . . . . . . . 3855--3868
Yuxing Chen and
Anqun Pan and
Hailin Lei and
Anda Ye and
Shuo Han and
Yan Tang and
Wei Lu and
Yunpeng Chai and
Feng Zhang and
Xiaoyong Du TDSQL: Tencent Distributed Database
System . . . . . . . . . . . . . . . . . 3869--3882
Tim Januschowski and
Yuyang Wang and
Jan Gasthaus and
Syama Rangapuram and
Caner Türkmen and
Jasper Zschiegner and
Lorenzo Stella and
Michael Bohlke-Schneider and
Danielle Maddix and
Konstantinos Benidis and
Alexander Alexandrov and
Christos Faloutsos and
Sebastian Schelter A Flexible Forecasting Stack . . . . . . 3883--3892
Bing Tong and
Yan Zhou and
Chen Zhang and
Jianheng Tang and
Jing Tang and
Leihong Yang and
Qiye Li and
Manwu Lin and
Zhongxin Bao and
Jia Li and
Lei Chen Galaxybase: a High Performance Native
Distributed Graph Database for HTAP . . 3893--3905
Xinying Yang and
Cong Yue and
Wenhui Zhang and
Yang Liu and
Beng Chin Ooi and
Jianjun Chen SecuDB: an In-Enclave Privacy-Preserving
and Tamper-Resistant Relational Database 3906--3919
Jun-Peng Zhu and
Peng Cai and
Kai Xu and
Li Li and
Yishen Sun and
Shuai Zhou and
Haihuang Su and
Liu Tang and
Qi Liu AutoTQA: Towards Autonomous Tabular
Question Answering through Multi-Agent
Large Language Models . . . . . . . . . 3920--3933
Xinchun Zhang and
Aqsa Kashaf and
Yihan Zou and
Wei Zhang and
Weibo Liao and
Haoxiang Song and
Jintao Ye and
Yakun Li and
Rui Shi and
Yong Tian and
Wei Feng and
Binbin Chen and
Zuzhi Chen and
Tieying Zhang and
Yongping Tang ResLake: Towards Minimum Job Latency and
Balanced Resource Utilization in
Geo-Distributed Job Scheduling . . . . . 3934--3946
Maryann Xue and
Yingyi Bu and
Abhishek Somani and
Wenchen Fan and
Ziqi Liu and
Steven Chen and
Herman van Hovell and
Bart Samwel and
Mostafa Mokhtar and
RK Korlapati and
Andy Lam and
Yunxiao Ma and
Vuk Ercegovac and
Jiexing Li and
Alexander Behm and
Yuanjian Li and
Xiao Li and
Sriram Krishnamurthy and
Amit Shukla and
Michalis Petropoulos and
Sameer Paranjpye and
Reynold Xin and
Matei Zaharia Adaptive and Robust Query Execution for
Lakehouses at Scale . . . . . . . . . . 3947--3959
Ed Davisson and
Tilo Dickopp and
David Gay and
Eric Karasuda and
Ram Kesavan and
Vadim Yushprakh Transparent Migration from Datastore to
Firestore . . . . . . . . . . . . . . . 3960--3972
Jinquan Hang and
Zhiqing Hong and
Xinyue Feng and
Guang Wang and
Dongjiang Cao and
Jiayang Qiao and
Haotian Wang and
Desheng Zhang Complex-Path: Effective and Efficient
Node Ranking with Paths in Billion-Scale
Heterogeneous Graphs . . . . . . . . . . 3973--3986
Wenjing Fang and
Shunde Cao and
Guojin Hua and
Junming Ma and
Yongqiang Yu and
Qunshan Huang and
Jun Feng and
Jin Tan and
Xiaopeng Zan and
Pu Duan and
Yang Yang and
Li Wang and
Ke Zhang and
Lei Wang SecretFlow-SCQL: a Secure Collaborative
Query Platform . . . . . . . . . . . . . 3987--4000
Hua Fan and
Dachao Fu and
Xu Wang and
Jiachi Zhang and
Chaoji Zuo and
Zhengyi Wu and
Miao Zhang and
Kang Yuan and
Xizi Ni and
Guocheng Huo and
Wenchao Zhou and
Feifei Li and
Jingren Zhou Towards Millions of Database
Transmission Services in the Cloud . . . 4001--4013
Tao Xiong and
Yong Wang Large-Scale Metric Computation in Online
Controlled Experiment Platform . . . . . 4014--4024
Hongyu Lei and
Chunhua Li and
Ke Zhou and
Jianping Zhu and
Kezhou Yan and
Fen Xiao and
Ming Xie and
Jiang Wang and
Shiyu Di X-Stor: a Cloud-Native NoSQL Database
Service with Multi-Model Support . . . . 4025--4037
Bradley Barnhart and
Marc Brooker and
Daniil Chinenkov and
Tony Hooper and
Jihoun Im and
Prakash Chandra Jha and
Tim Kraska and
Ashok Kurakula and
Alexey Kuznetsov and
Grant McAlister and
Arjun Muthukrishnan and
Aravinthan Narayanan and
Douglas Terry and
Bhuvan Urgaonkar and
Jiaming Yan Resource Management in Aurora Serverless 4038--4050
Jeff Shute and
Shannon Bales and
Matthew Brown and
Jean-Daniel Browne and
Brandon Dolphin and
Romit Kudtarkar and
Andrey Litvinov and
Jingchi Ma and
John Morcos and
Michael Shen and
David Wilhite and
Xi Wu and
Lulan Yu SQL Has Problems. We Can Fix Them: Pipe
Syntax In SQL . . . . . . . . . . . . . 4051--4063
Xin Zhao and
Jialin Qiao and
Xiangdong Huang and
Chen Wang and
Shaoxu Song and
Jianmin Wang Apache TsFile: an IoT-Native Time Series
File Format . . . . . . . . . . . . . . 4064--4076
Pranjal Shankhdhar and
Feilong Liu and
Jay Narale and
James Sun and
Rebecca Schlussel and
Lyublena Antova Presto's History-Based Query Optimizer 4077--4089
Ding Zou and
Wei Lu and
Zhibo Zhu and
Xingyu Lu and
Jun Zhou and
Xiaojin Wang and
Kangyu Liu and
Kefan Wang and
Renen Sun and
Haiqing Wang OptScaler: a Collaborative Framework for
Robust Autoscaling in the Cloud . . . . 4090--4103
Joshua Wu and
Dixin Tang and
Nithin Chalapathi and
Tristan Chambers and
Julie Ciccolini and
Cheryl Phillips and
Lisa Pickoff-White and
Aditya Parameswaran Dealing with Acronyms, Abbreviations,
and Typos in Real-World Entity Matching 4104--4116
Qianyu Ouyang and
Chunhui Shen and
Wenlong Yang and
Peng Yu and
Qiang Xiao and
Jianhui Lei and
Yadong Chen and
Qilu Zhong and
Xiang Wang and
Yong Lin and
Qingyi Meng and
Zhicheng Ji and
Wei Meng and
Cen Zheng and
Sheng Wang and
Dan Pei and
Wei Zhang and
Feifei Li and
Jingren Zhou Lindorm-UWC: an Ultra-Wide-Column
Database for Internet of Vehicles . . . 4117--4129
Qinlong Wang and
Tingfeng Lan and
Yinghao Tang and
Bo Sang and
Ziling Huang and
Yiheng Du and
Haitao Zhang and
Jian Sha and
Hui Lu and
Yuanchun Zhou and
Ke Zhang and
Mingjie Tang DLRover-RM: Resource Optimization for
Deep Recommendation Models Training in
the Cloud . . . . . . . . . . . . . . . 4130--4144
Bing Zhang and
Vadym Doroshenko and
Peter Kairouz and
Thomas Steinke and
Abhradeep Thakurta and
Ziyin Ma and
Eidan Cohen and
Himani Apte and
Jodi Spacek Differentially Private Stream Processing
at Scale . . . . . . . . . . . . . . . . 4145--4158
Anton Okolnychyi and
Chao Sun and
Kazuyuki Tanimura and
Russell Spitzer and
Ryan Blue and
Szehon Ho and
Yufei Gu and
Vishwanath Lakkundi and
DB Tsai Petabyte-Scale Row-Level Operations in
Data Lakehouses . . . . . . . . . . . . 4159--4172
Shreya Shankar and
Haotian Li and
Parth Asawa and
Madelon Hulsebos and
Yiming Lin and
J. D. Zamfirescu-Pereira and
Harrison Chase and
Will Fu-Hinthorn and
Aditya G. Parameswaran and
Eugene Wu spade: Synthesizing Data Quality
Assertions for Large Language Model
Pipelines . . . . . . . . . . . . . . . 4173--4186
Laith Sakka and
Pedro Pedreira and
Orri Erling and
Masha Basmanova and
Kevin Wilfong and
Wei He and
Xiaoxuan Meng and
Krishna Pai and
Bikramjeet Vig Simple (yet Efficient) Function
Authoring for Vectorized Engines . . . . 4187--4199
Rafi Ahmed and
Krishna Kantikiran Pasupuleti and
Sriram Tirupattur and
Lei Sheng and
Hong Su and
Mohamed Ziauddin Grouping, Subsumption, and Disjunctive
Join Optimizations in Oracle . . . . . . 4200--4212
Guoliang Li and
Xuanhe Zhou and
Xinyang Zhao LLM for Data Management . . . . . . . . 4213--4216
Quanqing Xu and
Chuanhui Yang and
Aoying Zhou Native Distributed Databases: Problems,
Challenges and Opportunities . . . . . . 4217--4220
Tim Fischer and
Denis Hirn and
Gökhan Kul A Reproducible Tutorial on
Reproducibility in Database Systems
Research . . . . . . . . . . . . . . . . 4221--4224
Senjuti Basu Roy and
Baruch Schieber and
Nimrod Talmon Fairness in Preference Queries: Social
Choice Theories Meet Data Management . . 4225--4228
Qinghua Liu and
Paul Boniol and
Themis Palpanas and
John Paparrizos Time-Series Anomaly Detection: Overview
and New Trends . . . . . . . . . . . . . 4229--4232
Faisal Nawab and
Mohammad Sadoghi Consensus in Data Management: With Use
Cases in Edge-Cloud and Blockchain
Systems . . . . . . . . . . . . . . . . 4233--4236
Yanyan Shen and
Lei Chen and
Jingzhi Fang and
Xin Zhang and
Shihong Gao and
Hongbo Yin Efficient Training of Graph Neural
Networks on Large Graphs . . . . . . . . 4237--4240
Marcos N. L. Carvalho and
Alkis Simitsis and
Anna Queralt and
Oscar Romero Workload Placement on Heterogeneous
CPU-GPU Systems . . . . . . . . . . . . 4241--4244
Xin Zhang and
Ahmed Eldawy Spatial Query Optimization With Learning 4245--4248
Pedro Pedreira and
Deepak Majeti and
Orri Erling Composable Data Management: an Execution
Overview . . . . . . . . . . . . . . . . 4249--4252
Jiaxin Jiang and
Zhen Zhang and
Bingqiao Luo and
Bingsheng He and
Min Chen and
WeiYang Wang and
Jia Chen Spade: a Real-Time Fraud Detection
Framework . . . . . . . . . . . . . . . 4253--4256
Dingyuan Shi and
Bingchen Song and
Yuanyuan Zhang and
Haolong Yang and
Ke Xu A Data-Driven Spatiotemporal Simulator
for Reinforcement Learning Methods . . . 4257--4260
Haoyun Qin and
Chenyuan Wu and
Mohammad Javad Amiri and
Ryan Marcus and
Boon Thau Loo BFTGym: an Interactive Playground for
BFT Protocols . . . . . . . . . . . . . 4261--4264
Angela Bonifati and
Yann Ramusat and
Filip Murlak and
Amela Fejza and
Rachid Echahed DTGraph: Declarative Transformations of
Property Graphs . . . . . . . . . . . . 4265--4268
Brian Kroth and
Sergiy Matusevych and
Rana Alotaibi and
Yiwen Zhu and
Anja Gruenheid and
Yuanyuan Tian MLOS in Action: Bridging the Gap Between
Experimentation and Auto-Tuning in the
Cloud . . . . . . . . . . . . . . . . . 4269--4272
Sebastian Schelter and
Stefan Grafberger and
Maarten de Rijke Snarcase --- Regain Control over Your
Predictions with Low-Latency Machine
Unlearning . . . . . . . . . . . . . . . 4273--4276
Yiqun Sun and
Qiang Huang and
Yanhao Wang and
Anthony K. H. Tung DiversiNews: Enriching News Consumption
with Relevant Yet Diverse News Articles
Retrieval . . . . . . . . . . . . . . . 4277--4280
Dean Light and
Ahmad Aiashy and
Mahmoud Diab and
Daniel Nachmias and
Stijn Vansummeren and
Benny Kimelfeld SpannerLib: Embedding Declarative
Information Extraction in an Imperative
Workflow . . . . . . . . . . . . . . . . 4281--4284
Roni Copul and
Nave Frost and
Tova Milo and
Kathy Razmadze Demonstrating TabEE: Tabular Embedding
Explanations . . . . . . . . . . . . . . 4285--4288
Daomin Ji and
Hui Luo and
Zhifeng Bao and
Shane Culpepper Navigating Data Repositories: Utilizing
Line Charts to Discover Relevant
Datasets . . . . . . . . . . . . . . . . 4289--4292
Wenfei Fan and
Daji Li and
Peiyu Liang and
Shuhao Liu and
Yaoshu Wang and
Yiming Wang and
Min Xie and
Runjie Zhang Graph Association Analyses for Early
Drug Discovery . . . . . . . . . . . . . 4293--4296
Lindsey Linxi Wei and
Chung Yik Edward Yeung and
Hongjian Yu and
Jingchuan Zhou and
Dong He and
Magdalena Balazinska Demonstration of MaskSearch: Efficiently
Querying Image Masks for Machine
Learning Workflows . . . . . . . . . . . 4297--4300
Stefan Halfpap and
Jan Kossmann and
Rainer Schlosser and
Volker Markl Looking Deeply into the Magic Mirror: an
Interactive Analysis of Database Index
Selection Approaches . . . . . . . . . . 4301--4304
Whanhee Cho and
Anna Fariha Utopia: Automatic Pivot Table Assistant 4305--4308
Yihao Ang and
Yifan Bao and
Qiang Huang and
Anthony K. H. Tung and
Zhiyong Huang TSGAssist: an Interactive Assistant
Harnessing LLMs and RAG for Time Series
Generation Recommendations and
Benchmarking . . . . . . . . . . . . . . 4309--4312
Wonseok Lee and
Jaehyun Ha and
Wook-Shin Han and
Changgyoo Park and
Myunggon Park and
Juhyeng Han DoppelGanger++ in Action: a Database
Replay System with Fast Dependency Graph
Generation . . . . . . . . . . . . . . . 4313--4316
Eugenie Lai and
Yuze Lou and
Brit Youngmann and
Michael Cafarella LucidScript: Bottom-Up Standardization
for Data Preparation . . . . . . . . . . 4317--4320
Ahmad Ghazal and
Zhiyuan Liang and
Sunny Bains and
Hanumath Maduri OSSInsight: Scalable GitHub Analysis . . 4321--4324
Long Gu and
Si Liu and
Tiancheng Xing and
Hengfeng Wei and
Yuxing Chen and
David Basin IsoVista: Black-Box Checking Database
Isolation Guarantees . . . . . . . . . . 4325--4328
Mourad Khayati and
Quentin Nater and
Jacques Pasquier ImputeVIS: an Interactive Evaluator to
Benchmark Imputation Techniques for Time
Series Data . . . . . . . . . . . . . . 4329--4332
Mengzhao Wang and
Haotian Wu and
Xiangyu Ke and
Yunjun Gao and
Xiaoliang Xu and
Lu Chen An Interactive Multi-Modal Query
Answering System with
Retrieval-Augmented Large Language
Models . . . . . . . . . . . . . . . . . 4333--4336
Victor Giannakouris and
Immanuel Trummer DBG-PT: a Large Language Model Assisted
Query Performance Regression Debugger 4337--4340
Felix S. Campbell and
Julia Stoyanovich and
Yuval Moskovitch Rodeo: Making Refinements for Diverse
Top-$K$ Queries . . . . . . . . . . . . 4341--4344
Xin Zhang and
Ahmed Eldawy QPJVis Demo: Quality-Boost Progressive
Join Query Processing System . . . . . . 4345--4348
Peter M. VanNostrand and
Dennis M. Hofmann and
Lei Ma and
Belisha Genin and
Randy Huang and
Elke. A. Rundensteiner Counterfactual Explanation Analytics:
Empowering Lay Users to Take Action
Against Consequential Automated
Decisions . . . . . . . . . . . . . . . 4349--4352
Zhenrong Xu and
Pengfei Wang and
Guoze Xue and
Qitong Yan and
Shenghao Gong and
Yelan Jiang and
Yuren Mao and
Yunjun Gao and
Shu Shen and
Wei Zhang and
Dan Luo and
Lu Chen UniView: a Unified Autonomous
Materialized View Management System for
Various Databases . . . . . . . . . . . 4353--4356
Yuanyuan Yao and
Shenjia Dai and
Yilin Li and
Lu Chen and
Dimeng Li and
Yunjun Gao and
Tianyi Li A Demonstration of TENDS: Time Series
Management System Based on Model
Selection . . . . . . . . . . . . . . . 4357--4360
Luca Althaus and
Mourad Khayati and
Abdelouahab Khelifati and
Anton Dignös and
Djellel Difallah and
Philippe Cudré-Mauroux SEER: an End-to-End Toolkit for
Benchmarking Time Series Database
Systems in Monitoring Applications . . . 4361--4364
Siqiao Xue and
Danrui Qi and
Caigao Jiang and
Fangyin Cheng and
Keting Chen and
Zhiping Zhang and
Hongyang Zhang and
Ganglin Wei and
Wang Zhao and
Fan Zhou and
Hong Yi and
Shaodong Liu and
Hongjun Yang and
Faqiang Chen Demonstration of DB-GPT: Next Generation
Data Interaction System Empowered by
Large Language Models . . . . . . . . . 4365--4368
Zheng Zhang and
Zhuhan Shao and
Andrew Crotty DeepSketch: a Query Sketching Interface
for Deep Time Series Similarity Search 4369--4372
Zian Bao and
Binbin Bie and
Wenfei Fan and
Daji Li and
Mengyun Li and
Kaiwen Lin and
Wei Lin and
Peijie Liu and
Peng Liu and
Zhicong Lv and
Mingliang Ouyang and
Chenyang Sun and
Shuai Tang and
Yaoshu Wang and
Qiyuan Wei and
Xiangqian Wu and
Min Xie and
Jing Zhang and
Runxiao Zhao and
Jie Zhu and
Yilin Zhu Rock: Cleaning Data with both ML and
Logic Rules . . . . . . . . . . . . . . 4373--4376
Xiaoou Ding and
Yichen Song and
Hongzhi Wang and
Donghua Yang and
Chen Wang and
Jianmin Wang Clean4TSDB: a Data Cleaning Tool for
Time Series Databases . . . . . . . . . 4377--4380
Chengliang Chai and
Yuhao Deng and
Yutong Zhan and
Ziqi Cao and
Yuanfang Zhang and
Lei Cao and
Yuping Wang and
Zhiwei Zhang and
Ye Yuan and
Guoren Wang and
Nan Tang LakeCompass: an End-to-End System for
Data Maintenance, Search and Analysis in
Data Lakes . . . . . . . . . . . . . . . 4381--4384
Jianzhe Yu and
Wei Dong and
Juanru Fang and
Dajun Sun and
Ke Yi DOP-SQL: a General-Purpose,
High-Utility, and Extensible Private SQL
System . . . . . . . . . . . . . . . . . 4385--4388
Baolong Mei and
Yafei Li and
Wei Chen and
Linshen Luan and
Guanglei Zhu and
Yuanyuan Jin and
Jianliang Xu Catcher: a Cache Analysis System for
Top-$k$ Pub/Sub Service . . . . . . . . 4389--4392
Sotiris Vasileiadis and
Matthew Paraskeva and
George Savva and
Andreas Efstathiou and
Edson Ramiro Lucas Filho and
Jianqiang Shen and
Lun Yang and
Kebo Fu and
Herodotos Herodotou Optimizing Distributed Tiered Data
Storage Systems with DITIS . . . . . . . 4393--4396
Zhaozhuo Li and
Xin Wang and
Meng Wang and
Yajun Yang and
Bohan Li and
Dong Han VQFT: a Visual Query Approach Based on
Full-Text Search for Knowledge Graphs 4397--4400
Zhen Zhu and
Yibo Wang and
Shouqing Yang and
Lin Long and
Runze Wu and
Xiu Tang and
Junbo Zhao and
Haobo Wang CORAL: Collaborative Automatic Labeling
System Based on Large Language Models 4401--4404
Wangze Ni and
Yiwei Zhao and
Pengze Chen and
Lei Chen and
Peng Cheng and
Chen Jason Zhang CMixing: an Efficient Coin Mixing
Platform to Enhance Anonymity in
Cryptocurrency Transactions . . . . . . 4405--4408
Eliot W. Robson and
Dhemath Reddy and
Abhishek K. Umrawal CyNetDiff: a Python Library for
Accelerated Implementation of Network
Diffusion Models . . . . . . . . . . . . 4409--4412
Zhe Fu and
Mo Sha and
Yiran Li and
Huorong Li and
Yubing Ma and
Sheng Wang and
Feifei Li EncChain: Enhancing Large Language Model
Applications with Advanced Privacy
Preservation Techniques . . . . . . . . 4413--4416
Nima Shahbazi and
Mahdi Erfanian and
Abolfazl Asudeh and
Fatemeh Nargesian and
Divesh Srivastava FairEM360: a Suite for Responsible
Entity Matching . . . . . . . . . . . . 4417--4420
Zan Ahmad Naeem and
Mohammad Shahmeer Ahmad and
Mohamed Eltabakh and
Mourad Ouzzani and
Nan Tang RetClean: Retrieval-Based Data Cleaning
Using LLMs and Data Lakes . . . . . . . 4421--4424
Franco Solleza and
Shihang Li and
William Sun and
Richard Tang and
Malte Schwarzkopf and
Nesime Tatbul and
Andrew Crotty and
David Cohen and
Stan Zdonik Mach: Firefighting Time-Critical Issues
in Complex Systems Using High-Frequency
Telemetry . . . . . . . . . . . . . . . 4425--4428
Renzhi Wu and
Pramod Chunduri and
Dristi J Shah and
Ashmitha Julius Aravind and
Ali Payani and
Xu Chu and
Joy Arulraj and
Kexin Rong SketchQL Demonstration: Zero-Shot Video
Moment Querying with Sketches . . . . . 4429--4432
Yiding Zhu and
Hongwei Zhang and
Jiayao Zhang and
Jinfei Liu and
Kui Ren DataPrice: an Interactive System for
Pricing Datasets in Data Marketplaces 4433--4436
Pinhan Zhao and
Yang He and
Xinyu Wang and
Yuepeng Wang Demonstration of the VeriEQL Equivalence
Checker for Complex SQL Queries . . . . 4437--4440
Zeqi Zhu and
Zeheng Fan and
Yuxiang Zeng and
Yexuan Shi and
Yi Xu and
Mengmeng Zhou and
Jin Dong FedSQ: a Secure System for Federated
Vector Similarity Queries . . . . . . . 4441--4444
Shuyue Wei and
Yuanyuan Zhang and
Zimu Zhou and
Tianlong Zhang and
Ke Xu FedSM: a Practical Federated Shared
Mobility System . . . . . . . . . . . . 4445--4448
Alexander van Renen and
Mihail Stoian and
Andreas Kipf DataLoom: Simplifying Data Loading with
LLMs . . . . . . . . . . . . . . . . . . 4449--4452
Jie Jeff Xu and
Saahir Dhanani and
Jorge Piazentin Ono and
Wenbin He and
Liu Ren and
Kexin Rong Demonstration of VCR: a Tabular Data
Slicing Approach to Understanding Object
Detection Model Performance . . . . . . 4453--4456
Mengying Wang and
Hanchao Ma and
Sheng Guan and
Yiyang Bian and
Haolai Che and
Abhishek Daundkar and
Alp Sehirlioglu and
Yinghui Wu ModsNet: Performance-Aware Top- k Model
Search Using Exemplar Datasets . . . . . 4457--4460
Linshan Jiang and
Moming Duan and
Bingsheng He and
Yulin Sun and
Peishen Yan and
Yang Hua and
Tao Song OFL-W3: a One-Shot Federated Learning
System on Web 3.0 . . . . . . . . . . . 4461--4464
Chang Gao and
Tianlong Zhang and
Yuxiang Zeng and
Yi Xu and
Shuyuan Li and
Yuanyuan Zhang Swift: a Data-Driven Flight Planning
System at Scale . . . . . . . . . . . . 4465--4468
Laila Abdelhafeez and
Andres Calderon-Romero and
Amr Magdy and
Vassilis J. Tsotras Pyneapple-G: Scalable Spatial Grouping
Queries . . . . . . . . . . . . . . . . 4469--4472
Itay Elyashiv and
Amir Gilad and
Edna Isakov and
Tal Tikochinsky and
Amit Somech PD-Explain: a Unified Python-Native
Framework for Query Explanations Over
DataFrames . . . . . . . . . . . . . . . 4473--4476
Jiawei Guan and
Feng Zhang and
Yuxin Tang and
Weitang Ye and
Xiaoyong Du HocoPG: a Database System with
Homomorphic Compression for Text
Processing . . . . . . . . . . . . . . . 4477--4480
Xinyang Zhao and
Xuanhe Zhou and
Guoliang Li Chat$2$Data: an Interactive Data
Analysis System with RAG, Vector
Databases and LLMs . . . . . . . . . . . 4481--4484
Shuhao Liu and
Yang Liu and
Wenfei Fan PrismX: a Single-Machine System for
Querying Big Graphs . . . . . . . . . . 4485--4488
Zhiyu Liang and
Chen Liang and
Zheng Liang and
Hongzhi Wang and
Bo Zheng TimeCSL: Unsupervised Contrastive
Learning of General Shapelets for
Explorable Time Series Analysis . . . . 4489--4492
Xiangmin Zhou and
Chengkun He and
Xi Chen and
Yanchun Zhang HSAP: a Human-in-the-Loop Social
Media-Based Situation Awareness Platform 4493--4496
Yiqi Shen and
Sijia Li and
Miaodong Shen and
Peng Cai and
Weiyuan Xu and
Kai Li and
Jinlong Cai DB-MAGS: Multi-Anomaly Data Generation
System for Transactional Databases . . . 4497--4500
Peizhi Wu and
Yi Zhang and
Wang-Chiew Tan and
Zachary G. Ives QuoteInspector: Gaining Insight about
Social Media Discussions . . . . . . . . 4501--4504
Jianguo Wang and
Eric Hanson and
Guoliang Li and
Yannis Papakonstantinou and
Harsha Simhadri and
Charles Xie Vector Databases: What's Really New and
What's Next? (VLDB 2024 Panel) . . . . . 4505--4506
Fusheng Wang and
Rubao Lee and
Dejun Teng and
Xiaodong Zhang and
Joel Saltz High-Performance Spatial Data Analytics:
Systematic R&D for Scale-Out and Scale-Up
Solutions from the Past to Now . . . . . 4507--4520
Sihem Amer-Yahia Intelligent Agents for Data Exploration 4521--4530
Arun Kumar Reimagining Deep Learning Systems
through the Lens of Data Systems . . . . 4531--4535
Viktor Leis LeanStore: a High-Performance Storage
Engine for NVMe SSDs . . . . . . . . . . 4536--4545
Samuel Madden and
Michael Cafarella and
Michael Franklin and
Tim Kraska Databases Unbound: Querying All of the
World's Bytes with AI . . . . . . . . . 4546--4554
Xiaokui Xiao Sharing Information with Differential
Privacy: a Database Perspective (VLDB
2024 Keynote) . . . . . . . . . . . . . 4555--4555
Fatma Ozcan Harmonizing ML and Databases: a Symphony
of Data (VLDB 2024 Keynote) . . . . . . 4556--4556
Michael Fruth and
Stefanie Scherzinger The Case for DBMS Live Patching . . . . 4557--4570
Guanghua Li and
Hao Zhang and
Xibo Sun and
Qiong Luo and
Yuanyuan Zhu TenGraph: a Tensor-Based Graph Query
Engine . . . . . . . . . . . . . . . . . 4571--4584
Tsz Nam Chan and
Bojian Zhu and
Dingming Wu and
Yun Peng and
Leong Hou U. LARGE: a Length-Aggregation-Based Grid
Structure for Line Density Visualization 4585--4598
Haoran Zhang and
Shuai Mu and
Sebastian Angel and
Vincent Liu CausalMesh: a Causal Cache for Stateful
Serverless Computing . . . . . . . . . . 4599--4613
Luigi Bellomarini and
Davide Benedetto and
Matteo Brandetti and
Emanuel Sallinger and
Adriano Vlad The Vadalog Parallel System: Distributed
Reasoning with Datalog+/- . . . . . . . 4614--4626
Haibo Xiu and
Pankaj K. Agarwal and
Jun Yang PARQO: Penalty-Aware Robust Plan
Selection in Query Optimization . . . . 4627--4640
Zezhong Xu and
Yincen Qu and
Wen Zhang and
Lei Liang and
Huajun Chen InBox: Recommendation with Knowledge
Graph using Interest Box Embedding . . . 4641--4654
Zongyan He and
Jeffrey Xu Yu A Branch-&-Bound Algorithm for Fractional
Hypertree Decomposition . . . . . . . . 4655--4667
Zeyu Wang and
Qitong Wang and
Xiaoxing Cheng and
Peng Wang and
Themis Palpanas and
Wei Wang Steiner-Hardness: a Query Hardness
Measure for Graph-Based ANN Indexes . . 4668--4682
Yiqi Wang and
Long Yuan and
Wenjie Zhang and
Zi Chen and
Xuemin Lin and
Qing Liu Simpler is More: Efficient Top-$K$
Nearest Neighbors Search on Large Road
Networks . . . . . . . . . . . . . . . . 4683--4695
Tim Fischer and
Denis Hirn and
Torsten Grust SQL Engines Excel at the Execution of
Imperative Programs . . . . . . . . . . 4696--4708
Bobbi Yogatama and
Weiwei Gong and
Xiangyao Yu Scaling your Hybrid CPU-GPU DBMS to
Multiple GPUs . . . . . . . . . . . . . 4709--4722
Abduvoris Abduvakhobov and
Sòren Kejser Jensen and
Torben Bach Pedersen and
Christian Thomsen Scalable Model-Based Management of
Massive High Frequency Wind Turbine Data
with ModelarDB . . . . . . . . . . . . . 4723--4732
Tejasvi Kashi and
Kenneth Salem and
Jaemyung Kim and
Khuzaima Daudjee Eventual Durability . . . . . . . . . . 4733--4745
Ruizhong Qiu and
Jun-Gi Jang and
Xiao Lin and
Lihui Liu and
Hanghang Tong TUCKET: a Tensor Time Series Data
Structure for Efficient and Accurate
Factor Analysis over Time Ranges . . . . 4746--4759
Yuchen Meng and
Rong-Hua Li and
Longlong Lin and
Xunkai Li and
Guoren Wang Topology-Preserving Graph Coarsening: an
Elementary Collapse-Based Approach . . . 4760--4772
Guanli Liu and
Lars Kulik and
Christian S. Jensen and
Tianyi Li and
Renata Borovica-Gajic and
Jianzhong Qi Efficient Cost Modeling of Space-Filling
Curves . . . . . . . . . . . . . . . . . 4773--4785
Eduardo Reis and
Mohamed Abdelaal and
Carsten Binnig Generalizable Data Cleaning of Tabular
Data in Latent Space . . . . . . . . . . 4786--4798
Angelos Christos Anadiotis and
Muhammad Ghufran Khan and
Ioana Manolescu Dynamic Graph Databases with
Out-of-Order Updates . . . . . . . . . . 4799--4812
Lingze Zeng and
Naili Xing and
Shaofeng Cai and
Gang Chen and
Beng Chin Ooi and
Jian Pei and
Yuncheng Wu Powering In-Database Dynamic Model
Slicing for Structured Data Analytics 4813--4826
Hongfu Li and
Qian Tao and
Song Yu and
Shufeng Gong and
Yanfeng Zhang and
Feng Yao and
Wenyuan Yu and
Ge Yu and
Jingren Zhou GastCoCo: Graph Storage and
Coroutine-Based Prefetch Co-Design for
Dynamic Graph Processing . . . . . . . . 4827--4839
Xiaoou Ding and
Yichen Song and
Hongzhi Wang and
Chen Wang and
Donghua Yang MTSClean: Efficient Constraint-Based
Cleaning for Multi-Dimensional Time
Series Data . . . . . . . . . . . . . . 4840--4852
Abd Errahmane Kiouche and
Julien Baste and
Mohammed Haddad and
Hamida Seba and
Angela Bonifati Neighborhood-Preserving Graph
Sparsification . . . . . . . . . . . . . 4853--4866
Matthias Urban and
Carsten Binnig ELEET: Efficient Learned Query Execution
over Text and Tables . . . . . . . . . . 4867--4880
Samuel Arch and
Yuchen Liu and
Todd C. Mowry and
Jignesh M. Patel and
Andrew Pavlo The Key to Effective UDF Optimization:
Before Inlining, First Perform Outlining 1--13
Milad Rezaei Hajidehi and
Sraavan Sridhar and
Margo Seltzer CUTTANA: Scalable Graph Partitioning for
Faster Distributed Graph Databases and
Analytics . . . . . . . . . . . . . . . 14--27
Guido Moerkotte Cardinality Estimation for
Having-Clauses . . . . . . . . . . . . . 28--41
Wenqi Jiang and
Marco Zeller and
Roger Waleffe and
Torsten Hoefler and
Gustavo Alonso Chameleon: a Heterogeneous and
Disaggregated Accelerator System for
Retrieval-Augmented Language Models . . 42--52
Zhaodonghui Li and
Haitao Yuan and
Huiming Wang and
Gao Cong and
Lidong Bing LLM-R$^2$: a Large Language Model
Enhanced Rule-Based Rewrite System for
Boosting Query Efficiency . . . . . . . 53--65
Hanfei Yu and
Jacob Carter and
Hao Wang and
Devesh Tiwari and
Jian Li and
Seung-Jong Park Nitro: Boosting Distributed
Reinforcement Learning with Serverless
Computing . . . . . . . . . . . . . . . 66--79
Silin Zhou and
Shuo Shang and
Lisi Chen and
Christian S. Jensen and
Panos Kalnis RED: Effective Trajectory Representation
Learning with Comprehensive Information 80--92
Anna Arpaci-Dusseau and
Zixiang Zhou and
Xuhao Chen Accurate and Fast Approximate Graph
Pattern Mining at Scale . . . . . . . . 93--107
Xiu Tang and
Wenhao Liu and
Sai Wu and
Chang Yao and
Gongsheng Yuan and
Shanshan Ying and
Gang Chen QueryArtisan: Generating Data
Manipulation Codes for Ad-hoc Analysis
in Data Lakes . . . . . . . . . . . . . 108--116
Yiqian Huang and
Shiqi Zhang and
Laks V. S. Lakshmanan and
Wenqing Lin and
Xiaokui Xiao and
Bo Tang Efficient and Effective Algorithms for A
Family of Influence Maximization
Problems with A Matroid Constraint . . . 117--129
Kyle Deeds and
Diandre Sabale and
Moe Kayali and
Dan Suciu Color: a Framework for Applying Graph
Coloring to Subgraph Cardinality
Estimation . . . . . . . . . . . . . . . 130--143
Xinle Wu and
Xingjian Wu and
Dalin Zhang and
Miao Zhang and
Chenjuan Guo and
Bin Yang and
Christian S. Jensen Fully Automated Correlated Time Series
Forecasting in Minutes . . . . . . . . . 144--157
Markos Markakis and
Brit Youngmann and
Trinity Gao and
Ziyu Zhang and
Rana Shahout and
Peter Baile Chen and
Chunwei Liu and
Ibrahim Sabek and
Michael Cafarella From Logs to Causal Inference:
Diagnosing Large Systems . . . . . . . . 158--172
Xin Ai and
Hao Yuan and
Zeyu Ling and
Qiange Wang and
Yanfeng Zhang and
Zhenbo Fu and
Chaoyi Chen and
Yu Gu and
Ge Yu NeutronTP: Load-Balanced Distributed
Full-Graph GNN Training with Tensor
Parallelism . . . . . . . . . . . . . . 173--186
Yuhan Liu and
Sheng Wang and
Yixuan Liu and
Feifei Li and
Hong Chen Unleash the Power of Ellipsis:
Accuracy-Enhanced Sparse Vector
Technique with Exponential Noise . . . . 187--199
Lijun Chang Maximum Defective Clique Computation:
Improved Time Complexities and Practical
Performance . . . . . . . . . . . . . . 200--212
Yanni Tang and
Zhuoxing Zhang and
Kaiqi Zhao and
Lanting Fang and
Zhenhua Li and
Wu Chen Substructure-Aware Log Anomaly Detection 213--225
Hao Miao and
Ziqiao Liu and
Yan Zhao and
Chenjuan Guo and
Bin Yang and
Kai Zheng and
Christian S. Jensen Less is More: Efficient Time Series
Dataset Condensation via Two-Fold Modal
Matching . . . . . . . . . . . . . . . . 226--238
Yunyao Cheng and
Chenjuan Guo and
Bin Yang and
Haomin Yu and
Kai Zhao and
Christian S. Jensen A Memory Guided Transformer for Time
Series Forecasting . . . . . . . . . . . 239--252
Chuxuan Hu and
Austin Peters and
Daniel Kang LEAP: LLM-Powered End-to-End Automatic
Library for Processing Social Science
Queries on Unstructured Data . . . . . . 253--264
Duc Kieu and
Tung Kieu and
Peng Han and
Bin Yang and
Christian S. Jensen and
Bac Le TEAM: Topological Evolution-Aware
Framework for Traffic Forecasting . . . 265--278
Geonho Lee and
Jeongho Park and
Min-Soo Kim Chimera: a System Design of Dual Storage
and Traversal-Join Unified Query
Processing for SQL\slash PGQ . . . . . . 279--292
Nikolai Merkel and
Pierre Toussing and
Ruben Mayer and
Hans-Arno Jacobsen Can Graph Reordering Speed Up Graph
Neural Network Training? An Experimental
Study . . . . . . . . . . . . . . . . . 293--307
Xiaoke Zhu and
Min Xie and
Ting Deng and
Qi Zhang HyperBlocker: Accelerating Rule-Based
Blocking in Entity Resolution Using GPUs 308--321
Yangfan Jiang and
Xinjian Luo and
Yin Yang and
Xiaokui Xiao Calibrating Noise for Group Privacy in
Subsampled Mechanisms . . . . . . . . . 322--334
Yi Liu and
Minghao Xie and
Shouqian Shi and
Yuanchao Xu and
Heiner Litz and
Chen Qian Outback: Fast and
Communication-Efficient Index for
Key--Value Store on Disaggregated Memory 335--348
Yunhao Mao and
Gengrui Zhang and
Zongxin Liu and
Pezhman Nasirifard and
Sofia Tijanic and
Hans-Arno Jacobsen Making CRDTs Not So Eventual . . . . . . 349--362
Shuohao Gao and
Kaiqiang Yu and
Shengxin Liu and
Cheng Long Maximum $k$-Plex Search: an Alternated
Reduction-and-Bound Method . . . . . . . 363--376
Patrick Schäfer and
Ulf Leser Discovering Leitmotifs in
Multidimensional Time Series . . . . . . 377--389
Chuang Yang and
Renhe Jiang and
Xiaohang Xu and
Chuan Xiao and
Kaoru Sezaki SIMformer: Single-Layer Vanilla
Transformer Can Learn Free-Space
Trajectory Similarity . . . . . . . . . 390--398
Junchang Wang and
Manos Athanassoulis CUBIT: Concurrent Updatable Bitmap
Indexing . . . . . . . . . . . . . . . . 399--412
Yunqing Ge and
Jianbin Qin and
Shuyuan Zheng and
Yongrui Zhong and
Bo Tang and
Yu-Xuan Qiu and
Rui Mao and
Ye Yuan and
Makoto Onizuka and
Chuan Xiao Privacy-Enhanced Database Synthesis for
Benchmark Publishing . . . . . . . . . . 413--425
Kijae Hong and
Kyoungmin Kim and
Young-Koo Lee and
Yang-Sae Moon and
Sourav S. Bhowmick and
Wook-Shin Han Themis: a GPU-Accelerated Relational
Query Execution Engine . . . . . . . . . 426--438
Shunit Agmon and
Amir Gilad and
Brit Youngmann and
Shahar Zoarets and
Benny Kimelfeld Finding Convincing Views to Endorse a
Claim . . . . . . . . . . . . . . . . . 439--452
Yumeng Song and
Yu Gu and
Tianyi Li and
Yushuai Li and
Christian S. Jensen and
Ge Yu Quantifying Point Contributions: a
Lightweight Framework for Efficient and
Effective Query-Driven Trajectory
Simplification . . . . . . . . . . . . . 453--465
Liwei Deng and
Tianfu Wang and
Yan Zhao and
Kai Zheng MILLION: a General Multi-Objective
Framework with Controllable Risk for
Portfolio Management . . . . . . . . . . 466--474
Yuxi Liu and
Fangzhu Shen and
Kushagra Ghosh and
Amir Gilad and
Benny Kimelfeld and
Sudeepa Roy The Cost of Representation by Subset
Repairs . . . . . . . . . . . . . . . . 475--487
Mark Zhao and
Emanuel Adamiak and
Christos Kozyrakis cedar: Optimized and Unified Machine
Learning Input Data Pipelines . . . . . 488--502
Monil Mukesh Sanghavi and
Ming-May Hu and
Zhenxiao Luo and
Xiao Li and
Kapil Bajaj Goku: a Schemaless Time Series Database
for Large Scale Monitoring at Pinterest 503--515
Zhangcheng Qiang and
Weiqing Wang and
Kerry Taylor Agent-OM: Leveraging LLM Agents for
Ontology Matching . . . . . . . . . . . 516--529
Xue Li and
Weibin Zeng and
Zhibin Wang and
Diwen Zhu and
Jingbo Xu and
Wenyuan Yu and
Jingren Zhou GraphAr: an Efficient Storage Scheme for
Graph Data in Data Lakes . . . . . . . . 530--543
Hai Lan and
Shixun Huang and
Zhifeng Bao and
Renata Borovica-Gajic Cardinality Estimation for Similarity
Search on High-Dimensional Data Objects:
The Impact of Reference Objects . . . . 544--556
Seonho Lee and
Yeunjun Lee and
Kunsoo Park Efficient Top-$k$ Frequent Subgraph
Mining using Tight Upper and Lower
Bounds . . . . . . . . . . . . . . . . . 557--570
Hao Liu and
Qianwen Yang and
Taoyong Cui and
Wei Wang MSGNN: Masked Schema based Graph Neural
Networks . . . . . . . . . . . . . . . . 571--584
Jawad Tahir and
Ruben Mayer and
Christoph Doblander and
Hans-Arno Jacobsen How Reliable are Streams? End-to-End
Processing-Guarantee Validation and
Performance Benchmarking of Stream
Processing Systems . . . . . . . . . . . 585--598
Yinnian Lin and
Lei Zou and
Xunbin Su Towards Sufficient GPU-Accelerated
Dynamic Graph Management: Survey and
Experiment . . . . . . . . . . . . . . . 599--612
Zhuocheng Shang and
Samriddhi Singla and
Ahmed Eldawy and
Elia Scudiero RDPro: Distributed Processing of Big
Raster Data: [Scalable Data Science] . . 613--622
Qi Zhang and
Yalong Zhang and
Rong-Hua Li and
Guoren Wang Approximate Anchored Densest Subgraph
Search on Large Static and Dynamic
Graphs . . . . . . . . . . . . . . . . . 623--636
Tianjing Zeng and
Junwei Lan and
Jiahong Ma and
Wenqing Wei and
Rong Zhu and
Yingli Zhou and
Pengfei Li and
Bolin Ding and
Defu Lian and
Zhewei Wei and
Jingren Zhou PRICE: a Pretrained Model for
Cross-Database Cardinality Estimation 637--650
Thomas Gilray and
Arash Sahebolamri and
Yihao Sun and
Sowmith Kunapaneni and
Sidharth Kumar and
Kristopher Micinski Datalog with First-Class Facts . . . . . 651--665
Junhao Zhu and
Tao Wang and
Danlei Hu and
Ziquan Fang and
Lu Chen and
Yunjun Gao and
Tianyi Li and
Christian S. Jensen T-Assess: an Efficient Data Quality
Assessment System Tailored for
Trajectory Data . . . . . . . . . . . . 666--674
Junhao Ye and
Jiahui Li and
Lu Chen and
Yuren Mao and
Yunjun Gao and
Tianyi Li LEAP: a Low-Cost Spark SQL Query
Optimizer using Pairwise Comparison . . 675--687
Xinle Cao and
Weiqi Feng and
Jian Liu and
Jinjin Zhou and
Wenjing Fang and
Lei Wang and
Quanqing Xu and
Chuanhui Yang and
Kui Ren Towards Practical Oblivious Map . . . . 688--701
Chaoyi Ruan and
Yingqiang Zhang and
Juncheng Zhang and
Cheng Li and
Xiaosong Ma and
Hao Chen and
Jie Zhou and
Feifei Li and
Xinjun Yang PolyBase: Adapting to Data Affinity
Changes in Geo-Replicated Database via
Row-Level Consensus-Group Affiliation
Re-Assignment . . . . . . . . . . . . . 702--714
Wenfei Fan and
Lihang Fan and
Dandan Lin and
Min Xie Explaining GNN-Based Recommendations in
Logic . . . . . . . . . . . . . . . . . 715--728
Haozhe Yin and
Kai Wang and
Wenjie Zhang and
Ying Zhang and
Ruijia Wu and
Xuemin Lin Efficient Computation of Hyper-Triangles
on Hypergraphs . . . . . . . . . . . . . 729--742
Yuwei Huang and
Guoliang Li Laser: Buffer-Aware Learned Query
Scheduling in Master-Standby Databases 743--755
Yang Liu and
Wenfei Fan and
Shuhao Liu and
Xiaoke Zhu and
Jianxin Li A Single Machine System for Querying Big
Graphs with PRAM . . . . . . . . . . . . 756--769
Jiajia Li and
Yongzhi Chen and
Mengxuan Zhang and
Lei Li A CPU-GPU Hybrid Labelling Algorithm for
Massive Shortest Distance Queries on
Road Networks . . . . . . . . . . . . . 770--783
Fei Ye and
Zikang Liu and
Xi Zhang and
Yinan Jing and
Zhenying He and
Yuxin Che and
Haoran Xiong and
Kai Zhang and
X. Sean Wang SDEcho: Efficient Explanation of
Aggregated Sequence Difference . . . . . 784--797
Qideng Tang and
Chaofan Dai and
Yahui Wu and
Haohao Zhou MLP-Mixer based Masked Autoencoders are
Effective, Explainable and Robust for
Time Series Anomaly Detection . . . . . 798--811
Liwei Deng and
Penghao Chen and
Ximu Zeng and
Tianfu Wang and
Yan Zhao and
Kai Zheng Efficient Data-Aware Distance Comparison
Operations for High-Dimensional
Approximate Nearest Neighbor Search . . 812--821
Shijie Zhang and
Ru Cheng and
Xinpeng Liu and
Jiang Xiao and
Hai Jin and
Bo Li Seer: Accelerating Blockchain
Transaction Execution by Fine-Grained
Branch Prediction . . . . . . . . . . . 822--835
Shangdi Yu and
Jessica Shi and
Jamison Meindl and
David Eisenstat and
Xiaoen Ju and
Sasan Tavakkol and
Laxman Dhulipala and
Jakub \L\kacki and
Vahab Mirrokni and
Julian Shun The ParClusterers Benchmark Suite
(PCBS): a Fine-Grained Analysis of
Scalable Graph Clustering . . . . . . . 836--849
Shuang Liu and
Chenglin Tian and
Jun Sun and
Ruifeng Wang and
Wei Lu and
Yongxin Zhao and
Yinxing Xue and
Junjie Wang and
Xiaoyong Du Semantic Conformance Testing of
Relational DBMS . . . . . . . . . . . . 850--862
Songsong Mo and
Yue Zhao and
Zhifeng Bao and
Quanqing Xu and
Chuanhui Yang and
Gao Cong RankPQO: Learning-to-Rank for Parametric
Query Optimization . . . . . . . . . . . 863--875
Aviv Hadar and
Tova Milo and
Kathy Razmadze Datamap-Driven Tabular Coreset Selection
for Classifier Training . . . . . . . . 876--888
Quinten De Man and
Laxman Dhulipala and
Adam Karczmarz and
Jakub \L\kacki and
Julian Shun and
Zhongqi Wang Towards Scalable and Practical
Batch-Dynamic Connectivity . . . . . . . 889--901
Shengquan Ni and
Yicong Huang and
Zuozhi Wang and
Chen Li IcedTea: Efficient and Responsive
Time-Travel Debugging in Dataflow
Systems . . . . . . . . . . . . . . . . 902--914
Ge Lee and
Shixun Huang and
Zhifeng Bao and
Yanchang Zhao Representative Time Series Discovery for
Data Exploration . . . . . . . . . . . . 915--928
Yifan Song and
Xiaolong Chen and
Wenqing Lin and
Jia Li and
Chen Zhang and
Yan Zhou and
Lei Chen and
Jing Tang Efficient Graph Embedding Generation and
Update for Large-Scale Temporal Graph 929--942
Haneen Mohammed and
Alexander Yao and
Charlie Summers and
Hongbin Zhong and
Gromit Yeuk-Yin Chan and
Subrata Mitra and
Lampros Flokas and
Eugene Wu FaDE: More Than a Million What-Ifs Per
Second . . . . . . . . . . . . . . . . . 943--955
Yuxin Yang and
Hongkuan Zhou and
Rajgopal Kannan and
Viktor Prasanna Towards Ideal Temporal Graph Neural
Networks: Evaluations and Conclusions
after 10,000 GPU Hours . . . . . . . . . 956--969
Zhaoheng Li and
Supawit Chockchowwat and
Ribhav Sahu and
Areet Sheth and
Yongjoo Park Kishu: Time-Traveling for Computational
Notebooks . . . . . . . . . . . . . . . 970--985
Ruiyao Ma and
Yifan Zhu and
Baihua Zheng and
Lu Chen and
Congcong Ge and
Yunjun Gao GTI: Graph-Based Tree Index with
Logarithm Updates for Nearest Neighbor
Search in High-Dimensional Spaces . . . 986--999
Youri Kaminsky and
Eduardo H. M. Pena and
Felix Naumann Incremental Detection of Denial
Constraint Violations . . . . . . . . . 1000--1012
Zhihao Chang and
Linzhu Yu and
Huan Li and
Sai Wu and
Gang Chen and
Dongxiang Zhang Revisiting CNNs for Trajectory
Similarity Learning . . . . . . . . . . 1013--1021
Deming Chu and
Zhizhi Gao and
Fan Zhang and
Wenjie Zhang and
Xuemin Lin and
Zhihong Tian Most Similar Biclique Search at Scale 1022--1034
Chenghong Wang and
Lina Qiu and
Johes Bater and
Yukui Luo SPECIAL: SynoPsis AssistEd Secure
Collaborative AnaLytics . . . . . . . . 1035--1048
Qingyin Lin and
Jiangsu Du and
Rui Li and
Zhiguang Chen and
Wenguang Chen and
Nong Xiao IncrCP: Decomposing and Orchestrating
Incremental Checkpoints for Effective
Recommendation Model Training . . . . . 1049--1062
Naiqing Guan and
Nick Koudas WeShap: Weak Supervision Source
Evaluation with Shapley Values . . . . . 1063--1076
Weiping Yu and
Fan Wang and
Xuwei Zhang and
Siqiang Luo Are Joins over LSM-Trees Ready? Take
RocksDB as an Example . . . . . . . . . 1077--1090
Zheng Wu and
Xuliang Zhu and
Yixiang Fang and
Jianliang Xu and
Xin Huang Interactive Graph Search for Multiple
Targets on DAGs . . . . . . . . . . . . 1091--1103
Xianghong Xu and
Tieying Zhang and
Xiao He and
Haoyang Li and
Rong Kang and
Shuai Wang and
Linhui Xu and
Zhimin Liang and
Shangyu Luo and
Lei Zhang and
Jianjun Chen AdaNDV: Adaptive Number of Distinct
Value Estimation via Learning to Select
and Fuse Estimators . . . . . . . . . . 1104--1117
Anqi Liang and
Pengcheng Zhang and
Bin Yao and
Zhongpu Chen and
Yitong Song and
Guangxu Cheng UNIFY: Unified Index for Range Filtered
Approximate Nearest Neighbors Search . . 1118--1130
Yingli Zhou and
Qingshuo Guo and
Yi Yang and
Yixiang Fang and
Chenhao Ma and
Laks V. S. Lakshmanan In-Depth Analysis of Densest Subgraph
Discovery in a Unified Framework . . . . 1131--1144
Fuheng Zhao and
Shaleen Deep and
Fotis Psallidas and
Avrilia Floratou and
Divyakant Agrawal and
Amr El Abbadi Sphinteract: Resolving Ambiguities in
NL2SQL through User Interaction . . . . 1145--1158
Zhihao Zhuang and
Yingying Zhang and
Kai Zhao and
Chenjuan Guo and
Bin Yang and
Qingsong Wen and
Lunting Fan Noise Matters: Cross Contrastive
Learning for Flink Anomaly Detection . . 1159--1168
Biao Ouyang and
Yingying Zhang and
Hanyin Cheng and
Yang Shu and
Chenjuan Guo and
Bin Yang and
Qingsong Wen and
Lunting Fan and
Christian S. Jensen RCRank: Multimodal Ranking of Root
Causes of Slow Queries in Cloud Database
Systems . . . . . . . . . . . . . . . . 1169--1182
Hassan Abdallah and
Béatrice Markhoff and
Arnaud Soulet Ranking Indicator Discovery from Very
Large Knowledge Graphs . . . . . . . . . 1183--1195
Saurabh Bajaj and
Hojae Son and
Juelin Liu and
Hui Guan and
Marco Serafini Graph Neural Network Training Systems: a
Performance Comparison of Full-Graph and
Mini-Batch . . . . . . . . . . . . . . . 1196--1209
Qingdong Su and
Zhikang Wang and
Zijing Tan and
Shuai Ma Discovering Approximate Inclusion
Dependencies . . . . . . . . . . . . . . 1210--1222
Zhutao Zhuang and
Xinqi Zeng and
Zhiguang Chen DumpKV: Learning Based Lifetime Aware
Garbage Collection for Key Value
Separation in LSM-Tree . . . . . . . . . 1223--1236
Boyu Zhang and
He Huang and
Yu-E Sun and
Guoju Gao RGS-Sketch: an Accurate, Invertible, and
Mergeable Sketch for Online Super
Spreader Detection in High-Speed Data
Streams . . . . . . . . . . . . . . . . 1237--1249
Yichao Yuan and
Advait Iyer and
Lin Ma and
Nishil Talati Vortex: Overcoming Memory Capacity
Limitations in GPU-Accelerated
Large-Scale Data Analytics . . . . . . . 1250--1263
Vasilis Gavrielatos and
Antonios Katsarakis and
Chris Jensen and
Nikos Ntarmos Dandelion: Smaller Clusters, Bigger
Speeds-Distributed Transactions
Redefined . . . . . . . . . . . . . . . 1264--1277
Xiaoying Wang and
Wentao Wu and
Vivek Narasayya and
Surajit Chaudhuri Esc: an Early-Stopping Checker for
Budget-Aware Index Tuning . . . . . . . 1278--1290
Yilei Wang and
Xiangdong Zeng and
Sheng Wang and
Feifei Li Jodes: Efficient Oblivious Join in the
Distributed Setting . . . . . . . . . . 1291--1304
Xunkai Li and
Yinlin Zhu and
Boyang Pang and
Guochen Yan and
Yeyu Yan and
Zening Li and
Zhengyu Wu and
Wentao Zhang and
Rong-Hua Li and
Guoren Wang OpenFGL: a Comprehensive Benchmark for
Federated Graph Learning . . . . . . . . 1305--1320
Samuele Langhi and
Angela Bonifati and
Riccardo Tommasini Evaluating Continuous Queries with
Inconsistency Annotations . . . . . . . 1321--1334
Zhi Wang and
Ming Zhong and
Yuanyuan Zhu and
Tieyun Qian and
Mengchi Liu and
Jeffrey Xu Yu On More Efficiently and Versatilely
Querying Historical $k$-Cores . . . . . 1335--1347
Yi Li and
Gao Cong GeoBloom: Revisiting Lightweight Models
for Geographic Information Retrieval . . 1348--1361
Shabnam Ghasemirad and
Si Liu and
Christoph Sprenger and
Luca Multazzu and
David Basin VerIso: Verifiable Isolation Guarantees
for Database Transactions . . . . . . . 1362--1375
Yinhao Hong and
Hongyao Zhao and
Wei Lu and
Xiaoyong Du and
Yuxing Chen and
Anqun Pan and
Lixiong Zheng A Hybrid Approach to Integrating
Deterministic and Non-Deterministic
Concurrency Control in Database Systems 1376--1389
Sandra Geisler and
Cinzia Cappiello and
Irene Celino and
David Chaves-Fraga and
Anastasia Dimou and
Ana Iglesias-Molina and
Maurizio Lenzerini and
Anisa Rula and
Dylan Van Assche and
Sascha Welten and
Maria-Esther Vidal From Genesis to Maturity: Managing
Knowledge Graph Ecosystems Through Life
Cycles . . . . . . . . . . . . . . . . . 1390--1397
Matthias Lanzinger and
Reinhard Pichler and
Alexander Selzer Avoiding Materialisation for Guarded
Aggregate Queries . . . . . . . . . . . 1398--1411
Chengyang Luo and
Qing Liu and
Yunjun Gao and
Jianliang Xu Synergetic Community Search over Large
Multilayer Graphs . . . . . . . . . . . 1412--1424
Shu Wang and
Yixiang Fang and
Wensheng Luo Searching and Detecting Structurally
Similar Communities in Large
Heterogeneous Information Networks . . . 1425--1438
Gengrui Zhang and
Shiquan Zhang and
Michail Bachras and
Yuqiu Zhang and
Hans-Arno Jacobsen Cabinet: Dynamically Weighted Consensus
Made Fast . . . . . . . . . . . . . . . 1439--1452
Mohsen Dehghankar and
Abolfazl Asudeh Mining the Minoria: Unknown,
Under-Represented, and Under-Performing
Minority Groups . . . . . . . . . . . . 1453--1480
Guanhao Hou and
Jinchao Huang and
Fangyuan Zhang and
Sibo Wang Efficient Concurrent Updates to
Persistent Randomized Binary Search
Trees . . . . . . . . . . . . . . . . . 1481--1494
Sariel Ofek and
Amit Somech Explaining Black-Box Clustering
Pipelines with Cluster-Explorer . . . . 1495--1508
Jianfeng Huang and
Yihao Cao and
Shubing Ren and
Baohua Wu and
Dongjing Miao BACH: Bridging Adjacency List and CSR
Format Using LSM-Trees for HGTAP
Workloads . . . . . . . . . . . . . . . 1509--1521
Hua Fan and
Hao Tan and
Wenchao Zhou and
Feifei Li FLEET: High-Performance Durable
Replicated State Machines Using
Scattered and Coordinated Log Entries 1522--1535
Guoxin Kang and
Zhongxin Ge and
Jingpei Hu and
Xueya Zhang and
Lei Wang and
Jianfeng Zhan BigVectorBench: Heterogeneous Data
Embedding and Compound Queries are
Essential in Evaluating Vector Databases 1536--1550
Jiawei Guan and
Feng Zhang and
Jiesong Liu and
Xiaoyong Du and
Xipeng Shen A Systematic Study on Early Stopping
Metrics in HPO and the Implications of
Uncertainty . . . . . . . . . . . . . . 1551--1564
Haoying Zhang and
Mariem Brahem and
Nicolas Anciaux and
Benjamin Nguyen and
Jose Maria de Fuentes TELESAFE: Detecting Private\slash Work
Boundary Crossings in Energy Consumption
Trails in Telework . . . . . . . . . . . 1565--1578
Yuan Chen and
Ao Li and
Wenhai Li and
Lingfeng Deng FB+-Tree: a Memory-Optimized B+-Tree
with Latch-Free Update . . . . . . . . . 1579--1592
Shenghao Gong and
Haobo Sun and
Ziquan Fang and
Liu Liu and
Lu Chen and
Yunjun Gao VStream: a Distributed Streaming Vector
Search System . . . . . . . . . . . . . 1593--1606
Qiuyang Mang and
Jingbang Chen and
Hangrui Zhou and
Yu Gao and
Yingli Zhou and
Qingyu Shi and
Richard Peng and
Yixiang Fang and
Chenhao Ma Efficient Historical Butterfly Counting
in Large Temporal Bipartite Networks via
Graph Structure-Aware Index . . . . . . 1607--1620
Abiram Mohanaraj and
Matteo Lissandrini and
Katja Hose PlanRGCN: Predicting SPARQL Query
Performance . . . . . . . . . . . . . . 1621--1634
Susan B. Davidson and
Tova Milo and
Kathy Razmadze and
Gal Zeevi Holistic Query Approximation via RL
Modeling . . . . . . . . . . . . . . . . 1635--1648
Lars Gottesbüren and
Laxman Dhulipala and
Rajesh Jayaram and
Jakub \L\kacki Unleashing Graph Partitioning for
Large-Scale Nearest Neighbor Search . . 1649--1662
Jiaming Ma and
Binwu Wang and
Pengkun Wang and
Zhengyang Zhou and
Xu Wang and
Yang Wang BiST: a Lightweight and Efficient
Bi-Directional Model for Spatiotemporal
Prediction . . . . . . . . . . . . . . . 1663--1676
Zhengxin You and
Qiaomu Shen and
Man Lung Yiu and
Bo Tang QOVIS: Understanding and Diagnosing
Query Optimizer via a
Visualization-Assisted Approach . . . . 1677--1690
Vincent Jacob and
Yanlei Diao Unsupervised Anomaly Detection in
Multivariate Time Series across
Heterogeneous Domains . . . . . . . . . 1691--1704
Zhenbo Fu and
Xin Ai and
Qiange Wang and
Yanfeng Zhang and
Shizhan Lu and
Chaoyi Chen and
Chunyu Cao and
Hao Yuan and
Zhewei Wei and
Yu Gu and
Yingyou Wen and
Ge Yu NeutronTask: Scalable and Efficient
Multi-GPU GNN Training with Task
Parallelism . . . . . . . . . . . . . . 1705--1719
Rihan Hai and
Shih-Han Hung and
Tim Coopmans and
Tim Littau and
Floris Geerts Quantum Data Management in the NISQ Era 1720--1729
Yunjia Zheng and
Charlotte Sacré and
Mohanna Shahrad and
Owen Lipchitz and
Yu Ting Gu and
Bettina Kemme G-View: View Management for Graph
Databases . . . . . . . . . . . . . . . 1730--1742
Rong Du and
Qingqing Ye and
Yue Fu and
Haibo Hu Privacy for Free: Leveraging Local
Differential Privacy Perturbed Data from
Multiple Services . . . . . . . . . . . 1743--1755
Haoze Song and
Yongqi Wang and
Xusheng Chen and
Hao Feng and
Yazhi Feng and
Xieyun Fang and
Heming Cui and
Linghe Kong K2: On Optimizing Distributed
Transactions in a Multi-Region Data
Store with TrueTime Clocks . . . . . . . 1756--1769
Tingyang Chen and
Cong Fu and
Kun Wang and
Xiangyu Ke and
Yunjun Gao and
Wenchao Zhou and
Yabo Ni and
Anxiang Zeng Maximum Inner Product is Query-Scaled
Nearest Neighbor . . . . . . . . . . . . 1770--1783
Rongzhao Chen and
Xiangpeng Hu and
Xiangdong Huang and
Chen Wang and
Shaoxu Song and
Jianmin Wang Migration-Free Elastic Storage of Time
Series in Apache IoTDB . . . . . . . . . 1784--1797
Amélie Gheerbrant and
Leonid Libkin and
Liat Peterfreund and
Alexandra Rogova GQL and SQL/PGQ: Theoretical Models and
Expressive Power . . . . . . . . . . . . 1798--1810
Peizhi Wu and
Haoshu Xu and
Ryan Marcus and
Zachary G. Ives A Practical Theory of Generalization in
Selectivity Learning . . . . . . . . . . 1811--1824
Shuo Yang and
Jiadong Xie and
Yingfan Liu and
Jeffrey Xu Yu and
Xiyue Gao and
Qianru Wang and
Yanguo Peng and
Jiangtao Cui Revisiting the Index Construction of
Proximity Graph-Based Approximate
Nearest Neighbor Search . . . . . . . . 1825--1838
Yijun Bei and
Teng Ma and
Dongxiang Zhang and
Sai Wu and
Kian-Lee Tan and
Gang Chen Mining Platoon Patterns from Traffic
Videos . . . . . . . . . . . . . . . . . 1839--1851
Botong Huang and
Lianggui Weng and
Wei Chen and
Kai Zeng and
Yihui Feng and
Bolin Ding and
Jingren Zhou and
Zuozhi Wang and
Chen Li Agamotto: Scheduling of
Deadline-Oriented Incremental Query
Execution under Uncertain Resource Price 1852--1864
Baoqing Cai and
Yu Liu and
Lin Ma and
Pingqi Huang and
Bingcheng Lian and
Ke Zhou and
Jia Yuan and
Jie Yang and
Xiaofan Cai and
Peijun Wu SCompression: Enhancing Database Knob
Tuning Efficiency Through Slice-Based
OLTP Workload Compression . . . . . . . 1865--1878
Xiyue Gao and
Zhuang Liu and
Yiran Shen and
Hui Li and
Yingfan Liu and
Hongjun Xiao and
Yanguo Peng and
Jiangtao Cui Fucci: Database Transaction Fuzzing via
Random Conflict Construction and
Multilevel Constraint Solving . . . . . 1879--1891
Wenjing Wang and
Ziyang Yue and
Bolong Zheng Streaming Time Series Subsequence
Anomaly Detection: a Glance and Focus
Approach . . . . . . . . . . . . . . . . 1892--1904
Leilei Du and
Peng Cheng and
Lei Chen and
Heng Tao Shen and
Xuemin Lin and
Wei Xi Infinite Stream Estimation under
Personalized $w$-Event Privacy . . . . . 1905--1918
Meng Wang and
Gus Waldspurger and
Naufal Ananda and
Yuyang Huang and
Kemas Wiharja and
John Bent and
Swaminathan Sundararaman and
Vijay Chidambaram and
Haryadi S. Gunawi GPEmu: a GPU Emulator for Faster and
Cheaper Prototyping and Evaluation of
Deep Learning System Research . . . . . 1919--1932
Anna Zeng and
Michael Cafarella and
Batya Kenig and
Markos Markakis and
Brit Youngmann and
Babak Salimi Causal DAG Summarization . . . . . . . . 1933--1947
Zhengmao Ye and
Dengchun Li and
Zetao Hu and
Tingfeng Lan and
Jian Sha and
Shicong Zhang and
Lei Duan and
Jie Zuo and
Hui Lu and
Yuanchun Zhou and
Mingjie Tang mLoRA: Fine-Tuning LoRA Adapters via
Highly-Efficient Pipeline Parallelism in
Multiple GPUs . . . . . . . . . . . . . 1948--1961
Abigale Kim and
Marco Slot and
David G. Andersen and
Andrew Pavlo Anarchy in the Database: a Survey and
Evaluation of Database Management System
Extensibility . . . . . . . . . . . . . 1962--1976
Longxu Sun and
Xin Huang and
Jiannan Wang and
Jianliang Xu A Flexible Framework for Query-Oriented
Interactive Community Search . . . . . . 1977--1990
Ziyi Yan and
Mohamed Farouk Drira and
Tianxun Hu and
Tianzheng Wang Tabular: Efficiently Building Efficient
Indexes . . . . . . . . . . . . . . . . 1991--2004
Yuanyuan Zeng and
Yixiang Fang and
Kun Chen and
Yangfan Li and
Chenhao Ma Efficient Maintenance of 2-Hop Labeling
Index on Dynamic Small-World Graphs . . 2005--2017
Kexin Zhu and
Michael Whittaker and
Srdjan Petrovic and
Robert Grandl and
Sanjay Ghemawat Vive la Différence: Practical Diff
Testing of Stateful Applications . . . . 2018--2030
Siyue Wu and
Dingming Wu and
Sinhong Cheuk and
Tsz Nam Chan and
Kezhong Lu GREAT: Generalized Reservoir Sampling
Based Triangle Counting Estimation over
Streaming Graphs . . . . . . . . . . . . 2031--2043
Mengran Li and
Zijing Tan and
Honghui Yang and
Shuai Ma Efficient Discovery of Relaxed
Functional Dependencies . . . . . . . . 2044--2056
Danlei Hu and
Yilin Li and
Lu Chen and
Ziquan Fang and
Yushuai Li and
Yunjun Gao and
Tianyi Li SimRN: Trajectory Similarity Learning in
Road Networks Based on Distributed Deep
Reinforcement Learning . . . . . . . . . 2057--2069
Yuxiang Guo and
Zhonghao Hu and
Yuren Mao and
Baihua Zheng and
Yunjun Gao and
Mingwei Zhou Birdie: Natural Language-Driven Table
Discovery Using Differentiate Search
Index . . . . . . . . . . . . . . . . . 2070--2083
Shu Liu and
Xiangxi Mo and
Moshik Hershcovitch and
Henric Zhang and
Audrey Cheng and
Guy Girmonsky and
Gil Vernik and
Michael Factor and
Tiemo Bang and
Soujanya Ponnapalli and
Natacha Crooks and
Joseph E. Gonzalez and
Danny Harnik and
Ion Stoica SkyStore: Cost-Optimized Object Storage
Across Regions and Clouds . . . . . . . 2084--2096
Tonghui Ren and
Chen Ke and
Yuankai Fan and
Yinan Jing and
Zhenying He and
Kai Zhang and
X. Sean Wang The Power of Constraints in Natural
Language to SQL Translation . . . . . . 2097--2111
Yufan Sheng and
Xin Cao and
Kaiqi Zhao and
Yixiang Fang and
Jianzhong Qi and
Wenjie Zhang and
Christian S. Jensen ACE: a Cardinality Estimator for
Set-Valued Queries . . . . . . . . . . . 2112--2125
Xiaoying Wang and
Jiannan Wang and
Tianzheng Wang and
Yong Zhang Accio: Bolt-on Query Federation . . . . 2126--2135
Xiaohai Dai and
Chaozheng Ding and
Wei Li and
Jiang Xiao and
Bolin Zhang and
Chen Yu and
Albert Y. Zomaya and
Hai Jin Falcon: Advancing Asynchronous BFT
Consensus for Lower Latency and Enhanced
Throughput . . . . . . . . . . . . . . . 2136--2148
Jindong Han and
Hao Wang and
Hui Xiong and
Hao Liu Scalable Pre-Training of Compact Urban
Spatio-Temporal Predictive Models on
Large-Scale Multi-Domain Data . . . . . 2149--2158
Xiaoyuan Liu and
Ni Trieu and
Trinabh Gupta and
Ishtiyaque Ahmad and
Dawn Song HADES: Range-Filtered Private
Aggregation on Public Data . . . . . . . 2159--2171
Zhaoxuan Ji and
Xinlu Wang and
Zhaojing Luo and
Zhongle Xie and
Meihui Zhang Optimized Batch Prompting for
Cost-Effective LLMs . . . . . . . . . . 2172--2184
Hongchao Qin and
Guang Zeng and
Rong-Hua Li and
Longlong Lin and
Ye Yuan and
Guoren Wang Truss Decomposition in Hypergraphs . . . 2185--2197
Jianting Zhang and
Zhongtang Luo and
Raghavendra Ramesh and
Aniket Kate Optimal Sharding for Scalable
Blockchains with Deconstructed SMR . . . 2198--2211
Eugenie Y. Lai and
Yeye He and
Surajit Chaudhuri Auto-Prep: Holistic Prediction of Data
Preparation Steps for Self-Service
Business Intelligence . . . . . . . . . 2212--2225
Valerio Guerrmi and
Thibaut Germain and
Charles Truong and
Laurent Oudre and
Paul Boniol Time Series Motif Discovery: a
Comprehensive Evaluation . . . . . . . . 2226--2239
Daniel Bourgeois and
Zhimin Ding and
Dimitrije Jankov and
Jiehui Li and
Mahmoud Sleem and
Yuxin Tang and
Jiawen Yao and
Xinyu Yao and
Chris Jermaine EinDecomp: Decomposition of
Declaratively-Specified Machine Learning
and Numerical Computations for Parallel
Execution . . . . . . . . . . . . . . . 2240--2253
Ruizhong Wu and
Mengxuan Zhang and
Shuxin Wang and
Frodo Kin Sun Chan and
Yan Nei Law and
Lei Li Continuous Lifelong Conflict-Aware AGV
Routing with Kinematic Constraints . . . 2254--2267
Dawei Liu and
Bolong Zheng and
Ziyang Yue and
Fuhao Ruan and
Xiaofang Zhou and
Christian S. Jensen Wolverine: Highly Efficient Monotonic
Search Path Repair for Graph-Based ANN
Index Updates . . . . . . . . . . . . . 2268--2280
Jiansen Song and
Wensheng Dou and
Yingying Zheng and
Yu Gao and
Ziyu Cui and
Wei Wang and
Jun Wei Detecting Schema-Related Logic Bugs in
Relational DBMSs via Equivalent Database
Construction . . . . . . . . . . . . . . 2281--2294
Weiyang Kong and
Kaiqi Wu and
Sen Zhang and
Yubao Liu GraphSparseNet: a Novel Method for Large
Scale Traffic Flow Prediction . . . . . 2295--2307
Danling Lai and
Jiajie Xu and
Jianfeng Qu and
Pingfu Chao and
Junhua Fang and
Chengfei Liu TMLKD: Few-Shot Trajectory Metric
Learning via Knowledge Distillation . . 2308--2320
Jun Nemoto and
Takashi Kambayashi and
Takashi Hoshino and
Hideyuki Kawashima Oze: Decentralized Graph-Based
Concurrency Control for Long-Running
Update Transactions . . . . . . . . . . 2321--2333
Elena Milkai and
Xiangyao Yu and
Jignesh M. Patel Hermes: Off-the-Shelf Real-Time
Transactional Analytics . . . . . . . . 2334--2347
Zeying Zhu and
Jonathan Chamberlain and
Kenny Wu and
David Starobinski and
Zaoxing Liu Approximation-First Timeseries Query At
Scale . . . . . . . . . . . . . . . . . 2348--2361
Ziheng Wang and
Junyu Wei and
Alex Aiken and
Guangyan Zhang and
Jacob O. Tòrring and
Rain Jiang and
Chenyu Jiang and
Wei Xu LogCIoud: Fast Search of Compressed Logs
on Object Storage . . . . . . . . . . . 2362--2370
Changlun Li and
Chenyu Yang and
Yuyu Luo and
Ju Fan and
Nan Tang Weak-to-Strong Prompts with
Lightweight-to-Powerful LLMs for
High-Accuracy, Low-Cost, and Explainable
Data Transformation . . . . . . . . . . 2371--2384
Zhe Xie and
Zeyan Li and
Xiao He and
Longlong Xu and
Xidao Wen and
Tieying Zhang and
Jianjun Chen and
Rui Shi and
Dan Pei ChatTS: Aligning Time Series with LLMs
via Synthetic Data for Enhanced
Understanding and Reasoning . . . . . . 2385--2398
Graham Cormode and
Daniel Ting Federated Data Distribution Shift
Estimation . . . . . . . . . . . . . . . 2399--2412
Liese Bekkers and
Frank Neven and
Stijn Vansummeren and
Yisu Remy Wang Instance-Optimal Acyclic Join Processing
Without Regret: Engineering the
Yannakakis Algorithm in Column Stores 2413--2426
Myles Thiessen and
Guy Khazma and
Sam Toueg and
Eyal de Lara Asymmetric Linearizable Local Reads . . 2427--2439
Jinyang Liu and
Pu Jiao and
Kai Zhao and
Xin Liang and
Sheng Di and
Franck Cappello QPET: a Versatile and Portable
Quantity-of-Interest-Preservation
Framework for Error-Bounded Lossy
Compression . . . . . . . . . . . . . . 2440--2453
Xinyi Zhu and
Yongqi Zhang and
Lei Chen OpenMEL: Unsupervised Multimodal Entity
Linking Using Noise-Free Expanded
Queries and Global Coherence . . . . . . 2454--2467
Ariane Ziehn and
Jan Szlang and
Steffen Zeuch and
Volker Markl Unraveling the Impact of Window
Semantics: Optimizing Join Order for
Efficient Stream Processing . . . . . . 2468--2481
Luca Zecchini and
Vasilis Efthymiou and
Felix Naumann and
Giovanni Simonini Deduplicated Sampling On-Demand . . . . 2482--2495
Theis E. Jendal and
Matteo Lissandrini and
Peter Dolog and
Katja Hose The Limits of Graph Samplers for
Training Inductive Recommender Systems 2496--2504
Zhiying Liang and
Vahab Jabrayilov and
Abutalib Aghayev and
Aleksey Charapko HoliPaxos: Towards More Predictable
Performance in State Machine Replication 2505--2518
Peizhi Wu and
Rong Kang and
Tieying Zhang and
Jianjun Chen and
Ryan Marcus and
Zachary G. Ives Data-Agnostic Cardinality Learning from
Imperfect Workloads . . . . . . . . . . 2519--2532
Song Wang and
Chen Wang and
Jianchun Wang and
Shengguo Li and
Rui Li and
Zhiyong Peng BLAEQ: a Multigrid Index for Spatial
Query on Geometry Data . . . . . . . . . 2533--2546
Ziyu Cui and
Wensheng Dou and
Yu Gao and
Rui Yang and
Yingying Zheng and
Jiansen Song and
Yuan Feng and
Jun Wei Simple Testing Can Expose Most Critical
Transaction Bugs: Understanding and
Detecting Write-Specific Serializability
Violations in Database Systems . . . . . 2547--2560
Ilie Sarpe and
Aristides Gionis Efficient and Adaptive Estimation of
Local Triadic Coefficients . . . . . . . 2561--2574
Wentao Zhang and
Jingyuan Wang and
Yifan Yang and
Leong Hou U VecCity: a Taxonomy-Guided Library for
Map Entity Representation Learning . . . 2575--2588
Jerin George Mathew and
Donatella Firmani and
Divesh Srivastava Evaluating Methods for Efficient Entity
Count Estimation . . . . . . . . . . . . 2589--2601
Audrey Cheng and
Aaron Kabcenell and
Xiao Shi and
Jolene Huey and
Peter Bailis and
Natacha Crooks and
Ion Stoica Fair Transaction Processing for
Multi-Tenant Databases . . . . . . . . . 2602--2615
Sheng Lin and
Fangcheng Fu and
Haoyang Li and
Hao Ge and
Xuanyu Wang and
Jiawen Niu and
Yaofeng Tu and
Bin Cui LobRA: Multi-Tenant Fine-Tuning over
Heterogeneous Data . . . . . . . . . . . 2616--2625
Amin Kamali and
Verena Kantere and
Calisto Zuzarte and
Vincent Corvinelli Robust Plan Evaluation Based on
Approximate Probabilistic Machine
Learning . . . . . . . . . . . . . . . . 2626--2638
Saeed Fathollahzadeh and
Essam Mansour and
Matthias Boehm CatDB: Data-Catalog-Guided, LLM-Based
Generation of Data-Centric ML Pipelines 2639--2652
Hanwen Liu and
Shashank Giridhara and
Ibrahim Sabek Conformal Prediction for Verifiable
Learned Query Optimization . . . . . . . 2653--2666
Neha Makhija and
Wolfgang Gatterbauer Is Integer Linear Programming All You
Need for Deletion Propagation? A Unified
and Practical Approach for Generalized
Deletion Propagation . . . . . . . . . . 2667--2680
Yurong Liu and
Eduardo H. M. Pena and
Aécio Santos and
Eden Wu and
Juliana Freire Magneto: Combining Small and Large
Language Models for Schema Matching . . 2681--2694
Qiuyu Guo and
Jianye Yang and
Wenjie Zhang and
Hanchen Wang and
Ying Zhang and
Xuemin Lin Efficient and Accurate Subgraph
Counting: a Bottom-up Flow-learning
Based Approach . . . . . . . . . . . . . 2695--2708
Lixiang Chen and
Yuxing Han and
Yu Chen and
Xing Chen and
Chengcheng Yang and
Weining Qian AQETuner: Reliable Query-Level
Configuration Tuning for Analytical
Query Engines . . . . . . . . . . . . . 2709--2721
Jun Liu and
Bingqian Du and
Ziyue Luo and
Sitian Lu and
Qiankun Zhang and
Hai Jin PipeTGL: (Near) Zero Bubble Memory-Based
Temporal Graph Neural Network Training
via Pipeline Optimization . . . . . . . 2722--2734
Yeounoh Chung and
Gaurav T. Kakkar and
Yu Gan and
Brenton Milne and
Fatma Özcan Is Long Context All You Need? Leveraging
LLM's Extended Context for NL2SQL . . . 2735--2747
Laurens Kuiper and
Paul Groß and
Peter Boncz and
Hannes Mühleisen Saving Private Hash Join . . . . . . . . 2748--2760
Weixing Zhou and
Yanfeng Zhang and
Xinji Zhou and
Zhiyou Wang and
Zeshun Peng and
Yang Ren and
Sihao Li and
Huanchen Zhang and
Guoliang Li and
Ge Yu Concurrency Control as a Service . . . . 2761--2774
Xiangfei Qiu and
Zhe Li and
Wanghui Qiu and
Shiyan Hu and
Lekui Zhou and
Xingjian Wu and
Zhengyu Li and
Chenjuan Guo and
Aoying Zhou and
Zhenli Sheng and
Jilin Hu and
Christian S. Jensen and
Bin Yang TAB: Unified Benchmarking of Time Series
Anomaly Detection Methods . . . . . . . 2775--2789
Yuchen Zhong and
Junwei Su and
Chuan Wu and
Minjie Wang Heta: Distributed Training of
Heterogeneous Graph Neural Networks . . 2790--2803
Yannis Foufoulas and
Theoni Palaiologou and
Alkis Simitsis The UDFBENCH Benchmark for
General-Purpose UDF Queries . . . . . . 2804--2817
Ye Sun and
Lei Shi and
Yongxin Tong eXpath: Explaining Knowledge Graph Link
Prediction with Ontological Closed Path
Rules . . . . . . . . . . . . . . . . . 2818--2830
Antonios Katsarakis and
Vasilis Gavrielatos and
Emmanouil Giortamis and
Pramod Bhatotia and
Aleksandar Dragojevic and
Boris Grot and
Vijay Nagarajan and
Panagiota Fatourou The LAW Theorem: Local Reads and
Linearizable Asynchronous Replication 2831--2845
Brecht Vandevoort and
Alan Fekete and
Bas Ketsman and
Frank Neven and
Stijn Vansummeren Using Read Promotion and Mixed Isolation
Levels for Performant Yet Serializable
Execution of Transaction Programs . . . 2846--2858
Zeynep Korkmaz and
M. Tamer Özsu and
Khuzaima Daudjee Locality-Aware Cache Replacement Policy
for Graph Traversals . . . . . . . . . . 2859--2871
Rúben Adão and
Zhongjie Wu and
Changjun Zhou and
Oana Balmau and
João Paulo and
Ricardo Macedo KEIGO: Co-Designing Log-Structured Merge
Key-Value Stores with a Non-Volatile,
Concurrency-Aware Storage Hierarchy . . 2872--2885
Qiyu Liu and
Siyuan Han and
Yanlin Qi and
Jingshu Peng and
Jin Li and
Longlong Lin and
Lei Chen Why Are Learned Indexes So Effective but
Sometimes Ineffective? . . . . . . . . . 2886--2898
Falaah Arif Khan and
Denys Herasymuk and
Nazar Protsiv and
Julia Stoyanovich Still More Shades of Null: an Evaluation
Suite for Responsible Missing Value
Imputation . . . . . . . . . . . . . . . 2899--2913
Tianji Cong and
Fatemeh Nargesian and
Junjie Xing and
H. V. Jagadish OpenForge: Probabilistic Metadata
Integration . . . . . . . . . . . . . . 2914--2927
Akhlaque Ahmad and
Da Yan and
Xiao Chen and
Lyuheng Yuan and
Qin Zhang and
Saugat Adhikari Maximum $k$-Plex Finding: Choices of
Pruning Techniques Matter! . . . . . . . 2928--2940
Xiaoxuan Gou and
Weiguo Zheng and
Yuxiang Wang and
Xiaoliang Xu and
Zhiyuan Yu A Comprehensive Survey and Experimental
Study of Learning-Based Community Search 2941--2954
Weizheng Lu and
Chao Hui and
Yunhai Wang and
Feng Zhang and
Yueguo Chen and
Bao Liu and
Chengjie Li and
Zhaoxin Wu and
Xuye Qin Decentralized Actor Scheduling and
Reference-Based Storage in Xorbits: a
Native Scalable Data Science Engine . . 2955--2963
Tao Kong and
Hui Li and
Yuxuan Zhao and
Liping Li and
Xiyue Gao and
Qilong Wu and
Jiangtao Cui STsCache: an Efficient Semantic Caching
Scheme for Time-Series Data Workloads
Based on Hybrid Storage . . . . . . . . 2964--2977
Ruihong Wang and
Jianguo Wang and
Walid G. Aref Cache Coherence Over Disaggregated
Memory . . . . . . . . . . . . . . . . . 2978--2991
Ruchi Bhoot and
Tuhin Khare and
Manoj Agarwal and
Siddharth Jaiswal and
Yogesh Simmhan Triparts: Scalable Streaming Graph
Partitioning to Enhance Community
Structure . . . . . . . . . . . . . . . 2992--3006
Shipeng Qi and
Bing Tong and
Jiatao Hu and
Heng Lin and
Yue Pang and
Wei Yuan and
Songlin Lyu and
Zhihui Guo and
Ke Huang and
Xujin Ba and
Qiang Yin and
Youren Shen and
Yan Zhou and
Tao Lv and
Jia Li and
Lei Zou and
Yongwei Wu and
Gábor Szárnyas and
Xiaowei Zhu and
Wenguang Chen and
Chuntao Hong The LDBC Financial Benchmark:
Transaction Workload . . . . . . . . . . 3007--3020
Donghyun Sohn and
Kelly Jiang and
Nicolas Hammer and
Jennie Rogers Alchemy: a Query Optimization Framework
for Oblivious SQL . . . . . . . . . . . 3021--3034
Shreya Shankar and
Tristan Chambers and
Tarak Shah and
Aditya G. Parameswaran and
Eugene Wu DocETL: Agentic Query Rewriting and
Evaluation for Complex Document
Processing . . . . . . . . . . . . . . . 3035--3048
Guoyu Hu and
Shaofeng Cai and
Tien Tuan Anh Dinh and
Zhongle Xie and
Cong Yue and
Gang Chen and
Beng Chin Ooi HAKES: Scalable Vector Database for
Embedding Search Service . . . . . . . . 3049--3062
Zhengdong Wang and
Qiang Yin and
Longbin Lai Path-Centric Cardinality Estimation for
Subgraph Matching . . . . . . . . . . . 3063--3076
Hong Lin and
Shixin Wan and
Zhongle Xie and
Ke Chen and
Meihui Zhang and
Lidan Shou and
Gang Chen A Comprehensive Study of Shapley Value
in Data Analytics . . . . . . . . . . . 3077--3092
Longjiao Zhang and
Rui Wang and
Tongya Zheng and
Ziqi Huang and
Wenjie Huang and
Xinyu Wang and
Can Wang and
Mingli Song and
Sai Wu and
Shuibing He Effective and Efficient Distributed
Temporal Graph Learning through Hotspot
Memory Sharing . . . . . . . . . . . . . 3093--3105
Riddho R. Haque and
Anh L. Mai and
Matteo Brucato and
Azza Abouzied and
Peter J. Haas and
Alexandra Meliou Stochastic SketchRefine: Scaling
In-Database Decision-Making under
Uncertainty to Millions of Tuples . . . 3106--3118
Marcel Weisgut and
Daniel Ritter and
Pinar Tözün and
Lawrence Benson and
Tilmann Rabl CXL Memory Performance for In-Memory
Data Processing . . . . . . . . . . . . 3119--3133
Fei Teng and
Haoyang Li and
Lei Chen LLMLog: Advanced Log Template Generation
via LLM-Driven Multi-Round Annotation 3134--3148
Vinh Quang Ngo and
Marina Papatriantafilou Cuckoo Heavy Keeper and the Balancing
Act of Maintaining Heavy Hitters in
Stream Processing . . . . . . . . . . . 3149--3161
Qian Zhang and
Yiwen Xiang and
Jianhao Wei and
Yang Yang and
Yifan Li and
Xueqing Gong and
Wanggen Liu Rebirth-Retire: a Concurrency Control
Protocol Adaptable to Different Levels
of Contention . . . . . . . . . . . . . 3162--3174
Ruikun Li and
Dai Shi and
Ye Xiao and
Junbin Gao UFGTime: Mining Intertwined Dependencies
in Multivariate Time Series via an
Efficient Pure Graph Approach . . . . . 3175--3188
Ruochen Jiang and
Spyros Blanas ArrayMorph: Optimizing Hyperslab Queries
on the Cloud for Machine Learning
Pipelines . . . . . . . . . . . . . . . 3189--3202
Yangxin Fan and
Haolai Che and
Yinghui Wu Inference-Friendly Graph Compression for
Graph Neural Networks . . . . . . . . . 3203--3215
Sacheendra Talluri and
Guido Walter Di Donato and
Luca Danelutti and
Koen R. Vlaswinkel and
Marco Arnaboldi and
Arnaud Delamare and
Marco D. Santambrogio and
Daniele Bonetta GpJSON: High-Performance JSON Data
Processing on GPUs . . . . . . . . . . . 3216--3229
Antonio Ferrara and
David García-Soriano and
Francesco Bonchi Beyond Shortest Paths: Node Fairness in
Route Recommendation . . . . . . . . . . 3230--3242
Yin Li and
Sharad Mehrotra and
Shantanu Sharma and
Komal Kumari Access Control for
Information-Theoretically Secure Data 3243--3255
Zhencan Peng and
Miao Qiao and
Wenchao Zhou and
Feifei Li and
Dong Deng Dynamic Range-Filtering Approximate
Nearest Neighbor Search . . . . . . . . 3256--3268
Yukun Cao and
Zengyi Gao and
Zhiyang Li and
Xike Xie and
S. Kevin Zhou and
Jianliang Xu LEGO-GraphRAG: Modularizing Graph-Based
Retrieval-Augmented Generation for
Design Space Exploration . . . . . . . . 3269--3283
Jinwoo Hwang and
Daeun Kim and
Sangyeop Lee and
Yoonsung Kim and
Guseul Heo and
Hojoon Kim and
Yunseok Jeong and
Tadiwos Meaza and
Eunhyeok Park and
Jeongseob Ahn and
Jongse Park Déj\`a Vu: Efficient Video-Language Query
Engine with Learning-Based Inter-Frame
Computation Reuse . . . . . . . . . . . 3284--3298
Mihail Stoian and
Andreas Zimmerer and
Skander Krid and
Amadou Latyr Ngom and
Jialin Ding and
Tim Kraska and
Andreas Kipf Parachute: Single-Pass Bi-Directional
Information Passing . . . . . . . . . . 3299--3311
Jiani Yang and
Sai Wu and
Yong Wang and
Dongxiang Zhang and
Yifei Liu and
Xiu Tang and
Gang Chen Twisted Twin: a Collaborative and
Competitive Memory Management Approach
in HTAP Systems . . . . . . . . . . . . 3312--3325
Muhammad Farhan and
Henning Koehler and
Qing Wang and
Jiawen Wang and
Moritz Laupichler and
Peter Sanders Customization Meets 2-Hop Labeling:
Efficient Routing in Road Networks . . . 3326--3338
Dayi Fan and
Rubao Lee and
Xiaodong Zhang X-Blossom: Massive Parallelization of
Graph Maximum Matching . . . . . . . . . 3339--3353
Chenyu Yang and
Yuyu Luo and
Chuanxuan Cui and
Ju Fan and
Chengliang Chai and
Nan Tang Data Imputation with Limited Data
Redundancy Using Data Lakes . . . . . . 3354--3367
Chunyue Huang and
Shuang Liu and
Xinyi Zhang and
Wenhao Li and
Wei Lu and
Xiaoyong Du Chimera: Mitigating Ownership Transfers
in Multi-Primary Shared-Storage
Cloud-Native Databases . . . . . . . . . 3368--3381
Minze Xu and
Zhentai Xie and
Zhibin Wang and
Guangzhan Wang and
Longbin Lai and
Yuan Zhang and
Chen Tian and
Sheng Zhong Sectric: Towards Accurate,
Privacy-Preserving and Efficient
Triangle Counting . . . . . . . . . . . 3382--3395
Haoyang Li and
Yuming Xu and
Yiming Li and
Hanmo Liu and
Darian Li and
Chen Jason Zhang and
Lei Chen and
Qing Li When Speed Meets Accuracy: an Efficient
and Effective Graph Model for Temporal
Link Prediction . . . . . . . . . . . . 3396--3405
Yuxin Tang and
Feng Zhang and
Jiawei Guan and
Yuan Tian and
Xiangdong Huang and
Chen Wang and
Jianmin Wang and
Xiaoyong Du Improving Time Series Data Compression
in Apache IoTDB . . . . . . . . . . . . 3406--3420
Jianwei Wang and
Kai Wang and
Ying Zhang and
Wenjie Zhang and
Xiwei Xu and
Xuemin Lin On LLM-Enhanced Mixed-Type Data
Imputation with High-Order Message
Passing . . . . . . . . . . . . . . . . 3421--3434
Vishal Chakraborty and
Youri Kaminsky and
Sharad Mehrotra and
Felix Naumann and
Faisal Nawab and
Primal Pappachan and
Mohammad Sadoghi and
Nalini Venkatasubramanian Meaningful Data Erasure in the Presence
of Dependencies . . . . . . . . . . . . 3435--3448
Chuzhe Tang and
Zhaoguo Wang and
Jinyang Li and
Haibo Chen Sonata: Multi-Database Transactions Made
Fast and Serializable . . . . . . . . . 3449--3462
Matteo Ceccarello and
Francesco Pio Monaco and
Francesco Silvestri MOMENTI: Scalable Motif Mining in
Multidimensional Time Series . . . . . . 3463--3476
Albert Martin and
Eduardo C. de Almeida and
Oscar Romero and
Anna Queralt How and Why False Denial Constraints are
Discovered . . . . . . . . . . . . . . . 3477--3489
Yingli Zhou and
Qingshuo Guo and
Yixiang Fang Efficient $k$-Clique Densest Subgraph
Discovery: Towards Bridging Practice and
Theory . . . . . . . . . . . . . . . . . 3490--3503
Meihao Fan and
Ju Fan and
Nan Tang and
Lei Cao and
Guoliang Li and
Xiaoyong Du AutoPrep: Natural Language
Question-Aware Data Preparation with a
Multi-Agent Framework . . . . . . . . . 3504--3517
Zengyang Gong and
Yuxiang Zeng and
Lei Chen Accelerating Approximate Nearest
Neighbor Search in Hierarchical Graphs:
Efficient Level Navigation with
Shortcuts . . . . . . . . . . . . . . . 3518--3530
Yan Zhang and
Shuwei Liang and
Xiaoye Miao and
Yangyang Wu and
Jianwei Yin Federated Incomplete Tabular Data
Prediction with Missing Complementarity 3531--3544
Yihao Hu and
Jin Wang and
Sajjadur Rahman LakeVisage: Towards Scalable, Flexible
and Interactive Visualization
Recommendation for Data Discovery over
Data Lakes . . . . . . . . . . . . . . . 3545--3558
Xiaokai Zhou and
Xiao Yan and
Fangcheng Fu and
Ziwen Fu and
Tieyun Qian and
Yuanyuan Zhu and
Qinbo Zhang and
Bin Cui and
Jiawei Jiang PS-MI: Accurate, Efficient, and Private
Data Valuation in Vertical Federated
Learning . . . . . . . . . . . . . . . . 3559--3572
Jiasheng Zhang and
Deqiang Ouyang and
Shuang Liang and
Jie Shao Towards Pattern-Aware Data Augmentation
for Temporal Knowledge Graph Completion 3573--3586
Chiyu Hao and
Jixian Su and
Shixuan Sun and
Hao Zhang and
Sen Gao and
Jianwen Zhao and
Chenyi Zhang and
Jieru Zhao and
Chen Chen and
Minyi Guo RapidStore: an Efficient Dynamic Graph
Storage System for Concurrent Queries 3587--3600
Xiaoyu Fan and
Kun Chen and
Jiping Yu and
Xiaowei Zhu and
Yunyi Chen and
Huanchen Zhang and
Wei Xu GORAM: Graph-Oriented ORAM for Efficient
Ego-Centric Queries on Federated Graphs 3601--3614
Weijie Sun and
Zihuan Xu and
Wangze Ni and
Lei Chen and
Peng Cheng and
Chen Jason Zhang Authenticated Aggregate Queries with
Boolean Range Predicates on Blockchains 3615--3627
Zemin Chao and
Qiaoyi Zheng and
Zhixin Qi and
Hongzhi Wang FSMDTW: a Fast Index-Free Subsequence
Matching Algorithm for Dynamic Time
Warping . . . . . . . . . . . . . . . . 3628--3640
Jianheng Tang and
Xi Zhao and
Lemin Kong and
Xiaofang Zhou and
Jia Li Fused Gromov--Wasserstein Alignment for
Graph Edit Distance Computation and
Beyond . . . . . . . . . . . . . . . . . 3641--3654
Tianshu Zhang and
Kun Qian and
Siddhartha Sahai and
Yuan Tian and
Shaddy Garg and
Huan Sun and
Yunyao Li Evoschema: Towards Text-to-SQL
Robustness against Schema Evolution . . 3655--3668
Shuai Han and
Yushi Tao and
Jingwen Tan and
Huanran Wang and
Wu Yang and
Yanmei Wang Effective and Efficient Community Search
for Complex Network Semantics Capture:
From Coarse-Grain to Fine-Grain . . . . 3669--3681
Rongrong Zhang and
Zhiwei Ye and
Jun-Peng Zhu and
Peng Cai and
Xuan Zhou and
Dunbo Cai and
Ling Qian HAWK: a Workload-Driven Hierarchical
Deadlock Detection Approach in
Distributed Database System . . . . . . 3682--3694
Chengliang Chai and
Jiajun Li and
Yuhao Deng and
Yuanhao Zhong and
Ye Yuan and
Guoren Wang and
Lei Cao Doctopus: Budget-Aware Structural Table
Extraction from Unstructured Documents 3695--3707
Qi Wen and
Yutong Ye and
Xiang Lian and
Mingsong Chen S$3$AND: Efficient Subgraph Similarity
Search under Aggregated Neighbor
Difference Semantics . . . . . . . . . . 3708--3720
Yanping Zheng and
Zhewei Wei and
Frank de Hoog and
Xu Chen and
Hongteng Xu and
Yuhang Ye and
Jiadeng Huang Lighter-X: an Efficient and
Plug-and-Play Strategy for Graph-Based
Recommendation through Decoupled
Propagation . . . . . . . . . . . . . . 3721--3729
Qiyu Liu and
Yanlin Qi and
Siyuan Han and
Jingshu Peng and
Jin Li and
Lei Chen Not Small Enough? SegPQ: a Learned
Approach to Compress Product
Quantization Codebooks . . . . . . . . . 3730--3743
Nazanin Rashedi and
Guido Moerkotte The Accuracy of Cardinality Estimators:
Unraveling the Evaluation Result
Conundrum . . . . . . . . . . . . . . . 3744--3756
Benzhao Tang and
Shiyu Yang and
Zhitao Shen and
Wenjie Zhang and
Xuemin Lin and
Zhihong Tian LogLite: Lightweight Plug-and-Play
Streaming Log Compression . . . . . . . 3757--3770
Saimon Amanuel Tsegai and
Xinyu Yang and
Haoyuan Liu and
Peng Gao Enabling Efficient Attack Investigation
via Human-in-the-Loop Security Analysis 3771--3783
Wenzhi Fu and
Yang Cao Shifting Transaction Isolation on
Graphs: From Systems to Data . . . . . . 3784--3796
Wenqi Jiang and
Hang Hu and
Torsten Hoefler and
Gustavo Alonso Fast Graph Vector Search via Hardware
Acceleration and Delayed-Synchronization
Traversal . . . . . . . . . . . . . . . 3797--3811
Hengyu Ye and
Jiadong Chen and
Fuxin Jiang and
Xiao He and
Tieying Zhang and
Jianjun Chen and
Xiaofeng Gao Fremer: Lightweight and Effective
Frequency Transformer for Workload
Forecasting in Cloud Services . . . . . 3812--3825
Arash Dargahi Nobari and
Davood Rafiei TabulaX: Leveraging Large Language
Models for Multi-Class Table
Transformations . . . . . . . . . . . . 3826--3839
Fan Cui and
Eric Lo and
Srijan Srivastava and
Ziliang Lai Bonspiel: Low Tail Latency Transactions
in Geo-Distributed Databases . . . . . . 3840--3853
Qiange Wang and
Yongze Yan and
Hongshi Tan and
Cheng Chen and
Cheng Zhao and
Jiaming Tian and
Jiaxin Jiang and
Xiaoliang Cong and
Yanfeng Zhang and
Ge Yu and
Weng-Fai Wong and
Bingsheng He Efficient Graph Data Access for
Out-of-Memory GPU Streaming Graph
Processing . . . . . . . . . . . . . . . 3854--3867
Daniel Schmitt and
Thomas Hütter and
Nikolaus Augsten Extensible and Robust Evaluation of
Similarity Queries . . . . . . . . . . . 3868--3882
Yan Zhou and
Chunwei Liu and
Bhuvan Urgaonkar and
Zhengle Wang and
Magnus Mueller and
Chao Zhang and
Songyue Zhang and
Pascal Pfeil and
Dominik Horn and
Zhengchun Liu and
Davide Pagano and
Tim Kraska and
Samuel Madden and
Ju Fan PBench: Workload Synthesizer with Real
Statistics for Cloud Analytics
Benchmarking . . . . . . . . . . . . . . 3883--3895
Yujie Lu and
Zhijie Zhang and
Weiguo Zheng and
Lei Zou Accelerating Subgraph Matching through
Fine-Grained and Powerful Equivalences 3896--3909
Luca Gretscher and
Jens Dittrich How to Optimize SQL Queries? A
Comparison Between Split, Holistic, and
Hybrid Approaches . . . . . . . . . . . 3910--3922
Navid Eslami and
Ioana O. Bercea and
Niv Dayan Diva: Dynamic Range Filter for
Var-Length Keys and Queries . . . . . . 3923--3936
Luca Becchetti and
Andrea Clementi and
Luciano Gualá and
Luca Pep\`e Sciarria and
Alessandro Straziota and
Matteo Stromieri Approximate 2-hop Neighborhoods on
Incremental Graphs: an Efficient Lazy
Approach . . . . . . . . . . . . . . . . 3937--3950
Vasilis Mageirakos and
Bowen Wu and
Gustavo Alonso Cracking Vector Search Indexes . . . . . 3951--3964
Alireza Heidari and
Amirhossein Ahmadi and
Wei Zhang DobLIX: a Dual-Objective Learned Index
for Log-Structured Merge Trees . . . . . 3965--3978
Xuhang Zhu and
Xiu Tang and
Sai Wu and
Jichen Li and
Haobo Wang and
Chang Yao and
Quanqing Xu and
Gang Chen CoLA: Model Collaboration for Log-Based
Anomaly Detection . . . . . . . . . . . 3979--3987
Michael Jungmair and
Jana Giceva Towards Designing Future-Proof Data
Processing Systems . . . . . . . . . . . 3988--3995
Omer Abramovich and
Daniel Deutch and
Nave Frost and
Ahmet Kara and
Dan Olteanu Advancing Fact Attribution for Query
Answering: Aggregate Queries and Novel
Algorithms . . . . . . . . . . . . . . . 3996--4008
Amedeo Pachera and
Mattia Palmiotto and
Angela Bonifati and
Andrea Mauri What If: Causal Analysis with Graph
Databases . . . . . . . . . . . . . . . 4009--4016
Mateusz Gienieczko and
Maximilian Kuschewski and
Thomas Neumann and
Viktor Leis and
Jana Giceva AnyBlox: a Framework for Self-Decoding
Datasets . . . . . . . . . . . . . . . . 4017--4031
Yushuai Ji and
Shengkun Zhu and
Shixun Huang and
Zepeng Liu and
Sheng Wang and
Zhiyong Peng Federated and Balanced Clustering for
High-Dimensional Data . . . . . . . . . 4032--4044
Mohamed Sabri Hafidi and
Ozan Kahramano\ugullarì and
Anton Dignös and
Johann Gamper Relational Data Models for Genetic VCF
data . . . . . . . . . . . . . . . . . . 4045--4053
Apostolos Giannoulidis and
Anastasios Gounaris and
John Paparrizos BURST: Rendering Clustering Techniques
Suitable for Evolving Streams . . . . . 4054--4063
Michail Bachras and
Hans-Arno Jacobsen Environmental Footprints of Query
Processing: a Vision for Sustainable
Database Architectures . . . . . . . . . 4064--4072
Fengxin Li and
Yi Li and
Yue Liu and
Chao Zhou and
Yuan Wang and
Xiaoxiang Deng and
Wei Xue and
Dapeng Liu and
Lei Xiao and
Haijie Gu and
Jie Jiang and
Hongyan Liu and
Biao Qin and
Jun He LEADRE: Multi-Faceted Knowledge Enhanced
LLM Empowered Display Advertisement
Recommender System . . . . . . . . . . . 4763--4776
Georgios Theodorakis and
Hugo Firth and
James Clarkson and
Natacha Crooks and
Jim Webber TuskFlow: an Efficient Graph Database
for Long-Running Transactions . . . . . 4777--4790
Panagiotis Antonopoulos and
Mansi Chauhan and
Shailender Dabas and
Rajat Jain and
Darshan Kattera and
Wonseok Kim and
Hanuma Kodavalla and
Nikolas Ogg and
Prashanth Purnananda and
Rahul Ranjan and
Alex Swanson and
Divyesh Tikmani MD-MVCC: Multi-Version Concurrency
Control for Schema Changes in Azure SQL
Database . . . . . . . . . . . . . . . . 4791--4803
Michael Carey and
Wail Alkowaileet and
Nick DiGeronimo and
Peeyush Gupta and
Sachin Smotra and
Till Westmann Towards Principled, Practical Document
Database Design . . . . . . . . . . . . 4804--4816
Manos Karpathiotakis and
Vlassios Rizopoulos and
Basri Kahveci and
Tiziano Carotti and
Artem Gelum and
Hazem Nada and
Yuri Dolgov Scribe: How Meta Transports Terabytes
per Second in Real Time . . . . . . . . 4817--4830
Daniel Ritter and
Mihnea Andrei and
Sukhyeun Cho and
Maik Görgens and
Taehyung Lee and
Norman May and
Amit Pathak and
Paul R. Willems The HANA Native Query Engine for
Lakehouse Systems . . . . . . . . . . . 4831--4845
Yuan Mei and
Rui Xia and
Zhaoqian Lan and
Kaitian Hu and
Lei Huang and
Paris Carbone and
Yanfei Lei and
Vasiliki Kalavri and
Han Yin and
Feng Wang Disaggregated State Management in Apache
Flink\reg 2.0 . . . . . . . . . . . . . 4846--4859
Jie Jiang and
Haining Xie and
Siqi Shen and
Yu Shen and
Zihan Zhang and
Meng Lei and
Yifeng Zheng and
Yang Li and
Chunyou Li and
Danqing Huang and
Yinjun Wu and
Wentao Zhang and
Bin Cui and
Peng Chen SiriusBI: a Comprehensive LLM-Powered
Solution for Data Analytics in Business
Intelligence . . . . . . . . . . . . . . 4860--4873
Edward Y. Chang and
Longling Geng SagaLLM: Context Management, Validation,
and Transaction Guarantees for
Multi-Agent LLM Planning . . . . . . . . 4874--4886
Hongtao Yang and
Zhichen Xu and
Sergey Yudin and
Andrew Davidson Unlocking the Power of CI/CD for Data
Pipelines in Distributed Data Warehouses 4887--4895
Jianjun Chen and
Li Zhang and
Yu Xie and
Wei Ding and
Lixun Cao and
Ye Liu and
Yonghua Ding and
Fangshi Li and
Ke Wu and
Haibo Xiu and
Kui Wei and
Le Cai and
Rui Chang and
Yuxiang Chen and
Yuanjin Lin and
Shangyu Luo and
Jianfeng Qian and
Xu Wang and
Zikang Wang and
Jian Zhang and
Mingyi Zhang and
Shicai Zeng and
Jason Sun and
Lei Zhang and
Rui Shi and
Pengwei Zhao veDB-HTAP: a Highly Integrated,
Efficient and Adaptive HTAP System . . . 4896--4909
Konstantinos Kanellis and
Badrish Chandramouli and
Ted Hart and
Shivaram Venkataraman From FASTER to F2: Evolving Concurrent
Key--Value Store Designs for Large
Skewed Workloads . . . . . . . . . . . . 4910--4923
Sunil Chakkappen and
Shreya Kunjibettu and
Daniel McGreer and
Masoomeh Javidi Kishi and
Hong Su and
Mohamed Ziauddin and
Mohamed Zait and
Zhan Li and
Yuying Zhang Automatic Indexing in Oracle . . . . . . 4924--4937
Murtadha Al Hubail and
Ali Alsuliman and
Wail Alkowaileet and
Michael Blow and
Michael Carey and
Savyasach Enukonda and
Peeyush Gupta and
Santosh Hegde and
Kamini Jagtiani and
Abhishek Jindal and
Mohammad Nawazish Khan and
Mehnaz Mahin and
Ian Maxon and
M. Muralikrishna and
Keshav Murthy and
Daniel Nagy and
Preetham Poluparthi and
Ankit Prabhu and
Ritik Raj and
Vijay Sarathy and
Shahrzad Shirazi and
Utsav Singh and
Hussain Towaileb and
Ayush Tripathi and
Janhavi Tripurwar and
Bo-Chun Wang and
Till Westmann Cloudy with a Chance of JSON . . . . . . 4938--4950
Ji Sun and
Guoliang Li and
James Pan and
Jiang Wang and
Yongqing Xie and
Ruicheng Liu and
Wen Nie GaussDB-Vector: a Large-Scale Persistent
Real-Time Vector Database for LLM
Applications . . . . . . . . . . . . . . 4951--4963
Jun Song and
Jingyi Ding and
Irshad Kandy and
Yanghao Lin and
Zhongjia Wei and
Zilong Zhou and
Zhiwei Peng and
Jixi Shan and
Hongyue Mao and
Xiuqi Huang and
Xun Song and
Cheng Chen and
Yanjia Li and
Tianhao Yang and
Wei Jia and
Xiaohong Dong and
Kang Lei and
Rui Shi and
Pengwei Zhao and
Wei Chen Magnus: a Holistic Approach to Data
Management for Large-Scale Machine
Learning Workloads . . . . . . . . . . . 4964--4977
Xin Gao and
Sibasish Acharya and
Sihui Han and
Yongxiong Ren and
Yanli Zhao and
Liang Luo and
Chucheng Wang and
Pradeep Fernando and
Saurabh Mishra and
Siqi Yan and
Yicong Du and
Elzbieta Krepska and
Intaik Park and
Min Ni and
Qunshu Zhang and
Shen Li DECK: Experiences on Delta Checkpointing
for Industrial Recommendation Systems 4978--4990
Zhe Jiang and
Zhaoguo Wang and
Haoning Lan and
Chuzhe Tang and
Haoran Ding and
Lefeng Wang and
Songyun Zou and
Zhuoran Wei and
Yongcun Liu and
Xiang Yu and
Yang Ren and
Guoliang Li and
Haibo Chen GRewriter: Practical Query Rewriting
with Automatic Rule Set Expansion in
GaussDB . . . . . . . . . . . . . . . . 4991--5003
Jun-Peng Zhu and
Lingfeng Zhang and
Peng Cai and
Xuan Zhou and
Peisen Zhao and
Xue Wang and
Linpeng Tang FDBKeeper: Enabling Scalable
Coordination Services for Metadata
Management Using Distributed Key--Value
Databases . . . . . . . . . . . . . . . 5004--5016
Xiaoyao Zhong and
Haotian Li and
Jiabao Jin and
Mingyu Yang and
Deming Chu and
Xiangyu Wang and
Zhitao Shen and
Wei Jia and
George Gu and
Yi Xie and
Xuemin Lin and
Heng Tao Shen and
Jingkuan Song and
Peng Cheng VSAG: an Optimized Search Framework for
Graph-Based Approximate Nearest Neighbor
Search . . . . . . . . . . . . . . . . . 5017--5030
Zhaoyan Sun and
Xuanhe Zhou and
Guoliang Li and
Xiang Yu and
Jianhua Feng and
Yong Zhang 12R-Bot: an LLM-Based Query Rewrite
System . . . . . . . . . . . . . . . . . 5031--5044
William Schultz and
Murat Demirbas Design and Modular Verification of
Distributed Transactions in MongoDB . . 5045--5058
Xinjun Yang and
Feifei Li and
Yingqiang Zhang and
Hao Chen and
Qingda Hu and
Panfeng Zhou and
Qiang Zhang and
Shuai Li and
Zongzhi Chen and
Zheyu Miao and
Rongbiao Xie and
Chuan Sun and
Zetao Wei and
Jing Fang and
Xingxuan Zhou and
Xiaofei Wu From Scale-Up to Scale-Out: PolarDB's
Journey to Achieving 2 Billion tpmC . . 5059--5072
Mingyu Liu and
Junbin Kang and
Kai Wang and
Lu Zhang and
Haibo Chen and
Xiuchang Li and
Tianhong Ding ScaleCache: Scalable and
Production-Grade Buffer Management for
Disk-Based Database Systems . . . . . . 5073--5085
Jun-Peng Zhu and
Boyan Niu and
Peng Cai and
Zheming Ni and
Jianwei Wan and
Kai Xu and
Jiajun Huang and
Shengbo Ma and
Bing Wang and
Xuan Zhou and
Guanglei Bao and
Donghui Zhang and
Liu Tang and
Qi Liu Towards Automated Cross-Domain
Exploratory Data Analysis through Large
Language Models . . . . . . . . . . . . 5086--5099
Bing Tong and
Yan Zhou and
Chen Zhang and
Jianheng Tang and
Jia Li and
Lei Chen GalaxyWeaver: Autonomous Table-to-Graph
Conversion and Schema Optimization with
Large Language Models . . . . . . . . . 5100--5112
Tim Gubner and
Rune Humborstad and
Manyi Lu Freely Moving between the OLTP and OLAP
Worlds: Hermes --- a High-Performance
OLAP Accelerator for MySQL . . . . . . . 5113--5125
Jan Vincent Szlang and
Sebastian Bress and
Sebastian Cattes and
Jonathan Dees and
Florian Funke and
Max Heimel and
Michel Oleynik and
Ismail Oukid and
Tobias Maltenberger Workload Insights from the Snowflake
Data Cloud: What Do Production Analytic
Queries Really Look Like? . . . . . . . 5126--5138
Fangyuan Zhang and
Caihua Yin and
Hua Fan and
Fenghua Fang and
Yineng Chen and
Xuqi Wang and
Mengqi Wu and
Bing Chen and
Tianbo Jin and
Sibo Wang and
Wenchao Zhou and
Feifei Li AnalyticDB-PG: a Cloud-Native
High-Performance Data Warehouse in
Alibaba Cloud . . . . . . . . . . . . . 5139--5152
Fangyuan Zhang and
Mengqi Wu and
Chunlei Xu and
Yunong Bao and
Jiyu Qiao and
Yingli Zhou and
Hua Fan and
Caihua Yin and
Wenchao Zhou and
Feifei Li Streaming View: an Efficient Data
Processing Engine for Modern Real-Time
Data Warehouse of Alibaba Cloud . . . . 5153--5165
Nitish Upreti and
Harsha Vardhan Simhadri and
Hari Sudan Sundar and
Krishnan Sundaram and
Samer Boshra and
Balachandar Perumalswamy and
Shivam Atri and
Martin Chisholm and
Revti Raman Singh and
Greg Yang and
Tamara Hass and
Nitesh Dudhey and
Subramanyam Pattipaka and
Mark Hildebrand and
Magdalen Manohar and
Jack Moffitt and
Haiyang Xu and
Naren Datha and
Suryansh Gupta and
Ravishankar Krishnaswamy and
Prashant Gupta and
Abhishek Sahu and
Hemeswari Varada and
Sudhanshu Barthwal and
Ritika Mor and
James Codella and
Shaun Cooper and
Kevin Pilch and
Simon Moreno and
Aayush Kataria and
Santosh Kulkarni and
Neil Deshpande and
Amar Sagare and
Dinesh Billa and
Zishan Fu and
Vipul Vishal Cost-Effective, Low Latency Vector
Search with Azure Cosmos DB . . . . . . 5166--5183
Matteo Merli and
Sijie Guo and
Penghui Li and
Hang Chen and
Neng Lu Ursa: a Lakehouse-Native Data Streaming
Engine for Kafka . . . . . . . . . . . . 5184--5196
Krishna Puttaswamy and
Abhijit Chakankar and
Tao Tao and
Zaheera Valani and
Ramesh Chandra and
William Chau and
Mengxi Chen and
Akram Chetibi and
Tianyi Huang and
Jonathan Keller and
Celia Kung and
Andy Liu and
Charlene Lyu and
Samarth Shetty and
Xiaotong Sun and
Steve Weis and
Lin Zhou and
Ryan Zhu and
Reynold Xin and
Matei Zaharia Delta Sharing: an Open Protocol for
Cross-Platform Data Sharing . . . . . . 5197--5209
Sam Lightstone and
Ping Wang SQL:Trek Automated Index Design at
Airbnb . . . . . . . . . . . . . . . . . 5210--5222
Marc Baeuerle and
Thomas Bodner and
Martin Boissier and
Tilmann Rabl and
Ricardo Salazar Díaz and
Florian Schmeller and
Nils Strassenburg and
Ilin Tolovski and
Marcel Weisgut and
Wang Yue TCO 2: Analyzing the Carbon Footprint of
Database Server Replacements . . . . . . 5223--5226
Pavel Koupil and
Jáchym Bártík and
Stefan Klessinger and
André Conrad and
Stefanie Scherzinger FDepHunter: Harnessing Negative Examples
to Expose Fakes and Reveal Ghosts . . . 5227--5230
Rong Kang and
Shuai Wang and
Tieying Zhang and
Xianghong Xu and
Linhui Xu and
Zhimin Liang and
Lei Zhang and
Rui Shi and
Jianjun Chen VIDEX: a Disaggregated and Extensible
Virtual Index for the Cloud and AI Era 5231--5234
Roi Yona and
Jonathan Breitman and
Benny Kimelfeld DVote: Constraining Committee Voting
with Database Dependencies . . . . . . . 5235--5238
Yangxin Fan and
Haolai Che and
Mingjian Lu and
Yinghui Wu Graph Compression for Interpretable
Graph Neural Network Inference At Scale 5239--5242
Bingnan Chen and
Binyang Dai and
Qichen Wang and
Ke Yi Query Running Too Slow? Rewrite it with
Quorion! . . . . . . . . . . . . . . . . 5243--5246
Sòren Kejser Jensen and
Christian Schmidt Godiksen and
Christian Thomsen and
Torben Bach Pedersen Demonstration of ModelarDB: Model-Based
Management of High-Frequency Time Series
Across Edge, Cloud, and Client . . . . . 5247--5250
Louisa Lambrecht and
Tim Findling and
Samuel Heid and
Marcel Knüdeler and
Torsten Grust Democratize MATCH_RECOGNIZE! . . . . . . 5251--5254
Roman Heinrich and
Oleksandr Havrylov and
Manisha Luthra and
Johannes Wehrstein and
Carsten Binnig Opening the Black-Box: Explaining
Learned Cost Models for Databases . . . 5255--5258
Venkatesh Emani and
Wenjing Wang and
Zi Ye and
Jia He and
Neel Ball and
Kumaraswamy Boora and
Carlo Curino and
Avrilia Floratou and
Manan Goenka and
Paridhi Gupta and
Vivek Gupta and
Katherine Lin and
Nick Litombe and
Jared Meade and
Suryakant Mutnal and
Mark Pryce-Maher and
Raghu Ramakrishnan and
Sudhir Raparla and
Dhruv Relwani and
Shyam Sai and
Vaibhave Sekar and
Roneet Shaw and
Harmeet Singh and
Prasanna Sridharan and
Mark Taylor and
Sunidhi Tiwari and
Yiwen Zhu Horizon: Robust Checks for SQL Migration
Using LLMs . . . . . . . . . . . . . . . 5259--5262
Wenhao Liu and
Xiu Tang and
Sai Wu and
Chang Yao and
Gongsheng Yuan and
Gang Chen A Demonstration of QueryArtisan:
Real-Time Data Lake Analysis via
Dynamically Generated Data Manipulation
Code . . . . . . . . . . . . . . . . . . 5263--5266
Dvir Cohen and
Liad Domb and
Avigdor Gal and
Lior Ganon and
Eliezer Gavriel and
Omri Lazover and
Coral Scharf and
Bar Shterenberg RecForUS: a Recommender System for
Uncertain Scores . . . . . . . . . . . . 5267--5270
Frederik Marinus Trudslev and
Matteo Lissandrini and
Juan Manuel Rodriguez and
Martin Bògsted and
Daniele Dell'Aglio PrivEval: a Tool for Interactive
Evaluation of Privacy Metrics in
Synthetic Data Generation . . . . . . . 5271--5274
Kyriakos Psarakis and
Oto Mraz and
George Christodoulou and
George Siachamis and
Marios Fragkoulis and
Asterios Katsifodimos Styx in Action: Transactional Cloud
Applications Made Easy . . . . . . . . . 5275--5278
Mathilde Marcy and
Jean-Marc Petit and
Vasile-Marian Scuturici and
Jocelyn Bonjour and
Camille Fertel and
Gerald Cavalier Can Surrogate Keys Negatively Impact
Data Quality? . . . . . . . . . . . . . 5279--5282
Benjamin Hättasch and
Leon Krüger and
Carsten Binnig JUSTINE (JUST-INsert Engine):
Demonstrating Self-Organizing Data
Schemas . . . . . . . . . . . . . . . . 5283--5286
Jiayi Wang and
Yuan Li and
Jianming Wu and
Shihui Xu and
Guoliang Li Unify: a System For Unstructured Data
Analytics . . . . . . . . . . . . . . . 5287--5290
Alexander Beischl and
Thomas Neumann UmbraPerf --- Profiling Results Tailored
for DBMS Developers . . . . . . . . . . 5291--5294
Abiram Mohanaraj and
Matteo Lissandrini and
Katja Hose Smart SPARQL Advisor: Guiding Users in
Query Formulation with Performance
Prediction . . . . . . . . . . . . . . . 5295--5298
Jiatang Zhou and
Kaisong Huang and
Zhuoyue Zhao and
Dong Xie and
Tianzheng Wang Analytics Are Heavy. The DBMS Is Busy.
When Will My Mission-Critical
Transaction Start Running? . . . . . . . 5299--5302
Enzo Veltri and
Donatello Santoro and
Jean-Flavien Bussotti and
Paolo Papotti Accelerating Tabular Inference: Training
Data Generation with TENET . . . . . . . 5303--5306
Xukang Zhang and
Huanchen Zhang and
Xiaofeng Meng Accordion: Balancing Performance and
Cost in Cloud--Native Data Analysis with
Intra-Query Runtime Elasticity . . . . . 5307--5310
Long Gu and
Shaza Zeitouni and
Carsten Binnig and
Zsolt István How SMPC Query Execution Can be Sped Up
through Efficient and Flexible
Intermediate Result Size Trimming . . . 5311--5314
Haralampos Gavriilidis and
Joel Ziegler and
Midhun Kaippillil Venugopalan and
Benedikt Didrich and
Matthias Boehm and
Volker Markl Enter the Warp: Fast and Adaptive Data
Transfer with XDBC . . . . . . . . . . . 5315--5318
Luca Zecchini and
Ziawasch Abedjan and
Vasilis Efthymiou and
Giovanni Simonini RadlER: Deduplicated Sampling On-Demand 5319--5322
Amey Shinde and
Viraj Sabhaya and
Kevin Farokhrouz and
Fariba Irany and
Ali Khan and
Sanjukta Bhowmick and
Abhishek Santra and
Sharma Chakravarthy MLN-geeWhiz: a Dashboard for Supporting
Complete Life-Cycle of Complex Data
Analysis Using Multilayer Networks . . . 5323--5326
Haibo Xiu and
Yang Li and
Qianyu Yang and
Weihang Guo and
Yuxi Liu and
Pankaj K. Agarwal and
Sudeepa Roy and
Jun Yang Hint-QPT: Hints for Robust Query
Performance Tuning . . . . . . . . . . . 5327--5330
Shunit Agmon and
David Avigdor and
Brit Youngmann and
Amir Gilad and
Benny Kimelfeld ClaimIt: Finding Convincing Views to
Endorse a Claim . . . . . . . . . . . . 5331--5334
Filip Je\vzek and
Pavel Koupil and
Michal Kopecký and
Jáchym Bártík and
Irena Holubová DortDB: Bridging Query Languages for
Multi-Model Data Ponds . . . . . . . . . 5335--5338
Zekai Qian and
Xiaoou Ding and
Chen Wang and
Hongzhi Wang DemandClean: a Multi-Objective Learning
Framework for Balancing Model Tolerance
to Data Authenticity and Diversity . . . 5339--5342
Xiaoou Ding and
Yanshuo Liu and
Zhounan Chen and
Hongzhi Wang and
Chen Wang and
Jianmin Wang TARImpute: Task-Aware Auto-Recommender
System for Missing Value Imputation
Algorithms with Clustering Case Studies 5343--5346
Yuchuan Huang and
Ana Elena Uribe and
Youssef Hussein and
Grant Ogren and
Kareem Eldahshoury and
Mohamed F. Mokbel A Demonstration of Polaris: an
Interactive and Scalable Data
Infrastructure for Polar Science . . . . 5347--5350
Zixuan Chen and
Jinyang Li and
H. V. Jagadish and
Mirek Riedewald GooseDB: a Database Engine that
Optimally Refines Top-$k$ Queries to
Satisfy Representation Constraints . . . 5351--5354
Ourania Ntouni and
Dimitrios Banelas and
Nikos Giatrakos NeuroFlinkCEP: Neurosymbolic Complex
Event Recognition Optimized across IoT
Platforms . . . . . . . . . . . . . . . 5355--5358
Stefan Grafberger and
Paul Groth and
Sebastian Schelter mlidea: Interactively Improving ML Data
Preparation Code via ``Shadow
Pipelines'' . . . . . . . . . . . . . . 5359--5362
Henning Koehler and
Sebastian Link Mining Meaningful Keys and Foreign Keys
with High Precision and Recall . . . . . 5363--5366
Wissal Benjira and
Nicolas Travers and
Faten Atigui and
Bénédicte Bucher and
Malika Grim-Yefsah SDG-KG: a Framework to Compute SDG
Indicators with Open Data . . . . . . . 5367--5370
Zeheng Fan and
Yuxiang Zeng and
Zhuanglin Zheng and
Yongxin Tong FedVSE: a Privacy-Preserving and
Efficient Vector Search Engine for
Federated Databases . . . . . . . . . . 5371--5374
Sebastian Eggers and
Nina Zukowska and
Ziawasch Abedjan APEX-DAG: Library and Language
independent Pipeline EXtraction . . . . 5375--5378
Fatemeh Ahmadi and
Julian Paulußen and
Ziawasch Abedjan Demonstrating Matelda for Multi-Table
Error Detection . . . . . . . . . . . . 5379--5382
Qingliu Wu and
Qingfeng Xiang and
Yingxia Shao and
Qiyao Luo and
Quanqing Xu DBPecker: a Graph-Based Compound Anomaly
Diagnosis System for Distributed RDBMSs 5383--5386
Zequn Li and
Yuanhao Zhong and
Chengliang Chai and
Zhaoze Sun and
Yuhao Deng and
Ye Yuan and
Guoren Wang and
Lei Cao DocDB: a Database for Unstructured
Document Analysis . . . . . . . . . . . 5387--5390
Jianxin Yan and
Wangze Ni and
Lei Chen and
Xuemin Lin and
Peng Cheng and
Zhan Qin and
Kui Ren ContextCache: Context-Aware Semantic
Cache for Multi-Turn Queries in Large
Language Models . . . . . . . . . . . . 5391--5394
Zenon Zacouris and
Maribel Acosta Simulating a Transactional Server for
Multi-Model Systems . . . . . . . . . . 5395--5398
Lingxi Cui and
Guanyu Jiang and
Huan Li and
Ke Chen and
Lidan Shou and
Gang Chen TableCopilot: a Table Assistant
Empowered by Natural Language
Conditional Table Discovery . . . . . . 5399--5402
Shuting Cao and
Zeping Niu and
Guoliang Li LETIndex: a Secure Learned Index with
TEE . . . . . . . . . . . . . . . . . . 5403--5406
Mauro Fam\`a and
Alessandro Ferri and
Samuele Langhi and
Riccardo Tommasini and
Angela Bonifati Play2Win: a Windowing Playground for
Continuous Queries . . . . . . . . . . . 5407--5410
Luigi Bellomarini and
Andrea Gentili and
Davide Magnanimi and
Emanuel Sallinger Vadacode: a Logician-Friendly IDE for
Datalog$^\pm $ . . . . . . . . . . . . . 5411--5414
Anas Dorbani and
Sunny Yasser and
Jimmy Lin and
Amine Mhedhbi Beyond Quacking: Deep Integration of
Language Models and RAG into DuckDB . . 5415--5418
Fan Yang and
John Paparrizos SAIL: a Voyage to Symbolic Approximation
Solutions for Time-Series Analysis . . . 5419--5422
Annabelle Warner and
Andrew McNutt and
Paul Rosen and
El Kindi Rezig Buckaroo: a Direct Manipulation Visual
Data Wrangler . . . . . . . . . . . . . 5423--5426
Akash Khatri and
Mir Mahathir Mohammad and
El Kindi Rezig Sort it Like You Mean It: Discovering
Semantically Interesting Attribute
Augmentations to Sort Tables . . . . . . 5427--5430
Qinghua Liu and
Seunghak Lee and
John Paparrizos EasyAD: a Demonstration of Automated
Solutions for Time-Series Anomaly
Detection . . . . . . . . . . . . . . . 5431--5434
Tarlan Bahadori and
Sai Sreekar Sarvepalli and
Ahmed Eldawy LASEK: LLM-Assisted Style Exploration
Kit for Geospatial Data . . . . . . . . 5435--5438
Hanwen Liu and
Federico Spedalieri and
Ibrahim Sabek A Demonstration of Q$^2$O:
Quantum-Augmented Query Optimizer . . . 5439--5443
Philipp Skavantzos and
Sebastian Link When Entity/Relationship Models Meet
Graph Databases . . . . . . . . . . . . 5444--5447
Graham Cormode and
Shripad Gade and
Samuel Maddock and
Enayat Ullah Synthetic Tabular Data: Methods, Attacks
and Defenses . . . . . . . . . . . . . . 5448--5450
Youssef Hussein and
Mohamed Hemdan and
Mohamed F. Mokbel Large Language Models for Spatial
Analysis Queries . . . . . . . . . . . . 5451--5454
Ziawasch Abedjan and
Mahdi Esmailoghli and
Sainyam Galhotra Data Discovery in Data Lakes:
Operations, Indexes, Systems . . . . . . 5455--5459
Da Yan and
Lyuheng Yuan and
Akhlaque Ahmad and
Saugat Adhikari Systems for Scalable Graph Analytics and
Machine Learning: Trends and Methods . . 5460--5465
Yuyu Luo and
Guoliang Li and
Ju Fan and
Chengliang Chai and
Nan Tang Natural Language to SQL: State of the
Art and Open Problems . . . . . . . . . 5466--5471
Ramon Rico and
Arno Siebes and
Yannis Velegrakis New Trends in Data Forgetting for
Sustainable Data Management . . . . . . 5472--5476
Haridimos Kondylakis and
Stefania Dumbrava and
Matteo Lissandrini and
Nikolay Yakovets and
Angela Bonifati and
Vasilis Efthymiou and
George Fletcher and
Dimitris Plexousakis and
Riccardo Tommasini and
Georgia Troullinou and
Elisjana Ymeralli Property Graph Standards: State of the
Art & Open Challenges . . . . . . . . . . 5477--5481
Roman Heinrich and
Xiao Li and
Manisha Luthra and
Zoi Kaoudi Learned Cost Models for Query
Optimization: From Batch to Streaming
Systems . . . . . . . . . . . . . . . . 5482--5487
Yannis Chronis and
Helena Caminal and
Yannis Papakonstantinou and
Fatma Özcan and
Anastasia Ailamaki Filtered Vector Search: State-of-the-Art
and Research Opportunities . . . . . . . 5488--5492
Mengying Wang and
Moming Duan and
Yicong Huang and
Chen Li and
Bingsheng He and
Yinghui Wu ML-Asset Management: Curation,
Discovery, and Utilization . . . . . . . 5493--5498
Hanchen Wang and
Ying Zhang and
Wenjie Zhang Machine Learning for Graph Data
Management and Query Processing . . . . 5499--5503
James Pan and
Guoliang Li Database Perspective on LLM Inference
Systems . . . . . . . . . . . . . . . . 5504--5507
Angela Bonifati Versatile Property Graph Transformations 5516--5526
Xiangyao Yu Disaggregation: a New Architecture for
Cloud Databases . . . . . . . . . . . . 5527--5530
Viktor Leis and
Andrey Gubichev and
Atanas Mirchev and
Peter Boncz and
Alfons Kemper and
Thomas Neumann Still Asking: How Good Are Query
Optimizers, Really? . . . . . . . . . . 5531--5536
Stratos Idreos Alphabets, Grammars, Calculators, and
the End of Hand-Crafted Systems . . . . 5537--5537
Juliana Freire Bridging Disciplines in Data Management
Research to Solve Complex Data Problems 5538--5538
Matei Zaharia Bringing the Operational and Analytical
Worlds Together with Lakebase . . . . . 5539--5539