Last update:
Wed Oct 8 06:54:23 MDT 2025
Stuart E. Madnick and
Yang W. Lee Editorial for the Inaugural Issue of the
ACM Journal of Data and Information
Quality (JDIQ) . . . . . . . . . . . . . 1:1--1:??
Stuart E. Madnick and
Richard Y. Wang and
Yang W. Lee and
Hongwei Zhu Overview and Framework for Data and
Information Quality Research . . . . . . 2:1--2:??
Xiao-Bai Li A Bayesian Approach for Estimating and
Replacing Missing Categorical Data . . . 3:1--3:??
Kristin Weber and
Boris Otto and
Hubert Österle One Size Does Not Fit All---A
Contingency Approach to Data Governance 4:1--4:??
B. Heinrich and
M. Klier and
M. Kaiser A Procedure to Develop Metrics for
Currency and its Application in CRM . . 5:1--5:??
Stuart E. Madnick and
Yang W. Lee Editorial Letter for the Special Issue
on Data Quality in Databases and
Information Systems . . . . . . . . . . 6:1--6:??
Felix Naumann and
Louiqa Raschid Guest Editorial for the Special Issue on
Data Quality in Databases . . . . . . . 7:1--7:??
Manoranjan Dash and
Ayush Singhania Mining in Large Noisy Domains . . . . . 8:1--8:??
George V. Moustakides and
Vassilios S. Verykios Optimal Stopping: a Record-Linkage
Approach . . . . . . . . . . . . . . . . 9:1--9:??
A. Klein and
W. Lehner Representing Data Quality in Sensor Data
Streaming Environments . . . . . . . . . 10:1--10:??
Suzanne M. Embury and
Paolo Missier and
Sandra Sampaio and
R. Mark Greenwood and
Alun D. Preece Incorporating Domain-Specific
Information Quality Constraints into
Database Queries . . . . . . . . . . . . 11:1--11:??
Stuart E. Madnick and
Yang W. Lee Call for Papers Special Issue on
Healthcare Information Quality: the
Challenges and Opportunities in
Healthcare Systems and Services . . . . 12:1--12:??
Stuart E. Madnick and
Yang W. Lee Editors' Comments: Where the JDIQ
Articles Come From: Incubating Research
in an Emerging Field . . . . . . . . . . 13:1--13:??
V. Sessions and
M. Valtorta Towards a Method for Data Accuracy
Assessment Utilizing a Bayesian Network
Learning Algorithm . . . . . . . . . . . 14:1--14:??
Adir Even and
G. Shankaranarayanan Dual Assessment of Data Quality in
Customer Databases . . . . . . . . . . . 15:1--15:??
Craig W. Fisher and
Eitel J. M. Lauria and
Carolyn C. Matheus An Accuracy Metric: Percentages,
Randomness, and Probabilities . . . . . 16:1--16:??
Sufyan Ababneh and
Rashid Ansari and
Ashfaq Khokhar Compensated Signature Embedding for
Multimedia Content Authentication . . . 17:1--17:??
Stuart E. Madnick and
Yang W. Lee Editors' Comments: ACM Journal of Data
and Information Quality (JDIQ) is alive
and well! . . . . . . . . . . . . . . . 1:1--1:??
Monica Chiarini Tremblay and
Kaushik Dutta and
Debra Vandermeer Using Data Mining Techniques to Discover
Bias Patterns in Missing Data . . . . . 2:1--2:??
Matthew L. Jensen and
Judee K. Burgoon and
Jay F. Nunamaker, Jr. Judging the Credibility of Information
Gathered from Face-to-Face Interactions 3:1--3:??
Hema S. Meda and
Anup Kumar Sen and
Amitava Bagchi On Detecting Data Flow Errors in
Workflows . . . . . . . . . . . . . . . 4:1--4:??
Matteo Magnani and
Danilo Montesi A Survey on Uncertainty Management in
Data Integration . . . . . . . . . . . . 5:1--5:??
John R. Talburt and
Stuart E. Madnick and
Yang W. Lee Call for Papers: Special Issue on Entity
Resolution . . . . . . . . . . . . . . . 6:1--6:??
Stuart E. Madnick and
Yang W. Lee Editorial: In Search of Novel Ideas and
Solutions with a Broader Context of Data
Quality in Mind . . . . . . . . . . . . 7:1--7:??
Roger Blake and
Paul Mangiameli The Effects and Interactions of Data
Quality and Problem Complexity on
Classification . . . . . . . . . . . . . 8:1--8:??
Irit Askira Gelman GIGO or not GIGO: The Accuracy of
Multi-Criteria Satisficing Decisions . . 9:1--9:??
Xiaoming Fan and
Jianyong Wang and
Xu Pu and
Lizhu Zhou and
Bing Lv On Graph-Based Name Disambiguation . . . 10:1--10:??
Benjamin Ngugi and
Beverly K. Kahn and
Marilyn Tremaine Typing Biometrics: Impact of Human
Learning on Performance Quality . . . . 11:1--11:??
Stuart E. Madnick and
Yang W. Lee Editorial Notes: Classification and
Assessment of Large Amounts of Data:
Examples in the Healthcare Industry and
Collaborative Digital Libraries . . . . 12:1--12:??
Eitel J. M. Lauría and
Alan D. March Combining Bayesian Text Classification
and Shrinkage to Automate Healthcare
Coding: a Data Quality Analysis . . . . 13:1--13:??
Daniel Hasan Dalip and
Marcos André Gonçalves and
Marco Cristo and
Pável Calado Automatic Assessment of Document Quality
in Web Collaborative Digital Libraries 14:1--14:??
Heiko Müller and
Johann-Christoph Freytag and
Ulf Leser Improving data quality by source
analysis . . . . . . . . . . . . . . . . 15:1--15:??
Irit Askira Gelman Biases in multi-criteria, satisfying
decisions due to data errors . . . . . . 16:1--16:??
Shelly Sachdeva and
Subhash Bhalla Semantic interoperability in
standardized electronic health record
databases . . . . . . . . . . . . . . . 1:1--1:??
Steven Brown and
Trent S. Rosenbloom and
Shawn P. Hardenbrook and
Terry Clark and
Elliot Fielstein and
Peter Elkin and
Ted Speroff Documentation quality and time costs: a
randomized controlled trial of
structured entry versus dictation . . . 2:1--2:??
Ali Sunyaev and
Dmitry Chornyi Supporting chronic disease care quality:
Design and implementation of a health
service and its integration with
electronic health records . . . . . . . 3:1--3:??
D. Shiloah Elizabeth and
H. Khanna Nehemiah and
C. Sunil Retmin Raj and
A. Kannan A novel segmentation approach for
improving diagnostic accuracy of CAD
systems for detecting lung cancer from
chest computed tomography images . . . . 4:1--4:??
Mohamed Yakout and
Mikhail J. Atallah and
Ahmed Elmagarmid Efficient and Practical Approach for
Private Record Linkage . . . . . . . . . 5:1--5:??
Yanjuan Yang and
Michael Mannino An Experimental Comparison of a Document
Deception Detection Policy using Real
and Artificial Deception . . . . . . . . 6:1--6:??
David A. Robb and
Paul L. Bowen and
A. Faye Borthick and
Fiona H. Rohde Improving New Users' Query Performance:
Deterring Premature Stopping of Query
Revision with Information for Forming Ex
Ante Expectations . . . . . . . . . . . 7:1--7:??
Cihan Varol and
Coskun Bayrak Hybrid Matching Algorithm for Personal
Names . . . . . . . . . . . . . . . . . 8:1--8:??
John O'Donoghue and
Jane Grimson and
Katherine Seelman Introduction to the Special Issue on
Information Quality: The Challenges and
Opportunities in Healthcare Systems and
Services . . . . . . . . . . . . . . . . 1:1--1:??
Claire Collins and
Kelly Janssens Creating a General (Family) Practice
Epidemiological Database in Ireland ---
Data Quality Issue Management . . . . . 2:1--2:??
Olivier Curé Improving the Data Quality of Drug
Databases using Conditional Dependencies
and Ontologies . . . . . . . . . . . . . 3:1--3:??
James McNaull and
Juan Carlos Augusto and
Maurice Mulvenna and
Paul McCullagh Data and Information Quality Issues in
Ambient Assisted Living Systems . . . . 4:1--4:??
John O'Donoghue and
John Herbert Data Management within mHealth
Environments: Patient Sensors, Mobile
Devices, and Databases . . . . . . . . . 5:1--5:??
John R. Talburt Special Issue on Entity Resolution
Overview: The Criticality of Entity
Resolution in Data and Information
Quality . . . . . . . . . . . . . . . . 6:1--6:??
Dezhao Song and
Jeff Heflin Domain-Independent Entity Coreference
for Linking Ontology Instances . . . . . 7:1--7:??
Rabia Nuray-Turan and
Dmitri V. Kalashnikov and
Sharad Mehrotra Adaptive Connection Strength Models for
Relationship-Based Entity Resolution . . 8:1--8:??
Fabian Panse and
Maurice van Keulen and
Norbert Ritter Indeterministic Handling of Uncertain
Decisions in Deduplication . . . . . . . 9:1--9:??
Yinle Zhou and
Eric Nelson and
Fumiko Kobayashi and
John R. Talburt A Graduate-Level Course on Entity
Resolution and Information Quality: a
Step toward ER Education . . . . . . . . 10:1--10:??
Lan Cao and
Hongwei Zhu Normal accidents: Data quality problems
in ERP-enabled manufacturing . . . . . . 11:1--11:??
Dov Biran and
Michael H. Zack and
Richard J. Briotta Competitive intelligence and information
quality: a game-theoretic perspective 12:1--12:??
Nitin R. Joglekar and
Edward G. Anderson and
G. Shankaranarayanan Accuracy of aggregate data in
distributed project settings: Model,
analysis and implications . . . . . . . 13:1--13:??
Louiqa Raschid Editorial . . . . . . . . . . . . . . . 14:1--14:??
Fons Wijnhoven and
Chintan Amrit and
Pim Dietz Value-Based File Retention: File
Attributes as File Value and Information
Waste Indicators . . . . . . . . . . . . 15:1--15:??
Wenfei Fan and
Shuai Ma and
Nan Tang and
Wenyuan Yu Interaction between Record Matching and
Data Repairing . . . . . . . . . . . . . 16:1--16:??
Nigel Martin and
Alexandra Poulovassilis and
Jianing Wang A Methodology and Architecture Embedding
Quality Assessment in Data Integration 17:1--17:??
Felix Naumann Editorial . . . . . . . . . . . . . . . 1:1--1:??
John Talburt and
Therese L. Williams and
Thomas C. Redman and
David Becker Information quality research challenge:
Predicting and quantifying the impact of
social issues on information quality
programs . . . . . . . . . . . . . . . . 2:1--2:??
Erhard Rahm Discovering product counterfeits in
online shops: a big data integration
challenge . . . . . . . . . . . . . . . 3:1--3:??
Peter Christen and
Dinusha Vatsalan and
Vassilios S. Verykios Challenges for privacy preservation in
data integration . . . . . . . . . . . . 4:1--4:??
Tobias Vogel and
Arvid Heise and
Uwe Draisbach and
Dustin Lange and
Felix Naumann Reach for gold: an annealing standard to
evaluate duplicate detection results . . 5:1--5:??
Wenfei Fan and
Floris Geerts and
Nan Tang and
Wenyuan Yu Conflict resolution with data currency
and consistency . . . . . . . . . . . . 6:1--6:??
Paul Glowalla and
Ali Sunyaev Process-driven data quality management:
a critical review on the application of
process modeling languages . . . . . . . 7:1--7:??
Khalid Belhajjame and
Domenico Beneventano and
Laure Berti-Equille and
James Cheney and
Victor Cuevas and
Tom De Nies and
Helena Galhardas and
Ashish Gehani and
Boris Glavic and
Paul Groth and
Olaf Hartig and
Scott Jensen and
Andrea Maurino and
Gianni Mecca and
Renee Miller and
Luc Moreau and
Mourad Ouzzani and
Jaehong Park Editorial . . . . . . . . . . . . . . . 8:1--8:??
You-Wei Cheah and
Beth Plale Provenance Quality Assessment
Methodology and Framework . . . . . . . 9:1--9:??
Melanie Herschel A Hybrid Approach to Answering Why-Not
Questions on Relational Query Results 10:1--10:??
Stephen Chong and
Christian Skalka and
Jeffrey A. Vaughan Self-Identifying Data for Fair Use . . . 11:1--11:??
Chris Baillie and
Peter Edwards and
Edoardo Pignotti QUAL: a Provenance-Aware Quality Model 12:1--12:??
Joshua Attenberg and
Panos Ipeirotis and
Foster Provost Beat the Machine: Challenging Humans to
Find a Predictive Model's ``Unknown
Unknowns'' . . . . . . . . . . . . . . . 1:1--1:??
Omar Alonso Challenges with Label Quality for
Supervised Learning . . . . . . . . . . 2:1--2:??
Roman Lukyanenko and
Jeffrey Parsons Information Quality Research Challenge:
Adapting Information Quality Principles
to User-Generated Content . . . . . . . 3:1--3:??
Felix Naumann Editorial . . . . . . . . . . . . . . . 4:1--4:??
Kush R. Varshney and
Dennis Wei and
Karthikeyan Natesan Ramamurthy and
Aleksandra Mojsilovi\'c Data Challenges in Disease Response: The
2014 Ebola Outbreak and Beyond . . . . . 5:1--5:??
Payam Barnaghi and
Maria Bermudez-Edo and
Ralf Tönjes Challenges for Quality of Data in Smart
Cities . . . . . . . . . . . . . . . . . 6:1--6:??
Christan Earl Grant and
Daisy Zhe Wang A Challenge for Long-Term Knowledge Base
Maintenance . . . . . . . . . . . . . . 7:1--7:??
Kewei Sha and
Sherali Zeadally Data Quality Challenges in
Cyber-Physical Systems . . . . . . . . . 8:1--8:??
Rosella Gennari and
Sara Tonelli and
Pierpaolo Vittorini Challenges in Quality of Temporal Data
--- Starting with Gold Standards . . . . 9:1--9:??
Rahul C. Basole and
Mark L. Braunstein and
Jimeng Sun Data and Analytics Challenges for a
Learning Healthcare System . . . . . . . 10:1--10:??
Ion-George Todoran and
Laurent Lecornu and
Ali Khenchaf and
Jean-Marc Le Caillec A Methodology to Evaluate Important
Dimensions of Information Quality in
Systems . . . . . . . . . . . . . . . . 11:1--11:??
Marta Zarraga-Rodriguez and
M. Jesus Alvarez Experience: Information Dimensions
Affecting Employees' Perceptions Towards
Being Well Informed . . . . . . . . . . 12:1--12:??
Alberto Bartoli and
Andrea De Lorenzo and
Eric Medvet and
Fabiano Tarlao Data Quality Challenge: Toward a Tool
for String Processing by Examples . . . 13:1--13:??
Dirk Ahlers and
John Krogstie Document and Corpus Quality Challenges
for Knowledge Management in Engineering
Enterprises . . . . . . . . . . . . . . 14:1--14:??
Banda Ramadan and
Peter Christen and
Huizhi Liang and
Ross W. Gayler Dynamic Sorted Neighborhood Indexing for
Real-Time Entity Resolution . . . . . . 15:1--15:??
Paolo Coletti and
Maurizio Murgia Design and Construction of a Historical
Financial Database of the Italian Stock
Market 1973--2011 . . . . . . . . . . . 16:1--16:??
Paolo Missier Corrigendum to the Special Issue
Editorial in JDIQ Volume 5, Issue 3 . . 17:1--17:??
Adriane P. Chapman and
Arnon Rosenthal and
Len Seligman The Challenge of ``Quick and Dirty''
Information Quality . . . . . . . . . . 1:1--1:??
Jeremy R. Millar and
Douglas D. Hodson and
Gilbert L. Peterson and
Darryl K. Ahner Data Quality Challenges in Distributed
Live-Virtual-Constructive Test
Environments . . . . . . . . . . . . . . 2:1--2:??
Roman Lukyanenko Information Quality Research Challenge:
Information Quality in the Age of
Ubiquitous Digital Intermediation . . . 3:1--3:??
Hongwei Zhu and
Yang W. Lee and
Arnon S. Rosenthal Data Standards Challenges for
Interoperable and Quality Data . . . . . 4:1--4:??
Robert Ulbricht and
Hilko Donker and
Claudio Hartmann and
Martin Hahmann and
Wolfgang Lehner Challenges for Context-Driven Time
Series Forecasting . . . . . . . . . . . 5:1--5:??
Davide Ceolin and
Paul Groth and
Valentina Maccatrozzo and
Wan Fokkink and
Willem Robert Van Hage and
Archana Nottamkandath Combining User Reputation and Provenance
Analysis for Trust Assessment . . . . . 6:1--6:??
Peter Christen and
Ross W. Gayler and
Khoi-Nguyen Tran and
Jeffrey Fisher and
Dinusha Vatsalan Automatic Discovery of Abnormal Values
in Large Textual Databases . . . . . . . 7:1--7:??
Peter Aiken EXPERIENCE: Succeeding at Data
Management-BigCo Attempts to Leverage
Data . . . . . . . . . . . . . . . . . . 8:1--8:??
Fei Chiang and
Siddharth Sitaramachandran Unifying Data and Constraint Repairs . . 9:1--9:??
Vincenzo Maltese and
Fausto Giunchiglia Search and Analytics Challenges in
Digital Libraries and Archives . . . . . 10:1--10:??
J. Gelernter and
J. Jha Challenges in Ontology Evaluation . . . 11:1--11:??
Laure Berti-Equille and
Mouhamadou Lamine Ba Veracity of Big Data: Challenges of
Cross-Modal Truth Discovery . . . . . . 12:1--12:??
Giannis Haralabopoulos and
Ioannis Anagnostopoulos and
Sherali Zeadally The Challenge of Improving Credibility
of User-Generated Content in Online
Social Networks . . . . . . . . . . . . 13:1--13:??
Ciro D'Urso EXPERIENCE: Glitches in Databases, How
to Ensure Data Quality by Outlier
Detection Techniques . . . . . . . . . . 14:1--14:??
Alan G. Labouseur and
Carolyn C. Matheus An Introduction to Dynamic Data Quality
Challenges . . . . . . . . . . . . . . . 6:1--6:??
Christoph Becker and
Kresimir Duretec and
Andreas Rauber The Challenge of Test Data Quality in
Data Processing . . . . . . . . . . . . 7:1--7:??
Nicola Ferro Reproducibility Challenges in
Information Retrieval Evaluation . . . . 8:1--8:??
G. Shankaranarayanan and
Roger Blake From Content to Context: The Evolution
and Growth of Data Quality Research . . 9:1--9:??
Sean Goldberg and
Daisy Zhe Wang and
Christan Grant A Probabilistically Integrated System
for Crowd-Assisted Text Labeling and
Extraction . . . . . . . . . . . . . . . 10:1--10:??
Philip Woodall The Data Repurposing Challenge: New
Pressures from Data Analytics . . . . . 11:1--11:??
Milan Markovic and
Peter Edwards The Challenge of Quality in Social
Computation . . . . . . . . . . . . . . 12:1--12:??
Leena Al-Hussaini Experience: Insights into the
Benchmarking Data of Hunspell and Aspell
Spell Checkers . . . . . . . . . . . . . 13:1--13:??
Sabrina Abdellaoui and
Fahima Nader and
Rachid Chalal QDflows: a System Driven by Knowledge
Bases for Designing Quality-Aware Data
flows . . . . . . . . . . . . . . . . . 14:1--14:??
Justin St-Maurice and
Catherine Burns An Exploratory Case Study to Understand
Primary Care Users and Their Data
Quality Tradeoffs . . . . . . . . . . . 15:1--15:??
Jiannan Wang and
Nan Tang Dependable Data Repairing with Fixing
Rules . . . . . . . . . . . . . . . . . 16:1--16:??
Diego Marcheggiani and
Fabrizio Sebastiani On the Effects of Low-Quality Training
Data on Information Extraction from
Clinical Reports . . . . . . . . . . . . 1:1--1:??
Aseel Basheer and
Kewei Sha Cluster-Based Quality-Aware Adaptive
Data Compression for Streaming Data . . 2:1--2:??
David Corsar and
Peter Edwards Challenges of Open Data Quality: More
Than Just License, Format, and Customer
Support . . . . . . . . . . . . . . . . 3:1--3:??
Nour El-Mawass and
Saad Alaboodi Data Quality Challenges in Social Spam
Research . . . . . . . . . . . . . . . . 4:1--4:??
Min Chen and
Roman Lukyanenko and
Monica Chiarini Tremblay Information Quality Challenges in Shared
Healthcare Decision Making . . . . . . . 5:1--5:??
Peter Arbuckle and
Ezra Kahn and
Adam Kriesberg Challenge Paper: Challenges to Sharing
Data and Models for Life Cycle
Assessment . . . . . . . . . . . . . . . 6:1--6:??
Louiqa Raschid Editor-in-Chief (January 2014--May 2017)
Farewell Report . . . . . . . . . . . . 7:1--7:??
Tiziana Catarci Foreword from the New JDIQ
Editor-in-Chief . . . . . . . . . . . . 8:1--8:??
Hong-Linh Truong and
Aitor Murguzur and
Erica Yang Challenges in Enabling Quality of
Analytics in the Cloud . . . . . . . . . 9:1--9:??
Kyu Han Koh and
Eric Fouh and
Mohammed F. Farghally and
Hossameldin Shahin and
Clifford A. Shaffer Experience: Learner Analytics Data
Quality for an eTextbook System . . . . 10:1--10:??
C. Cappiello and
C. Cerletti and
C. Fratto and
B. Pernici Validating Data Quality Actions in
Scoring Processes . . . . . . . . . . . 11:1--11:??
Bernd Heinrich and
Diana Hristova and
Mathias Klier and
Alexander Schiller and
Michael Szubartowicz Requirements for Data Quality Metrics 12:1--12:??
Floris Geerts and
Paolo Missier and
Norman Paton Editorial: Special Issue on Improving
the Veracity and Value of Big Data . . . 13:1--13:??
Leopoldo Bertossi and
Mostafa Milani Ontological Multidimensional Data Models
and Contextual Data Quality . . . . . . 14:1--14:??
Michalis Mountantonakis and
Yannis Tzitzikas Scalable Methods for Measuring the
Connectivity and Quality of Large
Numbers of Linked Datasets . . . . . . . 15:1--15:??
Diego Esteves and
Anisa Rula and
Aniketh Janardhan Reddy and
Jens Lehmann Toward Veracity Assessment in RDF
Knowledge Bases: an Exploratory Analysis 16:1--16:??
Qingyu Chen and
Yu Wan and
Xiuzhen Zhang and
Yang Lei and
Justin Zobel and
Karin Verspoor Comparative Analysis of Sequence
Clustering Methods for Deduplication of
Biological Databases . . . . . . . . . . 17:1--17:??
Avigdor Gal and
Arik Senderovich and
Matthias Weidlich Challenge Paper: Data Quality Issues in
Queue Mining . . . . . . . . . . . . . . 18:1--18:??
Fathoni A. Musyaffa and
Christiane Engels and
Maria-Esther Vidal and
Fabrizio Orlandi and
Sören Auer Experience: Open Fiscal Datasets, Common
Issues, and Recommendations . . . . . . 19:1--19:??
Mohammad Alshayeb and
Yasser Shaaban and
Jarallah Al-Ghamdi SPMDL: Software Product Metrics
Definition Language . . . . . . . . . . 20:1--20:??
Naveen Ashish and
Arihant Patawari Machine Reading of Biomedical Data
Dictionaries . . . . . . . . . . . . . . 21:1--21:??
Fei Chiang and
Dhruv Gairola InfoClean: Protecting Sensitive
Information in Data Cleaning . . . . . . 22:1--22:??
Elisa Bertino and
Mohammad R. Jahanshahi Adaptive and Cost-Effective Collection
of High-Quality Data for Critical
Infrastructure and Emergency Management
in Smart Cities-Framework and Challenges 1:1--1:??
Javier Flores and
Jun Sun Information Quality Awareness and
Information Quality Practice . . . . . . 2:1--2:??
Christian Bors and
Theresia Gschwandtner and
Simone Kriglstein and
Silvia Miksch and
Margit Pohl Visual Interactive Creation,
Customization, and Analysis of Data
Quality Metrics . . . . . . . . . . . . 3:1--3:??
Han Zhang and
Shawndra Hill and
David Rothschild Addressing Selection Bias in Event
Studies with General-Purpose Social
Media Panels . . . . . . . . . . . . . . 4:1--4:??
John Puentes and
Pedro Merino Laso and
David Brosset The Challenge of Quality Evaluation in
Fraud Detection . . . . . . . . . . . . 5:1--5:??
Elisa Bertino and
Amani Abu Jabal and
Seraphin Calo and
Dinesh Verma and
Christopher Williams The Challenge of Access Control Policies
Quality . . . . . . . . . . . . . . . . 6:1--6:??
Evanson Mwangi Karanja and
Shedden Masupe and
Mandu Gasennelwe-Jeffrey Challenge Paper: Towards Open Datasets
for Internet of Things Malware . . . . . 7:1--7:??
Ioannis Koumarelas and
Axel Kroschk and
Clifford Mosley and
Felix Naumann Experience: Enhancing Address Matching
with Geocoding and Similarity Measure
Selection . . . . . . . . . . . . . . . 8:1--8:??
Nicola Ferro and
Norbert Fuhr and
Andreas Rauber Introduction to the Special Issue on
Reproducibility in Information
Retrieval: Evaluation Campaigns,
Collections, and Analyses . . . . . . . 9:1--9:??
Alistair Moffat and
Falk Scholer and
Ziying Yang Estimating Measurement Uncertainty for
Information Retrieval Effectiveness
Metrics . . . . . . . . . . . . . . . . 10:1--10:??
Kevin Roitero and
Marco Passon and
Giuseppe Serra and
Stefano Mizzaro Reproduce. Generalize. Extend. On
Information Retrieval Evaluation without
Relevance Judgments . . . . . . . . . . 11:1--11:??
Kevin Roitero and
Michael Soprano and
Andrea Brunello and
Stefano Mizzaro Reproduce and Improve: an Evolutionary
Approach to Select a Few Good Topics for
Information Retrieval Evaluation . . . . 12:1--12:??
Rolf Jagerman and
Krisztian Balog and
Maarten De Rijke OpenSearch: Lessons Learned from an
Online Evaluation Campaign . . . . . . . 13:1--13:??
Nicola Ferro and
Norbert Fuhr and
Andreas Rauber Introduction to the Special Issue on
Reproducibility in Information
Retrieval: Tools and Infrastructures . . 14:1--14:??
Frank Hopfgartner and
Allan Hanbury and
Henning Müller and
Ivan Eggel and
Krisztian Balog and
Torben Brodt and
Gordon V. Cormack and
Jimmy Lin and
Jayashree Kalpathy-Cramer and
Noriko Kando and
Makoto P. Kato and
Anastasia Krithara and
Tim Gollub and
Martin Potthast and
Evelyne Viegas and
Simon Mercer Evaluation-as-a-Service for the
Computational Sciences: Overview and
Outlook . . . . . . . . . . . . . . . . 15:1--15:??
Peilin Yang and
Hui Fang and
Jimmy Lin Anserini: Reproducible Ranking Baselines
Using Lucene . . . . . . . . . . . . . . 16:1--16:??
Johannes Kiesel and
Florian Kneist and
Milad Alshomary and
Benno Stein and
Matthias Hagen and
Martin Potthast Reproducible Web Corpora: Interactive
Archiving with Automatic Quality
Assessment . . . . . . . . . . . . . . . 17:1--17:??
Dwaipayan Roy and
Mandar Mitra and
Debasis Ganguly To Clean or Not to Clean: Document
Preprocessing and Reproducibility . . . 18:1--18:??
Divesh Srivastava and
Monica Scannapieco and
Thomas C. Redman Ensuring High-Quality Private Data for
Responsible Data Science: Vision and
Challenges . . . . . . . . . . . . . . . 1:1--1:??
Julio César Cortés Ríos and
Norman W. Paton and
Alvaro A. A. Fernandes and
Edward Abel and
John A. Keane Crowdsourced Targeted Feedback
Collection for Multicriteria Data Source
Selection . . . . . . . . . . . . . . . 2:1--2:??
Michele Dallachiesa and
Charu C. Aggarwal and
Themis Palpanas Improving Classification Quality in
Uncertain Graphs . . . . . . . . . . . . 3:1--3:??
K. Michael Casey and
Kevin Casey Jr. Financial Regulatory and Risk Management
Challenges Stemming from Firm-Specific
Digital Misinformation . . . . . . . . . 4:1--4:??
Wenfei Fan Dependencies for Graphs: Challenges and
Opportunities . . . . . . . . . . . . . 5:1--5:??
Christian Sillaber and
Andrea Mussmann and
Ruth Breu Experience: Data and Information Quality
Challenges in Governance, Risk, and
Compliance Management . . . . . . . . . 6:1--6:??
Alina Lazar and
Ling Jin and
C. Anna Spurlock and
Kesheng Wu and
Alex Sim and
Annika Todd Evaluating the Effects of Missing Values
and Mixed Data Types on Social Sequence
Clustering Using t-SNE Visualization . . 7:1--7:??
Daniel Müller and
Pratiksha Jain and
Yieh-Funk Te Augmenting Data Quality through
High-Precision Gender Categorization . . 8:1--8:??
Naeemul Hassan and
Chengkai Li and
Jun Yang and
Cong Yu Introduction to the Special Issue on
Combating Digital Misinformation and
Disinformation . . . . . . . . . . . . . 9:1--9:??
Savvas Zannettou and
Michael Sirivianos and
Jeremy Blackburn and
Nicolas Kourtellis The Web of False Information: Rumors,
Fake News, Hoaxes, Clickbait, and
Various Other Shenanigans . . . . . . . 10:1--10:??
Hao Xue and
Qiaozhi Wang and
Bo Luo and
Hyunjin Seo and
Fengjun Li Content-Aware Trust Propagation Toward
Online Review Spam Detection . . . . . . 11:1--11:??
Pepa Atanasova and
Preslav Nakov and
Lluís M\`arquez and
Alberto Barrón-Cedeño and
Georgi Karadzhov and
Tsvetomila Mihaylova and
Mitra Mohtarami and
James Glass Automatic Fact-Checking Using Context
and Discourse Information . . . . . . . 12:1--12:??
Peng Lin and
Qi Song and
Yinghui Wu and
Jiaxing Pi Discovering Patterns for Fact Checking
in Knowledge Graphs . . . . . . . . . . 13:1--13:??
Luís Borges and
Bruno Martins and
Pável Calado Combining Similarity Features and Deep
Representation Learning for Stance
Detection in the Context of Checking
Fake News . . . . . . . . . . . . . . . 14:1--14:??
Serge Abiteboul and
Julia Stoyanovich Transparency, Fairness, Data Protection,
Neutrality: Data Management Challenges
in the Face of New Regulation . . . . . 15:1--15:??
Elisa Bertino and
Ahish Kundu and
Zehra Sura Data Transparency with Blockchain and AI
Ethics . . . . . . . . . . . . . . . . . 16:1--16:??
Amir Ebrahimi Fard and
Scott Cunningham Assessing the Readiness of Academia in
the Topic of False and Unverified
Information . . . . . . . . . . . . . . 17:1--17:??
Matthew Babcock and
David M. Beskow and
Kathleen M. Carley Different Faces of False: The Spread and
Curtailment of False Information in the
Black Panther Twitter Discussion . . . . 18:1--18:??
Michael F. Bosu and
Stephen G. Macdonell Experience: Quality Benchmarking of
Datasets Used in Software Effort
Estimation . . . . . . . . . . . . . . . 19:1--19:??
Junhua Ding and
Xinchuan Li and
Xiaojun Kang and
Venkat N. Gudivada A Case Study of the Augmentation and
Evaluation of Training Data for Deep
Learning . . . . . . . . . . . . . . . . 20:1--20:??
Zahaib Akhtar and
Anh Minh Le and
Yun Seong Nam and
Jessica Chen and
Ramesh Govindan and
Ethan Katz-Bassett and
Sanjay Rao and
Jibin Zhan Improving Adaptive Video Streaming
through Session Classification . . . . . 21:1--21:??
Tova Milo Getting Rid of Data . . . . . . . . . . 1:1--1:7
Donatella Firmani and
Letizia Tanca and
Riccardo Torlone Ethical Dimensions for Data Quality . . 2:1--2:5
Uwe Draisbach and
Peter Christen and
Felix Naumann Transforming Pairwise Duplicates to
Entity Clusters for High-quality
Duplicate Detection . . . . . . . . . . 3:1--3:30
Yusra Shakeel and
Jacob Krüger and
Ivonne Von Nostitz-Wallwitz and
Gunter Saake and
Thomas Leich Automated Selection and Quality
Assessment of Primary Studies: a
Systematic Literature Review . . . . . . 4:1--4:26
Al Hafiz Akbar Maulana Siagian and
Masayoshi Aritsugi Robustness of Word and Character
$N$-gram Combinations in Detecting
Deceptive and Truthful Opinions . . . . 5:1--5:24
Reema Aswani and
Arpan Kumar Kar and
P. Vigneswara Ilavarasan Experience: Managing Misinformation in
Social Media-Insights for Policymakers
from Twitter Analytics . . . . . . . . . 6:1--6:18
Anisa Rula and
Amrapali Zaveri and
Elena Simperl and
Elena Demidova Editorial: Special Issue on Quality
Assessment of Knowledge Graphs Dedicated
to the Memory of Amrapali Zaveri . . . . 7:1--7:4
Naser Ahmadi and
Viet-Phi Huynh and
Vamsi Meduri and
Stefano Ortona and
Paolo Papotti Mining Expressive Rules in Knowledge
Graphs . . . . . . . . . . . . . . . . . 8:1--8:27
Armin Haller and
Javier D. Fernández and
Maulik R. Kamdar and
Axel Polleres What Are Links in Linked Open Data? A
Characterization and Evaluation of Links
between Knowledge Graphs on the Web . . 9:1--9:34
Michalis Mountantonakis and
Yannis Tzitzikas Content-based Union and Complement
Metrics for Dataset Search over RDF
Knowledge Graphs . . . . . . . . . . . . 10:1--10:31
Leopoldo Bertossi and
Floris Geerts Data Quality and Explainable AI . . . . 11:1--11:9
Evaggelia Pitoura Social-minded Measures of Data Quality:
Fairness, Diversity, and Lack of Bias 12:1--12:8
Adrienne Colborne and
Michael Smit Characterizing Disinformation Risk to
Open Data in the Post-Truth Era . . . . 13:1--13:13
Karen Banahene Blay and
Steven Yeomans and
Peter Demian and
Danny Murguia The Information Resilience Framework:
Vulnerabilities, Capabilities, and
Requirements . . . . . . . . . . . . . . 14:1--14:25
Ioannis Koumarelas and
Lan Jiang and
Felix Naumann Data Preparation for Duplicate Detection 15:1--15:24
Larysa Visengeriyeva and
Ziawasch Abedjan Anatomy of Metadata for Data Curation 16:1--16:30
Giuseppe Polese and
Vincenzo Deufemia and
Shaoxu Song Editorial: Special Issue on Metadata
Discovery for Assessing Data Quality . . 17:1--17:2
Domenico Beneventano and
Sonia Bergamaschi and
Luca Gagliardelli and
Giovanni Simonini BLAST2: an Efficient Technique for Loose
Schema Information Extraction from
Heterogeneous Big Data Sources . . . . . 18:1--18:22
Loredana Caruccio and
Stefano Cirillo Incremental Discovery of Imprecise
Functional Dependencies . . . . . . . . 19:1--19:25
Sofía Maiolo and
Lorena Etcheverry and
Adriana Marotta Data Profiling in Property Graph
Databases . . . . . . . . . . . . . . . 20:1--20:27
Naser Ahmadi and
Thi-Thuy-Duyen Truong and
Le-Hong-Mai Dao and
Stefano Ortona and
Paolo Papotti RuleHub: a Public Corpus of Rules for
Knowledge Graphs . . . . . . . . . . . . 21:1--21:22
Philipp Lämmel and
Benjamin Dittwald and
Lina Bruns and
Nikolay Tcholtchev and
Yuri Glikman and
Silke Cuno and
Mathias Flügge and
Ina Schieferdecker Metadata Harvesting and Quality
Assurance within Open Urban Platforms 22:1--22:20
Yuliang Li and
Jinfeng Li and
Yoshihiko Suhara and
Jin Wang and
Wataru Hirota and
Wang-Chiew Tan Deep Entity Matching: Challenges and
Opportunities . . . . . . . . . . . . . 1:1--1:17
Michael Loster and
Ioannis Koumarelas and
Felix Naumann Knowledge Transfer for Entity Resolution
with Siamese Neural Networks . . . . . . 2:1--2:25
Nelson Novaes Neto and
Stuart Madnick and
Anchises Moraes G. De Paula and
Natasha Malara Borges Developing a Global Data Breach Database
and the Challenges Encountered . . . . . 3:1--3:33
Michela Fazzolari and
Francesco Buccafurri and
Gianluca Lax and
Marinella Petrocchi Experience: Improving Opinion Spam
Detection by Cumulative Relative
Frequency Distribution . . . . . . . . . 4:1--4:16
Rogério Luís C. Costa and
Enrico Miranda and
Paulo Dias and
José Moreira Experience: Quality Assessment and
Improvement on a Forest Fire Dataset . . 5:1--5:13
Shadi Aljawarneh and
Juan A. Lara Editorial: Special Issue on Quality
Assessment and Management in Big Data
--- Part I . . . . . . . . . . . . . . . 6:1--6:3
Mary L. Cummings and
Songpo Li Subjectivity in the Creation of Machine
Learning Models . . . . . . . . . . . . 7:1--7:19
Syed Iftikhar Hussain Shah and
Vassilios Peristeras and
Ioannis Magnisalis Government Big Data Ecosystem:
Definitions, Types of Data, Actors, and
Roles and the Impact in Public
Administrations . . . . . . . . . . . . 8:1--8:25
Abeer A. Al Batayneh and
Malik Qasaimeh and
Raad S. Al-Qassas A Scoring System for Information
Security Governance Framework Using Deep
Learning Algorithms: a Case Study on the
Banking Sector . . . . . . . . . . . . . 9:1--9:34
Salam Fraihat and
Walid A. Salameh and
Ammar Elhassan and
Bushra Abu Tahoun and
Maisa Asasfeh Business Intelligence Framework Design
and Implementation: a Real-estate Market
Case Study . . . . . . . . . . . . . . . 10:1--10:16
A. Khalemsky and
R. Gelbard ExpanDrogram: Dynamic Visualization of
Big Data Segmentation over Time . . . . 11:1--11:27
Vangipuram Radhakrishna and
Gali Suresh Reddy and
Puligadda Veereswara Kumar and
Vinjamuri Janaki Challenge Paper: The Vision for Time
Profiled Temporal Association Mining . . 12:1--12:8
Shadi Aljawarneh and
Juan A. Lara Editorial: Special Issue on Quality
Assessment and Management in Big Data
--- Part II . . . . . . . . . . . . . . 13:1--13:3
Sreelakshmy I. J. and
Binsu C. Kovoor A Hybrid Inpainting Model Combining
Diffusion and Enhanced Exemplar Methods 14:1--14:19
Rada Chirkova and
Jon Doyle and
Juan Reutter Ensuring Data Readiness for Quality
Requirements with Help from Procedure
Reuse . . . . . . . . . . . . . . . . . 15:1--15:15
Jeevamol Joy and
Nisha S. Raj and
Renumol V. G. Ontology-based E-learning Content
Recommender System for Addressing the
Pure Cold-start Problem . . . . . . . . 16:1--16:27
Anurag Roy and
Shalmoli Ghosh and
Kripabandhu Ghosh and
Saptarshi Ghosh An Unsupervised Normalization Algorithm
for Noisy Text: a Case Study for
Information Retrieval and Stance
Detection . . . . . . . . . . . . . . . 17:1--17:25
Zhicheng Liu and
Yang Zhang and
Ruihong Huang and
Zhiwei Chen and
Shaoxu Song and
Jianmin Wang EXPERIENCE: Algorithms and Case Study
for Explaining Repairs with Uniform
Profiles over IoT Data . . . . . . . . . 18:1--18:17
Jakub Kubiczek and
BartLomiej Hadasik Challenges in Reporting the COVID-19
Spread and its Presentation to the
Society . . . . . . . . . . . . . . . . 19:1--19:7
Mihnea Tufis and
Ludovico Boratto Toward a Complete Data Valuation
Process. Challenges of Personal Data . . 20:1--20:7
Stuti Nayak and
Amrapali Zaveri and
Pedro Hernandez Serrano and
Michel Dumontier Experience: Automated Prediction of
Experimental Metadata from Scientific
Publications . . . . . . . . . . . . . . 21:1--21:11
Jessica Chen and
Henry Milner and
Ion Stoica and
Jibin Zhan Benchmark of Bitrate Adaptation in Video
Streaming . . . . . . . . . . . . . . . 22:1--22:24
Gabriel Amaral and
Alessandro Piscopo and
Lucie-aimée Kaffee and
Odinaldo Rodrigues and
Elena Simperl Assessing the Quality of Sources in
Wikidata Across Languages: a Hybrid
Approach . . . . . . . . . . . . . . . . 23:1--23:35
Mahmoud Barhamgi and
Elisa Bertino Editorial: Special Issue on Data
Transparency-Data Quality, Annotation,
and Provenance . . . . . . . . . . . . . 1:1--1:3
Saravanan Thirumuruganathan and
Mayuresh Kunjir and
Mourad Ouzzani and
Sanjay Chawla Automated Annotations for AI Data and
Model Transparency . . . . . . . . . . . 2:1--2:9
Sandra Geisler and
Maria-Esther Vidal and
Cinzia Cappiello and
Bernadette Farias Lóscio and
Avigdor Gal and
Matthias Jarke and
Maurizio Lenzerini and
Paolo Missier and
Boris Otto and
Elda Paja and
Barbara Pernici and
Jakob Rehof Knowledge-Driven Data Ecosystems Toward
Data Transparency . . . . . . . . . . . 3:1--3:12
Khalid Belhajjame On the Anonymization of Workflow
Provenance without Compromising the
Transparency of Lineage . . . . . . . . 4:1--4:27
Tooska Dargahi and
Hossein Ahmadvand and
Mansour Naser Alraja and
Chia-Mu Yu Integration of Blockchain with Connected
and Autonomous Vehicles: Vision and
Challenge . . . . . . . . . . . . . . . 5:1--5:10
Mahmoud Barhamgi and
Elisa Bertino Editorial: Special Issue on Data
Transparency-Uses Cases and Applications 6:1--6:3
Youakim Badr and
Rahul Sharma Data Transparency and Fairness Analysis
of the NYPD Stop-and-Frisk Program . . . 7:1--7:14
Chien-Lun Chen and
Leana Golubchik and
Ranjan Pal Achieving Transparency Report Privacy in
Linear Time . . . . . . . . . . . . . . 8:1--8:56
Lara Mauri and
Ernesto Damiani Estimating Degradation of Machine
Learning Data Assets . . . . . . . . . . 9:1--9:15
Bin Wang and
Pengfei Guo and
Xing Wang and
Yongzhong He and
Wei Wang Transparent Aspect-Level Sentiment
Analysis Based on Dependency Syntax
Analysis and Its Application on COVID-19 10:1--10:24
Che-Yun Hsu and
Ting-Rui Chen and
Hung-Hsuan Chen Experience: Analyzing Missing Web Page
Visits and Unintentional Web Page Visits
from the Client-side Web Logs . . . . . 11:1--11:17
Sudhir Kumar Patnaik and
C. Narendra Babu A Web Information Extraction Framework
with Adaptive and Failure Prediction
Feature . . . . . . . . . . . . . . . . 12:1--12:21
Ihab F. Ilyas and
Theodoros Rekatsinas Machine Learning and Data Cleaning:
Which Serves the Other? . . . . . . . . 13:1--13:11
Donatello Santoro and
Saravanan Thirumuruganathan and
Paolo Papotti Editorial: Special Issue on Deep
Learning for Data Quality . . . . . . . 14:1--14:3
Renzhi Wu and
Nilaksh Das and
Sanya Chaba and
Sakshi Gandhi and
Duen Horng Chau and
Xu Chu A Cluster-then-label Approach for
Few-shot Learning with Application to
Automatic Image Data Labeling . . . . . 15:1--15:23
Roee Shraga and
Avigdor Gal PoWareMatch: a Quality-aware Deep
Learning Approach to Improve Human
Schema Matching . . . . . . . . . . . . 16:1--16:27
Md Enamul Haque and
Mehmet Engin Tozal Negative Insurance Claim Generation
Using Distance Pooling on Positive
Diagnosis-Procedure Bipartite Graphs . . 17:1--17:26
Camil Demetrescu and
Irene Finocchi and
Andrea Ribichini and
Marco Schaerf Which Conference Is That? A Case Study
in Computer Science . . . . . . . . . . 18:1--18:13
Dennis Gram and
Pantelis Karapanagiotis and
Marius Liebald and
Uwe Walz Design and Implementation of a
Historical German Firm-level Financial
Database . . . . . . . . . . . . . . . . 19:1--19:22
Zheng Zheng and
Longtao Zheng and
Morteza Alipourlangouri and
Fei Chiang and
Lukasz Golab and
Jaroslaw Szlichta and
Sridevi Baskaran Contextual Data Cleaning with Ontology
Functional Dependencies . . . . . . . . 20:1--20:26
Philipp Hacker and
Felix Naumann and
Tobias Friedrich and
Stefan Grundmann and
Anja Lehmann and
Herbert Zech AI Compliance --- Challenges of Bridging
Data Science and Law . . . . . . . . . . 21:1--21:4
Yuanxia Li and
Faiz Currim and
Sudha Ram Data Completeness and Complex Semantics
in Conceptual Modeling: The Need for a
Disaggregation Construct . . . . . . . . 22:1--22:??
Justin M. Johnson and
Taghi M. Khoshgoftaar A Survey on Classifying Big Data with
Label Noise . . . . . . . . . . . . . . 23:1--23:??
Donatella Firmani and
Letizia Tanca and
Riccardo Torlone Editorial: Special Issue on Data Quality
and Ethics . . . . . . . . . . . . . . . 24:1--24:??
Mariachiara Mecati and
Antonio Vetr\`o and
Marco Torchiano Detecting Risk of Biased Output with
Balance Measures . . . . . . . . . . . . 25:1--25:??
Chiara Accinelli and
Barbara Catania and
Giovanna Guerrini and
Simone Minisi A Coverage-based Approach to
Nondiscrimination-aware Data
Transformation . . . . . . . . . . . . . 26:1--26:??
H. Jagadish and
Julia Stoyanovich and
Bill Howe The Many Facets of Data Equity . . . . . 27:1--27:??
Lacramioara Mazilu and
Norman W. Paton and
Nikolaos Konstantinou and
Alvaro A. A. Fernandes Fairness-aware Data Integration . . . . 28:1--28:??
Fabio Azzalini and
Chiara Criscuolo and
Letizia Tanca E-FAIR-DB: Functional Dependencies to
Discover Data Bias and Enhance Data
Equity . . . . . . . . . . . . . . . . . 29:1--29:??
Dustin Wright and
Paolo Papotti and
Isabelle Augenstein Introduction to the Special Issue on
Truth and Trust Online . . . . . . . . . 1:1--1:??
Anna Gausen and
Wayne Luk and
Ce Guo Using Agent-Based Modelling to Evaluate
the Impact of Algorithmic Curation on
Social Media . . . . . . . . . . . . . . 2:1--2:??
Dominik Stammbach and
Boya Zhang and
Elliott Ash The Choice of Textual Knowledge Base in
Automated Claim Checking . . . . . . . . 3:1--3:??
Erik Brand and
Kevin Roitero and
Michael Soprano and
Afshin Rahimi and
Gianluca Demartini A Neural Model to Jointly Predict and
Explain Truthfulness of Statements . . . 4:1--4:??
Yunke Qu and
Kevin Roitero and
David La Barbera and
Damiano Spina and
Stefano Mizzaro and
Gianluca Demartini Combining Human and Machine Confidence
in Truthfulness Assessment . . . . . . . 5:1--5:??
Atijit Anuchitanukul and
Julia Ive and
Lucia Specia Revisiting Contextual Toxicity Detection
in Conversations . . . . . . . . . . . . 6:1--6:??
Subhadarshi Panda and
Sarah Levitan Deception Detection Within and Across
Domains: Identifying and Understanding
the Performance Gap . . . . . . . . . . 7:1--7:??
Asara Senaratne and
Peter Christen and
Graham Williams and
Pouya G. Omran Unsupervised Identification of Abnormal
Nodes and Edges in Graphs . . . . . . . 8:1--8:??
Chaoyuan Zuo and
Ritwik Banerjee and
Fateme Hashemi Chaleshtori and
Hossein Shirazi and
Indrakshi Ray Seeing Should Probably Not Be Believing:
The Role of Deceptive Support in
COVID-19 Misinformation on Twitter . . . 9:1--9:??
Roberto Navigli and
Simone Conia and
Björn Ross Biases in Large Language Models:
Origins, Inventory, and Discussion . . . 10:1--10:??
Maria Priestley and
Fionntán O'donnell and
Elena Simperl A Survey of Data Quality Requirements
That Matter in ML Development Pipelines 11:1--11:??
Eric Simon and
Bernd Amann and
Rutian Liu and
Stéphane Gançarski Controlling the Correctness of
Aggregation Operations During Sessions
of Interactive Analytic Queries . . . . 12:1--12:??
Philipp Skavantzos and
Uwe Leck and
Kaiqi Zhao and
Sebastian Link Uniqueness Constraints for Object Stores 13:1--13:??
Duncan Smith and
Mark Elliot and
Joseph W. Sakshaug To Link or Synthesize? An Approach to
Data Quality Comparison . . . . . . . . 14:1--14:??
Jing Ao and
Zehui Cheng and
Rada Chirkova and
Phokion G. Kolaitis Theory and Practice of Relational-to-RDF
Temporal Data Exchange and Query
Answering . . . . . . . . . . . . . . . 15:1--15:??
Christina Timko and
Malte Niederstadt and
Naman Goel and
Boi Faltings Incentive Mechanism Design for
Responsible Data Governance: a
Large-scale Field Experiment . . . . . . 16:1--16:??
Vanessa Simard and
Mikael Rönnqvist and
Luc Lebel and
Nadia Lehoux A Method to Classify Data Quality for
Decision Making Under Uncertainty . . . 17:1--17:??
Amal Tawakuli and
Daniel Kaiser and
Thomas Engel Experience: Differentiating Between
Isolated and Sequence Missing Data . . . 18:1--18:??
Gautam Srivastava and
Jerry Chun-Wei Lin and
Zhihan Lv Editorial for the Special Issue on
Quality Assessment of Data Security . . 19:1--19:??
Kyle Hoffpauir and
Jacob Simmons and
Nikolas Schmidt and
Rachitha Pittala and
Isaac Briggs and
Shanmukha Makani and
Yaser Jararweh A Survey on Edge Intelligence and
Lightweight Machine Learning Support for
Future Applications and Services . . . . 20:1--20:??
Kedar Nath Singh and
Amit Kumar Singh An Improved Encryption-Compression-based
Algorithm for Securing Digital Images 21:1--21:??
Y. Supriya and
Thippa Reddy Gadekallu A Survey on Soft Computing Techniques
for Federated Learning --- Applications,
Challenges and Future Directions . . . . 22:1--22:??
Kakali Chatterjee and
Ashish Singh and
Neha and
Keping Yu A Multifactor Ring Signature based
Authentication Scheme for Quality
Assessment of IoMT Environment in
COVID-19 Scenario . . . . . . . . . . . 23:1--23:??
Gautam Kumar and
Sambit Bakshi and
Arun Kumar Sangaiah and
Pankaj Kumar Sa Experimental Evaluation of Covariates
Effects on Periocular Biometrics: a
Robust Security Assessment Framework . . 24:1--24:??
Hadi Fadlallah and
Rima Kilany and
Houssein Dhayne and
Rami El Haddad and
Rafiqul Haque and
Yehia Taher and
Ali Jaber Context-aware Big Data Quality
Assessment: a Scoping Review . . . . . . 25:1--25:??
Ornella Irrera and
Andrea Mannocci and
Paolo Manghi and
Gianmaria Silvello A Novel Curated Scholarly Graph
Connecting Textual and Data Publications 26:1--26:??
Hadi Fadlallah and
Rima Kilany and
Houssein Dhayne and
Rami El Haddad and
Rafiqul Haque and
Yehia Taher and
Ali Jaber BIGQA: Declarative Big Data Quality
Assessment . . . . . . . . . . . . . . . 27:1--27:??
Viola Wenz and
Arno Kesper and
Gabriele Taentzer Clustering Heterogeneous Data Values for
Data Quality Analysis . . . . . . . . . 28:1--28:??
Arthur H. M. Ter Hofstede and
Agnes Koschmider and
Andrea Marrella and
Robert Andrews and
Dominik A. Fischer and
Sareh Sadeghianasl and
Moe Thandar Wynn and
Marco Comuzzi and
Jochen De Weerdt and
Kanika Goel and
Niels Martin and
Pnina Soffer Process-Data Quality: The True Frontier
of Process Mining . . . . . . . . . . . 29:1--29:??
Chinmay Chakraborty and
Mohammad Khosravi and
Muhammad Khurram Khan and
Houbing Herbert Song Editorial: Multimodality,
Multidimensional Representation, and
Multimedia Quality Assessment Toward
Information Quality in Social Web of
Things . . . . . . . . . . . . . . . . . 30:1--30:??
Cameron Aume and
Shantanu Pal and
Alireza Jolfaei and
Subhas Mukhopadhyay Multimodal Social Data Analytics on the
Design and Implementation of an
EEG-Mechatronic System Interface . . . . 31:1--31:??
Yang Jing and
Ma Haowei and
Arshiya S. Ansari and
G. Sucharitha and
Batyrkhan Omarov and
Sandeep Kumar and
Mohammad Sajid Mohammadi and
Khaled A. Z. Alyamani Soft Computing Techniques for Detecting
Cyberbullying in Social Multimedia Data 32:1--32:??
Khaled Matrouk and
Srikanth V and
Sumit Kumar and
Mohit Kumar Bhadla and
Mirza Sabirov and
Mohamed J. Saadh Deep Learning-based Dynamic User
Alignment in Social Networks . . . . . . 33:1--33:??
R. John Martin and
Rajvardhan Oak and
Mukesh Soni and
V. Mahalakshmi and
Arsalan Muhammad Soomar and
Anjali Joshi Fusion-based Representation Learning
Model for Multimode User-generated
Social Network Content . . . . . . . . . 34:1--34:??
Hani Attar Joint IoT/ML Platforms for Smart
Societies and Environments: a Review on
Multimodal Information-Based Learning
for Safety and Security . . . . . . . . 35:1--35:??
Ahmad Al-Qerem and
Ali Mohd Ali and
Shadi Nashwan and
Mohammad Alauthman and
Ala Hamarsheh and
Ahmad Nabot and
Issam Jibreen Transactional Services for Concurrent
Mobile Agents over Edge/Cloud
Computing-Assisted Social Internet of
Things . . . . . . . . . . . . . . . . . 36:1--36:??
Ahmad Al-Qerem and
Ali Mohd Ali and
Hani Attar and
Shadi Nashwan and
Lianyong Qi and
Mohammad Kazem Moghimi and
Ahmed Solyman Synthetic Generation of Multidimensional
Data to Improve Classification Model
Validity . . . . . . . . . . . . . . . . 37:1--37:??
Ahmad Alzu'bi and
Lojin Bani Younis and
Abdelrahman Abuarqoub and
Mohammad Hammoudeh Multimodal Deep Learning with
Discriminant Descriptors for Offensive
Memes Detection . . . . . . . . . . . . 38:1--38:??
Erfan Varedi and
Reza Boostani A Novel Feature Selection Method for
Risk Management in High-Dimensional Time
Series of Cryptocurrency Market . . . . 39:1--39:??
Marco Console and
Maurizio Lenzerini Editorial: Special Issue on Quality
Aspects of Data Preparation . . . . . . 40:1--40:??
Patrick Lambrix Completing and Debugging Ontologies:
State-of-the-art and Challenges in
Repairing Ontologies . . . . . . . . . . 41:1--41:??
Carlo A. Bono and
Cinzia Cappiello and
Barbara Pernici and
Edoardo Ramalli and
Monica Vitali Pipeline Design for Data Preparation for
Social Media Analysis . . . . . . . . . 42:1--42:??
Pavel Krasikov and
Christine Legner A Method to Screen, Assess, and Prepare
Open Data for Use: a Method to Screen,
Assess, and Prepare Open Data for Use 43:1--43:??
Hima Patel and
Shanmukha Guttula and
Nitin Gupta and
Sandeep Hans and
Ruhi Sharma Mittal and
Lokesh N. A Data-centric AI Framework for
Automating Exploratory Data Analysis and
Data Quality Tasks . . . . . . . . . . . 44:1--44:??
Luis Del Vasto-Terrientes Experience: Data Management for
Delivering COVID-19 Relief in Panama . . 45:1--45:??
Felix Naumann Editorial . . . . . . . . . . . . . . . 1:1--1:??
Tiziana Catarci Editor-in-Chief (June $ 2017$-November
2023) Farewell Report . . . . . . . . . 2:1--2:??
Gianluca Demartini and
Shazia Sadiq and
Jie Yang Editorial: Special Issue on Human in the
Loop Data Curation . . . . . . . . . . . 3:1--3:??
Stefani Tsaneva and
Marta Sabou Enhancing Human-in-the-Loop Ontology
Curation Results through Task Design . . 4:1--4:??
Timo Breuer and
Norbert Fuhr and
Philipp Schaer Validating Synthetic Usage Data in
Living Lab Environments . . . . . . . . 5:1--5:??
João L. M. Pereira and
Manuel J. Fonseca and
Antónia Lopes and
Helena Galhardas Cleenex: Support for User Involvement
during an Iterative Data Cleaning
Process . . . . . . . . . . . . . . . . 6:1--6:??
Julian Le Deunf and
Arwa Khannoussi and
Laurent Lecornu and
Patrick Meyer and
John Puentes Data Quality Assessment through a
Preference Model . . . . . . . . . . . . 7:1--7:??
Dakshi Tharanga Kapugama Geeganage and
Moe Thandar Wynn and
Arthur H. M. ter Hofstede Text2EL+: Expert Guided Event Log
Enrichment Using Unstructured Text . . . 8:1--8:??
Tobias Backes and
Stefan Dietze Connected Components for Scaling
Partial-order Blocking to Billion
Entities . . . . . . . . . . . . . . . . 9:1--9:??
Guy-Junior Richard and
Jérôme Habonneau and
Didier Guériot and
Jean-Marc Le Caillec AI Explainability and Acceptance: a Case
Study for Underwater Mine Hunting . . . 10:1--10:??
Na Li and
Yiyang Qi and
Chaoran Li and
Zhiming Zhao Active Learning for Data Quality
Control: a Survey . . . . . . . . . . . 11:1--11:??
Giansalvatore Mecca and
Paolo Papotti and
Donatello Santoro and
Enzo Veltri BUNNI: Learning Repair Actions in
Rule-driven Data Cleaning . . . . . . . 12:1--12:??
Florian Bachinger and
Lisa Ehrlinger and
Gabriel Kronberger and
Wolfram Wöss Data Validation Utilizing Expert
Knowledge and Shape Constraints . . . . 13:1--13:??
Michael Stenger and
André Bauer and
Thomas Prantl and
Robert Leppich and
Nathaniel Hudson and
Kyle Chard and
Ian Foster and
Samuel Kounev Thinking in Categories: a Survey on
Assessing the Quality for Time Series
Synthesis . . . . . . . . . . . . . . . 14:1--14:??
Sergei Chuprov and
Raman Zatsarenko and
Leon Reznik and
Igor Khokhlov Data Quality Based Intelligent
Instrument Selection with Security
Integration . . . . . . . . . . . . . . 15:1--15:??
Hichem Belgacem and
Xiaochen Li and
Domenico Bianculli and
Lionel Briand Automated anomaly detection for
categorical data by repurposing a form
filling recommender system . . . . . . . 16:1--16:??
Heinrich Peters and
Alireza Hashemi and
James Rae Generalizable Error Modeling for Human
Data Annotation: Evidence From an
Industry-Scale Search Data Annotation
Program . . . . . . . . . . . . . . . . 17:1--17:??
Naif Alzahrani and
Jacek Ca\la and
Paolo Missier Experience: a Comparative Analysis of
Multivariate Time-Series Generative
Models: a Case Study on Human Activity
Data . . . . . . . . . . . . . . . . . . 18:1--18:??
Flavia Serra and
Verónika Peralta and
Adriana Marotta and
Patrick Marcel Use of Context in Data Quality
Management: a Systematic Literature
Review . . . . . . . . . . . . . . . . . 19:1--19:??
Foutse Khomh and
Andreas Metzger and
Phu Nguyen and
Sagar Sen Editorial: Special Issue on Software
Engineering and AI for Data Quality . . 20:1--20:??
Valentina Golendukhina and
Harald Foidl and
Daniel Hörl and
Michael Felderer A Catalog of Consumer IoT Device
Characteristics for Data Quality
Estimation . . . . . . . . . . . . . . . 21:1--21:??
Edmon Begoli and
Maria Mahbub and
Linsey Passarella and
Sudarshan Srinivasan A Compound Data Poisoning Technique with
Significant Adversarial Effects on
Transformer-based Sentiment
Classification Tasks . . . . . . . . . . 22:1--22:??
Maria Gabriela Valeriano and
Ana Matran-Fernandez and
Carlos Kiffer and
Ana Carolina Lorena Understanding the performance of machine
learning models from data- to
patient-level . . . . . . . . . . . . . 23:1--23:??
Rui Filipe Ribeiro Jesus and
Ana Rodrigues and
Carlos Costa Unlocking AutoML: Enhancing Data with
Deep Learning Algorithms for Medical
Imaging . . . . . . . . . . . . . . . . 24:1--24:??
Hong-Linh Truong and
Ngoc Nhu Trang Nguyen TENSAI --- Practical and Responsible
Observability for Data Quality-aware
Large-scale Analytics . . . . . . . . . 25:1--25:??
Naeima Hamed and
Omer Rana and
Pablo Orozco-terWengel and
Beno\^\it Goossens and
Charith Perera A Comparison of Open Data Observatories 1:1--1:??
Toon Boeckling and
Antoon Bronselaer Cleaning data with Swipe . . . . . . . . 2:1--2:??
Adela Nedisan Videsjorden and
Arda Goknil and
Sagar Sen and
Erik Johannes Husom and
Phu Nguyen 3D-DaVa: Enhancing 3D Point Cloud Data
Reliability for Industrial Applications 3:1--3:??
Bismita Choudhury and
En-Tni Lin and
Jacqueline Speir A Quantitative Approach for Forensic
Footwear Quality Assessment using
Machine and Deep Learning . . . . . . . 4:1--4:??
Charini Nanayakkara and
Peter Christen and
Victor Christen Unsupervised Evaluation of Entity
Resolution . . . . . . . . . . . . . . . 5:1--5:??
Marco Comuzzi and
Jonghyeon Ko and
Fabrizio Maggi A Language to Model and Simulate Data
Quality Issues in Process Mining . . . . 6:1--6:36
Victor Christen and
Daniel Obraczka and
Marvin Hofer and
Martin Franke and
Erhard Rahm Graph Metrics-driven Record Cluster
Repair meets LLM-based active learning 7:1--7:25
Mathias Klier and
Andreas Obermeier and
Christian Sparn and
Torben Widmann Anomaly-based Assessment of Semantic
Consistency: Design and Evaluation of a
Novel Probability-based Metric in
Cooperation with a German Car
Manufacturer . . . . . . . . . . . . . . 8:1--8:24
Marco Rondina and
Antonio Vetr\`o and
Alessandro Fabris and
Gianmaria Silvello and
Gian Antonio Susto and
Marco Torchiano and
Juan Carlos De Martin Experience: Bridging Data Measurement
and Ethical Challenges with Extended
Data Briefs . . . . . . . . . . . . . . 9:1--9:22
Malick Ebiele and
Malika Bendechache and
Rob Brennan Quantitative Data Valuation Methods: a
Systematic Review and Taxonomy . . . . . 10:1--10:39
Shaohua Wan and
Carmen Bisogni and
Marco Zappatore and
Manoranjan Paul Editorial: Special Issue on Advanced
Artificial Intelligence Technologies for
Multimedia Big Data Quality . . . . . . 11:1--11:4
Abdullah Al-Ameri and
Waleed Al-Shammari and
Aniello Castiglione and
Michele Nappi and
Chiara Pero and
Muhammad Umer Student Academic Success Prediction
Using Learning Management Multimedia
Data With Convoluted Features and
Ensemble Model . . . . . . . . . . . . . 12:1--12:16
Zongda Wu and
Guoqi Lin and
Huawen Liu and
Jian Xie and
Guandong Xu and
Enhong Chen and
Gang Li How to Protect of Reader Preference
Privacy in Mobile Book Information
Services: a Technical Method . . . . . . 13:1--13:23
Kehui Tan and
Jiayang Yao and
Tianqi Pang and
Chenyou Fan and
Yu Song ELF: Educational LLM Framework of
Improving and Evaluating AI-generated
Content for Classroom Teaching . . . . . 14:1--14:23
Honghui Xu and
Zhipeng Cai and
Liran Ma and
Yingshu Li and
Daehee Seo and
Wei Li Overheard: Audio-based Integral Event
Inference . . . . . . . . . . . . . . . 15:1--15:17
Jungang Lou and
Xuhong Wu and
Kang Zhao and
Qing Shen and
Jinnan Yang DUTNG: Employing Dynamically Updating
Traffic Network Graph for Short-term
Traffic Flow Prediction . . . . . . . . 16:1--16:18
Xiaodong Wang and
Longyun Qi and
Xingshen Wei and
Weiping Zhu and
Haitao Jiang and
Zhitao Guan AED: a Novel Approach for Intrusion
Detection without Abnormal Samples in
Big Data Environment . . . . . . . . . . 17:1--17:20
Yanwei Zheng and
Yaling Li and
Changrui Li and
Taiqi Zhang and
Yifei Zou and
Dongxiao Yu Learning Attribute Attention and
Retrospect Location for Instance Object
Navigation . . . . . . . . . . . . . . . 18:1--18:20
Xinfu Liu and
Benze Wu and
Yirui Wu A Remote Sensing Image Classification
Method Based on Detail Attention
Sampling and Teacher-Student Network . . 19:1--19:19