MEPDaW'22 - Managing the Evolution and Preservation of the Data Web

8th MEPDaW Workshop at ISWC'22, October 23^rd, 2022

Proceedings are available through CEUR-WS

https://ceur-ws.org/Vol-3339/

MEPDaW 2022 and the Covid-19 outbreak

The ISWC Organizing Committee and the Semantic Web Science Association (SWSA) have been closely monitoring the situation evolving around Covid-19. They have decided that ISWC 2022 together with its associated workshops will be fully remote events.

There is a vast and rapidly increasing quantity of scientific, corporate, government, and crowd-sourced data openly published on the Web. Open Data plays a catalyst role in the way structured information is exploited on a large scale. A traditional view of digitally preserving these datasets by “pickling and locking them away” for future use, like groceries, conflicts with their evolution. There are several approaches and frameworks (Linked Data Stack, PoolParty Suite, etc.) that manage a full life-cycle of the Data Web. More specifically, these solutions are expected to tackle major issues such as the synchronisation problem (monitoring changes), the curation problem (repairing data imperfections), the appraisal problem (assessing the quality of a dataset), the citation problem (how to cite a particular version of a dataset), the archiving problem (retrieving a specific version of a dataset), and the sustainability problem (preserving at scale, ensuring long-term access).

This workshop targets one of the emerging and fundamental problems in the Web, specifically the management and preservation of evolving knowledge graphs.

During the past seven years, the MEPDaW workshop series has been gathering researchers from the community around these challenges. So far the series successfully published more than 30 research efforts allowing more than 60 individual authors to present and share their ideas.

This workshop aims at addressing challenges and issues on managing Knowledge Graph evolution and preservation by providing a forum for researchers and practitioners to discuss, exchange and disseminate their ideas and work, to network and cross-fertilise new ideas.

Topics of interest include, but are not limited to themes related to the evolution and preservation of Knowledge Graphs:

Management and Governance of Evolution in Knowledge Graphs

Representation, maintenance of versions & changes (change representation and detection)
Efficient indexing and update of Knowledge Graphs
Synchronization of distributed versions
Federated Knowledge Graph governance

Reasoning and Prediction over Evolving Knowledge Graphs

Techniques for extracting and predicting evolving patterns
Maintenance of explicit and implicit knowledge
Trend analysis of evolving knowledge graphs
Concept drift detection and prediction over knowledge graphs

Visualization and Exploration of Evolving Knowledge Graphs

Visual summarization of evolving knowledge
User interfaces for exploring evolving knowledge graphs
Visualisation of quality in knowledge graphs

Preservation of Evolving Knowledge Graphs

Preservation of context, provenance and background knowledge
Efficient and effective solutions for preserving evolving knowledge graphs
Models for representing provenance and evolution

Quality of Evolving Knowledge Graphs

Change-detection based quality assessment and validation
Quality trends and prediction in evolving knowledge graphs
Hybrid approaches for knowledge graph curation

Evaluation of Knowledge Graph Evolution

Benchmarks for managing, predicting, and curating evolution
Real-world applications of evolving knowledge graphs
Automatic and human-based techniques for evaluating evolving knowledge graph

We envision four types of submissions covering the entire workshop topics spectrum:

Research Papers (max 10 pages), presenting novel scientific research addressing topics of the workshop.
Position & Demo papers (max 5 pages), encouraging papers describing significant work in progress, late breaking results or ideas of the domain, as well as functional systems relevant to the community.
Industry & Use Case Presentations (max 5 pages), in which industry experts can present and discuss practical solutions, use case prototypes, best practices, etc. at any stage of implementation.
Expression of Interest (max 2 pages), presenting a research topic, a work in progress, practical applications or needs, etc.

The proceedings of the workshops will be published in the CEUR-WS. Papers must be submitted in PDF according to the LLNCS format or in HTML. The PDF files must have all non-standard fonts embedded. Workshop submissions must be self-contained and in English.

All papers should be submitted to https://easychair.org/conferences/?conf=mepdaw2022.

Submission: ~~August 26^th, 2022~~ September 16^th, 2022
Notification: October 14^th, 2022
Camera-ready: October 21^st, 2022
Presentation: October 23^rd, 2022

All dates are given for 23:59 Anywhere on Earth.

Attending the workshop

MEPDaW will take place on Sunday 23^rd of October between 10am and 2pm CEST. The WebEx link is https://tib-eu.webex.com/tib-eu-en/j.php?MTID=m1f46c4371cea3559036df707e8fa0a83 (Meeting number: 2734 460 6343 and Password: sMPgs8YnW38), provided by TIB. All hours are Central European Summer Time (CEST). Please, don't forget to register... ☺

Academic Keynote #1

Title: Querying the Web of data using sometimes available APIs
Professor Pieter Colpaert, IDLab, Ghent University - imec, Belgium
https://pietercolpaert.be/#me

Abstract: Keeping public APIs up and running is an art, both from the fund-raising perspective as from the technical perspective. As a consequence, projects that are only funded for a short amount of time, will often turn off their APIs when the project ends, turning all dependent applications useless. Linked Data Event Streams (LDES) aims to make data publishers focus on their core task of making the dataset available for reuse, and advocates a well preservable (what we’ll call “materializable”) API structure. Projects can then republish the LDES in an API of choice. In this talk, we argue that these APIs should use hypermedia and announce their existence in a metadata catalogue. That way, query processors like Comunica can automatically use them when they are available, and ignore them when they aren’t, but always guarantee that the question can be answered until the end of time.

Academic Keynote #2

Title: Fostering a Lively and Tenacious Web of Data
Professor Aidan Hogan, DCC, University of Chile & IMFD, Santiago, Chile
https://aidanhogan.com/

Abstract: While the Web of Data continues to mature, it can still suffer from inertia (being slow to change) and impermanence (losing track of the past). Addressing these two flaws in a meaningful way requires additional work to better understand and harness dynamics on the Web of Data, not only in terms of data, but also in terms of queries, links, websites, definitions, demands, etc. In this talk, we highlight key challenges relating to dynamics on the Web of Data. We exemplify issues for the Web of Data that may arise if such challenges are left neglected: stale or forgotten data, incorrect results, unchecked vandalism, biased conclusions, etc. We discuss research lines to address such challenges relating to representations, modelling, prediction, revision, synchronisation and preservation. Within these research lines we identify key trade-offs, a better understanding of which may help us to transition towards a more lively and tenacious Web of Data that is better equipped to serve a changing world.

Please note that the following times are according to the current CEST (Central European Summer Time), same as the actual time in Paris (France).

Agenda

Time (CEST)	Title
At 10:00am	Academic Keynote Session 1
10:00am-10:10am	Opening words
10:10am-10:50am	Querying the Web of data using sometimes available APIs By Professor Pieter Colpaert, IDLab, Ghent University - imec, Belgium
10:50am-11:00am	Break
At 11:00am	Paper Session I
11:00am-11:25am	Modelling & Analyzing Changes within LD source data Alex Randles and Declan O'Sullivan
11:25am-11:50am	Event sourcing in Solid Wout Slabbinck, Ruben Dedecker, Sindhu Vasireddy, Ruben Verborgh and Pieter Colpaert
11:50am-12:00pm	Break
At 12:00pm	Academic Keynote Session 2
12:00pm-12:50pm	Fostering a Lively and Tenacious Web of Data By Professor Aidan Hogan, DCC, University of Chile & IMFD, Santiago, Chile
At 12:50pm	Paper Session II
12:50pm-1:15pm	What does it mean when your URIs are redirected? Examining identity and redirection in the LOD cloud Idries Nasim, Shuai Wang, Joe Raad, Peter Bloem and Frank van Harmelen
1:15pm-1:40pm	Describing a network of live datasets with the SDS vocabulary Arthur Vercruysse, Sitt Min Oo and Pieter Colpaert
At 1:40pm	Discussion & wrap-up

Organisers

Fabrizio Orlandi (ADAPT Centre, Trinity College Dublin, Ireland) is a Senior Research Fellow at Trinity College Dublin. His research focuses on knowledge management, Linked (Open) Data, Big Data technologies, data governance and personalisation. Prior to joining ADAPT he worked as post-doctoral researcher at Fraunhofer IAIS applying his research on large EU-funded and industry projects.
Damien Graux (Inria, France) is a researcher at Inria Sophia Antipolis based in the Wimmics group. He has been contributing to research efforts in Semantic Web technologies and focusing on distributed query evaluation and on designing complex transformation pipelines for heterogeneous Big Data. Prior to joining Inria, he had research positions at Trinity College Dublin (Ireland) and at Fraunhofer IAIS (Germany).
Emetis Niazmand (Leibniz Information Centre for Science and Technology (TIB) & Leibniz Universität Hannover, Germany) is a research assistant at the Scientific Data Management research group at TIB and Computer Science PhD student at Leibniz Universität Hannover. Her research interests include knowledge management, semantic web, and query processing over knowledge graphs.
Gabriela Ydler (L3S Forshungszentrum, Research Center, Germany) is a student assistant at the Scientific Data Management research group at TIB. She studied at the University Santa Maria (Venezuela) and majored in Social Communications, specializing in corporate communications. She is currently supporting the group in the area of external communications and finishing her bachelor degree in Anthropology and Geography at Universität Bonn.
Maria-Esther Vidal (Leibniz Information Centre for Science and Technology (TIB) & Leibniz University, Hannover, Germany) is the head of the Scientific Data Management group at the Leibniz Information Centre for Science and Technology (TIB) and a full professor at the Leibniz University of Hannover. Her interests include data and knowledge management, knowledge representation, big data, and semantic web.

Advisory Board

Philippe Cudré-Mauroux, eXascale Infolab, University of Fribourg, Switzerland
Jeremy Debattista, TopQuadrant Inc
Javier D. Fernández, Information Architect at Roche, Switzerland
Fabien Gandon, Inria, Université Côte d’Azur, CNRS, I3S, France

Program Committee

Name	Affiliation
Natanael Arndt	Eccenca GmbH, Germany
David Chaves-Fraga	Universidad Politécnica de Madrid, Spain
Ioannis Chrysakis	FORTH-ICS, Greece; and Ghent University - IDLab – imec, Belgium
Pieter Colpaert	Ghent University, Belgium
Marcos Da Silveira	LIST, Luxembourg
Christophe Debruyne	Université de Liège, Belgium
Javier D. Fernández	F. Hoffmann-La Roche AG, Switzerland
Pavel Klinov	Stardog Union, Germany
Harshvardhan J. Pandit	ADAPT Centre - Trinity College Dublin, Ireland
George Papastefanatos	IMIS / RC "Athena", Greece
Iliana Petrova	Inria, France
Philipp D. Rohde	TIB Leibniz Information Centre for Science and Technology, Germany
Fatiha Saïs	LRI & Paris Saclay University, France
Ruben Taelman	Ghent University – imec, Belgium

Important Dates (AoE Time)

Submission (EasyChair): September 16^th, 2022
Notification: October 14^th, 2022
Camera-ready: October 21^st, 2022
Presentation: October 23^rd, 2022

@mepdaw

MEPDaW'22 - Managing the Evolution and Preservation of the Data Web

Proceedings are available through CEUR-WS

https://ceur-ws.org/Vol-3339/

MEPDaW 2022 and the Covid-19 outbreak

About

Call for papers

Important Dates

Program