MEPDaW'22 - Managing the Evolution and Preservation of the Data Web

8th MEPDaW Workshop at ISWC'22, October 23rd, 2022

visual banner

Proceedings are available through CEUR-WS

MEPDaW 2022 and the Covid-19 outbreak

The ISWC Organizing Committee and the Semantic Web Science Association (SWSA) have been closely monitoring the situation evolving around Covid-19. They have decided that ISWC 2022 together with its associated workshops will be fully remote events.

There is a vast and rapidly increasing quantity of scientific, corporate, government, and crowd-sourced data openly published on the Web. Open Data plays a catalyst role in the way structured information is exploited on a large scale. A traditional view of digitally preserving these datasets by “pickling and locking them away” for future use, like groceries, conflicts with their evolution. There are several approaches and frameworks (Linked Data Stack, PoolParty Suite, etc.) that manage a full life-cycle of the Data Web. More specifically, these solutions are expected to tackle major issues such as the synchronisation problem (monitoring changes), the curation problem (repairing data imperfections), the appraisal problem (assessing the quality of a dataset), the citation problem (how to cite a particular version of a dataset), the archiving problem (retrieving a specific version of a dataset), and the sustainability problem (preserving at scale, ensuring long-term access).

This workshop targets one of the emerging and fundamental problems in the Web, specifically the management and preservation of evolving knowledge graphs.

During the past seven years, the MEPDaW workshop series has been gathering researchers from the community around these challenges. So far the series successfully published more than 30 research efforts allowing more than 60 individual authors to present and share their ideas.

This workshop aims at addressing challenges and issues on managing Knowledge Graph evolution and preservation by providing a forum for researchers and practitioners to discuss, exchange and disseminate their ideas and work, to network and cross-fertilise new ideas.

Topics of interest include, but are not limited to themes related to the evolution and preservation of Knowledge Graphs:

  • Management and Governance of Evolution in Knowledge Graphs
    • Representation, maintenance of versions & changes (change representation and detection)
    • Efficient indexing and update of Knowledge Graphs
    • Synchronization of distributed versions
    • Federated Knowledge Graph governance
  • Reasoning and Prediction over Evolving Knowledge Graphs
    • Techniques for extracting and predicting evolving patterns
    • Maintenance of explicit and implicit knowledge
    • Trend analysis of evolving knowledge graphs
    • Concept drift detection and prediction over knowledge graphs
  • Visualization and Exploration of Evolving Knowledge Graphs
    • Visual summarization of evolving knowledge
    • User interfaces for exploring evolving knowledge graphs
    • Visualisation of quality in knowledge graphs
  • Preservation of Evolving Knowledge Graphs
    • Preservation of context, provenance and background knowledge
    • Efficient and effective solutions for preserving evolving knowledge graphs
    • Models for representing provenance and evolution
  • Quality of Evolving Knowledge Graphs
    • Change-detection based quality assessment and validation
    • Quality trends and prediction in evolving knowledge graphs
    • Hybrid approaches for knowledge graph curation
  • Evaluation of Knowledge Graph Evolution
    • Benchmarks for managing, predicting, and curating evolution
    • Real-world applications of evolving knowledge graphs
    • Automatic and human-based techniques for evaluating evolving knowledge graph

We envision four types of submissions covering the entire workshop topics spectrum:

  1. Research Papers (max 10 pages), presenting novel scientific research addressing topics of the workshop.
  2. Position & Demo papers (max 5 pages), encouraging papers describing significant work in progress, late breaking results or ideas of the domain, as well as functional systems relevant to the community.
  3. Industry & Use Case Presentations (max 5 pages), in which industry experts can present and discuss practical solutions, use case prototypes, best practices, etc. at any stage of implementation.
  4. Expression of Interest (max 2 pages), presenting a research topic, a work in progress, practical applications or needs, etc.

The proceedings of the workshops will be published in the CEUR-WS. Papers must be submitted in PDF according to the LLNCS format or in HTML. The PDF files must have all non-standard fonts embedded. Workshop submissions must be self-contained and in English.

All papers should be submitted to https://easychair.org/conferences/?conf=mepdaw2022.

  • Submission: August 26th, 2022 September 16th, 2022
  • Notification: October 14th, 2022
  • Camera-ready: October 21st, 2022
  • Presentation: October 23rd, 2022

All dates are given for 23:59 Anywhere on Earth.

Attending the workshop

MEPDaW will take place on Sunday 23rd of October between 10am and 2pm CEST. The WebEx link is https://tib-eu.webex.com/tib-eu-en/j.php?MTID=m1f46c4371cea3559036df707e8fa0a83 (Meeting number: 2734 460 6343 and Password: sMPgs8YnW38), provided by TIB. All hours are Central European Summer Time (CEST). Please, don't forget to register... ☺

Academic Keynote #1

Title: Querying the Web of data using sometimes available APIs
Professor Pieter Colpaert, IDLab, Ghent University - imec, Belgium
https://pietercolpaert.be/#me

Abstract: Keeping public APIs up and running is an art, both from the fund-raising perspective as from the technical perspective. As a consequence, projects that are only funded for a short amount of time, will often turn off their APIs when the project ends, turning all dependent applications useless. Linked Data Event Streams (LDES) aims to make data publishers focus on their core task of making the dataset available for reuse, and advocates a well preservable (what we’ll call “materializable”) API structure. Projects can then republish the LDES in an API of choice. In this talk, we argue that these APIs should use hypermedia and announce their existence in a metadata catalogue. That way, query processors like Comunica can automatically use them when they are available, and ignore them when they aren’t, but always guarantee that the question can be answered until the end of time.

Academic Keynote #2

Title: Fostering a Lively and Tenacious Web of Data
Professor Aidan Hogan, DCC, University of Chile & IMFD, Santiago, Chile
https://aidanhogan.com/

Abstract: While the Web of Data continues to mature, it can still suffer from inertia (being slow to change) and impermanence (losing track of the past). Addressing these two flaws in a meaningful way requires additional work to better understand and harness dynamics on the Web of Data, not only in terms of data, but also in terms of queries, links, websites, definitions, demands, etc. In this talk, we highlight key challenges relating to dynamics on the Web of Data. We exemplify issues for the Web of Data that may arise if such challenges are left neglected: stale or forgotten data, incorrect results, unchecked vandalism, biased conclusions, etc. We discuss research lines to address such challenges relating to representations, modelling, prediction, revision, synchronisation and preservation. Within these research lines we identify key trade-offs, a better understanding of which may help us to transition towards a more lively and tenacious Web of Data that is better equipped to serve a changing world.

Please note that the following times are according to the current CEST (Central European Summer Time), same as the actual time in Paris (France).

Agenda

Time (CEST)Title
At 10:00amAcademic Keynote Session 1
10:00am-10:10amOpening words
10:10am-10:50amQuerying the Web of data using sometimes available APIs
By Professor Pieter Colpaert, IDLab, Ghent University - imec, Belgium
10:50am-11:00amBreak
At 11:00amPaper Session I
11:00am-11:25amModelling & Analyzing Changes within LD source data
Alex Randles and Declan O'Sullivan
11:25am-11:50amEvent sourcing in Solid
Wout Slabbinck, Ruben Dedecker, Sindhu Vasireddy, Ruben Verborgh and Pieter Colpaert
11:50am-12:00pmBreak
At 12:00pmAcademic Keynote Session 2
12:00pm-12:50pmFostering a Lively and Tenacious Web of Data
By Professor Aidan Hogan, DCC, University of Chile & IMFD, Santiago, Chile
At 12:50pmPaper Session II
12:50pm-1:15pmWhat does it mean when your URIs are redirected? Examining identity and redirection in the LOD cloud
Idries Nasim, Shuai Wang, Joe Raad, Peter Bloem and Frank van Harmelen
1:15pm-1:40pmDescribing a network of live datasets with the SDS vocabulary
Arthur Vercruysse, Sitt Min Oo and Pieter Colpaert
At 1:40pmDiscussion & wrap-up

Organisers

  • Fabrizio Orlandi (ADAPT Centre, Trinity College Dublin, Ireland) is a Senior Research Fellow at Trinity College Dublin. His research focuses on knowledge management, Linked (Open) Data, Big Data technologies, data governance and personalisation. Prior to joining ADAPT he worked as post-doctoral researcher at Fraunhofer IAIS applying his research on large EU-funded and industry projects.
  • Damien Graux (Inria, France) is a researcher at Inria Sophia Antipolis based in the Wimmics group. He has been contributing to research efforts in Semantic Web technologies and focusing on distributed query evaluation and on designing complex transformation pipelines for heterogeneous Big Data. Prior to joining Inria, he had research positions at Trinity College Dublin (Ireland) and at Fraunhofer IAIS (Germany).
  • Emetis Niazmand (Leibniz Information Centre for Science and Technology (TIB) & Leibniz Universität Hannover, Germany) is a research assistant at the Scientific Data Management research group at TIB and Computer Science PhD student at Leibniz Universität Hannover. Her research interests include knowledge management, semantic web, and query processing over knowledge graphs.
  • Gabriela Ydler (L3S Forshungszentrum, Research Center, Germany) is a student assistant at the Scientific Data Management research group at TIB. She studied at the University Santa Maria (Venezuela) and majored in Social Communications, specializing in corporate communications. She is currently supporting the group in the area of external communications and finishing her bachelor degree in Anthropology and Geography at Universität Bonn.
  • Maria-Esther Vidal (Leibniz Information Centre for Science and Technology (TIB) & Leibniz University, Hannover, Germany) is the head of the Scientific Data Management group at the Leibniz Information Centre for Science and Technology (TIB) and a full professor at the Leibniz University of Hannover. Her interests include data and knowledge management, knowledge representation, big data, and semantic web.

Advisory Board

  • Philippe Cudré-Mauroux, eXascale Infolab, University of Fribourg, Switzerland
  • Jeremy Debattista, TopQuadrant Inc
  • Javier D. Fernández, Information Architect at Roche, Switzerland
  • Fabien Gandon, Inria, Université Côte d’Azur, CNRS, I3S, France

Program Committee

Name Affiliation
Natanael ArndtEccenca GmbH, Germany
David Chaves-FragaUniversidad Politécnica de Madrid, Spain
Ioannis ChrysakisFORTH-ICS, Greece; and Ghent University - IDLab – imec, Belgium
Pieter ColpaertGhent University, Belgium
Marcos Da SilveiraLIST, Luxembourg
Christophe DebruyneUniversité de Liège, Belgium
Javier D. FernándezF. Hoffmann-La Roche AG, Switzerland
Pavel KlinovStardog Union, Germany
Harshvardhan J. PanditADAPT Centre - Trinity College Dublin, Ireland
George PapastefanatosIMIS / RC "Athena", Greece
Iliana PetrovaInria, France
Philipp D. RohdeTIB Leibniz Information Centre for Science and Technology, Germany
Fatiha SaïsLRI & Paris Saclay University, France
Ruben Taelman Ghent University – imec, Belgium

Important Dates (AoE Time)

  • Submission (EasyChair): September 16th, 2022
  • Notification: October 14th, 2022
  • Camera-ready: October 21st, 2022
  • Presentation: October 23rd, 2022

Event Location

MEPDaW 2022 is co-located with ISWC 2022.

Past Editions

The main focus of the workshop has always been on the fundamental problem of managing the evolution and preservation of the Data Web.

  1. 2015 — MEPDaW was held for the first time at ESWC 2015.
  2. 2016 — MEPDaW was held at ESWC 2016.
  3. 2017 — MEPDaW was held at ESWC 2017.
  4. 2018 — MEPDaW was held at ESWC 2018.
  5. 2019 — MEPDaW was held at the Web Conference 2019.
  6. 2020 — MEPDaW was held at ISWC 2020.
  7. 2021 — MEPDaW was held at ISWC 2021.