MEPDaW'21 - Managing the Evolution and Preservation of the Data Web

7th MEPDaW Workshop at ISWC'21, October 25th, 2021 (virtual event)

visual banner

Proceedings are available through CEUR-WS

MEPDaW 2021 and the Covid-19 outbreak

The ISWC Organizing Committee and the Semantic Web Science Association (SWSA) have been closely monitoring the situation evolving around Covid-19. They have decided that ISWC 2021 together with its associated workshops will be virtual events.

There is a vast and rapidly increasing quantity of scientific, corporate, government, and crowd-sourced data openly published on the Web. Open Data plays a catalyst role in the way structured information is exploited on a large scale. A traditional view of digitally preserving these datasets by “pickling and locking them away” for future use, like groceries, conflicts with their evolution. There are several approaches and frameworks (Linked Data Stack, PoolParty Suite, etc.) that manage a full life-cycle of the Data Web. More specifically, these solutions are expected to tackle major issues such as the synchronisation problem (monitoring changes), the curation problem (repairing data imperfections), the appraisal problem (assessing the quality of a dataset), the citation problem (how to cite a particular version of a dataset), the archiving problem (retrieving a specific version of a dataset), and the sustainability problem (preserving at scale, ensuring long-term access).

This workshop targets one of the emerging and fundamental problems in the Web, specifically the management and preservation of evolving knowledge graphs.

During the past six years, the MEPDaW workshop series has been gathering researchers from the community around these challenges. So far the series successfully published more than 25 research efforts allowing more than 50 individual authors to present and share their ideas.

This workshop aims at addressing challenges and issues on managing Knowledge Graph evolution and preservation by providing a forum for researchers and practitioners to discuss, exchange and disseminate their ideas and work, to network and cross-fertilise new ideas.

Topics of interest include, but are not limited to themes related to the evolution and preservation of Knowledge Graphs:

  • Management and Governance of Evolution in Knowledge Graphs
    • Representation, maintenance of versions & changes (change representation and detection)
    • Efficient indexing and update of Knowledge Graphs
    • Synchronization of distributed versions
    • Federated Knowledge Graph governance
  • Reasoning and Prediction over Evolving Knowledge Graphs
    • Techniques for extracting and predicting evolving patterns
    • Maintenance of explicit and implicit knowledge
    • Trend analysis of evolving knowledge graphs
    • Concept drift detection and prediction over knowledge graphs
  • Visualization and Exploration of Evolving Knowledge Graphs
    • Visual summarization of evolving knowledge
    • User interfaces for exploring evolving knowledge graphs
    • Visualisation of quality in knowledge graphs
  • Preservation of Evolving Knowledge Graphs
    • Preservation of context, provenance and background knowledge
    • Efficient and effective solutions for preserving evolving knowledge graphs
    • Models for representing provenance and evolution
  • Quality of Evolving Knowledge Graphs
    • Change-detection based quality assessment and validation
    • Quality trends and prediction in evolving knowledge graphs
    • Hybrid approaches for knowledge graph curation
  • Evaluation of Knowledge Graph Evolution
    • Benchmarks for managing, predicting, and curating evolution
    • Real-world applications of evolving knowledge graphs
    • Automatic and human-based techniques for evaluating evolving knowledge graph

We envision four types of submissions covering the entire workshop topics spectrum:

  1. Research Papers (max 10 pages), presenting novel scientific research addressing topics of the workshop.
  2. Position & Demo papers (max 5 pages), encouraging papers describing significant work in progress, late breaking results or ideas of the domain, as well as functional systems relevant to the community.
  3. Industry & Use Case Presentations (max 5 pages), in which industry experts can present and discuss practical solutions, use case prototypes, best practices, etc. at any stage of implementation.
  4. Expression of Interest (max 2 pages), presenting a research topic, a work in progress, practical applications or needs, etc.

The proceedings of the workshops will be published in the CEUR-WS. Papers must be submitted in PDF according to the LLNCS format or in HTML. The PDF files must have all non-standard fonts embedded. Workshop submissions must be self-contained and in English.

All papers should be submitted to https://easychair.org/conferences/?conf=mepdaw2021.

  • Submission: August 20th, 2021 September 3rd, 2021, AoI - (extended)
  • Notification: September 24th, 2021
  • Camera-ready: October 18th, 2021
  • Presentation: October 25th, 2021

Attending the workshop

MEPDaW will take place on Monday 25th of October afternoon between 3pm and 6:30pm CET. The Zoom link is https://zoom.us/j/97654057442 (Provided by ISWC'21). All hours are Central European Time (CET). (Please, don't forget to register... ☺)

Academic Keynote

Title: How can we fix the Web of Data?
Professor Katja Hose, Aalborg University, Department of Computer Science, Denmark
http://people.cs.aau.dk/~khose/About_me.html

Abstract: When querying knowledge we typically consider the Web of Data to be a static point of reference that is always available and that never changes. However, when actually running queries "in the wild", we encounter a broad range of problems; spanning from the (un)availability of entire knowledge graphs (and their SPARQL endpoints) to outdated references between knowledge graphs caused be unilateral updates and beyond that, we are almost entirely missing out on the availability of previous versions of knowledge graphs and provenance metadata about them. In this talk, I will highlight some of the solutions we developed to mitigate these problems. I will first show how to keep knowledge available for continuous and scalable querying. Afterwards, I will present an approach that enables community-driven updates so that mistakes can be corrected or missing information can be added. And finally, I will highlight what we can learn from RDF archiving solutions to better support evolving knowledge graphs.

Industrial Keynote

Title: RDF version management of Clinical Data Standards
Javier D. Fernández, information architect at F. Hoffmann-La Roche AG, Switzerland
Nelia Lasierra, information architect at F. Hoffmann-La Roche AG, Switzerland

Please note that the following times are according to the current CET (Central European Time), same as the actual time in Paris (France).

Agenda

Time (CET)Title
At 3:00pmAcademic Keynote Session
3:00pm-3:05pmOpening words
3:05pm-2:50pmHow can we fix the Web of Data?
By Professor Katja Hose from Aalborg University, Denmark
3:50pm-4:00pmBreak
At 4:00pmPaper Session I
4:00pm-4:20pmDe-icing federated SPARQL pipelines: a method for assessing the "freshness" of result sets
Damien Graux, Fabrizio Orlandi and Declan O'Sullivan
4:20pm-4:40pmInteractive multimedia visualization for exploring and fixing
a multi-dimensional metadata base of popular musics
Maroua Tikat, Marco Winckler and Michel Buffa
4:40pm-5:00pmTrieDF: Efficient In-memory Indexing for Metadata-augmented RDF
Olivier Pelgrin, Luis Galárraga and Katja Hose
5:00pm-5:05pmBreak
At 5:05pmIndustrial Keynote Session
5:05pm-5:35pmRDF version management of Clinical Data Standards
By Javier D. Fernández & Nelia Lasierra from F. Hoffmann-La Roche AG, Switzerland
At 5:35pmPaper Session II
5:35pm-5:55pmDon’t Stop Thinking about Tomorrow: Use Cases Demonstrating
the Asymmetric Impact of Contextual Temporal Links in Knowledge Graph Evolution
K. Krasnow Waterman
5:55pm-6:15pmUpLOD: A Tool for Inconsistent Links Repairment in the LOD
André Regino, Enio de Jesus Pontes Monteiro, Andressa Cristina dos Santos
and Julio Cesar Dos Reis
At 6:15pmDiscussion & wrap-up

Organisers

  • Fabrizio Orlandi (ADAPT Centre, Trinity College Dublin, Ireland) is a Senior Research Fellow at Trinity College Dublin. His research focuses on knowledge management, Linked (Open) Data, Big Data technologies, data governance and personalisation. Prior to joining ADAPT he worked as post-doctoral researcher at Fraunhofer IAIS applying his research on large EU-funded and industry projects.
  • Damien Graux (Inria, France) is a researcher at Inria Sophia Antipolis based in the Wimmics group. He has been contributing to research efforts in Semantic Web technologies and focusing on distributed query evaluation and on designing complex transformation pipelines for heterogeneous Big Data. Prior to joining Inria, he had research positions at Trinity College Dublin (Ireland) and at Fraunhofer IAIS (Germany).
  • Julio Cesar dos Reis (Institute of Computing of the University of Campinas: UNICAMP) is an associated professor at the Institute of Computing, UNICAMP, Brazil. Former assistant researcher at the Public Research Centre Henri Tudor, Luxembourg. He investigates engineering and alignment of ontologies, knowledge representation and evolution aspects in knowledge graphs.
  • Maria-Esther Vidal (Leibniz Information Centre for Science and Technology (TIB) & L3S, Germany) is the head of the Scientific Data Management group at the Leibniz Information Centre for Science and Technology (TIB) and L3S, Germany. Maria-Esther is also a full professor (retired) from Universidad Simón Bolívar (USB) Venezuela. Her interests include data and knowledge management, knowledge representation, big data, and semantic web.

Advisory Board

  • Philippe Cudré-Mauroux, eXascale Infolab, University of Fribourg, Switzerland
  • Jeremy Debattista, TopQuadrant Inc
  • Javier D. Fernández, Information Architect at Roche, Switzerland
  • Fabien Gandon, Inria, Université Côte d’Azur, CNRS, I3S, France
  • Axel Polleres, Vienna University of Economics and Business, Austria

Program Committee

Name Affiliation
Natanael Arndt Leipzig University, Germany
Ioannis ChrysakisFORTH-ICS, Greece; and Ghent University - IDLab – imec, Belgium
Pieter ColpaertGhent University, Belgium
Marcos Da SilveiraLIST, Luxembourg
Christophe DebruyneTrinity College Dublin, Ireland
Javier D. FernándezF. Hoffmann-La Roche AG, Switzerland
Luis Ibanez-GonzalezUniversity of Southampton, England
Pavel KlinovStardog Union, Germany
Pierre MaillotInria, France
Harshvardhan J. PanditADAPT Centre - Trinity College Dublin, Ireland
George PapastefanatosIMIS / RC "Athena", Greece
Iliana PetrovaInria, France
Fatiha SaïsLRI & Paris Saclay University, France
Ruben Taelman Ghent University – imec, Belgium

Important Dates

  • Submission (EasyChair): August 20th, 2021 September 3rd, 2021, AoI - (extended)
  • Notification: September 24th, 2021
  • Camera-ready: October 18th, 2021
  • Presentation: October 25th, 2021

Event Location

MEPDaW 2021 is co-located with ISWC 2021.

Past Editions

The main focus of the workshop has always been on the fundamental problem of managing the evolution and preservation of the Data Web.

  1. 2015 — MEPDaW was held for the first time at ESWC 2015.
  2. 2016 — MEPDaW was held at ESWC 2016.
  3. 2017 — MEPDaW was held at ESWC 2017.
  4. 2018 — MEPDaW was held at ESWC 2018.
  5. 2019 — MEPDaW was held at the Web Conference 2019.
  6. 2020 — MEPDaW was held at ISWC 2020.