9th MEPDaW Workshop at ISWC'23, November 6th (afternoon), 2023
There is a vast and rapidly increasing quantity of scientific, corporate, government, and crowd-sourced data openly published on the Web. Open Data plays a catalyst role in the way structured information is exploited on a large scale. A traditional view of digitally preserving these datasets by “pickling and locking them away” for future use, like groceries, conflicts with their evolution. There are several approaches and frameworks (Linked Data Stack, PoolParty Suite, etc.) that manage a full life-cycle of the Data Web. More specifically, these solutions are expected to tackle major issues such as the synchronisation problem (monitoring changes), the curation problem (repairing data imperfections), the appraisal problem (assessing the quality of a dataset), the citation problem (how to cite a particular version of a dataset), the archiving problem (retrieving a specific version of a dataset), and the sustainability problem (preserving at scale, ensuring long-term access).
This workshop targets one of the emerging and fundamental problems in the Web, specifically the management and preservation of evolving knowledge graphs.
During the past eight years, the MEPDaW workshop series has been gathering researchers from the community around these challenges. So far the series successfully published more than 50 research efforts allowing more than 60 individual authors to present and share their ideas.
This workshop aims at addressing challenges and issues on managing Knowledge Graph evolution and preservation by providing a forum for researchers and practitioners to discuss, exchange and disseminate their ideas and work, to network and cross-fertilise new ideas.
Topics of interest include, but are not limited to themes related to the evolution and preservation of Knowledge Graphs:
We envision four types of submissions covering the entire workshop topics spectrum:
The proceedings of the workshops will be published in the CEUR-WS. Papers must be submitted in PDF according to the CEURART 1-column format. The PDF files must have all non-standard fonts embedded. Workshop submissions must be self-contained and in English. Note: The review process is single-blind, no need for authors to submit anonymous articles.
All papers should be submitted to https://easychair.org/conferences/?conf=mepdaw2023.
MEPDaW will take place on Monday 6th of November afternoon between 2pm and 5:20pm. All hours are Athens time. (Please, don't forget to register and attend... ☺)
Title: Challenges in Data Management for Evolving Knowledge Graphs
David Chaves-Fraga, an assistant professor, University of Santiago de Compostela, Spain
Abstract: KGs are playing an increasingly significant role in scientific and industrial domains as they have demonstrated their potential to represent the convergence between data and knowledge using a graph data model. KGs provide a solution to data heterogeneity and enable the representation of fine-grained domain knowledge. Despite the enormous efforts made by researchers and practitioners, the reality is that many data management challenges still appear in the development of KG-driven ecosystems. The construction and maintenance of these ecosystems suffer in key aspects such as maintainability, sustainability, scalability, and transparency, and they become even more complex when data and knowledge evolve. In this talk, we will analyze what are the main challenges that need to be addressed to ensure scalable, transparent, and maintainable data management techniques for the construction of evolving knowledge graphs. We will exemplify all these problems through the EU Public Procurement Data Space, a real-world project where a decentralized KG ecosystem is being built to enhance the transparency of public procurement across Europe.
Bio: David Chaves-Fraga is an assistant professor at the University of Santiago de Compostela (USC, Spain), a senior researcher at the Center for Research in Intelligent Technologies (CiTIUS@USC), and also a research collaborator at the Declarative Languages and Artificial Intelligence Group (DTAI) at KU Leuven, Belgium. His work is mostly focused on automating and optimizing the construction of knowledge graphs from heterogeneous data on the web using declarative mapping rules. He currently co-chairs the W3C Community Group Knowledge Graph Construction, where they pursue the standardization of the RDF Mapping Language (RML), and he is the main researcher of the EU Public Procurement Data Space. He received his PhD in Artificial Intelligence at Ontology Engineering Group (Universidad Politécnica de Madrid) in 2021.
|At 2:00pm||Session I|
|2:05pm-2:55pm||Challenges in Data
Management for Evolving Knowledge Graphs
By Dr. David Chaves-Fraga from University of Santiago de Compostela, Spain
|2:55pm-3:20pm||The Need for Better RDF
Olivier Pelgrin, Ruben Taelman, Luis Galárraga and Katja Hose
|At 4:00pm||Session II|
|4:00pm-4:25pm||Studying Linked Data
Accessibility Healthiness for the Long Tail of the
Johannes Frey, Marvin Hofer and Sebastian Hellmann
|4:25pm-4:50pm||SPARQL Statement Annotations
for Temporal Metadata in the Dydra RDF Store
James Anderson and Vimal Kumar
|4:50pm-5:15pm||Lenti: An Adaptive
Statistical Approach for Identifying Task-Specific
Data Quality Measures
|At 5:15pm||Discussion & wrap-up|
|David Chaves-Fraga||UPM, Spain|
|Pieter Colpaert||Ghent University, Belgium|
|Marcos Da Silveira||LIST, Luxembourg|
|Christophe Debruyne||Trinity College Dublin, Ireland|
|Javier D. Fernández||F. Hoffmann-La Roche AG, Switzerland|
|Pierre Maillot||Inria, France|
|Harshvardhan J. Pandit||ADAPT Centre - Trinity College Dublin, Ireland|
|George Papastefanatos||IMIS / RC "Athena", Greece|
|Iliana Petrova||Inria, France|
|Philipp D. Rohde||TIB, Germany|
|Ruben Taelman||Ghent University – imec, Belgium|
The main focus of the workshop has always been on the fundamental problem of managing the evolution and preservation of the Data Web.