Dagstuhl seminar on Knowledge Graphs

Returning home from a very interesting Dagstuhl seminar on Knowledge Graphs, it is time to collect some thoughts. In the seminar we developed a shared understanding of the current state of the art in Knowledge Graphs and more importantly mapped out the road ahead. The format of the seminar consisted of 5-min pitches on relevant topics, and then followed up by group discussions, to be summarised and consolidated in an upcoming report. In the spirit of true societal (and research) progress a large part of the seminar was devoted to discussing grand challenges in our society, where in this case the focus was on those where we believe Knowledge Graphs can play a crucial role in addressing the challenges. In the upcoming report those will be discussed in depth, but examples of such challenges include interaction between humanity and machines, the kind of explainable and human-centred AI that is required in various societal domains, such as medicine, keeping up with knowledge evolution and rapidly changing information in our society, and addressing information interoperability at scale.

The feeling I in particular take with me from this seminar is that we have a unique opportunity to really facilitate interaction and integration of major results from different areas, and that Knowledge Graphs may be the key that finally makes this possible at scale.

However, taking a step back, one may first ask the question: What is a Knowledge Graph? And how does it relate to previous objects of study, such as Linked Data or Ontologies? Although this was discussed at length in the seminar, my personal viewpoint is that we do not really need a strict scientific definition. Potentially a descriptive one could be useful, but even just exemplifying what we mean when talking about Knowledge Graphs should be enough. To me a Knowledge Graph is about two things: knowledge that is represented in some graph-like format, preferably machine readable, and (can be) used as the source of knowledge/information/data in some application. This subsumes both ontologies, Linked Data, and all the various Knowledge Graphs proposed by large companies so far. Although Google were the ones to popularise the term a few years ago, it has been around also before that, and can even be traced back to ancient times (as some people pointed out in the seminar). However, that does not reduce the importance of the Google Knowledge Graph, both as a positive example and inspiration for others (i.e., Knowledge Graphs of “everything” can really work at scale), and as a popular explanation of the term, or could maybe even be seen as a revitalisation of the whole knowledge representation field.

So, how does it relate to existing fields then? Here we come back to my key take-away from the seminar – integration of research fields. I do not see Knowledge Graphs as a new field, nor as a renaming of some existing area, such as the Semantic Web or ontologies, but rather it is what emerges when you marry ontologies and Linked Data with property graphs and graph databases and the web. Or macine learning models with graph formats and methods for symbolic knowledge representation, e.g., to create explainable AI. Of course, that means that everything we learned so far in these individual fields is very valuable, e.g., ontology engineering, representation formats and standards etc., but it is when you marry that with results from other fields that 1+1 becomes 3, or even 10. So if you ask for the relation to ontologies, for instance, I would say that Knowlege Graphs is a generalisation, where any Semantic Web ontology can probably be considered to be a Knowledge Graph, but not every Knowledge Graph (probably just a few) will be an ontology.

 

Related to our own research in the Linköping University Semantic Web group, we do have some very valuable pieces of this puzzle to offer. In the knowledge representation area we have worked a lot on ontology engineering and ontology design patterns, and this is a valuable input also for creation of Knowledge Graphs. In particular the notion of design patterns I believe is very valuable also when creating generic Knowledge Graphs. Especially since patterns are not only intended as a technical development tool, but can also support understandability, interoperability, reuse, and act as a least common denominator when matching and integrating data and knowledge. Also recent work on ontology matching will be directly applicable to Knowledge Graph matching and integration, as well as the work on ontology evolution and stream reasoning and complex event processing, for managing highly dynamic data and knowledge. All of this is highly relevant when generalised from ontologies to general Knowledge Graphs, maybe even more relevant than for the specific case of ontologies.

Then of course a Knowledge Graph needs to be represented in some way, preferably using a machine readable format and in a language with some formal semantics. RDF is an obvious candidate for representing Knowledge Graphs on the web. However, so far the RDF community has been quite separated from the community around property graphs (and graph databases), in my opinion mainly due to the difficulties of directly representing property graphs in RDF. Also here the LiU group has something to offer, in the form of the proposals by Olaf Hartig on RDF and SPARQL extensions to bridge this gap (called RDF* and SPARQL*) as well as our research on graph data, and models for that, in general.

I hope this seminar will really become the starting point of something new. New research directions, and a more inclusive community (than maybe the Semantic Web community has been, in retrospect) around Knowledge Graps that embraces the need for integrating approaches from various other fields, embraces variety and complexity, and embraces dynamics.

Workshop on Ontology Design and Patterns held at ISWC 2017

On Saturday October 21st the 8th WOP workshop was run during ISWC2017 in Vienna, Austria. Eva Blomqvist has been the general chair of this workshop instance, and co-organisers included Oscar Corcho (UPM), David Carral (TU Dresden), Rinke Hoekstra (Elsevier) and Matthew Horridge (Stanford). The workshop program and other information can be found on the WOP2017 page.

Pascal Hitzler (Wright State University, US) kicked off the workshop with a very interesting keynote on next generation ontology engineering, listing a number of open problems and practical showstoppers towards better utilisation of ODPs in ontology engineering. Then we heard a number of exciting talks presenting the latest research around ODPs, and a number of actual ODPs were also described. The workshop concluded with a discussion session, conducted in three groups, focusing on different problems bought forth by the participants. The groups talked about meta-languages to describe ODPs and ODP usage, relations between ODPs, and the relation between ontologies and thesauri. Notes are on the WOP page linked above.

An interesting thing to note is that ODP research is getting broader, with two main tracks; one considering the classical view of ODPs as design patterns for ontologies, and the second one more focusing on ODPs as templates or macros for generating OWL from a specification. A lot of discussions at the workshop centered around how to use both these views together to allow users to benefit from both of them. Interest in ODPs is also not only an academic thing, in the audience there were several industry representatives from around the world. Overall, the workshop attracted a quite large audience, counting around 45 people during the main sessions. In the evening almost half of the attendants met again, for a social dinner at a local restaurant.

Anyone who is interested in updates on ODP research and future events is encouraged to join the ODP mailing list (a google group).

Eva Blomqvist invited speaker at the ESSENCE final conference

ESSENCE is an EU-funded Marie Curie network, ending in October 2017. As a final event the network arranged the International Conference on Computational Approaches to Diversity in Interaction and Meaning, in San Servolo, Venice, Italy, from 6-9 October 2017. Eva Blomqvist was one of the invited speakers in this conference, talking about managing diversity of ontologies on the Semantic Web by means of Ontology Design Patterns (ODP). An ODP is not only an aid for constructing ontologies, but also a means of identifying commonalities in ontologies, i.e., based on the way they model various aspects. So at a certain level of abstraction ODPs can constitute a shared level of understanding between ontologies using those same ODPs.

The slides of Eva’s talk are available here.

A tweet containing some nice pictures from the talk.

Karl Hammar defended his PhD thesis on Ontology Design Patterns

On September 29, Karl Hammar successfully defended his PhD thesis entitled “Content Ontology Design Patterns: Qualities, Methods, and Tools”. The thesis was first presented by the opponent, professor Harald Sack, FIZ, Karlsruhe, Germany, who then continued to discuss the research results and future ideas with Karl. The three members of the examining committee then continued the discussion with Karl, before unanimously deciding to approve his thesis and award the PhD degree to Karl. At which point we could all congratulate him to a an interesting defence and an excellent thesis. In particular, his three supervisors, Henrik Eriksson (LiU), Eva Blomqvist (LiU) and Vladimir Tarassov (JTH) were of course the first to congratulate him, followed by colleagues and family.

Karl Hammar’s research has aimed to combine quantitative and qualitative research methods, primarily based on five ontology engineering projects involving inexperienced ontologists, studying how Ontology Design Patterns (ODPs) can support that specific group of users. A series of ontology engineering workshops and surveys provided data about developer preferences regarding ODP features and quality, ODP usage methodology, and ODP tooling needs. Other data sources were ontologies and ODPs published on the web, which have been studied in detail. To evaluate tooling improvements, experimental approaches provided data from comparison of new tools and techniques against established alternatives.

The analysis of the gathered data resulted in a set of measurable quality indicators that cover aspects of ODP documentation, formal representation or axiomatisation, and usage by ontologists. These indicators highlight quality trade-offs: for instance, between ODP Learnability and Reusability, or between Functional Suitability and Performance Efficiency. These are things that ontology engineers need to keep in mind when using ODPs in their ontologies, and in particular if they are inexperienced ontologists. Furthermore, the results demonstrated a need for ODP tools that support three novel property specialisation strategies, and highlighted the preference of inexperienced developers for template-based ODP instantiation, neither of which were supported in prior tooling. The studies also resulted in improvements to ODP search engines based on ODP-specific attributes. Finally, the analysis showed that a specific ontology engineering methodology, the eXtreme Design (XD), should include guidance for developer roles and responsibilities in ontology engineering projects, suggestions on how to reuse existing ontology resources, and approaches for adapting XD to project-specific contexts. Karl therefore proposed a new version of the XD methodology, specifically covering these aspects.

The thesis can be found here.



 

 

 

 

 

 

 

Congratulations Karl!

LiU Semantic Web group at ESWC2017

This week a couple of us have been at ESWC2017 in Portoroz, Slovenia. Eva Blomqvist was the general chair of the conference this year, hence, this was the culmination of a whole year of hard work for her. Olaf Hartig is the proceedings chair (proceedings part 1 and 2). He could not attend the conference this year, but has done great job with the Springer proceedings, and the upcoming post-proceedings volume with poster and demo papers among other things. In addition to this, Karl Hammar, was one of the organisers of the Modular Ontology Modeling with Ontology Design Patterns tutorial, together with Pascal Hitzler, Adila A. Krisnadhi, Agnieszka Lawrynowicz and Monika Solanki. In particular, Karl ran the hands-on session with his tool for ODP-based modelling in WebProtégé (called XDP). Finally, Henrik Eriksson, presented our EU-funded project VALCRI in the project networking session, and in the poster session.

The overall conference was interesting as always, and included a lot of networking opportunities, as well as interesting work to take a closer look at. A quick summary of some of the major events:

Crosbie

Kevin Crosbie, from Ravenpack, the first keynote speaker talking about how to model events in order to use them for predicting financial markets. Very interesting talk, describing how Ravenpack work with their data products and apply technologies very similar to Semantic Web, although technically not using the W3C standards, such as RDF.

Panel

At the end of the first day, Aldo Gangemi chaired a panel about the future of academic publishing, discussing the challenges and opportunities that lie ahead. It is clear that something needs to be done about both the reviewing situation in our field, the open access issue, and we want more focus on “eating our own dog food”. The discussions were also related to the paper that later won the best student paper award, on Linked Data Notifications.

Sheridan

Second keynote speaker, John Sheridan, from the National Archives in the UK, described how the National Archives heavily rely on Semantic Web technologies and standards to solve their archiving tasks. However there are also challenges of course, which can hopefully be solved by working together: academia and society at large. Particularly interesting for us at LiU to hear that the National Archives is in great need of a better solution for modelling trust and uncertainty in their data, which could be a potential use case for the recent research results on RDF* and SPARQL* by Olaf Hartig.

Dinner

Nice conference dinner at the beach, and a chance for the general chair to thank all the people in the organising committee.

Poster

Poster session with lots of interesting interactions and discussion, here with Diego Reforgirato, who later won both the best poster and best demo awards.

Unfortunately, we did not take any picture of the last keynote, Lora Aroyo, who gave a very interesting keynote on the last day. She started with an overview of the evolution of the field, pointing out that studying and using people to acquire knowledge has always been a central part of our research. However, by over simplifying, and trying to fit every answer into yes/no categories, we can introduce wrong conclusions. She means that we need to be aware of ambiguity and diversity in opinions, that there is usually not one true answer, and instead turn that to our advantage. Lora showed a vector-based model to represent diversity in opinions.

Finally, Aldo Gangemi will be the next general chair of ESWC in 2018, and he made a series of interesting promises for the next year, among others: double-open review process, improvements in the online pre-prints of the proceedings and the dataset, a resources track and an industry session á la ISWC, and better music in the social events. We all wish him the best of luck with the next conference, and we are excited to see all the innovations next year!