Workshop: Build a MarkLogic semantic search and discovery hub in under 3 hours


Date: November 7th, 2018
Time: 9:30 to 12:30 hours
Location: Beeld en Geluid, BENG LAB 1, Hilversum
Hosted by: Edward Thomas, Emil Zegers & Bas van Geffen

Data integration is time consuming and difficult right? What if you could take unstructured data, enrich it and combine in with structured and semantic data to build a working semantic search and discovery hub in under 3 hours? In the morning, directly before the main PLDN Linked Data conference, we invite you to this hands on workshop. You will build a semantic data hub combining structured, unstructured and semantic data in a single application using the MarkLogic Database, which can be combined with an ontology management tool like PoolParty, Smartlogic, TopBraid or Protégé. At the end of the workshop participants will have a working semantic data hub and a good introduction to working with the technology. The workshop, which previously was held at the Semantics 2018 conference Vienna, will be conducted in English but there will be Dutch speaking support available.

Agenda[bewerken]

Data integration is a major theme across industries, academia, government, etc. Organisations of all kinds have realised that there is tremendous value in the data they create, collect and hold. But it’s more than likely spread across silos and stored in relational data models and/or documents. And this proliferation of silos means that all data integration projects involve data in multiple shapes and sizes.

Semantics is becoming increasingly popular for simplifying the complicated integration processes such as, data harmonisation, mastering and discovery. But just as most organisations cannot convert all of their data to a fixed relational schema, nor can they turn all of their data into RDF. A multi-model approach where data of varying formats can be combined lowers the barrier to entry for introducing semantic technologies into these organisations.

A second challenge to successful data integration is the lack of good metadata in legacy content. The idea of manually re-classifying legacy content is an almost insurmountable obstacle for most organisations. Content enrichment and semantics are often the most effective solution to this problem.

In this workshop we will show you how to overcome these obstacles only using the software of MarkLogic combined with an ontology management tool.

This is hands on session. Participants should bring their laptops to the session. All software and data will be provided on thumb drives.

  • Short introduction to MarkLogic
  • Overview of the semantic data hub design pattern
  • Software installation and setup
  • Ingest data from several sources into MarkLogic 9.
  • Enrich unstructured data
  • Build a semantic search application

Participants will leave the workshop with a working semantic data hub application, the MarkLogic DBMS and ODH Framework code.

Registration[bewerken]

The registration is closed for this workshop