Data Lifecycle Ontology

IRI:
http://aligned-project.eu/ontologies/dlo
Date:
11-02-2015
Current version:
2.12.000
Authors:
Bojan Božić (bozicb@scss.tcd.ie)
Contributors:
Kevin C. Feeney (kevin.feeney@cs.tcd.ie)
Rob Brennan (rob.brennan@cs.tcd.ie)
Publisher:
Trinity College Dublin
Imported Ontologies:
http://protege.stanford.edu/plugins/owl/dc/protege-dc.owl (visualise it with LODE)
http://www.w3.org/ns/dcat (visualise it with LODE)
http://www.w3.org/ns/prov-o-20130430 (visualise it with LODE)
Other visualisation:
Ontology source

Abstract

This ontology provides a description of the data lifecycle for Linked Data. It captures various processes involved in the lifecycle of data and answeres the following questions: - What lifecycle stage is a specific dataset or data item currently in? - What is the next lifecycle stage for a particular data item (workflows)? - What is the appropriate widget or form to display this data item in for a specific user role, given the data item’s state (lifecycle stage)? - What is the context for a specific data item (dataset name/URI/meta-data URI, PROV records, …)? - Which agents, processes, and entities are involved in a lifecycle run?

Table of Content

  1. Introduction
  2. Classes
  3. Object Properties
  4. Namespace Declarations

Introduction

The purpose of the Data Lifecycle Ontology is to provide a set of conceptual entities, agents, activities, and roles to represent the general data engineering process. Furthermore, it is the basis for deriving specific domain ontologies which represent lifecycles of concrete data engineering projects such as DBpedia or Seshat. DLO uses the W3C PROV ontology represented by the classes Role, Person, Entity, and Activity. It uses the Process class which is derived from Activity to implement the Linked Data Stack lifecycle stages as subclasses. This allows the user to represent linked open data activities in the data lifecycle metamodel. In addition datasets, data sources and data repositories have been modelled. For datasets it imports the W3C Data Catalog Vocabulary (DCAT) definition of a dataset as it is a broad definition that goes beyond representing only RDF-based datasets. The W3C PROV ontology is available at http://www.w3.org/TR/prov-o/. The concepts defined in the LOD2 project are available at http://stack.lod2.eu/blog/.

Classes

Authoringc back to ToC or Class ToC

IRI: http://aligned-project.eu/ontologies/dlo#Authoring

The LOD2 Stack facilitates the authoring of rich semantic knowledge bases, by leveraging Semantic Wiki technology, the WYSIWYM paradigm (What You See Is What You Mean) and distributed social, semantic collaboration and networking techniques.
has super-classes
Data Lifecycle Processc

Classificationc back to ToC or Class ToC

IRI: http://aligned-project.eu/ontologies/dlo#Classification

Linked Data on the Web is mainly raw instance data. For data integration, fusion, search and many other applications, however, we need this raw instance data to be classified into taxonomies. In the LOD2 stack, semi-automatic components for this purpose are included.
has super-classes
Data Lifecycle Processc

Data Artifactc back to ToC or Class ToC

IRI: http://aligned-project.eu/ontologies/dlo#DataArtifact

An artifact is a process-oriented item such as a design or report used in the data lifecycle.
has super-classes
Data Entityc
has sub-classes
Test Casec, Test Case Resultc

Data Engineerc back to ToC or Class ToC

IRI: http://aligned-project.eu/ontologies/dlo#DataEngineer

Data engineers are the designers, builders and managers of an information infrastructure. They develop the architecture that helps analyze and process data in the way the organization needs it. And they make sure those systems are performing smoothly. Data science is a team sport.
has super-classes
Data Process Personc

Data Entityc back to ToC or Class ToC

IRI: http://aligned-project.eu/ontologies/dlo#DataEntity

A class for general data entities.
has super-classes
entity
has sub-classes
Data Artifactc, Data Sourcec, Repositoryc, datasetc
is in range of
consumesop, producesop

Data Lifecycle Processc back to ToC or Class ToC

IRI: http://aligned-project.eu/ontologies/dlo#DataLifecycleProcess

A general class for describing specific steps during the processing of linked data.
has super-classes
activity
has sub-classes
Authoringc, Classificationc, Evolution/Repairc, Extractionc, Interlinkingc, Quality Analysisc, Search/Browsing/Explorationc, Storagec
is in domain of
consumesop, has sub processop, is supported byop, producesop
is in range of
has sub processop, initiatesop, is responsible forop, supportsop

Data Process Personc back to ToC or Class ToC

IRI: http://aligned-project.eu/ontologies/dlo#DataProcessPerson

A person who is involved in the data processing lifecycle.
has super-classes
person
has sub-classes
Data Engineerc, Domain Expertc, System Administratorc, Userc
is in domain of
initiatesop

Data Software Agentc back to ToC or Class ToC

IRI: http://aligned-project.eu/ontologies/dlo#DataSoftwareAgent

A specific software agent involved in the data lifecycle.
has super-classes
software agent
is in domain of
supportsop

Data Sourcec back to ToC or Class ToC

IRI: http://aligned-project.eu/ontologies/dlo#DataSource

A data source defines where data comes from.
has super-classes
Data Entityc

datasetc back to ToC or Class ToC

IRI: http://dataid.dbpedia.org/ns/core#Dataset

has super-classes
Data Entityc

Domain Expertc back to ToC or Class ToC

IRI: http://aligned-project.eu/ontologies/dlo#DomainExpert

A person who is an authority in a particular area or topic. The term domain expert is frequently used in expert systems software development, and there the term always refers to the domain other than the software domain.
has super-classes
Data Process Personc

Evolution/Repairc back to ToC or Class ToC

IRI: http://aligned-project.eu/ontologies/dlo#Evolution

Data on the Web is dynamic. We need to facilitate the evolution of data while keeping things stable. Changes and modifications to knowledge bases, vocabularies and ontologies should be transparent and observable. The LOD2 Stack comprises methods to spot problems in knowledge bases and to automatically suggest repair strategies.
has super-classes
Data Lifecycle Processc

Extractionc back to ToC or Class ToC

IRI: http://aligned-project.eu/ontologies/dlo#Extraction

Gathering data from unstructured, semi-structured, and structured sources.
has super-classes
Data Lifecycle Processc

Interlinkingc back to ToC or Class ToC

IRI: http://aligned-project.eu/ontologies/dlo#Interlinking

Creating and maintaining links in a (semi-)automated fashion is still a major challenge and crucial for establishing coherence and facilitating data integration as outlined in the publishing usage scenario in the introduction. We seek linking approaches yielding high precision and recall, which configure themselves automatically or with end-user feedback.
has super-classes
Data Lifecycle Processc

Quality Analysisc back to ToC or Class ToC

IRI: http://aligned-project.eu/ontologies/dlo#QualityAnalysis

The quality of content on the Data Web varies, as the quality of content on the document web varies. The LOD2 Stack comprises techniques for assessing quality based on characteristics such as provenance, context, coverage or structure. The goal in our application scenarios is to assess whether data sources for a publisher are complete, consistent, reliable etc.
has super-classes
Data Lifecycle Processc

Repositoryc back to ToC or Class ToC

IRI: http://aligned-project.eu/ontologies/dlo#Repository

A central location in which data is stored and managed.
has super-classes
Data Entityc
is in domain of
storesop

Search/Browsing/Explorationc back to ToC or Class ToC

IRI: http://aligned-project.eu/ontologies/dlo#Search

For many users, the Data Web is still invisible below the surface. LOD2 develops search, browsing, exploration and visualization techniques for different kinds of Linked Data (i.e. spatial, temporal, statistical), which make the Data Web sensible for real users.
has super-classes
Data Lifecycle Processc

Storagec back to ToC or Class ToC

IRI: http://aligned-project.eu/ontologies/dlo#Storage

Efficient RDF data management techniques fulfilling requirements of global publishers comprise column-store technology, dynamic query optimization, adaptive caching of joins, optimized graph processing and cluster/cloud scalability.
has super-classes
Data Lifecycle Processc

System Administratorc back to ToC or Class ToC

IRI: http://aligned-project.eu/ontologies/dlo#SystemAdministrator

A person who is responsible for managing the data engineering system.
has super-classes
Data Process Personc

Test Casec back to ToC or Class ToC

IRI: http://aligned-project.eu/ontologies/dlo#TestCase

A data test case description.
has super-classes
Data Artifactc

Test Case Resultc back to ToC or Class ToC

IRI: http://aligned-project.eu/ontologies/dlo#TestCaseResult

A data test case result or report.
has super-classes
Data Artifactc

Userc back to ToC or Class ToC

IRI: http://aligned-project.eu/ontologies/dlo#User

A person who is using the data engineering system.
has super-classes
Data Process Personc

Object Properties

consumesop back to ToC or Object Property ToC

IRI: http://aligned-project.eu/ontologies/dlo#consumes

has domain
Data Lifecycle Processc
has range
Data Entityc
is inverse of
was attributed to

has sub processop back to ToC or Object Property ToC

IRI: http://aligned-project.eu/ontologies/dlo#hasSubProcess

initiatesop back to ToC or Object Property ToC

IRI: http://aligned-project.eu/ontologies/dlo#initiates

is responsible forop back to ToC or Object Property ToC

IRI: http://aligned-project.eu/ontologies/dlo#isResponsibleFor

has domain
person
has range
Data Lifecycle Processc

is supported byop back to ToC or Object Property ToC

IRI: http://aligned-project.eu/ontologies/dlo#isSupportedBy

has super-properties
was associated with
has domain
Data Lifecycle Processc
has range
software agent

producesop back to ToC or Object Property ToC

IRI: http://aligned-project.eu/ontologies/dlo#produces

has super-properties
generated
has domain
Data Lifecycle Processc
has range
Data Entityc

storesop back to ToC or Object Property ToC

IRI: http://aligned-project.eu/ontologies/dlo#stores

has domain
Repositoryc
has range
distributionc

supportsop back to ToC or Object Property ToC

IRI: http://aligned-project.eu/ontologies/dlo#supports

Namespace Declarations back to ToC

default namespace
http://aligned-project.eu/ontologies/dlo#
core
http://dataid.dbpedia.org/ns/core#
dc
http://purl.org/dc/elements/1.1/
images
https://www.scss.tcd.ie/~bozicb/images/
ns
http://www.w3.org/ns/
ontologies
http://aligned-project.eu/ontologies/
owl
http://www.w3.org/2002/07/owl#
prov
http://www.w3.org/ns/prov#
rdf
http://www.w3.org/1999/02/22-rdf-syntax-ns#
rdfs
http://www.w3.org/2000/01/rdf-schema#
terms
http://purl.org/dc/terms/
xsd
http://www.w3.org/2001/XMLSchema#

This HTML document was obtained by processing the OWL ontology source code through LODE, Live OWL Documentation Environment, developed by Silvio Peroni.