Difference between revisions of "Top Level Ontology"

Revision as of 15:23, 29 October 2014

Introduction

In a nutshell

One of the main characteristics of biodiversity data is its cross-disciplinary feature and the extremely broad range of data types, structures, and semantic concepts which encompasses. Moreover, biodiversity data, especially in the marine domain, is widely distributed, with few well-established repositories or standard protocols for their archiving, access, and retrieval. Queries like “Given the scientific name of a species, find its predators with the related taxon-rank classification and with the different codes that the organizations use to refer to them", cannot be formulated (and consequently nor answered) by any individual source. To formulate such queries we need an expressive conceptual model, while for answering them we also have to assemble pieces of information stored in different sources. To fill this gap, we have designed and implemented a top level ontology, called Marine Top Level Ontology (for short MarineTLO).

Motivating Scenarios

The availability of a top level ontology for the marine domain would be useful in various scenarios. Below we will describe them.

For Publishing Linked Data: There is a trend towards publishing Linked Data; consequently a rising issue concerns the structure that is beneficial to use during such publishing. The semantic structure that will be presented can be used by the involved organizations for anticipating future needs for information integration, and thus alleviating the required effort for (post) integration.

For Generating Fact Sheets: FactSheetGenerator is an application provided by IRD aiming at providing factual knowledge about the marine domain by mashing-up relevant knowledge distributed across several data sources. Currently, FactSheetGenerator uses only ECOSCOPE and related knowledge stored in other sources (e.g., about commercial codes or taxonomic information) cannot be exploited. MarineTLO could be exploited for advancing this application, i.e., for providing more complete semantic descriptions.

For Semantic Post-Processing of the Results of Keyword Search Queries: Another big challenge nowadays is how to integrate structured data with unstructured data (documents and text). The availability of harmonized structured knowledge about the marine domain can be exploited for a semantic post-processing of the search results (over dedicated or general purpose search systems). XSearch is a meta-search engine that offers semantic post-processing of search results and is able to analyze the returned results by exploiting also the availability of semantic repositories (e.g. SPARQL endpoints). Xsearch could exploit MarineTLO for providing more complete information about the identified entities.

For Enabling Complex Query Services over Integrated Data: MarineTLO can be used as the schema for setting up integrated repositories that offer more complex query services, which cannot be supported by the individual underlying sources. In general, there are two main approaches for building and querying such repositories: the materialized integration approach (or warehouse approach), and the virtual integration (or mediator) approach (more information about these sources can be found in REF_XXX). The key point is that in both cases we need a schema and MarineTLO can serve this requirement.

MarineTLO as a product

We used a set of underlying sources for integrating their concepts in MarineTLO. Below we briefly describe these sources, and then describe the ontology MarineTLO and its corresponding releases.

The main underlying sources

Fisheries Linked Open Data: FLOD, created and maintained by Food and Agriculture Organization (FAO), is dedicated to create a dense network of relationships among the entities of the Fishery domains, and to programmatically serve them to semantic and traditional application environments. The FLOD content is exposed either via a public SPARQL endpoint[1] (suitable for semantic applications) or via a JAVA API to be embedded in consumers’ application code. Currently, the FLOD network includes entities and relationships from the domains of Marine Species, Water Areas, Land Areas, Exclusive Economic Zones, and serves software applications in the domain of statistics and GIS.

ECOSCOPE Knowledge Base: IRD offers a public SPARQL endpoint[2] for its knowledge base containing geographical data, pictures and information about marine ecosystems (specifically data about fishes, sharks, related persons, countries and organizations, harbors, vessels, etc.).

WoRMS: The World Register of Marine Species[3] currently contains more than 200 thousand species, around 380 thousand species names including synonyms, and 470 thousands taxa (infraspecies to kingdoms).

FishBase: FishBase[4] is a global database of fish species. It is a relational database containing information about the taxonomy, geographical distribution, biometrics, population, genetic data and many more. Currently, it contains more the 32 thousand species and more than 300 thousand common names in various languages.

DBpedia: DBpedia[5] is a project focusing on the task of converting content from Wikipedia to structured knowledge so that Semantic Web techniques can be employed against it. At the time of writing this article, the English version of the knowledge base of DBpedia describes more than 4.5 million things, containing persons, places, works, species, etc. In our case, we are using a subset of DBpedia’s knowledge base containing only fishes (i.e. instances classified under the class http://dbpedia.org/ontology/Fish).

The Marine Top Level Ontology

MarineTLO is not supposed to be a single ontology covering the entirety of what exists. It aims at being a global core model that (a) covers with suitable abstractions the domains under consideration to enable the most fundamental queries, (b) can be extended to any level of detail on demand, and (c) can adequately map and integrate data originating from distinct sources. This approach has two main benefits:

reduced effort for improving and evolving it since the focus is given on one model rather than many
reduced effort for constructing mappings since this approach avoid the pair-wise mappings between individual metadata formats and/or ontologies.

For the development and evolution of MarineTLO we have adopted an iterative and incremental methodology comprising the following steps: (i) ontological analysis of the underlying sources, (ii) design, (iii) implementation and (iv) evaluation. The activities of each iteration has been monitored by opening and corresponding tickets. The section REF_TICKETS_SECTION contains a list of the tickets that have been opened. For the implementation of MarineTLO we have used OWL 2 (Web Ontology Language) while for the needs of evaluation we used the notion of competence queries.

Releases

In total we released 4 versions of the MarineTLO. Each release was able to cover various concepts for different sources. Below we report the contents of each version, some basic information and links its documentation.

MarineTLO Versions
Version	Classes and Properties	Underlying Sources	Concepts covered	OWL File	Documentation	Mappings	Competence Queries	Release Date
Version 1	17 classes and 8 properties	FLOD, ECOSCOPE, WORMS	Species, Scientific Names, Predators	http://goo.gl/ukxmAv		http://goo.gl/kEfp8g	http://goo.gl/3sMdwR	March 2013
Version 2	57 classes and 22 properties	FLOD, ECOSCOPE, WORMS, DBpedia	Species, Scientific Names, Predators, Authorships	http://goo.gl/JTh8p9	http://goo.gl/Y9bGFy	http://goo.gl/I6STNv	http://goo.gl/3ZNqb4	July 2013
Version 3	57 classes and 25 properties	FLOD, ECOSCOPE, WORMS, DBpedia, Fishbase	Species, Scientific Names, Common Names, Predators, Authorships, Ecosystems, Countries, Water Areas, Vessels, Gears, EEZ	http://goo.gl/J15OCE	http://goo.gl/FygSd6	http://goo.gl/vxESUF	http://goo.gl/yTaz7O	October 2013
Version 4	127 classes and 81 properties	FLOD, ECOSCOPE, WORMS, DBpedia, Fishbase	Species, Scientific Names, Common Names, Predators, Authorships, Ecosystems, Countries, Water Areas, Vessels, Gears, EEZ, Bibliography, Statistical Indicators	http://goo.gl/Yh1Uot	http://goo.gl/KIFY6e	http://goo.gl/WU3xkG	http://goo.gl/KIFY6e	July 2014

TLO-Development activity

General Description

This activity concerns with the development of a top level ontology (called MarineTLO) that will integrate the concepts currently existing in marine-domain knowledge bases (in particular FLOD and ECOSCOPE knowledge bases). The MarineTLO-development activity is dived into six sub-activities (or Tasks) and related to each other as shown in the diagram in Fig 1.

Methodology

The methodology is based on an Iterative and Incremental development approach. As such, one iteration will involve all the tasks of the Fig 1. that are described @ http://wiki.i-marine.eu/index.php/Top_Level_Ontology. All the iterations will be accurately described and evolution of the MarineTLO ontology will be released in each iteration, based on the acquirement of the specific marine domain knowledge.

Activities scheduled with deadlines

Each iteration is planned to be monitored by opening related tickets.

Related Wiki Pages

http://wiki.i-marine.eu/index.php/XSearch

Motivation - Goal - Requirements

At very abstract level we use the picture below as preview that focuses on viewpoints for motivations, goal and requirements of having such a ontology on top of marine-domain knowledge bases.

Motivation. Semantic technologies, applications and services for biodiversity mostly rely on the rise of an interconnected and shared tree-of-life like dataset scaling on the web. The various communities (including also marine one) are contributing to this joint effort aim to share domain data and their meaning, to provide a solid basis for biodiversity systems interoperability. One of the challenges in iMarine is how users could experice a coherent source of fact about marine resources, rather than a bag of contributed contents.

Goal. The goal in modelling and formalising MarineTLO ontology is for integrating and semantically extending the underlying models of existing marine data sources. Specifically, the MarineTLO is used on the top of a number of real and heterogeneous marine data sources, including FLOD and ECOSCOPE, as knowledge mediator to represent, manipulate and reason upon and across them.

Requirements. Focusing on Ecosystem Approach to Fisheries and Marine Resources, the MarineTLO ontology should be generic enough

to provide consistent abstraction or specification of concepts included in the data models or ontologies of marine data sources
to provide the necessary properties to make this distributed knowledge bases (FLOD, ECOSCOPE and others) a coherent source of facts, relating observation data to the respective space-temporal context and categorical domain knowledge

Related Meetings (and their slides)

TCOM in Italy, Rome, 03.11.2012

Minutes, http://wiki.i-marine.eu/index.php/03.11.2012.MeetingInRome

FLOD Ontological Analysis, 7.12.2012, meeting online

TCOM in Greece, Skiathos, 18.06.2013 (LATEST)

Analysis, Design and Implementation

FLOD Ontological Analysis

This activity has the primary goal to provide a common understanding of the underlying model of the FLOD source. It has been considered necessary for the development of the MarineTLO. The associated ticket is https://issue.imarine.research-infrastructures.eu/ticket/888

FLOD Ontological Analysis, 04-Dec-2012
- https://portal.i-marine.d4science.org/group/data-e-infrastructure-gateway/workspace?itemid=04d81d92-d3e3-41d9-aeaf-605bd5c4ea33

FLOD Ontological Analysis, 07-Dec-2012
- http://wiki.i-marine.eu/index.php/7.12.2012-TLO_FLOD_Ontological_Analysis

FLOD Ontological Analysis, 31-Dec-2012
- https://portal.i-marine.d4science.org/group/data-e-infrastructure-gateway/workspace?itemid=55f18f59-d807-474b-aa5a-88b123d0dae8

ECOSCOPE Ontological Analysis

This activity has the primary goal to provide a common understanding of the underlying model of the ECOSCOPE source. It has been considered necessary for the development of the MarineTLO. The associated ticket is https://issue.imarine.research-infrastructures.eu/ticket/889

ECOSCOPE Ontological Analysis, 13-Dec-2012
- https://portal.i-marine.d4science.org/group/data-e-infrastructure-gateway/workspace?itemid=7414845a-942d-4c1d-b343-27caeee15053

ECOSCOPE Ontological Analysis, 11-Jan-2013, meeting, online
- https://portal.i-marine.d4science.org/group/data-e-infrastructure-gateway/workspace?itemid=fc7d36b4-96d8-4083-a524-430487ffb5fd

MarineTLO Design

The activities is related to the design of MarineTLO ontology. The associated ticket is https://issue.imarine.research-infrastructures.eu/ticket/890

Describing the MarineTLO design, first draft, 21-Dec-2012
- https://portal.i-marine.d4science.org/group/data-e-infrastructure-gateway/workspace?itemid=0233e4b5-b43b-4e0b-9754-a2c299e0cda7
Describing the MarineTLO design, second draft, 24-Dec-2012
- https://portal.i-marine.d4science.org/group/data-e-infrastructure-gateway/workspace?itemid=6fe8c22f-1152-4caf-bfbe-47d07339f905
Describing the MarineTLO design, a complete report is going to be available soon.

MarineTLO Implementation

The activities is related to the implementation of MarineTLO ontology using OWL 2 language. The associated ticket is https://issue.imarine.research-infrastructures.eu/ticket/891

Please look at http://wiki.i-marine.eu/index.php/Top_Level_Ontology#TLO_Products for the versions of the MarineTLO

SPARQL endpoints (Virtuoso and OWLIM-Lite)
- http://62.217.127.213:8890/sparql with graph name http://www.ics.forth.gr/isl/TLObasedDataWarehouse

- http://62.217.127.213:8080/openrdf-workbench/repositories/TLODataWarehouse/

Competency queries
- https://portal.i-marine.d4science.org/group/data-e-infrastructure-gateway/workspace?itemid=4700c39a-922e-48b2-ba2c-fe7e0e2074b8

MarineTLO Usage

This activity concerns with the identification of use cases motivating the need for having harmonized integrated information. It has been associated to the ticket https://issue.imarine.research-infrastructures.eu/ticket/900. Currently, we evaluate the MarineTLO ontology for the:

Fact Sheet Generator. In this case, MarineTLO is used as top model for FLOD, ECOSCOPE and WoRMS to support the development of FactSheetsGenerator applications. An concrete example is the one provided by IRD (SpeciesFactSheetsGenerator) aiming at providing factual knowledge about the marine domain by mashing-up relevant knowledge distributed across several data sources.

For Semantic Post-Processing of the results of keyword search queries. In this case, MarineTLO is used as knowledge model for semantic search in X-search meta-engine. In more detail, suppose that a user is looking for publications about tuna. Specifically he wants to find experiments that were applied to several species of tuna. So, he submits the query tuna and gets a sorted list of results and various categories of entities like Regional Fisheries Body, Species, FAO Country, etc. User realizes that the category Species may contain interesting entities. He notices that there is an entity with the label skipjack tuna which is a medium-sized fish in the tuna family found in tropical and warm-temperate waters. User wants to learn more information about that species. Specifically, he would like to see other species for which the skipjack tuna is predator or is prey. By clicking the icon next to the entity's name, user is able to instantly (at real-time) retrieve such information. In particular, in the back end, a SPARQL query is sent to the MarineTLO's endpoint asking for that information. Note that the 'Species' have been derived from FLOD, while the properties 'is predator of' and 'is prey of' have been derived from ECOSCOPE's knowledge base. That would be impossible without the exploitation of the MarineTLO.

MarineTLO Evaluation

This activity is related to the evaluation of the MarineTLO ontology. The associated ticket is https://issue.imarine.research-infrastructures.eu/ticket/892

A required activity for the MarineTLO evaluation is to populate it with concrete instances.

To populate just the class Species, please follow the link
- https://portal.i-marine.d4science.org/group/data-e-infrastructure-gateway/workspace?itemid=2e1a3b06-5c2d-4b96-aea0-c43dc53e670a

To populate the class Species with predator and prey relationships, please follow the link
- https://portal.i-marine.d4science.org/group/data-e-infrastructure-gateway/workspace?itemid=506fb989-7af4-492e-b654-19f4fd2113a5

Competence Queries, 22-03-2013
- https://portal.i-marine.d4science.org/group/data-e-infrastructure-gateway/workspace?itemid=4700c39a-922e-48b2-ba2c-fe7e0e2074b8

MarineTLO Populating, 22-03-2013
- WoRMS-TLO
  - https://portal.i-marine.d4science.org/group/data-e-infrastructure-gateway/workspace?itemid=9b36ae7b-bc10-43a2-95d1-3bec294f1c45
  - https://portal.i-marine.d4science.org/group/data-e-infrastructure-gateway/workspace?itemid=69068055-db9a-4aec-a0ce-fe523c5aa782

MarineTLO Evaluation Report, 22-04-2013
- https://portal.i-marine.d4science.org/group/data-e-infrastructure-gateway/workspace?itemid=4700c39a-922e-48b2-ba2c-fe7e0e2074b8

MarineTLO-related Products

The Notion of MarineTLO Version

Each MarineTLO version consists of

A release number
OWL files
A document that contains the scope notes of each class
A set of competency queries
A short description describing the changes
It could also contain a set of mappings between data source and MarineTLO, each of them described as an OWL file

Evolution Process

Since last meeting in Rome (26-03-2013), we planned to release a new version every two months (for correcting errors, based on requirements, priorities, usage needs, etc).

Version 1.0.0 released on 26-03-2013

Release Number: 1.0.0
Documentation of the MarineTLO version 1.0.0 https://issue.imarine.research-infrastructures.eu/ticket/1764#comment:1
The version 1.0.0 of the MarineTLO ontology contains classes, properties and instances to cover the species fact sheet
- https://portal.i-marine.d4science.org/group/data-e-infrastructure-gateway/workspace?itemid=7b1791c6-b1f4-4223-807c-748e28ed83e8
Competence queries covering the species fact sheets
- https://portal.i-marine.d4science.org/group/data-e-infrastructure-gateway/workspace?itemid=4700c39a-922e-48b2-ba2c-fe7e0e2074b8
Mappings in OWL: FLOD-TLO, Ecoscope-TLO and WoRMS-TLO
- https://portal.i-marine.d4science.org/group/data-e-infrastructure-gateway/workspace?itemid=38385d9e-2ac9-4514-bc5c-144281b0b619
Mappings descriptions
- https://portal.i-marine.d4science.org/group/data-e-infrastructure-gateway/workspace?itemid=0745f853-f81c-4317-ac72-b0e78c4fdaeb

Version 2.0.0 planned to be released by end of July 2013

Release Number: 2.0.0
Documentation of the MarineTLO version 2.0.0
- http://goo.gl/vmWVSj and Ticket #1764
The version 2.0.0 of the MarineTLO ontology in OWL
- https://i-marine.d4science.org/group/data-e-infrastructure-gateway/workspace?itemid=aa0fe288-3130-4613-b6a9-1c208cc88914
Competence queries
- https://i-marine.d4science.org/group/data-e-infrastructure-gateway/workspace?itemid=3683a581-d3ce-4537-85b4-cd3ff1c5f3a2
Mappings in OWL: FLOD-TLO, Ecoscope-TLO and WoRMS-TLO
- https://i-marine.d4science.org/group/data-e-infrastructure-gateway/workspace?itemid=a511a3a4-fbbe-451d-acf9-bb6c25f91e5c
A new WoRMS Schema (Ver2.0.0)
- https://i-marine.d4science.org/group/data-e-infrastructure-gateway/workspace?itemid=2f0b0a4c-15c1-4b0c-bf5b-d892b81094f9
Mappings descriptions
- https://i-marine.d4science.org/group/data-e-infrastructure-gateway/workspace?itemid=61b1fe20-a4b8-4f00-b2dc-246e93ec75e1

Version 3.0.0 planned to be released by middle of October 2013

Requirements, Management and Plan
- http://goo.gl/uBq0CH
- https://i-marine.d4science.org/group/data-e-infrastructure-gateway/workspace?itemid=92a68b46-8b6b-4774-9d16-af417daaa752
Release Number: 3.0.0
Documentation of the MarineTLO version 3.0.0
- http://goo.gl/athcIg
The version 3.0.0 of the MarineTLO ontology
- http://goo.gl/athcIg
Competence queries
- http://goo.gl/yTaz7O
Mappings in OWL: FLOD-TLO, Ecoscope-TLO and WoRMS-TLO
- http://goo.gl/XO9Ibm
Mappings descriptions
- http://goo.gl/vxESUF

Version 4.0.0

Release
- Version: 4.0.0
- Date: July 2014

MarineTLO ontology in OWL
- http://goo.gl/Yh1Uot
Documentation of the MarineTLO version 4.0.0
- http://goo.gl/KIFY6e
Requirements, Competency Queries
- http://goo.gl/iNPnDd
Mappings
- Description: http://goo.gl/SniVs8
- OWL implementation: http://goo.gl/WU3xkG

Previous TLO Versions

TLO Version 13/11/2012, https://portal.i-marine.d4science.org/group/data-e-infrastructure-gateway/workspace?itemid=832af2c7-70d2-4ac3-8985-ed1efcaa0a40

MarineTLO-based Warehouses

Warehouse 1 (June 2013)

Virtuoso Repository
- http://62.217.127.213:8890/sparql with graph name http://www.ics.forth.gr/isl/TLObasedDataWarehouse

- Virtuoso Repository Browsing
  - http://62.217.127.213:8890/fct

OWLIM Lite Repository
- http://62.217.127.213:8080/openrdf-workbench/repositories/TLODataWarehouse/

Warehouse 2 (By the end of July 2013)

The warehouse is available at http://62.217.127.213:8890/sparql and the graph name is <http://www.ics.forth.gr/isl/TLObasedDataWarehouseV2>. We have also installed some plugins for browsing the warehouse, specifically one can use http://62.217.127.213:8890/fct/ and http://62.217.127.213:8890/fct/demo_queries.vsp that has some demo queries (use “Run with iSPARQL”, because the “Run in SPARQL endpoint” plugin prunes whatever follows the ‘#’ character). The total number of results this plugin returns is limited to 50.

A summary of the warehouse contents follows:

TLO version 2
FLOD
ECOSCOPE
part of WoRMS (information about the taxonomies of approximately ~1100 species obtained through Species Discovery Service and the wrapping software developed)
(marine) part of DBpedia (containing various information about marine species)

In numbers, this warehouse contains approximately 1.6 million triples about 19,000 distinct marine species.

A paper describing the process of creating the MarineTLO-based warehouse can be found here.

EVALUATION and USAGE

The new warehouse has been evaluated using a new set of competency queries.

X-Search now uses this warehouse (instead of FLOD) and can identify 25,000 marine species (this number includes species genera and family names). Furthermore, each species in the TLOMarine-based warehouse has in average 30 properties, while in FLOD each species has in average only 6 properties.

We are also in continuous collaboration with IRD who is testing (and provides requirements) for the needs of FactSheetGenerator.

FUTURE

Continuous inspection of the warehouse contents, exploitation issues (new queries, etc.), documentation. A next version of the warehouse will be constructed certainly when we reach Marine TLO v3 (scheduled for Oct 2013) based on the requirements that we continuously receive from IRD.

Warehouse 3 (ONGOING, by the end of October 2013)

Virtuoso Repository
- http://62.217.127.213:8890/sparql
NameGraph
- http://www.ics.forth.gr/isl/TLObasedDataWarehouseV3

Virtuoso Repository Browsing
- http://62.217.127.213:8890/fct

Warehouse 3+ (ONGOING, by the middle of January 2014)

Virtuoso Repository
NameGraph

Virtuoso Repository Browsing

MarineTLO Related Tickets

First Iteration

FLOD Ontological Analysis: https://issue.imarine.research-infrastructures.eu/ticket/888 CLOSED

Ecoscope Ontological Analysis: https://issue.imarine.research-infrastructures.eu/ticket/889 CLOSED

MarineTLO Design: https://issue.imarine.research-infrastructures.eu/ticket/890 CLOSED

MarineTLO Implementation: https://issue.imarine.research-infrastructures.eu/ticket/891 CLOSED

MarineTLO Results: TLO Usage https://issue.imarine.research-infrastructures.eu/ticket/900 CLOSED

MarineTLO Evaluation, https://issue.imarine.research-infrastructures.eu/ticket/892 CLOSED

MarineTLO Population, https://issue.imarine.research-infrastructures.eu/ticket/1220 CLOSED

Second Iteration

MarineTLO Version 2.0.0, https://issue.imarine.research-infrastructures.eu/ticket/1603 CLOSED

MarineTLO Version 2.0.0 Documentation, https://issue.imarine.research-infrastructures.eu/ticket/1764 CLOSED

Documentation of the process used for creating MarineTLO-based warehouses, https://issue.imarine.research-infrastructures.eu/ticket/1848 OPEN

Creation of a MarineTLO-based warehouse 2 CLOSED
- http://wiki.i-marine.eu/index.php/Top_Level_Ontology#Warehouse_2_.28By_the_end_of_July_2013.29

Actions for exploiting the MarineTLO-based Warehouse 2 OPEN
- https://issue.imarine.research-infrastructures.eu/ticket/2050

Third Iteration

MarineTLO Version 3.0.0, https://issue.imarine.research-infrastructures.eu/ticket/2046 CLOSED

Fourth Iteration

MarineTLO Version 4.0.0, https://issue.imarine.research-infrastructures.eu/ticket/2319 OPEN

Related Papers

Y. Tzitzikas, C. Allocca, C. Bekiari, Y. Marketakis, P. Fafalios, M. Doerr, N. Minadakis, T. Patkos, and L. Candela , “Integrating Heterogeneous and Distributed Information about Marine Species through a Top Level Ontology”, 7th Metadata and Semantics Research Conference, MTSR'13, Thessaloniki, Greece, November 2013.

Y. Tzitzikas, C. Allocca, C. Bekiari, Y. Marketakis, P. Fafalios, and N. Minadakis, "Ontology-based Integration of Heterogeneous and Distributed Information of the Marine Domain", ERCIM News 2014 (96), Special theme: Linked Open Data, January 2014.

Y. Tzitzikas, N. Minadakis, Y. Marketakis, P. Fafalios, C. Allocca, and M. Mountantonakis, "Quantifying the Connectivity of a Semantic Warehouse", 4th International Workshop on Linked Web Data Management, LWDM'14, Athens, Greece, March 2014.

M. Mountantonakis, C. Allocca, P. Fafalios, N. Minadakis, Y. Marketakis, C. Lantzaki, and Y. Tzitzikas, "Extending VoID for Expressing the Connectivity Metrics of a Semantic Warehouse", 1st International Workshop on Dataset Profiling & Federated Search for Linked Data (PROFILES'14), in conjunction with the 11th Extended Semantic Web Conference (ESWC'14), Anissaras Hersonissou, Crete, Greece, May 2014.

Y. Tzitzikas, N. Minadakis, Y. Marketakis, P. Fafalios, C. Allocca, M. Mountantonakis, and I. Zidianaki, "MatWare: Constructing and Exploiting Domain Specific Warehouses by Aggregating Semantic Data", 11th Extended Semantic Web Conference (ESWC'14), Anissaras Hersonissou, Crete, Greece, May 2014.

@@ Line 18: / Line 18: @@
 == MarineTLO as a product ==
-...
+We used a set of underlying sources for integrating their concepts in MarineTLO. Below we briefly describe these sources, and then describe the ontology MarineTLO and its corresponding releases.
+=== The main underlying sources ===
+'''Fisheries Linked Open Data:''' FLOD, created and maintained by Food and Agriculture Organization (FAO), is dedicated to create a dense network of relationships among the entities of the Fishery domains, and to programmatically serve them to semantic and traditional application environments. The FLOD content is exposed either via a public SPARQL endpoint[http://www.fao.org/figis/flod/endpoint] (suitable for semantic applications) or via a JAVA API to be embedded in consumers’ application code. Currently, the FLOD network includes entities and relationships from the domains of Marine Species, Water Areas, Land Areas, Exclusive Economic Zones, and serves software applications in the domain of statistics and GIS.
+'''ECOSCOPE Knowledge Base:''' IRD  offers a public SPARQL endpoint[http://ecoscopebc.mpl.ird.fr/joseki/ecoscope] for its knowledge base containing geographical data, pictures and information about marine ecosystems (specifically data about fishes, sharks, related persons, countries and organizations, harbors, vessels, etc.).
+'''WoRMS:''' The World Register of Marine Species[http://www.marinespecies.org] currently contains more than 200 thousand species, around 380 thousand species names including synonyms, and 470 thousands taxa (infraspecies to kingdoms).
+'''FishBase:''' FishBase[http://www.fishbase.org] is a global database of fish species. It is a relational database containing information about the taxonomy, geographical distribution, biometrics, population, genetic data and many more. Currently, it contains more the 32 thousand species and more than 300 thousand common names in various languages.
+'''DBpedia:''' DBpedia[http://dbpedia.org] is a project focusing on the task of converting content from Wikipedia to structured knowledge so that Semantic Web techniques can be employed against it. At the time of writing this article, the English version of the knowledge base of DBpedia describes more than 4.5 million things, containing persons, places, works, species, etc. In our case, we are using a subset of DBpedia’s knowledge base containing only fishes (i.e. instances classified under the class <nowiki>http://dbpedia.org/ontology/Fish</nowiki>).
+=== The Marine Top Level Ontology ===
+MarineTLO is not supposed to be a single ontology covering the entirety of what exists. It aims at being a global core model that (a) covers with suitable abstractions the domains under consideration to enable the most fundamental queries, (b) can be extended to any level of detail on demand, and (c) can adequately map and integrate data originating from distinct sources. This approach has two main benefits:
+* reduced effort for improving and evolving it since the focus is given on one model rather than many
+* reduced effort for constructing mappings since this approach avoid the pair-wise mappings between individual metadata formats and/or ontologies.
+For the development and evolution of MarineTLO we have adopted an iterative and incremental methodology comprising the following steps: (i) ontological analysis of the underlying sources, (ii) design, (iii) implementation and (iv) evaluation. The activities of each iteration has been monitored by opening and corresponding tickets. The section '''REF_TICKETS_SECTION''' contains a list of the tickets that have been opened. For the implementation of MarineTLO we have used OWL 2 (Web Ontology Language) while for the needs of evaluation we used the notion of competence queries.
+=== Releases ===
+In total we released 4 versions of the MarineTLO. Each release was able to cover various concepts for different sources. Below we report the contents of each version, some basic information and links its documentation.
+{| border="1" class="wikitable"
+|+ MarineTLO Versions
+! Version
+! Classes and Properties
+! Underlying Sources
+! Concepts covered
+! OWL File
+! Documentation
+! Mappings
+! Competence Queries
+! Release Date
+|-
+! Version 1
+| 17 classes and 8 properties
+|| FLOD, ECOSCOPE, WORMS
+|| Species, Scientific Names, Predators
+|| http://goo.gl/ukxmAv
+||
+|| http://goo.gl/kEfp8g
+|| http://goo.gl/3sMdwR
+|| March 2013
+|-
+! Version 2
+| 57 classes and 22 properties
+|| FLOD, ECOSCOPE, WORMS, DBpedia
+|| Species, Scientific Names, Predators, Authorships
+|| http://goo.gl/JTh8p9
+|| http://goo.gl/Y9bGFy
+|| http://goo.gl/I6STNv
+|| http://goo.gl/3ZNqb4
+|| July 2013
+|-
+! Version 3
+| 57 classes and 25 properties
+|| FLOD, ECOSCOPE, WORMS, DBpedia, Fishbase
+|| Species, Scientific Names, Common Names, Predators, Authorships, Ecosystems, Countries, Water Areas, Vessels, Gears, EEZ
+|| http://goo.gl/J15OCE
+|| http://goo.gl/FygSd6
+|| http://goo.gl/vxESUF
+|| http://goo.gl/yTaz7O
+|| October 2013
+|-
+! Version 4
+| 127 classes and 81 properties
+|| FLOD, ECOSCOPE, WORMS, DBpedia, Fishbase
+|| Species, Scientific Names, Common Names, Predators, Authorships, Ecosystems, Countries, Water Areas, Vessels, Gears, EEZ, Bibliography, Statistical Indicators
+|| http://goo.gl/Yh1Uot
+|| http://goo.gl/KIFY6e
+|| http://goo.gl/WU3xkG
+|| http://goo.gl/KIFY6e
+|| July 2014
+|}
 == TLO-Development activity ==