Blue Hackathon iMarine Data Challenges

From D4Science Wiki
Revision as of 16:28, 11 June 2013 by John.gerbesiotis (Talk | contribs) (Objectives)

Jump to: navigation, search

Data Challenges

Challenge #1

Enrich HTML web content with RDF annotation, and enable annotation-based document discovery

Background

(Why this is relevant to blue-er world)

Objectives
  1. We ask the hackathon participants to find a technical solution to enrich the factsheets of the FIGIS portal with annotations in RDFa format. Can we write there? The annotation will consist at least of the URIs of the entities referenced in the factsheet, and of set of relevant relations provided with the datasets.
    1. GOAL: an RDFa client will be able to extract (is that the correct term?) the annotations
  2. We ask the hackathon to use the annotations produced at item one, as input to online search of factsheets (publication, GIS maps, images, statistical timeseries), to create enhanced discovery facility that complement the web page information content.
    1. GOAL: a set of factsheets is retrieved via online search services.
    2. GOAL: Access to the factsheets
Challenges

TBD

Datasets

TBD

APIs

TBD


Challenge #2

Generate RDF dataset from GIS layer in geonetwork, and map the geographic entities with existing LOD datasets

Background

(Why this is relevant to blue-er world)

Objectives
  1. We ask the hackathon to find a technical solution to produce LOD dataset from a collection of GIS layers accessed via GeoNetwork web services. The entities of the dataset will have to be mapped with existing LOD datasets in the GIS domain.
    1. GOAL: given an online service that list a collection of GIS layer, a new LOD dataset is produced.
    2. GOAL: enrich the geographic entities in that dataset with more data gathered trough the mapping with existing LOD GIS datasets (e.g. geonames, geopolitical ontology, dbpedia, etc).
Challenges

TBD

Datasets

TBD

APIs

TBD


Challenge #3

Generate RDF dataset from DarwinCore sources and map to existing biodiversity LOD

Background

(Why this is relevant to blue-er world)

Objectives
  1. We ask the hackathon participants to produce a LOD dataset from a source of DarwinCore data (XML or a service), and map its entities with existing LOD datasets in the biodiversity domain.
    1. GOAL: access complementary information with taxonomic data through the mappings (e.g. species conservation status, capture statistics, distribution map, etc)
Challenges

TBD

Datasets

TBD

APIs

TBD


Challenge #4

Generate dynamic fact-sheets mashing up data from distributed LOD datasets

Background

(Why this is relevant to blue-er world)

Objectives
  1. We ask the hackathon to find a technical solution based on LOD data mashup, to compose domain-based sections of a factsheet, taking data from distributed LOD datasets. The domain of the sections can be: economics, taxonomic, fishing technique, statistics, publications etc.
    1. GOAL: a web service responding with a collection of data clustered by domain-section, and display the result in HTML format
Challenges

TBD

Datasets

TBD

APIs

TBD


Challenge #5

Exploit iMarine Information Retrieval facilities in order to provide more complex search results and browsing capabilities

Background

(Why this is relevant to blue-er world)

Objectives
  1. We ask the hackathon participants to enrich the search results retrieved from iMarine Collections by identifying special keywords (related to the topic) with results retrieved from OpenSearch and other external(?) datasources.
  2. We ask the hackathon participants to explore the database by performing a number of predefined queries and keep statistics on them in order to enhance the existing browsing methods
Challenges

TBD

Datasets

TBD

APIs

TBD


Challenge #6

Visualization and processing of data sets

Background

(Why this is relevant to blue-er world)

Objectives
  1. Exploit the species occurrences data in order to calculate and visualize geographical trends (i.e. migration of species). The data may be provided by a GeoServer.
  1. Interactive Map Search. Search over data of multiple source, combine them and enrich results. Search results can be presented on a map.
    1. clustering, filtering
    2. trend identification
    3. interact with results, like clicking on a result or location would show related results, helpful things etc


Challenges

TBD

Datasets

TBD

APIs

TBD

Documentation

External Links