Project

General

Profile

Wiki » History » Version 33

Theodore Dalamagas, 10/29/2014 01:12 PM

1 9 Katerina Gkirtzou
h1. LoDGoV : Generate, manage, preserve, share and protect resources in the Web of Data
2 9 Katerina Gkirtzou
3 9 Katerina Gkirtzou
{{toc}}
4 9 Katerina Gkirtzou
5 9 Katerina Gkirtzou
6 25 Theodore Dalamagas
h2. CONCEPTS AND OBJECTIVES
7 9 Katerina Gkirtzou
8 9 Katerina Gkirtzou
The "Linked Data paradigm":http://www4.wiwiss.fu-berlin.de/bizer/pub/LinkedDataTutorial/ involves practices to publish, share, and connect data on the Web, and offers a new way of data integration and interoperability. The driving force to implement Linked Data is the "RDF technology":http://www.w3.org/RDF/. The basic principles of the Linked Data paradigm is (a) use the RDF data model to publish structured data on the Web, and (b) use RDF links to interlink data from different data sources. Linked Data technologies have given rise to the "Web of Data":http://tomheath.com/papers/bizer-heath-berners-lee-ijswis-linked-data.pdf. The Web of Data extents current Web to a global data space connecting data from diverse domains. This gives added value for decision support and business intelligence applications, and enables new types of services that operate on top of an unbound, global data space and not on a fixed set of data sources as in Web 2.0 mashups. The Web of Data is impelled by the current trend towards Open Data, i.e., public data which are easily discoverable, accessible, and available to people without any restriction. Linked Open Data (LOD) serve a great cause, enabling transparency, accountability and good governance for public administrations. This is evident from international (e.g., "data.gov.uk":http://data.gov.uk), and national efforts (e.g., "geodata.gov.gr":http://geodata.gov.gr, which was developed by applicant’s research team in "IMIS institute":http://www.imis.athena-innovation.gr). As a side effect, LOD promote sustainable growth and offer a new paradigm for business models and public/private partnerships.
9 9 Katerina Gkirtzou
10 9 Katerina Gkirtzou
"Data Governance":http://en.wikipedia.org/wiki/Data_governance is an emerging field that brings together data quality, data management, and process management, regarding the handling of data in an organization. It involves controlling the full lifecycle of data produced and consumed within an organization: generation, assessment, management and processing, monitoring, maintenance and protection. Further, it offers technical and organizational solutions for integrating external data sources transparently. The goals of Data Governance include improving decision making, ensuring data processing transparency, adopting common approaches to data maintenance, and minimizing rework.
11 9 Katerina Gkirtzou
12 9 Katerina Gkirtzou
> LODGOV’s vision is to establish applicant’s research team as the premium R&D pole in LOD management and governance. The aim is to provide innovative technologies for best governance and curation practices for LOD in order to produce sustainable LOD ecosystems. LODGOV will handle the full lifecycle of LOD ecosystems, from data extraction, storage and maintenance, to monitoring, protection and repair. 
13 9 Katerina Gkirtzou
14 9 Katerina Gkirtzou
To achieve its goal, the LODGOV project will pursue the following challenging scientific and technological objectives:
15 9 Katerina Gkirtzou
* Effective methods for exposing large volumes of structured and unstructured data as LOD.
16 9 Katerina Gkirtzou
* Efficient storage solutions for large volumes of LOD.
17 9 Katerina Gkirtzou
* Query methods, retrieval algorithms and ranking techniques for LOD.
18 9 Katerina Gkirtzou
* Methods for interlink and fuse LOD from different data sources on the Web.
19 9 Katerina Gkirtzou
* Models and query languages to represent and query changes in LOD spaces.
20 9 Katerina Gkirtzou
* Provenance models and methods to trace the origins and transformations in LOD spaces.
21 9 Katerina Gkirtzou
* Design principles and best practices to expose LOD with anonymity guarantees.
22 9 Katerina Gkirtzou
* Models and methods to ensure privacy for publishing LOD.
23 9 Katerina Gkirtzou
24 25 Theodore Dalamagas
h2. PARTICIPANTS
25 9 Katerina Gkirtzou
26 9 Katerina Gkirtzou
# Timos Sellis, Prof
27 24 Theodore Dalamagas
# Vasilis Christophides, Prof
28 24 Theodore Dalamagas
# Vasilis Vasalos, Prof
29 9 Katerina Gkirtzou
# Theodore Dalamagas, Senior Researcher
30 9 Katerina Gkirtzou
# Stelios Sartzetakis, Senior Researcher
31 1 Theodore Dalamagas
# Katerina Gkirtzou, Postdoc Researcher
32 30 Katerina Gkirtzou
# Giorgos Papadakis, Postdoc Researcher
33 24 Theodore Dalamagas
# Konstantinos Karozos, PhD student
34 9 Katerina Gkirtzou
# Thanasis Vergoulis, PhD student
35 24 Theodore Dalamagas
# Giorgos Alexiou, PhD student
36 24 Theodore Dalamagas
# Panos Georgantas, Tech staff
37 9 Katerina Gkirtzou
38 25 Theodore Dalamagas
h2. SCHEDULE
39 9 Katerina Gkirtzou
40 9 Katerina Gkirtzou
# *WP1 (Study and analysis of LOD landscape, 6M).* Prior to any core research activity, WP1 analyses the current landscape in Semantic Web and LOD, sets up the common ground, and defines the S&T agenda. 
41 9 Katerina Gkirtzou
# *WP2 (LOD management, 18M).* WP2 starts with an extensive study of RDF storage and query methods. Then, the involved tasks include: (a) developing efficient solutions for exposing and fusing large LOD volumes from heterogeneous sources, (b) developing efficient co-reference methods to automatically and effectively interlink LOD datasets, (c) implementing indexing structures and ranking methods to support efficient LOD keyword search, (d) developing methods to support efficient retrieval and ranking of LOD entities, and (e) designing optimization methods for processing SPARQL queries on LOD. 
42 9 Katerina Gkirtzou
# *WP3 (LOD dynamics, 18M).* WP3 will start with an extensive study of state-of-the-art methods in data evolution and change management. Then, the involved tasks include: (a) developing models and methods to support LOD provenance, (b) developing models and methods to support LOD preservation, (c) designing query languages to explore LOD trails, (d) designing methods and metrics to evaluate LOD ecosystems with respect to its ability to sustain and adapt to evolution events.
43 9 Katerina Gkirtzou
# *WP4 (LOD privacy, 18M).* WP4 will start with an extensive study of state-of-the-art for privacy-preserving methods in data publishing. Then, the involved tasks include: (a) exploring privacy-threatening scenarios in the LOD publication, as well as defining the privacy requirements and guarantees that must be maintained, (b) providing design principles and best practices to expose LOD with anonymity guarantees, (c) developing LOD anonymization methods. 
44 9 Katerina Gkirtzou
# *WP5 (LOD governance, 12M).* WP2, WP3, and WP4  form the basis for the LOD Data Governance infrastructure, envisioned by LODGOV. Thus, WP5 integrates the outcome of WP2, WP3 and WP4, providing best practices for designing and evaluation of sustainable LOD ecosystems. 
45 9 Katerina Gkirtzou
# *WP6 (Evaluation, 6M)*. In WP6, the LODGOV team will validate the LODGOV models, methods and algorithms for LOD governance. It is important to note that the host organization maintains two Open Data services: one in the area of life science data, and one in the area of governmental open data (see also Section 1.1). Both services can be used as excellent testbeds for validating LODGOV’s technology, and this is an important advantage for LODGOV project.
46 9 Katerina Gkirtzou
# *WP7 (Management and dissemination, 36M).* Finally, to ensure effective planning and implementation of project activities, and promoting project results, WP7 is foreseen. 
47 9 Katerina Gkirtzou
48 14 Katerina Gkirtzou
!https://web.imis.athena-innovation.gr/redmine/attachments/download/1799/WP_Schedule.jpg!
49 12 Katerina Gkirtzou
50 25 Theodore Dalamagas
h2. OUTCOME
51 9 Katerina Gkirtzou
52 25 Theodore Dalamagas
h3. REPORTS
53 9 Katerina Gkirtzou
54 9 Katerina Gkirtzou
| *Title* | *Type* | *Notes* |
55 17 Katerina Gkirtzou
| [[Deliverable1_1| Linked Open Data Study and Analysis Report]] | Technical Report | Deliverable 1.1 |
56 17 Katerina Gkirtzou
| document#213 | Technical Report | Deliverable 3.1 |
57 18 Katerina Gkirtzou
| document#214 | Technical Report | Deliverable 3.2 |
58 29 Theodore Dalamagas
| Privacy models for LOD | Technical Report | Deliverable 4.1 (prepared for publ. submission, "ask to download":https://web.imis.athena-innovation.gr/redmine/users/246) | 
59 33 Theodore Dalamagas
|Entity Resolution in the Web of Data "Tutorial Summary":http://hal.inria.fr/docs/00/96/21/65/PDF/tutorial.pdf, "Slides":http://www.csd.uoc.gr/~vefthym/er/material.html,  "Website":http://www.csd.uoc.gr/~vefthym/er/material.html | Tutorial | |
60 9 Katerina Gkirtzou
61 25 Theodore Dalamagas
h3. SOFTWARE
62 1 Theodore Dalamagas
63 1 Theodore Dalamagas
| *Title* | *Notes* | *Link* |
64 19 Katerina Gkirtzou
| document#215| Deliverable 2.1 | "link":http://snf-80575.vm.okeanos.grnet.gr/encode2/index.php |
65 21 Katerina Gkirtzou
| [[Models and query languages for changes in LOD - Tool]] | Deliverable 3.1 | "link":http://snf-494989.vm.okeanos.grnet.gr:8080/DIANA_RDF/ |