7 EU Publication Office - Metadata Registry

Figurer 11 - EU Publication Office - Metadata Registry

 

7.1 URL (Access)

 Publications Office of the EU maintains the Metadata Registry

 Metadata Registry (MDR) URL: http://publications.europa.eu/mdr

 Among others definition data EuroVoc is maintained in the Metadata Registry:

EuroVoc URL: http://eurovoc.europa.eu

 

7.2 Short description

The Metadata Registry of the Publications Office of the EU is the central reference location where all metadata definitions and related authority data used in Publications Office and in the data exchange between the institutions involved in the legal decision making process.

The Metadata Registry registers and maintains definition data (metadata elements, named authority lists, schemas, etc.) used by the different European Institutions involved in the legal decision making process gathered in the interinstitutional Metadata Maintenance Committee (IMMC) and by the Publications Office of the EU in its production and dissemination process.

Metadata Registry implementation is a combination of svn, JIRA and scripts running on a UNIX server. The source format of the registers (authority tables) is a proprietary XML format. This is converted to Excel for editing purposes and reconverted into XML. From the XML diff files are generated to verify the changes. The XML is used as basis for the different distributions (SKOS, HTML, …).

 

7.3 Registry content

The following definition data are maintained in the Metadata Registry:

·         Named Authority Lists (Common Authority Tables/Value lists)

·         IMMC Core metadata exchange protocol

·         OP Core metadata element set

·         EuroVoc thesaurus and alignments (SKOS/XML distributions)

·         Official Journal Electronic Exchange Protocol (OJEEP)

·         Style sheets for presentation

 The most relevant registers for the INSPIRE context are listed in the table below:

Name of register

Description of the content

Countries NAL

The Countries authority table is a controlled list of codes related to the names of countries with an associated label - in a short and in a long form - in the 24 official languages of the European Union. The authority code is the ISO 3166-1/α-3, with exceptional alphanumerical codes when an ISO code does not exist (e.g. Kosovo).

 

http://publications.europa.eu/mdr/authority/country/index.html

Places NAL

The Places authority table is a controlled list of codes related to the names of locations with an associated label in the 24 official languages of the European Union (if available). The authority code consists of a combination of the three-letter country code and the last part of the UN/LOCODE.


http://publications.europa.eu/mdr/authority/place/index.html

Languages NAL

The Languages authority table is a controlled list of codes related to the names of languages with an associated label in the 24 official languages of the European Union. The authority code is equivalent to ISO 639-3α-3.

 

http://publications.europa.eu/mdr/authority/country/index.html

Territorial units NAL (in development)

The Territorial units authority table is a controlled list of codes related to the names of territorial units, providing a mapping between the labels of the NUTS, the ISO administrative units and the judicial units

 

Under development. To be released in 2015

EuroVoc

EuroVoc is a multilingual, multidisciplinary thesaurus covering the activities of the EU, the European Parliament in particular. It contains terms in 23 EU languages (Bulgarian, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, German, Greek, Hungarian, Italian, Latvian, Lithuanian, Maltese, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish and Swedish), plus Serbian.

 

It is maintained in SKOS-XL directly in the open source vocabulary management tool VocBench developed in collaboration with the FAO. A SKOS distribution is exported to the EuroVoc front-office based on Drupal. In 2015-2016 the website of EuroVoc will migrate to the CMS Liferay (see above).


http://eurovoc.europa.eu/drupal/

Table 3 - EU Publication Office - Metadata Registry content

 

7.4 Information model

The EuroVoc Thesaurus Ontology Schema is available at:

http://lists.w3.org/Archives/Public/public-esw-thes/2010Feb/att-0023/Ontology.html

 

7.5 Applied standards, format – presentations

It is being considered to move from the proprietary XML format towards the rich RDF format “euvoc” with the aim to use the same back-office as for EuroVoc: VocBench.

With the aim to merge with the EuroVoc website under a common EU Vocabularies website the current website of the MDR will be move to another platform based on the CMS LifeRay.

Named Authority Lists:

Proprietary XML, SKOS, XSD instance, HTML, “euvoc” ontology profile (dcat, org, prov-o, skos-thes, skos-xl, skos, vcard-x, voaf, void, wgs84_pos)

EuroVoc:

SKOS-XL, proprietary XML, “euvoc” ontology profile (dcat, org, prov-o, skos-thes, skos-xl, skos, vcard-x, voaf, void, wgs84_pos), PDF

 The EuroVoc ontology is an extension of SKOS (Simple Knowledge Organization System) - W3C recommendation, including appendix B, SKOS eXtension for Labels (SKOS-XL).

 Initially, the EuroVoc thesaurus was compiled in accordance with the standards of the International Standards Organisation:

 ISO 2788-1986: Guidelines for the establishment and development of monolingual thesauri;

  • ISO 5964-1985: Guidelines for the establishment and development of multilingual thesauri.

 Since 2008, both standards have been revised, extended and combined into a new "ISO 25964 - Thesauri and Interoperability with other Vocabularies" standard in two parts:

 "Part 1: Thesauri for information retrieval" covers monolingual and multilingual thesaurus was issued as a draft international standard (DIS) in October 2009;

  • "Part 2: Interoperability with other vocabularies" is under development is in development.

 

7.6 Referenced and managed entities

No information available.

 

7.7 Status codes

Named Authority Lists: draft, draft.deprecated, candidate, candidate.deprecated, current, deprecated, retired

 

7.8 Life cycle, history and versioning

No information available.

 

7.9 URI (identifier) patterns

Currently the information are provided as distribution files in different formats.

Examples:

http://publications.europa.eu/mdr/resource/authority/country/skos/countries-skos.rdf

http://publications.europa.eu/mdr/resource/authority/country/html/countries-eng.html

http://publications.europa.eu/mdr/resource/authority/country/xml/countries.xml

URI pattern will evolve with next version of EuroVoc (4.5):

http://publications.europa.eu/resource/authority/<register>/<concept>

example: http://publications.europa.eu/resource/authority/country/NLD

http://eurovoc.europa.eu/<concept>

example: http://eurovoc.europa.eu/220810

 

7.10 API and connection points

EuroVoc alignment:

As EuroVoc is a generic thesaurus, it is aligned with more specialised thesauri. A list of alignments can be found at http://data.europa.eu/euodp/en/data/dataset/eurovoc .

An alignment of EuroVoc and INSPIRE is available. It relates the terms managed by EuroVoc to the content of the INSPIRE Registry.

Example:

The term identified by “http://eurovoc.europa.eu/362” is described by EuroVoc with its labels (see Figure 11), available at:

http://publications.europa.eu/mdr/resource/thesaurus/eurovoc/skos/eurovoc_skos.zip

Figure 12 - EuroVoc snippet

 

Within the alignment with INSPIRE available at http://publications.europa.eu/mdr/resource/thesaurus/eurovoc/skos/align_EuroVoc_Inspire.rdf

the resource http://eurovoc.europa.eu/362 is related to the corresponding INSPIRE theme and feature concepts (see Figure 12).

Figure 13 - EuroVoc alignment