Jump to content

Identifiers.org

From Wikipedia, the free encyclopedia

Identifiers.org is a project providing stable and perennial identifiers for data records used in the Life Sciences. The identifiers are provided in the form of Uniform Resource Identifiers (URIs). It is also a resolving system that relies on collections listed in the MIRIAM Registry to provide direct access to different instances of the identified records.

URIs and resolving system

[edit]

The Identifiers.org URIs[1][2] are perennial identifiers, that specify at once the data collection, using the namespaces of the Registry, and the record identifier within the collection in the form of a unique resolvable URI. The Identifiers.org resolving system is built upon the information stored in the MIRIAM Registry,[3] which is a database that stores namespaces assigned to commonly used data collections (databases and ontologies) for the Life Sciences. It transforms an Identifiers.org URI into the various URLs leading to the various instances of the record identified by the URI. Identifiers.org is part of the ELIXIR Interoperability Platform.

Identifier structure

[edit]

An Identifiers.org URI is formed of several parts:

  • Protocol. Identifiers.org URIs are HTTP URIs and start with "http:/"
  • Data collection. These are namespaces listed in the MIRIAM Registry. For instance "pubmed" for the publication resource PubMed, "ec-code" for the enzyme nomenclature and "go" for gene ontology
  • Record in the collection. For instance "9606" is "3-fluorotoluene" in the collection PubChem, it is "Homo sapiens" in the collection "taxonomy" and it is a social science publication in the collection "pubmed".
  • Optional: Identifiers.org URIs can be suffixed with parameters, for instance imposing which resource to use for resolving, "profiles" that control the resolver's behaviour etc.
Structure and examples of Identifiers.org URIs.

Usage

[edit]

The system allows a consistent and uniform annotation of datasets. This in turn facilitates data alignment and integration. Identifiers.org URIs are used to encode the metadata in the standard formats of the COMBINE initiative,[4] such as SBML. In particular, databases such as BioModels Database and Reactome export their data in SBML with cross-references encoded using Identifiers.org URIs. These URIs are also used in various semantic web projects such as Bio2RDF, Open PHACTS and the EBI RDF platform[5] Identifiers.org is part of the Interoperability platform of the European life-sciences Infrastructure for biological Information.

Comparison with other URI systems

[edit]

Identifiers.org URIs have been developed since 2011 as a resolvable version of the MIRIAM identifiers, developed since 2005, which were of a URN form, and not directly resolvable. Identifiers.org URIs are similar to PURLs, albeit providing alternative resolutions for collections with several instances. They are also similar to DOIs, but provide human readable collection names, and re-use the record identifier assigned by the data provider.

See also

[edit]

References

[edit]
  1. ^ Juty, N; Le Novère, N; Laibe, C (2012). "Identifiers.org and MIRIAM Registry: Community resources to provide persistent identification". Nucleic Acids Research. 40 (Database issue): D580–6. doi:10.1093/nar/gkr1097. PMC 3245029. PMID 22140103.
  2. ^ http://identifiers.org/ Identifiers.org Website
  3. ^ Laibe, C; Le Novère, N (2007). "MIRIAM Resources: tools to generate and resolve robust cross-references in Systems Biology". BMC Systems Biology. 1: 58. doi:10.1186/1752-0509-1-58. PMC 2259379. PMID 18078503.
  4. ^ http://co.mbine.org/ COmputational Modeling in BIology NEtwork Web site
  5. ^ S Jupp, J Malone, J Bolleman, M Brandizi, M Davies, L Garcia, A Gaulton, S Gehant, C Laibe, N Redaschi, SM Wimalaratne, M Martin, N Le Novère, H Parkinson, E Birney, AM Jenkinson (2014) The EBI RDF Platform: Linked Open Data for the Life Sciences. Bioinformatics doi:10.1093/bioinformatics/btt765
[edit]