Data publication and repositories

Peristent Identifiers (PIDs)

Spot the PIDs

https://doi.org/10.1093/plcell/koab243

Globally unique, stable, persistent identifiers (PIDs)

  • Long-term findability
  • Make data, digital objects, people, … uniquely identifiable
  • Diminish “dead links”
  • Cope with name changes

Properties of a PID

Ideally, PIDs are

  • Stable and permanent
  • Location-independent
  • Globally unique and valid
  • Addressable (citable)
  • Clickable (resolvable)

Adapted from https://www.ebi.ac.uk/rdf/documentation/good_practice_uri/

Additional resources

Institutional publication guidelines

HHU Düsseldorf recommends use of ORCID and other PIDs

Publikationsrichtlinie der Heinrich-Heine-Universität Düsseldorf vom 09.11.2023:

https://www.hhu.de/die-hhu/kontakt-und-services/zentrale-und-amtliche-bekanntmachungen/nr-34-2023

Domain-specific data repositories

Good

  • Assign PIDs / DOIs
  • Long-term accessible
  • Data type specific
  • Apply metadata standards
  • Usually recommended / required by journals
  • Mostly accepted by the community

Intermediate

  • User-friendliness
  • Different metadata schema
  • Complex and versatile submission routines

Domain-specific data repositories

Repository Description Biological data domain
EBI-ENA European Nucleotide Archive genome / transcriptome sequences
EBI-ArrayExpress Archive of Functional Genomics Data transcriptome
EBI-MetaboLights Database of Metabolomics metabolome
EBI-PRIDE PRoteomics IDEntifications Database proteome
EBI-BioImage Archive Stores and distributes biological images imaging, microscopy
e!DAL-PGP Plant Genomics & Phenomics Research Data Repository phenome
NCBI-GEO Gene Expression Omnibus transcriptome
NCBI-GenBank Genetic Sequence Database genome
NCBI-SRA Sequence Read Archive genome / transcriptome sequences

Choosing a data repository

Domain-specific >> Generic >> Institutional

Find repositories at:

Generic data repositories

Good

  • Allow publication of any kind of data Assign PIDs / DOIs
  • Long-term accessible
  • Very simple to use

Intermediate

  • Only generic / high-level metadata schema
  • Limited reusability

From ARC to repositories

Moving from paper to data publications

Publish your ARC, get a DOI

Publish your ARC with a few clicks

Validation

adapted from Weil, H.L., Schneider, K., et al. (2023), PLANTdataHUB: a collaborative platform for continuous FAIR data sharing in plant research. Plant J. https://doi.org/10.1111/tpj.16474

Validate & publish

Weil, H.L., Schneider, K., et al. (2023), PLANTdataHUB: a collaborative platform for continuous FAIR data sharing in plant research. Plant J. https://doi.org/10.1111/tpj.16474

Receive a DOI

Contributors

Slides presented here include contributions by

![w:800px](https://www.nfdi4plants.org/nfdi4plants.knowledgebase/docs/img/FAIRData_ClassicalPublication.svg) ---