Flowchart of data and metadata in the migration process of SDDB from Contenido CMS to eSciDoc.

Source publication

Curating the web’s deep past – Migration strategies for the German Continental Deep Drilling Program web content

Article

Full-text available

Jun 2015

On timescales beyond the life of a research project, a core task in the curation of digital research data is the migration of data and metadata to new storage media, new hardware, and software systems. These migrations are necessitated by ageing software systems, ageing hardware systems, and the rise of new technologies in data management. Using th...

Two example sheets (Experiment and Species) in the emeScheme metadata...

An example of the validation report. The full validation report is in...

Web app to use the functionality of ‘dmdScheme’ derived metadata...

Metadata Made Easy: Develop and Use Domain‐Specific Metadata Schemes by following the dmdScheme approach

Article

Full-text available

Jun 2021

• Metadata plays an essential role in the long-term preservation, reuse, and interoperability of data. Nevertheless, creating useful metadata can be sufficiently difficult and weakly enough incentivized that many datasets may be accompanied by little or no metadata. One key challenge is, therefore, how to make metadata creation easier and more valu...

Updating the Data Curation Continuum

Article

Full-text available

Sep 2019

The Data Curation Continuum was developed as a way of thinking about data repository infrastructure. Since its original development over a decade ago, a number of things have changed in the data infrastructure domain. This paper revisits the thinking behind the original data curation continuum and updates it to respond to changes in research objects, storage models, and the repository landscape in general.

Metadata Management in an Interdisciplinary, Project-Specific Data Repository: A Case Study from Earth Sciences

Conference Paper

Nov 2016

Constanze Curdt

This paper presents an approach to manage metadata of (research) data from the interdisciplinary, long-term, DFG-funded, collaborative research project ‘Patterns in Soil-Vegetation-Atmosphere Systems: Monitoring, Modelling, and Data Assimilation’. In this framework, a data repository, the so-called TR32DB project database, was established in 2008 with the aim to manage the resulting data of the involved scientists. The data documentation with accurate, extensive metadata has been a key task. Consequently, a standardized, interoperable, multi-level metadata schema has been designed and implemented to ensure a proper documentation and publication of all project data (e.g. data, publication, reports), as well as to facilitate data search, exchange and re-use. A user-friendly web-interface was designed for a simple metadata input and search.

Superdeep Tests and Experiments at 9.1 km and 4 km

Article

Full-text available

May 2016

The Continental Deep Drilling Program of Germany (in German: Kontinentales Tiefbohrprogramm der Bundesrepublik Deutschland, abbreviated as KTB) was a scientific drilling project near the town of Windischeschenbach, Bavaria. The KTB Depth Laboratory comprises two 9.1 km and 4 km deep, water-filled boreholes in crystalline basement rocks just 200 meters apart from each other. Available equipment such as cables, winches, geophysical borehole tools as well as workshops and office infrastructure allows for in-situ tests and experiments at different pressure and temperature conditions. The two stable wells are large-diameter steel-cased and have been geophysically monitored in detail since 1996.

panMetaDocs, eSciDoc, and DOIDB—An Infrastructure for the Curation and Publication of File-Based Datasets for GFZ Data Services

Article

Full-text available

Mar 2016
ISPRS

The GFZ German Research Centre for Geosciences is the national laboratory for Geosciences in Germany. As part of the Helmholtz Association, providing and maintaining large-scale scientific infrastructures are an essential part of GFZ activities. This includes the generation of significant volumes and numbers of research data, which subsequently become source materials for data publications. The development and maintenance of data systems is a key component of GFZ Data Services to support state-of-the-art research. A challenge lies not only in the diversity of scientific subjects and communities, but also in different types and manifestations of how data are managed by research groups and individual scientists. The data repository of GFZ Data Services provides a flexible IT infrastructure for data storage and publication, including minting of digital object identifiers (DOI). It was built as a modular system of several independent software components linked together through Application Programming Interfaces (APIs) provided by the eSciDoc framework. Principal application software are panMetaDocs for data management and DOIDB for logging and moderating data publications activities. Wherever possible, existing software solutions were integrated or adapted. A summary of our experiences made in operating this service is given. Data are described through comprehensive landing pages and supplementary documents, like journal articles or data reports, thus augmenting the scientific usability of the service.

Maintaining Repositories, Databases, and Digital Collections in Memory Institutions: An Integrative Review

Article

Oct 2022

Database maintenance and migration are critical but under‐supported activities in libraries, archives, museums (LAMs), and other scholarly spaces. Existing guidelines for digital curation rarely account for the maintenance needed to keep digital curation infrastructures functioning over time. Though many case studies have been published describing individual instances of migration, there has been little generalizable research done in this area. Thus, it is challenging to understand overall trends or best practices in this space. We bridge this gap by conducting an integrative literature review of papers describing database migrations and maintenance in LAMs and other scholarly contexts. By qualitatively coding 75 articles from 58 publication venues, we identify common motivations for database migrations and maintenance actions. We find that databases are migrated to support changing user needs as well as to ward off technological obsolescence; we also find that common challenges include schema crosswalking and a need for data cleaning. Practitioners describe community collaboration as key in surmounting these challenges. Through this integrative review, we build a base for further best practices development and identify a need to better model database curation as part of the digital curation lifecycle.

Flowchart of data and metadata in the migration process of SDDB from Contenido CMS to eSciDoc.

Similar publications

Citations