Describe data by scripts for future reuse

Objectives

FAIR data principles

ExPaNDS’s ambition is to coordinate activities to enable national PaN RIs to make the majority of their data open following the FAIR principles; Findable, Accessible, Interoperable, and Reusable.

Harmonising EOSC services

ExPaNDS will allow data to be tailored to the user’s needs, and will harmonise their efforts to migrate data analysis workflows to EOSC platforms, allowing them to be shared in a uniform way.

video, slides

Describe data by scripts for future reuse

View material

Nearly 120 IT professionals, scientists and managers from the Photon and Neutron (PaN) community attended the 2nd European PaN EOSC Symposium organised jointly by PaNOSC and ExPaNDS on 26th October 2021. The second part of the first session focused on a selection of use cases relating to some of the tools and services developed in the EOSC projects, for FAIR data catalogues, data analysis and simulation.

Here is the first use case. The presentation starts 31m:53s in.

There is a long-lasting discussion in the PaN community about how to properly describe the data and which metadata are useful. To fulfil the last letter in FAIR, data needs to be reusable, which is often the most difficult task for large research infrastructures users.

Petr Čermák presented an easy and convenient way of describing the data by user scripts, using publicly available data at PaNOSC ILL, treating them with open-source software and publishing the scripts on a GitHub repository. The repository at Figshare was mirrored to get a citable entity and show how to use Binder to re-evaluate the data from any computer in the world “even after 100 years”.

This approach describes how processed data is obtained, through a transparent evaluation. Referees of the upcoming publication can easily verify the data treatment process; other scientists can easily learn how data can be treated and – most importantly – that the data treatment process will work forever.

DOI: 10.6084/m9.figshare.16869467.v1

Licence: Creative Commons Attribution 4.0 International

Keywords: metadata, open data, FAIR, figshare, binder, data processing, wp5-ExPaNDS

Resource type: Video Lecture, slides

External resources:

2nd PaN EOSC Symposium

Activity log

Content provider

Objectives

FAIR data principles

Harmonising EOSC services

Describe data by scripts for future reuse

2nd PaN EOSC Symposium