Press [ esc ] or close+

EOSCpilot Science Demonstrator: CryoEM - Life Science data workflows

SCIENTIFIC OBJECTIVES OF THE DEMONSTRATOR

Enable users of a representative subset of major CryoEM Facilities in Europe to bring back raw and preprocessed data, and a file linking to the acquired data and the analysis workflows.»The file will contain detailed information enabling the reproducibility of processing steps, be ready and accepted to be deposited in CryoEM major databases, and be easy to browse and analyze over the Web. After one year of work, users of a representative subset of major CryoEM Facilities in Europe will leave the Facility not only with raw and preprocessed data but also with a file linking the acquired data and the analysis workflows. This file:

  1. will contain detailed information enabling the reproduction of processing steps (assuming access to the original software),
  2. will be ready and accepted to be deposited in cryo-EM databases like EMDB and EMPIAR, and
  3. will be easy to browse and analyze over the Web.

MAIN ACHIEVEMENTS

 

 

 

 

 

 

 

 

 

IMPACT

By providing means to properly record data analysis workflows in cryoEM, we will certainly contribute to data reproducibility, data reuse and data interoperability. This Demonstrator will result in improved reproducibility of cryoEM structures, allowing better interoperation with distributed data and analysis sources, addressing provenance and increasing data reuse in a multidisciplinary context.

More generally, the Demonstrator will be an exemplar for workflows that involve the acquisition of complex experimental data and the application of sophisticated processing and modelling. Such workflows may be more complicated that typical use cases for e-infrastructure, but would provide real benefit to European scientists.

RECOMMENDATIONS FOR THE IMPLEMENTATION

  • Create a public repository of acquisition metadata and image processing workflows for new acquisitions, as a temporary repository until the data is finally analyzed and deposited in the standard public databases (EMDB and EMPIAR).
  • Create an authentication policy such that biologists coming out from an EM facility could continue the image processing in some of the EOSC cloud machines.

Find more of this Science Demonstrator on the EOSCpilot website.


The EOSC portal has been jointly developed and maintained by the eInfraCentral, EOSC-hub, EOSCpilot and OpenAIRE-Advance projects funded by the European Union’s Horizon 2020 research and innovation programme with contribution of the European Commission.