Wf4Ever project
Wf4Ever was a research object funded by EU Framework 7 to investigate how scientific workflows and their data could be better preserved for reproducibility, reuse and resiliance against workflow decay.
For instance, a particular computational analysis may require certain software preinstalled, and rely on web services and databases on the network. The workflow is also executed in a particular workflow engine, possibly using plugins or additional shims for data conversations etc. Reference datasets are used for comparisons. All of these components are however subject to change, both desirable (e.g. a web service gets new feature) and undesirable (e.g. a software binary no longer run on newer OS), and this can make the computational method captured in the form of a workflow over time turn inaccessible, incomprehensible, incompatible or unavailable.
In the project, researchers from Spain, UK, Poland and The Netherlands collaborated to develop methods and demonstrators for workflow preservation in bioinformatics and astrophysics, developing the concept of Research Objects to maturity, adding detailed provenance capture and metadata support for the workflow systems Taverna and WINGS while co-developing Linked Data standards like W3C PROV and W3C Web Annotation model.
- Homepage: http://wf4ever.org/ (was:
www.wf4ever-project.org
) [archived 2020-08-03] - Funding: STREPFP7-ICT-2007-6 270192
- Duration: 2010-12-01 – 2013-11-30
- Institutions:
- iSOCO
- The University of Manchester
- Universidad Politécnica de Madrid
- University of Oxford
- Poznan Supercomputing and Networking Center
- Instituto de Astrofísica de Andalucía
- Leiden University Medical Centre
- Wikis:
- Software:
- Specifications:
- Influenced (in varying degree and no particular order):
- https://www.w3.org/TR/prov-overview/
- https://www.w3.org/TR/annotation-model/
- http://co.mbine.org/documents/archive
- https://fairdomhub.org/
- https://w3id.org/ro/bagit
- https://github.com/fair-research/bdbag
- http://purl.org/net/ldp4ro/spec
- http://www.rohub.org/
- https://taverna.incubator.apache.org/
- https://wings-workflows.org/
- https://workflowhub.eu/
- https://www.commonwl.org/
- https://w3id.org/cwl/prov
- https://w3id.org/ro/crate/
- http://nanopub.org/
- http://amiga.iaa.es/p/294-open-science-canube.htm
- https://scape-project.eu/
- https://esciencelab.org.uk/projects/openphacts/
- https://ever-est.eu/
- https://bioexcel.eu/
Publications
See also:
- http://wf4ever.org/web/guest/publications.html
- https://www.researchobject.org/publications/
- http://www.rohub.org/about
Khalid Belhajjame, Jun Zhao, Daniel Garijo, Matthew Gamble, Kristina Hettne, Raul Palma, Eleni Mina, Oscar Corcho, José Manuel Gómez-Pérez, Sean Bechhofer, Graham Klyne, Carole Goble (2015) Using a suite of ontologies for preserving workflow-centric research objects, Web Semantics: Science, Services and Agents on the World Wide Web, https://doi.org/10.1016/j.websem.2015.01.003
Stian Soiland-Reyes, Pinar Alper, Carole Goble (2016): Tracking workflow execution with TavernaProv. At: PROV: Three Years Later, Provenance Week 2016. https://doi.org/10.5281/zenodo.51314 [html] [pdf] [source]
Eleni Mina, Mark Thompson, Rajaram Kaliyaperumal, Jun Zhao, Eelke van der Horst, Zuotian Tatum, Kristina M Hettne, Erik A Schultes, Barend Mons, Marco Roos (2015): Nanopublications for exposing experimental data in the life-sciences: a Huntington’s Disease case study. Journal of Biomedical Semantics 6(5). https://doi.org/10.1186/2041-1480-6-5
Daniel Garijo, Nandana Mihindukulasooriya, Oscar Corcho (2015): LDP4ROs: Managing Research Objects with the W3C Linked Data Platform. WWW ’15 Companion. Proceedings of the 24th International Conference on World Wide Web. https://doi.org/10.1145/2740908.2742016 [pdf]
Kristina M Hettne, Harish Dharuri, Jun Zhao, Katherine Wolstencroft, Khalid Belhajjame, Stian Soiland-Reyes, Eleni Mina, Mark Thompson, Don Cruickshank, Lourdes Verdes-Montenegro, Julian Garrido, David de Roure, Oscar Corcho, Graham Klyne, Reinout van Schouwen, Peter A `t Hoen, Sean Bechhofer, Carole Goble, Marco Roos (2014) Structuring research methods and data with the research object model: genomics workflows as a case study, Journal of Biomedical Semantics 5(1), p. 41, https://doi.org/10.1186/2041-1480-5-41
Palma Raul, Piotr Hołubowicz, Kevin Page, Stian Soiland-Reyes, Graham Klyne, Cezary Mazurek (2014) A Suite of APIs for the Management of Research Objects, Proceedings of the ISWC Developers Workshop 2014, ISWC-DEV 2014, CEUR-WS.org, online http://ceur-ws.org/Vol-1268/paper8.pdf
Raúl Palma, Piotr Hołubowicz, Oscar Corcho, José Manuel Gómez-Pérez, Cezary Mazurek (2014): ROHub — A Digital Library of Research Objects Supporting Scientists Towards Reproducible Science. Semantic Web Evaluation Challenge. SemWebEval 2014 at ESWC 2014. https://doi.org/10.1007/978-3-319-12024-9_9 [pdf, archived 2016-04-12] [preprint o:378711] [proceedings p361]
Kevin R. Page, Raul Palma, Piotr Hołubowicz, Graham Klyne, Stian Soiland-Reyes, Daniel Garijo, Khalid Belhajjame, and Rudolf Mayer: Research Objects for Audio Processing: Capturing Semantics for Reproducibility, 53rd Audio Engineering Society International Conference on Semantic Audio, January 2014. ISBN: 978-0-937803-96-7 [Proceedings] [Preprint] [escholar]
David De Roure (2014): Executable Music Documents. DLfM ’14 Proceedings of the 1st International Workshop on Digital Libraries for Musicology.https://doi.org/10.1145/2660168.2660183 [preprint] [preprint server]
Sean Bechhofer, Iain Buchan, David De Roure, Paolo Missier, John Ainsworth, Jiten Bhagat, Phillip Couch, Don Cruickshank, Mark Delderfield, Ian Dunlop, Matthew Gamble, Danius Michaelides, Stuart Owen, David Newman, Shoaib Sufi, Carole Goble (2013) Why Linked Data is Not Enough for Scientists, Future Generation Computer Systems 29(2), February 2013, Pages 599-611, ISSN 0167-739X, https://doi.org/10.1016/j.future.2011.08.004 [preprint] [researchgate]
Jun Zhao, Graham Klyne, Matthew Gamble and Carole Goble (2013): A Checklist-Based Approach for Quality Assessment of Scientific Information. 3rd International Workshop on Linked Science 2013—Supporting Reproducibility, Scientific Investigations and Experiments (LISC2013). [preprint] [slides]
Carole Goble, David De Roure, Sean Bechhofer (2013) Accelerating Scientists’ Knowledge Turns, Communications in Computer and Information Science 348, p. 3-25, https://doi.org/10.1007/978-3-642-37186-8_1
David De Roure (2013) Towards Computational Research Objects, Proceedings of the 1st International Workshop on Digital Preservation of Research Methods and Artefacts, p. 16-19, ACM, https://doi.org/10.1145/2499583.2499590
Khalid Belhajjame, Oscar Corcho, Daniel Garijo, Jun Zhao, Paolo Missier, David R. Newman, Raul Palma, Sean Bechhofer, Esteban Garcia Cuesta, Jose Manuel Gomez-Perez, Graham Klyne, Kevin Page, Marco Roos, José Enrique Ruiz, Stian Soiland-Reyes, Lourdes Verdes-Montenegro, David De Roure, Carole Goble (2012): Workflow-Centric Research Objects: A First Class Citizen in the Scholarly Discourse. Proceedings of the Workshop on the Semantic Publishing (SePublica 2012) (i), p. 1-12. CEUR Workshop Proceedings 903. http://ceur-ws.org/Vol-903/paper-01.pdf
Jun Zhao, Jose Manuel Gomez-perez, Khalid Belhajjame, Graham Klyne, Esteban Garcia-cuesta (2012) Why Workflows Break – Understanding and Combating Decay in Taverna Workflows, 8th International Conference on E-Science (e-Science), IEEE. https://doi.org/10.1109/eScience.2012.6404482 [preprint]
Matthew Gamble, Carole Goble, Graham Klyne, Jun Zhao (2012) MIM: A minimum information model vocabulary and framework for scientific linked data, 2012 IEEE 8th International Conference on E-Science, e-Science 2012. https://doi.org/10.1109/eScience.2012.6404489 [preprint]
David De Roure, Khalid Belhajjam, Paolo Missier, José Manuel Gómez-Pérez, Raúl Palma, José Enrique Ruiz, Kristina Hettne, Marco Roos, Graham Klyne, Carole Goble (2011): Towards the Preservation of Scientific Workflows. iPRES 2011 – 8th International Conference on Preservation of Digital Objects. https://doi.org/11353/10.294253 [researchgate] [preprint o:294253] [proceedings p228]
Sean Bechhofer, John Ainsworth, Jiten Bhagat, Iain Buchan, Phillip Couch, Don Cruickshank, David De Roure, Mark Delderfield, Ian Dunlop, Matthew Gamble, Carole Goble, Danius Michaelides, Paolo Missier, Stuart Owen, David Newman, Shoaib Sufi (2010): Why Linked Data is Not Enough for Scientists, Sixth IEEE e-Science conference (e-Science 2010) https://doi.org/10.1109/eScience.2010.21 [preprint] [pdf]
Sean Bechhofer, David De Roure, Matthew Gamble, Carole Goble, Iain Buchan (2010): Research Objects: Towards Exchange and Reuse of Digital Knowledge, The Future of the Web for Collaborative Science (FWCS 2010). https://doi.org/10.1038/npre.2010.4626.1
Deliverables
This list of project deliverables is adapted from http://wf4ever.org/web/guest/deliverables.html with updated hyperlinks as repo.wf4ever-project.org
is no longer available. Except where noted as confidential, these project deliverables are all Open Access licensed Creative Commons Attribution 3.0 (CC-BY-3.0).
- WP1: Software Architecture and Technology for Scientific Workflow Preservation
- D1.1 Setup of software development technologies
oai:repo.wf4ever-project.org:9
(31/12/2010) - D1.2: Wf4Ever Sandbox v1
oai:repo.wf4ever-project.org:14
(31/05/2011) - D1.2v2 Wf4Ever Sandbox - Phase II
oai:repo.wf4ever-project.org:35
(31/07/2012) - D1.2v3 Wf4Ever Sandbox - Phase III
oai:repo.wf4ever-project.org:47
(31/07/2013) - D1.2v4 Wf4Ever Sandbox - Phase IV Final
oai:repo.wf4ever-project.org:54
(30/11/2013) - D1.3v1: Wf4Ever Architecture - Phase I
oai:repo.wf4ever-project.org:28
(30/11/2011) - D1.3v1: Addendum: Wf4Ever Architecture - Phase I
oai:repo.wf4ever-project.org:43
(16/03/2012) - D1.3v2 Wf4Ever Architecture - Phase II
oai:repo.wf4ever-project.org:46
(31/03/2013) - D1.4v1 Reference Wf4Ever Implementation - Phase I
oai:repo.wf4ever-project.org:36
(31/07/2012) - D1.4v2 Reference Wf4Ever Implementation - Phase II
oai:repo.wf4ever-project.org:48
(31/07/2013) - D1.5 Opportunities for applying Wf4Ever architecture and technologies
oai:repo.wf4ever-project.org:55
(30/11/2013)
- D1.1 Setup of software development technologies
- WP2: Work package description: Workflow Lifecycle Management
- D2.1 Workflow Lifecycle Management Initial Requirements
oai:repo.wf4ever-project.org:12
(26/05/2011) - D2.2v1 Design, implementation and deployment of workflow lifecycle management components - Phase I
oai:repo.wf4ever-project.org:64
(31/07/2012)
https://github.com/wf4ever/deliverables/tree/master/D2.2v1 - D2.2v2 Design, implementation and deployment of workflow lifecycle management components - Phase II
oai:repo.wf4ever-project.org:49
(31/07/2013)
https://github.com/wf4ever/deliverables/tree/master/D2.2v2 - D2.3 Final evaluation report of workflow lifecycle management components
oai:repo.wf4ever-project.org:56
(30/11/2013)
- D2.1 Workflow Lifecycle Management Initial Requirements
- WP3: Work package description: Workflow Evolution, Sharing and Collaboration
- D3.1: Workflow Evolution, Sharing and Collaboration Initial Requirements
oai:repo.wf4ever-project.org:17
(31/05/2011) - D3.2v1: Design, implementation and deployment of Workflow Evolution, Sharing and Collaboration components
oai:repo.wf4ever-project.org:65
(31/07/2012) - D3.2v2 Design, implementation and deployment of Workflow Evolution, Sharing and Collaboration components - Phase II
oai:repo.wf4ever-project.org:50
(31/07/2013) - D3.3 Final evaluation report of Workflow Evolution, Sharing and Collaboration components
oai:repo.wf4ever-project.org:57
(30/11/2013)
- D3.1: Workflow Evolution, Sharing and Collaboration Initial Requirements
- WP4: Work package description: Workflow Integrity and Authenticity Maintenance
- D4.1: Workflow Integrity and Authenticity Maintenance Initial Requirements
oai:repo.wf4ever-project.org:18
(31/05/2011) - D4.2v1: Design, implementation and deployment of Workflow Integrity and Authenticity Maintenance components - Phase I
oai:repo.wf4ever-project.org:66
(31/07/2012) - D4.2v2 Design, implementation and deployment of Workflow Integrity and Authenticity Maintenance components - Phase II
oai:repo.wf4ever-project.org:51
(31/07/2013)
https://github.com/wf4ever/deliverables/tree/master/D4.2v2 - D4.3 Final evaluation report of the workflow integrity and authenticity maintenance components
oai:repo.wf4ever-project.org:58
(30/11/2013) https://github.com/wf4ever/deliverables/tree/master/D4.3
- D4.1: Workflow Integrity and Authenticity Maintenance Initial Requirements
- WP5: Work package description: Workflow Preservation in Astronomy
- D5.1: Astronomy Workflow Preservation Requirements
oai:repo.wf4ever-project.org:23
(31/05/2011) - D5.2: Creation of an Astrophysical Community of Preserved Workflow Users
oai:repo.wf4ever-project.org:59
(30/11/2013) - D5.3v1: Propagation of interdependent quantities in the calculation of luminosities of galaxies
oai:repo.wf4ever-project.org:29
(30/11/2011)
https://doi.org/10.5281/zenodo.6804281 - D5.3v2 Calculation of Luminosity profiles for a Sample of Galaxies extracted from Catalogues using Isolation Criteria
oai:repo.wf4ever-project.org:41
(30/09/2012)
https://doi.org/10.5281/zenodo.6804335 - D5.3v3 Astronomy Workflows v3
oai:repo.wf4ever-project.org:63
(30/09/2013)
- D5.1: Astronomy Workflow Preservation Requirements
- WP6: Work package description: Workflow Preservation for Genome Wide Analyses for Genomics and BioBanking communities
- D6.1: Genomics Workflow Preservation Requirements
oai:repo.wf4ever-project.org:21
(31/05/2011)
https://doi.org/10.5281/zenodo.6759827 - D6.1: Addendum: Biological User Requirements
oai:repo.wf4ever-project.org:30
(3/12/2011)
https://doi.org/10.5281/zenodo.6759872 - D6.2: Final Report on the creation of a Comunity of Users in Genomics
oai:repo.wf4ever-project.org:53
(30/11/2013)
https://doi.org/10.5281/zenodo.6759940 - D6.3v1: Genome Wide Association Study Workflows v1
oai:repo.wf4ever-project.org:31
(2/12/2011)
https://doi.org/10.5281/zenodo.3967616 - D6.3v2 Genomic Wide Association Study Workflows v2
oai:repo.wf4ever-project.org:42
(30/09/2012)
https://doi.org/10.5281/zenodo.3967633 - D6.3v3 Genomic Wide Association Study Workflows v3
oai:repo.wf4ever-project.org:52
(30/09/2013)
https://doi.org/10.5281/zenodo.3970778
- D6.1: Genomics Workflow Preservation Requirements
- WP7: Work package description: Project Management
- D7.1: Quality and Risk Contingency Plan
oai:repo.wf4ever-project.org:33
(30/11/2011) - D7.2v1 Periodic Activity Report (30/11/2011) confidential
- D7.2v2 Periodic Activity Report (30/11/2012) confidential
- D7.3 Periodic Activity Report (30/11/2013) confidential
- D7.1: Quality and Risk Contingency Plan
- WP8: Work package description: Dissemination, Transfer and Exploitation
- D8.1: Wf4Ever web site
oai:repo.wf4ever-project.org:11
(1/10/2011) - D8.2: Dissemination, Transfer and Exploitation
oai:repo.wf4ever-project.org:24
(31/05/2011) - D8.3: SWOT Analysis (30/11/2011) confidential
- D8.4v1 Report on Dissemination Activities v1
oai:repo.wf4ever-project.org:44
(31/05/2012) - D8.4v2 Report on Dissemination Activities v2
oai:repo.wf4ever-project.org:61
(30/11/2013) - D8.5v1 Exploitation Plan v1 (31/05/2012) confidential
- D8.5v2 Exploitation Plan v2 (30/11/2013) confidential
- D8.1: Wf4Ever web site
Knowledge Hub
Adapted from http://wf4ever.org/web/guest/knowledge-hub.html: