tl;dr: Senior Lecturer in FAIR, Open and Reproducible Digital Research. Research interests include Linked Data, RESTful web services, metadata, provenance, annotations, open source and software development best practices.
- scholar.social/@soilandreyes (Mastodon)
Stian Soiland-Reyes is a Senior Lecturer in FAIR, Open and Reproducible Digital Research and co-leader of the eScience Lab in the Department of Computer Science at The University of Manchester.
In his research he is working on reproducibility, provenance and metadata to improve FAIR sharing of open research data and computational workflows, within ELIXIR Europe, European Open Science Cloud (EOSC) projects and the workflow repository WorkflowHub.eu, as well as building FAIR approaches for computational workflows and FAIR Digital Objects. Stian contributed to the W3C PROV standard and its implementation in workflow systems.
As one of the early founders to the Research Object approach for sharing research outputs using Linked Data, he is the co-lead of RO-Crate, a community-based lightweight approach to packaging and publishing research data with structured metadata.
Stian is on the leadership team of Common Workflow Language, a standard for interoperable computational workflows, as well as on the technical steering committee for BioCompute Object, an approach for describing workflows for submission to regulatory authorities. Stian is the Technical and Community Advisor for the recently started Workflows Community Initiative.
Stian graduated in 2006 with an MSc in Computer Science from the Norwegian University of Science and Technology (NTNU) and in 2023 submitted his PhD thesis at the Informatics Institute of the University of Amsterdam.
He has more than two decades of experience as an IT professional, including 16 years as a software developer working across academia and industry. He joined the eScience Lab in 2005 (then called the myGrid team), where he became the Technical Lead Developer of the open-source scientific workflow management system Taverna. As part of the EU FP7-funded Wf4Ever project, Stian contributed to the W3C PROV specifications for capturing provenance and co-developed the Research Object model and ontologies. More recently Stian has continued this passion for reproducibility, structured metadata and open scholarship with engagement in several projects related to FAIR, computational workflows, and RO-Crate. In 2023, Stian became a Research Fellow and later a Senior Lecturer at the Department of Computer Science.
Stian is co-leading and contributing to eScience Lab’s participation in several European-wide research projects:
- EVERSE – European Virtual Institute for Research Software Excellence. Starting March 2024. Includes support for RO-Crate in the Workflow Execution Service (WfExS), continuing work started in EOSC-Life, BY-COVID and TRE-FX.
- HDR Federated Analytics – understand the challenges of computational reproducibility and FAIR data sharing within HDR UK federated data infrastructures, extends on TRE-FX prototype.
HDR UK, UKRI HDR-CORE, HDR QQ2
- EuroScienceGateway – making a federated open access computational infrastructure through Galaxy as a gateway for data resources, tools and applications following the FAIR principles.
http://eurosciencegateway.eu/ Horizon Europe 101057388, UKRI 10038963
- FAIR-IMPACT – which is improving the FAIR publication and indexing of semantic artefacts and their catalogues, with associated Linked Data tooling including RO-Crate.
http://fair-impact.eu/ Horizon Europe 101057344, UKRI 10038992
- BioDT – making Digital Twins for BioDiversity using FAIR Digital Object, RO-Crate, FAIR Workflows and WorkflowHub.
https://biodt.eu/ Horizon Europe 101057437, UKRI 10038930)
- BY-COVID – making COVID-19 data open, standardized and linked using FAIR metadata and computational workflows. Stian is a task leader and focus on metadata standards, provenance and Workflow Run RO-Crate.
https://by-covid.eu/ Horizon Europe #101046203
- Biodiversity Genomics Europe (BGE) – where the eScience Lab will help develop FAIR Data infrastructures.
https://biodiversitygenomics.eu/ Horizon Europe #101059492, UKRI 10040409
- TRE-FX – Delivering a federated network of TREs to enable safe analytics. Workflow execution in Trusted Research Environments using 5-Safe RO-Cratre.
https://trefx.uk/ DARE UK, UKRI MC_PC_23007
- EOSC-Life – FAIR data processing for life sciences. Here Stian mainly contributed to the Tools Collaboratory for WorkflowHub, using RO-Crate, CWL, Galaxy and other existing workflow systems.
https://www.eosc-life.eu/ Horizon 2020 #824087
- Synthesys+ where Stian led Manchester’s part of developing the Specimen Data Refinery using Galaxy workflows, RO-Crate for provenance and FAIR Digital Objects using Open Digital Specimens.
https://www.synthesys.info/ Horizon 2020 #823827
- BioExcel, a Centre of Excellence for Computational Biomolecular Research, where Stian was deputy work package leader with focus on workflows, scalability, reproducibility, metadata.
http://bioexcel.eu/ Horizon 2020 #823830, #675728
- BCO-RO-Crate (NIH), consultancy for HIVE Lab at George Washington University to develop tutorial for packaging BioCompute Objects as RO-Crate Research Objects.
- FAIR-CURES-RO (NIH/NHLBI), subcontract for Mendeley Data to compose Research Objects through a REST API to be used in the FAIR4CURES participation in the NHLBI BioData Catalyst project.
- Open PHACTS, bringing together pharmacological data resources in an integrated, interoperable infrastructure using Linked Data.
https://www.openphactsfoundation.org/ IMI 115191
- Wf4Ever, developing methods for preservation of computational workflows, their dependencies and provenance. This was demonstrated using Taverna and Research Objects.
http://wf4ever.org/ FP7 #270192
- caGRID, where Stian was integrating Taverna into the caBIG infrastructure using Grid/WSDL/WSRF
- OMII-UK (EPS), where the myGrid team developed the Taverna workflow system.
Teaching and supervising
Stian teaches provenance models as part of the course unit Understanding Data and their Environment in the cross-faculty MSc Data Science programme at The University of Manchester.
As of 2023 Stian is supervising 1 PhD student in Manchester in collaboration with Professor Carole Goble, and will also supervise MSc projects for 2023/2024.
In 2023 the eScience Lab welcomed applications for a PhD studentship for Sharing reproducible analyses in Big Healthcare Data Infrastructures, funded by Health Data Research UK to be supervised by Stian and Carole Goble.
He has previously co-supervised 1 PhD student and multiple BSc/MSc student projects. He has informally mentored PhD students internationally, as well as mentoring several student projects of Google Summer of Code.
In July 2019, Stian started as a PhD Candidate in the INDElab at the Informatics Institute of the University of Amsterdam.
Member of committees and professional bodies
Stian has participated in W3C activities, co-authoring W3C Recommendations for Provenance, and collaborating on W3C Community efforts like Open Annotation (now Web Annotation Model), JSON-LD and w3id. He edited the ORE JSON-LD specification. He is part of the Common Workflow Language leadership team and co-author of the CWL specification. He is a Force11 member, where he promotes reproducibility best practices and open science. He is a member of the Technical Steering Committee for BioCompute Objects, and co-chair of Research Object Crate.
Stian is an active open source developer, and have contributed to software like ORCID, Apache Jena, OWL API, JSON-LD Java, as well as maintaining several Docker images. He is a Foundation Member of the Apache Software Foundation, where he is a on the Project Management Committee of Apache Commons, and Apache Incubator. He was a committer on Taverna in Apache Incubator and worked on Commons RDF. He is an occassional curator of abandomware including jai-imageio-core and beanshell.