BioCompute Object
Abbreviation | BCO |
---|---|
Status | Active IEEE Working Group |
Related standards | Common Workflow Language |
License | BSD-3-clause |
Website | osf |
The BioCompute Object (BCO) project is a community-driven initiative to build a framework for standardizing and sharing computations and analyses generated from
Originally started as a collaborative contract between the George Washington University and the Food and Drug Administration, the project has grown to include over 20 universities, biotechnology companies, public-private partnerships and pharmaceutical companies including Seven Bridges and Harvard Medical School.[2] The BCO aims to ease the exchange of HTS workflows between various organizations, such as the FDA, pharmaceutical companies, contract research organizations, bioinformatic platform providers, and academic researchers. Due to the sensitive nature of regulatory filings, few direct references to material can be published. However, the project is currently funded to train FDA Reviewers and administrators to read and interpret BCOs, and currently has 4 publications either submitted or nearly submitted.
Background
One of the biggest challenges in bioinformatics is documenting and sharing
Utility
As a standardization for genomic data, BioCompute Objects are mostly useful to three groups of users: 1) academic researchers carrying out new genetic experiments, 2) pharma/biotech companies that wish to submit work to the FDA for regulatory review, and 3) clinical settings (hospitals and labs) that offer genetic tests and personalized medicine. The utility to academic researchers is the ability to reproduce experimental data more accurately and with less uncertainty. The utility to entities wishing to submit work to the FDA is a streamlined approach, again with less uncertainty and with the ability to more accurately reproduce work. For clinical settings, it is critical that HTS data and clinical metadata be transmitted in an accurate way, ideally in a standardized way that is readable by any stakeholder, including regulatory partners.
Format
The BioCompute Object is in
BCO Consortium
The BioCompute Object working group facilitated a means for different stakeholders to provide input on current practices on the BCO. This working group was formed during preparation for the 2017 HTS Computational Standards for Regulatory Sciences Workshop, and was initially made up of the workshop participants. The growth and work of the BCO working group, as a direct result of the interaction between a variety of stakeholders from all interested communities, culminated in the official standard, IEEE 2791-2020, which was approved in January 2020. A Public-Private partnerships was formed between GWU and CBER and has become an easy point of entry for new individuals or institutions into the BCO project to participate in the discussion of best practices for the objects.
Implementations
The simple R package biocompute[8] can create, validate, and export BioCompute Objects. The Genomics Compliance Suite is a Shiny app that offers similar features to regular expressions found in all modern text editors. There are several internally developed open source software packages and web applications that implement the BioCompute specification, three of which have been deployed in a publicly accessible AWS EC2 cloud. These include an instance of the High-performance Integrated Virtual Environment, the BioCompute Portal[9] (a form-based web application that can create and edit BioCompute Objects based on the IEEE-2791-2020 standard, and a BioCompute compliant instance of Galaxy.
Some bioinformatics platforms have built-in support for Biocompute, which let a user automatically create a BCO from a workflow and edit the descriptive content.
- DNAnexus and PrecisionFDA facilitate the generation of BCOs by importing workflows, allowing users to edit descriptive content. The platform supports metadata import and export of WDL and CWL scripts, and offers the BCOnexus tool, which is a high-level, platform-free tool with a graphical user interface that lets a user merge BCOs.
- Velsera's Seven Bridges Genomics and Cancer Genomics Cloud also have support for BioCompute by enabling direct pre-population of BCO fields from workflows.
- BioCompute has also been integrated into HIVE and the main Galaxy instance, both of which similarly enable users to automatically generate BCOs and edit content within these platforms.
- BioCompute has also been implemented in the Common Fund Data Elements Playbook Partnership project. This implementation lets a user save a workflow when they're satisfied with the results, which aids in traceability through the network of independently-versioned resources, allowing users to save queries and annotate them for future use, sharing, or repeatability, aligning with its role in advancing bioinformatics practices.
Integration into platforms is meant to improve data handling and collaboration and provide effective ways for users to execute a workflow, and graphical representations of BCOs are often more intuitive ways of browsing or reading BCOs.
References
- ^ Simonyan V, Goecks J, Mazumder R. Biocompute Objects—A Step towards Evaluation and Validation of Biomedical Scientific Computations. PDA journal of pharmaceutical science and technology. 2017;71(2):136-146. doi:10.5731/pdajpst.2016.006734.
- ^ "BioCompute Objects specifications to advance genomic data analysis". www.europeanpharmaceuticalreview.com. Retrieved 2017-12-21.
- PMID 24204232.
- PMID 26978244.
- PMID 30596645– via www.biorxiv.org.
- ^ "Provenance-example-biocompute-object". HL7 FHIR Release 3 (STU).
- ^ Soiland-Reyes, Stian (2020-09-01), hive-cwl-examples: Packaging BioCompute Objects using RO-Crate
- ^ "CRAN - Package biocompute". cran.r-project.org. Retrieved 2019-11-28.
- ^ "BioCompute Portal". github.com/biocompute-objects. Retrieved 2020-06-25.