Back to Search Start Over

Incorporating biological structure into machine learning models in biomedicine.

Authors :
Crawford, Jake
Greene, Casey S
Source :
Current Opinion in Biotechnology. Jun2020, Vol. 63, p126-134. 9p.
Publication Year :
2020

Abstract

Schematic showing the main categories of models incorporating structured biological data covered in this review. The first panel shows an example of a model operating on sequence data, the second panel shows a model in which dimension reduction is influenced by the connections in a gene network, and the third panel shows a neural network with structure constrained by a phylogeny or ontology. The 'x' values in the data tables represent gene expression measurements. In biomedical applications of machine learning, relevant information often has a rich structure that is not easily encoded as real-valued predictors. Examples of such data include DNA or RNA sequences, gene sets or pathways, gene interaction or coexpression networks, ontologies, and phylogenetic trees. We highlight recent examples of machine learning models that use structure to constrain model architecture or incorporate structured data into model training. For machine learning in biomedicine, where sample size is limited and model interpretability is crucial, incorporating prior knowledge in the form of structured data can be particularly useful. The area of research would benefit from performant open source implementations and independent benchmarking efforts. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09581669
Volume :
63
Database :
Academic Search Index
Journal :
Current Opinion in Biotechnology
Publication Type :
Academic Journal
Accession number :
143825834
Full Text :
https://doi.org/10.1016/j.copbio.2019.12.021