Back to Search
Start Over
PrISM: precision for integrative structural models
- Source :
- Bioinformatics. 38:3837-3839
- Publication Year :
- 2022
- Publisher :
- Oxford University Press (OUP), 2022.
-
Abstract
- Integrative modeling of macromolecular structures usually results in an ensemble of models that satisfy the input information. The model precision, or variability among these models is estimated globally, i.e., a single precision value is reported for the model. However, it would be useful to identify regions of high and low precision. For instance, low-precision regions can suggest where the next experiments could be performed and high-precision regions can be used for further analysis, e.g., suggesting mutations. We develop PrISM (Precision for Integrative Structural Models), using autoencoders, a type of unsupervised deep neural network, to efficiently and accurately annotate precision for integrative models. The method is benchmarked and tested on five examples of binary protein complexes and five examples of large protein assemblies. The annotated precision is shown to be consistent with localization density maps, while providing more fine-grained information. Finally, the generated networks are also interpreted by gradient-based attention analysis. Significance Statement Validation of integrative models and data is an open research challenge. This is timely due to the new worldwide Protein Data Bank archive for integrative structures (http://pdb-dev.wwpdb.org). Currently, a single precision value is reported for an integrative model. However, precision may vary for different regions of an integrative model owing to varying amounts of information available for different regions. We develop a method using unsupervised deep learning to efficiently and accurately annotate precision for regions of an integrative model. This will ultimately improve the quality and utility of deposited structures.
- Subjects :
- Statistics and Probability
Artificial neural network
Computer science
business.industry
Deep learning
Binary number
Value (computer science)
computer.file_format
Protein Data Bank
computer.software_genre
Biochemistry
Single-precision floating-point format
Computer Science Applications
Models, Structural
Benchmarking
Computational Mathematics
Open research
Computational Theory and Mathematics
Data mining
Artificial intelligence
business
computer
Molecular Biology
Software
Subjects
Details
- ISSN :
- 13674811 and 13674803
- Volume :
- 38
- Database :
- OpenAIRE
- Journal :
- Bioinformatics
- Accession number :
- edsair.doi.dedup.....68027ec6ca3145d8dc42a3ef06fc1ebd