114 results on '"Carazo, J M"'
Search Results
2. A deep learning approach to the automatic detection of alignment errors in cryo-electron tomographic reconstructions
- Author
-
Ministerio de Ciencia, Innovación y Universidades (España), de Isidro-Gómez, F P, Vilas, J L, Losana, P, Carazo, J M, Sorzano, Carlos Óscar S., Ministerio de Ciencia, Innovación y Universidades (España), de Isidro-Gómez, F P, Vilas, J L, Losana, P, Carazo, J M, and Sorzano, Carlos Óscar S.
- Abstract
Electron tomography is an imaging technique that allows for the elucidation of three-dimensional structural information of biological specimens in a very general context, including cellular in situ observations. The approach starts by collecting a set of images at different projection directions by tilting the specimen stage inside the microscope. Therefore, a crucial preliminary step is to precisely define the acquisition geometry by aligning all the tilt images to a common reference. Errors introduced in this step will lead to the appearance of artifacts in the tomographic reconstruction, rendering them unsuitable for the sample study. Focusing on fiducial-based acquisition strategies, this work proposes a deep-learning algorithm to detect misalignment artifacts in tomographic reconstructions by analyzing the characteristics of these fiducial markers in the tomogram. In addition, we propose an algorithm designed to detect fiducial markers in the tomogram with which to feed the classification algorithm in case the alignment algorithm does not provide the location of the markers. This open-source software is available as part of the Xmipp software package inside of the Scipion framework, and also through the command-line in the standalone version of Xmipp.
- Published
- 2024
3. Biclustering of gene expression data by non-smooth non-negative matrix factorization
- Author
-
Carmona Saez, P., Pascual Marqui, R. D., Tirado Fernández, Francisco, Carazo, J. M., Pascual Montano, Alberto, Carmona Saez, P., Pascual Marqui, R. D., Tirado Fernández, Francisco, Carazo, J. M., and Pascual Montano, Alberto
- Abstract
© 2006 Carmona-Saez et al; licensee BioMed Central Ltd. This work has been supported by the Spanish grants GR/SAL/0653/2004, CICYT BFU2004-00217/BMC, GEN2003-20235-c05-05, TIN2005-5619, PR27/05-13964-BSCH and a collaborative grant between the Spanish Research Council and the National Research Council of Canada (CSIC050402040003). The authors also thank the KEY Foundation for Brain-Mind Research in Zurich for partial economical support of this work. P.C.S. is the recipient of a fellowship from Comunidad de Madrid (CAM). A.P.M. acknowledges the support of the Spanish Ramón y Cajal program., Background: The extended use of microarray technologies has enabled the generation and accumulation of gene expression datasets that contain expression levels of thousands of genes across tens or hundreds of different experimental conditions. One of the major challenges in the analysis of such datasets is to discover local structures composed by sets of genes that show coherent expression patterns across subsets of experimental conditions. These patterns may provide clues about the main biological processes associated to different physiological states. Results: In this work we present a methodology able to cluster genes and conditions highly related in sub-portions of the data. Our approach is based on a new data mining technique, Non-smooth Non-Negative Matrix Factorization (nsNMF), able to identify localized patterns in large datasets. We assessed the potential of this methodology analyzing several synthetic datasets as well as two large and heterogeneous sets of gene expression profiles. In all cases the method was able to identify localized features related to sets of genes that show consistent expression patterns across subsets of experimental conditions. The uncovered structures showed a clear biological meaning in terms of relationships among functional annotations of genes and the phenotypes or physiological states of the associated conditions. Conclusion: The proposed approach can be a useful tool to analyze large and heterogeneous gene expression datasets. The method is able to identify complex relationships among genes and conditions that are difficult to identify by standard clustering algorithms., Spanish Research Council, National Research Council of Canada, KEY Foundation for Brain- Mind Research (Zurich), Comunidad de Madrid (CAM)., Spanish Ramón y Cajal program, Sección Deptal. de Arquitectura de Computadores y Automática (Físicas), Fac. de Ciencias Físicas, TRUE, pub
- Published
- 2023
4. Image processing tools for the validation of CryoEM maps
- Author
-
Sorzano, Carlos O. S, Vilas Prieto, José Luis, Ramírez Aportela, Erney, Krieger, J., Hoyo, D. del, Herreros, David, Fernández Giménez, Estrella, Marchán, D., Macías, J. R., Sánchez, I., Caño, L. del, Fonseca Reyna, Y., Conesa, A., García Mena, A., Burguet Castell, Jordi, García Condado, J., Méndez García, J., Martínez Orts, Mónica, Muñoz Barrutia, A., Marabini, R., Vargas Balbuena, Javier, Carazo, J. M., Sorzano, Carlos O. S, Vilas Prieto, José Luis, Ramírez Aportela, Erney, Krieger, J., Hoyo, D. del, Herreros, David, Fernández Giménez, Estrella, Marchán, D., Macías, J. R., Sánchez, I., Caño, L. del, Fonseca Reyna, Y., Conesa, A., García Mena, A., Burguet Castell, Jordi, García Condado, J., Méndez García, J., Martínez Orts, Mónica, Muñoz Barrutia, A., Marabini, R., Vargas Balbuena, Javier, and Carazo, J. M.
- Abstract
© The Royal Society of Chemistry 2022. We are thankful to Philip Baldwin, Dmitry Lyumkis, and Gabriel Lander for making their validation methods available (4.h and A.c). Javier Vargas and Jordi Burguet would like to thank the Spanish Ministry of Science and Innovation for financial support through the call 2019 Proyectos de I+D+i - RTI Tipo A (PID2019-108850RA-I00) and Arrate Muñoz Barrutia, PID2019-109820RB-I00, MCIN/AEI/10.13039/501100011033/, cofinanced by European Regional Development Fund (ERDF), "A way of making Europe.". The authors acknowledge the economic support from MICIN of the Instruct Image Processing Center (I2PC) as part of the Spanish participation in Instruct-ERIC, the European Strategic Infrastructure Project (ESFRI) in the area of Structural Biology, Grant PID2019-104757RB-I00 funded by MCIN/AEI/10.13039/501100011033/ and "ERDF A way of making Europe", by the European Union and Grant PRE2020 - 093527 funded by MCIN/AEI/ 10.13039/501100011033 and by "ESF Investing in your future". We also acknowledge support from "Comunidad Autonoma de Madrid" through Grant: S2017/BMD-3817, Instituto de Salud Carlos III (project IMPaCT-Data, exp. IMP/00019), co-funded by the European Union, European Regional Development Fund (ERDF, "A way to make Europe"), and European Union (EU) and Horizon 2020 through grants: EOSC Life (INFRAEOSC-04-2018, Proposal: 824087), HighResCells (ERC - 2018 - SyG, Proposal: 810057), IMpaCT (WIDESPREAD-03-2018 - Proposal: 857203), EOSC - Synergy (EINFRA-EOSC-5, Proposal: 857647), iNEXT-Discovery (Proposal: 871037), EnLaCES (H2020-MSCA-IF-2020, Proposal: 101024130)., The number of maps deposited in public databases (Electron Microscopy Data Bank, EMDB) determined by cryo-electron microscopy has quickly grown in recent years. With this rapid growth, it is critical to guarantee their quality. So far, map validation has primarily focused on the agreement between maps and models. From the image processing perspective, the validation has been mostly restricted to using two half-maps and the measurement of their internal consistency. In this article, we suggest that map validation can be taken much further from the point of view of image processing if 2D classes, particles, angles, coordinates, defoci, and micrographs are also provided. We present a progressive validation scheme that qualifies a result validation status from 0 to 5 and offers three optional qualifiers (A, W, and O) that can be added. The simplest validation state is 0, while the most complete would be 5AWO. This scheme has been implemented in a website https://biocomp.cnb.csic.es/EMValidationService/ to which reconstructed maps and their ESI can be uploaded., Unión Europea. Horizonte 2020, Ministerio de Ciencia e Innovación (MICINN)/FEDER, Ministerio de Ciencia e Innovación (MICINN), Comunidad de Madrid, Depto. de Óptica, Fac. de Ciencias Físicas, TRUE, pub
- Published
- 2022
5. Optimization problems in electron microscopy of single particles
- Author
-
Sorzano, C. O. S., Marabini, R., Pascual-Montano, A., Scheres, S. H. W., and Carazo, J. M.
- Published
- 2006
- Full Text
- View/download PDF
6. A pattern matching approach to the automatic selection of particles from low-contrast electron micrographs
- Author
-
Abrishami, V., Zaldívar-Peraza, A., de la Rosa-Trevín, J. M., Vargas, J., Otón, J., Marabini, R., Shkolnisky, Y., Carazo, J. M., and Sorzano, C. O. S.
- Published
- 2013
- Full Text
- View/download PDF
7. Automatic Classification of Biological Particles from Electron-microscopy Images Using Conventional and Genetic-algorithm Optimized Learning Vector Quantization
- Author
-
Merelo, J. J., Prieto, A., Morán, F., Marabini, R., and Carazo, J. M.
- Published
- 1998
- Full Text
- View/download PDF
8. Designing and executing scientific workflows with a programmable integrator
- Author
-
Chagoyen, M., Kurul, M. E., De-Alarcón, P. A., Carazo, J. M., and Gupta, A.
- Published
- 2004
9. Survey of the analysis of continuous conformational variability of biological macromolecules by electron microscopy
- Author
-
Ministerio de Economía y Competitividad (España), Comunidad de Madrid, Instituto de Salud Carlos III, National Institutes of Health (US), European Commission, Carazo, J. M [0000-0003-0788-8447], Sorzano, Carlos Óscar S. [0000-0002-9473-283X], Marabini, Roberto [0000-0001-7876-1684], Melero, Roberto [0000-0001-9467-9381], Sorzano, Carlos Óscar S., Jiménez, A., Vilas, J. L., Maluenda, D., Martínez, Marta, Ramírez-Aportela, Erney, Majtner, T., Segura, J., Sánchez-García, R., Caño, Laura del, Conesa Mingo, Pablo, Melero, Roberto, Jonic, Slavica, Vargas, Javier, Cazals, F., Freyberg, Z., Krieger, James, Bahar, Ivet, Marabini, Roberto, Carazo, José M., Ministerio de Economía y Competitividad (España), Comunidad de Madrid, Instituto de Salud Carlos III, National Institutes of Health (US), European Commission, Carazo, J. M [0000-0003-0788-8447], Sorzano, Carlos Óscar S. [0000-0002-9473-283X], Marabini, Roberto [0000-0001-7876-1684], Melero, Roberto [0000-0001-9467-9381], Sorzano, Carlos Óscar S., Jiménez, A., Vilas, J. L., Maluenda, D., Martínez, Marta, Ramírez-Aportela, Erney, Majtner, T., Segura, J., Sánchez-García, R., Caño, Laura del, Conesa Mingo, Pablo, Melero, Roberto, Jonic, Slavica, Vargas, Javier, Cazals, F., Freyberg, Z., Krieger, James, Bahar, Ivet, Marabini, Roberto, and Carazo, José M.
- Abstract
Single-particle analysis by electron microscopy is a well established technique for analyzing the three-dimensional structures of biological macromolecules. Besides its ability to produce high-resolution structures, it also provides insights into the dynamic behavior of the structures by elucidating their conformational variability. Here, the different image-processing methods currently available to study continuous conformational changes are reviewed.
- Published
- 2019
10. Photosystem II supercomplexes form pairs by functionally interacting across the stromal gap: implications for light harvesting and thylakoid dynamics
- Author
-
Albanese, P., Melero, R., Engel, B. D., Saracco, G., Carazo, J. M., and Pagliano, C
- Published
- 2018
11. 3DBIONOTES v3.0: crossing molecular and structural biology data with genomic variations
- Author
-
Segura, Joan, primary, Sanchez-Garcia, Ruben, additional, Sorzano, C O S, additional, and Carazo, J M, additional
- Published
- 2019
- Full Text
- View/download PDF
12. Survey of the analysis of continuous conformational variability of biological macromolecules by electron microscopy
- Author
-
Sorzano, C. O. S., primary, Jiménez, A., additional, Mota, J., additional, Vilas, J. L., additional, Maluenda, D., additional, Martínez, M., additional, Ramírez-Aportela, E., additional, Majtner, T., additional, Segura, J., additional, Sánchez-García, R., additional, Rancel, Y., additional, del Caño, L., additional, Conesa, P., additional, Melero, R., additional, Jonic, S., additional, Vargas, J., additional, Cazals, F., additional, Freyberg, Z., additional, Krieger, J., additional, Bahar, I., additional, Marabini, R., additional, and Carazo, J. M., additional
- Published
- 2019
- Full Text
- View/download PDF
13. BIPSPI: a method for the prediction of partner-specific protein–protein interfaces
- Author
-
Sanchez-Garcia, Ruben, primary, Sorzano, C O S, additional, Carazo, J M, additional, and Segura, Joan, additional
- Published
- 2018
- Full Text
- View/download PDF
14. Using Scipion for stream image processing at Cryo-EM facilities
- Author
-
Gomez-Blanco, J., de la Rosa-Trevin, J. M., Marabini, R., del Cano, L., Jimenez, A., Martinez, M., Melero, R., Majtner, T., Maluenda, D., Mota, J., Rancel, Y., Ramirez-Aportela, E., Vilas, J. L., Carroni, M., Fleischmann, S., Lindahl, Erik, Ashton, A. W., Basham, M., Clare, D. K., Savage, K., Siebert, C. A., Sharov, G. G., Sorzano, C. O. S., Conesa, P., Carazo, J. M., Gomez-Blanco, J., de la Rosa-Trevin, J. M., Marabini, R., del Cano, L., Jimenez, A., Martinez, M., Melero, R., Majtner, T., Maluenda, D., Mota, J., Rancel, Y., Ramirez-Aportela, E., Vilas, J. L., Carroni, M., Fleischmann, S., Lindahl, Erik, Ashton, A. W., Basham, M., Clare, D. K., Savage, K., Siebert, C. A., Sharov, G. G., Sorzano, C. O. S., Conesa, P., and Carazo, J. M.
- Abstract
Three dimensional electron microscopy is becoming a very data-intensive field in which vast amounts of experimental images are acquired at high speed. To manage such large-scale projects, we had previously developed a modular workflow system called Scipion (de la Rosa-Trevfn et al., 2016). We present here a major extension of Scipion that allows processing of EM images while the data is being acquired. This approach helps to detect problems at early stages, saves computing time and provides users with a detailed evaluation of the data quality before the acquisition is finished. At present, Scipion has been deployed and is in production mode in seven Cryo-EM facilities throughout the world., QC 20190118
- Published
- 2018
- Full Text
- View/download PDF
15. A Survey of the Use of Iterative Reconstruction Algorithms in Electron Microscopy
- Author
-
Sorzano, C. O. S., Vargas, J., Otón, J., de la Rosa-Trevín, J. M., Vilas, J. L., Kazemi, M., Melero, R., del Caño, L., Cuenca, J., Conesa, P., Gómez-Blanco, J., Marabini, R., and Carazo, J. M.
- Subjects
Article Subject - Abstract
One of the key steps in Electron Microscopy is the tomographic reconstruction of a three-dimensional (3D) map of the specimen being studied from a set of two-dimensional (2D) projections acquired at the microscope. This tomographic reconstruction may be performed with different reconstruction algorithms that can be grouped into several large families: direct Fourier inversion methods, back-projection methods, Radon methods, or iterative algorithms. In this review, we focus on the latter family of algorithms, explaining the mathematical rationale behind the different algorithms in this family as they have been introduced in the field of Electron Microscopy. We cover their use in Single Particle Analysis (SPA) as well as in Electron Tomography (ET).
- Published
- 2017
- Full Text
- View/download PDF
16. 3DBIONOTES v2.0: a web server for the automatic annotation of macromolecular structures
- Author
-
Segura, Joan, primary, Sanchez-Garcia, Ruben, additional, Martinez, Marta, additional, Cuenca-Alba, Jesus, additional, Tabas-Madrid, Daniel, additional, Sorzano, C O S, additional, and Carazo, J M, additional
- Published
- 2017
- Full Text
- View/download PDF
17. Quantitative analysis of 3D alignment quality: its impact on soft-validation, particle pruning and homogeneity analysis
- Author
-
Vargas, J., primary, Melero, R., additional, Gómez-Blanco, J., additional, Carazo, J. M., additional, and Sorzano, C. O. S., additional
- Published
- 2017
- Full Text
- View/download PDF
18. Organising multi-dimensional biological image information: The BioImage Database
- Author
-
Carazo, J. M., Stelzer, E. H. K., Engel, A., Fita, I., Henn, C., Machtynger, J., McNeil, P., Shotton, D. M., Chagoyen, M., de Alarcón, P. A., Fritsch, R., Heymann, J. B., Kalko, S., Pittet, J. J., Rodriguez-Tomé, P., Boudier, T., Carazo, J. M., Stelzer, E. H. K., Engel, A., Fita, I., Henn, C., Machtynger, J., McNeil, P., Shotton, D. M., Chagoyen, M., de Alarcón, P. A., Fritsch, R., Heymann, J. B., Kalko, S., Pittet, J. J., Rodriguez-Tomé, P., and Boudier, T.
- Abstract
Nowadays it is possible to unravel complex information at all levels of cellular organization by obtaining multi-dimensional image information. at the macromolecular level, three-dimensional (3D) electron microscopy, together with other techniques, is able to reach resolutions at the nanometer or subnanometer level. The information is delivered in the form of 3D volumes containing samples of a given function, for example, the electron density distribution within a given macromolecule. The same situation happens at the cellular level with the new forms of light microscopy, particularly confocal microscopy, all of which produce biological 3D volume information. Furthermore, it is possible to record sequences of images over time (videos), as well as sequences of volumes, bringing key information on the dynamics of living biological systems. It is in this context that work on bioimage started two years ago, and that its first version is now presented here. In essence, Bioimage is a database specifically designed to contain multi-dimensional images, perform queries and interactively work with the resulting multi-dimensional information on the World Wide Web, as well as accomplish the required cross-database links. Two sister home pages of bioimage can be accessed at http://www.bioimage.org and http://www-embl.bioimage.org
- Published
- 2017
19. BIPSPI: a method for the prediction of partner-specific protein–protein interfaces.
- Author
-
Sanchez-Garcia, Ruben, Sorzano, C O S, Carazo, J M, and Segura, Joan
- Subjects
PROTEIN-protein interactions ,CELL proliferation ,BIOCHEMICAL genetics ,AMINO acid sequence ,RESIDUE codes - Abstract
Motivation Protein–Protein Interactions (PPI) are essentials for most cellular processes and thus, unveiling how proteins interact is a crucial question that can be better understood by identifying which residues are responsible for the interaction. Computational approaches are orders of magnitude cheaper and faster than experimental ones, leading to proliferation of multiple methods aimed to predict which residues belong to the interface of an interaction. Results We present BIPSPI, a new machine learning-based method for the prediction of partner-specific PPI sites. Contrary to most binding site prediction methods, the proposed approach takes into account a pair of interacting proteins rather than a single one in order to predict partner-specific binding sites. BIPSPI has been trained employing sequence-based and structural features from both protein partners of each complex compiled in the Protein–Protein Docking Benchmark version 5.0 and in an additional set independently compiled. Also, a version trained only on sequences has been developed. The performance of our approach has been assessed by a leave-one-out cross-validation over different benchmarks, outperforming state-of-the-art methods. Availability and implementation BIPSPI web server is freely available at http://bipspi.cnb.csic.es. BIPSPI code is available at https://github.com/bioinsilico/BIPSPI. Docker image is available at https://hub.docker.com/r/bioinsilico/bipspi/. Supplementary information Supplementary data are available at Bioinformatics online. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
20. Particle alignment reliability in single particle electron cryomicroscopy: a general approach
- Author
-
Vargas, J., primary, Otón, J., additional, Marabini, R., additional, Carazo, J. M., additional, and Sorzano, C. O. S., additional
- Published
- 2016
- Full Text
- View/download PDF
21. OmicsHub Proteomics Software Tool
- Author
-
Lloret, T., Carmona Tamayo, J. L., Sierra Cantero, J. M., Carazo, J. M., Couto, E. González, and Yankilevich, P.
- Subjects
Poster Session Abstracts - Abstract
RP-6
- Published
- 2010
22. Cryo-EM and the elucidation of new macromolecular structures: Random Conical Tilt revisited
- Author
-
Sorzano, C. O. S., primary, Alcorlo, M., additional, de la Rosa-Trevín, J. M., additional, Melero, R., additional, Foche, I., additional, Zaldívar-Peraza, A., additional, del Cano, L., additional, Vargas, J., additional, Abrishami, V., additional, Otón, J., additional, Marabini, R., additional, and Carazo, J. M., additional
- Published
- 2015
- Full Text
- View/download PDF
23. Using neighborhood cohesiveness to infer interactions between protein domains
- Author
-
Segura, Joan, primary, Sorzano, C. O. S., additional, Cuenca-Alba, Jesus, additional, Aloy, Patrick, additional, and Carazo, J. M., additional
- Published
- 2015
- Full Text
- View/download PDF
24. Biclustering of gene expression data by non-smooth non-negative matrix factorization
- Author
-
Carmona Saez, P., Pascual Marqui, R. D., Tirado Fernández, José Francisco, Carazo, J. M., Pascual Montano, Alberto, Carmona Saez, P., Pascual Marqui, R. D., Tirado Fernández, José Francisco, Carazo, J. M., and Pascual Montano, Alberto
- Abstract
© 2006 Carmona-Saez et al; licensee BioMed Central Ltd. This work has been supported by the Spanish grants GR/SAL/0653/2004, CICYT BFU2004-00217/BMC, GEN2003-20235-c05-05, TIN2005-5619, PR27/05-13964-BSCH and a collaborative grant between the Spanish Research Council and the National Research Council of Canada (CSIC050402040003). The authors also thank the KEY Foundation for Brain-Mind Research in Zurich for partial economical support of this work. P.C.S. is the recipient of a fellowship from Comunidad de Madrid (CAM). A.P.M. acknowledges the support of the Spanish Ramón y Cajal program., Background: The extended use of microarray technologies has enabled the generation and accumulation of gene expression datasets that contain expression levels of thousands of genes across tens or hundreds of different experimental conditions. One of the major challenges in the analysis of such datasets is to discover local structures composed by sets of genes that show coherent expression patterns across subsets of experimental conditions. These patterns may provide clues about the main biological processes associated to different physiological states. Results: In this work we present a methodology able to cluster genes and conditions highly related in sub-portions of the data. Our approach is based on a new data mining technique, Non-smooth Non-Negative Matrix Factorization (nsNMF), able to identify localized patterns in large datasets. We assessed the potential of this methodology analyzing several synthetic datasets as well as two large and heterogeneous sets of gene expression profiles. In all cases the method was able to identify localized features related to sets of genes that show consistent expression patterns across subsets of experimental conditions. The uncovered structures showed a clear biological meaning in terms of relationships among functional annotations of genes and the phenotypes or physiological states of the associated conditions. Conclusion: The proposed approach can be a useful tool to analyze large and heterogeneous gene expression datasets. The method is able to identify complex relationships among genes and conditions that are difficult to identify by standard clustering algorithms., Spanish Research Council, National Research Council of Canada, KEY Foundation for Brain- Mind Research (Zurich), Comunidad de Madrid (CAM)., Spanish Ramón y Cajal program, Sección Deptal. de Arquitectura de Computadores y Automática (Físicas), Fac. de Ciencias Físicas, TRUE, pub
- Published
- 2006
25. bioNMF: a versatile tool for non-negative matrix factorization in biology
- Author
-
Pascual-Montano, A, Carmona-Saez, P, Chagoyen, M, Tirado, F, Carazo, J M, Pascual-Marqui, R D, Pascual-Montano, A, Carmona-Saez, P, Chagoyen, M, Tirado, F, Carazo, J M, and Pascual-Marqui, R D
- Abstract
BACKGROUND: In the Bioinformatics field, a great deal of interest has been given to Non-negative matrix factorization technique (NMF), due to its capability of providing new insights and relevant information about the complex latent relationships in experimental data sets. This method, and some of its variants, has been successfully applied to gene expression, sequence analysis, functional characterization of genes and text mining. Even if the interest on this technique by the bioinformatics community has been increased during the last few years, there are not many available simple standalone tools to specifically perform these types of data analysis in an integrated environment. RESULTS: In this work we propose a versatile and user-friendly tool that implements the NMF methodology in different analysis contexts to support some of the most important reported applications of this new methodology. This includes clustering and biclustering gene expression data, protein sequence analysis, text mining of biomedical literature and sample classification using gene expression. The tool, which is named bioNMF, also contains a user-friendly graphical interface to explore results in an interactive manner and facilitate in this way the exploratory data analysis process. CONCLUSION: bioNMF is a standalone versatile application which does not require any special installation or libraries. It can be used for most of the multiple applications proposed in the bioinformatics field or to support new research using this method. This tool is publicly available at http://www.dacya.ucm.es/apascual/bioNMF.
- Published
- 2006
26. Biclustering of gene expression data by Non-smooth Non-negative Matrix Factorization
- Author
-
Carmona-Saez, P, Pascual-Marqui, R D, Tirado, F, Carazo, J M, Pascual-Montano, A, Carmona-Saez, P, Pascual-Marqui, R D, Tirado, F, Carazo, J M, and Pascual-Montano, A
- Abstract
BACKGROUND: The extended use of microarray technologies has enabled the generation and accumulation of gene expression datasets that contain expression levels of thousands of genes across tens or hundreds of different experimental conditions. One of the major challenges in the analysis of such datasets is to discover local structures composed by sets of genes that show coherent expression patterns across subsets of experimental conditions. These patterns may provide clues about the main biological processes associated to different physiological states. RESULTS: In this work we present a methodology able to cluster genes and conditions highly related in sub-portions of the data. Our approach is based on a new data mining technique, Non-smooth Non-Negative Matrix Factorization (nsNMF), able to identify localized patterns in large datasets. We assessed the potential of this methodology analyzing several synthetic datasets as well as two large and heterogeneous sets of gene expression profiles. In all cases the method was able to identify localized features related to sets of genes that show consistent expression patterns across subsets of experimental conditions. The uncovered structures showed a clear biological meaning in terms of relationships among functional annotations of genes and the phenotypes or physiological states of the associated conditions. CONCLUSION: The proposed approach can be a useful tool to analyze large and heterogeneous gene expression datasets. The method is able to identify complex relationships among genes and conditions that are difficult to identify by standard clustering algorithms.
- Published
- 2006
27. Shack–Hartmann centroid detection using the spiral phase transform
- Author
-
Vargas, J., primary, Restrepo, R., additional, Estrada, J. C., additional, Sorzano, C. O. S., additional, Du, Yong-Zhao, additional, and Carazo, J. M., additional
- Published
- 2012
- Full Text
- View/download PDF
28. GeneCodis: interpreting gene lists through enrichment analysis and integration of diverse biological information
- Author
-
Nogales-Cadenas, R., primary, Carmona-Saez, P., additional, Vazquez, M., additional, Vicente, C., additional, Yang, X., additional, Tirado, F., additional, Carazo, J. M., additional, and Pascual-Montano, A., additional
- Published
- 2009
- Full Text
- View/download PDF
29. SENT: semantic features in text
- Author
-
Vazquez, M., primary, Carmona-Saez, P., additional, Nogales-Cadenas, R., additional, Chagoyen, M., additional, Tirado, F., additional, Carazo, J. M., additional, and Pascual-Montano, A., additional
- Published
- 2009
- Full Text
- View/download PDF
30. CentrosomeDB: a human centrosomal proteins database
- Author
-
Nogales-Cadenas, R., primary, Abascal, F., additional, Diez-Perez, J., additional, Carazo, J. M., additional, and Pascual-Montano, A., additional
- Published
- 2009
- Full Text
- View/download PDF
31. Organising multi-dimensional biological image information: The BioImage Database
- Author
-
Carazo, J. M., primary, Stelzer, E. H. K., additional, Engel, A., additional, Fita, I., additional, Henn, C., additional, Machtynger, J., additional, McNeil, P., additional, Shotton, D. M., additional, Chagoyen, M., additional, de Alarcon, P. A., additional, Fritsch, R., additional, Heymann, J. B., additional, Kalko, S., additional, Pittet, J. J., additional, Rodriguez-Tome, P., additional, and Boudier, T., additional
- Published
- 1999
- Full Text
- View/download PDF
32. Computational space reduction and parallelization of a new clustering approach for large groups of sequences.
- Author
-
Trelles, O, primary, Andrade, M A, additional, Valencia, A, additional, Zapata, E L, additional, and Carazo, J M, additional
- Published
- 1998
- Full Text
- View/download PDF
33. Self-organizing tree growing network for classifying amino acids.
- Author
-
Wang, H C, primary, Dopazo, J, additional, and Carazo, J M, additional
- Published
- 1998
- Full Text
- View/download PDF
34. New phylogenetic venues opened by a novel implementation of the DNAml algorithm.
- Author
-
Trelles, O, primary, Ceron, C, additional, Wang, H C, additional, Dopazo, J, additional, and Carazo, J M, additional
- Published
- 1998
- Full Text
- View/download PDF
35. Marker-free image registration of electron tomography tilt-series.
- Author
-
Sanchez Sorzano, Carlos Oscar, Messaoudi, Cédric, Eibauer, Matthias, Bilbao-Castro, J. R., Hegerl, R., Nickell, S., Marco, S., and Carazo, J. M.
- Subjects
TOMOGRAPHY ,MEDICAL radiography ,DIAGNOSTIC imaging ,ALGORITHMS ,BIOINFORMATICS - Abstract
Background: Tilt series are commonly used in electron tomography as a means of collecting three-dimensional information from two-dimensional projections. A common problem encountered is the projection alignment prior to 3D reconstruction. Current alignment techniques usually employ gold particles or image derived markers to correctly align the images. When these markers are not present, correlation between adjacent views is used to align them. However, sequential pairwise correlation is prone to bias and the resulting alignment is not always optimal. Results: In this paper we introduce an algorithm to find regions of the tilt series which can be tracked within a subseries of the tilt series. These regions act as landmarks allowing the determination of the alignment parameters. We show our results with synthetic data as well as experimental cryo electron tomography. Conclusion: Our algorithm is able to correctly align a single-tilt tomographic series without the help of fiducial markers thanks to the detection of thousands of small image patches that can be tracked over a short number of images in the series. [ABSTRACT FROM AUTHOR]
- Published
- 2009
- Full Text
- View/download PDF
36. Nonsmooth Nonnegative Matrix Factorization (nsNMF).
- Author
-
Pascual-Montano, Alberto, Carazo, J. M., Kochi, Kieko, Lehmann, Dietrich, and Pascual-Marqui, Roberto D.
- Subjects
- *
MATRICES (Mathematics) , *MULTIVARIATE analysis , *FACTORIZATION , *SMOOTHNESS of functions , *MATHEMATICAL functions , *VECTOR algebra - Abstract
We propose a novel nonnegative matrix factorization model that aims at finding localized, part-based, representations of nonnegative multivariate data items. Unlike the classical nonnegative matrix factorization (NMF) technique, this new model, denoted "nonsmooth nonnegative matrix factorization" (nsNMF), corresponds to the optimization of an unambiguous cost function designed to explicitly represent sparseness, in the form of nonsmoothness, which is controlled by a single parameter. In general, this method produces a set of basis and encoding vectors that are not only capable of representing the original data, but they also extract highly localized patterns, which generally lend themselves to improved interpretability. The properties of this new method are illustrated with several data sets. Comparisons to previously published methods show that the new nsNMF method has some advantages in keeping faithfulness to the data in the achieving a high degree of sparseness for both the estimated basis and the encoding vectors and in better interpretability of the factors. [ABSTRACT FROM AUTHOR]
- Published
- 2006
- Full Text
- View/download PDF
37. The RepA protein of plasmid RSF1010 is a replicative DNA helicase.
- Author
-
Scherzinger, E, Ziegelin, G, Bárcena, M, Carazo, J M, Lurz, R, and Lanka, E
- Abstract
The RepA protein of the mobilizable broad host range plasmid RSF1010 has a key function in its replication. RepA is one of the smallest known helicases. The protein forms a homohexamer of 29,896-Da subunits. A variety of methods were used to analyze the quaternary structure of RepA. Gel filtration and cross-linking experiments demonstrated the hexameric structure, which was confirmed by electron microscopy and image reconstruction. These results agree with recent data obtained from RepA crystals diffracting at 3.5-A resolution (Röleke, D., Hoier, H., Bartsch, C., Umbach, P., Scherzinger, E., Lurz, R., and Saenger, W. (1997) Acta Crystallogr. Sec. D 53, 213-216). The RepA helicase has 5' --> 3' polarity. As do most true replicative helicases, RepA prefers a tailed substrate with an unpaired 3'-tail mimicking a replication fork. Optimal unwinding activity was achieved at the remarkably low pH of 5.5. In the presence of Mg2+ (Mn2+) ions, the RepA activity is fueled by ATP, dATP, GTP, and dGTP and less efficiently by CTP and dCTP. UTP and dTTP are poor effectors. Nonhydrolyzable ATP analogues, ADP, and pyrophosphate inhibit the helicase activity, whereas inorganic phosphate does not. The presence of Escherichia coli single-stranded DNA-binding protein stimulates unwinding at physiological pH 2-3-fold, whereas the RSF1010 replicon-specific primase, RepB' protein, has no effect, either in the presence or in the absence of single-stranded DNA-binding protein.
- Published
- 1997
38. Organising multi-dimensional biological image information: The BioImage Database
- Author
-
Carazo, J. M., Stelzer, E. H. K., Engel, A., Fita, I., Henn, C., Machtynger, J., McNeil, P., Shotton, D. M., Chagoyen, M., de Alarcón, P. A., Fritsch, R., Heymann, J. B., Kalko, S., Pittet, J. J., Rodriguez-Tomé, P., Boudier, T., Carazo, J. M., Stelzer, E. H. K., Engel, A., Fita, I., Henn, C., Machtynger, J., McNeil, P., Shotton, D. M., Chagoyen, M., de Alarcón, P. A., Fritsch, R., Heymann, J. B., Kalko, S., Pittet, J. J., Rodriguez-Tomé, P., and Boudier, T.
- Abstract
Nowadays it is possible to unravel complex information at all levels of cellular organization by obtaining multi-dimensional image information. at the macromolecular level, three-dimensional (3D) electron microscopy, together with other techniques, is able to reach resolutions at the nanometer or subnanometer level. The information is delivered in the form of 3D volumes containing samples of a given function, for example, the electron density distribution within a given macromolecule. The same situation happens at the cellular level with the new forms of light microscopy, particularly confocal microscopy, all of which produce biological 3D volume information. Furthermore, it is possible to record sequences of images over time (videos), as well as sequences of volumes, bringing key information on the dynamics of living biological systems. It is in this context that work on bioimage started two years ago, and that its first version is now presented here. In essence, Bioimage is a database specifically designed to contain multi-dimensional images, perform queries and interactively work with the resulting multi-dimensional information on the World Wide Web, as well as accomplish the required cross-database links. Two sister home pages of bioimage can be accessed at http://www.bioimage.org and http://www-embl.bioimage.org
39. Survey of the analysis of continuous conformational variability of biological macromolecules by electron microscopy
- Author
-
Zachary Freyberg, Roberto Marabini, Carlos Oscar S. Sorzano, F Cazals, A. Jimenez, David Maluenda, Y. Rancel, Rubén J. Sánchez-García, María Jesús Martínez, L. del Cano, Javier Mota, Roberto Melero, José María Carazo, José Luis Vilas, Pablo Conesa, Joan Segura, Slavica Jonic, Javier Vargas, Ivet Bahar, James Krieger, Tomáš Majtner, Erney Ramírez-Aportela, Centro Nacional de Biotecnología [Madrid] (CNB-CSIC), Biocomputing Unit [Madrid], Institut de minéralogie, de physique des matériaux et de cosmochimie (IMPMC), Muséum national d'Histoire naturelle (MNHN)-Institut de recherche pour le développement [IRD] : UR206-Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS), McGill University = Université McGill [Montréal, Canada], Algorithms, Biology, Structure (ABS), Inria Sophia Antipolis - Méditerranée (CRISAM), Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria), University of Pittsburgh (PITT), Pennsylvania Commonwealth System of Higher Education (PCSHE), Universidad Autonoma de Madrid (UAM), ANR-19-P3IA-0002,3IA@cote d'azur,3IA Côte d'Azur(2019), UAM. Departamento de Ingeniería Informática, Tratamiento de Señales Biomédicas (ING EPS-011), Ministerio de Economía y Competitividad (España), Comunidad de Madrid, Instituto de Salud Carlos III, National Institutes of Health (US), European Commission, Carazo, J. M, Sorzano, Carlos Óscar S., Marabini, Roberto, Melero, Roberto, Consejo Superior de Investigaciones Científicas [Madrid] (CSIC), Universidad Autónoma de Madrid (UAM), Carazo, J. M [0000-0003-0788-8447], Sorzano, Carlos Óscar S. [0000-0002-9473-283X], Marabini, Roberto [0000-0001-7876-1684], and Melero, Roberto [0000-0001-9467-9381]
- Subjects
Materials science ,Macromolecular Substances ,Molecular Conformation ,Biophysics ,Single particle analysis ,Electrons ,Molecular Dynamics Simulation ,[INFO.INFO-CG]Computer Science [cs]/Computational Geometry [cs.CG] ,Biochemistry ,law.invention ,03 medical and health sciences ,Imaging, Three-Dimensional ,Image processing ,Structural Biology ,Normal mode ,law ,Continuous heterogeneity ,Image Processing, Computer-Assisted ,Genetics ,Electron microscopy ,Humans ,ComputingMilieux_MISCELLANEOUS ,030304 developmental biology ,Informática ,Principal Component Analysis ,0303 health sciences ,030302 biochemistry & molecular biology ,Proteins ,Single-particle analysis ,Condensed Matter Physics ,3. Good health ,Microscopy, Electron ,Thermodynamics ,Normal-mode analysis ,Electron microscope ,Cryo-Electron Microscopy ,Algorithms ,Macromolecule - Abstract
Single-particle analysis by electron microscopy is a well established technique for analyzing the three-dimensional structures of biological macromolecules. Besides its ability to produce high-resolution structures, it also provides insights into the dynamic behavior of the structures by elucidating their conformational variability. Here, the different image-processing methods currently available to study continuous conformational changes are reviewed., Spanish Ministry of Economy and Competitiveness through grants BIO2013-44647-R and BIO2016-76400-R (AEI/FEDER, UE), Comunidad Autonoma de Madrid through grant S2017/BMD-3817, Instituto de Salud Carlos III through grants PT13/0001/0009 and PT17/0009/0010, the European Union (EU) and Horizon 2020 through West-Life (EINFRA-2015-1, Proposal 675858), CORBEL (INFRADEV-1-2014-1, Proposal 654248), ELIXIR–EXCELERATE (INFRADEV-3-2015, Proposal 676559), iNEXT (INFRAIA-1-2014-2015, Proposal 653706), EOSCpilot (INFRADEV-04-2016, Proposal 739563) and the National Institutes of Health (P41GM 103712) (IB).
- Published
- 2019
- Full Text
- View/download PDF
40. Real-space heterogeneous reconstruction, refinement, and disentanglement of CryoEM conformational states with HetSIREN.
- Author
-
Herreros D, Mata CP, Noddings C, Irene D, Krieger J, Agard DA, Tsai MD, Sorzano COS, and Carazo JM
- Abstract
Single-particle analysis by Cryo-electron microscopy (CryoEM) provides direct access to the conformation of each macromolecule. However, the image's signal-to-noise ratio is low, and some form of classification is usually performed at the image processing level to allow structural modeling. Classical classification methods imply the existence of a discrete number of structural conformations. However, new heterogeneity algorithms introduce a novel reconstruction paradigm, where every state is represented by a lower number of particles, potentially just one, allowing the estimation of conformational landscapes representing the different structural states a biomolecule explores. In this work, we present a novel deep learning-based method called HetSIREN. HetSIREN can fully reconstruct or refine a CryoEM volume in real space based on the structural information summarized in a conformational latent space. The unique characteristics that set HetSIREN apart start with the definition of the approach as a real space-based only method, a fact that allows spatially focused analysis, but also the introduction of a novel network architecture specifically designed to make use of meta-sinusoidal activations, with proven high analytics capacities. Continuing with innovations, HetSIREN can also refine the pose parameters of the images at the same time that it conditions the network with prior information/constraints on the maps, such as Total Variation and L 1 denoising, ultimately yielding cleaner volumes with high-quality structural features. Finally, but very importantly, HetSIREN addresses one of the most confusing issues in heterogeneity analysis, as it is the fact that real structural heterogeneity estimation is entangled with pose estimation (and to a lesser extent with CTF estimation), in this way, HetSIREN introduces a novel encoding architecture able to decouple pose and CTF information from the conformational landscape, resulting in more accurate and interpretable conformational latent spaces. We present results on computer-simulated data, public data from EMPIAR, and data from experimental systems currently being studied in our laboratories. An important finding is the sensitivity of the structure and dynamics of the SARS-CoV-2 Spike protein on the storage temperature., Competing Interests: Competing Interests The authors declare that they have no competing interests.
- Published
- 2024
- Full Text
- View/download PDF
41. Improvements on marker-free images alignment for electron tomography.
- Author
-
Sorzano COS, de Isidro-Gómez F, Fernández-Giménez E, Herreros D, Marco S, Carazo JM, and Messaoudi C
- Abstract
Electron tomography is a technique to obtain three-dimensional structural information of samples. However, the technique is limited by shifts occurring during acquisition that need to be corrected before the reconstruction process. In 2009, we proposed an approach for post-acquisition alignment of tilt series images. This approach was marker-free, based on patch tracking and integrated in free software. Here, we present improvements to the method to make it more reliable, stable and accurate. In addition, we modified the image formation model underlying the alignment procedure to include different deformations occurring during acquisition. We propose a new way to correct these computed deformations to obtain reconstructions with reduced artifacts. The new approach has demonstrated to improve the quality of the final 3D reconstruction, giving access to better defined structures for different transmission electron tomography methods: resin embedded STEM-tomography and cryo-TEM tomography. The method is freely available in TomoJ software., Competing Interests: The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper., (© 2020 The Author(s).)
- Published
- 2020
- Full Text
- View/download PDF
42. Measurement of local resolution in electron tomography.
- Author
-
Vilas JL, Oton J, Messaoudi C, Melero R, Conesa P, Ramirez-Aportela E, Mota J, Martinez M, Jimenez A, Marabini R, Carazo JM, Vargas J, and Sorzano COS
- Abstract
Resolution (global and local) is one of the most reported metrics of quality measurement in Single Particle Analysis (SPA). However, in electron tomography, the situation is different and its computation is not straightforward. Typically, resolution estimation is global and, therefore, reduces the assessment of a whole tomogram to a single number. However, it is known that tomogram quality is spatially variant. Still, up to our knowledge, a method to estimate local quality metrics in tomography is lacking. This work introduces MonoTomo , a method developed to estimate locally in a tomogram the highest reliable frequency component, expressed as a form of local resolution. The fundamentals lie in a local analysis of the density map via monogenic signals, which, in analogy to MonoRes , allows for local estimations. Results with experimental data show that the local resolution range that MonoTomo casts agrees with reported resolution values for experimental data sets, with the advantage of providing a local estimation. A range of applications of MonoTomo are suggested for further exploration., Competing Interests: The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper., (© 2019 Published by Elsevier Inc.)
- Published
- 2019
- Full Text
- View/download PDF
43. The DnaB.DnaC complex: a structure based on dimers assembled around an occluded channel.
- Author
-
Bárcena M, Ruiz T, Donate LE, Brown SE, Dixon NE, Radermacher M, and Carazo JM
- Subjects
- Cryoelectron Microscopy, DNA Replication, Dimerization, DnaB Helicases, Escherichia coli genetics, Image Processing, Computer-Assisted, Models, Structural, Protein Structure, Quaternary, Protein Structure, Tertiary, Recombinant Proteins ultrastructure, Bacterial Proteins ultrastructure, DNA Helicases ultrastructure, Escherichia coli Proteins
- Abstract
Replicative helicases are motor proteins that unwind DNA at replication forks. Escherichia coli DnaB is the best characterized member of this family of enzymes. We present the 26 A resolution three-dimensional structure of the DnaB hexamer in complex with its loading partner, DnaC, obtained from cryo-electron microscopy. Analysis of the volume brings insight into the elaborate way the two proteins interact, and provides a structural basis for control of the symmetry state and inactivation of the helicase by DnaC. The complex is arranged on the basis of interactions among DnaC and DnaB dimers. DnaC monomers are observed for the first time to arrange as three dumb-bell-shaped dimers that interlock into one of the faces of the helicase. This could be responsible for the freezing of DnaB in a C(3) architecture by its loading partner. The central channel of the helicase is almost occluded near the end opposite to DnaC, such that even single-stranded DNA could not pass through. We propose that the DnaB N-terminal domain is located at this face.
- Published
- 2001
- Full Text
- View/download PDF
44. Sequence-related protein export NTPases encoded by the conjugative transfer region of RP4 and by the cag pathogenicity island of Helicobacter pylori share similar hexameric ring structures.
- Author
-
Krause S, Barcena M, Pansegrau W, Lurz R, Carazo JM, and Lanka E
- Subjects
- Acid Anhydride Hydrolases chemistry, Acid Anhydride Hydrolases metabolism, Amino Acid Sequence, Bacterial Proteins chemistry, Biological Transport, Conjugation, Genetic, Helicobacter pylori pathogenicity, Membrane Proteins chemistry, Molecular Sequence Data, Nucleoside-Triphosphatase, Protein Conformation, Sequence Homology, Amino Acid, Acid Anhydride Hydrolases genetics, Bacterial Proteins genetics, Fimbriae Proteins, Genes, Bacterial, Helicobacter pylori genetics, Membrane Proteins genetics
- Abstract
RP4 TrbB, an essential component of the conjugative transfer apparatus of the broad-host-range plasmid RP4, is a member of the PulE protein superfamily involved in multicomponent machineries transporting macromolecules across the bacterial envelope. PulE-like proteins share several well conserved motifs, most notable a nucleoside triphosphate binding motif (P-loop). Helicobacter pylori HP0525 also belongs to the PulE superfamily and is encoded by the pathogenicity island cag, involved in the inflammatory response of infected gastric epithelial cells in mammals. The native molecular masses of TrbB and HP0525 as determined by gel filtration and glycerol gradient centrifugation suggested a homohexameric structure in the presence of ATP and Mg(2+). In the absence of nucleotides and bivalent cations, TrbB behaved as a tetramer whereas the hexameric state of HP0525 remained unaffected. Electron microscopy and image processing demonstrated that TrbB and HP0525 form ring-shaped complexes (diameter: 12 nm) with a central region (diameter: 3 nm) of low electron density when incubated in the presence of ATP and Mg(2+). However, the TrbB average image appeared to be more elliptical with strong twofold rotational symmetry whereas HP0525 complexes are regular hexagons. Six well defined triangle-shaped areas of high electron density were distinguishable in both cases. Covalent crosslinking of TrbB suggests that the hexameric ring is composed from a trimer of dimers, because only dimeric, tetrameric, and hexameric species were detectable. The toroidal structure of TrbB and HP0525 suggests that both proteins catalyze a repetitive process, most probably translocating a cognate substrate across the inner membrane.
- Published
- 2000
- Full Text
- View/download PDF
45. Large T-antigen double hexamers imaged at the simian virus 40 origin of replication.
- Author
-
Valle M, Gruss C, Halmer L, Carazo JM, and Donate LE
- Subjects
- Antigens, Viral, Tumor chemistry, Binding Sites genetics, DNA Replication genetics, DNA, Viral chemistry, Protein Binding, Antigens, Viral, Tumor genetics, DNA, Viral genetics, Replication Origin genetics, Simian virus 40 genetics
- Abstract
The initial step of simian virus 40 (SV40) DNA replication is the binding of the large tumor antigen (T-Ag) to the SV40 core origin. In the presence of Mg(2+) and ATP, T-Ag forms a double-hexamer complex covering the complete core origin. By using electron microscopy and negative staining, we visualized for the first time T-Ag double hexamers bound to the SV40 origin. Image processing of side views of these nucleoprotein complexes revealed bilobed particles 24 nm long and 8 to 12 nm wide, which indicates that the two T-Ag hexamers are oriented head to head. Taking into account all of the biochemical data known on the T-Ag-DNA interactions at the replication origin, we present a model in which the DNA passes through the inner channel of both hexamers. In addition, we describe a previously undetected structural domain of the T-Ag hexamer and thereby amend the previously published dimensions of the T-Ag hexamer. This domain we have determined to be the DNA-binding domain of T-Ag.
- Published
- 2000
- Full Text
- View/download PDF
46. The three-dimensional structure of a DNA translocating machine at 10 A resolution.
- Author
-
Valpuesta JM, Fernández JJ, Carazo JM, and Carrascosa JL
- Subjects
- Bacillus Phages ultrastructure, Capsid physiology, Capsid ultrastructure, Crystallography, X-Ray, Image Processing, Computer-Assisted, Microscopy, Electron, Models, Molecular, Morphogenesis, Recombinant Fusion Proteins chemistry, Recombinant Fusion Proteins ultrastructure, Rotation, Bacillus Phages chemistry, Capsid chemistry, Capsid Proteins, DNA, Viral metabolism, Protein Conformation
- Abstract
Background: Head-tail connectors are viral substructures that are very important in the viral morphogenetic cycle, having roles in the formation of the precursor capsid (prohead), DNA packaging, tail binding to the mature head and in the infection process. Structural information on the connector would, therefore, help us to understand how this structure is related to a multiplicity of functions., Results: Recombinant bacteriophage phi29 connectors have been crystallized in two-dimensional aggregates. An average projection image and a three-dimensional map have been obtained at 8 A and 10 A resolution, respectively, from untilted and tilted images of vitrified specimens of the two-dimensional crystals. The average projection image reveals a central mass surrounding a channel with 12 appendages protruding from the central mass. The three-dimensional map reveals a wide domain surrounded by 12 appendages that interact with the prohead vertex, and a narrow domain that interacts with the bacteriophage tail. At the junction of the two domains, 12 smaller appendages are visualized. A channel runs along the axis of the connector structure and is sufficiently wide to allow a double-stranded DNA molecule to pass through., Conclusions: The propeller-like structure of the phi29 connector strengthens the notion of the connector rotating during DNA packaging. The groove formed by the two lanes of large and small appendages may act as a rail to prevent the liberation of the connector from the prohead vertex during rotation.
- Published
- 1999
- Full Text
- View/download PDF
47. Self-organizing tree-growing network for the classification of protein sequences.
- Author
-
Wang HC, Dopazo J, de la Fraga LG, Zhu YP, and Carazo JM
- Subjects
- Algorithms, Cytochrome c Group chemistry, Decision Trees, Hemoglobins chemistry, Interleukins chemistry, Receptors, Interleukin chemistry, Sequence Alignment, Software Design, Triose-Phosphate Isomerase chemistry, Phylogeny, Proteins chemistry, Proteins classification, Software
- Abstract
The self-organizing tree algorithm (SOTA) was recently introduced to construct phylogenetic trees from biological sequences, based on the principles of Kohonen's self-organizing maps and on Fritzke's growing cell structures. SOTA is designed in such a way that the generation of new nodes can be stopped when the sequences assigned to a node are already above a certain similarity threshold. In this way a phylogenetic tree resolved at a high taxonomic level can be obtained. This capability is especially useful to classify sets of diversified sequences. SOTA was originally designed to analyze pre-aligned sequences. It is now adapted to be able to analyze patterns associated to the frequency of residues along a sequence, such as protein dipeptide composition and other n-gram compositions. In this work we show that the algorithm applied to these data is able to not only successfully construct phylogenetic trees of protein families, such as cytochrome c, triosephophate isomerase, and hemoglobin alpha chains, but also classify very diversified sequence data sets, such as a mixture of interleukins and their receptors.
- Published
- 1998
- Full Text
- View/download PDF
48. Three-dimensional reconstructions from cryoelectron microscopy images reveal an intimate complex between helicase DnaB and its loading partner DnaC.
- Author
-
San Martin C, Radermacher M, Wolpensinger B, Engel A, Miles CS, Dixon NE, and Carazo JM
- Subjects
- Bacterial Proteins ultrastructure, DNA Helicases ultrastructure, Dimerization, DnaB Helicases, Freezing, Image Processing, Computer-Assisted, Microscopy, Electron, Protein Conformation, Bacterial Proteins chemistry, DNA Helicases chemistry, Escherichia coli enzymology, Escherichia coli Proteins
- Abstract
Background: DNA helicases play a fundamental role in all aspects of nucleic acid metabolism and defects in these enzymes have been implicated in a number of inherited human disorders. DnaB is the major replicative DNA helicase in Escherichia coli and has been used as a model system for studying the structure and function of hexameric helicases. The native protein is a hexamer of identical subunits, which in solution forms a complex with six molecules of the loading protein DnaC. DnaB is delivered from this complex onto the DNA template, with the subsequent release of DnaC. We report here the structures of the DnaB helicase hexamer and its complex with DnaC under a defined set of experimental conditions, as determined by three-dimensional cryoelectron microscopy. It was hoped that the structures would provide insight into the mechanisms of helicase activity., Results: The DnaB structure reveals that six DnaB monomers assemble as three asymmetric dimers to form a polar, ring-like hexamer. The hexamer has two faces, one displaying threefold and the other sixfold symmetry. The six DnaC protomers bind tightly to the sixfold face of the DnaB hexamer. This is the first report of a three-dimensional structure of a helicase obtained using cryoelectron microscopy, and the first report of the structure of a helicase in complex with a loading protein., Conclusions: The structures of the DnaB helicase and its complex with DnaC reveal some interesting structural features relevant to helicase function and to the assembly of the two-protein complex. The results presented here provide a basis for a more complete understanding of the structure and function of these important proteins.
- Published
- 1998
- Full Text
- View/download PDF
49. Phylogenetic reconstruction using an unsupervised growing neural network that adopts the topology of a phylogenetic tree.
- Author
-
Dopazo J and Carazo JM
- Subjects
- Algorithms, Computer Simulation, Humans, Nucleic Acids genetics, Proteins genetics, RNA-Binding Proteins classification, Neural Networks, Computer, Nucleic Acids classification, Phylogeny, Proteins classification, Ribosomal Proteins
- Abstract
We propose a new type of unsupervised, growing, self-organizing neural network that expands itself by following the taxonomic relationships that exist among the sequences being classified. The binary tree topology of this neutral network, contrary to other more classical neural network topologies, permits an efficient classification of sequences. The growing nature of this procedure allows to stop it at the desired taxonomic level without the necessity of waiting until a complete phylogenetic tree is produced. This novel approach presents a number of other interesting properties, such as a time for convergence which is, approximately, a lineal function of the number of sequences. Computer simulation and a real example show that the algorithm accurately finds the phylogenetic tree that relates the data. All this makes the neural network presented here an excellent tool for phylogenetic analysis of a large number of sequences.
- Published
- 1997
- Full Text
- View/download PDF
50. An image-processing approach to dotplots: an X-Window-based program for interactive analysis of dotplots derived from sequence and structural data.
- Author
-
Trelles-Salazar O, Zapata EL, Dopazo J, Coulson AF, and Carazo JM
- Subjects
- Evaluation Studies as Topic, Data Interpretation, Statistical, Image Processing, Computer-Assisted, Molecular Structure, Sequence Analysis statistics & numerical data, Software
- Abstract
We present an approach to the study of the relationships between biological sequences and structures applying image analysis methods to dotplots. We introduce a set of analytical tools based on different types of digital image-processing filters that are new within the context of dotplots. We have reformulated some of the usual approaches in dotplot analysis as mathematical operations on images within the framework of mathematical morphology. An X-Window-based implementation of this new approach has been developed and is available by anonymous FTP.
- Published
- 1995
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.