1. Exhaustive variant interaction analysis using multifactor dimensionality reduction
- Author
-
Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors, Barcelona Supercomputing Center, Universitat Politècnica de Catalunya. CROMAI - Computing Resources Orchestration and Management for AI, Gómez Sánchez, Gonzalo, Alonso Parrilla, Lorena, Pérez Elena, Miguel Ángel, Morán, Ignasi, Torrents Arenales, David, Berral García, Josep Lluís, Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors, Barcelona Supercomputing Center, Universitat Politècnica de Catalunya. CROMAI - Computing Resources Orchestration and Management for AI, Gómez Sánchez, Gonzalo, Alonso Parrilla, Lorena, Pérez Elena, Miguel Ángel, Morán, Ignasi, Torrents Arenales, David, and Berral García, Josep Lluís
- Abstract
One of the main goals of human genetics is to understand the connections between genomic variation and the predisposition to develop a complex disorder. These disease–variant associations are usually studied in a single independent manner, disregarding the possible effect derived from the interaction between genomic variants. In particular, in a background of complex diseases, these interactions can be directly linked to the disorder and may play an important role in disease development. Although their study has been suggested to help complete the understanding of the genetic bases of complex diseases, this still represents a big challenge due to large computing demands. Here, we take advantage of high-performance computing technologies to tackle this problem by using a combination of machine learning methods and statistical approaches. As a result, we created a containerized framework that uses multifactor dimensionality reduction (MDR) to detect pairs of variants associated with type 2 diabetes (T2D). This methodology was tested on the Northwestern University NUgene project cohort using a dataset of 1,883,192 variant pairs with a certain degree of association with T2D. Out of the pairs studied, we identified 104 significant pairs: two of which exhibit a potential functional relationship with T2D. These results place the proposed MDR method as a valid, efficient, and portable solution to study variant interaction in real reduced genomic datasets., This work was partially financed by the European Commission (EU-HORIZON NEARDATA GA.101092644) and by the Universitat Politècnica de Catalunya (45-FPIUPC2018); it was also partially financed by Generalitat de Catalunya (AGAUR) under grant agreement 2021-SGR-00478; it was also partially financed by the Spanish Ministry of Science (MICINN), the Research State Agency (AEI), and European Regional Development Funds (ERDF/FEDER) under grant agreement PID2021-126248OB-I00, MCIN/AEI/10.13039/501100011033/FEDER, UE., Peer Reviewed, Postprint (published version)
- Published
- 2024