1. Identification of constrained sequence elements across 239 primate genomes
- Author
-
Natural Environment Research Council (UK), UK Research and Innovation, National Human Genome Research Institute (US), Fundación la Caixa, Vienna Science and Technology Fund, European Commission, Conselho Nacional de Desenvolvimento Científico e Tecnológico (Brasil), Fonds de la Recherche Scientifique (Fédération Wallonie-Bruxelles), Fundação de Amparo à Pesquisa do Estado do Amazonas, Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (Brasil), German Research Foundation, Ministry of Science and Technology of Vietnam, Agencia Estatal de Investigación (España), Generalitat de Catalunya, Natural Sciences and Engineering Research Council of Canada, Canada Research Chairs, Wenner-Gren Foundation, Leakey Foundation, National Science Foundation (US), National Geographic Society, National Institute on Aging (US), Swedish Research Council, National Research Foundation Singapore, European Research Council, Ministerio de Ciencia e Innovación (España), Ministerio de Ciencia, Innovación y Universidades (España), Kuderna, Lukas F. K. [0000-0002-9992-9295], Kuhlwilm, Martin [0000-0002-0115-1797], Valenzuela, Alejandro [0000-0001-6120-6246], Juan, David [0000-0003-1912-9667], Lizano, Esther [0000-0003-3304-9807], Navarro, Arcadi [0000-0003-2162-8246], Marqués-Bonet, Tomàs [0000-0002-5597-3075], Kuderna, Lukas F. K., Ulirsch, Jacob C., Rashid, Sabrina, Ameen, Mohamed, Sundaram, Laksshman, Hickey, Glenn, Cox, Anthony J., Gao, Hong, Kumar, Arvind, Aguet, Francois, Christmas, Matthew J., Clawson, Hiram, Haeussler, Maximilian, Janiak, Mareike C., Kuhlwilm, Martin, Orkin, Joseph D., Bataillon, Thomas, Manu, Shivakumara, Valenzuela, Alejandro, Bergman, Juraj, Rousselle, Marjolaine, Silva, Felipe Ennes, Agueda, Lidia, Blanc, Julie, Gut, Marta, Vries, Dorien de, Goodhead, Ian, Harris, R. Alan, Raveendran, Muthuswamy, Jensen, Axel, Chuma, Idriss S., Horvath, Julie E., Hvilsom, Christina, Juan, David, Frandsen, Peter, Schraiber, Joshua G., Melo, Fabiano R. de, Bertuol, Fabrício, Byrne, Hazel, Sampaio, Iracilda, Farias, Izeni, Valsecchi, João, Messias, Malu, Silva, Maria N. F. da, Trivedi, Mihir, Rossi, Rogerio, Hrbek, Tomas, Andriaholinirina, Nicole, Rabarivola, Clément J., Zaramody, Alphonse, Jolly, Clifford J., Phillips-Conroy, Jane, Wilkerson, Gregory, Abee, Christian, Simmons, Joe H., Fernandez-Duque, Eduardo, Kanthaswamy, Sree, Shiferaw, Fekadu, Wu, Dongdong, Zhou, Long, Shao, Yong, Zhang, Guojie, Keyyu, Julius D., Knauf, Sascha, Le, Minh D., Lizano, Esther, Merker, Stefan, Navarro, Arcadi, Nadler, Tilo, Khor, Chiea Chuen, Lee, Jessica, Tan, Patrick, Lim, Weng Khong, Kitchener, Andrew C., Zinner, Dietmar, Gut, Ivo, Melin, Amanda D., Guschanski, Katerina, Schierup, Mikkel Heide, Beck, Robin M. D., Karakikes, Ioannis, Wang, Kevin C., Umapathy, Govindhaswamy, Roos, Christian, Boubli, Jean P., Siepel, Adam, Kundaje, Anshul, Paten, Benedict, Lindblad-Toh, Kerstin, Rogers, Jeffrey, Marqués-Bonet, Tomàs, Farh, Kyle Kai-How, Natural Environment Research Council (UK), UK Research and Innovation, National Human Genome Research Institute (US), Fundación la Caixa, Vienna Science and Technology Fund, European Commission, Conselho Nacional de Desenvolvimento Científico e Tecnológico (Brasil), Fonds de la Recherche Scientifique (Fédération Wallonie-Bruxelles), Fundação de Amparo à Pesquisa do Estado do Amazonas, Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (Brasil), German Research Foundation, Ministry of Science and Technology of Vietnam, Agencia Estatal de Investigación (España), Generalitat de Catalunya, Natural Sciences and Engineering Research Council of Canada, Canada Research Chairs, Wenner-Gren Foundation, Leakey Foundation, National Science Foundation (US), National Geographic Society, National Institute on Aging (US), Swedish Research Council, National Research Foundation Singapore, European Research Council, Ministerio de Ciencia e Innovación (España), Ministerio de Ciencia, Innovación y Universidades (España), Kuderna, Lukas F. K. [0000-0002-9992-9295], Kuhlwilm, Martin [0000-0002-0115-1797], Valenzuela, Alejandro [0000-0001-6120-6246], Juan, David [0000-0003-1912-9667], Lizano, Esther [0000-0003-3304-9807], Navarro, Arcadi [0000-0003-2162-8246], Marqués-Bonet, Tomàs [0000-0002-5597-3075], Kuderna, Lukas F. K., Ulirsch, Jacob C., Rashid, Sabrina, Ameen, Mohamed, Sundaram, Laksshman, Hickey, Glenn, Cox, Anthony J., Gao, Hong, Kumar, Arvind, Aguet, Francois, Christmas, Matthew J., Clawson, Hiram, Haeussler, Maximilian, Janiak, Mareike C., Kuhlwilm, Martin, Orkin, Joseph D., Bataillon, Thomas, Manu, Shivakumara, Valenzuela, Alejandro, Bergman, Juraj, Rousselle, Marjolaine, Silva, Felipe Ennes, Agueda, Lidia, Blanc, Julie, Gut, Marta, Vries, Dorien de, Goodhead, Ian, Harris, R. Alan, Raveendran, Muthuswamy, Jensen, Axel, Chuma, Idriss S., Horvath, Julie E., Hvilsom, Christina, Juan, David, Frandsen, Peter, Schraiber, Joshua G., Melo, Fabiano R. de, Bertuol, Fabrício, Byrne, Hazel, Sampaio, Iracilda, Farias, Izeni, Valsecchi, João, Messias, Malu, Silva, Maria N. F. da, Trivedi, Mihir, Rossi, Rogerio, Hrbek, Tomas, Andriaholinirina, Nicole, Rabarivola, Clément J., Zaramody, Alphonse, Jolly, Clifford J., Phillips-Conroy, Jane, Wilkerson, Gregory, Abee, Christian, Simmons, Joe H., Fernandez-Duque, Eduardo, Kanthaswamy, Sree, Shiferaw, Fekadu, Wu, Dongdong, Zhou, Long, Shao, Yong, Zhang, Guojie, Keyyu, Julius D., Knauf, Sascha, Le, Minh D., Lizano, Esther, Merker, Stefan, Navarro, Arcadi, Nadler, Tilo, Khor, Chiea Chuen, Lee, Jessica, Tan, Patrick, Lim, Weng Khong, Kitchener, Andrew C., Zinner, Dietmar, Gut, Ivo, Melin, Amanda D., Guschanski, Katerina, Schierup, Mikkel Heide, Beck, Robin M. D., Karakikes, Ioannis, Wang, Kevin C., Umapathy, Govindhaswamy, Roos, Christian, Boubli, Jean P., Siepel, Adam, Kundaje, Anshul, Paten, Benedict, Lindblad-Toh, Kerstin, Rogers, Jeffrey, Marqués-Bonet, Tomàs, and Farh, Kyle Kai-How
- Abstract
Noncoding DNA is central to our understanding of human gene regulation and complex diseases1,2, and measuring the evolutionary sequence constraint can establish the functional relevance of putative regulatory elements in the human genome3,4,5,6,7,8,9. Identifying the genomic elements that have become constrained specifically in primates has been hampered by the faster evolution of noncoding DNA compared to protein-coding DNA10, the relatively short timescales separating primate species11, and the previously limited availability of whole-genome sequences12. Here we construct a whole-genome alignment of 239 species, representing nearly half of all extant species in the primate order. Using this resource, we identified human regulatory elements that are under selective constraint across primates and other mammals at a 5% false discovery rate. We detected 111,318 DNase I hypersensitivity sites and 267,410 transcription factor binding sites that are constrained specifically in primates but not across other placental mammals and validate their cis-regulatory effects on gene expression. These regulatory elements are enriched for human genetic variants that affect gene expression and complex traits and diseases. Our results highlight the important role of recent evolution in regulatory sequence elements differentiating primates, including humans, from other placental mammals.
- Published
- 2024