205 results on '"Emrich, Scott J."'
Search Results
2. Synonymous codon substitutions modulate transcription and translation of a divergent upstream gene by modulating antisense RNA production.
- Author
-
Rodriguez, Anabel, Diehl, Jacob D., Wright, Gabriel S., Bonar, Christopher D., Lundgren, Taylor J., Moss, McKenze J., Jun Li, Milenkovic, Tijana, Huber, Paul W., Champion, Matthew M., Emrich, Scott J., and Clark, Patricia L.
- Subjects
GENE expression ,ANTISENSE RNA ,GENETIC transcription ,NUCLEOTIDE sequence ,AMINO acid sequence - Abstract
Synonymous codons were originally viewed as interchangeable, with no phenotypic consequences. However, substantial evidence has now demonstrated that synonymous substitutions can perturb a variety of gene expression and protein homeostasis mechanisms, including translational efficiency, translational fidelity, and cotranslational folding of the encoded protein. To date, most studies of synonymous codon-derived perturbations have focused on effects within a single gene. Here, we show that synonymous codon substitutions made far within the coding sequence of Escherichia coli plasmid-encoded chloramphenicol acetyltransferase (cat) can significantly increase expression of the divergent upstream tetracycline resistance gene, tetR. In four out of nine synonymously recoded cat sequences tested, expression of the upstream tetR gene was significantly elevated due to transcription of a long antisense RNA (asRNA) originating from a transcription start site within cat. Surprisingly, transcription of this asRNA readily bypassed the native tet transcriptional repression mechanism. Even more surprisingly, accumulation of the TetR protein correlated with the level of asRNA, rather than total tetR RNA. These effects of synonymous codon substitutions on transcription and translation of a neighboring gene suggest that synonymous codon usage in bacteria may be under selection to both preserve the amino acid sequence of the encoded gene and avoid DNA sequence elements that can significantly perturb expression of neighboring genes. Avoiding such sequences may be especially important in plasmids and prokaryotic genomes, where genes and regulatory elements are often densely packed. Similar considerations may apply to the design of genetic circuits for synthetic biology applications. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
3. Publisher Correction: The genome of the stable fly, Stomoxys calcitrans, reveals potential mechanisms underlying reproduction, host interactions, and novel targets for pest control
- Author
-
Olafson, Pia U., Aksoy, Serap, Attardo, Geoffrey M., Buckmeier, Greta, Chen, Xiaoting, Coates, Craig J., Davis, Megan, Dykema, Justin, Emrich, Scott J., Friedrich, Markus, Holmes, Christopher J., Ioannidis, Panagiotis, Jansen, Evan N., Jennings, Emily C., Lawson, Daniel, Martinson, Ellen O., Maslen, Gareth L., Meisel, Richard P., Murphy, Terence D., Nayduch, Dana, Nelson, David R., Oyen, Kennan J., Raszick, Tyler J., Ribeiro, José M. C., Robertson, Hugh M., Rosendale, Andrew J., Sackton, Timothy B., Saelao, Perot, Swiger, Sonja L., Sze, Sing-Hoi, Tarone, Aaron M., Taylor, David B., Warren, Wesley C., Waterhouse, Robert M., Weirauch, Matthew T., Werren, John H., Wilson, Richard K., Zdobnov, Evgeny M., and Benoit, Joshua B.
- Published
- 2021
- Full Text
- View/download PDF
4. The genome of the stable fly, Stomoxys calcitrans, reveals potential mechanisms underlying reproduction, host interactions, and novel targets for pest control
- Author
-
Olafson, Pia U., Aksoy, Serap, Attardo, Geoffrey M., Buckmeier, Greta, Chen, Xiaoting, Coates, Craig J., Davis, Megan, Dykema, Justin, Emrich, Scott J., Friedrich, Markus, Holmes, Christopher J., Ioannidis, Panagiotis, Jansen, Evan N., Jennings, Emily C., Lawson, Daniel, Martinson, Ellen O., Maslen, Gareth L., Meisel, Richard P., Murphy, Terence D., Nayduch, Dana, Nelson, David R., Oyen, Kennan J., Raszick, Tyler J., Ribeiro, José M. C., Robertson, Hugh M., Rosendale, Andrew J., Sackton, Timothy B., Saelao, Perot, Swiger, Sonja L., Sze, Sing-Hoi, Tarone, Aaron M., Taylor, David B., Warren, Wesley C., Waterhouse, Robert M., Weirauch, Matthew T., Werren, John H., Wilson, Richard K., Zdobnov, Evgeny M., and Benoit, Joshua B.
- Published
- 2021
- Full Text
- View/download PDF
5. PeakMatcher facilitates updated Aedes aegypti embryonic cis-regulatory element map
- Author
-
Nowling, Ronald J., Behura, Susanta K., Halfon, Marc S., Emrich, Scott J., and Duman-Scheel, Molly
- Published
- 2021
- Full Text
- View/download PDF
6. Whole genome comparisons reveal panmixia among fall armyworm (Spodoptera frugiperda) from diverse locations
- Author
-
Schlum, Katrina A., Lamour, Kurt, de Bortoli, Caroline Placidi, Banerjee, Rahul, Meagher, Robert, Pereira, Eliseu, Murua, Maria Gabriela, Sword, Gregory A., Tessnow, Ashley E., Viteri Dillon, Diego, Linares Ramirez, Angela M., Akutse, Komivi S., Schmidt-Jeffris, Rebecca, Huang, Fangneng, Reisig, Dominic, Emrich, Scott J., and Jurat-Fuentes, Juan Luis
- Published
- 2021
- Full Text
- View/download PDF
7. The power and promise of genetic mapping from Plasmodium falciparum crosses utilizing human liver-chimeric mice
- Author
-
Button-Simons, Katrina A., Kumar, Sudhir, Carmago, Nelly, Haile, Meseret T., Jett, Catherine, Checkley, Lisa A., Kennedy, Spencer Y., Pinapati, Richard S., Shoue, Douglas A., McDew-White, Marina, Li, Xue, Nosten, François H., Kappe, Stefan H., Anderson, Timothy J. C., Romero-Severson, Jeanne, Ferdig, Michael T., Emrich, Scott J., Vaughan, Ashley M., and Cheeseman, Ian H.
- Published
- 2021
- Full Text
- View/download PDF
8. Climate-mediated hybrid zone movement revealed with genomics, museum collection, and simulation modeling
- Author
-
Ryan, Sean F., Deines, Jillian M., Scriber, J. Mark, Pfrender, Michael E., Jones, Stuart E., Emrich, Scott J., and Hellmann, Jessica J.
- Published
- 2018
9. Author Correction: GRAFENE: Graphlet-based alignment-free network approach integrates 3D structural and sequence (residue order) data to improve protein structural comparison
- Author
-
Faisal, Fazle E., Newaz, Khalique, Chaney, Julie L., Li, Jun, Emrich, Scott J., Clark, Patricia L., and Milenković, Tijana
- Published
- 2020
- Full Text
- View/download PDF
10. Radical remodeling of the Y chromosome in a recent radiation of malaria mosquitoes
- Author
-
Hall, Andrew Brantley, Papathanos, Philippos-Aris, Sharma, Atashi, Cheng, Changde, Akbari, Omar S., Assour, Lauren, Bergman, Nicholas H., Cagnetti, Alessia, Crisanti, Andrea, Dottorini, Tania, Fiorentini, Elisa, Galizi, Roberto, Hnath, Jonathan, Jiang, Xiaofang, Koren, Sergey, Nolan, Tony, Radune, Diane, Sharakhova, Maria V., Steele, Aaron, Timoshevskiy, Vladimir A., Windbichler, Nikolai, Zhang, Simo, Hahn, Matthew W., Phillippy, Adam M., Emrich, Scott J., Sharakhov, Igor V., Tu, Zhijian Jake, and Besansky, Nora J.
- Published
- 2016
11. Evolutionary superscaffolding and chromosome anchoring to improve Anopheles genome assemblies
- Author
-
Waterhouse, Robert M., Aganezov, Sergey, Anselmetti, Yoann, Lee, Jiyoung, Ruzzante, Livio, Reijnders, Maarten J. M. F., Feron, Romain, Bérard, Sèverine, George, Phillip, Hahn, Matthew W., Howell, Paul I., Kamali, Maryam, Koren, Sergey, Lawson, Daniel, Maslen, Gareth, Peery, Ashley, Phillippy, Adam M., Sharakhova, Maria V., Tannier, Eric, Unger, Maria F., Zhang, Simo V., Alekseyev, Max A., Besansky, Nora J., Chauve, Cedric, Emrich, Scott J., and Sharakhov, Igor V.
- Published
- 2020
- Full Text
- View/download PDF
12. MIReAD, a minimum information standard for reporting arthropod abundance data
- Author
-
Rund, Samuel S. C., Braak, Kyle, Cator, Lauren, Copas, Kyle, Emrich, Scott J., Giraldo-Calderón, Gloria I., Johansson, Michael A., Heydari, Naveed, Hobern, Donald, Kelly, Sarah A., Lawson, Daniel, Lord, Cynthia, MacCallum, Robert M., Roche, Dominique G., Ryan, Sadie J., Schigel, Dmitry, Vandegrift, Kurt, Watts, Matthew, Zaspel, Jennifer M., and Pawar, Samraat
- Published
- 2019
- Full Text
- View/download PDF
13. Genome of Rhodnius prolixus, an insect vector of Chagas disease, reveals unique adaptations to hematophagy and parasite infection
- Author
-
Mesquita, Rafael D., Vionette-Amaral, Raquel J., Lowenberger, Carl, Rivera-Pomar, Rolando, Monteiro, Fernando A., Minx, Patrick, Spieth, John, Carvalho, A. Bernardo, Panzera, Francisco, Lawson, Daniel, Torres, André Q., Ribeiro, Jose M. C., Sorgine, Marcos H. F., Waterhouse, Robert M., Montague, Michael J., Abad-Franch, Fernando, Alves-Bezerra, Michele, Amaral, Laurence R., Araujo, Helena M., Araujo, Ricardo N., Aravind, L., Atella, Georgia C., Azambuja, Patricia, Berni, Mateus, Bittencourt-Cunha, Paula R., Braz, Gloria R. C., Calderón-Fernández, Gustavo, Carareto, Claudia M. A., Christensen, Mikkel B., Costa, Igor R., Costa, Samara G., Dansa, Marilvia, Daumas-Filho, Carlos R. O., De-Paula, Iron F., Dias, Felipe A., Dimopoulos, George, Emrich, Scott J., Esponda-Behrens, Natalia, Fampa, Patricia, Fernandez-Medina, Rita D., da Fonseca, Rodrigo N., Fontenele, Marcio, Fronick, Catrina, Fulton, Lucinda A., Gandara, Ana Caroline, Garcia, Eloi S., Genta, Fernando A., Giraldo-Calderón, Gloria I., Gomes, Bruno, Gondim, Katia C., Granzotto, Adriana, Guarneri, Alessandra A., Guigó, Roderic, Harry, Myriam, Hughes, Daniel S. T., Jablonka, Willy, Jacquin-Joly, Emmanuelle, Juárez, M. Patricia, Koerich, Leonardo B., Lange, Angela B., Latorre-Estivalis, José Manuel, Lavore, Andrés, Lawrence, Gena G., Lazoski, Cristiano, Lazzari, Claudio R., Lopes, Raphael R., Lorenzo, Marcelo G., Lugon, Magda D., Majerowicz, David, Marcet, Paula L., Mariotti, Marco, Masuda, Hatisaburo, Megy, Karine, Melo, Ana C. A., Missirlis, Fanis, Mota, Theo, Noriega, Fernando G., Nouzova, Marcela, Nunes, Rodrigo D., Oliveira, Raquel L. L., Oliveira-Silveira, Gilbert, Ons, Sheila, Orchard, Ian, Pagola, Lucia, Paiva-Silva, Gabriela O., Pascual, Agustina, Pavan, Marcio G., Pedrini, Nicolás, Peixoto, Alexandre A., Pereira, Marcos H., Pike, Andrew, Polycarpo, Carla, Prosdocimi, Francisco, Ribeiro-Rodrigues, Rodrigo, Robertson, Hugh M., Salerno, Ana Paula, Salmon, Didier, Santesmasses, Didac, Schama, Renata, Seabra-Junior, Eloy S., Silva-Cardoso, Livia, Silva-Neto, Mario A. C., Souza-Gomes, Matheus, Sterkel, Marcos, Taracena, Mabel L., Tojo, Marta, Tu, Zhijian Jake, Tubio, Jose M. C., Ursic-Bedoya, Raul, Venancio, Thiago M., Walter-Nuno, Ana Beatriz, Wilson, Derek, Warren, Wesley C., Wilson, Richard K., Huebner, Erwin, Dotson, Ellen M., and Oliveira, Pedro L.
- Published
- 2015
14. Extensive introgression in a malaria vector species complex revealed by phylogenomics
- Author
-
Fontaine, Michael C., Pease, James B., Steele, Aaron, Waterhouse, Robert M., Neafsey, Daniel E., Sharakhov, Igor V., Jiang, Xiaofang, Hall, Andrew B., Catteruccia, Flaminia, Kakani, Evdoxia, Mitchell, Sara N., Wu, Yi-Chieh, Smith, Hilary A., Love, R. Rebecca, Lawniczak, Mara K., Slotman, Michel A., Emrich, Scott J., Hahn, Matthew W., and Besansky, Nora J.
- Published
- 2015
15. Reduced-representation sequencing identifies small effective population sizes of Anopheles gambiae in the north-western Lake Victoria basin, Uganda
- Author
-
Wiltshire, Rachel M., Bergey, Christina M., Kayondo, Jonathan K., Birungi, Josephine, Mukwaya, Louis G., Emrich, Scott J., Besansky, Nora J., and Collins, Frank H.
- Published
- 2018
- Full Text
- View/download PDF
16. HECIL: A Hybrid Error Correction Algorithm for Long Reads with Iterative Learning
- Author
-
Choudhury, Olivia, Chakrabarty, Ankush, and Emrich, Scott J.
- Published
- 2018
- Full Text
- View/download PDF
17. Screening for resistance alleles to Cry1 proteins through targeted sequencing in the native and invasive range of Spodoptera frugiperda (Lepidoptera: Noctuidae).
- Author
-
Tandy, Peter, Lamour, Kurt, Bortoli, Caroline Placidi de, Nagoshi, Rodney, Emrich, Scott J, and Jurat-Fuentes, Juan Luis
- Subjects
FALL armyworm ,NOCTUIDAE ,ALLELES ,LEPIDOPTERA ,TRANSGENIC plants - Abstract
The fall armyworm, Spodoptera frugiperda (J. E. Smith), is a highly polyphagous pest native to the tropical Americas that has recently spread to become a global super-pest threatening food and fiber production. Transgenic crops producing insecticidal Cry and Vip3Aa proteins from Bacillus thuringiensis (Bt) are used for control of this pest in its native range. The evolution of practical resistance represents the greatest threat to sustainability of this technology and its potential efficacy in the S. frugiperda invasive range. Monitoring for resistance is vital to management approaches delaying S. frugiperda resistance to Bt crops. DNA-based resistance screening provides higher sensitivity and cost-effectiveness than currently used bioassay-based monitoring. So far, practical S. frugiperda resistance to Bt corn-producing Cry1F has been genetically linked to mutations in the SfABCC2 gene, providing a model to develop and test monitoring tools. In this study, we performed targeted SfABCC2 sequencing followed by Sanger sequencing to confirm the detection of known and candidate resistance alleles to Cry1F corn in field-collected S. frugiperda from continental USA, Puerto Rico, Africa (Ghana, Togo, and South Africa), and Southeast Asia (Myanmar). Results confirm that the distribution of a previously characterized resistance allele (SfABCC2mut) is limited to Puerto Rico and identify 2 new candidate SfABCC2 alleles for resistance to Cry1F, one of them potentially spreading along the S. frugiperda migratory route in North America. No candidate resistance alleles were found in samples from the invasive S. frugiperda range. These results provide support for the potential use of targeted sequencing in Bt resistance monitoring programs. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
18. GRAFENE: Graphlet-based alignment-free network approach integrates 3D structural and sequence (residue order) data to improve protein structural comparison
- Author
-
Faisal, Fazle E., Newaz, Khalique, Chaney, Julie L., Li, Jun, Emrich, Scott J., Clark, Patricia L., and Milenković, Tijana
- Published
- 2017
- Full Text
- View/download PDF
19. The B73 Maize Genome: Complexity, Diversity, and Dynamics
- Author
-
Schnable, Patrick S., Ware, Doreen, Fulton, Robert S., Stein, Joshua C., Wei, Fusheng, Pasternak, Shiran, Liang, Chengzhi, Zhang, Jianwei, Fulton, Lucinda, Graves, Tina A., Minx, Patrick, Reily, Amy Denise, Courtney, Laura, Kruchowski, Scott S., Tomlinson, Chad, Strong, Cindy, Delehaunty, Kim, Fronick, Catrina, Courtney, Bill, Rock, Susan M., Belter, Eddie, Du, Feiyu, Kim, Kyung, Abbott, Rachel M., Cotton, Marc, Levy, Andy, Marchetto, Pamela, Ochoa, Kerri, Jackson, Stephanie M., Gillam, Barbara, Chen, Weizu, Yan, Le, Higginbotham, Jamey, Cardenas, Marco, Waligorski, Jason, Applebaum, Elizabeth, Phelps, Lindsey, Falcone, Jason, Kanchi, Krishna, Thane, Thynn, Scimone, Adam, Thane, Nay, Henke, Jessica, Wang, Tom, Ruppert, Jessica, Shah, Neha, Rotter, Kelsi, Hodges, Jennifer, Ingenthron, Elizabeth, Cordes, Matt, Kohlberg, Sara, Sgro, Jennifer, Delgado, Brandon, Mead, Kelly, Chinwalla, Asif, Leonard, Shawn, Crouse, Kevin, Collura, Kristi, Kudrna, Dave, Currie, Jennifer, He, Ruifeng, Angelova, Angelina, Rajasekar, Shanmugam, Mueller, Teri, Lomeli, Rene, Scara, Gabriel, Ko, Ara, Delaney, Krista, Wissotski, Marina, Lopez, Georgina, Campos, David, Braidotti, Michele, Ashley, Elizabeth, Golser, Wolfgang, Kim, HyeRan, Lee, Seunghee, Lin, Jinke, Dujmic, Zeljko, Kim, Woojin, Talag, Jayson, Zuccolo, Andrea, Fan, Chuanzhu, Sebastian, Aswathy, Kramer, Melissa, Spiegel, Lori, Nascimento, Lidia, Zutavern, Theresa, Miller, Beth, Ambroise, Claude, Muller, Stephanie, Spooner, Will, Narechania, Apurva, Ren, Liya, Wei, Sharon, Kumari, Sunita, Faga, Ben, Levy, Michael J., McMahan, Linda, Van Buren, Peter, Vaughn, Matthew W., Ying, Kai, Yeh, Cheng-Ting, Emrich, Scott J., Jia, Yi, Kalyanaraman, Ananth, Hsia, An-Ping, Barbazuk, W. Brad, Baucom, Regina S., Brutnell, Thomas P., Carpita, Nicholas C., Chaparro, Cristian, Chia, Jer-Ming, Deragon, Jean-Marc, Estill, James C., Fu, Yan, Jeddeloh, Jeffrey A., Han, Yujun, Lee, Hyeran, Li, Pinghua, Lisch, Damon R., Liu, Sanzhen, Liu, Zhijie, Nagel, Dawn Holligan, McCann, Maureen C., SanMiguel, Phillip, Myers, Alan M., Nettleton, Dan, Nguyen, John, Penning, Bryan W., Ponnala, Lalit, Schneider, Kevin L., Schwartz, David C., Sharma, Anupma, Soderlund, Carol, Springer, Nathan M., Sun, Qi, Wang, Hao, Waterman, Michael, Westerman, Richard, Wolfgruber, Thomas K., Yang, Lixing, Yu, Yeisoo, Zhang, Lifang, Zhou, Shiguo, Zhu, Qihui, Bennetzen, Jeffrey L., Dawe, R. Kelly, Jiang, Jiming, Jiang, Ning, Presting, Gernot G., Wessler, Susan R., Aluru, Srinivas, Martienssen, Robert A., Clifton, Sandra W., McCombie, W. Richard, Wing, Rod A., and Wilson, Richard K.
- Published
- 2009
- Full Text
- View/download PDF
20. Genomic analysis of two phlebotomine sand fly vectors of leishmania from the new and old World.
- Author
-
Labbé, Frédéric, Abdeladhim, Maha, Abrudan, Jenica, Araki, Alejandra Saori, Araujo, Ricardo N., Arensburger, Peter, Benoit, Joshua B., Brazil, Reginaldo Pecanha, Bruno, Rafaela V., Bueno da Silva Rivas, Gustavo, Carvalho de Abreu, Vinicius, Charamis, Jason, Coutinho-Abreu, Iliano V., da Costa-Latgé, Samara G., Darby, Alistair, Dillon, Viv M., Emrich, Scott J., Fernandez-Medina, Daniela, Figueiredo Gontijo, Nelder, and Flanley, Catherine M.
- Subjects
SAND flies ,LEISHMANIA mexicana ,GENOMICS ,MOBILE genetic elements ,LEISHMANIA ,NEGLECTED diseases ,LYME disease ,BURULI ulcer - Abstract
Phlebotomine sand flies are of global significance as important vectors of human disease, transmitting bacterial, viral, and protozoan pathogens, including the kinetoplastid parasites of the genus Leishmania, the causative agents of devastating diseases collectively termed leishmaniasis. More than 40 pathogenic Leishmania species are transmitted to humans by approximately 35 sand fly species in 98 countries with hundreds of millions of people at risk around the world. No approved efficacious vaccine exists for leishmaniasis and available therapeutic drugs are either toxic and/or expensive, or the parasites are becoming resistant to the more recently developed drugs. Therefore, sand fly and/or reservoir control are currently the most effective strategies to break transmission. To better understand the biology of sand flies, including the mechanisms involved in their vectorial capacity, insecticide resistance, and population structures we sequenced the genomes of two geographically widespread and important sand fly vector species: Phlebotomus papatasi, a vector of Leishmania parasites that cause cutaneous leishmaniasis, (distributed in Europe, the Middle East and North Africa) and Lutzomyia longipalpis, a vector of Leishmania parasites that cause visceral leishmaniasis (distributed across Central and South America). We categorized and curated genes involved in processes important to their roles as disease vectors, including chemosensation, blood feeding, circadian rhythm, immunity, and detoxification, as well as mobile genetic elements. We also defined gene orthology and observed micro-synteny among the genomes. Finally, we present the genetic diversity and population structure of these species in their respective geographical areas. These genomes will be a foundation on which to base future efforts to prevent vector-borne transmission of Leishmania parasites. Author summary: The leishmaniases are a group of neglected tropical diseases caused by protist parasites from the Genus Leishmania. Different Leishmania species present a wide clinical profile, ranging from mild, often self-resolving cutaneous lesions that can lead to protective immunity, to severe metastatic mucosal disease, to visceral disease that is ultimately fatal. Leishmania parasites are transmitted by the bites of sand flies, and as no approved human vaccine exists, available drugs are toxic and/or expensive and parasite resistance to them is emerging, new dual control strategies to combat these diseases must be developed, combining interventions on human infections and integrated sand fly population management. Effective vector control requires a comprehensive understanding of the biology of sand flies. To this end, we sequenced and annotated the genomes of two sand fly species that are important leishmaniasis vectors from the Old and New Worlds. These genomes allow us to better understand, at the genetic level, processes important in the vector biology of these species, such as finding hosts, blood-feeding, immunity, and detoxification. These genomic resources highlight the driving forces of evolution of two major Leishmania vectors and provide foundations for future research on how to better prevent leishmaniasis by control of the sand fly vectors. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
21. Standing genetic variation and chromosome differences drove rapid ecotype formation in a major malaria mosquito.
- Author
-
Small, Scott T., Costantini, Carlo, Sagnon, N'Fale, Guelbeogo, Moussa W., Emrich, Scott J., Kern, Andrew D., Fontaine, Michael C., and Besansky, Nora J.
- Subjects
GENETIC variation ,CHROMOSOME inversions ,CHROMOSOMES ,GENETIC recombination ,MALARIA - Abstract
Species distributed across heterogeneous environments often evolve locally adapted ecotypes, but understanding of the genetic mechanisms involved in their formation and maintenance in the face of gene flow is incomplete. In Burkina Faso, the major African malaria mosquito Anopheles funestus comprises two strictly sympatric and morphologically indistinguishable yet karyotypically differentiated forms reported to differ in ecology and behavior. However, knowledge of the genetic basis and environmental determinants of An. funestus diversification was impeded by lack of modern genomic resources. Here, we applied deep whole-genome sequencing and analysis to test the hypothesis that these two forms are ecotypes differentially adapted to breeding in natural swamps versus irrigated rice fields. We demonstrate genomewide differentiation despite extensive microsympatry, synchronicity, and ongoing hybridization. Demographic inference supports a split only ~1,300 y ago, closely following the massive expansion of domesticated African rice cultivation ~1,850 y ago. Regions of highest divergence, concentrated in chromosomal inversions, were under selection during lineage splitting, consistent with local adaptation. The origin of nearly all variations implicated in adaptation, including chromosomal inversions, substantially predates the ecotype split, suggesting that rapid adaptation was fueled mainly by standing genetic variation. Sharp inversion frequency differences likely facilitated adaptive divergence between ecotypes by suppressing recombination between opposing chromosomal orientations of the two ecotypes, while permitting free recombination within the structurally monomorphic rice ecotype. Our results align with growing evidence from diverse taxa that rapid ecological diversification can arise from evolutionarily old structural genetic variants that modify genetic recombination. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
22. VectorBase: an updated bioinformatics resource for invertebrate vectors and other organisms related with human diseases
- Author
-
Giraldo-Calderón, Gloria I., Emrich, Scott J., MacCallum, Robert M., Maslen, Gareth, Dialynas, Emmanuel, Topalis, Pantelis, Ho, Nicholas, Gesing, Sandra, Madey, Gregory, Collins, Frank H., and Lawson, Daniel
- Published
- 2015
- Full Text
- View/download PDF
23. MOSQUITO GENOMICS: Extensive introgression in a malaria vector species complex revealed by phylogenomics
- Author
-
Fontaine, Michael C., Pease, James B., Steele, Aaron, Waterhouse, Robert M., Neafsey, Daniel E., Sharakhov, Igor V., Jiang, Xiaofang, Hall, Andrew B., Catteruccia, Flaminia, Kakani, Evdoxia, Mitchell, Sara N., Wu, Yi-Chieh, Smith, Hilary A., Love, Rebecca R., Lawniczak, Mara K., Slotman, Michel A., Emrich, Scott J., Hahn, Matthew W., and Besansky, Nora J.
- Published
- 2015
- Full Text
- View/download PDF
24. High-throughput genetic mapping of mutants via quantitative Single nucleotide polymorphism typing
- Author
-
Liu, Sanzhen, Chen, Hsin D., Makarevitch, Irina, Shirmer, Rebecca, Emrich, Scott J., Dietrich, Charles R., Barbazuk, W. Brad, Springer, Nathan M., and Schnable, Patrick S.
- Subjects
DNA sequencing -- Research ,Nucleotide sequencing -- Research ,Gene mutations -- Research ,Genetic polymorphisms -- Research ,Population genetics -- Research ,Biological sciences - Published
- 2010
25. Freshwater bacterial lifestyles inferred from comparative genomics
- Author
-
Livermore, Joshua A., Emrich, Scott J., Tan, John, and Jones, Stuart E.
- Published
- 2014
- Full Text
- View/download PDF
26. Multi‐layer sequential network analysis improves protein 3D structural classification.
- Author
-
Newaz, Khalique, Piland, Jacob, Clark, Patricia L., Emrich, Scott J., Li, Jun, and Milenković, Tijana
- Abstract
Protein structural classification (PSC) is a supervised problem of assigning proteins into pre‐defined structural (e.g., CATH or SCOPe) classes based on the proteins' sequence or 3D structural features. We recently proposed PSC approaches that model protein 3D structures as protein structure networks (PSNs) and analyze PSN‐based protein features, which performed better than or comparable to state‐of‐the‐art sequence or other 3D structure‐based PSC approaches. However, existing PSN‐based PSC approaches model the whole 3D structure of a protein as a static (i.e., single‐layer) PSN. Because folding of a protein is a dynamic process, where some parts (i.e., sub‐structures) of a protein fold before others, modeling the 3D structure of a protein as a PSN that captures the sub‐structures might further help improve the existing PSC performance. Here, we propose to model 3D structures of proteins as multi‐layer sequential PSNs that approximate 3D sub‐structures of proteins, with the hypothesis that this will improve upon the current state‐of‐the‐art PSC approaches that are based on single‐layer PSNs (and thus upon the existing state‐of‐the‐art sequence and other 3D structural approaches). Indeed, we confirm this on 72 datasets spanning ~44 000 CATH and SCOPe protein domains. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
27. Nearly identical paralogs: implications for maize (Zea mays L.) genome evolution
- Author
-
Emrich, Scott J., Li, Li, Wen, Tsui-Jung, Yandeau-Nelson, Marna D., Fu, Yan, Guo, Ling, Chou, Hui-Hsien, Aluru, Srinivas, Ashlock, Daniel A., and Schnable, Patrick S.
- Subjects
Genomics -- Research ,Corn -- Research ,Corn -- Genetic aspects ,Biological sciences - Abstract
As an ancient segmental tetraploid, the maize (Zea mays L.) genome contains large numbers of paralogs that are expected to have diverged by a minimum of 10% over time. Nearly/dentical paralogs (NIPs) are defined as paralogous genes that exhibit [greater than or equal to] 98% identity. Sequence analyses of the 'gene space' of the maize inbred line B73 genome, coupled with wet lab validation, have revealed that, conservatively, at least ~1% of maize genes have a NIP, a rate substantially higher than that in Arabidopsis. In most instances, both members of maize NIP pairs are expressed and are therefore at least potentially functional. Of evolutionary significance, members of many NIP families also exhibit differential expression. The finding that some families of maize NIPs are closely linked genetically while others are genetically unlinked is consistent with multiple modes of origin. NIPs provide a mechanism for the maize genome to circumvent the inherent limitation that diploid genomes can carry at most two 'alleles' per 'locus.' As such, NIPs may have played important roles during the evolution and domestication of maize and may contribute to the success of long-term selection experiments in this important crop species.
- Published
- 2007
28. Evaluation of five ab initio gene prediction programs for the discovery of maize genes
- Author
-
Yao, Hong, Guo, Ling, Fu, Yan, Borsuk, Lisa A., Wen, Tsui-Jung, Skibbe, David S., Cui, Xiangqin, Scheffler, Brian E., Cao, Jun, Emrich, Scott J., Ashlock, Daniel A., and Schnable, Patrick S.
- Published
- 2005
- Full Text
- View/download PDF
29. Quality assessment of maize assembled genomic islands (MAGIs) and large-scale experimental verification of predicted genes
- Author
-
Fu, Yan, Emrich, Scott J., Guo, Ling, Wen, Tsui-Jung, Ashlock, Daniel A., Aluru, Srinivas, and Schnable, Patrick S.
- Subjects
Corn -- Genetic aspects ,Corn -- Research ,Science and technology - Abstract
Recent sequencing efforts have targeted the gene-rich regions of the maize (Zea mays L.) genome. We report the release of an improved assembly of maize assembled genomic islands (MAGIs). The 114,173 resulting contigs have been subjected to computational and physical quality assessments. Comparisons to the sequences of maize bacterial artificial chromosomes suggest that at least 97% (160 of 165) of MAGIs are correctly assembled. Because the rates at which junction-testing PCR primers for genomic survey sequences (90-92%) amplify genomic DNA are not significantly different from those of control primers ([approximately equal to] 91%), we conclude that a very high percentage of genic MAGIs accurately reflect the structure of the maize genome. EST alignments, ab initio gene prediction, and sequence similarity searches of the MAGIs are available at the Iowa State University MAGI web site. This assembly contains 46,688 ab initio predicted genes. The expression of almost half (628 of 1,369) of a sample of the predicted genes that lack expression evidence was validated by RT-PCR. Our analyses suggest that the maize genome contains between [approximately equal to] 33,000 and [approximately equal to] 54,000 expressed genes. Approximately 5% (32 of 628) of the maize transcripts discovered do not have detectable paralogs among maize ESTs or detectable homologs from other species in the GenBank NR nucleotide/protein database. Analyses therefore suggest that this assembly of the maize genome contains approximately 350 previously uncharacterized expressed genes. We hypothesize that these 'orphans' evolved quickly during maize evolution and/or domestication. assembly validation | gene prediction | maize genome assembly | nearly identical paralog
- Published
- 2005
30. VectorBase: improvements to a bioinformatics resource for invertebrate vector genomics
- Author
-
Megy, Karine, Emrich, Scott J., Lawson, Daniel, Campbell, David, Dialynas, Emmanuel, Hughes, Daniel S.T., Koscielny, Gautier, Louis, Christos, MacCallum, Robert M., Redmond, Seth N., Sheehan, Andrew, Topalis, Pantelis, and Wilson, Derek
- Published
- 2012
- Full Text
- View/download PDF
31. Global gene expression analysis of the shoot apical meristem of maize (Zea mays L.)
- Author
-
Ohtsu, Kazuhiro, Smith, Marianne B., Emrich, Scott J., Borsuk, Lisa A., Zhou, Ruilian, Chen, Tianle, Zhang, Xiaolan, Timmermans, Marja C. P., Beck, Jon, Buckner, Brent, Janick-Buckner, Diane, Nettleton, Dan, Scanlon, Michael J., and Schnable, Patrick S.
- Published
- 2007
32. SNP discovery via 454 transcriptome sequencing
- Author
-
Barbazuk, W. Brad, Emrich, Scott J., Chen, Hsin D., Li, Li, and Schnable, Patrick S.
- Published
- 2007
33. CHARMING: Harmonizing synonymous codon usage to replicate a desired codon usage pattern.
- Author
-
Wright, Gabriel, Rodriguez, Anabel, Li, Jun, Milenkovic, Tijana, Emrich, Scott J., and Clark, Patricia L.
- Abstract
There is a growing appreciation that synonymous codon usage, although historically regarded as phenotypically silent, can instead alter a wide range of mechanisms related to functional protein production, a term we use here to describe the net effect of transcription (mRNA synthesis), mRNA half‐life, translation (protein synthesis) and the probability of a protein folding correctly to its active, functional structure. In particular, recent discoveries have highlighted the important role that sub‐optimal codons can play in modifying co‐translational protein folding. These results have drawn increased attention to the patterns of synonymous codon usage within coding sequences, particularly in light of the discovery that these patterns can be conserved across evolution for homologous proteins. Because synonymous codon usage differs between organisms, for heterologous gene expression it can be desirable to make synonymous codon substitutions to match the codon usage pattern from the original organism in the heterologous expression host. Here we present CHARMING (for Codon HARMonizING), a robust and versatile algorithm to design mRNA sequences for heterologous gene expression and other related codon harmonization tasks. CHARMING can be run as a downloadable Python script or via a web portal at http://www.codons.org. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
34. A strategy for assembling the maize (Zea mays L.) genome
- Author
-
Emrich, Scott J., Aluru, Srinivas, Fu, Yan, Wen, Tsui-Jung, Narayanan, Mahesh, Guo, Ling, Ashlock, Daniel A., and Schnable, Patrick S.
- Published
- 2004
35. High-throughput 454 resequencing for allele discovery and recombination mapping in Plasmodium falciparum
- Author
-
Samarakoon, Upeka, Regier, Allison, Tan, Asako, Desany, Brian A, Collins, Brendan, Tan, John C, Emrich, Scott J, and Ferdig, Michael T
- Published
- 2011
- Full Text
- View/download PDF
36. Breakpoint structure of the Anopheles gambiae 2Rb chromosomal inversion
- Author
-
Costantini Carlo, Traore Sekou F, Emrich Scott J, Sharakhova Maria V, Bretz David A, Reidenbach Kyanne R, Regier Allison A, Sangaré Djibril M, Lobo Neil F, Besansky Nora J, and Collins Frank H
- Subjects
Arctic medicine. Tropical medicine ,RC955-962 ,Infectious and parasitic diseases ,RC109-216 - Abstract
Abstract Background Alternative arrangements of chromosome 2 inversions in Anopheles gambiae are important sources of population structure, and are associated with adaptation to environmental heterogeneity. The forces responsible for their origin and maintenance are incompletely understood. Molecular characterization of inversion breakpoints provides insight into how they arose, and provides the basis for development of molecular karyotyping methods useful in future studies. Methods Sequence comparison of regions near the cytological breakpoints of 2Rb allowed the molecular delineation of breakpoint boundaries. Comparisons were made between the standard 2R+b arrangement in the An. gambiae PEST reference genome and the inverted 2Rb arrangements in the An. gambiae M and S genome assemblies. Sequence differences between alternative 2Rb arrangements were exploited in the design of a PCR diagnostic assay, which was evaluated against the known chromosomal banding pattern of laboratory colonies and field-collected samples from Mali and Cameroon. Results The breakpoints of the 7.55 Mb 2Rb inversion are flanked by extensive runs of the same short (72 bp) tandemly organized sequence, which was likely responsible for chromosomal breakage and rearrangement. Application of the molecular diagnostic assay suggested that 2Rb has a single common origin in An. gambiae and its sibling species, Anopheles arabiensis, and also that the standard arrangement (2R+b) may have arisen twice through breakpoint reuse. The molecular diagnostic was reliable when applied to laboratory colonies, but its accuracy was lower in natural populations. Conclusions The complex repetitive sequence flanking the 2Rb breakpoint region may be prone to structural and sequence-level instability. The 2Rb molecular diagnostic has immediate application in studies based on laboratory colonies, but its usefulness in natural populations awaits development of complementary molecular tools.
- Published
- 2010
- Full Text
- View/download PDF
37. A statistical approach to finding overlooked genetic associations
- Author
-
Siwo Geoffrey, Rider Andrew K, Chawla Nitesh V, Ferdig Michael, and Emrich Scott J
- Subjects
Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background Complexity and noise in expression quantitative trait loci (eQTL) studies make it difficult to distinguish potential regulatory relationships among the many interactions. The predominant method of identifying eQTLs finds associations that are significant at a genome-wide level. The vast number of statistical tests carried out on these data make false negatives very likely. Corrections for multiple testing error render genome-wide eQTL techniques unable to detect modest regulatory effects. We propose an alternative method to identify eQTLs that builds on traditional approaches. In contrast to genome-wide techniques, our method determines the significance of an association between an expression trait and a locus with respect to the set of all associations to the expression trait. The use of this specific information facilitates identification of expression traits that have an expression profile that is characterized by a single exceptional association to a locus. Our approach identifies expression traits that have exceptional associations regardless of the genome-wide significance of those associations. This property facilitates the identification of possible false negatives for genome-wide significance. Further, our approach has the property of prioritizing expression traits that are affected by few strong associations. Expression traits identified by this method may warrant additional study because their expression level may be affected by targeting genes near a single locus. Results We demonstrate our method by identifying eQTL hotspots in Plasmodium falciparum (malaria) and Saccharomyces cerevisiae (yeast). We demonstrate the prioritization of traits with few strong genetic effects through Gene Ontology (GO) analysis of Yeast. Our results are strongly consistent with results gathered using genome-wide methods and identify additional hotspots and eQTLs. Conclusions New eQTLs and hotspots found with this method may represent regions of the genome or biological processes that are controlled through few relatively strong genetic interactions. These points of interest warrant experimental investigation.
- Published
- 2010
- Full Text
- View/download PDF
38. Population-level transcriptome sequencing of nonmodel organisms Erynnis propertius and Papilio zelicaon
- Author
-
Emrich Scott J, Lobo Neil F, Carmichael Rory D, Dzurisin Jason DK, O'Neil Shawn T, and Hellmann Jessica J
- Subjects
Biotechnology ,TP248.13-248.65 ,Genetics ,QH426-470 - Abstract
Abstract Background Several recent studies have demonstrated the use of Roche 454 sequencing technology for de novo transcriptome analysis. Low error rates and high coverage also allow for effective SNP discovery and genetic diversity estimates. However, genetically diverse datasets, such as those sourced from natural populations, pose challenges for assembly programs and subsequent analysis. Further, estimating the effectiveness of transcript discovery using Roche 454 transcriptome data is still a difficult task. Results Using the Roche 454 FLX Titanium platform, we sequenced and assembled larval transcriptomes for two butterfly species: the Propertius duskywing, Erynnis propertius (Lepidoptera: Hesperiidae) and the Anise swallowtail, Papilio zelicaon (Lepidoptera: Papilionidae). The Expressed Sequence Tags (ESTs) generated represent a diverse sample drawn from multiple populations, developmental stages, and stress treatments. Despite this diversity, > 95% of the ESTs assembled into long (> 714 bp on average) and highly covered (> 9.6× on average) contigs. To estimate the effectiveness of transcript discovery, we compared the number of bases in the hit region of unigenes (contigs and singletons) to the length of the best match silkworm (Bombyx mori) protein--this "ortholog hit ratio" gives a close estimate on the amount of the transcript discovered relative to a model lepidopteran genome. For each species, we tested two assembly programs and two parameter sets; although CAP3 is commonly used for such data, the assemblies produced by Celera Assembler with modified parameters were chosen over those produced by CAP3 based on contig and singleton counts as well as ortholog hit ratio analysis. In the final assemblies, 1,413 E. propertius and 1,940 P. zelicaon unigenes had a ratio > 0.8; 2,866 E. propertius and 4,015 P. zelicaon unigenes had a ratio > 0.5. Conclusions Ultimately, these assemblies and SNP data will be used to generate microarrays for ecoinformatics examining climate change tolerance of different natural populations. These studies will benefit from high quality assemblies with few singletons (less than 26% of bases for each assembled transcriptome are present in unassembled singleton ESTs) and effective transcript discovery (over 6,500 of our putative orthologs cover at least 50% of the corresponding model silkworm gene).
- Published
- 2010
- Full Text
- View/download PDF
39. Detecting inversions with PCA in the presence of population structure.
- Author
-
Nowling, Ronald J., Manke, Krystal R., and Emrich, Scott J.
- Abstract
Chromosomal inversions can lead to reproductive isolation and adaptation in insects such as Drosophila melanogaster and the non-model malaria vector Anopheles gambiae. Inversions can be detected and characterized using principal component analysis (PCA) of single nucleotide polymorphisms (SNPs). To aid in developing such methods, we formed a new benchmark derived from three publicly-available insect data. We then used this benchmark to perform an extended validation of our software for inversion analysis (Asaph). Through that process, we identified and characterized several problematic test cases liable to misinterpretation that can help guide PCA-based inversion detection. Lastly, we re-analyzed the 2R chromosome arm of 150 An. gambiae and coluzzii samples and observed two inversions (2Rc and 2Rd) that were previously known but not annotated in these particular individuals. The resulting benchmark data set and methods will be useful for future inversion detection based solely on SNP data. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
40. Network analysis of synonymous codon usage.
- Author
-
Newaz, Khalique, Wright, Gabriel, Piland, Jacob, Li, Jun, Clark, Patricia L, Emrich, Scott J, and Milenković, Tijana
- Subjects
PROTEIN structure ,PROTEIN folding ,AMINO acids ,PROTEIN models ,MOTIVATION (Psychology) - Abstract
Motivation Most amino acids are encoded by multiple synonymous codons, some of which are used more rarely than others. Analyses of positions of such rare codons in protein sequences revealed that rare codons can impact co-translational protein folding and that positions of some rare codons are evolutionarily conserved. Analyses of their positions in protein 3-dimensional structures, which are richer in biochemical information than sequences alone, might further explain the role of rare codons in protein folding. Results We model protein structures as networks and use network centrality to measure the structural position of an amino acid. We first validate that amino acids buried within the structural core are network-central, and those on the surface are not. Then, we study potential differences between network centralities and thus structural positions of amino acids encoded by conserved rare, non-conserved rare and commonly used codons. We find that in 84% of proteins, the three codon categories occupy significantly different structural positions. We examine protein groups showing different codon centrality trends, i.e. different relationships between structural positions of the three codon categories. We see several cases of all proteins from our data with some structural or functional property being in the same group. Also, we see a case of all proteins in some group having the same property. Our work shows that codon usage is linked to the final protein structure and thus possibly to co-translational protein folding. Availability and implementation https://nd.edu/∼cone/CodonUsage/. Supplementary information Supplementary data are available at Bioinformatics online. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
41. Analysis of computational codon usage models and their association with translationally slow codons.
- Author
-
Wright, Gabriel, Rodriguez, Anabel, Li, Jun, Clark, Patricia L., Milenković, Tijana, and Emrich, Scott J.
- Subjects
GENETIC translation ,GENETIC code ,PROTEIN models ,FORECASTING - Abstract
Improved computational modeling of protein translation rates, including better prediction of where translational slowdowns along an mRNA sequence may occur, is critical for understanding co-translational folding. Because codons within a synonymous codon group are translated at different rates, many computational translation models rely on analyzing synonymous codons. Some models rely on genome-wide codon usage bias (CUB), believing that globally rare and common codons are the most informative of slow and fast translation, respectively. Others use the CUB observed only in highly expressed genes, which should be under selective pressure to be translated efficiently (and whose CUB may therefore be more indicative of translation rates). No prior work has analyzed these models for their ability to predict translational slowdowns. Here, we evaluate five models for their association with slowly translated positions as denoted by two independent ribosome footprint (RFP) count experiments from S. cerevisiae, because RFP data is often considered as a "ground truth" for translation rates across mRNA sequences. We show that all five considered models strongly associate with the RFP data and therefore have potential for estimating translational slowdowns. However, we also show that there is a weak correlation between RFP counts for the same genes originating from independent experiments, even when their experimental conditions are similar. This raises concerns about the efficacy of using current RFP experimental data for estimating translation rates and highlights a potential advantage of using computational models to understand translation rates instead. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
42. Erratum to :'Genome of Rhodnius prolixus, an insect vector of Chagas disease, reveals unique adaptations to hematophagy and parasite infection' (vol 112, pg 14936, 2015)
- Author
-
Lowenberger, Carl, Rivera Pomar, Rolando, Monteiro, Fernando A., Minx, Patrick, Spieth, John, Bernardo Carvalho, A., Panzera, Francisco, Lawson, Daniel, Torres, Andre Q., Ribeiro, Jose M. C., Sorgine, Marcos H. F., Waterhouse, Robert M., Montague, Michael J., Abad Franch, Fernando, Alves Bezerra, Michele, Amaral, Laurence R., Araujo, Helena M., Araujo, Ricardo N., Aravind, L., Atella, Georgia C., Azambuja, Patricia, Berni, Mateus, Bittencourt Cunha, Paula R., Braz, Gloria R. C., Calderon Fernandez, Gustavo, Carareto, Claudia M. A., Christensen, Mikkel B., Costa, Igor R., Costa, Samara G., Dansa, Marilvia, Daumas Filho, Carlos R. O., De Paula, Iron F., Dias, Felipe A., Dimopoulos, George, Emrich, Scott J., Esponda Behrens, Natalia, Fampa, Patricia, Fernandez Medina, Rita D., da Fonseca, Rodrigo N., Fontenele, Marcio, Fronick, Catrina, Fulton, Lucinda A., Gandara, Ana Caroline, Garcia, Eloi S., Genta, Fernando A., Giraldo Calderón, Gloria I., Gomes, Bruno, Gondim, Katia C., Granzotto, Adriana, Guarneri, Alessandra A., Guigó, Roderic, Harry, Myriam, Hughes, Daniel S. T., Jablonka, Willy, Jacquin Joly, Emmanuelle, Patricia Juarez, M., Koerich, Leonardo B., Latorre Estivalis, Jose Manuel, Lavore, Andrés, Lawrence, Gena G., Lazoski, Cristiano, Lazzari, Claudio R., Lopes, Raphael R., Lorenzo, Marcelo G., Lugon, Magda D., Majerowicz, David, Marcet, Paula L., Mariotti, Marco, Masuda, Hatisaburo, Megy, Karine, Melo, Ana C.A., Missirlis, Fanis, Mota, Theo, Noriega, Fernando G., Nouzova, Marcela, Nunes, Rodrigo D., Oliveira, Raquel L.L., Oliveira Silveira, Gilbert, Ons, Sheila, Pagola, Lucia, Paiva Silva, Gabriela O., Pascual, Agustina, Pavan, Marcio G., Pedrini, Nicolas, Peixoto, Alexandre A., Pereira, Marcos H., Pike, Andrew, Polycarpo, Carla, Prosdocimi, Francisco, Ribeiro Rodrigues, Rodrigo, Robertson, Hugh M., Salerno, Ana Paula, Salmon, Didier, Santesmasses, Didac, Schama, Renata, Seabra Junior, Eloy S., Silva Cardoso, Livia, Mesquita, Rafael D., and Vionette Amaral, Raquel J.
- Subjects
rhodnius prolixus ,Rhodnius prolixus ,genome ,hematophagy ,immunity ,Chagas disease ,maladie de chagas - Abstract
Rhodnius prolixus not only has served as a model organism for the study of insect physiology, but also is a major vector of Chagas disease, an illness that affects approximately seven million people worldwide. We sequenced the genome of R. prolixus, generated assembled sequences covering 95% of the genome (similar to 702 Mb), including 15,456 putative protein-coding genes, and completed comprehensive genomic analyses of this obligate blood-feeding insect. Although immune-deficiency (IMD)-mediated immune responses were observed, R. prolixus putatively lacks key components of the IMD pathway, suggesting a reorganization of the canonical immune signaling network. Although both Toll and IMD effectors controlled intestinal microbiota, neither affected Trypanosoma cruzi, the causal agent of Chagas disease, implying the existence of evasion or tolerance mechanisms. R. prolixus has experienced an extensive loss of selenoprotein genes, with its repertoire reduced to only two proteins, one of which is a selenocysteine-based glutathione peroxidase, the first found in insects. The genome contained actively transcribed, horizontally transferred genes from Wolbachia sp., which showed evidence of codon use evolution toward the insect use pattern. Comparative protein analyses revealed many lineage-specific expansions and putative gene absences in R. prolixus, including tandem expansions of genes related to chemoreception, feeding, and digestion that possibly contributed to the evolution of a blood-feeding lifestyle. The genome assembly and these associated analyses provide critical information on the physiology and evolution of this important vector species and should be instrumental for the development of innovative disease control methods.
- Published
- 2016
43. Hot RAD: A Tool for Analysis of Next-Gen RAD Tag Data
- Author
-
Assour, Lauren A., LaRosa, Nicholas, and Emrich, Scott J.
- Subjects
Genomics (q-bio.GN) ,FOS: Biological sciences ,Quantitative Biology - Genomics ,Quantitative Biology - Quantitative Methods ,Quantitative Methods (q-bio.QM) - Abstract
Restriction site Associated DNA (RAD) tagging (also known as RAD-seq, etc.) is an emerging method for analyzing an organism's genome without completely sequencing it. This can be applied to a non-model organism without a reference genome, though this creates the problem of how to begin data analysis on unmapped and unannotated reads. Our program, Hot RAD, presents a straightforward and easy-to-use method to take raw Illumina data that has been RAD tagged and produce consensus contigs or sequence stacks using a distributed framework, creating a basis on which to begin analyzing an organism's DNA. The GUI (graphical user interface) element of our tool makes it easy for those not familiar with the command line to take raw sequence files and produce usable data in a timely manner.
- Published
- 2015
44. Predicting Local Inversions Using Rectangle Clustering and Representative Rectangle Prediction.
- Author
-
Zhu, Shenglong, Emrich, Scott J., and Chen, Danny Z.
- Abstract
As a specific type of structural variation, inversions are enjoying particular traction as a result of their established role in evolution. Using third-generation sequencing technology to predict inversions is growing in interest, but many such methods focus on improving sensitivity, giving rise to either too many false positives or very long running times. In this paper, we propose a new framework for inversion detection based on a combination of two novel theoretical models: rectangle clustering and representative rectangle prediction. This combination can automatically filter out false positive inversion predictions while retaining correct ones, leading to a method that has both high sensitivity and high positive prediction values (PPV). Further, this new framework can run very fast on available data. Our software can be freely obtained at https://github.com/UTbioinf/RigInv. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
45. Highly Accurate and Efficient Data-Driven Methods for Genotype Imputation.
- Author
-
Choudhury, Olivia, Chakrabarty, Ankush, and Emrich, Scott J.
- Abstract
High-throughput sequencing techniques have generated massive quantities of genotype data. Haplotype phasing has proven to be a useful and effective method for analyzing these data. However, the quality of phasing is undermined due to missing information. Imputation provides an effective means of improving the underlying genotype information. For model organisms, imputation can rely on an available reference genotype panel and a physical or genetic map. For non-model organisms, which often do not have a genotype panel, it is important to design an imputation technique that does not rely on reference data. Here, we present Accurate Data-Driven Imputation Technique (ADDIT), which is composed of two data-driven algorithms capable of handling data generated from model and non-model organisms. The non-model variant of ADDIT (referred to as ADDIT-NM) employs statistical inference methods to impute missing genotypes, whereas the model variant (referred to as ADDIT-M) leverages a supervised learning-based approach for imputation. We demonstrate that both variants of ADDIT are more accurate, faster, and require less memory than leading state-of-the-art imputation tools using model (human) and non-model (maize, apple, and grape) genotype data. Software Availability: The source code of ADDIT and test data sets are available at https://github.com/NDBL/ADDIT. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
46. Networks’ Characteristics Matter for Systems Biology
- Author
-
Rider, Andrew K., Milenković, Tijana, Siwo, Geoffrey H., Pinapati, Richard S., Emrich, Scott J., Ferdig, Michael T., and Chawla, Nitesh V.
- Subjects
Article - Abstract
A fundamental goal of systems biology is to create models that describe relationships between biological components. Networks are an increasingly popular approach to this problem. However, a scientist interested in modeling biological (e.g., gene expression) data as a network is quickly confounded by the fundamental problem: how to construct the network? It is fairly easy to construct a network, but is it the network for the problem being considered? This is an important problem with three fundamental issues: How to weight edges in the network in order to capture actual biological interactions? What is the effect of the type of biological experiment used to collect the data from which the network is constructed? How to prune the weighted edges (or what cut-off to apply)? Differences in the construction of networks could lead to different biological interpretations.
- Published
- 2014
47. Gene discovery and annotation using LCM-454 transcriptome sequencing
- Author
-
Emrich, Scott J., Barbazuk, W. Brad, Li Li, and Schnable, Patrick S.
- Subjects
Nucleotide sequencing -- Analysis ,Shoots (Botany) -- Genetic aspects ,Shoots (Botany) -- Research ,Polymerase chain reaction -- Analysis ,Genetic research ,Health - Abstract
Various methodologies, including the laser capture microdissection (LCM)-454 are used to observe and analyze the gene discovery and the transcriptome sequencing of cDNA. The analysis reveals that the coupling of such techniques is very effective, as it facilitates the discovery of rare, possibly cell-type-specific transcripts.
- Published
- 2007
48. Tools and Methods Derived from the Anopheline 16 Genome project
- Author
-
Steele, Aaron, Fontaine, Michael Christophe, Martin, Andres, Emrich, Scott J, and Fontaine lab
- Abstract
The dramatic reduction in sequencing costs has resulted in many initiatives to sequence certain organisms and populations. These initiatives aim to not only sequence and assemble genomes but also to perform a more broader analysis of the population structure. As part of the Anopheline Genome Consortium, which has a vested interest in studying anpopheline mosquitoes, we developed novel methods and tools to further the communities goals. We provide a brief description of these methods and tools as well as assess the contributions that each offers to the broader study of comparative genomics.
- Published
- 2014
49. Complex speciation process in the Anopheles gambiae species complex revealed by whole genome sequencing
- Author
-
Fontaine, Michael C., Steele, Aaron, Emrich, Scott J., Smith, Hilary A., Hahn, Matthew W., Besansky, Nora J., and Fontaine lab
- Published
- 2013
50. Adjusted likelihood-ratio test for variants with unknown genotypes.
- Author
-
Nowling, Ronald J. and Emrich, Scott J.
- Subjects
- *
LIKELIHOOD ratio tests , *GENOTYPES , *POPULATION genetics , *SINGLE nucleotide polymorphisms , *ANOPHELES gambiae - Abstract
Association tests performed with the Likelihood-Ratio Test (LR Test) can be an alternative to F S T , which is often used in population genetics to find variants of interest. Because the LR Test has several properties that could make it preferable to F S T , we propose a novel approach for modeling unknown genotypes in highly-similar species. To show the effectiveness of this LR Test approach, we apply it to single-nucleotide polymorphisms (SNPs) associated with the recent speciation of the malaria vectors Anopheles gambiae and Anopheles coluzzii and compare to F S T . [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.