47 results on '"Ruby JG"'
Search Results
2. PRICE: Software for the targeted assembly of components of (Meta) genomic sequence data
- Author
-
Derisi, Joseph, Ruby, JG, Bellare, P, and DeRisi, JL
- Abstract
Low-cost DNA sequencing technologies have expanded the role for direct nucleic acid sequencing in the analysis of genomes, transcriptomes, and the metagenomes of whole ecosystems. Human and machine comprehension of such large datasets can be simplified via
- Published
- 2013
3. Identification, characterization, and in vitro culture of highly divergent arenaviruses from boa constrictors and annulated tree boas: candidate etiological agents for snake inclusion body disease.
- Author
-
Derisi, Joseph, Stenglein, MD, Sanders, C, Kistler, AL, Ruby, JG, Franco, JY, Reavill, DR, Dunker, F, and Derisi, JL
- Abstract
Inclusion body disease (IBD) is an infectious fatal disease of snakes typified by behavioral abnormalities, wasting, and secondary infections. At a histopathological level, the disease is identified by the presence of large eosinophilic cytoplasmic inclusi
- Published
- 2012
4. The Macronuclear Genome of $\textit{Stentor coeruleus}$ Reveals Tiny Introns in a Giant Cell
- Author
-
Slabodnick, MM, Ruby, JG, Reiff, SB, Swart, EC, Gosai, S, Prabakaran, S, Witkowska, E, Larue, GE, Fisher, S, Freeman, RM, Gunawardena, J, Chu, W, Stover, NA, Gregory, BD, Nowacki, M, Derisi, J, Roy, SW, Marshall, WF, Sood, P, Prabakaran, Sudhakaran [0000-0002-6527-1085], and Apollo - University of Cambridge Repository
- Subjects
genetic code ,cell size ,splicing ,intron evolution ,heterotrichidae ,regeneration ,ploidy ,macronucleus ,ciliate ,U2 snRNA - Abstract
The giant, single-celled organism $\textit{Stentor coeruleus}$ has a long history as a model system for studying pattern formation and regeneration in single cells. $\textit{Stentor coeruleus}$ [1, 2] is a heterotrichous ciliate distantly related to familiar ciliate models, such as Tetrahymena or Paramecium. The primary distinguishing feature of $\textit{Stentor}$ is its incredible size: a single cell is 1 mm long. Early developmental biologists, including T.H. Morgan [3], were attracted to the system because of its regenerative abilities-if large portions of a cell are surgically removed, the remnant reorganizes into a normal-looking but smaller cell with correct proportionality [2, 3]. These biologists were also drawn to $\textit{Stentor}$ because it exhibits a rich repertoire of behaviors, including light avoidance, mechanosensitive contraction, food selection, and even the ability to habituate to touch, a simple form of learning usually seen in higher organisms [4]. While early microsurgical approaches demonstrated a startling array of regenerative and morphogenetic processes in this single-celled organism, $\textit{Stentor}$ was never developed as a molecular model system. We report the sequencing of the $\textit{Stentor coeruleus}$ macronuclear genome and reveal key features of the genome. First, we find that $\textit{Stentor}$ uses the standard genetic code, suggesting that ciliate-specific genetic codes arose after $\textit{Stentor}$ branched from other ciliates. We also discover that ploidy correlates with $\textit{Stentor}$'s cell size. Finally, in the $\textit{Stentor}$ genome, we discover the smallest spliceosomal introns reported for any species. The sequenced genome opens the door to molecular analysis of single-cell regeneration in $\textit{Stentor}$.
- Published
- 2017
5. Molecular insights into the biosynthesis of guadinomine: a type III secretion system inhibitor
- Author
-
Kathia Zaleta-Rivera, Chaitan Khosla, Tracy C. Holmes, May Ae, Joseph L. DeRisi, Michael A. Fischbach, Peter Skewes-Cox, Ruby Jg, Iwatsuki M, and Ōmura S
- Subjects
Molecular Conformation ,Virulence ,medicine.disease_cause ,Imidazolidines ,Biochemistry ,Streptomyces ,Catalysis ,Article ,Type three secretion system ,Vaccine Related ,Rare Diseases ,Colloid and Surface Chemistry ,Nonribosomal peptide ,Biodefense ,Polyketide synthase ,Gene cluster ,Genetics ,medicine ,2.2 Factors relating to the physical environment ,Aetiology ,Escherichia coli ,Bacterial Secretion Systems ,chemistry.chemical_classification ,biology ,Prevention ,General Chemistry ,Dipeptides ,Foodborne Illness ,biology.organism_classification ,Emerging Infectious Diseases ,Infectious Diseases ,chemistry ,Chemical Sciences ,biology.protein ,Heterologous expression ,Infection ,Biotechnology - Abstract
Guadinomines are a recently discovered family of anti-infective compounds produced by Streptomyces sp. K01-0509 with a novel mode of action. With an IC(50) of 14 nM, guadinomine B is the most potent known inhibitor of the type III secretion system (TTSS) of Gram-negative bacteria. TTSS activity is required for the virulence of many pathogenic Gram-negative bacteria including Escherichia coli , Salmonella spp., Yersinia spp., Chlamydia spp., Vibrio spp., and Pseudomonas spp. The guadinomine (gdn) biosynthetic gene cluster has been cloned and sequenced and includes 26 open reading frames spanning 51.2 kb. It encodes a chimeric multimodular polyketide synthase, a nonribosomal peptide synthetase, along with enzymes responsible for the biosynthesis of the unusual aminomalonyl-acyl carrier protein extender unit and the signature carbamoylated cyclic guanidine. Its identity was established by targeted disruption of the gene cluster as well as by heterologous expression and analysis of key enzymes in the biosynthetic pathway. Identifying the guadinomine gene cluster provides critical insight into the biosynthesis of these scarce but potentially important natural products.
- Published
- 2012
6. Assemblathon 1: A competitive assessment of de novo short read assembly methods
- Author
-
Earl, D, Bradnam, K, St. John, J, Darling, A, Lin, D, Fass, J, Yu, HOK, Buffalo, V, Zerbino, DR, Diekhans, M, Nguyen, N, Ariyaratne, PN, Sung, WK, Ning, Z, Haimel, M, Simpson, JT, Fonseca, NA, Birol, I, Docking, TR, Ho, IY, Rokhsar, DS, Chikhi, R, Lavenier, D, Chapuis, G, Naquin, D, Maillet, N, Schatz, MC, Kelley, DR, Phillippy, AM, Koren, S, Yang, SP, Wu, W, Chou, WC, Srivastava, A, Shaw, TI, Ruby, JG, Skewes-Cox, P, Betegon, M, Dimon, MT, Solovyev, V, Seledtsov, I, Kosarev, P, Vorobyev, D, Ramirez-Gonzalez, R, Leggett, R, MacLean, D, Xia, F, Luo, R, Li, Z, Xie, Y, Liu, B, Gnerre, S, MacCallum, I, Przybylski, D, Ribeiro, FJ, Sharpe, T, Hall, G, Kersey, PJ, Durbin, R, Jackman, SD, Chapman, JA, Huang, X, DeRisi, JL, Caccamo, M, Li, Y, Jaffe, DB, Green, RE, Haussler, D, Korf, I, Paten, B, Earl, D, Bradnam, K, St. John, J, Darling, A, Lin, D, Fass, J, Yu, HOK, Buffalo, V, Zerbino, DR, Diekhans, M, Nguyen, N, Ariyaratne, PN, Sung, WK, Ning, Z, Haimel, M, Simpson, JT, Fonseca, NA, Birol, I, Docking, TR, Ho, IY, Rokhsar, DS, Chikhi, R, Lavenier, D, Chapuis, G, Naquin, D, Maillet, N, Schatz, MC, Kelley, DR, Phillippy, AM, Koren, S, Yang, SP, Wu, W, Chou, WC, Srivastava, A, Shaw, TI, Ruby, JG, Skewes-Cox, P, Betegon, M, Dimon, MT, Solovyev, V, Seledtsov, I, Kosarev, P, Vorobyev, D, Ramirez-Gonzalez, R, Leggett, R, MacLean, D, Xia, F, Luo, R, Li, Z, Xie, Y, Liu, B, Gnerre, S, MacCallum, I, Przybylski, D, Ribeiro, FJ, Sharpe, T, Hall, G, Kersey, PJ, Durbin, R, Jackman, SD, Chapman, JA, Huang, X, DeRisi, JL, Caccamo, M, Li, Y, Jaffe, DB, Green, RE, Haussler, D, Korf, I, and Paten, B
- Abstract
Low-cost short read sequencing technology has revolutionized genomics, though it is only just becoming practical for the high-quality de novo assembly of a novel large genome. We describe the Assemblathon 1 competition, which aimed to comprehensively assess the state of the art in de novo assembly methods when applied to current sequencing technologies. In a collaborative effort, teams were asked to assemble a simulated Illumina HiSeq data set of an unknown, simulated diploid genome. A total of 41 assemblies from 17 different groups were received. Novel haplotype aware assessments of coverage, contiguity, structure, base calling, and copy number were made. We establish that within this benchmark: (1) It is possible to assemble the genome to a high level of coverage and accuracy, and that (2) large differences exist between the assemblies, suggesting room for further improvements in current methods. The simulated benchmark, including the correct answer, the assemblies, and the code that was used to evaluate the assemblies is now public and freely available from http://www.assemblathon.org/. © 2011 by Cold Spring Harbor Laboratory Press.
- Published
- 2011
7. Analysis of lifespan across Diversity Outbred mouse studies identifies multiple longevity-associated loci.
- Author
-
Mullis MN, Wright KM, Raj A, Gatti DM, Reifsnyder PC, Flurkey K, Archer JR, Robinson L, Di Francesco A, Svenson KL, Korstanje R, Harrison DE, Ruby JG, and Churchill GA
- Abstract
Lifespan is an integrative phenotype whose genetic architecture is likely to highlight multiple processes with high impact on health and aging. Here, we conduct a genetic meta-analysis of longevity in Diversity Outbred (DO) mice that includes 2,444 animals from three independently conducted lifespan studies. We identify six loci that contribute significantly to lifespan independently of diet and drug treatment, one of which also influences lifespan in a sex-dependent manner, as well as an additional locus with a diet-specific effect on lifespan. Collectively, these loci explain over half of the estimated heritable variation in lifespan across these studies and provide insight into the genetic architecture of lifespan in DO mice.
- Published
- 2024
- Full Text
- View/download PDF
8. Five years later, with double the demographic data, naked mole-rat mortality rates continue to defy Gompertzian laws by not increasing with age.
- Author
-
Ruby JG, Smith M, and Buffenstein R
- Subjects
- Animals, Mortality trends, Male, Female, Mole Rats, Longevity, Aging physiology
- Abstract
The naked mole-rat (Heterocephalus glaber) is a mouse-sized rodent species, notable for its eusociality and long lifespan. Previously, we reported that demographic aging, i.e., the exponential increase of mortality hazard that accompanies advancing age in mammals and other organisms, does not occur in naked mole-rats (Ruby et al., 2018), a finding that has potential implications for human healthy aging. The demographic data supporting that conclusion had taken over three decades to accumulate, starting with the original rearing of H. glaber in captivity. This finding was controversial since many of the animals in that study were relatively young. In the 5 years following that study, we have doubled our quantity of demographic data. Here, we re-evaluated our prior conclusions in light of these new data and found them to be not only supported but indeed strengthened. We additionally provided insight into the social dynamics of captive H. glaber with data and analyses of body weight and colony size versus mortality. Finally, we provide a phylogenetically proximal comparator in the form of lifespan data from our Damaraland mole-rat (Fukomys damarensis) colony and demographic meta-analysis of those data along with published data from Ansell's mole-rat (Fukomys anselli). We found Fukomys mortality hazard to increase gradually with age, an observation with inferences on the evolution of exceptional lifespan among mole-rats and the ecological factors that may have accompanied that evolution., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
9. 3D mouse pose from single-view video and a new dataset.
- Author
-
Hu B, Seybold B, Yang S, Sud A, Liu Y, Barron K, Cha P, Cosino M, Karlsson E, Kite J, Kolumam G, Preciado J, Zavala-Solorio J, Zhang C, Zhang X, Voorbach M, Tovcimak AE, Ruby JG, and Ross DA
- Subjects
- Animals, Mice, Extremities, Foot, Genotype, Models, Animal, Video Recording
- Abstract
We present a method to infer the 3D pose of mice, including the limbs and feet, from monocular videos. Many human clinical conditions and their corresponding animal models result in abnormal motion, and accurately measuring 3D motion at scale offers insights into health. The 3D poses improve classification of health-related attributes over 2D representations. The inferred poses are accurate enough to estimate stride length even when the feet are mostly occluded. This method could be applied as part of a continuous monitoring system to non-invasively measure animal health, as demonstrated by its use in successfully classifying animals based on age and genotype. We introduce the Mouse Pose Analysis Dataset, the first large scale video dataset of lab mice in their home cage with ground truth keypoint and behavior labels. The dataset also contains high resolution mouse CT scans, which we use to build the shape models for 3D pose reconstruction., (© 2023. Springer Nature Limited.)
- Published
- 2023
- Full Text
- View/download PDF
10. An Automated, Home-Cage, Video Monitoring-based Mouse Frailty Index Detects Age-associated Morbidity in C57BL/6 and Diversity Outbred Mice.
- Author
-
Ruby JG, Di Francesco A, Ylagan P, Luo A, Keyser R, Williams O, Spock S, Li W, Vongtharangsy N, Chatterjee S, Sloan CA, Ledogar C, Kuiper V, Kite J, Cosino M, Cha P, and Karlsson EM
- Subjects
- Animals, Mice, Humans, Aged, Collaborative Cross Mice, Mice, Inbred C57BL, Aging, Frail Elderly, Geriatric Assessment, Frailty diagnosis
- Abstract
Frailty indexes (FIs) provide quantitative measurements of nonspecific health decline and are particularly useful as longitudinal monitors of morbidity in aging studies. For mouse studies, frailty assessments can be taken noninvasively, but they require handling and direct observation that is labor-intensive to the scientist and stress inducing to the animal. Here, we implement, evaluate, and provide a refined digital FI composed entirely of computational analyses of home-cage video and compare it to manually obtained frailty scores in both C57BL/6 and genetically heterogeneous Diversity Outbred mice. We show that the frailty scores assigned by our digital index correlate with both manually obtained frailty scores and chronological age. Thus, we provide an automated tool for frailty assessment that can be collected reproducibly, at scale, without substantial labor cost., (© The Author(s) 2023. Published by Oxford University Press on behalf of The Gerontological Society of America.)
- Published
- 2023
- Full Text
- View/download PDF
11. Genetics implicates overactive osteogenesis in the development of diffuse idiopathic skeletal hyperostosis.
- Author
-
Sethi A, Ruby JG, Veras MA, Telis N, and Melamud E
- Subjects
- Male, Humans, Female, Osteogenesis genetics, Spine pathology, Absorptiometry, Photon, Hyperostosis, Diffuse Idiopathic Skeletal diagnostic imaging, Hyperostosis, Diffuse Idiopathic Skeletal genetics, Hyperostosis, Diffuse Idiopathic Skeletal complications, Osteophyte complications, Osteophyte pathology
- Abstract
Diffuse idiopathic skeletal hyperostosis (DISH) is a condition where adjacent vertebrae become fused through formation of osteophytes. The genetic and epidemiological etiology of this condition is not well understood. Here, we implemented a machine learning algorithm to assess the prevalence and severity of the pathology in ~40,000 lateral DXA scans in the UK Biobank Imaging cohort. We find that DISH is highly prevalent, above the age of 45, ~20% of men and ~8% of women having multiple osteophytes. Surprisingly, we find strong phenotypic and genetic association of DISH with increased bone mineral density and content throughout the entire skeletal system. Genetic association analysis identified ten loci associated with DISH, including multiple genes involved in bone remodeling (RUNX2, IL11, GDF5, CCDC91, NOG, and ROR2). Overall, this study describes genetics of DISH and implicates the role of overactive osteogenesis as a key driver of the pathology., (© 2023. The Author(s).)
- Published
- 2023
- Full Text
- View/download PDF
12. Calcification of the abdominal aorta is an under-appreciated cardiovascular disease risk factor in the general population.
- Author
-
Sethi A, Taylor DL, Ruby JG, Venkataraman J, Sorokin E, Cule M, and Melamud E
- Abstract
Calcification of large arteries is a high-risk factor in the development of cardiovascular diseases, however, due to the lack of routine monitoring, the pathology remains severely under-diagnosed and prevalence in the general population is not known. We have developed a set of machine learning methods to quantitate levels of abdominal aortic calcification (AAC) in the UK Biobank imaging cohort and carried out the largest to-date analysis of genetic, biochemical, and epidemiological risk factors associated with the pathology. In a genetic association study, we identified three novel loci associated with AAC ( FGF9, NAV9 , and APOE ), and replicated a previously reported association at the TWIST1/HDAC9 locus. We find that AAC is a highly prevalent pathology, with ~ 1 in 10 adults above the age of 40 showing significant levels of hydroxyapatite build-up (Kauppila score > 3). Presentation of AAC was strongly predictive of future cardiovascular events including stenosis of precerebral arteries (HR~1.5), myocardial infarction (HR~1.3), ischemic heart disease (HR~1.3), as well as other diseases such as chronic obstructive pulmonary disease (HR~1.3). Significantly, we find that the risk for myocardial infarction from elevated AAC (HR ~1.4) was comparable to the risk of hypercholesterolemia (HR~1.4), yet most people who develop AAC are not hypercholesterolemic. Furthermore, the overwhelming majority (98%) of individuals who develop pathology do so in the absence of known pre-existing risk conditions such as chronic kidney disease and diabetes (0.6% and 2.7% respectively). Our findings indicate that despite the high cardiovascular risk, calcification of large arteries remains a largely under-diagnosed lethal condition, and there is a clear need for increased awareness and monitoring of the pathology in the general population., Competing Interests: Authors AS, JR, ES, MC, and EM are employed by Calico Life Sciences LLC. Authors DT and JV were employed by Calico Life Sciences LLC. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest., (Copyright © 2022 Sethi, Taylor, Ruby, Venkataraman, Sorokin, Cule and Melamud.)
- Published
- 2022
- Full Text
- View/download PDF
13. The naked truth: a comprehensive clarification and classification of current 'myths' in naked mole-rat biology.
- Author
-
Buffenstein R, Amoroso V, Andziak B, Avdieiev S, Azpurua J, Barker AJ, Bennett NC, Brieño-Enríquez MA, Bronner GN, Coen C, Delaney MA, Dengler-Crish CM, Edrey YH, Faulkes CG, Frankel D, Friedlander G, Gibney PA, Gorbunova V, Hine C, Holmes MM, Jarvis JUM, Kawamura Y, Kutsukake N, Kenyon C, Khaled WT, Kikusui T, Kissil J, Lagestee S, Larson J, Lauer A, Lavrenchenko LA, Lee A, Levitt JB, Lewin GR, Lewis Hardell KN, Lin TD, Mason MJ, McCloskey D, McMahon M, Miura K, Mogi K, Narayan V, O'Connor TP, Okanoya K, O'Riain MJ, Park TJ, Place NJ, Podshivalova K, Pamenter ME, Pyott SJ, Reznick J, Ruby JG, Salmon AB, Santos-Sacchi J, Sarko DK, Seluanov A, Shepard A, Smith M, Storey KB, Tian X, Vice EN, Viltard M, Watarai A, Wywial E, Yamakawa M, Zemlemerova ED, Zions M, and Smith ESJ
- Subjects
- Animals, Biology, Longevity, Mole Rats
- Abstract
The naked mole-rat (Heterocephalus glaber) has fascinated zoologists for at least half a century. It has also generated considerable biomedical interest not only because of its extraordinary longevity, but also because of unusual protective features (e.g. its tolerance of variable oxygen availability), which may be pertinent to several human disease states, including ischemia/reperfusion injury and neurodegeneration. A recent article entitled 'Surprisingly long survival of premature conclusions about naked mole-rat biology' described 28 'myths' which, those authors claimed, are a 'perpetuation of beautiful, but falsified, hypotheses' and impede our understanding of this enigmatic mammal. Here, we re-examine each of these 'myths' based on evidence published in the scientific literature. Following Braude et al., we argue that these 'myths' fall into four main categories: (i) 'myths' that would be better described as oversimplifications, some of which persist solely in the popular press; (ii) 'myths' that are based on incomplete understanding, where more evidence is clearly needed; (iii) 'myths' where the accumulation of evidence over the years has led to a revision in interpretation, but where there is no significant disagreement among scientists currently working in the field; (iv) 'myths' where there is a genuine difference in opinion among active researchers, based on alternative interpretations of the available evidence. The term 'myth' is particularly inappropriate when applied to competing, evidence-based hypotheses, which form part of the normal evolution of scientific knowledge. Here, we provide a comprehensive critical review of naked mole-rat biology and attempt to clarify some of these misconceptions., (© 2021 The Authors. Biological Reviews published by John Wiley & Sons Ltd on behalf of Cambridge Philosophical Society.)
- Published
- 2022
- Full Text
- View/download PDF
14. Opportunities for new insight into aging from the naked mole-rat and other non-traditional models.
- Author
-
Buffenstein R and Ruby JG
- Subjects
- Animals, Mole Rats, Aging, Longevity
- Published
- 2021
- Full Text
- View/download PDF
15. Fully Phased Sequence of a Diploid Human Genome Determined de Novo from the DNA of a Single Individual.
- Author
-
Soifer L, Fong NL, Yi N, Ireland AT, Lam I, Sooknah M, Paw JS, Peluso P, Concepcion GT, Rank D, Hastie AR, Jojic V, Ruby JG, Botstein D, and Roy MA
- Subjects
- DNA, High-Throughput Nucleotide Sequencing, Humans, Sequence Analysis, DNA, Diploidy, Genome, Human
- Abstract
In recent years, improved sequencing technology and computational tools have made de novo genome assembly more accessible. Many approaches, however, generate either an unphased or only partially resolved representation of a diploid genome, in which polymorphisms are detected but not assigned to one or the other of the homologous chromosomes. Yet chromosomal phase information is invaluable for the understanding of phenotypic trait inheritance in the cases of compound heterozygosity, allele-specific expression or cis -acting variants. Here we use a combination of tools and sequencing technologies to generate a de novo diploid assembly of the human primary cell line WI-38. First, data from PacBio single molecule sequencing and Bionano Genomics optical mapping were combined to generate an unphased assembly. Next, 10x Genomics linked reads were combined with the hybrid assembly to generate a partially phased assembly. Lastly, we developed and optimized methods to use short-read (Illumina) sequencing of flow cytometry-sorted metaphase chromosomes to provide phase information. The final genome assembly was almost fully (94%) phased with the addition of approximately 2.5-fold coverage of Illumina data from the sequenced metaphase chromosomes. The diploid nature of the final de novo genome assembly improved the resolution of structural variants between the WI-38 genome and the human reference genome. The phased WI-38 sequence data are available for browsing and download at wi38.research.calicolabs.com. Our work shows that assembling a completely phased diploid genome de novo from the DNA of a single individual is now readily achievable., (Copyright © 2020 Soifer et al.)
- Published
- 2020
- Full Text
- View/download PDF
16. A Prospective Analysis of Genetic Variants Associated with Human Lifespan.
- Author
-
Wright KM, Rand KA, Kermany A, Noto K, Curtis D, Garrigan D, Slinkov D, Dorfman I, Granka JM, Byrnes J, Myres N, Ball CA, and Ruby JG
- Subjects
- Aged, Aged, 80 and over, Apolipoproteins E genetics, Carrier Proteins genetics, Databases, Genetic, Female, Humans, Male, Nuclear Proteins genetics, Pedigree, Polymorphism, Single Nucleotide, Prospective Studies, Proto-Oncogene Proteins genetics, Genome-Wide Association Study methods, Longevity genetics
- Abstract
We present a massive investigation into the genetic basis of human lifespan. Beginning with a genome-wide association (GWA) study using a de-identified snapshot of the unique AncestryDNA database - more than 300,000 genotyped individuals linked to pedigrees of over 400,000,000 people - we mapped six genome-wide significant loci associated with parental lifespan. We compared these results to a GWA analysis of the traditional lifespan proxy trait, age, and found only one locus, APOE , to be associated with both age and lifespan. By combining the AncestryDNA results with those of an independent UK Biobank dataset, we conducted a meta-analysis of more than 650,000 individuals and identified fifteen parental lifespan-associated loci. Beyond just those significant loci, our genome-wide set of polymorphisms accounts for up to 8% of the variance in human lifespan; this value represents a large fraction of the heritability estimated from phenotypic correlations between relatives., (Copyright © 2019 Wright et al.)
- Published
- 2019
- Full Text
- View/download PDF
17. Response to comment on 'Naked mole-rat mortality rates defy Gompertzian laws by not increasing with age'.
- Author
-
Ruby JG, Smith M, and Buffenstein R
- Subjects
- Animals, Humans, Mammals, Mole Rats
- Abstract
For most adult mammals, the risk of death increases exponentially with age, an observation originally described for humans by Benjamin Gompertz. We recently performed a Kaplan-Meier survival analysis of naked mole-rats ( Heterocephalus glaber ) and concluded that their risk of death remains constant as they grow older (Ruby et al., 2018). Dammann et al. suggest incomplete historical records potentially confounded our demographic analysis (Dammann et al., 2019). In response, we applied the left-censorship technique of Kaplan and Meier to exclude all data from the historical era in which they speculate the records to be confounded. Our new analysis produced indistinguishable results from what we had previously published, and thus strongly reinforced our original conclusions., Competing Interests: JR, MS, RB This research was funded by Calico Life Sciences LLC, where all authors were employees at the time the study was conducted. The authors declare no other competing financial interests., (© 2019, Ruby et al.)
- Published
- 2019
- Full Text
- View/download PDF
18. Acoel genome reveals the regulatory landscape of whole-body regeneration.
- Author
-
Gehrke AR, Neverett E, Luo YJ, Brandt A, Ricci L, Hulett RE, Gompers A, Ruby JG, Rokhsar DS, Reddien PW, and Srivastava M
- Subjects
- Animals, Binding Sites, Chromatin metabolism, Genome, Transcriptome, Wound Healing genetics, Early Growth Response Transcription Factors metabolism, Gene Expression Regulation, Gene Regulatory Networks, Invertebrates genetics, Invertebrates physiology, Regeneration genetics
- Abstract
Whole-body regeneration is accompanied by complex transcriptomic changes, yet the chromatin regulatory landscapes that mediate this dynamic response remain unexplored. To decipher the regulatory logic that orchestrates regeneration, we sequenced the genome of the acoel worm Hofstenia miamia , a highly regenerative member of the sister lineage of other bilaterians. Epigenomic profiling revealed thousands of regeneration-responsive chromatin regions and identified dynamically bound transcription factor motifs, with the early growth response (EGR) binding site as the most variably accessible during Hofstenia regeneration. Combining egr inhibition with chromatin profiling suggests that Egr functions as a pioneer factor to directly regulate early wound-induced genes. The genetic connections inferred by this approach allowed the construction of a gene regulatory network for whole-body regeneration, enabling genomics-based comparisons of regeneration across species., (Copyright © 2019 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works.)
- Published
- 2019
- Full Text
- View/download PDF
19. Estimates of the Heritability of Human Longevity Are Substantially Inflated due to Assortative Mating.
- Author
-
Ruby JG, Wright KM, Rand KA, Kermany A, Noto K, Curtis D, Varner N, Garrigan D, Slinkov D, Dorfman I, Granka JM, Byrnes J, Myres N, and Ball C
- Subjects
- Female, Humans, Male, Models, Genetic, Pedigree, Longevity genetics, Reproduction
- Abstract
Human life span is a phenotype that integrates many aspects of health and environment into a single ultimate quantity: the elapsed time between birth and death. Though it is widely believed that long life runs in families for genetic reasons, estimates of life span "heritability" are consistently low (∼15-30%). Here, we used pedigree data from Ancestry public trees, including hundreds of millions of historical persons, to estimate the heritability of human longevity. Although "nominal heritability" estimates based on correlations among genetic relatives agreed with prior literature, the majority of that correlation was also captured by correlations among nongenetic (in-law) relatives, suggestive of highly assortative mating around life span-influencing factors (genetic and/or environmental). We used structural equation modeling to account for assortative mating, and concluded that the true heritability of human longevity for birth cohorts across the 1800s and early 1900s was well below 10%, and that it has been generally overestimated due to the effect of assortative mating., (Copyright © 2018 by the Genetics Society of America.)
- Published
- 2018
- Full Text
- View/download PDF
20. Naked Mole-Rat mortality rates defy gompertzian laws by not increasing with age.
- Author
-
Ruby JG, Smith M, and Buffenstein R
- Subjects
- Animals, Mole Rats, Aging pathology, Longevity, Mortality
- Abstract
The longest-lived rodent, the naked mole-rat ( Heterocephalus glaber) , has a reported maximum lifespan of >30 years and exhibits delayed and/or attenuated age-associated physiological declines. We questioned whether these mouse-sized, eusocial rodents conform to Gompertzian mortality laws by experiencing an exponentially increasing risk of death as they get older. We compiled and analyzed a large compendium of historical naked mole-rat lifespan data with >3000 data points. Kaplan-Meier analyses revealed a substantial portion of the population to have survived at 30 years of age. Moreover, unlike all other mammals studied to date, and regardless of sex or breeding-status, the age-specific hazard of mortality did not increase with age, even at ages 25-fold past their time to reproductive maturity. This absence of hazard increase with age, in defiance of Gompertz's law, uniquely identifies the naked mole-rat as a non-aging mammal, confirming its status as an exceptional model for biogerontology., Competing Interests: JR The research was funded by Calico Life Sciences LLC, where all authors were employees at the time the study was conducted. The authors declare no other competing financial interests, MS Megan Smith: The research was funded by Calico Life Sciences LLC, where all authors were employees at the time the study was conducted. The authors declare no other competing financial interests, RB Rochelle Buffenstein: The research was funded by Calico Life Sciences LLC, where all authors were employees at the time the study was conducted. The authors declare no other competing financial interests, (© 2017, Ruby et al.)
- Published
- 2018
- Full Text
- View/download PDF
21. Correction: Correction: A Novel Rhabdovirus Associated with Acute Hemorrhagic Fever in Central Africa.
- Author
-
Grard G, Fair JN, Lee D, Slikas E, Steffen I, Muyembe JJ, Sittler T, Veeraraghavan N, Ruby JG, Wang C, Makuwa M, Mulembakani P, Tesh RB, Mazet J, Rimoin AW, Taylor T, Schneider BS, Simmons G, Delwart E, Wolfe ND, Chiu CY, and Leroy EM
- Abstract
[This corrects the article DOI: 10.1371/journal.ppat.1002924.].
- Published
- 2017
- Full Text
- View/download PDF
22. The Macronuclear Genome of Stentor coeruleus Reveals Tiny Introns in a Giant Cell.
- Author
-
Slabodnick MM, Ruby JG, Reiff SB, Swart EC, Gosai S, Prabakaran S, Witkowska E, Larue GE, Fisher S, Freeman RM Jr, Gunawardena J, Chu W, Stover NA, Gregory BD, Nowacki M, Derisi J, Roy SW, Marshall WF, and Sood P
- Subjects
- Phylogeny, Whole Genome Sequencing, Ciliophora genetics, Genome, Protozoan, Introns genetics, Spliceosomes metabolism
- Abstract
The giant, single-celled organism Stentor coeruleus has a long history as a model system for studying pattern formation and regeneration in single cells. Stentor [1, 2] is a heterotrichous ciliate distantly related to familiar ciliate models, such as Tetrahymena or Paramecium. The primary distinguishing feature of Stentor is its incredible size: a single cell is 1 mm long. Early developmental biologists, including T.H. Morgan [3], were attracted to the system because of its regenerative abilities-if large portions of a cell are surgically removed, the remnant reorganizes into a normal-looking but smaller cell with correct proportionality [2, 3]. These biologists were also drawn to Stentor because it exhibits a rich repertoire of behaviors, including light avoidance, mechanosensitive contraction, food selection, and even the ability to habituate to touch, a simple form of learning usually seen in higher organisms [4]. While early microsurgical approaches demonstrated a startling array of regenerative and morphogenetic processes in this single-celled organism, Stentor was never developed as a molecular model system. We report the sequencing of the Stentor coeruleus macronuclear genome and reveal key features of the genome. First, we find that Stentor uses the standard genetic code, suggesting that ciliate-specific genetic codes arose after Stentor branched from other ciliates. We also discover that ploidy correlates with Stentor's cell size. Finally, in the Stentor genome, we discover the smallest spliceosomal introns reported for any species. The sequenced genome opens the door to molecular analysis of single-cell regeneration in Stentor., (Copyright © 2017 The Authors. Published by Elsevier Ltd.. All rights reserved.)
- Published
- 2017
- Full Text
- View/download PDF
23. Correction: A Novel Rhabdovirus Associated with Acute Hemorrhagic Fever in Central Africa.
- Author
-
Grard G, Fair JN, Lee D, Slikas E, Steffen I, Muyembe JJ, Sittler T, Veeraraghavan N, Ruby JG, Wang C, Makuwa M, Mulembakani P, Tesh RB, Mazet J, Rimoin AW, Taylor T, Schneider BS, Simmons G, Delwart E, Wolfe ND, Chiu CY, and Leroy EM
- Published
- 2016
- Full Text
- View/download PDF
24. Adaptive regulation of testis gene expression and control of male fertility by the Drosophila hairpin RNA pathway. [Corrected].
- Author
-
Wen J, Duan H, Bejarano F, Okamura K, Fabian L, Brill JA, Bortolamiol-Becet D, Martin R, Ruby JG, and Lai EC
- Subjects
- Adaptation, Physiological genetics, Animals, Base Sequence, Biological Evolution, Drosophila melanogaster growth & development, Drosophila melanogaster metabolism, Fertility genetics, Humans, Insect Proteins antagonists & inhibitors, Insect Proteins metabolism, Male, Mitochondrial Proton-Translocating ATPases antagonists & inhibitors, Mitochondrial Proton-Translocating ATPases genetics, Mitochondrial Proton-Translocating ATPases metabolism, Molecular Sequence Data, Nucleic Acid Conformation, Protein Subunits antagonists & inhibitors, Protein Subunits genetics, Protein Subunits metabolism, RNA, Small Interfering chemistry, RNA, Small Interfering metabolism, Spermatozoa growth & development, Spermatozoa metabolism, Testis growth & development, Drosophila melanogaster genetics, Gene Expression Regulation, Developmental, Insect Proteins genetics, RNA, Small Interfering genetics, Spermatogenesis genetics, Testis metabolism
- Abstract
Although endogenous siRNAs (endo-siRNAs) have been described in many species, still little is known about their endogenous utility. Here, we show that Drosophila hairpin RNAs (hpRNAs) generate an endo-siRNA class with predominant expression in testes. Although hpRNAs are universally recently evolved, we identify highly complementary protein-coding targets for all hpRNAs. Importantly, we find broad evidence for evolutionary divergences that preferentially maintain compensatory pairing between hpRNAs and targets, serving as first evidence for adaptive selection for siRNA-mediated target regulation in metazoans. We demonstrate organismal impact of hpRNA activity, since knockout of hpRNA1 derepresses its target ATP synthase-β in testes and compromises spermatogenesis and male fertility. Moreover, we reveal surprising male-specific impact of RNAi factors on germ cell development and fertility, consistent with testis-directed function of the hpRNA pathway. Finally, the collected hpRNA loci chronicle an evolutionary timeline that reflects their origins from prospective target genes, mirroring a strategy described for plant miRNAs., (Copyright © 2015 Elsevier Inc. All rights reserved.)
- Published
- 2015
- Full Text
- View/download PDF
25. The kinase regulator mob1 acts as a patterning protein for stentor morphogenesis.
- Author
-
Slabodnick MM, Ruby JG, Dunn JG, Feldman JL, DeRisi JL, and Marshall WF
- Subjects
- Amino Acid Sequence, Animals, Cell Division, Ciliophora classification, Ciliophora metabolism, Ciliophora ultrastructure, Intracellular Signaling Peptides and Proteins antagonists & inhibitors, Intracellular Signaling Peptides and Proteins metabolism, Molecular Sequence Data, Phylogeny, Plants, Protozoan Proteins antagonists & inhibitors, Protozoan Proteins metabolism, RNA Interference, RNA, Small Interfering genetics, RNA, Small Interfering metabolism, Sequence Alignment, Sequence Homology, Amino Acid, Ciliophora genetics, Gene Expression Regulation, Intracellular Signaling Peptides and Proteins genetics, Morphogenesis genetics, Protozoan Proteins genetics, Regeneration genetics
- Abstract
Morphogenesis and pattern formation are vital processes in any organism, whether unicellular or multicellular. But in contrast to the developmental biology of plants and animals, the principles of morphogenesis and pattern formation in single cells remain largely unknown. Although all cells develop patterns, they are most obvious in ciliates; hence, we have turned to a classical unicellular model system, the giant ciliate Stentor coeruleus. Here we show that the RNA interference (RNAi) machinery is conserved in Stentor. Using RNAi, we identify the kinase coactivator Mob1--with conserved functions in cell division and morphogenesis from plants to humans-as an asymmetrically localized patterning protein required for global patterning during development and regeneration in Stentor. Our studies reopen the door for Stentor as a model regeneration system., Competing Interests: The authors have declared that no competing interests exist.
- Published
- 2014
- Full Text
- View/download PDF
26. Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species.
- Author
-
Bradnam KR, Fass JN, Alexandrov A, Baranay P, Bechner M, Birol I, Boisvert S, Chapman JA, Chapuis G, Chikhi R, Chitsaz H, Chou WC, Corbeil J, Del Fabbro C, Docking TR, Durbin R, Earl D, Emrich S, Fedotov P, Fonseca NA, Ganapathy G, Gibbs RA, Gnerre S, Godzaridis E, Goldstein S, Haimel M, Hall G, Haussler D, Hiatt JB, Ho IY, Howard J, Hunt M, Jackman SD, Jaffe DB, Jarvis ED, Jiang H, Kazakov S, Kersey PJ, Kitzman JO, Knight JR, Koren S, Lam TW, Lavenier D, Laviolette F, Li Y, Li Z, Liu B, Liu Y, Luo R, Maccallum I, Macmanes MD, Maillet N, Melnikov S, Naquin D, Ning Z, Otto TD, Paten B, Paulo OS, Phillippy AM, Pina-Martins F, Place M, Przybylski D, Qin X, Qu C, Ribeiro FJ, Richards S, Rokhsar DS, Ruby JG, Scalabrin S, Schatz MC, Schwartz DC, Sergushichev A, Sharpe T, Shaw TI, Shendure J, Shi Y, Simpson JT, Song H, Tsarev F, Vezzi F, Vicedomini R, Vieira BM, Wang J, Worley KC, Yin S, Yiu SM, Yuan J, Zhang G, Zhang H, Zhou S, and Korf IF
- Abstract
Background: The process of generating raw genome sequence data continues to become cheaper, faster, and more accurate. However, assembly of such data into high-quality, finished genome sequences remains challenging. Many genome assembly tools are available, but they differ greatly in terms of their performance (speed, scalability, hardware requirements, acceptance of newer read technologies) and in their final output (composition of assembled sequence). More importantly, it remains largely unclear how to best assess the quality of assembled genome sequences. The Assemblathon competitions are intended to assess current state-of-the-art methods in genome assembly., Results: In Assemblathon 2, we provided a variety of sequence data to be assembled for three vertebrate species (a bird, a fish, and snake). This resulted in a total of 43 submitted assemblies from 21 participating teams. We evaluated these assemblies using a combination of optical map data, Fosmid sequences, and several statistical methods. From over 100 different metrics, we chose ten key measures by which to assess the overall quality of the assemblies., Conclusions: Many current genome assemblers produced useful assemblies, containing a significant representation of their genes and overall genome structure. However, the high degree of variability between the entries suggests that there is still much room for improvement in the field of genome assembly and that approaches which work well in assembling the genome of one species may not necessarily work well for another.
- Published
- 2013
- Full Text
- View/download PDF
27. The posttranslational modification cascade to the thiopeptide berninamycin generates linear forms and altered macrocyclic scaffolds.
- Author
-
Malcolmson SJ, Young TS, Ruby JG, Skewes-Cox P, and Walsh CT
- Subjects
- Amino Acid Sequence, Bacterial Proteins chemistry, Bacterial Proteins genetics, Macrocyclic Compounds chemistry, Molecular Sequence Data, Peptides chemistry, Peptides genetics, Peptides, Cyclic chemistry, Peptides, Cyclic genetics, Peptides, Cyclic metabolism, Protein Precursors chemistry, Protein Precursors genetics, Protein Processing, Post-Translational, Protein Structure, Secondary, Streptomyces lividans chemistry, Streptomyces lividans genetics, Thiazoles chemistry, Thiazoles metabolism, Bacterial Proteins metabolism, Macrocyclic Compounds metabolism, Peptides metabolism, Protein Precursors metabolism, Streptomyces lividans metabolism
- Abstract
Berninamycin is a member of the pyridine-containing thiopeptide class of antibiotics that undergoes massive posttranslational modifications from ribosomally generated preproteins. Berninamycin has a 2-oxazolyl-3-thiazolyl-pyridine core embedded in a 35-atom macrocycle rather than typical trithiazolylpyridine cores embedded in 26-atom and 29-atom peptide macrocycles. We describe the cloning of an 11-gene berninamycin cluster from Streptomyces bernensis UC 5144, its heterologous expression in Streptomyces lividans TK24 and Streptomyces venezuelae ATCC 10712, and detection of variant and incompletely processed scaffolds. Posttranslational maturation in S. lividans of both the wild-type berninamycin prepeptide (BerA) and also a T3A mutant generates macrocyclic compounds as well as linear variants, which have failed to form the pyridine and the macrocycle. Expression of the gene cluster in S. venezuelae generates a variant of the 35-atom skeleton of berninamycin, containing a methyloxazoline in the place of a methyloxazole within the macrocyclic framework.
- Published
- 2013
- Full Text
- View/download PDF
28. PRICE: software for the targeted assembly of components of (Meta) genomic sequence data.
- Author
-
Ruby JG, Bellare P, and Derisi JL
- Subjects
- Base Sequence, Genome, Viral genetics, Herpesvirus 8, Human genetics, Humans, Keratin-6 genetics, Keratin-6 metabolism, RNA Viruses genetics, RNA, Messenger genetics, RNA, Messenger metabolism, Sequence Analysis, DNA, Transcriptome genetics, Databases, Genetic, Metagenome genetics, Metagenomics methods, Software
- Abstract
Low-cost DNA sequencing technologies have expanded the role for direct nucleic acid sequencing in the analysis of genomes, transcriptomes, and the metagenomes of whole ecosystems. Human and machine comprehension of such large datasets can be simplified via synthesis of sequence fragments into long, contiguous blocks of sequence (contigs), but most of the progress in the field of assembly has focused on genomes in isolation rather than metagenomes. Here, we present software for paired-read iterative contig extension (PRICE), a strategy for focused assembly of particular nucleic acid species using complex metagenomic data as input. We describe the assembly strategy implemented by PRICE and provide examples of its application to the sequence of particular genes, transcripts, and virus genomes from complex multicomponent datasets, including an assembly of the BCBL-1 strain of Kaposi's sarcoma-associated herpesvirus. PRICE is open-source and available for free download (derisilab.ucsf.edu/software/price/ or sourceforge.net/projects/pricedenovo/).
- Published
- 2013
- Full Text
- View/download PDF
29. Comparative analysis of tandem repeats from hundreds of species reveals unique insights into centromere evolution.
- Author
-
Melters DP, Bradnam KR, Young HA, Telis N, May MR, Ruby JG, Sebra R, Peluso P, Eid J, Rank D, Garcia JF, DeRisi JL, Smith T, Tobias C, Ross-Ibarra J, Korf I, and Chan SW
- Subjects
- Animals, Base Sequence, Molecular Sequence Data, Plants genetics, Species Specificity, Centromere genetics, Evolution, Molecular, Tandem Repeat Sequences
- Abstract
Background: Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres contain megabase-scale arrays of tandem repeats. Despite their importance, very little is known about the degree to which centromere tandem repeats share common properties between different species across different phyla. We used bioinformatic methods to identify high-copy tandem repeats from 282 species using publicly available genomic sequence and our own data., Results: Our methods are compatible with all current sequencing technologies. Long Pacific Biosciences sequence reads allowed us to find tandem repeat monomers up to 1,419 bp. We assumed that the most abundant tandem repeat is the centromere DNA, which was true for most species whose centromeres have been previously characterized, suggesting this is a general property of genomes. High-copy centromere tandem repeats were found in almost all animal and plant genomes, but repeat monomers were highly variable in sequence composition and length. Furthermore, phylogenetic analysis of sequence homology showed little evidence of sequence conservation beyond approximately 50 million years of divergence. We find that despite an overall lack of sequence conservation, centromere tandem repeats from diverse species showed similar modes of evolution., Conclusions: While centromere position in most eukaryotes is epigenetically determined, our results indicate that tandem repeats are highly prevalent at centromeres of both animal and plant genomes. This suggests a functional role for such repeats, perhaps in promoting concerted evolution of centromere DNA across chromosomes.
- Published
- 2013
- Full Text
- View/download PDF
30. Molecular insights into the biosynthesis of guadinomine: a type III secretion system inhibitor.
- Author
-
Holmes TC, May AE, Zaleta-Rivera K, Ruby JG, Skewes-Cox P, Fischbach MA, DeRisi JL, Iwatsuki M, Ōmura S, and Khosla C
- Subjects
- Dipeptides chemistry, Imidazolidines chemistry, Molecular Conformation, Streptomyces chemistry, Bacterial Secretion Systems drug effects, Dipeptides biosynthesis, Dipeptides pharmacology, Imidazolidines pharmacology, Streptomyces metabolism
- Abstract
Guadinomines are a recently discovered family of anti-infective compounds produced by Streptomyces sp. K01-0509 with a novel mode of action. With an IC(50) of 14 nM, guadinomine B is the most potent known inhibitor of the type III secretion system (TTSS) of Gram-negative bacteria. TTSS activity is required for the virulence of many pathogenic Gram-negative bacteria including Escherichia coli , Salmonella spp., Yersinia spp., Chlamydia spp., Vibrio spp., and Pseudomonas spp. The guadinomine (gdn) biosynthetic gene cluster has been cloned and sequenced and includes 26 open reading frames spanning 51.2 kb. It encodes a chimeric multimodular polyketide synthase, a nonribosomal peptide synthetase, along with enzymes responsible for the biosynthesis of the unusual aminomalonyl-acyl carrier protein extender unit and the signature carbamoylated cyclic guanidine. Its identity was established by targeted disruption of the gene cluster as well as by heterologous expression and analysis of key enzymes in the biosynthetic pathway. Identifying the guadinomine gene cluster provides critical insight into the biosynthesis of these scarce but potentially important natural products.
- Published
- 2012
- Full Text
- View/download PDF
31. Complete genome sequence of an astrovirus identified in a domestic rabbit (Oryctolagus cuniculus) with gastroenteritis.
- Author
-
Stenglein MD, Velazquez E, Greenacre C, Wilkes RP, Ruby JG, Lankton JS, Ganem D, Kennedy MA, and DeRisi JL
- Subjects
- Animals, Astroviridae isolation & purification, Astroviridae Infections veterinary, Astroviridae Infections virology, Enterocolitis veterinary, Enterocolitis virology, Feces virology, Gastroenteritis veterinary, Gastroenteritis virology, Microscopy, Electron, Molecular Sequence Data, Rabbits, Tennessee, Virus Cultivation, Astroviridae genetics, Genome, Viral, RNA, Viral genetics, Sequence Analysis, DNA
- Abstract
A colony of domestic rabbits in Tennessee, USA, experienced a high-mortality (~90%) outbreak of enterocolitis. The clinical characteristics were one to six days of lethargy, bloating, and diarrhea, followed by death. Heavy intestinal coccidial load was a consistent finding as was mucoid enteropathy with cecal impaction. Preliminary analysis by electron microscopy revealed the presence of virus-like particles in the stool of one of the affected rabbits. Analysis using the Virochip, a viral detection microarray, suggested the presence of an astrovirus, and follow-up PCR and sequence determination revealed a previously uncharacterized member of that family. Metagenomic sequencing enabled the recovery of the complete viral genome, which contains the characteristic attributes of astrovirus genomes. Attempts to propagate the virus in tissue culture have yet to succeed. Although astroviruses cause gastroenteric disease in other mammals, the pathogenicity of this virus and the relationship to this outbreak remains to be determined. This study therefore defines a viral species and a potential rabbit pathogen.
- Published
- 2012
- Full Text
- View/download PDF
32. A novel rhabdovirus associated with acute hemorrhagic fever in central Africa.
- Author
-
Grard G, Fair JN, Lee D, Slikas E, Steffen I, Muyembe JJ, Sittler T, Veeraraghavan N, Ruby JG, Wang C, Makuwa M, Mulembakani P, Tesh RB, Mazet J, Rimoin AW, Taylor T, Schneider BS, Simmons G, Delwart E, Wolfe ND, Chiu CY, and Leroy EM
- Subjects
- Adolescent, Adult, Animals, Antibodies, Viral blood, Democratic Republic of the Congo, Disease Outbreaks, Female, Genome, Viral, Hemorrhagic Fevers, Viral epidemiology, Hemorrhagic Fevers, Viral transmission, High-Throughput Nucleotide Sequencing, Humans, Male, Mice, Molecular Sequence Data, Phylogeny, Rhabdoviridae Infections epidemiology, Rhabdoviridae Infections pathology, Rhabdoviridae Infections transmission, Hemorrhagic Fevers, Viral virology, Rhabdoviridae classification, Rhabdoviridae genetics, Rhabdoviridae immunology, Rhabdoviridae isolation & purification, Rhabdoviridae Infections virology
- Abstract
Deep sequencing was used to discover a novel rhabdovirus (Bas-Congo virus, or BASV) associated with a 2009 outbreak of 3 human cases of acute hemorrhagic fever in Mangala village, Democratic Republic of Congo (DRC), Africa. The cases, presenting over a 3-week period, were characterized by abrupt disease onset, high fever, mucosal hemorrhage, and, in two patients, death within 3 days. BASV was detected in an acute serum sample from the lone survivor at a concentration of 1.09 × 10(6) RNA copies/mL, and 98.2% of the genome was subsequently de novo assembled from ≈ 140 million sequence reads. Phylogenetic analysis revealed that BASV is highly divergent and shares less than 34% amino acid identity with any other rhabdovirus. High convalescent neutralizing antibody titers of >1:1000 were detected in the survivor and an asymptomatic nurse directly caring for him, both of whom were health care workers, suggesting the potential for human-to-human transmission of BASV. The natural animal reservoir host or arthropod vector and precise mode of transmission for the virus remain unclear. BASV is an emerging human pathogen associated with acute hemorrhagic fever in Africa.
- Published
- 2012
- Full Text
- View/download PDF
33. Identification, characterization, and in vitro culture of highly divergent arenaviruses from boa constrictors and annulated tree boas: candidate etiological agents for snake inclusion body disease.
- Author
-
Stenglein MD, Sanders C, Kistler AL, Ruby JG, Franco JY, Reavill DR, Dunker F, and Derisi JL
- Subjects
- Animals, Arenavirus genetics, Arenavirus pathogenicity, Cell Line, Cytoplasm chemistry, Cytoplasm virology, Genome, Viral, Metagenome, Molecular Sequence Data, Nucleoproteins analysis, RNA, Viral genetics, Sequence Analysis, DNA, Virus Cultivation, Arenavirus classification, Arenavirus isolation & purification, Boidae virology, Inclusion Bodies, Viral virology
- Abstract
Unlabelled: Inclusion body disease (IBD) is an infectious fatal disease of snakes typified by behavioral abnormalities, wasting, and secondary infections. At a histopathological level, the disease is identified by the presence of large eosinophilic cytoplasmic inclusions in multiple tissues. To date, no virus or other pathogen has been definitively characterized or associated with the disease. Using a metagenomic approach to search for candidate etiologic agents in snakes with confirmed IBD, we identified and de novo assembled the complete genomic sequences of two viruses related to arenaviruses, and a third arenavirus-like sequence was discovered by screening an additional set of samples. A continuous boa constrictor cell line was established and used to propagate and isolate one of the viruses in culture. Viral nucleoprotein was localized and concentrated within large cytoplasmic inclusions in infected cells in culture and tissues from diseased snakes. In total, viral RNA was detected in 6/8 confirmed IBD cases and 0/18 controls. These viruses have a typical arenavirus genome organization but are highly divergent, belonging to a lineage separate from that of the Old and New World arenaviruses. Furthermore, these viruses encode envelope glycoproteins that are more similar to those of filoviruses than to those of other arenaviruses. These findings implicate these viruses as candidate etiologic agents of IBD. The presence of arenaviruses outside mammals reveals that these viruses infect an unexpectedly broad range of species and represent a new reservoir of potential human pathogens., Importance: Inclusion body disease (IBD) is a common infectious disease of captive snakes. IBD is fatal and can cause the loss of entire animal collections. The cause of the disease has remained elusive, and no treatment exists. In addition to being important to pet owners, veterinarians, breeders, zoological parks, and aquariums, the study of animal disease is significant since animals are the source of virtually every emerging infectious human disease. We searched for candidate causative agents in snakes diagnosed with IBD and found a group of novel viruses distantly related mainly to arenaviruses but also to filoviruses, both of which can cause fatal hemorrhagic fevers when transmitted from animals to humans. In addition to providing evidence that strongly suggests that these viruses cause snake IBD, this discovery reveals a new and unanticipated domain of virus biology and evolution.
- Published
- 2012
- Full Text
- View/download PDF
34. Merochlorins A-D, cyclic meroterpenoid antibiotics biosynthesized in divergent pathways with vanadium-dependent chloroperoxidases.
- Author
-
Kaysser L, Bernhardt P, Nam SJ, Loesgen S, Ruby JG, Skewes-Cox P, Jensen PR, Fenical W, and Moore BS
- Subjects
- Anti-Bacterial Agents chemistry, Chloride Peroxidase genetics, Genes, Bacterial, Models, Molecular, Multigene Family, Sesterterpenes chemistry, Sesterterpenes genetics, Sesterterpenes metabolism, Streptomyces coelicolor chemistry, Streptomyces coelicolor genetics, Streptomyces coelicolor metabolism, Anti-Bacterial Agents metabolism, Chloride Peroxidase metabolism, Streptomyces coelicolor enzymology, Vanadium metabolism
- Abstract
Meroterpenoids are mixed polyketide-terpenoid natural products with a broad range of biological activities. Herein, we present the structures of four new meroterpenoid antibiotics, merochlorins A-D, produced by the marine bacterium Streptomyces sp. strain CNH-189, which possess novel chemical skeletons unrelated to known bacterial agents. Draft genome sequencing, mutagenesis, and heterologous biosynthesis in the genome-minimized model actinomycete Streptomyces coelicolor provided the 57.6 kb merochlorin gene cluster that contains two genes encoding rare bacterial vanadium-dependent haloperoxidase (VHPO) genes. Pathway expression of two different fosmid clones that differ largely by the presence or absence of the VHPO gene mcl40 resulted in the differential biosynthesis of merochlorin C, suggesting that Mcl40 catalyzes an unprecedented 15-membered chloronium-induced macrocyclization reaction converting merochlorin D to merochlorin C.
- Published
- 2012
- Full Text
- View/download PDF
35. Assemblathon 1: a competitive assessment of de novo short read assembly methods.
- Author
-
Earl D, Bradnam K, St John J, Darling A, Lin D, Fass J, Yu HO, Buffalo V, Zerbino DR, Diekhans M, Nguyen N, Ariyaratne PN, Sung WK, Ning Z, Haimel M, Simpson JT, Fonseca NA, Birol İ, Docking TR, Ho IY, Rokhsar DS, Chikhi R, Lavenier D, Chapuis G, Naquin D, Maillet N, Schatz MC, Kelley DR, Phillippy AM, Koren S, Yang SP, Wu W, Chou WC, Srivastava A, Shaw TI, Ruby JG, Skewes-Cox P, Betegon M, Dimon MT, Solovyev V, Seledtsov I, Kosarev P, Vorobyev D, Ramirez-Gonzalez R, Leggett R, MacLean D, Xia F, Luo R, Li Z, Xie Y, Liu B, Gnerre S, MacCallum I, Przybylski D, Ribeiro FJ, Yin S, Sharpe T, Hall G, Kersey PJ, Durbin R, Jackman SD, Chapman JA, Huang X, DeRisi JL, Caccamo M, Li Y, Jaffe DB, Green RE, Haussler D, Korf I, and Paten B
- Subjects
- Genome physiology, Genomics methods, Sequence Analysis, DNA methods
- Abstract
Low-cost short read sequencing technology has revolutionized genomics, though it is only just becoming practical for the high-quality de novo assembly of a novel large genome. We describe the Assemblathon 1 competition, which aimed to comprehensively assess the state of the art in de novo assembly methods when applied to current sequencing technologies. In a collaborative effort, teams were asked to assemble a simulated Illumina HiSeq data set of an unknown, simulated diploid genome. A total of 41 assemblies from 17 different groups were received. Novel haplotype aware assessments of coverage, contiguity, structure, base calling, and copy number were made. We establish that within this benchmark: (1) It is possible to assemble the genome to a high level of coverage and accuracy, and that (2) large differences exist between the assemblies, suggesting room for further improvements in current methods. The simulated benchmark, including the correct answer, the assemblies, and the code that was used to evaluate the assemblies is now public and freely available from http://www.assemblathon.org/.
- Published
- 2011
- Full Text
- View/download PDF
36. Transcriptome sequencing demonstrates that human papillomavirus is not active in cutaneous squamous cell carcinoma.
- Author
-
Arron ST, Ruby JG, Dybbro E, Ganem D, and Derisi JL
- Subjects
- Adult, Aged, Aged, 80 and over, Betapapillomavirus genetics, Cell Transformation, Viral genetics, DNA, Viral genetics, Female, Gene Expression Regulation, Neoplastic, Gene Expression Regulation, Viral, HeLa Cells, Human papillomavirus 16 genetics, Human papillomavirus 16 isolation & purification, Human papillomavirus 18 genetics, Human papillomavirus 18 isolation & purification, Humans, Male, Middle Aged, Papillomavirus Infections genetics, Uterine Cervical Neoplasms genetics, Uterine Cervical Neoplasms virology, Viral Load genetics, Betapapillomavirus isolation & purification, Carcinoma, Squamous Cell virology, Gene Expression Profiling, Papillomavirus Infections virology, Skin Neoplasms virology
- Abstract
β-Human papillomavirus (β-HPV) DNA is present in some cutaneous squamous cell carcinomas (cuSCCs), but no mechanism of carcinogenesis has been determined. We used ultra-high-throughput sequencing of the cancer transcriptome to assess whether papillomavirus transcripts are present in these cancers. In all, 67 cuSCC samples were assayed for β-HPV DNA by PCR, and viral loads were measured with type-specific quantitative PCR. A total of 31 SCCs were selected for whole transcriptome sequencing. Transcriptome libraries were prepared in parallel from the HPV18-positive HeLa cervical cancer cell line and HPV16-positive primary cervical and periungual SCCs. Of the tumors, 30% (20/67) were positive for β-HPV DNA, but there was no difference in β-HPV viral load between tumor and normal tissue (P=0.310). Immunosuppression and age were significantly associated with higher viral load (P=0.016 for immunosuppression; P=0.0004 for age). Transcriptome sequencing failed to identify papillomavirus expression in any of the skin tumors. In contrast, HPV16 and HPV18 mRNA transcripts were readily identified in primary cervical and periungual cancers and HeLa cells. These data demonstrate that papillomavirus mRNA expression is not a factor in the maintenance of cuSCCs.
- Published
- 2011
- Full Text
- View/download PDF
37. Formation, regulation and evolution of Caenorhabditis elegans 3'UTRs.
- Author
-
Jan CH, Friedman RC, Ruby JG, and Bartel DP
- Subjects
- AT Rich Sequence genetics, Animals, Conserved Sequence genetics, Gene Expression Profiling methods, Genes, Helminth genetics, Humans, MicroRNAs genetics, Poly A, Polyadenylation, RNA, Helminth genetics, Regulatory Sequences, Ribonucleic Acid genetics, Sequence Alignment, Sequence Analysis, RNA methods, 3' Untranslated Regions genetics, Caenorhabditis elegans genetics, Evolution, Molecular, Gene Expression Regulation genetics, High-Throughput Nucleotide Sequencing methods
- Abstract
Post-transcriptional gene regulation frequently occurs through elements in mRNA 3' untranslated regions (UTRs). Although crucial roles for 3'UTR-mediated gene regulation have been found in Caenorhabditis elegans, most C. elegans genes have lacked annotated 3'UTRs. Here we describe a high-throughput method for reliable identification of polyadenylated RNA termini, and we apply this method, called poly(A)-position profiling by sequencing (3P-Seq), to determine C. elegans 3'UTRs. Compared to standard methods also recently applied to C. elegans UTRs, 3P-Seq identified 8,580 additional UTRs while excluding thousands of shorter UTR isoforms that do not seem to be authentic. Analysis of this expanded and corrected data set suggested that the high A/U content of C. elegans 3'UTRs facilitated genome compaction, because the elements specifying cleavage and polyadenylation, which are A/U rich, can more readily emerge in A/U-rich regions. Indeed, 30% of the protein-coding genes have mRNAs with alternative, partially overlapping end regions that generate another 10,480 cleavage and polyadenylation sites that had gone largely unnoticed and represent potential evolutionary intermediates of progressive UTR shortening. Moreover, a third of the convergently transcribed genes use palindromic arrangements of bidirectional elements to specify UTRs with convergent overlap, which also contributes to genome compaction by eliminating regions between genes. Although nematode 3'UTRs have median length only one-sixth that of mammalian 3'UTRs, they have twice the density of conserved microRNA sites, in part because additional types of seed-complementary sites are preferentially conserved. These findings reveal the influence of cleavage and polyadenylation on the evolution of genome architecture and provide resources for studying post-transcriptional gene regulation.
- Published
- 2011
- Full Text
- View/download PDF
38. Temporal analysis of the honey bee microbiome reveals four novel viruses and seasonal prevalence of known viruses, Nosema, and Crithidia.
- Author
-
Runckel C, Flenniken ML, Engel JC, Ruby JG, Ganem D, Andino R, and DeRisi JL
- Subjects
- Agriculture, Animal Migration, Animals, Crithidia physiology, Molecular Sequence Data, Nosema physiology, Oligonucleotide Array Sequence Analysis, Polymerase Chain Reaction, Reproducibility of Results, Sequence Analysis, Time Factors, Bees microbiology, Bees virology, Crithidia genetics, Metagenome, Nosema genetics, Seasons
- Abstract
Honey bees (Apis mellifera) play a critical role in global food production as pollinators of numerous crops. Recently, honey bee populations in the United States, Canada, and Europe have suffered an unexplained increase in annual losses due to a phenomenon known as Colony Collapse Disorder (CCD). Epidemiological analysis of CCD is confounded by a relative dearth of bee pathogen field studies. To identify what constitutes an abnormal pathophysiological condition in a honey bee colony, it is critical to have characterized the spectrum of exogenous infectious agents in healthy hives over time. We conducted a prospective study of a large scale migratory bee keeping operation using high-frequency sampling paired with comprehensive molecular detection methods, including a custom microarray, qPCR, and ultra deep sequencing. We established seasonal incidence and abundance of known viruses, Nosema sp., Crithidia mellificae, and bacteria. Ultra deep sequence analysis further identified four novel RNA viruses, two of which were the most abundant observed components of the honey bee microbiome (∼10(11) viruses per honey bee). Our results demonstrate episodic viral incidence and distinct pathogen patterns between summer and winter time-points. Peak infection of common honey bee viruses and Nosema occurred in the summer, whereas levels of the trypanosomatid Crithidia mellificae and Lake Sinai virus 2, a novel virus, peaked in January.
- Published
- 2011
- Full Text
- View/download PDF
39. Mammalian microRNAs: experimental evaluation of novel and previously annotated genes.
- Author
-
Chiang HR, Schoenfeld LW, Ruby JG, Auyeung VC, Spies N, Baek D, Johnston WK, Russ C, Luo S, Babiarz JE, Blelloch R, Schroth GP, Nusbaum C, and Bartel DP
- Subjects
- Animals, Cell Line, Gene Expression Profiling, Humans, Inverted Repeat Sequences genetics, Mice, MicroRNAs biosynthesis, MicroRNAs metabolism, Ribonuclease III metabolism, Genes genetics, Genome genetics, MicroRNAs genetics
- Abstract
MicroRNAs (miRNAs) are small regulatory RNAs that derive from distinctive hairpin transcripts. To learn more about the miRNAs of mammals, we sequenced 60 million small RNAs from mouse brain, ovary, testes, embryonic stem cells, three embryonic stages, and whole newborns. Analysis of these sequences confirmed 398 annotated miRNA genes and identified 108 novel miRNA genes. More than 150 previously annotated miRNAs and hundreds of candidates failed to yield sequenced RNAs with miRNA-like features. Ectopically expressing these previously proposed miRNA hairpins also did not yield small RNAs, whereas ectopically expressing the confirmed and newly identified hairpins usually did yield small RNAs with the classical miRNA features, including dependence on the Drosha endonuclease for processing. These experiments, which suggest that previous estimates of conserved mammalian miRNAs were inflated, provide a substantially revised list of confidently identified murine miRNAs from which to infer the general features of mammalian miRNAs. Our analyses also revealed new aspects of miRNA biogenesis and modification, including tissue-specific strand preferences, sequential Dicer cleavage of a metazoan precursor miRNA (pre-miRNA), consequential 5' heterogeneity, newly identified instances of miRNA editing, and evidence for widespread pre-miRNA uridylation reminiscent of miRNA regulation by Lin28.
- Published
- 2010
- Full Text
- View/download PDF
40. Mouse ES cells express endogenous shRNAs, siRNAs, and other Microprocessor-independent, Dicer-dependent small RNAs.
- Author
-
Babiarz JE, Ruby JG, Wang Y, Bartel DP, and Blelloch R
- Subjects
- Animals, Base Sequence, Cells, Cultured, Computational Biology, Gene Expression Profiling, Inverted Repeat Sequences, Mice, Mice, Knockout, Molecular Sequence Data, Nucleic Acid Conformation, Oligonucleotide Array Sequence Analysis, Phenotype, RNA, Small Interfering genetics, RNA-Binding Proteins, Ribonuclease III, Sequence Homology, Nucleic Acid, Tandem Repeat Sequences, DEAD-box RNA Helicases physiology, Embryonic Stem Cells metabolism, Endoribonucleases physiology, MicroRNAs metabolism, Proteins physiology, RNA Processing, Post-Transcriptional physiology, RNA, Small Interfering metabolism
- Abstract
Canonical microRNAs (miRNAs) require two processing steps: the first by the Microprocessor, a complex of DGCR8 and Drosha, and the second by a complex of TRBP and Dicer. dgcr8Delta/Delta mouse embryonic stem cells (mESCs) have less severe phenotypes than dicer1Delta/Delta mESCs, suggesting a physiological role for Microprocessor-independent, Dicer-dependent small RNAs. To identify these small RNAs with unusual biogenesis, we performed high-throughput sequencing from wild-type, dgcr8Delta/Delta, and dicer1Delta/Delta mESCs. Several of the resulting DGCR8-independent, Dicer-dependent RNAs were noncanonical miRNAs. These derived from mirtrons and a newly identified subclass of miRNA precursors, which appears to be the endogenous counterpart of shRNAs. Our analyses also revealed endogenous siRNAs resulting from Dicer cleavage of long hairpins, the vast majority of which originated from one genomic locus with tandem, inverted short interspersed nuclear elements (SINEs). Our results extend the known diversity of mammalian small RNA-generating pathways and show that mammalian siRNAs exist in cell types other than oocytes.
- Published
- 2008
- Full Text
- View/download PDF
41. PRG-1 and 21U-RNAs interact to form the piRNA complex required for fertility in C. elegans.
- Author
-
Batista PJ, Ruby JG, Claycomb JM, Chiang R, Fahlgren N, Kasschau KD, Chaves DA, Gu W, Vasale JJ, Duan S, Conte D Jr, Luo S, Schroth GP, Carrington JC, Bartel DP, and Mello CC
- Subjects
- Animals, Argonaute Proteins, Base Sequence, Caenorhabditis elegans cytology, Caenorhabditis elegans genetics, Fertility, Gene Expression Regulation, Germ Cells cytology, Germ Cells metabolism, Models, Biological, Molecular Sequence Data, Mutation genetics, Protein Binding, RNA-Induced Silencing Complex, Regulatory Sequences, Nucleic Acid genetics, Caenorhabditis elegans metabolism, Caenorhabditis elegans Proteins metabolism, RNA, Helminth metabolism, RNA, Small Interfering metabolism
- Abstract
In metazoans, Piwi-related Argonaute proteins have been linked to germline maintenance, and to a class of germline-enriched small RNAs termed piRNAs. Here we show that an abundant class of 21 nucleotide small RNAs (21U-RNAs) are expressed in the C. elegans germline, interact with the C. elegans Piwi family member PRG-1, and depend on PRG-1 activity for their accumulation. The PRG-1 protein is expressed throughout development and localizes to nuage-like structures called P granules. Although 21U-RNA loci share a conserved upstream sequence motif, the mature 21U-RNAs are not conserved and, with few exceptions, fail to exhibit complementarity or evidence for direct regulation of other expressed sequences. Our findings demonstrate that 21U-RNAs are the piRNAs of C. elegans and link this class of small RNAs and their associated Piwi Argonaute to the maintenance of temperature-dependent fertility.
- Published
- 2008
- Full Text
- View/download PDF
42. The Drosophila hairpin RNA pathway generates endogenous short interfering RNAs.
- Author
-
Okamura K, Chung WJ, Ruby JG, Guo H, Bartel DP, and Lai EC
- Subjects
- Animals, Argonaute Proteins, Drosophila Proteins genetics, Drosophila Proteins metabolism, Drosophila melanogaster enzymology, Drosophila melanogaster metabolism, Methyltransferases metabolism, MicroRNAs biosynthesis, MicroRNAs genetics, MicroRNAs metabolism, RNA Helicases genetics, RNA Helicases metabolism, RNA, Double-Stranded chemistry, RNA, Double-Stranded genetics, RNA, Small Interfering biosynthesis, RNA, Small Interfering genetics, RNA-Binding Proteins genetics, RNA-Binding Proteins metabolism, RNA-Induced Silencing Complex genetics, RNA-Induced Silencing Complex metabolism, Ribonuclease III, Drosophila melanogaster genetics, Nucleic Acid Conformation, RNA Interference, RNA, Double-Stranded metabolism, RNA, Small Interfering metabolism
- Abstract
In contrast to microRNAs and Piwi-associated RNAs, short interfering RNAs (siRNAs) are seemingly dispensable for host-directed gene regulation in Drosophila. This notion is based on the fact that mutants lacking the core siRNA-generating enzyme Dicer-2 or the predominant siRNA effector Argonaute 2 are viable, fertile and of relatively normal morphology. Moreover, endogenous Drosophila siRNAs have not yet been identified. Here we report that siRNAs derived from long hairpin RNA genes (hpRNAs) programme Slicer complexes that can repress endogenous target transcripts. The Drosophila hpRNA pathway is a hybrid mechanism that combines canonical RNA interference factors (Dicer-2, Hen1 (known as CG12367) and Argonaute 2) with a canonical microRNA factor (Loquacious) to generate approximately 21-nucleotide siRNAs. These novel regulatory RNAs reveal unexpected complexity in the sorting of small RNAs, and open a window onto the biological usage of endogenous RNA interference in Drosophila.
- Published
- 2008
- Full Text
- View/download PDF
43. The long march: a sample preparation technique that enhances contig length and coverage by high-throughput short-read sequencing.
- Author
-
Sorber K, Chiu C, Webster D, Dimon M, Ruby JG, Hekele A, and DeRisi JL
- Subjects
- Animals, Deoxyribonucleases, Type II Site-Specific, Hepatitis B virology, Humans, Liver Failure virology, Methods, Plasmodium falciparum genetics, Restriction Mapping, Sequence Analysis, DNA standards, Contig Mapping methods, RNA, Messenger analysis, Sequence Analysis, DNA methods
- Abstract
High-throughput short-read technologies have revolutionized DNA sequencing by drastically reducing the cost per base of sequencing information. Despite producing gigabases of sequence per run, these technologies still present obstacles in resequencing and de novo assembly applications due to biased or insufficient target sequence coverage. We present here a simple sample preparation method termed the "long march" that increases both contig lengths and target sequence coverage using high-throughput short-read technologies. By incorporating a Type IIS restriction enzyme recognition motif into the sequencing primer adapter, successive rounds of restriction enzyme cleavage and adapter ligation produce a set of nested sub-libraries from the initial amplicon library. Sequence reads from these sub-libraries are offset from each other with enough overlap to aid assembly and contig extension. We demonstrate the utility of the long march in resequencing of the Plasmodium falciparum transcriptome, where the number of genomic bases covered was increased by 39%, as well as in metagenomic analysis of a serum sample from a patient with hepatitis B virus (HBV)-related acute liver failure, where the number of HBV bases covered was increased by 42%. We also offer a theoretical optimization of the long march for de novo sequence assembly.
- Published
- 2008
- Full Text
- View/download PDF
44. Evolution, biogenesis, expression, and target predictions of a substantially expanded set of Drosophila microRNAs.
- Author
-
Ruby JG, Stark A, Johnston WK, Kellis M, Bartel DP, and Lai EC
- Subjects
- Animals, Base Sequence, Computational Biology methods, Conserved Sequence, Drosophila melanogaster embryology, MicroRNAs metabolism, Molecular Sequence Data, Nucleic Acid Conformation, Drosophila melanogaster genetics, Evolution, Molecular, Gene Expression Regulation, Developmental physiology, MicroRNAs biosynthesis, MicroRNAs genetics, RNA, Messenger metabolism
- Abstract
MicroRNA (miRNA) genes give rise to small regulatory RNAs in a wide variety of organisms. We used computational methods to predict miRNAs conserved among Drosophila species and large-scale sequencing of small RNAs from Drosophila melanogaster to experimentally confirm and complement these predictions. In addition to validating 20 of our top 45 predictions for novel miRNA loci, the large-scale sequencing identified many miRNAs that had not been predicted. In total, 59 novel genes were identified, increasing our tally of confirmed fly miRNAs to 148. The large-scale sequencing also refined the identities of previously known miRNAs and provided insights into their biogenesis and expression. Many miRNAs were expressed in particular developmental contexts, with a large cohort of miRNAs expressed primarily in imaginal discs. Conserved miRNAs typically were expressed more broadly and robustly than were nonconserved miRNAs, and those conserved miRNAs with more restricted expression tended to have fewer predicted targets than those expressed more broadly. Predicted targets for the expanded set of microRNAs substantially increased and revised the miRNA-target relationships that appear conserved among the fly species. Insights were also provided into miRNA gene evolution, including evidence for emergent regulatory function deriving from the opposite arm of the miRNA hairpin, exemplified by mir-10, and even the opposite strand of the DNA, exemplified by mir-iab-4.
- Published
- 2007
- Full Text
- View/download PDF
45. Discovery of functional elements in 12 Drosophila genomes using evolutionary signatures.
- Author
-
Stark A, Lin MF, Kheradpour P, Pedersen JS, Parts L, Carlson JW, Crosby MA, Rasmussen MD, Roy S, Deoras AN, Ruby JG, Brennecke J, Hodges E, Hinrichs AS, Caspi A, Paten B, Park SW, Han MV, Maeder ML, Polansky BJ, Robson BE, Aerts S, van Helden J, Hassan B, Gilbert DG, Eastman DA, Rice M, Weir M, Hahn MW, Park Y, Dewey CN, Pachter L, Kent WJ, Haussler D, Lai EC, Bartel DP, Hannon GJ, Kaufman TC, Eisen MB, Clark AG, Smith D, Celniker SE, Gelbart WM, and Kellis M
- Subjects
- Animals, Base Sequence, Binding Sites, Conserved Sequence, Drosophila Proteins genetics, Exons genetics, Gene Expression Regulation genetics, Genes, Insect genetics, MicroRNAs genetics, Molecular Sequence Data, Organ Specificity, Phylogeny, Untranslated Regions genetics, Drosophila classification, Drosophila genetics, Evolution, Molecular, Genome, Insect genetics, Genomics
- Abstract
Sequencing of multiple related species followed by comparative genomics analysis constitutes a powerful approach for the systematic understanding of any genome. Here, we use the genomes of 12 Drosophila species for the de novo discovery of functional elements in the fly. Each type of functional element shows characteristic patterns of change, or 'evolutionary signatures', dictated by its precise selective constraints. Such signatures enable recognition of new protein-coding genes and exons, spurious and incorrect gene annotations, and numerous unusual gene structures, including abundant stop-codon readthrough. Similarly, we predict non-protein-coding RNA genes and structures, and new microRNA (miRNA) genes. We provide evidence of miRNA processing and functionality from both hairpin arms and both DNA strands. We identify several classes of pre- and post-transcriptional regulatory motifs, and predict individual motif instances with high confidence. We also study how discovery power scales with the divergence and number of species compared, and we provide general guidelines for comparative studies.
- Published
- 2007
- Full Text
- View/download PDF
46. Intronic microRNA precursors that bypass Drosha processing.
- Author
-
Ruby JG, Jan CH, and Bartel DP
- Subjects
- Animals, Base Sequence, Caenorhabditis elegans enzymology, Cell Line, Drosophila melanogaster enzymology, Gene Silencing, MicroRNAs biosynthesis, Molecular Sequence Data, RNA Interference, RNA Splicing, RNA, Helminth metabolism, Caenorhabditis elegans genetics, Caenorhabditis elegans Proteins metabolism, Drosophila Proteins metabolism, Drosophila melanogaster genetics, Introns, MicroRNAs metabolism, RNA Precursors metabolism, RNA Processing, Post-Transcriptional, Ribonuclease III metabolism
- Abstract
MicroRNAs (miRNAs) are approximately 22-nucleotide endogenous RNAs that often repress the expression of complementary messenger RNAs. In animals, miRNAs derive from characteristic hairpins in primary transcripts through two sequential RNase III-mediated cleavages; Drosha cleaves near the base of the stem to liberate a approximately 60-nucleotide pre-miRNA hairpin, then Dicer cleaves near the loop to generate a miRNA:miRNA* duplex. From that duplex, the mature miRNA is incorporated into the silencing complex. Here we identify an alternative pathway for miRNA biogenesis, in which certain debranched introns mimic the structural features of pre-miRNAs to enter the miRNA-processing pathway without Drosha-mediated cleavage. We call these pre-miRNAs/introns 'mirtrons', and have identified 14 mirtrons in Drosophila melanogaster and another four in Caenorhabditis elegans (including the reclassification of mir-62). Some of these have been selectively maintained during evolution with patterns of sequence conservation suggesting important regulatory functions in the animal. The abundance of introns comparable in size to pre-miRNAs appears to have created a context favourable for the emergence of mirtrons in flies and nematodes. This suggests that other lineages with many similarly sized introns probably also have mirtrons, and that the mirtron pathway could have provided an early avenue for the emergence of miRNAs before the advent of Drosha.
- Published
- 2007
- Full Text
- View/download PDF
47. Large-scale sequencing reveals 21U-RNAs and additional microRNAs and endogenous siRNAs in C. elegans.
- Author
-
Ruby JG, Jan C, Player C, Axtell MJ, Lee W, Nusbaum C, Ge H, and Bartel DP
- Subjects
- Animals, Base Sequence, Caenorhabditis elegans Proteins metabolism, Conserved Sequence, MicroRNAs chemistry, MicroRNAs metabolism, Molecular Sequence Data, Nucleic Acid Conformation, RNA, Helminth chemistry, RNA, Helminth metabolism, RNA, Small Interfering chemistry, RNA, Small Interfering metabolism, RNA, Untranslated chemistry, RNA, Untranslated metabolism, RNA-Dependent RNA Polymerase metabolism, Sequence Analysis, RNA, Uridine metabolism, Caenorhabditis elegans genetics, MicroRNAs genetics, RNA, Helminth genetics, RNA, Small Interfering genetics, RNA, Untranslated genetics
- Abstract
We sequenced approximately 400,000 small RNAs from Caenorhabditis elegans. Another 18 microRNA (miRNA) genes were identified, thereby extending to 112 our tally of confidently identified miRNA genes in C. elegans. Also observed were thousands of endogenous siRNAs generated by RNA-directed RNA polymerases acting preferentially on transcripts associated with spermatogenesis and transposons. In addition, a third class of nematode small RNAs, called 21U-RNAs, was discovered. 21U-RNAs are precisely 21 nucleotides long, begin with a uridine 5'-monophosphate but are diverse in their remaining 20 nucleotides, and appear modified at their 3'-terminal ribose. 21U-RNAs originate from more than 5700 genomic loci dispersed in two broad regions of chromosome IV-primarily between protein-coding genes or within their introns. These loci share a large upstream motif that enables accurate prediction of additional 21U-RNAs. The motif is conserved in other nematodes, presumably because of its importance for producing these diverse, autonomously expressed, small RNAs (dasRNAs).
- Published
- 2006
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.