171 results on '"Susan E. Celniker"'
Search Results
52. Complete Genome Sequence of Enterococcus durans Oregon-R-modENCODE Strain BDGP3, a Lactic Acid Bacterium Found in the Drosophila melanogaster Gut
- Author
-
Benjamin W. Booth, Ann S. Hammonds, Soo Park, Susan E. Celniker, Kenneth H. Wan, and Charles Yu
- Subjects
0301 basic medicine ,Genetics ,Whole genome sequencing ,biology ,Strain (chemistry) ,Human Genome ,biology.organism_classification ,Microbiology ,Genome ,Enterococcus durans ,Interaction studies ,03 medical and health sciences ,030104 developmental biology ,Plasmid ,Lactic acid bacterium ,Biochemistry and Cell Biology ,Drosophila melanogaster ,Molecular Biology ,Biotechnology - Abstract
Enterococcus durans Oregon-R-modENCODE strain BDGP3 was isolated from the Drosophila melanogaster gut for functional host-microbe interaction studies. The complete genome is composed of a single circular genome of 2,983,334 bp, with a G+C content of 38%, and a single plasmid of 5,594 bp.
- Published
- 2017
53. Complete Genome Sequence of Bacillus kochii Oregon-R-modENCODE Strain BDGP4, Isolated from Drosophila melanogaster Gut
- Author
-
Soo Park, Ann S. Hammonds, Kenneth H. Wan, Susan E. Celniker, Benjamin W. Booth, and Charles Yu
- Subjects
0301 basic medicine ,Genetics ,Whole genome sequencing ,biology ,Strain (chemistry) ,Host (biology) ,Human Genome ,biology.organism_classification ,C content ,Genome ,Microbiology ,03 medical and health sciences ,030104 developmental biology ,Plasmid ,Bacillus kochii ,Biochemistry and Cell Biology ,Drosophila melanogaster ,Molecular Biology ,Biotechnology - Abstract
Bacillus kochii Oregon-R-modENCODE strain BDGP4 was isolated from the gut of Drosophila melanogaster for functional host microbial interaction studies. The complete genome comprised a single chromosomal circle of 4,557,232 bp with a G+C content of 37% and a single plasmid of 137,143 bp.
- Published
- 2017
54. Complete Genome Sequence of Lactobacillus plantarum Oregon-R-modENCODE Strain BDGP2 Isolated from Drosophila melanogaster Gut
- Author
-
Charles Yu, Soo Park, Susan E. Celniker, Ann S. Hammonds, Kenneth H. Wan, and Benjamin W. Booth
- Subjects
0301 basic medicine ,Whole genome sequencing ,Genetics ,biology ,Strain (chemistry) ,Host (biology) ,Microbial interaction ,Human Genome ,030106 microbiology ,food and beverages ,biology.organism_classification ,Microbiology ,Genome ,03 medical and health sciences ,030104 developmental biology ,Plasmid ,Biochemistry and Cell Biology ,Prokaryotes ,Drosophila melanogaster ,Molecular Biology ,Lactobacillus plantarum - Abstract
Lactobacillus plantarum Oregon-R-modENCODE strain BDGP2 was isolated from the gut of Drosophila melanogaster for functional host microbial interaction studies. The complete genome comprised a single circular genome of 3,407,160 bp, with a G+C content of 44%, and four plasmids.
- Published
- 2017
55. The ModERN Resource: Genome-Wide Binding Profiles for Hundreds of
- Author
-
Michelle M, Kudron, Alec, Victorsen, Louis, Gevirtzman, LaDeana W, Hillier, William W, Fisher, Dionne, Vafeados, Matt, Kirkey, Ann S, Hammonds, Jeffery, Gersch, Haneen, Ammouri, Martha L, Wall, Jennifer, Moran, David, Steffen, Matt, Szynkarek, Samantha, Seabrook-Sturgis, Nader, Jameel, Madhura, Kadaba, Jaeda, Patton, Robert, Terrell, Mitch, Corson, Timothy J, Durham, Soo, Park, Swapna, Samanta, Mei, Han, Jinrui, Xu, Koon-Kiu, Yan, Susan E, Celniker, Kevin P, White, Lijia, Ma, Mark, Gerstein, Valerie, Reinke, and Robert H, Waterston
- Subjects
Chromatin Immunoprecipitation ,Binding Sites ,fungi ,Models, Biological ,Communications ,Databases, Genetic ,Animals ,natural sciences ,Drosophila ,Nucleotide Motifs ,Caenorhabditis elegans ,Genome-Wide Association Study ,Protein Binding ,Transcription Factors - Abstract
To develop a catalog of regulatory sites in two major model organisms, Drosophila melanogaster and Caenorhabditis elegans, the modERN (model organism Encyclopedia of Regulatory Networks) consortium has systematically assayed the binding sites of transcription factors (TFs). Combined with data produced by our predecessor, modENCODE (Model Organism ENCyclopedia Of DNA Elements), we now have data for 262 TFs identifying 1.23 M sites in the fly genome and 217 TFs identifying 0.67 M sites in the worm genome. Because sites from different TFs are often overlapping and tightly clustered, they fall into 91,011 and 59,150 regions in the fly and worm, respectively, and these binding sites span as little as 8.7 and 5.8 Mb in the two organisms. Clusters with large numbers of sites (so-called high occupancy target, or HOT regions) predominantly associate with broadly expressed genes, whereas clusters containing sites from just a few factors are associated with genes expressed in tissue-specific patterns. All of the strains expressing GFP-tagged TFs are available at the stock centers, and the chromatin immunoprecipitation sequencing data are available through the ENCODE Data Coordinating Center and also through a simple interface (http://epic.gs.washington.edu/modERN/) that facilitates rapid accessibility of processed data sets. These data will facilitate a vast number of scientific inquiries into the function of individual TFs in key developmental, metabolic, and defense and homeostatic regulatory pathways, as well as provide a broader perspective on how individual TFs work together in local networks and globally across the life spans of these two key model organisms.
- Published
- 2017
56. Transcription Factor Networks in Drosophila melanogaster
- Author
-
Dong-Yeon Cho, Julian Mintseris, Steven P. Gygi, Matthew Slattery, Robert A. Obar, Christina Wong, Bo Zhai, Lijia Ma, David Y. Rhee, Kevin P. White, Spyros Artavanis-Tsakonas, Teresa M. Przytycka, and Susan E. Celniker
- Subjects
Genetics ,biology ,Systems biology ,fungi ,Gene regulatory network ,Notch signaling pathway ,Computational biology ,biology.organism_classification ,Interactome ,General Biochemistry, Genetics and Molecular Biology ,Article ,Protein–protein interaction ,Drosophila melanogaster ,lcsh:Biology (General) ,Animals ,Drosophila Proteins ,Wings, Animal ,Gene Regulatory Networks ,Protein Interaction Maps ,lcsh:QH301-705.5 ,Transcription factor ,Drosophila Protein ,Transcription Factors - Abstract
Specific cellular fates and functions depend on differential gene expression, which occurs primarily at the transcriptional level, controlled by complex regulatory networks of transcription factors. Transcription factors act through combinatorial interactions with other transcription factors, co-factors and chromatin-remodelling proteins. We present a study of 459 Drosophila melanogaster transcription related factors, defining protein-protein interactions using a co-affinity purification mass spectrometry methodology, representing approximately half of the established catalogue of transcription factors. We probe this network in vivo, demonstrating functional interactions for many interacting proteins testing the predictive value for our data set. Building on these analyses, we combine regulatory network inference models with physical interactions to define an integrated network, connecting combinatorial transcription factor protein interactions to the transcriptional regulatory network of the cell. We use this integrated network as a tool to connect the functional network of genetic modifiers related to mastermind, a transcriptional co-factor of the Notch pathway.
- Published
- 2014
- Full Text
- View/download PDF
57. Navigating and mining modENCODE data
- Author
-
Susan E. Celniker, Peter J. Bickel, Nathan Boley, and Kenneth H. Wan
- Subjects
DNA Replication ,Genetics ,biology ,Genome, Insect ,fungi ,DNA replication ,RNA ,Computational biology ,Genome ,Article ,Chromatin ,General Biochemistry, Genetics and Molecular Biology ,DNA binding site ,Drosophila melanogaster ,Histone ,Transcription (biology) ,Databases, Genetic ,Gene expression ,biology.protein ,Animals ,Data Mining ,Molecular Biology ,Developmental Biology - Abstract
modENCODE was a 5year NHGRI funded project (2007-2012) to map the function of every base in the genomes of worms and flies characterizing positions of modified histones and other chromatin marks, origins of DNA replication, RNA transcripts and the transcription factor binding sites that control gene expression. Here we describe the Drosophila modENCODE datasets and how best to access and use them for genome wide and individual gene studies.
- Published
- 2014
58. Genome-guided transcript assembly by integrative analysis of RNA sequence data
- Author
-
Nathan Boley, Susan E. Celniker, Kenneth H. Wan, Marcus H. Stoiber, Peter J. Bickel, James B. Brown, Roger A. Hoskins, and Benjamin W. Booth
- Subjects
Genetics ,Sequence Analysis, RNA ,Genome, Insect ,Biomedical Engineering ,Chromosome Mapping ,Molecular Sequence Annotation ,Bioengineering ,Genomics ,Computational biology ,Biology ,Applied Microbiology and Biotechnology ,Genome ,Article ,Drosophila melanogaster ,ComputingMethodologies_PATTERNRECOGNITION ,RNA Sequence ,Animals ,RNA ,Molecular Medicine ,Biotechnology - Abstract
The identification of full length transcripts entirely from short-read RNA sequencing data (RNA-seq) remains a challenge in genome annotation pipelines. Here we describe an automated pipeline for genome annotation that integrates RNA-seq and gene-boundary data sets, which we call generalized RNA integration tool, or GRIT. By applying GRIT to Drosophila melanogaster short-read RNA-seq, cap analysis of gene expression (CAGE) and poly(A)-site-seq data collected for the modENCODE project, we recover the vast majority of previously annotated transcripts and double the total number of transcripts cataloged. We find that 20% of protein coding genes encode multiple protein-localization signals, and that, in 20 day old adult fly heads, genes with multiple poly-adenylation sites are more common than genes with alternate splicing or alternate promoters. When compared to the most widely used transcript assembly tools, GRIT recovers a larger fraction of annotated transcripts at higher precision. GRIT will enable the automated generation of high-quality genome annotations without necessitating extensive manual annotation.
- Published
- 2014
59. Comparative Analysis of the Transcriptome across Distant Species
- Author
-
Gang Fang, LaDeana W. Hillier, Brenton R. Graveley, Ali Mortazavi, Norbert Perrimon, Nathan Boley, Jingyi Jessica Li, William C. Spencer, James B. Brown, Chau Huynh, Roger A. Hoskins, Mark Gerstein, Ann S. Hammonds, Sarah Djebali, Sonali Jha, Kenneth H. Wan, Cédric Howald, Raymond K. Auerbach, Chenghai Xue, Haiyan Huang, Jorg Drenkow, Elise A. Feingold, Julien Lagarde, Daifeng Wang, Dmitri D. Pervouchine, Thomas R. Gingeras, Guilin Wang, Peter Cherbas, Brent Ewing, Chao Di, Gary Saunders, Benjamin W. Booth, Joel Rozowsky, Yan Zhang, Anastasia Samsonova, Dionna M. Kasper, Cristina Sisu, Marcus H. Stoiber, Jiayu Wen, Michael O. Duff, Felix Schlesinger, Gennifer E. Merrihew, Sara Olson, Susan E. Celniker, Burak H. Alver, Chao Cheng, Gemma E. May, Alexandre Reymond, Carrie A. Davis, Alexander Dobin, Max E. Boeck, Roger P. Alexander, Michael J. Pazin, Peter J. Park, Adam Frankish, Lucy Cherbas, Zhi Lu, Kevin Y. Yip, Henry Zheng, Owen Thompson, Jing Leng, Kathie L. Watkins, Andrea Tanzer, Valerie Reinke, Rebecca McWhirter, Eric C. Lai, Steven E. Brenner, Robert H. Waterston, Koon-Kiu Yan, Masaomi Kato, Roderic Guigó, Huaien Wang, Kimberly Bell, Pnina Strasbourger, Baikang Pei, Jen Harrow, Long Hu, Chris Zaleski, Rabi Murad, Thomas C. Kaufman, Erik Ladewig, Robert R. Kitchen, Anurag Sethi, Kejia Wen, Guanjun Gao, Arif Harmanci, Megan Fastuca, Brian Oliver, Frank J. Slack, David M. Miller, Tim Hubbard, Garrett Robinson, Peter J. Good, Peter J. Bickel, Michael J. MacCoss, and Li Yang
- Subjects
RNA, Untranslated ,Caenorhabditis elegans -- Embriologia ,Genome ,Transcriptome ,Histones ,0302 clinical medicine ,Models ,Cluster Analysis ,Developmental ,Drosòfila -- Genètica ,Promoter Regions, Genetic ,Genetics ,0303 health sciences ,Multidisciplinary ,biology ,Pupa ,Untranslated ,Gene Expression Regulation, Developmental ,Chromatin ,Molecular Sequence Annotation ,Drosophila melanogaster ,Larva ,Sequence Analysis ,Biotechnology ,animal structures ,General Science & Technology ,Computational biology ,ENCODE ,Article ,Promoter Regions ,03 medical and health sciences ,Genetic ,Regulació genètica ,Animals ,Humans ,Transcriptomics ,Caenorhabditis elegans ,030304 developmental biology ,Comparative genomics ,Models, Genetic ,Phylum ,Sequence Analysis, RNA ,Gene Expression Profiling ,fungi ,Human Genome ,biology.organism_classification ,Gene expression profiling ,Gene Expression Regulation ,RNA ,Generic health relevance ,030217 neurology & neurosurgery - Abstract
The transcriptome is the readout of the genome. Identifying common features in it across distant species can reveal fundamental principles. To this end, the ENCODE and modENCODE consortia have generated large amounts of matched RNA-sequencing data for human, worm and fly. Uniform processing and comprehensive annotation of these data allow comparison across metazoan phyla, extending beyond earlier within-phylum transcriptome comparisons and revealing ancient, conserved features1, 2, 3, 4, 5, 6. Specifically, we discover co-expression modules shared across animals, many of which are enriched in developmental genes. Moreover, we use expression patterns to align the stages in worm and fly development and find a novel pairing between worm embryo and fly pupae, in addition to the embryo-to-embryo and larvae-to-larvae pairings. Furthermore, we find that the extent of non-canonical, non-coding transcription is similar in each organism, per base pair. Finally, we find in all three organisms that the gene-expression levels, both coding and non-coding, can be quantitatively predicted from chromatin features at the promoter using a ‘universal model’ based on a single set of organism-independent parameters. In particular, this work was funded by a contract from the National Human Genome Research Institute modENCODE Project, contract U01 HG004271 and U54 HG006944, to S.E.C. (principal investigator) and P.C., T.R.G., R.A.H. and B.R.G. (co-principal investigators) with additional support from R01 GM076655 (S.E.C.) both under Department of Energy contract no. DE-AC02-05CH11231, and U54 HG007005 to B.R.G. J.B.B.’s work was supported by NHGRI K99 HG006698 and DOE DE-AC02-05CH11231. Work in P.J.B.’s group was supported by the modENCODE DAC sub award 5710003102, 1U01HG007031-01 and the ENCODE DAC 5U01HG004695-04. Work in M.B.G.’s group was supported by NIH grants HG007000 and HG007355. Work in Bloomington was supported in part by the Indiana METACyt Initiative of Indiana University, funded by an award from the Lilly Endowment, Inc. Work in E.C.L.’s group was supported by U01-HG004261 and RC2-HG005639. P.J.P. acknowledges support from the National Institutes of Health (grant no. U01HG004258). We thank the HAVANA team for providing annotation of the human reference genome, whose work is supported by National Institutes of Health (grant no. 5U54HG004555), the Wellcome Trust (grant no. WT098051). R.G. acknowledges support from the Spanish Ministry of Education (grant BIO2011-26205). We also acknowledge use of the Yale University Biomedical High Performance Computing Center. R.W.'s lab was supported by grant no. U01 HG 004263.
- Published
- 2014
60. De novo Identification of DNA Modifications Enabled by Genome-Guided Nanopore Signal Processing
- Author
-
Eun Lee J, Nicholas J. Loman, Susan E. Celniker, Robert K. Neely, Rob Egan, Josh Quick, Len A. Pennacchio, James B. Brown, and Marcus H. Stoiber
- Subjects
Genetics ,0303 health sciences ,Signal processing ,business.industry ,0206 medical engineering ,Sequence (biology) ,02 engineering and technology ,Computational biology ,Biology ,Genome ,Pipeline (software) ,03 medical and health sciences ,Nanopore ,chemistry.chemical_compound ,Data visualization ,chemistry ,Nanopore sequencing ,business ,020602 bioinformatics ,DNA ,030304 developmental biology - Abstract
Advances in nanopore sequencing technology have enabled investigation of the full catalogue of covalent DNA modifications. We present the first algorithm for the identification of modified nucleotides without the need for prior training data along with the open source software implementation,nanoraw.Nanorawaccurately assigns contiguous raw nanopore signal to genomic positions, enabling novel data visualization, and increasing power and accuracy for the discovery of covalently modified bases in native DNA. Ground truth case studies utilizing synthetically methylated DNA show the capacity to identify three distinct methylation marks, 4mC, 5mC, and 6mA, in seven distinct sequence contexts without any changes to the algorithm. We demonstrate quantitative reproducibility simultaneously identifying 5mC and 6mA in native E. coli across biological replicates processed in different labs. Finally we propose a pipeline for the comprehensive discovery of DNA modifications in any genome withouta prioriknowledge of their chemical identities.
- Published
- 2016
- Full Text
- View/download PDF
61. Improved mosquito genome points to population-control strategies
- Author
-
Susan E. Celniker
- Subjects
0301 basic medicine ,Whole genome sequencing ,animal structures ,Multidisciplinary ,education ,fungi ,virus diseases ,Sequence assembly ,Genomics ,Computational biology ,Aedes aegypti ,Biology ,biology.organism_classification ,Genome ,03 medical and health sciences ,030104 developmental biology ,0302 clinical medicine ,parasitic diseases ,030216 legal & forensic medicine ,Gene ,Sequence (medicine) - Abstract
A high-quality genome sequence for the mosquito Aedes aegypti has now been assembled. The sequence will enable researchers to identify genes that could be targeted to keep mosquito populations at bay. A high-quality genome assembly for Aedes aegypti.
- Published
- 2018
62. Su2040 – Diverse Tumor Susceptibility in Collaborative Cross Mice: Identification of a New Model for Human Gastric Tumorigenesis
- Author
-
Xiaoping Zou, Jian-Hua Mao, Bo Hang, Antoine M. Snijders, Pin Wang, and Susan E. Celniker
- Subjects
0303 health sciences ,03 medical and health sciences ,0302 clinical medicine ,Hepatology ,Gastroenterology ,Cancer research ,medicine ,030211 gastroenterology & hepatology ,Identification (biology) ,Biology ,Carcinogenesis ,medicine.disease_cause ,030304 developmental biology - Published
- 2019
63. Global analysis of Drosophila Cys2-His2 zinc finger proteins reveals a multitude of novel recognition motifs and binding determinants
- Author
-
Adam Richards, Susan E. Celniker, Jianhong Ou, Hannah Pham, Metewo Selase Enuameh, Charles Blatti, Ryan G. Christensen, Scot A. Wolfe, Majid Kazemian, Qiong Cheng, Gary D. Stormo, Jessie A. Brasefield, Victoria L. Hall, Matthew D. Basciotta, Lihua Julie Zhu, Joseph C. McNulty, Saurabh Sinha, Michael H. Brodsky, Yuna Asriyan, and Cong Zhu
- Subjects
Models, Molecular ,Protein Conformation ,Computational biology ,Biology ,Genome ,Protein structure ,Phylogenetics ,Genetics ,Animals ,Cluster Analysis ,Drosophila Proteins ,Position-Specific Scoring Matrices ,Nucleotide Motifs ,Gene ,Transcription factor ,Phylogeny ,Genetics (clinical) ,Zinc finger ,Binding Sites ,Base Sequence ,Research ,Alternative splicing ,Computational Biology ,Zinc Fingers ,Zinc finger nuclease ,Alternative Splicing ,Drosophila ,Protein Binding - Abstract
Cys2-His2 zinc finger proteins (ZFPs) are the largest group of transcription factors in higher metazoans. A complete characterization of these ZFPs and their associated target sequences is pivotal to fully annotate transcriptional regulatory networks in metazoan genomes. As a first step in this process, we have characterized the DNA-binding specificities of 129 zinc finger sets from Drosophila using a bacterial one-hybrid system. This data set contains the DNA-binding specificities for at least one encoded ZFP from 70 unique genes and 23 alternate splice isoforms representing the largest set of characterized ZFPs from any organism described to date. These recognition motifs can be used to predict genomic binding sites for these factors within the fruit fly genome. Subsets of fingers from these ZFPs were characterized to define their orientation and register on their recognition sequences, thereby allowing us to define the recognition diversity within this finger set. We find that the characterized fingers can specify 47 of the 64 possible DNA triplets. To confirm the utility of our finger recognition models, we employed subsets of Drosophila fingers in combination with an existing archive of artificial zinc finger modules to create ZFPs with novel DNA-binding specificity. These hybrids of natural and artificial fingers can be used to create functional zinc finger nucleases for editing vertebrate genomes.
- Published
- 2013
64. Influence of early life exposure, host genetics and diet on the mouse gut microbiome and metabolome
- Author
-
Sarah J. Fansler, Gary H. Karpen, Young-Mo Kim, Thomas O. Metz, Darla R. Miller, Jian-Hua Mao, Sasha A. Langley, Susan E. Celniker, Cecilia Noecker, Janet K. Jansson, Antoine M. Snijders, Elhanan Borenstein, Colin J. Brislawn, Yurong Huang, Cameron P. Casey, Erika M. Zink, and James B. Brown
- Subjects
0301 basic medicine ,Microbiology (medical) ,Quantitative Trait Loci ,Immunology ,Quantitative trait locus ,Applied Microbiology and Biotechnology ,Microbiology ,Mice ,03 medical and health sciences ,0302 clinical medicine ,Genetics ,Metabolome ,Animals ,Microbiome ,Life History Traits ,Gene ,biology ,Host (biology) ,Lactobacillales ,Gastrointestinal Microbiome ,Cell Biology ,biology.organism_classification ,Acquired immune system ,Diet ,Gastrointestinal Tract ,030104 developmental biology ,030217 neurology & neurosurgery - Abstract
Although the gut microbiome plays important roles in host physiology, health and disease1, we lack understanding of the complex interplay between host genetics and early life environment on the microbial and metabolic composition of the gut. We used the genetically diverse Collaborative Cross mouse system2 to discover that early life history impacts the microbiome composition, whereas dietary changes have only a moderate effect. By contrast, the gut metabolome was shaped mostly by diet, with specific non-dietary metabolites explained by microbial metabolism. Quantitative trait analysis identified mouse genetic trait loci (QTL) that impact the abundances of specific microbes. Human orthologues of genes in the mouse QTL are implicated in gastrointestinal cancer. Additionally, genes located in mouse QTL for Lactobacillales abundance are implicated in arthritis, rheumatic disease and diabetes. Furthermore, Lactobacillales abundance was predictive of higher host T-helper cell counts, suggesting an important link between Lactobacillales and host adaptive immunity.
- Published
- 2016
65. Stability-driven nonnegative matrix factorization to interpret spatial gene expression and build local gene networks
- Author
-
Siqi Wu, Ann S. Hammonds, Susan E. Celniker, Antony Joseph, Erwin Frise, and Bin Yu
- Subjects
0301 basic medicine ,Computer science ,Stability (learning theory) ,Gene regulatory network ,sparse decomposition ,Computational biology ,computer.software_genre ,Non-negative matrix factorization ,Set (abstract data type) ,03 medical and health sciences ,Databases ,Genetic ,Models ,Databases, Genetic ,Genetics ,Animals ,Gene Regulatory Networks ,Representation (mathematics) ,Gap gene ,principal patterns ,Multidisciplinary ,Models, Genetic ,Model selection ,spatially local networks ,stability selection ,030104 developmental biology ,Drosophila melanogaster ,Gene Expression Regulation ,Scalability ,Physical Sciences ,Data mining ,spatial gene expression ,Generic health relevance ,computer - Abstract
Spatial gene expression patterns enable the detection of local covariability and are extremely useful for identifying local gene interactions during normal development. The abundance of spatial expression data in recent years has led to the modeling and analysis of regulatory networks. The inherent complexity of such data makes it a challenge to extract biological information. We developed staNMF, a method that combines a scalable implementation of nonnegative matrix factorization (NMF) with a new stability-driven model selection criterion. When applied to a set of Drosophila early embryonic spatial gene expression images, one of the largest datasets of its kind, staNMF identified 21 principal patterns (PP). Providing a compact yet biologically interpretable representation of Drosophila expression patterns, PP are comparable to a fate map generated experimentally by laser ablation and show exceptional promise as a data-driven alternative to manual annotations. Our analysis mapped genes to cell-fate programs and assigned putative biological roles to uncharacterized genes. Finally, we used the PP to generate local transcription factor regulatory networks. Spatially local correlation networks were constructed for six PP that span along the embryonic anterior-posterior axis. Using a two-tail 5% cutoff on correlation, we reproduced 10 of the 11 links in the well-studied gap gene network. The performance of PP with the Drosophila data suggests that staNMF provides informative decompositions and constitutes a useful computational lens through which to extract biological insight from complex and often noisy gene expression data.
- Published
- 2016
66. Diverse Hormone Response Networks in 41 Independent Drosophila Cell Lines
- Author
-
Ben Brown, Lucy Cherbas, Susan E. Celniker, Peter Cherbas, and Marcus H. Stoiber
- Subjects
0301 basic medicine ,Receptors, Steroid ,Ecdysone ,Transcription, Genetic ,network biology ,1.1 Normal biological development and functioning ,Cell fate determination ,Biology ,Investigations ,QH426-470 ,Cell Line ,03 medical and health sciences ,Transduction (genetics) ,chemistry.chemical_compound ,Genetic ,Underpinning research ,Receptors ,Genetics ,Animals ,Protein Isoforms ,Cluster Analysis ,2.1 Biological and endogenous factors ,ecdysone ,Aetiology ,Molecular Biology ,Transcription factor ,Steroid ,Genetics (clinical) ,Regulation of gene expression ,Gene Expression Profiling ,bioinformatics ,biology.organism_classification ,Hormones ,030104 developmental biology ,Nuclear receptor ,chemistry ,Gene Expression Regulation ,Drosophila ,Steroids ,Drosophila melanogaster ,RNA-seq ,Ecdysone receptor ,Transcriptome ,transcription ,Transcription Factors ,Signal Transduction - Abstract
Steroid hormones induce cascades of gene activation and repression with transformative effects on cell fate . Steroid transduction plays a major role in the development and physiology of nearly all metazoan species, and in the progression of the most common forms of cancer. Despite the paramount importance of steroids in developmental and translational biology, a complete map of transcriptional response has not been developed for any hormone . In the case of 20-hydroxyecdysone (ecdysone) in Drosophila melanogaster, these trajectories range from apoptosis to immortalization. We mapped the ecdysone transduction network in a cohort of 41 cell lines, the largest such atlas yet assembled. We found that the early transcriptional response mirrors the distinctiveness of physiological origins: genes respond in restricted patterns, conditional on the expression levels of dozens of transcription factors. Only a small cohort of genes is constitutively modulated independent of initial cell state. Ecdysone-responsive genes tend to organize into directional same-stranded units, with consecutive genes induced from the same strand. Here, we identify half of the ecdysone receptor heterodimer as the primary rate-limiting step in the response, and find that initial receptor isoform levels modulate the activated cohort of target transcription factors. This atlas of steroid response reveals organizing principles of gene regulation by a model type II nuclear receptor and lays the foundation for comprehensive and predictive understanding of the ecdysone transduction network in the fruit fly.
- Published
- 2016
67. Frontiers in RNA biology: Advances from a small fly Drosophila melanogaster
- Author
-
James B. Brown and Susan E. Celniker
- Subjects
Rna processing ,biology ,RNA ,Computational biology ,biology.organism_classification ,General Biochemistry, Genetics and Molecular Biology ,Molecular machine ,RNA Isoforms ,RNA splicing ,Genetics ,Biochemistry and Cell Biology ,Drosophila melanogaster ,Function (biology) ,Ribonucleoprotein - Abstract
Author(s): Brown, JB; Celniker, SE | Abstract: In this article, we discuss emerging frontiers in RNA biology from a historical perspective. The field is currently undergoing yet another transformative expansion. RNA-seq has revealed that splicing, and, more generally, RNA processing is far more complex than expected, and the mechanisms of regulation are correspondingly sophisticated. Our understanding of the molecular machines involved in RNA metabolism is incomplete and derives from small sample sizes. Even if we manage to complete a catalogue of molecular species, RNA isoforms and the ribonucleoprotein complexes that drive their genesis, the horizons of molecular dynamics and cell-type-specific processing mechanisms await. This is an exciting time to enter into the study of RNA biology; analytical tools, wet and dry, are advancing rapidly, and each new measurement modality brings into view another new function or activity of versatile RNA. Since the dawn of sequence-based RNA biology, we have come a long way.
- Published
- 2016
68. Automated protein-DNA interaction screening of Drosophila regulatory elements
- Author
-
Bart Deplancke, Korneel Hens, Julien Bryois, Susan E. Celniker, Antonina Iagovitina, Patrick Callaerts, Andreas Massouras, Alina Isakova, and Jean-Daniel Feuz
- Subjects
Microfluidics ,Sine-Oculis ,Network ,Twin-Of-Eyeless ,Biology ,Gene ,Biochemistry ,Trans-regulatory element ,Article ,Database ,Automation ,Open Reading Frames ,03 medical and health sciences ,0302 clinical medicine ,Two-Hybrid System Techniques ,Melanogaster ,Sequence ,Animals ,Drosophila Proteins ,Protein–DNA interaction ,Regulatory Elements, Transcriptional ,Enhancer ,Molecular Biology ,Transcription factor ,030304 developmental biology ,Visual-System ,Regulation of gene expression ,Genetics ,0303 health sciences ,Binding Sites ,REDfly ,Reproducibility of Results ,DNA ,Cell Biology ,High-Throughput Screening Assays ,DNA-Binding Proteins ,Drosophila melanogaster ,030217 neurology & neurosurgery ,Drosophila Protein ,Transcription Factors ,Biotechnology - Abstract
Drosophila melanogaster has one of the best characterized metazoan genomes in terms of functionally annotated regulatory elements. To explore how these elements contribute to gene regulation in the context of gene regulatory networks, we need convenient tools to identify the proteins that bind to them. Here, we present the development and validation of a highly automated protein-DNA interaction detection method, enabling the high-throughput yeast one-hybrid-based screening of DNA elements versus an array of full-length, sequence-verified clones containing 647 (over 85%) of predicted Drosophila transcription factors (TFs). Using six well-characterized regulatory elements (82 bp – 1kb), we identified 33 TF-DNA interactions of which 27 are novel. To simultaneously validate these interactions and locate their binding sites of involved TFs, we implemented a novel microfluidics-based approach that enables us to conduct hundreds of gel shift-like assays at once, thus allowing the retrieval of DNA occupancy data for each TF throughout the respective target DNA elements. Finally, we biologically validate several interactions and specifically identify two novel regulators of sine oculis gene expression and hence eye development.
- Published
- 2011
69. A Protein Complex Network of Drosophila melanogaster
- Author
-
Spyros Artavanis-Tsakonas, David Y. Rhee, Pujita Vaidya, Mark Stapleton, Chapman N. Beekman, Christina Wong, Steven P. Gygi, Emily McKillip, K. G. Guruharsha, Robert A. Obar, Julian Mintseris, Bayan Parsa, Susan E. Celniker, Kenneth H. Wan, Namita Vaidya, Xiao Chen, Joseph W. Carlson, Jean François Rual, Saumini Shah, K. VijayRaghavan, Charles Yu, Bhaveen Kapadia, Odise Cenaj, and Bo Zhai
- Subjects
Proteomics ,Proteasome Endopeptidase Complex ,Computational biology ,Article ,General Biochemistry, Genetics and Molecular Biology ,03 medical and health sciences ,0302 clinical medicine ,Protein Interaction Mapping ,Animals ,Drosophila Proteins ,Proteasome endopeptidase complex ,Gene ,030304 developmental biology ,Genetics ,0303 health sciences ,biology ,Biochemistry, Genetics and Molecular Biology(all) ,A protein ,Complex network ,biology.organism_classification ,Drosophila melanogaster ,Proteome ,SNARE Proteins ,030217 neurology & neurosurgery ,Drosophila Protein - Abstract
SummaryDetermining the composition of protein complexes is an essential step toward understanding the cell as an integrated system. Using coaffinity purification coupled to mass spectrometry analysis, we examined protein associations involving nearly 5,000 individual, FLAG-HA epitope-tagged Drosophila proteins. Stringent analysis of these data, based on a statistical framework designed to define individual protein-protein interactions, led to the generation of a Drosophila protein interaction map (DPiM) encompassing 556 protein complexes. The high quality of the DPiM and its usefulness as a paradigm for metazoan proteomes are apparent from the recovery of many known complexes, significant enrichment for shared functional attributes, and validation in human cells. The DPiM defines potential novel members for several important protein complexes and assigns functional links to 586 protein-coding genes lacking previous experimental annotation. The DPiM represents, to our knowledge, the largest metazoan protein complex map and provides a valuable resource for analysis of protein complex evolution.
- Published
- 2011
70. Characterization of MtnE, the fifth metallothionein member in Drosophila
- Author
-
Susan E. Celniker, Oleg Georgiev, Lilit Atanesyan, Viola Günther, Walter Schaffner, University of Zurich, and Schaffner, W
- Subjects
Male ,1303 Biochemistry ,Transcription, Genetic ,Molecular Sequence Data ,chemistry.chemical_element ,Response Elements ,Metal-regulatory transcription factor 1 ,Biochemistry ,Metal regulatory transcription factor 1 ,Inorganic Chemistry ,Transcription (biology) ,Metals, Heavy ,Animals ,Drosophila Proteins ,Gene family ,Metallothionein ,Amino Acid Sequence ,Cadmium toxicit ,Enhancer ,Gene ,Peptide sequence ,Cadmium ,Base Sequence ,Metallothionein E ,Drosophila ,Metal responsive transcription factor ,Metal-response element ,1604 Inorganic Chemistry ,Chemistry ,Molecular biology ,10124 Institute of Molecular Life Sciences ,Cell biology ,DNA-Binding Proteins ,Drosophila melanogaster ,Gene Expression Regulation ,570 Life sciences ,biology ,Female ,Transcription Factors - Abstract
JBIC Journal of Biological Inorganic Chemistry, 16 (7), ISSN:0949-8257, ISSN:1432-1327
- Published
- 2011
71. The transcriptional diversity of 25 Drosophila cell lines
- Author
-
Chris Zaleski, Brenton R. Graveley, Wei Lin, Thomas R. Gingeras, Peter Cherbas, Srinka Ghosh, Philipp Kapranov, Alexander Dobin, Joseph W. Carlson, Lucy Cherbas, Li Yang, Carrie A. Davis, Yi Zou, Johnny Roberts, Justen Andrews, Kim Bell, Michael O. Duff, Roger A. Hoskins, Jeong Hyeon Choi, Jane M. Landolin, Laura Langton, Michael R. Brent, Jacqueline Dumais, Dayu Zhang, Anastasia Samsonova, Marijke J. van Baren, Aaron Tenney, Norbert Perrimon, Haixu Tang, Brian D. Eads, Aarron T. Willingham, Thomas C. Kaufman, and Susan E. Celniker
- Subjects
Male ,Resource ,Transcription, Genetic ,Molecular Sequence Data ,Cell Line ,Exon ,Transcription (biology) ,Gene expression ,Genetics ,Animals ,Cluster Analysis ,FlyBase : A Database of Drosophila Genes & Genomes ,Gene ,Genetics (clinical) ,biology ,Microarray analysis techniques ,Gene Expression Profiling ,Genetic Variation ,Exons ,biology.organism_classification ,Gene expression profiling ,Drosophila melanogaster ,Female ,Signal Transduction ,Transcription Factors - Abstract
Drosophila melanogaster cell lines are important resources for cell biologists. Here, we catalog the expression of exons, genes, and unannotated transcriptional signals for 25 lines. Unannotated transcription is substantial (typically 19% of euchro- matic signal). Conservatively, we identify 1405 novel transcribed regions; 684 of these appear to be new exons of neighboring, oftendistant, genes. Sixty-four percent of genes areexpressed detectably in atleastone line, but only 21%are detected in all lines. Each cell line expresses, on average, 5885 genes, including a common set of 3109. Expression levels vary over several orders of magnitude. Major signaling pathways are well represented: most differentiation pathways are ''off'' andsurvival/growth pathways''on.''Roughly 50%ofthe genesexpressed by eachlinearenot part ofthe commonset, and these show considerable individuality. Thirty-one percent are expressed at a higher level in at least one cell line than in any single developmental stage, suggesting that each line is enriched for genes characteristic of small sets of cells. Most re- markable is that imaginal disc-derived lines can generally be assigned, on the basis of expression, to small territories within developing discs. These mappings reveal unexpected stability of even fine-grained spatial determination. No two cell lines show identical transcription factor expression. We conclude that each line has retained features of an individual founder cell superimposed on a common ''cell line'' gene expression pattern. (Supplemental material is available for this article. The data from this study have been submitted to the NCBI Gene Expression Omnibus (http://www.ncbi.nlm.nih.gov/geo) under accession nos. GSE15596, GSE16269-GSE16290, GSE16321- GSE16322, GSE16325, and GSE18040. All of the microarray data, RNA-seq data, and expression scores for genes and exons are available from the Data Coordination Center of modENCODE (http://modencode.org), and much of the data are also available from the Drosophila Genomic Resources Center (https://dgrc.cgb.indiana.edu/) and FlyBase (http:// flybase.org/).)
- Published
- 2010
72. Genome-wide analysis of promoter architecture in Drosophila melanogaster
- Author
-
Hazuki Takahashi, Timo Lassmann, Jane M. Landolin, Joseph W. Carlson, Kenneth H. Wan, Jeremy E. Sandler, Charles Yu, Justen Andrews, Thomas C. Kaufman, Benjamin W. Booth, Peter J. Bickel, Piero Carninci, Dayu Zhang, Li Yang, James B. Brown, Brenton R. Graveley, Nathan Boley, Roger A. Hoskins, and Susan E. Celniker
- Subjects
Expressed Sequence Tags ,Genetics ,Regulation of gene expression ,Expressed sequence tag ,Gene Expression Profiling ,Research ,Genome, Insect ,Mammalian promoter database ,Chromosome Mapping ,Computational Biology ,Promoter ,Biology ,Cap analysis gene expression ,Drosophila melanogaster ,Gene Expression Regulation ,Gene expression ,Animals ,Transcription Initiation Site ,Promoter Regions, Genetic ,3' Untranslated Regions ,Gene ,Transcription factor ,Genetics (clinical) ,Genome-Wide Association Study - Abstract
Core promoters are critical regions for gene regulation in higher eukaryotes. However, the boundaries of promoter regions, the relative rates of initiation at the transcription start sites (TSSs) distributed within them, and the functional significance of promoter architecture remain poorly understood. We produced a high-resolution map of promoters active in the Drosophila melanogaster embryo by integrating data from three independent and complementary methods: 21 million cap analysis of gene expression (CAGE) tags, 1.2 million RNA ligase mediated rapid amplification of cDNA ends (RLM-RACE) reads, and 50,000 cap-trapped expressed sequence tags (ESTs). We defined 12,454 promoters of 8037 genes. Our analysis indicates that, due to non-promoter-associated RNA background signal, previous studies have likely overestimated the number of promoter-associated CAGE clusters by fivefold. We show that TSS distributions form a complex continuum of shapes, and that promoters active in the embryo and adult have highly similar shapes in 95% of cases. This suggests that these distributions are generally determined by static elements such as local DNA sequence and are not modulated by dynamic signals such as histone modifications. Transcription factor binding motifs are differentially enriched as a function of promoter shape, and peaked promoter shape is correlated with both temporal and spatial regulation of gene expression. Our results contribute to the emerging view that core promoters are functionally diverse and control patterning of gene expression in Drosophila and mammals.
- Published
- 2010
73. The developmental transcriptome of Drosophila melanogaster
- Author
-
Roger A. Hoskins, Peter Cherbas, Brian B. Tuch, Lucy Cherbas, Michael R. Brent, Wei Lin, Laura Langton, Dayu Zhang, James B. Brown, Thomas R. Gingeras, Aarron T. Willingham, Alexander Dobin, Brenton R. Graveley, Susan E. Celniker, Nathan Boley, Carlo G. Artieri, Steven E. Brenner, Nicolas R. Mattiuzzo, Carrie A. Davis, Marco Blanchette, Jeremy E. Sandler, David Sturgill, Sandrine Dudoit, Brian Oliver, Benjamin W. Booth, Peter J. Bickel, Renhua Li, Yu Zhang, Norbert Perrimon, Phil Kapranov, David Scott Miller, Ann S. Hammonds, Marijke J. van Baren, Yi Zou, Richard E. Green, Joseph W. Carlson, Lichun Jiang, Jane M. Landolin, Chris Zaleski, Li Yang, Kenneth H. Wan, Thomas C. Kaufman, Brian D. Eads, John H. Malone, Angela N. Brooks, Michael O. Duff, and Justen Andrews
- Subjects
Genetics ,0303 health sciences ,Multidisciplinary ,biology ,Computational biology ,biology.organism_classification ,Genome ,Article ,Transcriptome ,03 medical and health sciences ,Exon ,0302 clinical medicine ,RNA editing ,Genetic model ,RNA splicing ,Drosophila melanogaster ,Gene ,030217 neurology & neurosurgery ,030304 developmental biology - Abstract
Drosophila melanogaster is one of the most well studied genetic model organisms, nonetheless its genome still contains unannotated coding and non-coding genes, transcripts, exons, and RNA editing sites. Full discovery and annotation are prerequisites for understanding how the regulation of transcription, splicing, and RNA editing directs development of this complex organism. We used RNA-Seq, tiling microarrays, and cDNA sequencing to explore the transcriptome in 30 distinct developmental stages. We identified 111,195 new elements, including thousands of genes, coding and non-coding transcripts, exons, splicing and editing events and inferred protein isoforms that previously eluded discovery using established experimental, prediction and conservation-based approaches. Together, these data substantially expand the number of known transcribed elements in the Drosophila genome and provide a high-resolution view of transcriptome dynamics throughout development.
- Published
- 2010
74. SF3B1 Gene Expression in Erythroid Cells Is Regulated By Intron Retention Via a Posttranscriptional Mechanism Involving Cryptic Exons Proposed to Function As Splicing Decoys
- Author
-
Richard Weiszmann, Benjamin W. Booth, Gene W. Yeo, Susan E. Celniker, Marilyn Parra, James B. Brown, and John G. Conboy
- Subjects
RNA Splicing Factors ,U2AF2 ,Immunology ,Intron ,RNA-binding protein ,Cell Biology ,Hematology ,Biology ,Biochemistry ,Cell biology ,Exon ,RNA splicing ,Decoy ,Minigene - Abstract
Proper expression of the MDS-disease gene, SF3B1, ensures appropriate pre-mRNA splicing in erythroid progenitors and during terminal erythropoiesis. We previously showed that the SF3B1 gene is post-transcriptionally regulated in a differentiation stage-specific manner by intron retention (IR), such that ~50% of its transcripts in mature erythroblasts retain intron 4. Based on new mechanistic studies, we propose a model in which mostly unannotated and noncoding exons within intron 4 function as splicing decoys; i.e., they promote retention of intron 4 by interacting with, and blocking splice sites of, the adjacent exons 4 and 5. A total of six putative decoy exons were revealed via RT-PCR and RNA-seq analysis of RNA from erythroblasts treated with inhibitors of nonsense-mediated decay. That decoy exons have IR-promoting activity is suggested by several criteria. First, the frequency of interaction between constitutive exons 4 and 5 and putative decoy exons within intron 4, measured by the abundance of splice junctions in RNA-seq read data, is temporally correlated with levels of intron 4 retention during terminal erythropoiesis. Both IR and decoy splice junctions were low in early stage erythroblasts and much higher in mature erythroblasts. Second, selected decoy exons exhibited IR-promoting activity in the context of minigene splicing reporters expressing the exon 3-6 region of SF3B1 in transfected K562 cells. The wild type minigene reproduced the intron-specific retention phenotype, since it was fully spliced at introns 3 and 5 but exhibited substantial retention of intron 4, whereas deletion of decoy exon 4e, or mutation of its splice sites, substantially decreased IR. Third, RBP (RNA binding protein) cross-linking data from K562 cells show that 3' splice site factors including U2AF1 and U2AF2 can bind specifically to 3' splice sites of intron 4's decoy exons. Finally, several experiments showed that IR-promoting activity of decoy exons is a more general phenomenon that likely governs IR in other erythroid genes. We observed not only that SF3B1 intron 4 decoy exons could promote IR in heterologous contexts, but also that predicted decoy exons from other erythroblast transcripts could promote IR in the SF3B1 minigene. Apart from this experimental data, comparative genomics revealed that the SF3B1 decoy exons are extremely conserved among vertebrate genomes, with two of the exons being essentially identical from fish to humans. Together this data supports the hypothesis that a subset of up-regulated IR events in late erythroblasts are controlled by decoy exons that block productive splicing at the flanking exons. We propose that regulated IR is an important post-transcriptional mechanism for adjusting cellular splicing capacity during terminal erythropoiesis by regulating expression of key splicing factors such as SF3B1. Disclosures No relevant conflicts of interest to declare.
- Published
- 2017
75. Unlocking the secrets of the genome
- Author
-
Laura A.L. Dillon, Fabio Piano, Jason D. Lieb, Gary H. Karpen, Michael Snyder, Kevin P. White, David M. MacAlpine, Robert H. Waterston, Gos Micklem, Mark Gerstein, Kristin C. Gunsalus, Manolis Kellis, Eric C. Lai, Steven Henikoff, Lincoln Stein, Susan E. Celniker, Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science, and Kellis, Manolis
- Subjects
Genetics ,Genome ,Multidisciplinary ,biology ,fungi ,Genomics ,Computational biology ,biology.organism_classification ,Article ,Drosophila melanogaster ,Drosophilidae ,Human Genome Project ,Animals ,Humans ,sense organs ,Caenorhabditis elegans ,Functional genomics - Abstract
The primary objective of the Human Genome Project was to produce high-quality sequences not just for the human genome but also for those of the chief model organisms: Escherichia coli, yeast (Saccharomyces cerevisiae), worm (Caenorhabditis elegans), fly (Drosophila melanogaster) and mouse (Mus musculus). Free access to the resultant data has prompted much biological research, including development of a map of common human genetic variants (the International HapMap Project)1, expression profiling of healthy and diseased cells2 and in-depth studies of many individual genes. These genome sequences have enabled researchers to carry out genetic and functional genomic studies not previously possible, revealing new biological insights with broad relevance across the animal kingdom 3, 4.
- Published
- 2009
76. Determination of gene expression patterns using high-throughput RNA in situ hybridization to whole-mount Drosophila embryos
- Author
-
Richard Weiszmann, Ann S. Hammonds, and Susan E. Celniker
- Subjects
Embryo, Nonmammalian ,animal structures ,Embryonic Development ,Dot blot ,In situ hybridization ,Biology ,Polymerase Chain Reaction ,Article ,General Biochemistry, Genetics and Molecular Biology ,Embryo Culture Techniques ,Transcription (biology) ,Gene expression ,Animals ,Cloning, Molecular ,In Situ Hybridization ,Gene Expression Profiling ,Hybridization probe ,Gene Expression Regulation, Developmental ,RNA ,RNA Probes ,Molecular biology ,Antisense RNA ,genomic DNA ,embryonic structures ,Drosophila - Abstract
We describe a high-throughput protocol for RNA in situ hybridization (ISH) to Drosophila embryos in 96-well format. cDNA or genomic DNA templates are amplified by PCR and then digoxigenin-labeled ribonucleotides are incorporated into anti-sense RNA probes by in vitro transcription. The quality of each probe is evaluated prior to in situ hybridization using a RNA Probe Quantification (dot blot) assay. RNA probes are hybridized to fixed, mixed-staged Drosophila embryos in 96-well plates. The resulting stained embryos can be examined and photographed immediately or stored at 4°C for later analysis. Starting with fixed, staged embryos, the protocol takes 6 days from probe template production through hybridization. Preparation of fixed embryos requires a minimum of two weeks to collect embryos representing all stages. The method has been used to determine the expression patterns of over 6000 genes throughout embryogenesis.
- Published
- 2009
77. Tools for neuroanatomy and neurogenetics in Drosophila
- Author
-
Christopher J. Mungall, Todd R. Laverty, Barret D. Pfeiffer, James T. Kadonaga, Ann S. Hammonds, Michael B. Eisen, Chris Q. Doe, Gerald M. Rubin, Arnim Jenett, Susan E. Celniker, Audra Scully, Rob Svirskas, Sima Misra, Christine Murphy, Teri T.B. Ngo, Kenneth H. Wan, and Joseph W. Carlson
- Subjects
Biomedical Research ,Transgene ,Genes, Insect ,Computational biology ,Biology ,Animals, Genetically Modified ,Transcription (biology) ,Gene expression ,medicine ,Animals ,Enhancer ,Gene ,Neurons ,Recombination, Genetic ,Genetics ,Multidisciplinary ,Neurosciences ,Brain ,Gene Expression Regulation, Developmental ,Promoter ,Biological Sciences ,biology.organism_classification ,Drosophila melanogaster ,Enhancer Elements, Genetic ,medicine.anatomical_structure ,Neuroanatomy - Abstract
We demonstrate the feasibility of generating thousands of transgenic Drosophila melanogaster lines in which the expression of an exogenous gene is reproducibly directed to distinct small subsets of cells in the adult brain. We expect the expression patterns produced by the collection of 5,000 lines that we are currently generating to encompass all neurons in the brain in a variety of intersecting patterns. Overlapping 3-kb DNA fragments from the flanking noncoding and intronic regions of genes thought to have patterned expression in the adult brain were inserted into a defined genomic location by site-specific recombination. These fragments were then assayed for their ability to function as transcriptional enhancers in conjunction with a synthetic core promoter designed to work with a wide variety of enhancer types. An analysis of 44 fragments from four genes found that >80% drive expression patterns in the brain; the observed patterns were, on average, comprised of D. melanogaster genome contains >50,000 enhancers and that multiple enhancers drive distinct subsets of expression of a gene in each tissue and developmental stage. We expect that these lines will be valuable tools for neuroanatomy as well as for the elucidation of neuronal circuits and information flow in the fly brain.
- Published
- 2008
78. Drosophila telomeric retrotransposons derived from an ancestral element that was recruited to replace telomerase
- Author
-
Beatriz de Pablos, Rosario Planelló, José P. Abad, Susan E. Celniker, Maria Mendez-Lago, and Alfredo Villasante
- Subjects
Telomerase ,animal structures ,Retroelements ,Molecular Sequence Data ,Retrotransposon ,Evolution, Molecular ,Transposition (music) ,Species Specificity ,Phylogenetics ,Genetics ,Animals ,ORFS ,Drosophila (subgenus) ,Phylogeny ,Genetics (clinical) ,Genome ,biology ,fungi ,Telomere ,biology.organism_classification ,Drosophila melanogaster ,12 Drosophila Genomes/Letter ,Drosophila - Abstract
Drosophila telomeres do not have arrays of simple telomerase-generated G-rich repeats. Instead, Drosophila maintains its telomeres by occasional transposition of specific non-long terminal repeat (non-LTR) retrotransposons to chromosome ends. The genus Drosophila provides a superb model system for comparative telomere analysis. Here we present an evolutionary study of Drosophila telomeric elements to ascertain the significance of telomeric retrotransposons (TRs) in the maintenance of Drosophila telomeres. PCR and in silico surveys in the sibling species of Drosophila melanogaster and in more distantly related species show that multiple TRs maintain telomeres in Drosophila. In addition to TRs with two open reading frames (ORFs) capable of autonomous transposition, there are deleted telomeric retrotransposons that have lost their ORF2, which we refer to as half telomeric-retrotransposons (HTRs). The phylogenetic relationship among these telomeric elements is congruent with the phylogeny of the species, suggesting that they have been vertically inherited from a common ancestor. Our results suggest that an existing non-LTR retrotransposon was recruited to perform the cellular function of telomere maintenance.
- Published
- 2007
79. Identification of genetic factors that modify motor performance and body weight using Collaborative Cross mice
- Author
-
Janet K. Jansson, Michael Hang, Susan E. Celniker, Yurong Huang, James B. Brown, Sasha A. Langley, Antoine M. Snijders, Kristofer E. Bouchard, Gary H. Karpen, and Jian-Hua Mao
- Subjects
Genotype ,Genetic Linkage ,Quantitative Trait Loci ,Inbred Strains ,Mice, Inbred Strains ,Motor Activity ,Quantitative trait locus ,Biology ,Article ,Mice ,03 medical and health sciences ,0302 clinical medicine ,Inbred strain ,Risk Factors ,Genetic linkage ,Genetics ,medicine ,Animals ,Humans ,2.1 Biological and endogenous factors ,Obesity ,Aetiology ,Behavioural genetics ,Nutrition ,030304 developmental biology ,0303 health sciences ,Multidisciplinary ,Prevention ,Body Weight ,Human Genome ,Neurosciences ,Chromosome Mapping ,medicine.disease ,Phenotype ,Other Physical Sciences ,Cohort ,Biochemistry and Cell Biology ,030217 neurology & neurosurgery - Abstract
Evidence has emerged that suggests a link between motor deficits, obesity and many neurological disorders. However, the contributing genetic risk factors are poorly understood. Here we used the Collaborative Cross (CC), a large panel of newly inbred mice that captures 90% of the known variation among laboratory mice, to identify the genetic loci controlling rotarod performance and its relationship with body weight in a cohort of 365 mice across 16 CC strains. Body weight and rotarod performance varied widely across CC strains and were significantly negatively correlated. Genetic linkage analysis identified 14 loci that were associated with body weight. However, 45 loci affected rotarod performance, seven of which were also associated with body weight, suggesting a strong link at the genetic level. Lastly, we show that genes identified in this study overlap significantly with those related to neurological disorders and obesity found in human GWA studies. In conclusion, our results provide a genetic framework for studies of the connection between body weight, the central nervous system and behavior.
- Published
- 2015
80. Genome-wide identification of zero nucleotide recursive splicing in Drosophila
- Author
-
Alex M. Plocik, Michael O. Duff, Brenton R. Graveley, Sara Olson, Xintao Wei, Mohan Bolisetty, Susan E. Celniker, Sandra C. Garrett, and Ahmad Osman
- Subjects
Male ,animal structures ,General Science & Technology ,Cells ,RNA Splicing ,Genome, Insect ,Exonic splicing enhancer ,Genes, Insect ,Biology ,Article ,03 medical and health sciences ,Exon ,Splicing factor ,0302 clinical medicine ,Splicing Factor U2AF ,Genetics ,Animals ,Humans ,Ultrabithorax ,Cells, Cultured ,030304 developmental biology ,0303 health sciences ,Multidisciplinary ,Splice site mutation ,Cultured ,Genome ,Base Sequence ,Nucleotides ,Human Genome ,Intron ,Nuclear Proteins ,Reproducibility of Results ,Exons ,Introns ,Drosophila melanogaster ,Ribonucleoproteins ,Genes ,RNA splicing ,Female ,RNA Splice Sites ,Insect ,030217 neurology & neurosurgery - Abstract
© 2015 Macmillan Publishers Limited. All rights reserved. Recursive splicing is a process in which large introns are removed in multiple steps by re-splicing at ratchet points - 5′ splice sites recreated after splicing1. Recursive splicing was first identified in the Drosophila Ultrabithorax (Ubx) gene and only three additional Drosophila genes have since been experimentally shown to undergo recursive splicing. Here we identify 197 zero nucleotide exon ratchet points in 130 introns of 115 Drosophila genes from total RNA sequencing data generated from developmental time points, dissected tissues and cultured cells. The sequential nature of recursive splicing was confirmed by identification of lariat introns generated by splicing to and from the ratchet points. We also show that recursive splicing is a constitutive process, that depletion of U2AF inhibits recursive splicing, and that the sequence and function of ratchet points are evolutionarily conserved in Drosophila. Finally, we identify four recursively spliced human genes, one of which is also recursively spliced in Drosophila. Together, these results indicate that recursive splicing is commonly used in Drosophila, occurs in humans, and provides insight into the mechanisms by which some large introns are removed.
- Published
- 2015
81. Regulation of alternative splicing in Drosophila by 56 RNA binding proteins
- Author
-
Brenton R. Graveley, Steven E. Brenner, Li Yang, Jeremy E. Sandler, Mohan Bolisetty, Angela N. Brooks, Jane M. Landolin, Gemma E. May, Michael O. Duff, Susan E. Celniker, Benjamin W. Booth, and Ken Wan
- Subjects
Genetics ,Resource ,TATA-Binding Protein Associated Factors ,Sequence Analysis, RNA ,Alternative splicing ,Intron ,Exonic splicing enhancer ,RNA-Binding Proteins ,RNA-binding protein ,Exons ,Biology ,Heterogeneous-Nuclear Ribonucleoproteins ,Cell biology ,Exon ,Splicing factor ,Alternative Splicing ,SR protein ,RNA splicing ,RNA Precursors ,Animals ,Drosophila Proteins ,Drosophila ,RNA Interference ,Genetics (clinical) - Abstract
Alternative splicing is regulated by RNA binding proteins (RBPs) that recognize pre-mRNA sequence elements and activate or repress adjacent exons. Here, we used RNA interference and RNA-seq to identify splicing events regulated by 56 Drosophila proteins, some previously unknown to regulate splicing. Nearly all proteins affected alternative first exons, suggesting that RBPs play important roles in first exon choice. Half of the splicing events were regulated by multiple proteins, demonstrating extensive combinatorial regulation. We observed that SR and hnRNP proteins tend to act coordinately with each other, not antagonistically. We also identified a cross-regulatory network where splicing regulators affected the splicing of pre-mRNAs encoding other splicing regulators. This large-scale study substantially enhances our understanding of recent models of splicing regulation and provides a resource of thousands of exons that are regulated by 56 diverse RBPs.
- Published
- 2015
82. The Release 6 reference sequence of the Drosophila melanogaster genome
- Author
-
Ivonne Mendez, Maria Mendez-Lago, Alfredo Villasante, Lidiya V. Boldyreva, Joseph W. Carlson, Kenneth H. Wan, Jacqueline E. Schein, Benjamin W. Booth, Reed A. George, Evgeniya N. Andreyeva, Gary H. Karpen, Patrizio Dimitri, Maria Carmela Accardo, A. Bernardo Carvalho, Robert Svirskas, Marco A. Marra, Soo Park, Martin Krzywinski, Elisabetta Damia, Olga V. Demakova, Giovanni Messina, Beatriz de Pablos, Igor F. Zhimulev, Gerald M. Rubin, Roger A. Hoskins, Samuel E. Galle, Susan E. Celniker, Barret D. Pfeiffer, Fundação Carlos Chagas Filho de Amparo à Pesquisa do Estado do Rio de Janeiro, Conselho Nacional de Desenvolvimento Científico e Tecnológico (Brasil), Fondazione Cenci Bolognetti, University of California, Ministry of Education and Science of the Russian Federation, Ministerio de Economía y Competitividad (España), Hoskins, R, Carlson, J, Wan, K, Park, S, Mendez, I, Galle, S, Booth, B, Pfeiffer, B, George, R, Svirskas, R, Krzywinski, M, Schein, J, Accardo, M, Damia, E, Messina, G, Mendez-Lago, M, De Pablos, B, Demakova, O, Andreyeva, E, Boldyreva, L, Marra, M, Carvalho, A, Dimitri, P, Villasante, A, Zhimulev, I, Rubin, G, Karpen, G, and Celniker, S
- Subjects
Resource ,Chromosomes, Artificial, Bacterial ,Sequence analysis ,Bioinformatics ,Restriction Mapping ,Molecular Sequence Data ,Drosophila, Genome ,Sequence assembly ,Computational biology ,Biology ,Genome ,Medical and Health Sciences ,Contig Mapping ,Chromosomes ,Fluorescence ,03 medical and health sciences ,0302 clinical medicine ,Restriction map ,Genetics ,Animals ,In Situ Hybridization, Fluorescence ,Genetics (clinical) ,In Situ Hybridization ,030304 developmental biology ,Sequence (medicine) ,Polytene Chromosomes ,0303 health sciences ,Human Genome ,Bacterial ,Chromosome Mapping ,Computational Biology ,High-Throughput Nucleotide Sequencing ,Genome project ,Biological Sciences ,Drosophila melanogaster ,Artificial ,Generic health relevance ,030217 neurology & neurosurgery ,Reference genome ,Biotechnology - Abstract
Drosophila melanogaster plays an important role in molecular, genetic, and genomic studies of heredity, development, metabolism, behavior, and human disease. The initial reference genome sequence reported more than a decade ago had a profound impact on progress in Drosophila research, and improving the accuracy and completeness of this sequence continues to be important to further progress. We previously described improvement of the 117-Mb sequence in the euchromatic portion of the genome and 21 Mb in the heterochromatic portion, using a whole-genome shotgun assembly, BAC physical mapping, and clone-based finishing. Here, we report an improved reference sequence of the single-copy and middle-repetitive regions of the genome, produced using cytogenetic mapping to mitotic and polytene chromosomes, clone-based finishing and BAC fingerprint verification, ordering of scaffolds by alignment to cDNA sequences, incorporation of other map and sequence data, and validation by whole-genome optical restriction mapping. These data substantially improve the accuracy and completeness of the reference sequence and the order and orientation of sequence scaffolds into chromosome arm assemblies. Representation of the Y chromosome and other heterochromatic regions is particularly improved. The new 143.9-Mb reference sequence, designated Release 6, effectively exhausts clone-based technologies for mapping and sequencing. Highly repeat-rich regions, including large satellite blocks and functional elements such as the ribosomal RNA genes and the centromeres, are largely inaccessible to current sequencing and assembly methods and remain poorly represented. Further significant improvements will require sequencing technologies that do not depend on molecular cloning and that produce very long reads., This work was supported by NIH grants P50 HG00750 (G.M.R.), R01 HG00747 (G.H.K.), and R01 HG002673 (S.E.C.) and performed under U.S. Department of Energy Contracts DE-AC0376SF00098 and DE-AC02-05CH11231, University of California. I.F.Z. was supported by grant 13-04-40137 from the Russian Federation; E.N.A. was supported by grant 12-04-00874-a from the Russian Federation; P.D. was supported by a grant from the Instituto Pasteur-Fondazione Cenci Bolognetti; A.V. was supported by Ministerio de Economía y Competitividad grant BFU2011-30295-C02-01; and A.B.C. was supported by NIH grant R01 GM064590 and grants from Fundaçao de Amparo a Pesquisa do Estado do Rio de Janeiro (FAPERJ) and the Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq).
- Published
- 2015
83. Lessons from modENCODE
- Author
-
Susan E. Celniker and James B. Brown
- Subjects
replication ,Databases, Factual ,1.1 Normal biological development and functioning ,ved/biology.organism_classification_rank.species ,Genome, Insect ,Computational biology ,ENCODE ,Genome ,Transcriptome ,Databases ,Underpinning research ,Helminth ,Genetics ,Animals ,2.1 Biological and endogenous factors ,Aetiology ,Model organism ,Caenorhabditis elegans ,Molecular Biology ,Genetics (clinical) ,Factual ,Genomic organization ,Genetics & Heredity ,Genome, Helminth ,Evolutionary Biology ,Binding Sites ,biology ,epigenetics ,ved/biology ,Human Genome ,Epigenome ,Genomics ,biology.organism_classification ,Chromatin ,DNA-Binding Proteins ,Drosophila melanogaster ,Generic health relevance ,regulation of gene expression ,transcription ,Insect ,Law ,Biotechnology - Abstract
Copyright © 2015 by Annual Reviews. All rights reserved. The modENCODE (Model Organism Encyclopedia of DNA Elements) Consortium aimed to map functional elements-including transcripts, chromatin marks, regulatory factor binding sites, and origins of DNA replication-in the model organisms Drosophila melanogaster and Caenorhabditis elegans. During its five-year span, the consortium conducted more than 2,000 genome-wide assays in developmentally staged animals, dissected tissues, and homogeneous cell lines. Analysis of these data sets provided foundational insights into genome, epigenome, and transcriptome structure and the evolutionary turnover of regulatory pathways. These studies facilitated a comparative analysis with similar data types produced by the ENCODE Consortium for human cells. Genome organization differs drastically in these distant species, and yet quantitative relationships among chromatin state, transcription, and cotranscriptional RNA processing are deeply conserved. Of the many biological discoveries of the modENCODE Consortium, we highlight insights that emerged from integrative studies. We focus on operational and scientific lessons that may aid future projects of similar scale or aims in other, emerging model systems.
- Published
- 2015
84. RNA editing in Drosophila melanogaster: New targets and functional consequences
- Author
-
Susan E. Celniker, Joseph W. Carlson, and Mark Stapleton
- Subjects
DNA, Complementary ,Receptors, Peptide ,Adenosine Deaminase ,Molecular Sequence Data ,Genes, Insect ,Biology ,Ion Channels ,Report ,Animals ,Drosophila Proteins ,Eye Proteins ,Molecular Biology ,Gene ,DNA Primers ,Genetics ,Membrane Glycoproteins ,Base Sequence ,Reverse Transcriptase Polymerase Chain Reaction ,Life Sciences ,RNA-Binding Proteins ,RNA ,Sequence Analysis, DNA ,Actin cytoskeleton ,biology.organism_classification ,Cell biology ,Cytoskeletal Proteins ,Drosophila melanogaster ,Ion homeostasis ,RNA editing ,ADAR ,RNA Editing ,Drosophila Protein ,Signal Transduction - Abstract
Adenosine deaminases that act on RNA [adenosine deaminase, RNA specific (ADAR)] catalyze the site-specific conversion of adenosine to inosine in primary mRNA transcripts. These re-coding events affect coding potential, splice sites, and stability of mature mRNAs. ADAR is an essential gene, and studies in mouse, Caenorhabditis elegans, and Drosophila suggest that its primary function is to modify adult behavior by altering signaling components in the nervous system. By comparing the sequence of isogenic cDNAs to genomic DNA, we have identified and experimentally verified 27 new targets of Drosophila ADAR. Our analyses led us to identify new classes of genes whose transcripts are targets of ADAR, including components of the actin cytoskeleton and genes involved in ion homeostasis and signal transduction. Our results indicate that editing in Drosophila increases the diversity of the proteome, and does so in a manner that has direct functional consequences on protein function.
- Published
- 2006
85. Genomic organization of the Drosophila telomere retrotransposable elements
- Author
-
Janet A. George, P. Gregory DeBaryshe, K L Traverse, Mary Lou Pardue, and Susan E. Celniker
- Subjects
Genetics ,Letter ,animal structures ,Polytene chromosome ,Retroelements ,Euchromatin ,Sequence analysis ,Heterochromatin ,Computational Biology ,Telomeric heterochromatin ,Retrotransposon ,Genomics ,Genome project ,Telomere ,Biology ,Blotting, Southern ,Drosophila melanogaster ,Animals ,Genetics (clinical) ,Genomic organization - Abstract
Telomeres of the genus Drosophila, like those of other eukaryotes, are made up of tandem arrays of nucleotide repeats copied from an RNA template. The distinctive feature of Drosophila telomeres lies in the sequence repeats themselves. In all other studied organisms, telomere repeats are very short simple sequences, which do not code for proteins. In contrast, the Drosophila telomere repeats are two retrotransposable elements, HeT-A and TART (see Pardue and DeBaryshe 2003). (In Drosophila melanogaster there are also a few copies of Tahre, an element combining sequences of HeT-A and TART [Abad et al. 2004b]. Tahre has not yet been reported in other Drosophila species.) The Drosophila telomere elements are 2 or 3 orders of magnitude larger than telomere repeats in other organisms, and they encode proteins used for their retrotransposition. HeT-A, TART, and Tahre are non–long terminal repeat (non- LTR) retrotransposons, and the proteins they encode are closely related to the proteins encoded by a group of non-LTR retrotransposons that are abundant in Drosophila genomes. However, HeTA, TART, and Tahre are distinguished from these other retrotransposons in two important ways: First, HeT-A, TART, and Tahre transpose specifically to chromosome ends, apparently identifying ends by some feature other than DNA sequence. In situ hybridization experiments have not detected these elements in euchromatic regions except when they have bound to the end of a chromosome that has broken in euchromatin (Traverse and Pardue 1988; Biessmann et al. 1990b). Other Drosophila retrotransposons transpose to many sites in euchromatin but have not been found in clones derived from telo- mere HeT-A/TART arrays. Second, while most retrotransposons contain very little DNA that does not code for proteins, HeT-A, TART, and Tahre have 3 untranslated regions (3 UTRs) that make up nearly half their sequence. As is typical of UTRs, the sequence of these regions evolves rapidly. Nonetheless, HeT-A and TART arrays maintain the strand composition bias seen on other telomeres—the sense strand is always A+C-rich (Danilevskaya et al. 1998), like telomerase template sequences (Henderson 1995). In addition, HeT-A 3 UTRs have a pattern of irregularly spaced A-rich regions in every Drosophila species studied (Danilevskaya et al. 1998; Casacuberta and Pardue 2003). This combination of unusual, but well-defined, chromosomal distribution and unusual sequence organization suggests that these features are related. The possibility of such a relationship is further emphasized by the fact that fragments of their 3 UTR sequence have been found not only in telomeric heterochromatin but also in other heterochromatic regions of the genome (Danilevskaya et al. 1991, 1993; Losada et al. 1999; Casacuberta and Pardue 2002). Such an association could result if these unusual sequences are positively selected for survival in heterochromatin, are deleterious if transposed into euchromatin, or both. The implications of this apparent relationship between chromosomal distribution and HeT-A/TART sequences can be investigated effectively only now as we begin to have a better idea of exactly how well our current limited information reflects the true distribution of these sequences in the genome. The D. melanogaster euchromatic genome has been completely sequenced (Adams et al. 2000; Celniker et al. 2002), but the heterochromatic portion presents challenges for sequencing that are only slowly being overcome. Therefore, most available HeT-A and TART sequences have come from small fragments, either subcloned or amplified by PCR. There is little evidence for the exact genomic site from which these fragments originated; most of what we do know has been deduced from in situ hybridization to polytene chromosomes. In these giant chromosomes, in situ localization of cloned sequences is complicated by cross- hybridization to the many copies of HeT-A and TART and, at least sometimes, by under-replication of some heterochromatic sequences. For example, neither Y chromosomes nor pericentric satellite sequences are detectably polytenized (Gall et al. 1971). The replication status of other heterochromatic sequences is less well determined. If HeT-A and TART sequences are part of the set that is greatly under-replicated in salivary glands, then polytene chromosomes might not give a complete picture of the localization of these sequences. The Drosophila Heterochromatin Genome Project is extending sequence of the D. melanogaster genome (stock 2057) into heterochromatic regions (Hoskins et al. 2002). Specifically, there is now assembled sequence extending into the telomere on the right end of chromosome 4 (4R) and the left end of the X chromosome (XL). These assemblies give the first detailed view of telomere structure in D. melanogaster. The following analysis of these sequences confirms that the chromosomal distribution of HeT-A and TART differs sharply from that of other retrotransposons in ways that are consistent with their roles at the telo- mere. They also provide new insight into the production and turnover of telomere arrays. The results of this sequence analysis, in conjunction with quantitative hybridization measurements, allowed us to determine the magnitude and range of variation of telomere sequence in flies of different genotypes and in established cell lines. Previous molecular characterization of eukaryotic telomeres produced the unexpected finding that average lengths of telomerase- maintained telomeres fluctuate around equilibrium values specified by the genotype, tissue, and environment. Our measurements of Drosophila telomeres suggest that, despite their unusual retrotransposon mechanism, these telomeres have a similarly dynamic behavior.
- Published
- 2006
86. Chromatin looping mediates boundary element promoter interactions
- Author
-
Robert A. Drewell and Susan E. Celniker
- Subjects
Homeodomain Proteins ,Regulation of gene expression ,Genetics ,Models, Genetic ,biology ,Genes, Insect ,Chromatin Assembly and Disassembly ,biology.organism_classification ,General Biochemistry, Genetics and Molecular Biology ,Chromatin ,Cell biology ,Animals, Genetically Modified ,Drosophila melanogaster ,Gene Expression Regulation ,Gene expression ,Animals ,Drosophila Proteins ,Promoter Regions, Genetic ,Enhancer ,Gene ,Function (biology) ,Drosophila Protein - Abstract
One facet of the control of gene expression is long-range promoter regulation by distant enhancers. It is an important component of the regulation of genes that control metazoan development and has been appreciated for some time but the molecular mechanisms underlying this regulation have remained poorly understood. A recent study by Cleard and colleagues1 reports the first in vivo evidence of chromatin looping and boundary element promoter interaction. Specifically, they studied the function of a boundary element within the cis-regulatory region of the Abdominal-B (Abd-B) gene of Drosophila melanogaster.
- Published
- 2006
87. Identification and analysis of U5 snRNA variants in Drosophila
- Author
-
Donald C. Rio, Enbo Ma, Susan E. Celniker, Dennis J. Lullo, Jennifer A. Doudna, and Li Chen
- Subjects
Spliceosome ,RNA Splicing ,Molecular Sequence Data ,Prp24 ,Biology ,Autoantigens ,Protein Structure, Secondary ,snRNP Core Proteins ,RNA, Small Nuclear ,Animals ,Protein Isoforms ,snRNP ,Amino Acid Sequence ,Cloning, Molecular ,Letter to the Editor ,Molecular Biology ,Ribonucleoprotein ,Genetics ,Binding Sites ,Base Sequence ,SnRNP Core Proteins ,Gene Expression Profiling ,Genetic Variation ,RNA ,Ribonucleoproteins, Small Nuclear ,RNA splicing ,Spliceosomes ,Nucleic Acid Conformation ,Drosophila ,Small nuclear RNA - Abstract
Distinct isoforms of spliceosomal RNAs may be involved in regulating pre-messenger RNA splicing in eukaryotic cells. During a large-scale effort to identify small noncoding RNAs in Drosophila, we isolated a U5 snRNA-like molecule containing a 5′ segment identical to that of the canonical (major) U5 snRNA but with a variant Sm binding site and a distinct 3′ hairpin sequence. Based on this finding, another six similar U5 snRNA-like sequences were identified within the Drosophila genome by sequence similarity to the invariant loop in the 5′ half of U5. Interestingly, although all of these variants are expressed in vivo, each shows a distinct temporal expression profile during Drosophila development, and one is expressed primarily in fly heads. The presence of these U5 snRNA variants within RNP particles suggests their role in splicing and implies a possible connection to regulation of developmental and tissue-specific gene expression.
- Published
- 2005
88. Identification of putative noncoding polyadenylated transcripts in Drosophila melanogaster
- Author
-
Sima Misra, Martha Evans-Holm, Susan E. Celniker, Gina Dailey, Jonathan L. Tupy, Christian W. Siebel, Gerald M. Rubin, and Adina M. Bailey
- Subjects
DNA, Complementary ,RNA, Untranslated ,Transcription, Genetic ,Genes, Insect ,Genomics ,Polyadenylation ,Genome ,Conserved sequence ,Open Reading Frames ,Intergenic region ,Species Specificity ,Complementary DNA ,Animals ,RNA, Messenger ,Gene ,Conserved Sequence ,In Situ Hybridization ,Oligonucleotide Array Sequence Analysis ,Genetics ,Multidisciplinary ,biology ,Reverse Transcriptase Polymerase Chain Reaction ,Computational Biology ,Biological Sciences ,Blotting, Northern ,biology.organism_classification ,Drosophila melanogaster ,Protein Biosynthesis ,Drosophila ,Sequence motif - Abstract
Analysis of EST and cDNA collections from a number of metazoan species has identified genes encoding long polyadenylated transcripts that do not contain ORFs of lengths typical for protein-encoding mRNAs. Noncoding functions of such polyadenylated transcripts have been elucidated in only a few examples. The corresponding genes neither contain hallmark sequence motifs nor appear to have been conserved across phyla. Thus, it is impossible to systematically identify new members of this class of gene by using sequence homology and traditional gene-finding algorithms that depend on protein-coding potential. Consequently, even their approximate number has not been established for any metazoan genome. We curated polyadenylated transcripts with limited protein-coding capacity from intergenic regions of the Drosophila melanogaster genome. We used RT-PCR assays, hybridization to RNA blots and whole-mount embryos, and computational analyses to characterize candidate transcripts. We verify the structures and expression of 17 distinct, likely non-protein-coding polyadenylated transcripts. We show that the expression of many of these transcripts is conserved in other Drosophila species, indicating that they have important biological functions.
- Published
- 2005
89. A computational and experimental approach to validating annotations and gene predictions in the Drosophila melanogaster genome
- Author
-
Mark Yandell, Sima Misra, Adina M. Bailey, Martha Evans-Holm, Gerald M. Rubin, ShengQiang Shu, Susan E. Celniker, and Colin Wiel
- Subjects
Genetics ,Genome ,Multidisciplinary ,Models, Genetic ,Reverse Transcriptase Polymerase Chain Reaction ,Gene number ,Molecular Sequence Data ,Reproducibility of Results ,Computational gene ,Gene Annotation ,Computational biology ,Biological Sciences ,Biology ,biology.organism_classification ,Polymerase Chain Reaction ,Drosophila melanogaster ,Melanogaster ,Animals ,Drosophila Proteins ,Gene ,Drosophila Protein ,DNA Primers - Abstract
Five years after the completion of the sequence of the Drosophila melanogaster genome, the number of protein-coding genes it contains remains a matter of debate; the number of computational gene predictions greatly exceeds the number of validated gene annotations. We have assembled a collection of >10,000 gene predictions that do not overlap existing gene annotations and have developed a process for their validation that allows us to efficiently prioritize and experimentally validate predictions from various sources by sequencing RT-PCR products to confirm gene structures. Our data provide experimental evidence for 122 protein-coding genes. Our analyses suggest that the entire collection of predictions contains only ≈700 additional protein-coding genes. Although we cannot rule out the discovery of genes with unusual features that make them refractory to existing methods, our results suggest that the D. melanogaster genome contains ≈14,000 protein-coding genes.
- Published
- 2005
90. [Untitled]
- Author
-
Gerald M. Rubin, Susan E. Celniker, Roger A. Hoskins, Eugene W. Myers, Granger G. Sutton, Mark Raymond Adams, Joseph W. Carlson, Andrew G. Clark, Maria D. Vibranovski, and A. Bernardo Carvalho
- Subjects
Genetics ,Expressed sequence tag ,Euchromatin ,Contig ,Shotgun sequencing ,Heterochromatin ,food and beverages ,Genomics ,Plant Science ,General Medicine ,Genome project ,Computational biology ,Biology ,Genome ,Insect Science ,Animal Science and Zoology - Abstract
Whole genome shotgun assemblies have proven remarkably successful in reconstructing the bulk of euchromatic genes, with the only limit appearing to be determined by the sequencing depth. For genes imbedded in heterochromatin, however, the low cloning efficiency of repetitive sequences, combined with the computational challenges, demand that additional clues be used to annotate the sequences. One approach that has proven very successful in identifying protein coding genes in Y-linked heterochromatin of Drosophila melanogaster has been to make a BLASTable database of the small, unmapped contigs and fragments leftover at the end of a shotgun assembly, and to attempt to capture these by blasting with an appropriate query sequence. This approach often yields a staggered alignment of contigs from the unmapped set to the query sequence, as though the disjoint contigs represent small portions of the gene. Further inspection frequently shows that the contigs are broken by very large, heterochromatic introns. Methods of this sort are being expanded to make best use of all available clues to determine which unmapped contigs are associated with genes. These include use of EST libraries, and, in the case of the Y chromosome, testing of male specific genes and reduced shotgun depth of relevant contigs. It appears much more hopeful than anyone would have imagined that whole genome shotgun assemblies can recover the great bulk of even heterochromatic genes.
- Published
- 2003
91. Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genome
- Author
-
Pavel Tomancak, Barret D. Pfeiffer, Michael B. Eisen, Gerald M. Rubin, Yutaka Nibu, Michael Levine, Susan E. Celniker, and Benjamin P. Berman
- Subjects
Genetics ,Binding Sites ,Multidisciplinary ,Models, Genetic ,Gene Expression Regulation, Developmental ,Genes, Insect ,Genomics ,Biological Sciences ,Biology ,Genome ,DNA binding site ,Drosophila melanogaster ,Enhancer Elements, Genetic ,Genes, Regulator ,Animals ,Binding site ,Enhancer ,Gene ,Transcription factor ,Algorithms ,Gap gene ,Transcription Factors ,Cis-regulatory module - Abstract
A major challenge in interpreting genome sequences is understanding how the genome encodes the information that specifies when and where a gene will be expressed. The first step in this process is the identification of regions of the genome that contain regulatory information. In higher eukaryotes, this cis-regulatory information is organized into modular units [cis-regulatory modules (CRMs)] of a few hundred base pairs. A common feature of these cis-regulatory modules is the presence of multiple binding sites for multiple transcription factors. Here, we evaluate the extent to which the tendency for transcription factor binding sites to be clustered can be used as the basis for the computational identification of cis-regulatory modules. By using published DNA binding specificity data for five transcription factors active in the early Drosophila embryo, we identified genomic regions containing unusually high concentrations of predicted binding sites for these factors. A significant fraction of these binding site clusters overlap known CRMs that are regulated by these factors. In addition, many of the remaining clusters are adjacent to genes expressed in a pattern characteristic of genes regulated by these factors. We tested one of the newly identified clusters, mapping upstream of the gap gene giant (gt) , and show that it acts as an enhancer that recapitulates the posterior expression pattern of gt.
- Published
- 2002
92. Extensive cross-regulation of post-transcriptional regulatory networks in Drosophila
- Author
-
Robert A. Obar, Michael O. Duff, Jan Manent, Spyros Artavanis-Tsakonas, Sara Olson, Peter J. Bickel, Susan E. Celniker, K. G. Guruharsha, James B. Brown, Brenton R. Graveley, Gemma E. May, and Marcus H. Stoiber
- Subjects
Genetics ,Polyadenylation ,Sequence Analysis, RNA ,RNA Splicing ,Research ,Intron ,RNA ,RNA-Binding Proteins ,RNA-binding protein ,Biology ,Heterogeneous ribonucleoprotein particle ,Transfection ,Heterogeneous-Nuclear Ribonucleoproteins ,Cell biology ,Drosophila melanogaster ,Gene Expression Regulation ,RNA splicing ,Animals ,Drosophila Proteins ,RNA, Messenger ,Genetics (clinical) ,Drosophila Protein ,Ribonucleoprotein - Abstract
In eukaryotic cells, RNAs exist as ribonucleoprotein particles (RNPs). Despite the importance of these complexes in many biological processes, including splicing, polyadenylation, stability, transportation, localization, and translation, their compositions are largely unknown. We affinity-purified 20 distinct RNA-binding proteins (RBPs) from cultured Drosophila melanogaster cells under native conditions and identified both the RNA and protein compositions of these RNP complexes. We identified “high occupancy target” (HOT) RNAs that interact with the majority of the RBPs we surveyed. HOT RNAs encode components of the nonsense-mediated decay and splicing machinery, as well as RNA-binding and translation initiation proteins. The RNP complexes contain proteins and mRNAs involved in RNA binding and post-transcriptional regulation. Genes with the capacity to produce hundreds of mRNA isoforms, ultracomplex genes, interact extensively with heterogeneous nuclear ribonuclear proteins (hnRNPs). Our data are consistent with a model in which subsets of RNPs include mRNA and protein products from the same gene, indicating the widespread existence of auto-regulatory RNPs. From the simultaneous acquisition and integrative analysis of protein and RNA constituents of RNPs, we identify extensive cross-regulatory and hierarchical interactions in post-transcriptional control.
- Published
- 2014
93. DNA copy number evolution in Drosophila cell lines
- Author
-
Lucy Cherbas, Alissa M. Resch, Susan E. Celniker, Brenton R. Graveley, Maria Patrizia Somma, Peter Cherbas, Teresa M. Przytycka, Brian Oliver, C. Joel McManus, Gemma E. May, Sara K. Powell, Lijun Zhan, Dong-Yeon Cho, Hangnoh Lee, Dayu Zhang, David M. MacAlpine, Justen Andrews, Matthew L. Eaton, Maurizio Gatti, and Fioranna Renda
- Subjects
Male ,Cell Survival ,Gene Dosage ,Biology ,medicine.disease_cause ,Genome ,Gene dosage ,Cell Line ,Evolution, Molecular ,Tissue Culture Techniques ,medicine ,Animals ,Drosophila Proteins ,Selection, Genetic ,Gene ,Genetics ,genic imbalance ,Sex Chromosomes ,Dosage compensation ,Research ,Structural rearrangements of the genome ,Genetic Variation ,Receptor Protein-Tyrosine Kinases ,Correction ,DNA ,Sequence Analysis, DNA ,Phenotype ,MicroRNAs ,Drosophila melanogaster ,Cancer cell ,Drosophila ,Female ,Genetic Fitness ,sense organs ,Carcinogenesis ,Immortalised cell line - Abstract
Background Structural rearrangements of the genome resulting in genic imbalance due to copy number change are often deleterious at the organismal level, but are common in immortalized cell lines and tumors, where they may be an advantage to cells. In order to explore the biological consequences of copy number changes in the Drosophila genome, we resequenced the genomes of 19 tissue-culture cell lines and generated RNA-Seq profiles. Results Our work revealed dramatic duplications and deletions in all cell lines. We found three lines of evidence indicating that copy number changes were due to selection during tissue culture. First, we found that copy numbers correlated to maintain stoichiometric balance in protein complexes and biochemical pathways, consistent with the gene balance hypothesis. Second, while most copy number changes were cell line-specific, we identified some copy number changes shared by many of the independent cell lines. These included dramatic recurrence of increased copy number of the PDGF/VEGF receptor, which is also over-expressed in many cancer cells, and of bantam, an anti-apoptosis miRNA. Third, even when copy number changes seemed distinct between lines, there was strong evidence that they supported a common phenotypic outcome. For example, we found that proto-oncogenes were over-represented in one cell line (S2-DRSC), whereas tumor suppressor genes were under-represented in another (Kc167). Conclusion Our study illustrates how genome structure changes may contribute to selection of cell lines in vitro. This has implications for other cell-level natural selection progressions, including tumorigenesis. Electronic supplementary material The online version of this article (doi:10.1186/gb-2014-15-8-r70) contains supplementary material, which is available to authorized users.
- Published
- 2014
94. Comparative validation of the D. melanogaster modENCODE transcriptome annotation
- Author
-
Brian Oliver, Peter C. FitzGerald, Fiona Ongeri, Joel Atallah, Roger A. Hoskins, Teresa M. Przytycka, David C. Plachetzki, Mingmei Li, Ling Ling Pu, Sandra L. Lee, Divya Kalra, Piero Carninci, Nicolas R. Mattiuzzo, Tittu Mathew, Harold E. Smith, Emily Clough, Anthony R. Fletcher, Susan E. Celniker, Karin Kionte, Charles Vinson, Michael O. Duff, Abhijit Dasgupta, Crystal B. Warner, Stephen Richards, Sai Gubbala, James B. Brown, Xiaoyan Zou, Huaiyang Jiang, Ana Maria Suzuki, Artyom Kopp, Kim C. Worley, Yuan Qing Wu, Yi Han, Fabio Piano, Christie Kovar, Paul W. Sternberg, Lora Perales, Olga Barmina, Donna M. Muzny, Manolis Kellis, Joy Jayaseelan, Zhen-Xia Chen, Kerstin P. Blankenburg, Carlo G. Artieri, Michael B. Eisen, Peter J. Bickel, Steven E. Scherer, Mala Munidasa, Nehad Saada, David H. A. Fitch, Peter Cherbas, James D. Malley, David Sturgill, Jiaxin Qu, Asher D. Cutter, Garrett Robinson, John H. Malone, Richard A. Gibbs, Soo Park, Brenton R. Graveley, Yoo-Ah Kim, Rebecca Thornton, Nathan Boley, Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science, and Kellis, Manolis
- Subjects
Male ,Genome, Insect ,Genome ,Medical and Health Sciences ,Melanogaster ,Cluster Analysis ,Promoter Regions, Genetic ,Genetics (clinical) ,Phylogeny ,Genetics ,biology ,Genome project ,Exons ,Biological Sciences ,Molecular Sequence Annotation ,Drosophila melanogaster ,Female ,Transcription Initiation Site ,Biotechnology ,Resource ,Evolution ,Bioinformatics ,RNA Splicing ,Computational biology ,Vertebrate and Genome Annotation Project ,Evolution, Molecular ,Promoter Regions ,Genetic ,Animals ,Humans ,Position-Specific Scoring Matrices ,Nucleotide Motifs ,Comparative genomics ,Gene Expression Profiling ,Human Genome ,Reproducibility of Results ,Computational Biology ,Molecular ,biology.organism_classification ,Gene expression profiling ,RNA Splice Sites ,RNA Editing ,Generic health relevance ,Transcriptome ,Insect - Abstract
Accurate gene model annotation of reference genomes is critical for making them useful. The modENCODE project has improved the D. melanogaster genome annotation by using deep and diverse high-throughput data. Since transcriptional activity that has been evolutionarily conserved is likely to have an advantageous function, we have performed large-scale interspecific comparisons to increase confidence in predicted annotations. To support comparative genomics, we filled in divergence gaps in the Drosophila phylogeny by generating draft genomes for eight new species. For comparative transcriptome analysis, we generated mRNA expression profiles on 81 samples from multiple tissues and developmental stages of 15 Drosophila species, and we performed cap analysis of gene expression in D. melanogaster and D. pseudoobscura. We also describe conservation of four distinct core promoter structures composed of combinations of elements at three positions. Overall, each type of genomic feature shows a characteristic divergence rate relative to neutral models, highlighting the value of multispecies alignment in annotating a target genome that should prove useful in the annotation of other high priority genomes, especially human and other mammalian genomes that are rich in noncoding sequences. We report that the vast majority of elements in the annotation are evolutionarily conserved, indicating that the annotation will be an important springboard for functional genetic testing by the Drosophila community., National Institutes of Health (U.S.) (NIDDK (DK015600-18)), National Institutes of Health (U.S.) (Extramural program (1ROIGM082843)), National Institutes of Health (U.S.) (Extramural program (U01HB004271))
- Published
- 2014
95. Genome-wide Identification of Zero Nucleotide Recursive Splicing inDrosophila
- Author
-
Sara Olson, Michael O. Duff, Ahmad Osman, Xintao Wei, Brenton R. Graveley, Mohan Bolisetty, Susan E. Celniker, and Alex M. Plocik
- Subjects
0303 health sciences ,Intron ,Genomics ,Computational biology ,Biology ,Genome ,03 medical and health sciences ,Exon ,0302 clinical medicine ,RNA splicing ,splice ,Gene ,030217 neurology & neurosurgery ,Ultrabithorax ,030304 developmental biology - Abstract
Recursive splicing is a process in which large introns are removed in multiple steps by resplicing at ratchet points - 5? splice sites recreated after splicing. Recursive splicing was first identified in the Drosophila Ultrabithorax (Ubx) gene and only three additional Drosophila genes have since been experimentally shown to undergo recursive splicing. Here, we identify 196 zero nucleotide exon ratchet points in 130 introns of 115 Drosophila genes from total RNA sequencing data generated from developmental time points, dissected tissues, and cultured cells. Recursive splicing events were identified by splice junctions that map to annotated 5? splice sites and unannotated intronic 3? splice sites, the presence of the sequence AG/GT at the 3? splice site, and a 5? to 3? gradient of decreasing RNA-Seq read density indicative of co-transcriptional splicing. The sequential nature of recursive splicing was confirmed by identification of lariat introns generated by splicing to and from the ratchet points. We also show that recursive splicing is a constitutive process, and that the sequence and function of ratchet points are evolutionarily conserved. Together these results indicate that recursive splicing is commonly used in Drosophila and provides insight into the mechanisms by which some introns are removed.
- Published
- 2014
- Full Text
- View/download PDF
96. Spatial expression of transcription factors in Drosophila embryonic organ development
- Author
-
Christopher A. Bristow, Manolis Kellis, Richard Weiszmann, Susan E. Celniker, William W. Fisher, Siqi Wu, Volker Hartenstein, Erwin Frise, Ann S. Hammonds, Bin Yu, Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science, Bristow, Christopher A., and Kellis, Manolis
- Subjects
Central Nervous System ,Bioinformatics ,Response element ,genetic processes ,Gene regulatory network ,Context (language use) ,Computational biology ,Biology ,03 medical and health sciences ,0302 clinical medicine ,Information and Computing Sciences ,Genetics ,Animals ,Drosophila Proteins ,natural sciences ,Developmental ,Gene Regulatory Networks ,Gene ,Transcription factor ,In Situ Hybridization ,030304 developmental biology ,0303 health sciences ,Research ,Gene Expression Profiling ,fungi ,Gene Expression Regulation, Developmental ,Biological Sciences ,Gene expression profiling ,Drosophila melanogaster ,Spatiotemporal gene expression ,Gene Expression Regulation ,Organ Specificity ,030217 neurology & neurosurgery ,Environmental Sciences ,Transcription Factors ,Biotechnology - Abstract
Background: Site-specific transcription factors (TFs) bind DNA regulatory elements to control expression of target genes, forming the core of gene regulatory networks. Despite decades of research, most studies focus on only a small number of TFs and the roles of many remain unknown. Results: We present a systematic characterization of spatiotemporal gene expression patterns for all known or predicted Drosophila TFs throughout embryogenesis, the first such comprehensive study for any metazoan animal. We generated RNA expression patterns for all 708 TFs by in situ hybridization, annotated the patterns using an anatomical controlled vocabulary, and analyzed TF expression in the context of organ system development. Nearly all TFs are expressed during embryogenesis and more than half are specifically expressed in the central nervous system. Compared to other genes, TFs are enriched early in the development of most organ systems, and throughout the development of the nervous system. Of the 535 TFs with spatially restricted expression, 79% are dynamically expressed in multiple organ systems while 21% show single-organ specificity. Of those expressed in multiple organ systems, 77 TFs are restricted to a single organ system either early or late in development. Expression patterns for 354 TFs are characterized for the first time in this study. Conclusions: We produced a reference TF dataset for the investigation of gene regulatory networks in embryogenesis, and gained insight into the expression dynamics of the full complement of TFs controlling the development of each organ system., National Institutes of Health (U.S.). Ruth L. Kirschstein National Research Service Award
- Published
- 2013
97. Quantitative Analysis of the Drosophila Segmentation Regulatory Network Using Pattern Generating Potentials
- Author
-
Majid Kazemian, Charles Blatti, Adam Richards, Michael McCutchan, Noriko Wakabayashi-Ito, Ann S. Hammonds, Susan E. Celniker, Sudhir Kumar, Scot A. Wolfe, Michael H. Brodsky, and Saurabh Sinha
- Subjects
0303 health sciences ,03 medical and health sciences ,0302 clinical medicine ,General Immunology and Microbiology ,QH301-705.5 ,General Neuroscience ,Correction ,Biology (General) ,General Agricultural and Biological Sciences ,030217 neurology & neurosurgery ,General Biochemistry, Genetics and Molecular Biology ,030304 developmental biology - Published
- 2013
98. An Extracellular Interactome of Immunoglobulin and LRR Proteins Reveals Receptor-Ligand Networks
- Author
-
Richard Weiszmann, Susan E. Celniker, Catharine L. Eastman, Deepa Waghray, Engin Özkan, Robert A. Carrillo, Karl G. Johnson, K. Christopher Garcia, and Kai Zinn
- Subjects
Subfamily ,Molecular Sequence Data ,Immunoglobulins ,Receptors, Cell Surface ,Biology ,Leucine-Rich Repeat Proteins ,Ligands ,Interactome ,General Biochemistry, Genetics and Molecular Biology ,03 medical and health sciences ,0302 clinical medicine ,Protein structure ,Cell surface receptor ,Extracellular ,Animals ,Drosophila Proteins ,Amino Acid Sequence ,Protein Interaction Maps ,Phylogeny ,030304 developmental biology ,Genetics ,0303 health sciences ,Biochemistry, Genetics and Molecular Biology(all) ,Proteins ,Fibronectins ,Protein Structure, Tertiary ,3. Good health ,Cell biology ,Fibronectin ,Drosophila melanogaster ,biology.protein ,Immunoglobulin superfamily ,Sequence Alignment ,030217 neurology & neurosurgery ,Drosophila Protein - Abstract
SummaryExtracellular domains of cell surface receptors and ligands mediate cell-cell communication, adhesion, and initiation of signaling events, but most existing protein-protein “interactome” data sets lack information for extracellular interactions. We probed interactions between receptor extracellular domains, focusing on a set of 202 proteins composed of the Drosophila melanogaster immunoglobulin superfamily (IgSF), fibronectin type III (FnIII), and leucine-rich repeat (LRR) families, which are known to be important in neuronal and developmental functions. Out of 20,503 candidate protein pairs tested, we observed 106 interactions, 83 of which were previously unknown. We “deorphanized” the 20 member subfamily of defective-in-proboscis-response IgSF proteins, showing that they selectively interact with an 11 member subfamily of previously uncharacterized IgSF proteins. Both subfamilies interact with a single common “orphan” LRR protein. We also observed interactions between Hedgehog and EGFR pathway components. Several of these interactions could be visualized in live-dissected embryos, demonstrating that this approach can identify physiologically relevant receptor-ligand pairs.
- Published
- 2013
99. Global Patterns of Tissue-Specific Alternative Polyadenylation in Drosophila
- Author
-
Michael O. Duff, Eric C. Lai, Susan E. Celniker, Peter Cherbas, Joe Carlson, Pedro Miura, Justen Andrews, James B. Brown, Robert C. Eisman, Dayu Zhang, Solomon Shenker, Jakub Orzechowski Westholm, Peter Smibert, Gemma E. May, Brenton R. Graveley, Brian D. Eads, and Thomas C. Kaufman
- Subjects
Male ,0106 biological sciences ,Untranslated region ,Embryo, Nonmammalian ,Polyadenylation ,Molecular Sequence Data ,Genes, Insect ,RNA-binding protein ,Biology ,01 natural sciences ,Article ,General Biochemistry, Genetics and Molecular Biology ,03 medical and health sciences ,0302 clinical medicine ,Testis ,Gene expression ,Animals ,Protein Isoforms ,RNA, Messenger ,Nucleotide Motifs ,lcsh:QH301-705.5 ,3' Untranslated Regions ,Gene ,Conserved Sequence ,In Situ Hybridization ,030304 developmental biology ,Neurons ,Genetics ,0303 health sciences ,Base Sequence ,Sequence Analysis, RNA ,Three prime untranslated region ,Gene Expression Regulation, Developmental ,Reproducibility of Results ,RNA ,Blotting, Northern ,biology.organism_classification ,DNA-Binding Proteins ,Drosophila melanogaster ,lcsh:Biology (General) ,Organ Specificity ,Poly A ,Transcriptome ,030217 neurology & neurosurgery ,010606 plant biology & botany - Abstract
SummaryWe analyzed the usage and consequences of alternative cleavage and polyadenylation (APA) in Drosophila melanogaster by using >1 billion reads of stranded mRNA-seq across a variety of dissected tissues. Beyond demonstrating that a majority of fly transcripts are subject to APA, we observed broad trends for 3′ untranslated region (UTR) shortening in the testis and lengthening in the central nervous system (CNS); the latter included hundreds of unannotated extensions ranging up to 18 kb. Extensive northern analyses validated the accumulation of full-length neural extended transcripts, and in situ hybridization indicated their spatial restriction to the CNS. Genes encoding RNA binding proteins (RBPs) and transcription factors were preferentially subject to 3′ UTR extensions. Motif analysis indicated enrichment of miRNA and RBP sites in the neural extensions, and their termini were enriched in canonical cis elements that promote cleavage and polyadenylation. Altogether, we reveal broad tissue-specific patterns of APA in Drosophila and transcripts with unprecedented 3′ UTR length in the nervous system.
- Published
- 2013
- Full Text
- View/download PDF
100. Development of the Indirect Flight Muscle Attachment Sites inDrosophila:Role of the PS Integrins and thestripeGene
- Author
-
K. VijayRaghavan, Susan E. Celniker, and Joyce Fernandes
- Subjects
Integrins ,Integrin ,Neuromuscular Junction ,Genes, Insect ,Muscle Development ,Muscle attachment ,Myocyte ,Animals ,Drosophila Proteins ,Molecular Biology ,biology ,Epidermis (botany) ,Myogenesis ,Muscles ,Metamorphosis, Biological ,Anatomy ,Cell Biology ,biology.organism_classification ,Immunohistochemistry ,Cell biology ,DNA-Binding Proteins ,Imaginal disc ,Drosophila melanogaster ,Flight, Animal ,Larva ,biology.protein ,Integrin alpha Chains ,Drosophila Protein ,Transcription Factors ,Developmental Biology - Abstract
Using markers that are expressed at muscle attachment sites, we have examined the early pupal development (first 36 hr) of Indirect Flight Muscle (IFM) attachments in the fruit fly Drosophila melanogaster. Expression of the Drosophila homologs of vertebrate integrins, the Position-Specific (PS) antigens, is known to differentially mark epidermal (PS1alpha) and muscle (PS2alpha) components of the developing IFM attachment sites. During myogenesis, PS2alpha is detected transiently in imaginal myoblasts that fuse with persistent larval muscles to give rise to the Dorsal Longitudinal Muscles (DLMs), but not in myoblasts that fuse de novo to give rise to the Dorso Ventral Muscles. The integrins are not expressed at attachment sites when the muscle fibers first make their appearance (12-20 hr). Following muscle-epidermal contact, PS1 and PS2 are detected at muscle attachment sites. PS1 expression is at the muscle ends and also in the long epidermal processes that connect the developing muscle fibers to their sites of attachment in the epidermis, while PS2 expression is restricted to the muscle ends. Epidermal cells that will contribute to the adult attachment sites are defined as early as the third larval instar. Both anterior and posterior sites of attachment of the IFMs are marked by the expression of reporter beta-galactosidase activity in a P-element line B14.0, which is an insertion at the stripe locus. B14.0 (stripe) is seen in distinct domains in the wing and leg imaginal discs which give rise to the thoracic cuticle. The expression is maintained during pupal development. The B14.0 (stripe) expressing epidermal cells contact the developing muscle fibers, leading to the formation of the myotendon junction. We show that the dorsal and ventral attachment sites of one group of IFMs, the DVMs arise from two different imaginal discs (wing and leg, respectively), which may explain the differential effect of mutations such as bendless on these muscles. Attachment sites for the other group of IFMs, the DLMs, on the other hand, arise from one imaginal disc (wing). B14.0 (stripe) expression defines epidermal cells of the adult attachment sites and is likely to function during early events leading to the formation of muscle-epithelial contacts. The PS integrins are detected at later stages, suggesting a role in the stabilization and maturation of the muscle-epidermal contacts into myotendon junctions.
- Published
- 1996
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.