79 results on '"Gaspin C"'
Search Results
2. Mapping rRNA 2’-O-methylations and identification of C/D snoRNAs in Arabidopsis thaliana plants
- Author
-
Azevedo-Favory, J., primary, Gaspin, C., additional, Ayadi, L., additional, Montacié, C., additional, Marchand, V., additional, Jobet, E., additional, Rompais, M., additional, Carapito, C., additional, Motorin, Y., additional, and Sáez-Vásquez, J., additional
- Published
- 2021
- Full Text
- View/download PDF
3. Genome sequence of the plant pathogen Ralstonia solanacearum
- Author
-
Salanoubat, M., Genin, S., Artiguenave, F., Gouzy, J., Mangenot, S., Arlat, M., Billault, A., Brottier, P., Camus, J. C., Cattolico, L., Chandler, M., Choisne, N., Claudel-Renard, C., Cunnac, S., Demange, N., Gaspin, C., Lavie, M., Moisan, A., Robert, C., Saurin, W., Schiex, T., Siguier, P., Thebault, P., Whalen, M., Wincker, P., Levy, M., Weissenbach, J., and Boucher, C. A.
- Subjects
Environmental issues ,Science and technology ,Zoology and wildlife conservation - Abstract
Author(s): M. Salanoubat [1, 2]; S. Genin [2, 3]; F. Artiguenave [1, 4]; J. Gouzy [3]; S. Mangenot [1]; M. Arlat [3]; A. Billault [5]; P. Brottier [1]; J. C. [...]
- Published
- 2002
- Full Text
- View/download PDF
4. Searching RNA motifs and their intermolecular contacts with constraint networks
- Author
-
Thébault, P., de Givry, S., Schiex, T., and Gaspin, C.
- Published
- 2006
5. Oak genome reveals facets of long lifespan
- Author
-
Plomion, C., Aury, J.-M., Amselem, J., Leroy, T., Murat, F., Duplessis, S., Faye, S., Francillonne, N., Labadie, K., Le Provost, G., Lesur, I., Bartholomé, J., Faivre-Rampant, P., Kohler, A., Leplé, J.-C., Chantret, N., Chen, J., Diévart, A., Alaeitabar, T., Barbe, V., Belser, C., Bergès, H., Bodénès, C., Bogeat-Triboulot, M.-B., Bouffaud, Marie-Lara, Brachi, B., Chancerel, E., Cohen, D., Couloux, A., Da Silva, C., Dossat, C., Ehrenmann, F., Gaspin, C., Grima-Pettenati, J., Guichoux, E., Hecker, A., Herrmann, Sylvie, Hugueney, P., Hummel, I., Klopp, C., Lalanne, C., Lascoux, M., Lasserre, E., Lemainque, A., Desprez-Loustau, M.-L., Luyten, I., Madoui, M.-A., Mangenot, S., Marchal, C., Maumus, F., Mercier, J., Michotey, C., Panaud, O., Picault, N., Rouhier, N., Rué, O., Rustenholz, C., Salin, F., Soler, M., Tarkka, Mika, Velt, A., Zanne, A.E., Martin, F., Wincker, P., Quesneville, H., Kremer, A., Salse, J., Plomion, C., Aury, J.-M., Amselem, J., Leroy, T., Murat, F., Duplessis, S., Faye, S., Francillonne, N., Labadie, K., Le Provost, G., Lesur, I., Bartholomé, J., Faivre-Rampant, P., Kohler, A., Leplé, J.-C., Chantret, N., Chen, J., Diévart, A., Alaeitabar, T., Barbe, V., Belser, C., Bergès, H., Bodénès, C., Bogeat-Triboulot, M.-B., Bouffaud, Marie-Lara, Brachi, B., Chancerel, E., Cohen, D., Couloux, A., Da Silva, C., Dossat, C., Ehrenmann, F., Gaspin, C., Grima-Pettenati, J., Guichoux, E., Hecker, A., Herrmann, Sylvie, Hugueney, P., Hummel, I., Klopp, C., Lalanne, C., Lascoux, M., Lasserre, E., Lemainque, A., Desprez-Loustau, M.-L., Luyten, I., Madoui, M.-A., Mangenot, S., Marchal, C., Maumus, F., Mercier, J., Michotey, C., Panaud, O., Picault, N., Rouhier, N., Rué, O., Rustenholz, C., Salin, F., Soler, M., Tarkka, Mika, Velt, A., Zanne, A.E., Martin, F., Wincker, P., Quesneville, H., Kremer, A., and Salse, J.
- Abstract
Oaks are an important part of our natural and cultural heritage. Not only are they ubiquitous in our most common landscapes1 but they have also supplied human societies with invaluable services, including food and shelter, since prehistoric times2. With 450 species spread throughout Asia, Europe and America3, oaks constitute a critical global renewable resource. The longevity of oaks (several hundred years) probably underlies their emblematic cultural and historical importance. Such long-lived sessile organisms must persist in the face of a wide range of abiotic and biotic threats over their lifespans. We investigated the genomic features associated with such a long lifespan by sequencing, assembling and annotating the oak genome. We then used the growing number of whole-genome sequences for plants (including tree and herbaceous species) to investigate the parallel evolution of genomic characteristics potentially underpinning tree longevity. A further consequence of the long lifespan of trees is their accumulation of somatic mutations during mitotic divisions of stem cells present in the shoot apical meristems. Empirical4 and modelling5 approaches have shown that intra-organismal genetic heterogeneity can be selected for6 and provides direct fitness benefits in the arms race with short-lived pests and pathogens through a patchwork of intra-organismal phenotypes7. However, there is no clear proof that large-statured trees consist of a genetic mosaic of clonally distinct cell lineages within and between branches. Through this case study of oak, we demonstrate the accumulation and transmission of somatic mutations and the expansion of disease-resistance gene families in trees.
- Published
- 2018
6. The Cm56 tRNA modification in archaea is catalyzed either by a specific 2 '-O-methylase or a
- Author
-
Renalier, M.H., Joseph, N., Gaspin, C., Thébault, P., Mougin, A., Laboratoire de Biométrie et Biologie Evolutive - UMR 5558 (LBBE), Université Claude Bernard Lyon 1 (UCBL), and Université de Lyon-Université de Lyon-Institut National de Recherche en Informatique et en Automatique (Inria)-VetAgro Sup - Institut national d'enseignement supérieur et de recherche en alimentation, santé animale, sciences agronomiques et de l'environnement (VAS)-Centre National de la Recherche Scientifique (CNRS)
- Subjects
[SDV.OT]Life Sciences [q-bio]/Other [q-bio.OT] - Published
- 2005
7. Bounds Arc Consistency for Weighted CSPs
- Author
-
Zytnicki, M., primary, Gaspin, C., additional, De Givry, S., additional, and Schiex, T., additional
- Published
- 2009
- Full Text
- View/download PDF
8. RNA Structure Prediction
- Author
-
Westhof, E., primary and Gaspin, C., additional
- Published
- 2002
- Full Text
- View/download PDF
9. DRAWNA: A program for drawing schematic views of nucleic acids
- Author
-
Massire, C., primary, Gaspin, C., additional, and Westhof, E., additional
- Published
- 1994
- Full Text
- View/download PDF
10. A global state driven cooling schema for simulated annealing used with Boltzman machine
- Author
-
Bouju, A., primary, Bourret, P., additional, and Gaspin, C., additional
- Published
- 1990
- Full Text
- View/download PDF
11. Automatic translation of constraints for solving optimization problems by neural networks
- Author
-
Gaspin, C., primary
- Published
- 1990
- Full Text
- View/download PDF
12. GeneFarm, structural and functional annotation of Arabidopsis gene and protein families by a network of experts
- Author
-
Aubourg, S., Brunaud, W., Bruyere, C., Cock, M., Cooke, R., Cottet, A., Couloux, A., Dehais, P., Deleage, G., Duclert, A., Echeverria, M., Eschbach, A., Falconet, D., Filippi, G., Gaspin, C., Geourjon, C., Grienenberger, Jm, Houlne, G., Jamet, E., Lechauve, F., Leleu, O., Leroy, P., Mache, R., Meyer, C., Negrutiu, L., Orsini, V., Peyretaillade, E., Pommier, C., Jeroen Raes, Risler, Jl, Riviere, S., Rombauts, S., Rouze, P., Schneider, M., Schwob, P., Small, I., Soumayet-Kampetenga, G., Stankovski, D., Toffano, C., Tognolli, M., Caboche, M., and Lecharny, A.
13. Identification of CRISPR and riboswitch related RNAs among novel noncoding RNAs of the euryarchaeon Pyrococcus abyssi
- Author
-
Carpousis Agamemnon J, Brucato Nicolas, Rinaldi Dana, Moisan Annick, Phok Kounthéa, Gaspin Christine, and Clouet-d'Orval Béatrice
- Subjects
Biotechnology ,TP248.13-248.65 ,Genetics ,QH426-470 - Abstract
Abstract Background Noncoding RNA (ncRNA) has been recognized as an important regulator of gene expression networks in Bacteria and Eucaryota. Little is known about ncRNA in thermococcal archaea except for the eukaryotic-like C/D and H/ACA modification guide RNAs. Results Using a combination of in silico and experimental approaches, we identified and characterized novel P. abyssi ncRNAs transcribed from 12 intergenic regions, ten of which are conserved throughout the Thermococcales. Several of them accumulate in the late-exponential phase of growth. Analysis of the genomic context and sequence conservation amongst related thermococcal species revealed two novel P. abyssi ncRNA families. The CRISPR family is comprised of crRNAs expressed from two of the four P. abyssi CRISPR cassettes. The 5'UTR derived family includes four conserved ncRNAs, two of which have features similar to known bacterial riboswitches. Several of the novel ncRNAs have sequence similarities to orphan OrfB transposase elements. Based on RNA secondary structure predictions and experimental results, we show that three of the twelve ncRNAs include Kink-turn RNA motifs, arguing for a biological role of these ncRNAs in the cell. Furthermore, our results show that several of the ncRNAs are subjected to processing events by enzymes that remain to be identified and characterized. Conclusions This work proposes a revised annotation of CRISPR loci in P. abyssi and expands our knowledge of ncRNAs in the Thermococcales, thus providing a starting point for studies needed to elucidate their biological function.
- Published
- 2011
- Full Text
- View/download PDF
14. Distribution of short interstitial telomere motifs in two plant genomes: putative origin and function
- Author
-
Lescure Bernard, Rami Jean-François, and Gaspin Christine
- Subjects
Botany ,QK1-989 - Abstract
Abstract Background Short interstitial telomere motifs (telo boxes) are short sequences identical to plant telomere repeat units. They are observed within the 5' region of several genes over-expressed in cycling cells. In synergy with various cis-acting elements, these motifs participate in the activation of expression. Here, we have analysed the distribution of telo boxes within Arabidopsis thaliana and Oryza sativa genomes and their association with genes involved in the biogenesis of the translational apparatus. Results Our analysis showed that the distribution of the telo box (AAACCCTA) in different genomic regions of A. thaliana and O. sativa is not random. As is also the case for plant microsatellites, they are preferentially located in the 5' flanking regions of genes, mainly within the 5' UTR, and distributed as a gradient along the direction of transcription. As previously reported in Arabidopsis, a conserved topological association of telo boxes with site II or TEF cis-acting elements is observed in almost all promoters of genes encoding ribosomal proteins in O. sativa. Such a conserved promoter organization can be found in other genes involved in the biogenesis of the translational machinery including rRNA processing proteins and snoRNAs. Strikingly, the association of telo boxes with site II motifs or TEF boxes is conserved in promoters of genes harbouring snoRNA clusters nested within an intron as well as in the 5' flanking regions of non-intronic snoRNA genes. Thus, the search for associations between telo boxes and site II motifs or TEF box in plant genomes could provide a useful tool for characterizing new cryptic RNA pol II promoters. Conclusions The data reported in this work support the model previously proposed for the spreading of telo boxes within plant genomes and provide new insights into a putative process for the acquisition of microsatellites in plants. The association of telo boxes with site II or TEF cis-acting elements appears to be an essential feature of plant genes involved in the biogenesis of ribosomes and clearly indicates that most plant snoRNAs are RNA pol II products.
- Published
- 2010
- Full Text
- View/download PDF
15. RNomics and Modomics in the halophilic archaea Haloferax volcanii: identification of RNA modification genes
- Author
-
Decatur Wayne A, Marck Christian, Gaspin Christine, Grosjean Henri, and de Crécy-Lagard Valérie
- Subjects
Biotechnology ,TP248.13-248.65 ,Genetics ,QH426-470 - Abstract
Abstract Background Naturally occurring RNAs contain numerous enzymatically altered nucleosides. Differences in RNA populations (RNomics) and pattern of RNA modifications (Modomics) depends on the organism analyzed and are two of the criteria that distinguish the three kingdoms of life. If the genomic sequences of the RNA molecules can be derived from whole genome sequence information, the modification profile cannot and requires or direct sequencing of the RNAs or predictive methods base on the presence or absence of the modifications genes. Results By employing a comparative genomics approach, we predicted almost all of the genes coding for the t+rRNA modification enzymes in the mesophilic moderate halophile Haloferax volcanii. These encode both guide RNAs and enzymes. Some are orthologous to previously identified genes in Archaea, Bacteria or in Saccharomyces cerevisiae, but several are original predictions. Conclusion The number of modifications in t+rRNAs in the halophilic archaeon is surprisingly low when compared with other Archaea or Bacteria, particularly the hyperthermophilic organisms. This may result from the specific lifestyle of halophiles that require high intracellular salt concentration for survival. This salt content could allow RNA to maintain its functional structural integrity with fewer modifications. We predict that the few modifications present must be particularly important for decoding, accuracy of translation or are modifications that cannot be functionally replaced by the electrostatic interactions provided by the surrounding salt-ions. This analysis also guides future experimental validation work aiming to complete the understanding of the function of RNA modifications in Archaeal translation.
- Published
- 2008
- Full Text
- View/download PDF
16. LeARN: a platform for detecting, clustering and annotating non-coding RNAs
- Author
-
Schiex Thomas, Gaspin Christine, Noirot Céline, and Gouzy Jérôme
- Subjects
Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background In the last decade, sequencing projects have led to the development of a number of annotation systems dedicated to the structural and functional annotation of protein-coding genes. These annotation systems manage the annotation of the non-protein coding genes (ncRNAs) in a very crude way, allowing neither the edition of the secondary structures nor the clustering of ncRNA genes into families which are crucial for appropriate annotation of these molecules. Results LeARN is a flexible software package which handles the complete process of ncRNA annotation by integrating the layers of automatic detection and human curation. Conclusion This software provides the infrastructure to deal properly with ncRNAs in the framework of any annotation project. It fills the gap between existing prediction software, that detect independent ncRNA occurrences, and public ncRNA repositories, that do not offer the flexibility and interactivity required for annotation projects. The software is freely available from the download section of the website http://bioinfo.genopole-toulouse.prd.fr/LeARN
- Published
- 2008
- Full Text
- View/download PDF
17. A neural based approach of constraints satisfaction problem.
- Author
-
Bourret, P. and Gaspin, C.
- Published
- 1992
- Full Text
- View/download PDF
18. ESSA: an integrated and interactive computer tool for analysing RNA secondary structure
- Author
-
Chetouani, F., Monestié, P., Thébault, P., Gaspin, C., and Michot, B.
- Abstract
With ESSA, we propose an approach of RNA secondary structure analysis based on extensive viewing within a friendly graphical interface. This computer program is organized around the display of folding models produced by two complementary methods suitable to draw long RNA molecules. Any feature of interest can be managed directly on the display and highlighted by a rich combination of colours and symbols with emphasis given to structural probe accessibilities. ESSA also includes a word searching procedure allowing easy visual identification of structural features even complex and degenerated. Analysis functions make it possible to calculate the thermodynamic stability of any part of a folding using several models and compare homologous aligned RNA both in primary and secondary structure. The predictive capacities of ESSA which brings together the experimental, thermodynamic and comparative methods, are increased by coupling it with a program dedicated to RNA folding prediction based on constraints management and propagation. The potentialities of ESSA are illustrated by the identification of a possible tertiary motif in the LSU rRNA and the visualization of a pseudoknot in S15 mRNA.
- Published
- 1997
- Full Text
- View/download PDF
19. Massive detection of cryptic recessive genetic defects in dairy cattle mining millions of life histories.
- Author
-
Besnard F, Guintard A, Grohs C, Guzylack-Piriou L, Cano M, Escouflaire C, Hozé C, Leclerc H, Buronfosse T, Dutheil L, Jourdain J, Barbat A, Fritz S, Deloche MC, Remot A, Gaussères B, Clément A, Bouchier M, Contat E, Relun A, Plassard V, Rivière J, Péchoux C, Vilotte M, Eche C, Kuchly C, Charles M, Boulling A, Viard G, Minéry S, Barbey S, Birbes C, Danchin-Burge C, Launay F, Mattalia S, Allais-Bonnet A, Ravary B, Millemann Y, Guatteo R, Klopp C, Gaspin C, Iampietro C, Donnadieu C, Milan D, Arcangioli MA, Boussaha M, Foucras G, Boichard D, and Capitan A
- Subjects
- Animals, Cattle, Data Mining, Cattle Diseases genetics, Haplotypes, Genes, Recessive
- Abstract
Background: Dairy cattle breeds are populations of limited effective size, subject to recurrent outbreaks of recessive defects that are commonly studied using positional cloning. However, this strategy, based on the observation of animals with characteristic features, may overlook a number of conditions, such as immune or metabolic genetic disorders, which may be confused with pathologies of environmental etiology., Results: We present a data mining framework specifically designed to detect recessive defects in livestock that have been previously missed due to a lack of specific signs, incomplete penetrance, or incomplete linkage disequilibrium. This approach leverages the massive data generated by genomic selection. Its basic principle is to compare the observed and expected numbers of homozygotes for sliding haplotypes in animals with different life histories. Within three cattle breeds, we report 33 new loci responsible for increased risk of juvenile mortality and present a series of validations based on large-scale genotyping, clinical examination, and functional studies for candidate variants affecting the NOA1, RFC5, and ITGB7 genes. In particular, we describe disorders associated with NOA1 and RFC5 mutations for the first time in vertebrates., Conclusions: The discovery of these many new defects will help to characterize the genetic basis of inbreeding depression, while their management will improve animal welfare and reduce losses to the industry., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
20. Asterics: a simple tool for the ExploRation and Integration of omiCS data.
- Author
-
Maigné É, Noirot C, Henry J, Adu Kesewaah Y, Badin L, Déjean S, Guilmineau C, Krebs A, Mathevet F, Segalini A, Thomassin L, Colongo D, Gaspin C, Liaubet L, and Vialaneix N
- Subjects
- Workflow, Software
- Abstract
Background: The rapid development of omics acquisition techniques has induced the production of a large volume of heterogeneous and multi-level omics datasets, which require specific and sometimes complex analyses to obtain relevant biological information. Here, we present ASTERICS (version 2.5), a publicly available web interface for the analyses of omics datasets., Results: ASTERICS is designed to make both standard and complex exploratory and integration analysis workflows easily available to biologists and to provide high quality interactive plots. Special care has been taken to provide a comprehensive documentation of the implemented analyses and to guide users toward sound analysis choices regarding some specific omics data. Data and analyses are organized in a comprehensive graphical workflow within ASTERICS workspace to facilitate the understanding of successive data editions and analyses leading to a given result., Conclusion: ASTERICS provides an easy to use platform for omics data exploration and integration. The modular organization of its open source code makes it easy to incorporate new workflows and analyses by external contributors. ASTERICS is available at https://asterics.miat.inrae.fr and can also be deployed using provided docker images., (© 2023. BioMed Central Ltd., part of Springer Nature.)
- Published
- 2023
- Full Text
- View/download PDF
21. A Bos taurus sequencing methods benchmark for assembly, haplotyping, and variant calling.
- Author
-
Eché C, Iampietro C, Birbes C, Dréau A, Kuchly C, Di Franco A, Klopp C, Faraut T, Djebali S, Castinel A, Zytnicki M, Denis E, Boussaha M, Grohs C, Boichard D, Gaspin C, Milan D, and Donnadieu C
- Subjects
- Animals, Cattle, Female, Benchmarking, Genome, Sequence Analysis, DNA, Genomics, High-Throughput Nucleotide Sequencing
- Abstract
Inspired by the production of reference data sets in the Genome in a Bottle project, we sequenced one Charolais heifer with different technologies: Illumina paired-end, Oxford Nanopore, Pacific Biosciences (HiFi and CLR), 10X Genomics linked-reads, and Hi-C. In order to generate haplotypic assemblies, we also sequenced both parents with short reads. From these data, we built two haplotyped trio high quality reference genomes and a consensus assembly, using up-to-date software packages. The assemblies obtained using PacBio HiFi reaches a size of 3.2 Gb, which is significantly larger than the 2.7 Gb ARS-UCD1.2 reference. The BUSCO score of the consensus assembly reaches a completeness of 95.8%, among highly conserved mammal genes. We also identified 35,866 structural variants larger than 50 base pairs. This assembly is a contribution to the bovine pangenome for the "Charolais" breed. These datasets will prove to be useful resources enabling the community to gain additional insight on sequencing technologies for applications such as SNP, indel or structural variant calling, and de novo assembly., (© 2023. The Author(s).)
- Published
- 2023
- Full Text
- View/download PDF
22. MicroRNA Target Identification: Revisiting Accessibility and Seed Anchoring.
- Author
-
Homberg N, Galvão Ferrarini M, Gaspin C, and Sagot MF
- Subjects
- MicroRNAs genetics, MicroRNAs metabolism, RNA, Messenger genetics, Gene Expression Regulation
- Abstract
By pairing to messenger RNAs (mRNAs for short), microRNAs (miRNAs) regulate gene expression in animals and plants. Accurately identifying which mRNAs interact with a given miRNA and the precise location of the interaction sites is crucial to reaching a more complete view of the regulatory network of an organism. Only a few experimental approaches, however, allow the identification of both within a single experiment. Computational predictions of miRNA-mRNA interactions thus remain generally the first step used, despite their drawback of a high rate of false-positive predictions. The major computational approaches available rely on a diversity of features, among which anchoring the miRNA seed and measuring mRNA accessibility are the key ones, with the first being universally used, while the use of the second remains controversial. Revisiting the importance of each is the aim of this paper, which uses Cross-Linking, Ligation, And Sequencing of Hybrids (CLASH) datasets to achieve this goal. Contrary to what might be expected, the results are more ambiguous regarding the use of the seed match as a feature, while accessibility appears to be a feature worth considering, indicating that, at least under some conditions, it may favour anchoring by miRNAs.
- Published
- 2023
- Full Text
- View/download PDF
23. srnaMapper: an optimal mapping tool for sRNA-Seq reads.
- Author
-
Zytnicki M and Gaspin C
- Subjects
- Sequence Analysis, RNA methods, RNA, Small Interfering, RNA, Transfer, High-Throughput Nucleotide Sequencing methods, MicroRNAs
- Abstract
Background: Sequencing is the key method to study the impact of short RNAs, which include micro RNAs, tRNA-derived RNAs, and piwi-interacting RNA, among others. The first step to make use of these reads is to map them to a genome. Existing mapping tools have been developed for long RNAs in mind, and, so far, no tool has been conceived for short RNAs. However, short RNAs have several distinctive features which make them different from messenger RNAs: they are shorter, they are often redundant, they can be produced by duplicated loci, and they may be edited at their ends., Results: In this work, we present a new tool, srnaMapper, that exhaustively maps these reads with all these features in mind, and is most efficient when applied to reads no longer than 50 base pairs. We show, on several datasets, that srnaMapper is very efficient considering computation time and edition error handling: it retrieves all the hits, with arbitrary number of errors, in time comparable with non-exhaustive tools., (© 2022. The Author(s).)
- Published
- 2022
- Full Text
- View/download PDF
24. Generation of full-length circular RNA libraries for Oxford Nanopore long-read sequencing.
- Author
-
Fuchs S, Babin L, Andraos E, Bessiere C, Willier S, Schulte JH, Gaspin C, and Meggetto F
- Subjects
- High-Throughput Nucleotide Sequencing methods, Humans, RNA genetics, Sequence Analysis, RNA methods, Nanopores, RNA, Circular
- Abstract
Circular RNA (circRNA) is a noncoding RNA class with important implications for gene expression regulation, mostly by interaction with other RNA species or RNA-binding proteins. While the commonly applied short-read Illumina RNA-sequencing techniques can be used to detect circRNAs, their full sequence is not revealed. However, the complete sequence information is needed to analyze potential interactions and thus the mechanism of action of circRNAs. Here, we present an improved protocol to enrich and sequence full-length circRNAs by using the Oxford Nanopore long-read sequencing platform. The protocol involves an enrichment of lowly abundant circRNAs by exonuclease treatment and negative selection of linear RNAs. Then, a cDNA library is created and amplified by PCR. This protocol provides enough material for several sequencing runs. The library is used as input for ligation-based sequencing together with native barcoding. Stringent quality control of the libraries is ensured by a combination of Qubit, Fragment Analyzer and qRT-PCR. Multiplexing of up to 4 libraries yields in total more than 1-2 Million reads per library, of which 1-2% are circRNA-specific reads with >99% of them full-length. The protocol works well with human cancer cell lines. We further provide suggestions for the bioinformatic analysis of the created data, as well as the limitations of our approach together with recommendations for troubleshooting and interpretation. Taken together, this protocol enables reliable full-length analysis of circRNAs, a noncoding RNA type involved in a growing number of physiologic and pathologic conditions. Metadata Associated content. https://dx.doi.org/10.17504/protocols.io.rm7vzy8r4lx1/v2., Competing Interests: The authors have declared that no competing interests exist.
- Published
- 2022
- Full Text
- View/download PDF
25. Phylodynamic study of the conserved RNA structure encompassing the hemagglutinin cleavage site encoding region of H5 and H7 low pathogenic avian influenza viruses.
- Author
-
Dupré G, Hoede C, Figueroa T, Bessière P, Bertagnoli S, Ducatez M, Gaspin C, and Volmer R
- Abstract
Highly pathogenic avian influenza viruses (HPAIVs) evolve from low pathogenic avian influenza viruses (LPAIVs) of the H5 and H7 subtypes. This evolution is characterized by the acquisition of a multi-basic cleavage site (MBCS) motif in the hemagglutinin (HA) that leads to an extended viral tropism and severe disease in poultry. One key unanswered question is whether the risk of transition to HPAIVs is similar for all LPAIVs H5 or H7 strains, or whether specific determinants in the HA sequence of some H5 or H7 LPAIV strains correlate with a higher risk of transition to HPAIVs. Here, we determined if specific features of the conserved RNA stem-loop located at the HA cleavage site-encoding region could be detected along the LPAIV to HPAIV evolutionary pathway. Analysis of the thermodynamic stability of the predicted RNA structures showed no specific patterns common to HA sequences leading to HPAIVs and distinct from those remaining LPAIVs. However, RNA structure clustering analysis revealed that most of the American lineage ancestors leading to H7 emergences via recombination shared the same viral RNA (vRNA) structure topology at the HA1/HA2 boundary region. Our study thus identified predicted secondary RNA structures present in the HA of H7 viruses, which could promote genetic recombination and acquisition of a multibasic cleavage site motif (MBCS)., (© The Author(s) 2021. Published by Oxford University Press.)
- Published
- 2021
- Full Text
- View/download PDF
26. mmannot: How to improve small-RNA annotation?
- Author
-
Zytnicki M and Gaspin C
- Subjects
- Genomics, High-Throughput Nucleotide Sequencing, MicroRNAs genetics, Molecular Sequence Annotation, RNA, Small Untranslated genetics, Sequence Analysis, RNA methods, Software
- Abstract
High-throughput sequencing makes it possible to provide the genome-wide distribution of small non coding RNAs in a single experiment, and contributed greatly to the identification and understanding of these RNAs in the last decade. Small non coding RNAs gather a wide collection of classes, such as microRNAs, tRNA-derived fragments, small nucleolar RNAs and small nuclear RNAs, to name a few. As usual in RNA-seq studies, the sequencing step is followed by a feature quantification step: when a genome is available, the reads are aligned to the genome, their genomic positions are compared to the already available annotations, and the corresponding features are quantified. However, problem arises when many reads map at several positions and while different strategies exist to circumvent this problem, all of them are biased. In this article, we present a new strategy that compares all the reads that map at several positions, and their annotations when available. In many cases, all the hits co-localize with the same feature annotation (a duplicated miRNA or a duplicated gene, for instance). When different annotations exist for a given read, we propose to merge existing features and provide the counts for the merged features. This new strategy has been implemented in a tool, mmannot, freely available at https://github.com/mzytnicki/mmannot., Competing Interests: The authors have declared that no competing interests exist.
- Published
- 2020
- Full Text
- View/download PDF
27. Multiple Origins and Specific Evolution of CRISPR/Cas9 Systems in Minimal Bacteria ( Mollicutes ).
- Author
-
Ipoutcha T, Tsarmpopoulos I, Talenton V, Gaspin C, Moisan A, Walker CA, Brownlie J, Blanchard A, Thebault P, and Sirand-Pugnet P
- Abstract
CRISPR/Cas systems provide adaptive defense mechanisms against invading nucleic acids in prokaryotes. Because of its interest as a genetic tool, the Type II CRISPR/Cas9 system from Streptococcus pyogenes has been extensively studied. It includes the Cas9 endonuclease that is dependent on a dual-guide RNA made of a tracrRNA and a crRNA. Target recognition relies on crRNA annealing and the presence of a protospacer adjacent motif (PAM). Mollicutes are currently the bacteria with the smallest genome in which CRISPR/Cas systems have been reported. Many of them are pathogenic to humans and animals (mycoplasmas and ureaplasmas) or plants (phytoplasmas and some spiroplasmas). A global survey was conducted to identify and compare CRISPR/Cas systems found in the genome of these minimal bacteria. Complete or degraded systems classified as Type II-A and less frequently as Type II-C were found in the genome of 21 out of 52 representative mollicutes species. Phylogenetic reconstructions predicted a common origin of all CRISPR/Cas systems of mycoplasmas and at least two origins were suggested for spiroplasmas systems. Cas9 in mollicutes were structurally related to the S. aureus Cas9 except the PI domain involved in the interaction with the PAM, suggesting various PAM might be recognized by Cas9 of different mollicutes. Structure of the predicted crRNA/tracrRNA hybrids was conserved and showed typical stem-loop structures pairing the Direct Repeat part of crRNAs with the 5' region of tracrRNAs. Most mollicutes crRNA/tracrRNAs showed G + C% significantly higher than the genome, suggesting a selective pressure for maintaining stability of these secondary structures. Examples of CRISPR spacers matching with mollicutes phages were found, including the textbook case of Mycoplasma cynos strain C142 having no prophage sequence but a CRISPR/Cas system with spacers targeting prophage sequences that were found in the genome of another M. cynos strain that is devoid of a CRISPR system. Despite their small genome size, mollicutes have maintained protective means against invading DNAs, including restriction/modification and CRISPR/Cas systems. The apparent lack of CRISPR/Cas systems in several groups of species including main pathogens of humans, ruminants, and plants suggests different evolutionary routes or a lower risk of phage infection in specific ecological niches., (Copyright © 2019 Ipoutcha, Tsarmpopoulos, Talenton, Gaspin, Moisan, Walker, Brownlie, Blanchard, Thebault and Sirand-Pugnet.)
- Published
- 2019
- Full Text
- View/download PDF
28. Draft Genome Sequence of Tubulinosema ratisbonensis, a Microsporidian Species Infecting the Model Organism Drosophila melanogaster.
- Author
-
Polonais V, Niehus S, Wawrzyniak I, Franchet A, Gaspin C, Belkorchia A, Reichstadt M, Belser C, Labadie K, Couloux A, Delbac F, Peyretaillade E, and Ferrandon D
- Abstract
We present the draft genome sequence of Tubulinosema ratisbonensis , a microsporidium species infecting Drosophila melanogaster A total of 3,013 protein-encoding genes and an array of transposable elements were identified. This work represents a necessary step to develop a novel model of host-parasite relationships using the highly tractable genetic model D. melanogaster ., (Copyright © 2019 Polonais et al.)
- Published
- 2019
- Full Text
- View/download PDF
29. Oak genome reveals facets of long lifespan.
- Author
-
Plomion C, Aury JM, Amselem J, Leroy T, Murat F, Duplessis S, Faye S, Francillonne N, Labadie K, Le Provost G, Lesur I, Bartholomé J, Faivre-Rampant P, Kohler A, Leplé JC, Chantret N, Chen J, Diévart A, Alaeitabar T, Barbe V, Belser C, Bergès H, Bodénès C, Bogeat-Triboulot MB, Bouffaud ML, Brachi B, Chancerel E, Cohen D, Couloux A, Da Silva C, Dossat C, Ehrenmann F, Gaspin C, Grima-Pettenati J, Guichoux E, Hecker A, Herrmann S, Hugueney P, Hummel I, Klopp C, Lalanne C, Lascoux M, Lasserre E, Lemainque A, Desprez-Loustau ML, Luyten I, Madoui MA, Mangenot S, Marchal C, Maumus F, Mercier J, Michotey C, Panaud O, Picault N, Rouhier N, Rué O, Rustenholz C, Salin F, Soler M, Tarkka M, Velt A, Zanne AE, Martin F, Wincker P, Quesneville H, Kremer A, and Salse J
- Subjects
- Biological Evolution, DNA, Plant genetics, Genetic Variation genetics, Longevity genetics, Mutation, Phylogeny, Sequence Analysis, DNA, Genome, Plant genetics, Quercus genetics
- Abstract
Oaks are an important part of our natural and cultural heritage. Not only are they ubiquitous in our most common landscapes
1 but they have also supplied human societies with invaluable services, including food and shelter, since prehistoric times2 . With 450 species spread throughout Asia, Europe and America3 , oaks constitute a critical global renewable resource. The longevity of oaks (several hundred years) probably underlies their emblematic cultural and historical importance. Such long-lived sessile organisms must persist in the face of a wide range of abiotic and biotic threats over their lifespans. We investigated the genomic features associated with such a long lifespan by sequencing, assembling and annotating the oak genome. We then used the growing number of whole-genome sequences for plants (including tree and herbaceous species) to investigate the parallel evolution of genomic characteristics potentially underpinning tree longevity. A further consequence of the long lifespan of trees is their accumulation of somatic mutations during mitotic divisions of stem cells present in the shoot apical meristems. Empirical4 and modelling5 approaches have shown that intra-organismal genetic heterogeneity can be selected for6 and provides direct fitness benefits in the arms race with short-lived pests and pathogens through a patchwork of intra-organismal phenotypes7 . However, there is no clear proof that large-statured trees consist of a genetic mosaic of clonally distinct cell lineages within and between branches. Through this case study of oak, we demonstrate the accumulation and transmission of somatic mutations and the expansion of disease-resistance gene families in trees.- Published
- 2018
- Full Text
- View/download PDF
30. Large-Scale Measurement of mRNA Degradation in Escherichia coli: To Delay or Not to Delay.
- Author
-
Laguerre S, González I, Nouaille S, Moisan A, Villa-Vialaneix N, Gaspin C, Bouvier M, Carpousis AJ, Cocaign-Bousquet M, and Girbal L
- Subjects
- RNA Stability genetics, Transcription, Genetic genetics, Escherichia coli genetics, RNA Stability physiology, RNA, Bacterial metabolism, RNA, Messenger metabolism
- Abstract
In this study, we compared different computational methods used for genome-wide determination of mRNA half-lives in Escherichia coli with a special focus on the impact on considering a delay before the onset of mRNA decay after transcription arrest. A wide variety of datasets were analyzed coming from different technical methods for mRNA quantification (microarrays, RNA-seq, and RT-qPCR) and different bacterial growth conditions. The exponential decay of mRNA levels was fitted using both linear and exponential models and with or without a delay. We showed that for all the models, independently of mRNA quantification methods and growth conditions, ignoring the delay resulted in only a modest overestimation of the half-life. For approximately 80% of the mRNAs, differences in mRNA half-life values were less than 34s. The correlation between half-lives estimated with and without a delay was extremely high. However, the slope of the linear regression between the half-lives with and without a delay tended to decrease with the delay. For the few mRNAs for which taking into account the delay influenced the estimated half-life, the impact was dependent on the model and the growth condition. The smallest impact was obtained for the linear model., (© 2018 Elsevier Inc. All rights reserved.)
- Published
- 2018
- Full Text
- View/download PDF
31. Unraveling the evolution and coevolution of small regulatory RNAs and coding genes in Listeria.
- Author
-
Cerutti F, Mallet L, Painset A, Hoede C, Moisan A, Bécavin C, Duval M, Dussurget O, Cossart P, Gaspin C, and Chiapello H
- Subjects
- Gene Regulatory Networks, Genes, Bacterial, Genome, Bacterial, Listeria pathogenicity, Bacterial Proteins genetics, Evolution, Molecular, Listeria genetics, RNA, Small Untranslated genetics
- Abstract
Background: Small regulatory RNAs (sRNAs) are widely found in bacteria and play key roles in many important physiological and adaptation processes. Studying their evolution and screening for events of coevolution with other genomic features is a powerful way to better understand their origin and assess a common functional or adaptive relationship between them. However, evolution and coevolution of sRNAs with coding genes have been sparsely investigated in bacterial pathogens., Results: We designed a robust and generic phylogenomics approach that detects correlated evolution between sRNAs and protein-coding genes using their observed and inferred patterns of presence-absence in a set of annotated genomes. We applied this approach on 79 complete genomes of the Listeria genus and identified fifty-two accessory sRNAs, of which most were present in the Listeria common ancestor and lost during Listeria evolution. We detected significant coevolution between 23 sRNA and 52 coding genes and inferred the Listeria sRNA-coding genes coevolution network. We characterized a main hub of 12 sRNAs that coevolved with genes encoding cell wall proteins and virulence factors. Among them, an sRNA specific to L. monocytogenes species, rli133, coevolved with genes involved either in pathogenicity or in interaction with host cells, possibly acting as a direct negative post-transcriptional regulation., Conclusions: Our approach allowed the identification of candidate sRNAs potentially involved in pathogenicity and host interaction, consistent with recent findings on known pathogenicity actors. We highlight four sRNAs coevolving with seven internalin genes, some of which being important virulence factors in Listeria.
- Published
- 2017
- Full Text
- View/download PDF
32. Complete Genome Sequence and Annotation of the Staphylococcus aureus Strain HG001.
- Author
-
Caldelari I, Chane-Woon-Ming B, Noirot C, Moreau K, Romby P, Gaspin C, and Marzi S
- Abstract
Staphylococcus aureus is an opportunistic Gram-positive pathogen responsible for a wide range of infections from minor skin abscesses to life-threatening diseases. Here, we report the draft genome assembly and current annotation of the HG001 strain, a derivative of the RN1 (NCT8325) strain with restored rbsU (a positive activator of SigB)., (Copyright © 2017 Caldelari et al.)
- Published
- 2017
- Full Text
- View/download PDF
33. Comparative genomics of microsporidian genomes reveals a minimal non-coding RNA set and new insights for transcription in minimal eukaryotic genomes.
- Author
-
Belkorchia A, Pombert JF, Polonais V, Parisot N, Delbac F, Brugère JF, Peyret P, Gaspin C, and Peyretaillade E
- Subjects
- Base Sequence, Computer Simulation, Genomics, RNA, Small Nuclear metabolism, RNA, Small Nucleolar metabolism, Encephalitozoon genetics, Genome, Fungal, Nosema genetics, RNA, Untranslated metabolism, Transcription, Genetic
- Abstract
Microsporidia are ubiquitous intracellular pathogens whose opportunistic nature led to their increased recognition with the rise of the AIDS pandemic. As the RNA world was largely unexplored in this parasitic lineage, we developed a dedicated in silico methodology to carry out exhaustive identification of ncRNAs across the Encephalitozoon and Nosema genera. Thus, the previously missing U1 small nuclear RNA (snRNA) and small nucleolar RNAs (snoRNAs) targeting only the LSU rRNA were highlighted and were further validated using 5' and 3'RACE-PCR experiments. Overall, the 15 ncRNAs that were found shared between Encephalitozoon and Nosema spp. may represent the minimal core set required for parasitic life. Interestingly, the systematic presence of a CCC- or GGG-like motif in 5' of all ncRNA and mRNA gene transcripts regardless of the RNA polymerase involved suggests that the RNA polymerase machineries in microsporidia species could use common factors. Our data provide additional insights in accordance with the simplification processes observed in these reduce genomes and underline the usefulness of sequencing closely related species to help identify highly divergent ncRNAs in these parasites., (© The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.)
- Published
- 2017
- Full Text
- View/download PDF
34. Listeriomics: an Interactive Web Platform for Systems Biology of Listeria .
- Author
-
Bécavin C, Koutero M, Tchitchek N, Cerutti F, Lechat P, Maillet N, Hoede C, Chiapello H, Gaspin C, and Cossart P
- Abstract
As for many model organisms, the amount of Listeria omics data produced has recently increased exponentially. There are now >80 published complete Listeria genomes, around 350 different transcriptomic data sets, and 25 proteomic data sets available. The analysis of these data sets through a systems biology approach and the generation of tools for biologists to browse these various data are a challenge for bioinformaticians. We have developed a web-based platform, named Listeriomics, that integrates different tools for omics data analyses, i.e., (i) an interactive genome viewer to display gene expression arrays, tiling arrays, and sequencing data sets along with proteomics and genomics data sets; (ii) an expression and protein atlas that connects every gene, small RNA, antisense RNA, or protein with the most relevant omics data; (iii) a specific tool for exploring protein conservation through the Listeria phylogenomic tree; and (iv) a coexpression network tool for the discovery of potential new regulations. Our platform integrates all the complete Listeria species genomes, transcriptomes, and proteomes published to date. This website allows navigation among all these data sets with enriched metadata in a user-friendly format and can be used as a central database for systems biology analysis. IMPORTANCE In the last decades, Listeria has become a key model organism for the study of host-pathogen interactions, noncoding RNA regulation, and bacterial adaptation to stress. To study these mechanisms, several genomics, transcriptomics, and proteomics data sets have been produced. We have developed Listeriomics, an interactive web platform to browse and correlate these heterogeneous sources of information. Our website will allow listeriologists and microbiologists to decipher key regulation mechanism by using a systems biology approach.
- Published
- 2017
- Full Text
- View/download PDF
35. Characterization of an extensive rainbow trout miRNA transcriptome by next generation sequencing.
- Author
-
Juanchich A, Bardou P, Rué O, Gabillard JC, Gaspin C, Bobe J, and Guiguen Y
- Subjects
- Animals, High-Throughput Nucleotide Sequencing, Sequence Analysis, RNA, MicroRNAs genetics, Oncorhynchus mykiss genetics, Transcriptome
- Abstract
Background: MicroRNAs (miRNAs) have emerged as important post-transcriptional regulators of gene expression in a wide variety of physiological processes. They can control both temporal and spatial gene expression and are believed to regulate 30 to 70% of the genes. Data are however limited for fish species, with only 9 out of the 30,000 fish species present in miRBase. The aim of the current study was to discover and characterize rainbow trout (Oncorhynchus mykiss) miRNAs in a large number of tissues using next-generation sequencing in order to provide an extensive repertoire of rainbow trout miRNAs., Results: A total of 38 different samples corresponding to 16 different tissues or organs were individually sequenced and analyzed independently in order to identify a large number of miRNAs with high confidence. This led to the identification of 2946 miRNA loci in the rainbow trout genome, including 445 already known miRNAs. Differential expression analysis was performed in order to identify miRNAs exhibiting specific or preferential expression among the 16 analyzed tissues. In most cases, miRNAs exhibit a specific pattern of expression in only a few tissues. The expression data from sRNA sequencing were confirmed by RT-qPCR. In addition, novel miRNAs are described in rainbow trout that had not been previously reported in other species., Conclusion: This study represents the first characterization of rainbow trout miRNA transcriptome from a wide variety of tissue and sets an extensive repertoire of rainbow trout miRNAs. It provides a starting point for future studies aimed at understanding the roles of miRNAs in major physiological process such as growth, reproduction or adaptation to stress. These rainbow trout miRNAs repertoire provide a novel resource to advance genomic research in salmonid species.
- Published
- 2016
- Full Text
- View/download PDF
36. Jflow: a workflow management system for web applications.
- Author
-
Mariette J, Escudié F, Bardou P, Nabihoudine I, Noirot C, Trotard MS, Gaspin C, and Klopp C
- Subjects
- Databases, Factual, Humans, Workflow, Computational Biology methods, Database Management Systems, Information Storage and Retrieval, Internet, Software
- Abstract
Summary: Biologists produce large data sets and are in demand of rich and simple web portals in which they can upload and analyze their files. Providing such tools requires to mask the complexity induced by the needed High Performance Computing (HPC) environment. The connection between interface and computing infrastructure is usually specific to each portal. With Jflow, we introduce a Workflow Management System (WMS), composed of jQuery plug-ins which can easily be embedded in any web application and a Python library providing all requested features to setup, run and monitor workflows., Availability and Implementation: Jflow is available under the GNU General Public License (GPL) at http://bioinfo.genotoul.fr/jflow. The package is coming with full documentation, quick start and a running test portal., Contact: Jerome.Mariette@toulouse.inra.fr., (© The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.)
- Published
- 2016
- Full Text
- View/download PDF
37. Decoding the oak genome: public release of sequence data, assembly, annotation and publication strategies.
- Author
-
Plomion C, Aury JM, Amselem J, Alaeitabar T, Barbe V, Belser C, Bergès H, Bodénès C, Boudet N, Boury C, Canaguier A, Couloux A, Da Silva C, Duplessis S, Ehrenmann F, Estrada-Mairey B, Fouteau S, Francillonne N, Gaspin C, Guichard C, Klopp C, Labadie K, Lalanne C, Le Clainche I, Leplé JC, Le Provost G, Leroy T, Lesur I, Martin F, Mercier J, Michotey C, Murat F, Salin F, Steinbach D, Faivre-Rampant P, Wincker P, Salse J, Quesneville H, and Kremer A
- Subjects
- Models, Genetic, Molecular Sequence Annotation, Phylogeny, Quercus classification, Sequence Analysis, DNA, Genome, Plant, Quercus genetics
- Abstract
The 1.5 Gbp/2C genome of pedunculate oak (Quercus robur) has been sequenced. A strategy was established for dealing with the challenges imposed by the sequencing of such a large, complex and highly heterozygous genome by a whole-genome shotgun (WGS) approach, without the use of costly and time-consuming methods, such as fosmid or BAC clone-based hierarchical sequencing methods. The sequencing strategy combined short and long reads. Over 49 million reads provided by Roche 454 GS-FLX technology were assembled into contigs and combined with shorter Illumina sequence reads from paired-end and mate-pair libraries of different insert sizes, to build scaffolds. Errors were corrected and gaps filled with Illumina paired-end reads and contaminants detected, resulting in a total of 17,910 scaffolds (>2 kb) corresponding to 1.34 Gb. Fifty per cent of the assembly was accounted for by 1468 scaffolds (N50 of 260 kb). Initial comparison with the phylogenetically related Prunus persica gene model indicated that genes for 84.6% of the proteins present in peach (mean protein coverage of 90.5%) were present in our assembly. The second and third steps in this project are genome annotation and the assignment of scaffolds to the oak genetic linkage map. In accordance with the Bermuda and Fort Lauderdale agreements and the more recent Toronto Statement, the oak genome data have been released into public sequence repositories in advance of publication. In this presubmission paper, the oak genome consortium describes its principal lines of work and future directions for analyses of the nature, function and evolution of the oak genome., (© 2015 John Wiley & Sons Ltd.)
- Published
- 2016
- Full Text
- View/download PDF
38. The Prediction and Validation of Small CDSs Expand the Gene Repertoire of the Smallest Known Eukaryotic Genomes.
- Author
-
Belkorchia A, Gasc C, Polonais V, Parisot N, Gallois N, Ribière C, Lerat E, Gaspin C, Pombert JF, Peyret P, and Peyretaillade E
- Subjects
- Base Sequence, DNA, Intergenic genetics, Molecular Sequence Annotation, Molecular Sequence Data, Phylogeny, Reproducibility of Results, Encephalitozoon genetics, Genes, Fungal genetics, Genome Size, Genomics, Open Reading Frames genetics
- Abstract
The proper prediction of the gene catalogue of an organism is essential to obtain a representative snapshot of its overall lifestyle, especially when it is not amenable to culturing. Microsporidia are obligate intracellular, sometimes hard to culture, eukaryotic parasites known to infect members of every animal phylum. To date, sequencing and annotation of microsporidian genomes have revealed a poor gene complement with highly reduced gene sizes. In the present paper, we investigated whether such gene sizes may have induced biases for the methodologies used for genome annotation, with an emphasis on small coding sequence (CDS) gene prediction. Using better delineated intergenic regions from four Encephalitozoon genomes, we predicted de novo new small CDSs with sizes ranging from 78 to 255 bp (median 168) and corroborated these predictions by RACE-PCR experiments in Encephalitozoon cuniculi. Most of the newly found genes are present in other distantly related microsporidian species, suggesting their biological relevance. The present study provides a better framework for annotating microsporidian genomes and to train and evaluate new computational methods dedicated at detecting ultra-small genes in various organisms.
- Published
- 2015
- Full Text
- View/download PDF
39. Advantages of mixing bioinformatics and visualization approaches for analyzing sRNA-mediated regulatory bacterial networks.
- Author
-
Thébault P, Bourqui R, Benchimol W, Gaspin C, Sirand-Pugnet P, Uricaru R, and Dutour I
- Subjects
- Computational Biology, Gene Regulatory Networks, RNA, Bacterial physiology
- Abstract
The revolution in high-throughput sequencing technologies has enabled the acquisition of gigabytes of RNA sequences in many different conditions and has highlighted an unexpected number of small RNAs (sRNAs) in bacteria. Ongoing exploitation of these data enables numerous applications for investigating bacterial transacting sRNA-mediated regulation networks. Focusing on sRNAs that regulate mRNA translation in trans, recent works have noted several sRNA-based regulatory pathways that are essential for key cellular processes. Although the number of known bacterial sRNAs is increasing, the experimental validation of their interactions with mRNA targets remains challenging and involves expensive and time-consuming experimental strategies. Hence, bioinformatics is crucial for selecting and prioritizing candidates before designing any experimental work. However, current software for target prediction produces a prohibitive number of candidates because of the lack of biological knowledge regarding the rules governing sRNA-mRNA interactions. Therefore, there is a real need to develop new approaches to help biologists focus on the most promising predicted sRNA-mRNA interactions. In this perspective, this review aims at presenting the advantages of mixing bioinformatics and visualization approaches for analyzing predicted sRNA-mediated regulatory bacterial networks., (© The Author 2014. Published by Oxford University Press.)
- Published
- 2015
- Full Text
- View/download PDF
40. Mirinho: An efficient and general plant and animal pre-miRNA predictor for genomic and deep sequencing data.
- Author
-
Higashi S, Fournier C, Gautier C, Gaspin C, and Sagot MF
- Subjects
- Algorithms, Animals, Base Pairing, Base Sequence, Genomics methods, Molecular Sequence Data, Sequence Homology, Nucleic Acid, Arabidopsis genetics, Genome, High-Throughput Nucleotide Sequencing methods, Insecta genetics, MicroRNAs genetics, Sequence Analysis, RNA methods, Software
- Abstract
Background: Several methods exist for the prediction of precursor miRNAs (pre-miRNAs) in genomic or sRNA-seq (small RNA sequences) data produced by NGS (Next Generation Sequencing). One key information used for this task is the characteristic hairpin structure adopted by pre-miRNAs, that in general are identified using RNA folders whose complexity is cubic in the size of the input. The vast majority of pre-miRNA predictors then rely on further information learned from previously validated miRNAs from the same or a closely related genome for the final prediction of new miRNAs. With this paper, we wished to address three main issues. The first was methodological and aimed at obtaining a more time-efficient predictor, however without losing in accuracy which represented a second issue. We indeed aimed at better predicting miRNAs at a genome scale, but also from sRNAseq data where in some cases, notably of plants, the current folding methods often infer the wrong structure. The third issue is related to the fact that it is important to rely as little as possible on previously recorded examples of miRNAs. We therefore also sought a method that is less dependent on previous miRNA records., Results: As concerns the first and second issues, we present a novel alternative to a classical folder based on a thermodynamic Nearest-Neighbour (NN) model for computing the free energy and predicting the classical hairpin structure of a pre-miRNA. We show that the free energies thus computed correlate well with those of RNAFOLD. This novel method, called MIRINHO, has quadratic instead of cubic complexity and is much more efficient also in practice. When applied to sRNAseq data of plants, it gives in general better results than classical folders. On the third issue, we show that MIRINHO, which uses as only knowledge the length of the loops and stem-arms and the free energy of the pre-miRNA hairpin, compares well with algorithms that require more information. The results, obtained with different datasets, are indeed similar to those of other approaches with which such a comparison was possible. These needed to be publicly available softwares that could be used on a large input. In some cases, MIRINHO is even better in terms of sensitivity or precision., Conclusion: We provide a simpler and much faster method with very reasonable sensitivity and precision, which can be applied without special adaptation to the prediction of both animal and plant pre-miRNAs, using as input either genomic sequences or sRNA-seq data.
- Published
- 2015
- Full Text
- View/download PDF
41. Genome-wide investigation of mRNA lifetime determinants in Escherichia coli cells cultured at different growth rates.
- Author
-
Esquerré T, Moisan A, Chiapello H, Arike L, Vilu R, Gaspin C, Cocaign-Bousquet M, and Girbal L
- Subjects
- 5' Untranslated Regions, Base Composition, Base Sequence, Codon metabolism, Escherichia coli growth & development, Escherichia coli Proteins chemistry, Escherichia coli Proteins metabolism, Half-Life, Models, Biological, Open Reading Frames genetics, RNA Stability, Escherichia coli genetics, Genome, Bacterial, RNA, Messenger metabolism
- Abstract
Background: Changes to mRNA lifetime adjust mRNA concentration, facilitating the adaptation of growth rate to changes in growth conditions. However, the mechanisms regulating mRNA lifetime are poorly understood at the genome-wide scale and have not been investigated in bacteria growing at different rates., Results: We used linear covariance models and the best model selected according to the Akaike information criterion to identify and rank intrinsic and extrinsic general transcript parameters correlated with mRNA lifetime, using mRNA half-life datasets for E. coli, obtained at four growth rates. The principal parameter correlated with mRNA stability was mRNA concentration, the mRNAs most concentrated in the cells being the least stable. However, sequence-related features (codon adaptation index (CAI), ORF length, GC content, polycistronic mRNA), gene function and essentiality also affected mRNA lifetime at all growth rates. We also identified sequence motifs within the 5'UTRs potentially related to mRNA stability. Growth rate-dependent effects were confined to particular functional categories (e.g. carbohydrate and nucleotide metabolism). Finally, mRNA stability was less strongly correlated with the amount of protein produced than mRNA concentration and CAI., Conclusions: This study provides the most complete genome-wide analysis to date of the general factors correlated with mRNA lifetime in E. coli. We have generalized for the entire population of transcripts or excluded determinants previously defined as regulators of stability for some particular mRNAs and identified new, unexpected general indicators. These results will pave the way for discussions of the underlying mechanisms and their interaction with the growth physiology of bacteria.
- Published
- 2015
- Full Text
- View/download PDF
42. Structural and functional partitioning of bread wheat chromosome 3B.
- Author
-
Choulet F, Alberti A, Theil S, Glover N, Barbe V, Daron J, Pingault L, Sourdille P, Couloux A, Paux E, Leroy P, Mangenot S, Guilhot N, Le Gouis J, Balfourier F, Alaux M, Jamilloux V, Poulain J, Durand C, Bellec A, Gaspin C, Safar J, Dolezel J, Rogers J, Vandepoele K, Aury JM, Mayer K, Berges H, Quesneville H, Wincker P, and Feuillet C
- Subjects
- Bread, Chromosome Segregation, Chromosomes, Plant genetics, DNA Transposable Elements, Meiosis, Plant Proteins genetics, Polyploidy, Pseudogenes, Recombination, Genetic, Triticum cytology, Chromosomes, Plant physiology, Triticum genetics
- Abstract
We produced a reference sequence of the 1-gigabase chromosome 3B of hexaploid bread wheat. By sequencing 8452 bacterial artificial chromosomes in pools, we assembled a sequence of 774 megabases carrying 5326 protein-coding genes, 1938 pseudogenes, and 85% of transposable elements. The distribution of structural and functional features along the chromosome revealed partitioning correlated with meiotic recombination. Comparative analyses indicated high wheat-specific inter- and intrachromosomal gene duplication activities that are potential sources of variability for adaption. In addition to providing a better understanding of the organization, function, and evolution of a large and polyploid genome, the availability of a high-quality sequence anchored to genetic maps will accelerate the identification of genes underlying important agronomic traits., (Copyright © 2014, American Association for the Advancement of Science.)
- Published
- 2014
- Full Text
- View/download PDF
43. The rainbow trout genome provides novel insights into evolution after whole-genome duplication in vertebrates.
- Author
-
Berthelot C, Brunet F, Chalopin D, Juanchich A, Bernard M, Noël B, Bento P, Da Silva C, Labadie K, Alberti A, Aury JM, Louis A, Dehais P, Bardou P, Montfort J, Klopp C, Cabau C, Gaspin C, Thorgaard GH, Boussaha M, Quillet E, Guyomard R, Galiana D, Bobe J, Volff JN, Genêt C, Wincker P, Jaillon O, Roest Crollius H, and Guiguen Y
- Subjects
- Animals, Gene Duplication genetics, Evolution, Molecular, Oncorhynchus mykiss genetics, Vertebrates genetics
- Abstract
Vertebrate evolution has been shaped by several rounds of whole-genome duplications (WGDs) that are often suggested to be associated with adaptive radiations and evolutionary innovations. Due to an additional round of WGD, the rainbow trout genome offers a unique opportunity to investigate the early evolutionary fate of a duplicated vertebrate genome. Here we show that after 100 million years of evolution the two ancestral subgenomes have remained extremely collinear, despite the loss of half of the duplicated protein-coding genes, mostly through pseudogenization. In striking contrast is the fate of miRNA genes that have almost all been retained as duplicated copies. The slow and stepwise rediploidization process characterized here challenges the current hypothesis that WGD is followed by massive and rapid genomic reorganizations and gene deletions.
- Published
- 2014
- Full Text
- View/download PDF
44. Diversity of CRISPR systems in the euryarchaeal Pyrococcales.
- Author
-
Norais C, Moisan A, Gaspin C, and Clouet-d'Orval B
- Subjects
- Archaeal Viruses genetics, Archaeal Viruses physiology, Gene Expression Regulation, Archaeal, Gene Transfer, Horizontal, Genome, Archaeal, Phylogeny, CRISPR-Cas Systems, Clustered Regularly Interspaced Short Palindromic Repeats, Pyrococcus abyssi genetics, Pyrococcus abyssi virology, Thermococcales genetics, Thermococcales virology
- Abstract
Pyrococcales are members of the order Thermococcales, a group of hyperthermophilic euryarchaea that are frequently found in deep sea hydrothermal vents. Infectious genetic elements, such as plasmids and viruses, remain a threat even in this remote environment and these microorganisms have developed several ways to fight their genetic invaders. Among these are the recently discovered CRISPR systems. In this review, we have combined and condensed available information on genetic elements infecting the Thermococcales and on the multiple CRISPR systems found in the Pyrococcales to fight them. Their organization and mode of action will be presented with emphasis on the Type III-B system that is the only CRISPR system known to target RNA molecules in a process reminiscent of RNA interference. The intriguing case of Pyrococcus abyssi, which is among the rare strains to present a CRISPR system devoid of the universal cas1 and cas2 genes, is also discussed.
- Published
- 2013
- Full Text
- View/download PDF
45. RNAspace.org: An integrated environment for the prediction, annotation, and analysis of ncRNA.
- Author
-
Cros MJ, de Monte A, Mariette J, Bardou P, Grenier-Boley B, Gautheret D, Touzet H, and Gaspin C
- Subjects
- Base Sequence, Genome, Nucleic Acid Conformation, RNA, Untranslated chemistry, Software, Databases, Nucleic Acid, Internet, RNA, Untranslated analysis
- Abstract
The annotation of noncoding RNA genes remains a major bottleneck in genome sequencing projects. Most genome sequences released today still come with sets of tRNAs and rRNAs as the only annotated RNA elements, ignoring hundreds of other RNA families. We have developed a web environment that is dedicated to noncoding RNA (ncRNA) prediction, annotation, and analysis and allows users to run a variety of tools in an integrated and flexible manner. This environment offers complementary ncRNA gene finders and a set of tools for the comparison, visualization, editing, and export of ncRNA candidates. Predictions can be filtered according to a large set of characteristics. Based on this environment, we created a public website located at http://RNAspace.org. It accepts genomic sequences up to 5 Mb, which permits for an online annotation of a complete bacterial genome or a small eukaryotic chromosome. The project is hosted as a Source Forge project (http://rnaspace.sourceforge.net/).
- Published
- 2011
- Full Text
- View/download PDF
46. Identification of CRISPR and riboswitch related RNAs among novel noncoding RNAs of the euryarchaeon Pyrococcus abyssi.
- Author
-
Phok K, Moisan A, Rinaldi D, Brucato N, Carpousis AJ, Gaspin C, and Clouet-d'Orval B
- Subjects
- 5' Untranslated Regions genetics, Base Sequence, Conserved Sequence, DNA, Intergenic genetics, Genome, Archaeal genetics, Reverse Transcriptase Polymerase Chain Reaction, Species Specificity, Genetic Loci genetics, Pyrococcus abyssi genetics, RNA, Archaeal genetics, Riboswitch genetics
- Abstract
Background: Noncoding RNA (ncRNA) has been recognized as an important regulator of gene expression networks in Bacteria and Eucaryota. Little is known about ncRNA in thermococcal archaea except for the eukaryotic-like C/D and H/ACA modification guide RNAs., Results: Using a combination of in silico and experimental approaches, we identified and characterized novel P. abyssi ncRNAs transcribed from 12 intergenic regions, ten of which are conserved throughout the Thermococcales. Several of them accumulate in the late-exponential phase of growth. Analysis of the genomic context and sequence conservation amongst related thermococcal species revealed two novel P. abyssi ncRNA families. The CRISPR family is comprised of crRNAs expressed from two of the four P. abyssi CRISPR cassettes. The 5'UTR derived family includes four conserved ncRNAs, two of which have features similar to known bacterial riboswitches. Several of the novel ncRNAs have sequence similarities to orphan OrfB transposase elements. Based on RNA secondary structure predictions and experimental results, we show that three of the twelve ncRNAs include Kink-turn RNA motifs, arguing for a biological role of these ncRNAs in the cell. Furthermore, our results show that several of the ncRNAs are subjected to processing events by enzymes that remain to be identified and characterized., Conclusions: This work proposes a revised annotation of CRISPR loci in P. abyssi and expands our knowledge of ncRNAs in the Thermococcales, thus providing a starting point for studies needed to elucidate their biological function.
- Published
- 2011
- Full Text
- View/download PDF
47. Distribution of short interstitial telomere motifs in two plant genomes: putative origin and function.
- Author
-
Gaspin C, Rami JF, and Lescure B
- Subjects
- 3' Untranslated Regions genetics, 5' Flanking Region genetics, 5' Untranslated Regions genetics, Base Sequence, Gene Expression Regulation, Plant, Genes, Plant genetics, Molecular Sequence Data, Promoter Regions, Genetic genetics, Regulatory Sequences, Nucleic Acid genetics, Ribosomal Proteins genetics, Arabidopsis genetics, Genome, Plant genetics, Oryza genetics, Telomere genetics
- Abstract
Background: Short interstitial telomere motifs (telo boxes) are short sequences identical to plant telomere repeat units. They are observed within the 5' region of several genes over-expressed in cycling cells. In synergy with various cis-acting elements, these motifs participate in the activation of expression. Here, we have analysed the distribution of telo boxes within Arabidopsis thaliana and Oryza sativa genomes and their association with genes involved in the biogenesis of the translational apparatus., Results: Our analysis showed that the distribution of the telo box (AAACCCTA) in different genomic regions of A. thaliana and O. sativa is not random. As is also the case for plant microsatellites, they are preferentially located in the 5' flanking regions of genes, mainly within the 5' UTR, and distributed as a gradient along the direction of transcription. As previously reported in Arabidopsis, a conserved topological association of telo boxes with site II or TEF cis-acting elements is observed in almost all promoters of genes encoding ribosomal proteins in O. sativa. Such a conserved promoter organization can be found in other genes involved in the biogenesis of the translational machinery including rRNA processing proteins and snoRNAs. Strikingly, the association of telo boxes with site II motifs or TEF boxes is conserved in promoters of genes harbouring snoRNA clusters nested within an intron as well as in the 5' flanking regions of non-intronic snoRNA genes. Thus, the search for associations between telo boxes and site II motifs or TEF box in plant genomes could provide a useful tool for characterizing new cryptic RNA pol II promoters., Conclusions: The data reported in this work support the model previously proposed for the spreading of telo boxes within plant genomes and provide new insights into a putative process for the acquisition of microsatellites in plants. The association of telo boxes with site II or TEF cis-acting elements appears to be an essential feature of plant genes involved in the biogenesis of ribosomes and clearly indicates that most plant snoRNAs are RNA pol II products.
- Published
- 2010
- Full Text
- View/download PDF
48. Cartography of methicillin-resistant S. aureus transcripts: detection, orientation and temporal expression during growth phase and stress conditions.
- Author
-
Beaume M, Hernandez D, Farinelli L, Deluen C, Linder P, Gaspin C, Romby P, Schrenzel J, and Francois P
- Subjects
- Base Sequence, Conserved Sequence, DNA, Complementary genetics, Gene Expression Profiling, Genome, Bacterial genetics, High-Throughput Screening Assays, Molecular Sequence Data, Nucleic Acid Conformation, RNA, Antisense genetics, RNA, Antisense metabolism, RNA, Bacterial chemistry, RNA, Bacterial classification, RNA, Bacterial genetics, RNA, Messenger genetics, RNA, Messenger metabolism, Reproducibility of Results, Reverse Transcriptase Polymerase Chain Reaction, Sequence Analysis, RNA, Time Factors, Gene Expression Regulation, Bacterial, Methicillin-Resistant Staphylococcus aureus genetics, Methicillin-Resistant Staphylococcus aureus growth & development, Stress, Physiological genetics
- Abstract
Background: Staphylococcus aureus is a versatile bacterial opportunist responsible for a wide spectrum of infections. The severity of these infections is highly variable and depends on multiple parameters including the genome content of the bacterium as well as the condition of the infected host. Clinically and epidemiologically, S. aureus shows a particular capacity to survive and adapt to drastic environmental changes including the presence of numerous antimicrobial agents. Mechanisms triggering this adaptation remain largely unknown despite important research efforts. Most studies evaluating gene content have so far neglected to analyze the so-called intergenic regions as well as potential antisense RNA molecules., Principal Findings: Using high-throughput sequencing technology, we performed an inventory of the whole transcriptome of S. aureus strain N315. In addition to the annotated transcription units, we identified more than 195 small transcribed regions, in the chromosome and the plasmid of S. aureus strain N315. The coding strand of each transcript was identified and structural analysis enabled classification of all discovered transcripts. RNA purified at four time-points during the growth phase of the bacterium allowed us to define the temporal expression of such transcripts. A selection of 26 transcripts of interest dispersed along the intergenic regions was assessed for expression changes in the presence of various stress conditions including pH, temperature, oxidative shocks and growth in a stringent medium. Most of these transcripts showed expression patterns specific for the defined stress conditions that we tested., Conclusions: These RNA molecules potentially represent important effectors of S. aureus adaptation and more generally could support some of the epidemiological characteristics of the bacterium.
- Published
- 2010
- Full Text
- View/download PDF
49. A search for small noncoding RNAs in Staphylococcus aureus reveals a conserved sequence motif for regulation.
- Author
-
Geissmann T, Chevalier C, Cros MJ, Boisset S, Fechter P, Noirot C, Schrenzel J, François P, Vandenesch F, Gaspin C, and Romby P
- Subjects
- Bacillus subtilis genetics, Bacillus subtilis metabolism, Base Sequence, Computational Biology, Conserved Sequence, Gene Expression Profiling, Molecular Sequence Data, Proteomics, RNA Stability, RNA, Messenger metabolism, RNA, Untranslated genetics, RNA, Untranslated metabolism, Transcription, Genetic, Gene Expression Regulation, Bacterial, RNA, Untranslated chemistry, Staphylococcus aureus genetics
- Abstract
Bioinformatic analysis of the intergenic regions of Staphylococcus aureus predicted multiple regulatory regions. From this analysis, we characterized 11 novel noncoding RNAs (RsaA-K) that are expressed in several S. aureus strains under different experimental conditions. Many of them accumulate in the late-exponential phase of growth. All ncRNAs are stable and their expression is Hfq-independent. The transcription of several of them is regulated by the alternative sigma B factor (RsaA, D and F) while the expression of RsaE is agrA-dependent. Six of these ncRNAs are specific to S. aureus, four are conserved in other Staphylococci, and RsaE is also present in Bacillaceae. Transcriptomic and proteomic analysis indicated that RsaE regulates the synthesis of proteins involved in various metabolic pathways. Phylogenetic analysis combined with RNA structure probing, searches for RsaE-mRNA base pairing, and toeprinting assays indicate that a conserved and unpaired UCCC sequence motif of RsaE binds to target mRNAs and prevents the formation of the ribosomal initiation complex. This study unexpectedly shows that most of the novel ncRNAs carry the conserved C-rich motif, suggesting that they are members of a class of ncRNAs that target mRNAs by a shared mechanism.
- Published
- 2009
- Full Text
- View/download PDF
50. RNomics and Modomics in the halophilic archaea Haloferax volcanii: identification of RNA modification genes.
- Author
-
Grosjean H, Gaspin C, Marck C, Decatur WA, and de Crécy-Lagard V
- Subjects
- Genes, rRNA, Protein Biosynthesis, RNA, Ribosomal, 5S genetics, RNA, Transfer genetics, Genomics, Haloferax volcanii genetics, RNA, Archaeal genetics
- Abstract
Background: Naturally occurring RNAs contain numerous enzymatically altered nucleosides. Differences in RNA populations (RNomics) and pattern of RNA modifications (Modomics) depends on the organism analyzed and are two of the criteria that distinguish the three kingdoms of life. If the genomic sequences of the RNA molecules can be derived from whole genome sequence information, the modification profile cannot and requires or direct sequencing of the RNAs or predictive methods base on the presence or absence of the modifications genes., Results: By employing a comparative genomics approach, we predicted almost all of the genes coding for the t+rRNA modification enzymes in the mesophilic moderate halophile Haloferax volcanii. These encode both guide RNAs and enzymes. Some are orthologous to previously identified genes in Archaea, Bacteria or in Saccharomyces cerevisiae, but several are original predictions., Conclusion: The number of modifications in t+rRNAs in the halophilic archaeon is surprisingly low when compared with other Archaea or Bacteria, particularly the hyperthermophilic organisms. This may result from the specific lifestyle of halophiles that require high intracellular salt concentration for survival. This salt content could allow RNA to maintain its functional structural integrity with fewer modifications. We predict that the few modifications present must be particularly important for decoding, accuracy of translation or are modifications that cannot be functionally replaced by the electrostatic interactions provided by the surrounding salt-ions. This analysis also guides future experimental validation work aiming to complete the understanding of the function of RNA modifications in Archaeal translation.
- Published
- 2008
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.