40 results on '"Foissac S"'
Search Results
2. Extensive functional genomics information from early developmental time points for pig and chicken
- Author
-
Acloque, H., Harrison, P.W., Lakhal, W., Martin, F., Archibald, A.L., Beinat, M., Davey, M., Djebali, S., Foissac, S., Guizard, S., Guyomar, C., Kuo, R., Kurylo, C., Madsen, O., Miedzinska, K., Mongellaz, M., Smith, J., Sokolov, A., de Vos, J., Giuffra, E., Watson, M., Acloque, H., Harrison, P.W., Lakhal, W., Martin, F., Archibald, A.L., Beinat, M., Davey, M., Djebali, S., Foissac, S., Guizard, S., Guyomar, C., Kuo, R., Kurylo, C., Madsen, O., Miedzinska, K., Mongellaz, M., Smith, J., Sokolov, A., de Vos, J., Giuffra, E., and Watson, M.
- Abstract
The global Functional Annotation of Farm Animal Genomes initiative (FAANG) aims to improve animal breeding by improved genomic prediction via integration of functional genomics information. The GENESWitCH project has produced extensive functional genomics information for a variety of important tissues at early embryonic timepoints for both chickens and pigs. These datasets will be integrated to produce both tissue and time-point specific transcript, gene, and regulatory annotation for both species. In this paper, we describe the aims of the project, and the initial release of both raw and processed data.
- Published
- 2022
3. Coordinated international action to accelerate genome-to-phenome with FAANG, the Functional Annotation of Animal Genomes project
- Author
-
Tosser-Klopp, G., Zhou, H., Palti, Y., Nanduri, B., Tixier-Boichard, M., Silverstein, J., Plastow, G. S., Sarropoulou, E., Brauning, R., Zhao, S., Rohrer, G. A., Elsik, C. G., Cheng, H. H., Giuffra, E., Notredame, C., Khatib, H., Vilkki, J., Couldrey, C., Archibald, A. L., Tellam, R. L., Schmidt, C. J., Reecy, J. M., Clarke, L., Huang, L. S., Groenen, M. A., McEwan, J. C., Burt, D. W., Kim, H., Bottema, C. D., Kijas, J. W., Dalrymple, B. P., White, S. N., Burgess, S. C., Hayes, B. J., McCarthy, F. M., Moore, S., Foissac, S., Lunney, J. K., Andersson, L., Tuggle, C. K., and Casas, E.
- Published
- 2015
- Full Text
- View/download PDF
4. Coordinated international action to accelerate genome-to-phenome with FAANG, the Functional Annotation of Animal Genomes project : open letter
- Author
-
Archibald, A.L., Bottema, C.D., Brauning, R., Burgess, S.C., Burt, D.W., Casas, E., Cheng, H.H., Clarke, L., Couldrey, C., Dalrymple, B.P., Elsik, C.G., Foissac, S., Giuffra, E., Groenen, M.A.M., Hayes, B.J., Huang, L.S., Khatib, H., Kijas, J.W., Kim, H., Lunney, J.K., McCarthy, F.M., McEwan, J., Moore, S., Nanduri, B., Notredame, C., Palti, Y., Plastow, G.S., Reecy, J.M., Rohrer, G., Sarropoulou, E., Schmidt, C.J., Silverstein, J., Tellam, R.L., Tixier-Boichard, M., Tosser-klopp, G., Tuggle, C.K., Vilkki, J., White, S.N., Zhao, S., and Zhou, H.
- Subjects
WIAS ,Life Science ,Fokkerij en Genomica ,Animal Breeding and Genomics - Abstract
We describe the organization of a nascent international effort, the Functional Annotation of Animal Genomes (FAANG) project, whose aim is to produce comprehensive maps of functional elements in the genomes of domesticated animal species.
- Published
- 2015
5. Chromatin accessibility in the liver and circulating immune cells of pigs, goats and chickens.
- Author
-
Giuffra, E., Munyard, Kylie, Goubil, A., Vincent-Naulleau, S., Esquerre, D., Djebali, S., Foissac, S., Giuffra, E., Munyard, Kylie, Goubil, A., Vincent-Naulleau, S., Esquerre, D., Djebali, S., and Foissac, S.
- Published
- 2016
6. P3038 Chromatin accessibility in the liver and circulating immune cells of pigs, goats and chickens
- Author
-
Giuffra, E., primary, Munyard, K. A., additional, Goubil, A., additional, Vincent-Naulleau, S., additional, Esquerré, D., additional, Djebali, S., additional, and Foissac, S., additional
- Published
- 2016
- Full Text
- View/download PDF
7. P8001 3D nuclear positioning of IGF2 alleles and trans interactions with imprinted genes
- Author
-
Lahbib-Mansais, Y., primary, Marimon, M. Marti, additional, Voillet, V., additional, Mompart, F., additional, Riquet, J., additional, Foissac, S., additional, Robelin, D., additional, Acloque, H., additional, Liaubet, L., additional, Bouissou-Matet Yerle, M., additional, Billon, Y., additional, and Villa-Vialaneix, N., additional
- Published
- 2016
- Full Text
- View/download PDF
8. ASTALAVISTA: dynamic and flexible analysis of alternative splicing events in custom gene datasets
- Author
-
Foissac, S., primary and Sammeth, M., additional
- Published
- 2007
- Full Text
- View/download PDF
9. EUGENE'HOM: a generic similarity-based gene finder using multiple homologous sequences
- Author
-
Foissac, S., primary
- Published
- 2003
- Full Text
- View/download PDF
10. Evidence for transcript networks composed of chimeric RNAs in human cells
- Author
-
Universitat de Barcelona, Djebali, S., Lagarde, J., Kapranov, P., Lacroix, V., Borel, C., Mudge, J.M., Howald, C., Foissac, S., Ucla, C., Chrast, J., Ribeca, P., Martin, D., Murray, R.R., Yang, X., Ghamsari, L., Lin, C., Bell, I., Dumais, E., Gelpi Buchaca, Josep Lluís, Orozco López, Modesto, Universitat de Barcelona, Djebali, S., Lagarde, J., Kapranov, P., Lacroix, V., Borel, C., Mudge, J.M., Howald, C., Foissac, S., Ucla, C., Chrast, J., Ribeca, P., Martin, D., Murray, R.R., Yang, X., Ghamsari, L., Lin, C., Bell, I., Dumais, E., Gelpi Buchaca, Josep Lluís, and Orozco López, Modesto
- Abstract
The classic organization of a gene structure has followed the Jacob and Monod bacterial gene model proposed more than 50 years ago. Since then, empirical determinations of the complexity of the transcriptomes found in yeast to human has blurred the definition and physical boundaries of genes. Using multiple analysis approaches we have characterized individual gene boundaries mapping on human chromosomes 21 and 22. Analyses of the locations of the 5′ and 3′ transcriptional termini of 492 protein coding genes revealed that for 85% of these genes the boundaries extend beyond the current annotated termini, most often connecting with exons of transcripts from other well annotated genes. The biological and evolutionary importance of these chimeric transcripts is underscored by (1) the non-random interconnections of genes involved, (2) the greater phylogenetic depth of the genes involved in many chimeric interactions, (3) the coordination of the expression of connected genes and (4) the close in vivo and three dimensional proximity of the genomic regions being transcribed and contributing to parts of the chimeric RNAs. The non-random nature of the connection of the genes involved suggest that chimeric transcripts should not be studied in isolation, but together, as an RNA network.
11. P8001 3D nuclear positioning of IGF2 alleles and transinteractions with imprinted genes
- Author
-
Lahbib-Mansais, Y., Marimon, M. Marti, Voillet, V., Mompart, F., Riquet, J., Foissac, S., Robelin, D., Acloque, H., Liaubet, L., Bouissou-Matet Yerle, M., Billon, Y., and Villa-Vialaneix, N.
- Abstract
To explore the relationship between gene activity and nuclear position, genomic imprinting leading to parental-specific expression offers a good model. In one cell, it is possible to compare the nuclear environment of the two alleles for a given locus and search for a potential correlation between their nuclear position and expression status. Using 3D RNA-DNA fluorescence in situ hybridization (FISH) in porcine fetal liver cells, we focused on the imprinted region of Insulin-like growth factor 2(IGF2), a paternally expressed gene located on porcine chromosome 2. We investigated the interchromosomal interactions implicating IGF2. Through a 2D FISH screening, imprinted genes from the Imprinted Gene Network (Varrault et al., 2006) were tested for interactions in liver cells. The locus DLK1/MEG3showed the highest rate of colocalization with IGF2. By 3D RNA-DNA FISH combined to confocal microscopy, we demonstrated a preferential implication of the expressed paternal IGF2allele in a transassociation with DLK1/MEG3region (chromosome 7). We showed that this colocalization occurs also in fetal muscle and demonstrated that it occurs preferentially between the expressed IGF2, DLK1, and MEG3alleles. We are extending this analysis through an interdisciplinary approach to develop large functional mappingstudies focused on the mechanisms involved in the transcriptional regulation of genes expressed in muscle during late fetal development of pig. From a transcriptomic analysis carried on fetal muscle of two extreme genetic lines to study maturity, we identified 2000 genes differentially expressed that characterize its establishment (Voillet et al., 2014). We are now constructing, by in silico processes, networks of co-regulated genes with IGF2as starting point. We are also developing a Hi-C approach to construct interaction maps on a genome-wide scale. A set of key genes belonging to these networks and interaction maps will be selected to study by 3D FISH their position in the nuclear space in cells of the two genotypes, and to determine if co-regulated genes implicated in the same biological function co-localize in the nucleus. These data should allow us to determine if these interactions are genotype and expression pattern dependent. This will open interesting questions to study the possible link between nuclear architecture and control of gene expression in muscle in an animal model for which extreme genotypes for maturity at birth are available.
- Published
- 2016
- Full Text
- View/download PDF
12. Integrating alternative splicing detection into gene prediction
- Author
-
Schiex Thomas and Foissac Sylvain
- Subjects
Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background Alternative splicing (AS) is now considered as a major actor in transcriptome/proteome diversity and it cannot be neglected in the annotation process of a new genome. Despite considerable progresses in term of accuracy in computational gene prediction, the ability to reliably predict AS variants when there is local experimental evidence of it remains an open challenge for gene finders. Results We have used a new integrative approach that allows to incorporate AS detection into ab initio gene prediction. This method relies on the analysis of genomically aligned transcript sequences (ESTs and/or cDNAs), and has been implemented in the dynamic programming algorithm of the graph-based gene finder EuGÈNE. Given a genomic sequence and a set of aligned transcripts, this new version identifies the set of transcripts carrying evidence of alternative splicing events, and provides, in addition to the classical optimal gene prediction, alternative optimal predictions (among those which are consistent with the AS events detected). This allows for multiple annotations of a single gene in a way such that each predicted variant is supported by a transcript evidence (but not necessarily with a full-length coverage). Conclusions This automatic combination of experimental data analysis and ab initio gene finding offers an ideal integration of alternatively spliced gene prediction inside a single annotation pipeline.
- Published
- 2005
- Full Text
- View/download PDF
13. An integrated encyclopedia of DNA elements in the human genome
- Author
-
Robert Altshuler, Laura Elnitski, Michael Anaya, Alec Victorsen, Deborah Winter, Javier Herrero, Katherine Varley, Andrea Sboner, Oscar Junhong Luo, Marco Mariotti, Cristina Sisu, Mike Kay, Timothy Dreszer, Jane Loveland, Alexandra Bignell, Ewan Birney, Tim @timjph Hubbard, Kuljeet Sandhu, Eric Haugen, Chris Gunter, Alexej Abyzov, Lucas Ward, Georgi Marinov, Michael Pazin, Thomas Gingeras, Alexander Dobin, Kimberly Foss, Xianjun Dong, Benoit Miotto, Piotr Mieczkowski, Cedric Notredame, Andrew Berry, Shawn Gillespie, Axel Visel, Shawn Levy, Richard Sandstrom, Jose M Gonzalez, Melissa Fullwood, Timo Lassmann, Michael Tress, Julien Lagarde, Kevin Yip, Leslie Adams, Sylvain Foissac, Bronwen Aken, Piero Carninci, Suganthi Balasubramanian, Andrea Tanzer, Sarah Djebali, Michael Hoffman, Gloria Despacio-Reyes, Peter Park, Felix Kokocinski, Katherine Fisher-Aylor, Juan M Vaquerizas, Peggy Farnham, Patrick Collins, Amonida Zadissa, Pedro Ferreira, Philippe Batut, Michael Snyder, Electra Tapanari, Adam Frankish, Paul Flicek, AMARTYA SANYAL, Tyler Alioto, Giovanni Bussotti, Laurence Meyer, Jingyi Jessica Li, Matthew Blow, Tristan FRUM, Roger Alexander, Rory Johnson, Charles Steward, Meizhen Zheng, Margus Lukk, Ross Hardison, Claire Davidson, Gary Saunders, Alan Boyle, Luiz Penalva, Rajinder Kaul, Lazaro Centanin, Florencia Pauli Behn, Thomas Derrien, Nathan Sheffield, Toby Hunt, Eric Nguyen, Jeff Vierstra, Konrad Karczewski, Kimberly Bell, Yanbao Yu, Hagen U Tilgner, James Taylor, Balázs Bánfai, Catherine Snow, Benjamin Vernot, Stephan Kirchmaier, Michael Sammeth, Steven Wilder, Angelika Merkel, Joanna Mieczkowska, Guoliang Li, Wei Lin, Jennifer Harrow, Thomas Oliver Auer, Daniel Barrell, Eddie Park, Alvis Brazma, Hazuki Takahashi, Nathan Johnson, Daniel Sobral, Terry Furey, Alexandre Reymond, Jonathan Mudge, Anshul Kundaje, Jose Rodriguez, Akshay Bhinge, James Gilbert, Jakub Karczewski, Venkat Malladi, Troy Whitfield, Orion Buske, Ian Dunham, Jennifer Moran, Joachim Wittbrodt, Charles B. Epstein, Laurens Wilming, Jason Gertz, Joshua Akey, Joel Rozowsky, Laboratoire de Génétique Cellulaire (LGC), Ecole Nationale Vétérinaire de Toulouse (ENVT), Institut National Polytechnique (Toulouse) (Toulouse INP), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Institut National Polytechnique (Toulouse) (Toulouse INP), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Institut National de la Recherche Agronomique (INRA), National Human Genome Research Institute (NHGRI), Institut National de la Recherche Agronomique (INRA)-Ecole Nationale Vétérinaire de Toulouse (ENVT), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées, Antonarakis, Stylianos, Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science, Altshuler, Robert Charles, Ernst, Jason, Kellis, Manolis, Kheradpour, Pouya, Ward, Lucas D., Eaton, Matthew Lucas, Hendrix, David A., Jungreis, Irwin, Lin, Michael F., Washietl, Stefan, Lists of participants and their affiliations appear at the end of the paper and in the 'Collaboration/Projet' field., The Consortium is funded by grants from the NHGRI as follows: production grants: U54HG004570 (B. E. Bernstein), U01HG004695 (E. Birney), U54HG004563 (G. E. Crawford), U54HG004557 (T. R. Gingeras), U54HG004555 (T. J. Hubbard), U41HG004568 (W. J. Kent), U54HG004576 (R. M. Myers), U54HG004558 (M. Snyder), U54HG004592 (J. A. Stamatoyannopoulos). Pilot grants: R01HG003143 (J. Dekker), RC2HG005591 and R01HG003700 (M. C. Giddings), R01HG004456-03 (Y. Ruan), U01HG004571 (S. A. Tenenbaum), U01HG004561 (Z. Weng), RC2HG005679 (K. P. White). This project was supported in part by American Recovery and Reinvestment Act (ARRA) funds from the NHGRI through grants U54HG004570, U54HG004563, U41HG004568, U54HG004592, R01HG003143, RC2HG005591, R01HG003541,U01HG004561,RC2HG005679andR01HG003988(L. Pennacchio). In addition, work from NHGRI Groups was supported by the Intramural Research Program of the NHGRI (L. Elnitski, ZIAHG200323, E. H. Margulies, ZIAHG200341). Research in the Pennachio laboratory was performed at Lawrence Berkeley National Laboratory and at the United States Department of Energy Joint Genome Institute, Department of Energy Contract DE-AC02-05CH11231, University of California., Dunham I, Kundaje A, Aldred SF, Collins PJ, Davis CA, Doyle F, Epstein CB, Frietze S, Harrow J, Kaul R, Khatun J, Lajoie BR, Landt SG, Lee BK, Pauli F, Rosenbloom KR, Sabo P, Safi A, Sanyal A, Shoresh N, Simon JM, Song L, Trinklein ND, Altshuler RC, Birney E, Brown JB, Cheng C, Djebali S, Dong X, Dunham I, Ernst J, Furey TS, Gerstein M, Giardine B, Greven M, Hardison RC, Harris RS, Herrero J, Hoffman MM, Iyer S, Kellis M, Khatun J, Kheradpour P, Kundaje A, Lassmann T, Li Q, Lin X, Marinov GK, Merkel A, Mortazavi A, Parker SC, Reddy TE, Rozowsky J, Schlesinger F, Thurman RE, Wang J, Ward LD, Whitfield TW, Wilder SP, Wu W, Xi HS, Yip KY, Zhuang J, Pazin MJ, Lowdon RF, Dillon LA, Adams LB, Kelly CJ, Zhang J, Wexler JR, Green ED, Good PJ, Feingold EA, Bernstein BE, Birney E, Crawford GE, Dekker J, Elnitski L, Farnham PJ, Gerstein M, Giddings MC, Gingeras TR, Green ED, Guigó R, Hardison RC, Hubbard TJ, Kellis M, Kent W, Lieb JD, Margulies EH, Myers RM, Snyder M, Stamatoyannopoulos JA, Tenenbaum SA, Weng Z, White KP, Wold B, Khatun J, Yu Y, Wrobel J, Risk BA, Gunawardena HP, Kuiper HC, Maier CW, Xie L, Chen X, Giddings MC, Bernstein BE, Epstein CB, Shoresh N, Ernst J, Kheradpour P, Mikkelsen TS, Gillespie S, Goren A, Ram O, Zhang X, Wang L, Issner R, Coyne MJ, Durham T, Ku M, Truong T, Ward LD, Altshuler RC, Eaton ML, Kellis M, Djebali S, Davis CA, Merkel A, Dobin A, Lassmann T, Mortazavi A, Tanzer A, Lagarde J, Lin W, Schlesinger F, Xue C, Marinov GK, Khatun J, Williams BA, Zaleski C, Rozowsky J, Röder M, Kokocinski F, Abdelhamid RF, Alioto T, Antoshechkin I, Baer MT, Batut P, Bell I, Bell K, Chakrabortty S, Chen X, Chrast J, Curado J, Derrien T, Drenkow J, Dumais E, Dumais J, Duttagupta R, Fastuca M, Fejes-Toth K, Ferreira P, Foissac S, Fullwood MJ, Gao H, Gonzalez D, Gordon A, Gunawardena HP, Howald C, Jha S, Johnson R, Kapranov P, King B, Kingswood C, Li G, Luo OJ, Park E, Preall JB, Presaud K, Ribeca P, Risk BA, Robyr D, Ruan X, Sammeth M, Sandhu KS, Schaeffer L, See LH, Shahab A, Skancke J, Suzuki AM, Takahashi H, Tilgner H, Trout D, Walters N, Wang H, Wrobel J, Yu Y, Hayashizaki Y, Harrow J, Gerstein M, Hubbard TJ, Reymond A, Antonarakis SE, Hannon GJ, Giddings MC, Ruan Y, Wold B, Carninci P, Guigó R, Gingeras TR, Rosenbloom KR, Sloan CA, Learned K, Malladi VS, Wong MC, Barber GP, Cline MS, Dreszer TR, Heitner SG, Karolchik D, Kent W, Kirkup VM, Meyer LR, Long JC, Maddren M, Raney BJ, Furey TS, Song L, Grasfeder LL, Giresi PG, Lee BK, Battenhouse A, Sheffield NC, Simon JM, Showers KA, Safi A, London D, Bhinge AA, Shestak C, Schaner MR, Kim SK, Zhang ZZ, Mieczkowski PA, Mieczkowska JO, Liu Z, McDaniell RM, Ni Y, Rashid NU, Kim MJ, Adar S, Zhang Z, Wang T, Winter D, Keefe D, Birney E, Iyer VR, Lieb JD, Crawford GE, Li G, Sandhu KS, Zheng M, Wang P, Luo OJ, Shahab A, Fullwood MJ, Ruan X, Ruan Y, Myers RM, Pauli F, Williams BA, Gertz J, Marinov GK, Reddy TE, Vielmetter J, Partridge E, Trout D, Varley KE, Gasper C, Bansal A, Pepke S, Jain P, Amrhein H, Bowling KM, Anaya M, Cross MK, King B, Muratet MA, Antoshechkin I, Newberry KM, McCue K, Nesmith AS, Fisher-Aylor KI, Pusey B, DeSalvo G, Parker SL, Balasubramanian S, Davis NS, Meadows SK, Eggleston T, Gunter C, Newberry J, Levy SE, Absher DM, Mortazavi A, Wong WH, Wold B, Blow MJ, Visel A, Pennachio LA, Elnitski L, Margulies EH, Parker SC, Petrykowska HM, Abyzov A, Aken B, Barrell D, Barson G, Berry A, Bignell A, Boychenko V, Bussotti G, Chrast J, Davidson C, Derrien T, Despacio-Reyes G, Diekhans M, Ezkurdia I, Frankish A, Gilbert J, Gonzalez JM, Griffiths E, Harte R, Hendrix DA, Howald C, Hunt T, Jungreis I, Kay M, Khurana E, Kokocinski F, Leng J, Lin MF, Loveland J, Lu Z, Manthravadi D, Mariotti M, Mudge J, Mukherjee G, Notredame C, Pei B, Rodriguez JM, Saunders G, Sboner A, Searle S, Sisu C, Snow C, Steward C, Tanzer A, Tapanari E, Tress ML, van Baren MJ, Walters N, Washietl S, Wilming L, Zadissa A, Zhang Z, Brent M, Haussler D, Kellis M, Valencia A, Gerstein M, Reymond A, Guigó R, Harrow J, Hubbard TJ, Landt SG, Frietze S, Abyzov A, Addleman N, Alexander RP, Auerbach RK, Balasubramanian S, Bettinger K, Bhardwaj N, Boyle AP, Cao AR, Cayting P, Charos A, Cheng Y, Cheng C, Eastman C, Euskirchen G, Fleming JD, Grubert F, Habegger L, Hariharan M, Harmanci A, Iyengar S, Jin VX, Karczewski KJ, Kasowski M, Lacroute P, Lam H, Lamarre-Vincent N, Leng J, Lian J, Lindahl-Allen M, Min R, Miotto B, Monahan H, Moqtaderi Z, Mu XJ, O'Geen H, Ouyang Z, Patacsil D, Pei B, Raha D, Ramirez L, Reed B, Rozowsky J, Sboner A, Shi M, Sisu C, Slifer T, Witt H, Wu L, Xu X, Yan KK, Yang X, Yip KY, Zhang Z, Struhl K, Weissman SM, Gerstein M, Farnham PJ, Snyder M, Tenenbaum SA, Penalva LO, Doyle F, Karmakar S, Landt SG, Bhanvadia RR, Choudhury A, Domanus M, Ma L, Moran J, Patacsil D, Slifer T, Victorsen A, Yang X, Snyder M, Auer T, Centanin L, Eichenlaub M, Gruhl F, Heermann S, Hoeckendorf B, Inoue D, Kellner T, Kirchmaier S, Mueller C, Reinhardt R, Schertel L, Schneider S, Sinn R, Wittbrodt B, Wittbrodt J, Weng Z, Whitfield TW, Wang J, Collins PJ, Aldred SF, Trinklein ND, Partridge EC, Myers RM, Dekker J, Jain G, Lajoie BR, Sanyal A, Balasundaram G, Bates DL, Byron R, Canfield TK, Diegel MJ, Dunn D, Ebersol AK, Frum T, Garg K, Gist E, Hansen R, Boatman L, Haugen E, Humbert R, Jain G, Johnson AK, Johnson EM, Kutyavin TV, Lajoie BR, Lee K, Lotakis D, Maurano MT, Neph SJ, Neri FV, Nguyen ED, Qu H, Reynolds AP, Roach V, Rynes E, Sabo P, Sanchez ME, Sandstrom RS, Sanyal A, Shafer AO, Stergachis AB, Thomas S, Thurman RE, Vernot B, Vierstra J, Vong S, Wang H, Weaver MA, Yan Y, Zhang M, Akey JM, Bender M, Dorschner MO, Groudine M, MacCoss MJ, Navas P, Stamatoyannopoulos G, Kaul R, Dekker J, Stamatoyannopoulos JA, Dunham I, Beal K, Brazma A, Flicek P, Herrero J, Johnson N, Keefe D, Lukk M, Luscombe NM, Sobral D, Vaquerizas JM, Wilder SP, Batzoglou S, Sidow A, Hussami N, Kyriazopoulou-Panagiotopoulou S, Libbrecht MW, Schaub MA, Kundaje A, Hardison RC, Miller W, Giardine B, Harris RS, Wu W, Bickel PJ, Banfai B, Boley NP, Brown JB, Huang H, Li Q, Li JJ, Noble WS, Bilmes JA, Buske OJ, Hoffman MM, Sahu AD, Kharchenko PV, Park PJ, Baker D, Taylor J, Weng Z, Iyer S, Dong X, Greven M, Lin X, Wang J, Xi HS, Zhuang J, Gerstein M, Alexander RP, Balasubramanian S, Cheng C, Harmanci A, Lochovsky L, Min R, Mu XJ, Rozowsky J, Yan KK, Yip KY, Birney E., and Miotto, Benoit
- Subjects
Encyclopedias as Topic ,[SDV]Life Sciences [q-bio] ,DNA Footprinting ,Genoma humà ,Binding Sites/genetics ,Histones/chemistry/metabolism ,0302 clinical medicine ,Exons/genetics ,ddc:576.5 ,0303 health sciences ,Multidisciplinary ,[SDV.MHEP] Life Sciences [q-bio]/Human health and pathology ,[SDV.BIBS] Life Sciences [q-bio]/Quantitative Methods [q-bio.QM] ,DNA-Binding Proteins/metabolism ,region ,Chemistry ,Genetic Predisposition to Disease/genetics ,Genomics ,Polymorphism, Single Nucleotide/genetics ,[SDV.BIBS]Life Sciences [q-bio]/Quantitative Methods [q-bio.QM] ,Neoplasms/genetics ,Chromatin ,Cell biology ,in vivo ,Genetic Variation/genetics ,030220 oncology & carcinogenesis ,Deoxyribonuclease I/metabolism ,Proteins/genetics ,transcription factor-binding ,chromosome conformation capture ,DNA Methylation/genetics ,Chromosomes, Human/genetics/metabolism ,Chromatin Immunoprecipitation ,Mammals/genetics ,DNA/genetics ,determinant ,Article ,03 medical and health sciences ,map ,Animals ,Humans ,Transcription Factors/metabolism ,Alleles ,mouse ,030304 developmental biology ,Transcription, Genetic/genetics ,Chromatin/genetics/metabolism ,Sequence Analysis, RNA ,human cell ,Molecular Sequence Annotation ,Regulatory Sequences, Nucleic Acid/genetics ,Promoter Regions, Genetic/genetics ,DNA binding site ,Genòmica ,Genome, Human/genetics ,chromatin ,[SDV.MHEP]Life Sciences [q-bio]/Human health and pathology ,Genètica ,Genome-Wide Association Study - Abstract
The human genome encodes the blueprint of life, but the function of the vast majority of its nearly three billion bases is unknown. The Encyclopedia of DNA Elements (ENCODE) project has systematically mapped regions of transcription, transcription factor association, chromatin structure and histone modification. These data enabled us to assign biochemical functions for 80% of the genome, in particular outside of the well-studied protein-coding regions. Many discovered candidate regulatory elements are physically associated with one another and with expressed genes, providing new insights into the mechanisms of gene regulation. The newly identified elements also show a statistical correspondence to sequence variants linked to human disease, and can thereby guide interpretation of this variation. Overall, the project provides new insights into the organization and regulation of our genes and genome, and is an expansive resource of functional annotations for biomedical research. The Consortium is funded by grants from the NHGRI as follows: production grants: U54HG004570 (B. E. Bernstein); U01HG004695 (E. Birney); U54HG004563 (G. E. Crawford); U54HG004557 (T. R. Gingeras); U54HG004555 (T. J. Hubbard); U41HG004568 /n(W. J. Kent); U54HG004576 (R. M. Myers); U54HG004558 (M. Snyder);/nU54HG004592 (J. A. Stamatoyannopoulos). Pilot grants: R01HG003143 (J. Dekker); RC2HG005591 and R01HG003700 (M. C. Giddings); R01HG004456-03 (Y. Ruan); U01HG004571 (S. A. Tenenbaum); U01HG004561 (Z. Weng); RC2HG005679 (K. P. White). This project was supported in part by American Recovery and/nReinvestment Act (ARRA) funds from the NHGRI through grants U54HG004570, U54HG004563, U41HG004568, U54HG004592, R01HG003143, RC2HG005591,R01HG003541, U01HG004561, RC2HG005679andR01HG003988(L. Pennacchio). In addition, work from NHGRI Groups was supported by the Intramural Research/nProgram of the NHGRI (L. Elnitski, ZIAHG200323; E. H. Margulies, ZIAHG200341). Research in the Pennachio laboratory was performed at Lawrence Berkeley National Laboratory and at the United States Department of Energy Joint Genome Institute, Department of Energy Contract DE-AC02-05CH11231, University of California.
- Published
- 2012
14. Enriched atlas of lncRNA and protein-coding genes for the GRCg7b chicken assembly and its functional annotation across 47 tissues.
- Author
-
Degalez F, Charles M, Foissac S, Zhou H, Guan D, Fang L, Klopp C, Allain C, Lagoutte L, Lecerf F, Acloque H, Giuffra E, Pitel F, and Lagarrigue S
- Subjects
- Animals, Humans, Chickens genetics, Chickens metabolism, Transcriptome, Molecular Sequence Annotation, Sequence Analysis, RNA, RNA, Long Noncoding genetics, RNA, Long Noncoding metabolism
- Abstract
Gene atlases for livestock are steadily improving thanks to new genome assemblies and new expression data improving the gene annotation. However, gene content varies across databases due to differences in RNA sequencing data and bioinformatics pipelines, especially for long non-coding RNAs (lncRNAs) which have higher tissue and developmental specificity and are harder to consistently identify compared to protein coding genes (PCGs). As done previously in 2020 for chicken assemblies galgal5 and GRCg6a, we provide a new gene atlas, lncRNA-enriched, for the latest GRCg7b chicken assembly, integrating "NCBI RefSeq", "EMBL-EBI Ensembl/GENCODE" reference annotations and other resources such as FAANG and NONCODE. As a result, the number of PCGs increases from 18,022 (RefSeq) and 17,007 (Ensembl) to 24,102, and that of lncRNAs from 5789 (RefSeq) and 11,944 (Ensembl) to 44,428. Using 1400 public RNA-seq transcriptome representing 47 tissues, we provided expression evidence for 35,257 (79%) lncRNAs and 22,468 (93%) PCGs, supporting the relevance of this atlas. Further characterization including tissue-specificity, sex-differential expression and gene configurations are provided. We also identified conserved miRNA-hosting genes with human counterparts, suggesting common function. The annotated atlas is available at gega.sigenae.org., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
15. Cell specification and functional interactions in the pig blastocyst inferred from single-cell transcriptomics and uterine fluids proteomics.
- Author
-
Dufour A, Kurylo C, Stöckl JB, Laloë D, Bailly Y, Manceau P, Martins F, Turhan AG, Ferchaud S, Pain B, Fröhlich T, Foissac S, Artus J, and Acloque H
- Subjects
- Pregnancy, Humans, Female, Swine, Mice, Animals, Embryo Implantation physiology, Embryonic Development genetics, Gene Expression Profiling, Proteomics, Blastocyst metabolism
- Abstract
The embryonic development of the pig comprises a long in utero pre- and peri-implantation development, which dramatically differs from mice and humans. During this peri-implantation period, a complex series of paracrine signals establishes an intimate dialogue between the embryo and the uterus. To better understand the biology of the pig blastocyst during this period, we generated a large dataset of single-cell RNAseq from early and hatched blastocysts, spheroid and ovoid conceptus and proteomic datasets from corresponding uterine fluids. Our results confirm the molecular specificity and functionality of the three main cell populations. We also discovered two previously unknown subpopulations of the trophectoderm, one characterised by the expression of LRP2, which could represent progenitor cells, and the other, expressing pro-apoptotic markers, which could correspond to the Rauber's layer. Our work provides new insights into the biology of these populations, their reciprocal functional interactions, and the molecular dialogue with the maternal uterine environment., Competing Interests: Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper., (Copyright © 2024 The Authors. Published by Elsevier Inc. All rights reserved.)
- Published
- 2024
- Full Text
- View/download PDF
16. TAGADA: a scalable pipeline to improve genome annotations with RNA-seq data.
- Author
-
Kurylo C, Guyomar C, Foissac S, and Djebali S
- Abstract
Genome annotation plays a crucial role in providing comprehensive catalog of genes and transcripts for a particular species. As research projects generate new transcriptome data worldwide, integrating this information into existing annotations becomes essential. However, most bioinformatics pipelines are limited in their ability to effectively and consistently update annotations using new RNA-seq data. Here we introduce TAGADA, an RNA-seq pipeline for Transcripts And Genes Assembly, Deconvolution, and Analysis. Given a genomic sequence, a reference annotation and RNA-seq reads, TAGADA enhances existing gene models by generating an improved annotation. It also computes expression values for both the reference and novel annotation, identifies long non-coding transcripts (lncRNAs), and provides a comprehensive quality control report. Developed using Nextflow DSL2, TAGADA offers user-friendly functionalities and ensures reproducibility across different computing platforms through its containerized environment. In this study, we demonstrate the efficacy of TAGADA using RNA-seq data from the GENE-SWiTCH project alongside chicken and pig genome annotations as references. Results indicate that TAGADA can substantially increase the number of annotated transcripts by approximately [Formula: see text] in these species. Furthermore, we illustrate how TAGADA can integrate Illumina NovaSeq short reads with PacBio Iso-Seq long reads, showcasing its versatility. TAGADA is available at github.com/FAANG/analysis-TAGADA., (© The Author(s) 2023. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics.)
- Published
- 2023
- Full Text
- View/download PDF
17. Enhancer/gene relationships: Need for more reliable genome-wide reference sets.
- Author
-
Hoellinger T, Mestre C, Aschard H, Le Goff W, Foissac S, Faraut T, and Djebali S
- Abstract
Differences in cells' functions arise from differential activity of regulatory elements, including enhancers. Enhancers are cis-regulatory elements that cooperate with promoters through transcription factors to activate the expression of one or several genes by getting physically close to them in the 3D space of the nucleus. There is increasing evidence that genetic variants associated with common diseases are enriched in enhancers active in cell types relevant to these diseases. Identifying the enhancers associated with genes and conversely, the sets of genes activated by each enhancer (the so-called enhancer/gene or E/G relationships) across cell types, can help understanding the genetic mechanisms underlying human diseases. There are three broad approaches for the genome-wide identification of E/G relationships in a cell type: 1) genetic link methods or eQTL, 2) functional link methods based on 1D functional data such as open chromatin, histone mark or gene expression and 3) spatial link methods based on 3D data such as HiC. Since 1) and 3) are costly, the current strategy is to develop functional link methods and to use data from 1) and 3) as reference to evaluate them. However, there is still no consensus on the best functional link method to date, and method comparison remain seldom. Here, we compared the relative performances of three recent methods for the identification of enhancer-gene links, TargetFinder, Average-Rank, and the ABC model, using the three latest benchmarks from the field: a reference that combines 3D and eQTL data, called BENGI, and two genetic screening references, called CRiFF and CRiSPRi. Overall, none of the three methods performed best on the three references. CRiFF and CRISPRi reference sets are likely more reliable, but CRiFF is not genome-wide and CRiFF and CRISPRi are mostly available on the K562 cancer cell line. The BENGI reference set is genome-wide but likely contains many false positives. This study therefore calls for new reliable and genome-wide E/G reference data rather than new functional link E/G identification methods., Competing Interests: The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest., (Copyright © 2023 Hoellinger, Mestre, Aschard, Le Goff, Foissac, Faraut and Djebali.)
- Published
- 2023
- Full Text
- View/download PDF
18. Major Reorganization of Chromosome Conformation During Muscle Development in Pig.
- Author
-
Marti-Marimon M, Vialaneix N, Lahbib-Mansais Y, Zytnicki M, Camut S, Robelin D, Yerle-Bouissou M, and Foissac S
- Abstract
The spatial organization of the genome in the nucleus plays a crucial role in eukaryotic cell functions, yet little is known about chromatin structure variations during late fetal development in mammals. We performed in situ high-throughput chromosome conformation capture (Hi-C) sequencing of DNA from muscle samples of pig fetuses at two late stages of gestation. Comparative analysis of the resulting Hi-C interaction matrices between both groups showed widespread differences of different types. First, we discovered a complex landscape of stable and group-specific Topologically Associating Domains (TADs). Investigating the nuclear partition of the chromatin into transcriptionally active and inactive compartments, we observed a genome-wide fragmentation of these compartments between 90 and 110 days of gestation. Also, we identified and characterized the distribution of differential cis - and trans -pairwise interactions. In particular, trans -interactions at chromosome extremities revealed a mechanism of telomere clustering further confirmed by 3D Fluorescence in situ Hybridization (FISH). Altogether, we report major variations of the three-dimensional genome conformation during muscle development in pig, involving several levels of chromatin remodeling and structural regulation., Competing Interests: The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest., (Copyright © 2021 Marti-Marimon, Vialaneix, Lahbib-Mansais, Zytnicki, Camut, Robelin, Yerle-Bouissou and Foissac.)
- Published
- 2021
- Full Text
- View/download PDF
19. RNA-Seq Data for Reliable SNP Detection and Genotype Calling: Interest for Coding Variant Characterization and Cis -Regulation Analysis by Allele-Specific Expression in Livestock Species.
- Author
-
Jehl F, Degalez F, Bernard M, Lecerf F, Lagoutte L, Désert C, Coulée M, Bouchez O, Leroux S, Abasht B, Tixier-Boichard M, Bed'hom B, Burlot T, Gourichon D, Bardou P, Acloque H, Foissac S, Djebali S, Giuffra E, Zerjal T, Pitel F, Klopp C, and Lagarrigue S
- Abstract
In addition to their common usages to study gene expression, RNA-seq data accumulated over the last 10 years are a yet-unexploited resource of SNPs in numerous individuals from different populations. SNP detection by RNA-seq is particularly interesting for livestock species since whole genome sequencing is expensive and exome sequencing tools are unavailable. These SNPs detected in expressed regions can be used to characterize variants affecting protein functions, and to study cis -regulated genes by analyzing allele-specific expression (ASE) in the tissue of interest. However, gene expression can be highly variable, and filters for SNP detection using the popular GATK toolkit are not yet standardized, making SNP detection and genotype calling by RNA-seq a challenging endeavor. We compared SNP calling results using GATK suggested filters, on two chicken populations for which both RNA-seq and DNA-seq data were available for the same samples of the same tissue. We showed, in expressed regions, a RNA-seq precision of 91% (SNPs detected by RNA-seq and shared by DNA-seq) and we characterized the remaining 9% of SNPs. We then studied the genotype (GT) obtained by RNA-seq and the impact of two factors (GT call-rate and read number per GT) on the concordance of GT with DNA-seq; we proposed thresholds for them leading to a 95% concordance. Applying these thresholds to 767 multi-tissue RNA-seq of 382 birds of 11 chicken populations, we found 9.5 M SNPs in total, of which ∼550,000 SNPs per tissue and population with a reliable GT (call rate ≥ 50%) and among them, ∼340,000 with a MAF ≥ 10%. We showed that such RNA-seq data from one tissue can be used to ( i ) detect SNPs with a strong predicted impact on proteins, despite their scarcity in each population (16,307 SIFT deleterious missenses and 590 stop-gained), ( ii ) study, on a large scale, cis -regulations of gene expression, with ∼81% of protein-coding and 68% of long non-coding genes (TPM ≥ 1) that can be analyzed for ASE, and with ∼29% of them that were cis -regulated, and ( iii ) analyze population genetic using such SNPs located in expressed regions. This work shows that RNA-seq data can be used with good confidence to detect SNPs and associated GT within various populations and used them for different analyses as GTEx studies., Competing Interests: The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest., (Copyright © 2021 Jehl, Degalez, Bernard, Lecerf, Lagoutte, Désert, Coulée, Bouchez, Leroux, Abasht, Tixier-Boichard, Bed’hom, Burlot, Gourichon, Bardou, Acloque, Foissac, Djebali, Giuffra, Zerjal, Pitel, Klopp and Lagarrigue.)
- Published
- 2021
- Full Text
- View/download PDF
20. Author Correction: An integrative atlas of chicken long non-coding genes and their annotations across 25 tissues.
- Author
-
Jehl F, Muret K, Bernard M, Boutin M, Lagoutte L, Désert C, Dehais P, Esquerré D, Acloque H, Giuffra E, Djebali S, Foissac S, Derrien T, Pitel F, Zerjal T, Klopp C, and Lagarrigue S
- Published
- 2021
- Full Text
- View/download PDF
21. An integrative atlas of chicken long non-coding genes and their annotations across 25 tissues.
- Author
-
Jehl F, Muret K, Bernard M, Boutin M, Lagoutte L, Désert C, Dehais P, Esquerré D, Acloque H, Giuffra E, Djebali S, Foissac S, Derrien T, Pitel F, Zerjal T, Klopp C, and Lagarrigue S
- Subjects
- Animals, Atlases as Topic, Avian Proteins genetics, Gene Expression Profiling, Gene Expression Regulation, Gene Regulatory Networks, MicroRNAs genetics, Organ Specificity, Sequence Analysis, RNA, Tissue Distribution, Chickens genetics, Computational Biology methods, Molecular Sequence Annotation methods, RNA, Long Noncoding genetics
- Abstract
Long non-coding RNAs (LNC) regulate numerous biological processes. In contrast to human, the identification of LNC in farm species, like chicken, is still lacunar. We propose a catalogue of 52,075 chicken genes enriched in LNC ( http://www.fragencode.org/ ), built from the Ensembl reference extended using novel LNC modelled here from 364 RNA-seq and LNC from four public databases. The Ensembl reference grew from 4,643 to 30,084 LNC, of which 59% and 41% with expression ≥ 0.5 and ≥ 1 TPM respectively. Characterization of these LNC relatively to the closest protein coding genes (PCG) revealed that 79% of LNC are in intergenic regions, as in other species. Expression analysis across 25 tissues revealed an enrichment of co-expressed LNC:PCG pairs, suggesting co-regulation and/or co-function. As expected LNC were more tissue-specific than PCG (25% vs. 10%). Similarly to human, 16% of chicken LNC hosted one or more miRNA. We highlighted a new chicken LNC, hosting miR155, conserved in human, highly expressed in immune tissues like miR155, and correlated with immunity-related PCG in both species. Among LNC:PCG pairs tissue-specific in the same tissue, we revealed an enrichment of divergent pairs with the PCG coding transcription factors, as for example LHX5, HXD3 and TBX4, in both human and chicken.
- Published
- 2020
- Full Text
- View/download PDF
22. Multi-species annotation of transcriptome and chromatin structure in domesticated animals.
- Author
-
Foissac S, Djebali S, Munyard K, Vialaneix N, Rau A, Muret K, Esquerré D, Zytnicki M, Derrien T, Bardou P, Blanc F, Cabau C, Crisci E, Dhorne-Pollet S, Drouet F, Faraut T, Gonzalez I, Goubil A, Lacroix-Lamandé S, Laurent F, Marthey S, Marti-Marimon M, Momal-Leisenring R, Mompart F, Quéré P, Robelin D, Cristobal MS, Tosser-Klopp G, Vincent-Naulleau S, Fabre S, Pinard-Van der Laan MH, Klopp C, Tixier-Boichard M, Acloque H, Lagarrigue S, and Giuffra E
- Subjects
- Animals, Cattle, Chickens, Goats, Phylogeny, Sus scrofa, Animals, Domestic genetics, Chromatin genetics, Molecular Sequence Annotation, Transcriptome
- Abstract
Background: Comparative genomics studies are central in identifying the coding and non-coding elements associated with complex traits, and the functional annotation of genomes is a critical step to decipher the genotype-to-phenotype relationships in livestock animals. As part of the Functional Annotation of Animal Genomes (FAANG) action, the FR-AgENCODE project aimed to create reference functional maps of domesticated animals by profiling the landscape of transcription (RNA-seq), chromatin accessibility (ATAC-seq) and conformation (Hi-C) in species representing ruminants (cattle, goat), monogastrics (pig) and birds (chicken), using three target samples related to metabolism (liver) and immunity (CD4+ and CD8+ T cells)., Results: RNA-seq assays considerably extended the available catalog of annotated transcripts and identified differentially expressed genes with unknown function, including new syntenic lncRNAs. ATAC-seq highlighted an enrichment for transcription factor binding sites in differentially accessible regions of the chromatin. Comparative analyses revealed a core set of conserved regulatory regions across species. Topologically associating domains (TADs) and epigenetic A/B compartments annotated from Hi-C data were consistent with RNA-seq and ATAC-seq data. Multi-species comparisons showed that conserved TAD boundaries had stronger insulation properties than species-specific ones and that the genomic distribution of orthologous genes in A/B compartments was significantly conserved across species., Conclusions: We report the first multi-species and multi-assay genome annotation results obtained by a FAANG project. Beyond the generation of reference annotations and the confirmation of previous findings on model animals, the integrative analysis of data from multiple assays and species sheds a new light on the multi-scale selective pressure shaping genome organization from birds to mammals. Overall, these results emphasize the value of FAANG for research on domesticated animals and reinforces the importance of future meta-analyses of the reference datasets being generated by this community on different species.
- Published
- 2019
- Full Text
- View/download PDF
23. Identification of a t(3;4)(p1.3;q1.5) translocation breakpoint in pigs using somatic cell hybrid mapping and high-resolution mate-pair sequencing.
- Author
-
Fève K, Foissac S, Pinton A, Mompart F, Esquerré D, Faraut T, Yerle M, and Riquet J
- Subjects
- Animals, Swine, ADAMTS Proteins genetics, Chromosome Mapping, Chromosomes, Mammalian genetics, High-Throughput Nucleotide Sequencing, Models, Genetic, Translocation, Genetic
- Abstract
Reciprocal translocations are the most frequently occurring constitutional structural rearrangements in mammalian genomes. In phenotypically normal pigs, an incidence of 1/200 is estimated for such rearrangements. Even if constitutional translocations do not necessarily induce defects and diseases, they are responsible for significant economic losses in domestic animals due to reproduction failures. Over the last 30 years, advances in molecular and cytogenetic technologies have led to major improvements in the resolution of the characterization of translocation events. Characterization of translocation breakpoints helps to decipher the mechanisms that lead to such rearrangements and the functions of the genes that are involved in the translocation. Here, we describe the fine characterization of a reciprocal translocation t(3;4) (p1.3;q1.5) detected in a pig line. The breakpoint was identified at the base-pair level using a positional cloning and chromosome walking strategy in somatic cell hybrids that were generated from an animal that carries this translocation. We show that this translocation occurs within the ADAMTSL4 gene and results in a loss of expression in homozygous carriers. In addition, by taking this translocation as a model, we used a whole-genome next-generation mate-pair sequencing approach on pooled individuals to evaluate this strategy for high-throughput screening of structural rearrangements.
- Published
- 2017
- Full Text
- View/download PDF
24. Long noncoding RNA repertoire in chicken liver and adipose tissue.
- Author
-
Muret K, Klopp C, Wucher V, Esquerré D, Legeai F, Lecerf F, Désert C, Boutin M, Jehl F, Acloque H, Giuffra E, Djebali S, Foissac S, Derrien T, and Lagarrigue S
- Subjects
- Animals, Chickens metabolism, Conserved Sequence, Evolution, Molecular, Gene Expression Profiling, Gene Expression Regulation, Genome, Genotype, Humans, Lipid Metabolism genetics, Open Reading Frames, Organ Specificity, Phenotype, Quantitative Trait Loci, RNA, Antisense, RNA, Long Noncoding chemistry, RNA, Messenger genetics, Adipose Tissue metabolism, Chickens genetics, Liver metabolism, RNA, Long Noncoding genetics, Transcriptome
- Abstract
Background: Improving functional annotation of the chicken genome is a key challenge in bridging the gap between genotype and phenotype. Among all transcribed regions, long noncoding RNAs (lncRNAs) are a major component of the transcriptome and its regulation, and whole-transcriptome sequencing (RNA-Seq) has greatly improved their identification and characterization. We performed an extensive profiling of the lncRNA transcriptome in the chicken liver and adipose tissue by RNA-Seq. We focused on these two tissues because of their importance in various economical traits for which energy storage and mobilization play key roles and also because of their high cell homogeneity. To predict lncRNAs, we used a recently developed tool called FEELnc, which also classifies them with respect to their distance and strand orientation to the closest protein-coding genes. Moreover, to confidently identify the genes/transcripts expressed in each tissue (a complex task for weakly expressed molecules such as lncRNAs), we probed a particularly large number of biological replicates (16 per tissue) compared to common multi-tissue studies with a larger set of tissues but less sampling., Results: We predicted 2193 lncRNA genes, among which 1670 were robustly expressed across replicates in the liver and/or adipose tissue and which were classified into 1493 intergenic and 177 intragenic lncRNAs located between and within protein-coding genes, respectively. We observed similar structural features between chickens and mammals, with strong synteny conservation but without sequence conservation. As previously reported, we confirm that lncRNAs have a lower and more tissue-specific expression than mRNAs. Finally, we showed that adjacent lncRNA-mRNA genes in divergent orientation have a higher co-expression level when separated by less than 1 kb compared to more distant divergent pairs. Among these, we highlighted for the first time a novel lncRNA candidate involved in lipid metabolism, lnc_DHCR24, which is highly correlated with the DHCR24 gene that encodes a key enzyme of cholesterol biosynthesis., Conclusions: We provide a comprehensive lncRNA repertoire in the chicken liver and adipose tissue, which shows interesting patterns of co-expression between mRNAs and lncRNAs. It contributes to improving the structural and functional annotation of the chicken genome and provides a basis for further studies on energy storage and mobilization traits in the chicken.
- Published
- 2017
- Full Text
- View/download PDF
25. Bioinformatics Pipeline for Transcriptome Sequencing Analysis.
- Author
-
Djebali S, Wucher V, Foissac S, Hitte C, Corre E, and Derrien T
- Subjects
- High-Throughput Nucleotide Sequencing, Humans, K562 Cells, Sequence Analysis, RNA, Workflow, Computational Biology methods, Gene Expression Profiling methods
- Abstract
The development of High Throughput Sequencing (HTS) for RNA profiling (RNA-seq) has shed light on the diversity of transcriptomes. While RNA-seq is becoming a de facto standard for monitoring the population of expressed transcripts in a given condition at a specific time, processing the huge amount of data it generates requires dedicated bioinformatics programs. Here, we describe a standard bioinformatics protocol using state-of-the-art tools, the STAR mapper to align reads onto a reference genome, Cufflinks to reconstruct the transcriptome, and RSEM to quantify expression levels of genes and transcripts. We present the workflow using human transcriptome sequencing data from two biological replicates of the K562 cell line produced as part of the ENCODE3 project.
- Published
- 2017
- Full Text
- View/download PDF
26. Erratum to: Bioinformatics Pipeline for Transcriptome Sequencing Analysis.
- Author
-
Djebali S, Wucher V, Foissac S, Hitte C, Corre E, and Derrien T
- Published
- 2017
- Full Text
- View/download PDF
27. Modeling of autosomal-dominant retinitis pigmentosa in Caenorhabditis elegans uncovers a nexus between global impaired functioning of certain splicing factors and cell type-specific apoptosis.
- Author
-
Rubio-Peña K, Fontrodona L, Aristizábal-Corrales D, Torres S, Cornes E, García-Rodríguez FJ, Serrat X, González-Knowles D, Foissac S, Porta-De-La-Riva M, and Cerón J
- Subjects
- Animals, Ataxia Telangiectasia Mutated Proteins genetics, Ataxia Telangiectasia Mutated Proteins metabolism, Caenorhabditis elegans, Caenorhabditis elegans Proteins genetics, Caenorhabditis elegans Proteins metabolism, Genes, Dominant, Organ Specificity, RNA Interference, RNA Splicing, Repressor Proteins genetics, Repressor Proteins metabolism, Retinitis Pigmentosa pathology, Ribonucleoprotein, U4-U6 Small Nuclear genetics, Ribonucleoprotein, U5 Small Nuclear genetics, Apoptosis, Retinitis Pigmentosa genetics
- Abstract
Retinitis pigmentosa (RP) is a rare genetic disease that causes gradual blindness through retinal degeneration. Intriguingly, seven of the 24 genes identified as responsible for the autosomal-dominant form (adRP) are ubiquitous spliceosome components whose impairment causes disease only in the retina. The fact that these proteins are essential in all organisms hampers genetic, genomic, and physiological studies, but we addressed these difficulties by using RNAi in Caenorhabditis elegans. Our study of worm phenotypes produced by RNAi of splicing-related adRP (s-adRP) genes functionally distinguishes between components of U4 and U5 snRNP complexes, because knockdown of U5 proteins produces a stronger phenotype. RNA-seq analyses of worms where s-adRP genes were partially inactivated by RNAi, revealed mild intron retention in developing animals but not in adults, suggesting a positive correlation between intron retention and transcriptional activity. Interestingly, RNAi of s-adRP genes produces an increase in the expression of atl-1 (homolog of human ATR), which is normally activated in response to replicative stress and certain DNA-damaging agents. The up-regulation of atl-1 correlates with the ectopic expression of the pro-apoptotic gene egl-1 and apoptosis in hypodermal cells, which produce the cuticle, but not in other cell types. Our model in C. elegans resembles s-adRP in two aspects: The phenotype caused by global knockdown of s-adRP genes is cell type-specific and associated with high transcriptional activity. Finally, along with a reduced production of mature transcripts, we propose a model in which the retina-specific cell death in s-adRP patients can be induced through genomic instability., (© 2015 Rubio-Peña et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.)
- Published
- 2015
- Full Text
- View/download PDF
28. Coordinated international action to accelerate genome-to-phenome with FAANG, the Functional Annotation of Animal Genomes project.
- Author
-
Andersson L, Archibald AL, Bottema CD, Brauning R, Burgess SC, Burt DW, Casas E, Cheng HH, Clarke L, Couldrey C, Dalrymple BP, Elsik CG, Foissac S, Giuffra E, Groenen MA, Hayes BJ, Huang LS, Khatib H, Kijas JW, Kim H, Lunney JK, McCarthy FM, McEwan JC, Moore S, Nanduri B, Notredame C, Palti Y, Plastow GS, Reecy JM, Rohrer GA, Sarropoulou E, Schmidt CJ, Silverstein J, Tellam RL, Tixier-Boichard M, Tosser-Klopp G, Tuggle CK, Vilkki J, White SN, Zhao S, and Zhou H
- Subjects
- Animals, Databases, Genetic, Genomics, Software, Animals, Domestic genetics, Genome, Molecular Sequence Annotation
- Abstract
We describe the organization of a nascent international effort, the Functional Annotation of Animal Genomes (FAANG) project, whose aim is to produce comprehensive maps of functional elements in the genomes of domesticated animal species.
- Published
- 2015
- Full Text
- View/download PDF
29. Analysis of alternative splicing events in custom gene datasets by AStalavista.
- Author
-
Foissac S and Sammeth M
- Subjects
- Alternative Splicing genetics, High-Throughput Nucleotide Sequencing, Humans, Transcriptome, Alternative Splicing physiology, Computational Biology methods
- Abstract
Alternative splicing (AS) is a eukaryotic principle to derive more than one RNA product from transcribed genes by removing distinct subsets of introns from a premature polymer. We know today that this process is highly regulated and makes up a large part of the differences between species, cell types, and states. The key to compare AS across different genes or organisms is to tokenize the AS phenomenon into atomary units, so-called AS events. These events then usually are grouped by common patterns to investigate the underlying molecular mechanisms that drive their regulation. However, attempts to decompose loci with AS observations into events are often hampered by applying a limited set of a priori defined event patterns which are not capable to describe all AS configurations and therefore cannot decompose the phenomenon exhaustively. In this chapter, we describe working scenarios of AStalavista, a computational method that reports all AS events reflected by transcript annotations. We show how to practically employ AStalavista to study AS variation in complex transcriptomes, as characterized by the human GENCODE annotation. Our examples demonstrate how the inherent and universal AStalavista paradigm allows for an automatic delineation of AS events in custom gene datasets. Additionally, we sketch an example of an AStalavista use case including next-generation sequencing data (RNA-Seq) to enrich the landscape of discovered AS events.
- Published
- 2015
- Full Text
- View/download PDF
30. An overview of gene expression dynamics during early ovarian folliculogenesis: specificity of follicular compartments and bi-directional dialog.
- Author
-
Bonnet A, Cabau C, Bouchez O, Sarry J, Marsaud N, Foissac S, Woloszyn F, Mulsant P, and Mandon-Pepin B
- Subjects
- Animals, Cell Communication genetics, Cluster Analysis, Computational Biology, Female, Granulosa Cells metabolism, Humans, Molecular Sequence Annotation, Oocytes metabolism, Organ Specificity genetics, Reproducibility of Results, Signal Transduction, Transcription Factors genetics, Transcription Factors metabolism, Gene Expression Regulation, Ovarian Follicle physiology, Transcriptome
- Abstract
Background: Successful early folliculogenesis is crucial for female reproductive function. It requires appropriate gene specific expression of the different types of ovarian cells at different developmental stages. To date, most gene expression studies on the ovary were conducted in rodents and did not distinguish the type of cell. In mono-ovulating species, few studies have addressed gene expression profiles and mainly concerned human oocytes., Results: We used a laser capture microdissection method combined with RNA-seq technology to explore the transcriptome in oocytes and granulosa cells (GCs) during development of the sheep ovarian follicle. We first documented the expression profile of 15 349 genes, then focused on the 5 129 genes showing differential expression between oocytes and GCs. Enriched functional categories such as oocyte meiotic arrest and GC steroid synthesis reflect two distinct cell fates. We identified the implication of GC signal transduction pathways such as SHH, WNT and RHO GTPase. In addition, signaling pathways (VEGF, NOTCH, IGF1, etc.) and GC transzonal projections suggest the existence of complex cell-cell interactions. Finally, we highlighted several transcription regulators and specifically expressed genes that likely play an important role in early folliculogenesis., Conclusions: To our knowledge, this is the first comprehensive exploration of transcriptomes derived from in vivo oocytes and GCs at key stages in early follicular development in sheep. Collectively, our data advance our understanding of early folliculogenesis in mono-ovulating species and will be a valuable resource for unraveling human ovarian dysfunction such as premature ovarian failure (POF).
- Published
- 2013
- Full Text
- View/download PDF
31. Landscape of transcription in human cells.
- Author
-
Djebali S, Davis CA, Merkel A, Dobin A, Lassmann T, Mortazavi A, Tanzer A, Lagarde J, Lin W, Schlesinger F, Xue C, Marinov GK, Khatun J, Williams BA, Zaleski C, Rozowsky J, Röder M, Kokocinski F, Abdelhamid RF, Alioto T, Antoshechkin I, Baer MT, Bar NS, Batut P, Bell K, Bell I, Chakrabortty S, Chen X, Chrast J, Curado J, Derrien T, Drenkow J, Dumais E, Dumais J, Duttagupta R, Falconnet E, Fastuca M, Fejes-Toth K, Ferreira P, Foissac S, Fullwood MJ, Gao H, Gonzalez D, Gordon A, Gunawardena H, Howald C, Jha S, Johnson R, Kapranov P, King B, Kingswood C, Luo OJ, Park E, Persaud K, Preall JB, Ribeca P, Risk B, Robyr D, Sammeth M, Schaffer L, See LH, Shahab A, Skancke J, Suzuki AM, Takahashi H, Tilgner H, Trout D, Walters N, Wang H, Wrobel J, Yu Y, Ruan X, Hayashizaki Y, Harrow J, Gerstein M, Hubbard T, Reymond A, Antonarakis SE, Hannon G, Giddings MC, Ruan Y, Wold B, Carninci P, Guigó R, and Gingeras TR
- Subjects
- Alleles, Cell Line, DNA, Intergenic genetics, Enhancer Elements, Genetic, Exons genetics, Gene Expression Profiling, Genes genetics, Genomics, Humans, Polyadenylation genetics, Protein Isoforms genetics, RNA biosynthesis, RNA genetics, RNA Editing genetics, RNA Splicing genetics, Repetitive Sequences, Nucleic Acid genetics, Sequence Analysis, RNA, DNA genetics, Encyclopedias as Topic, Genome, Human genetics, Molecular Sequence Annotation, Regulatory Sequences, Nucleic Acid genetics, Transcription, Genetic genetics, Transcriptome genetics
- Abstract
Eukaryotic cells make many types of primary and processed RNAs that are found either in specific subcellular compartments or throughout the cells. A complete catalogue of these RNAs is not yet available and their characteristic subcellular localizations are also poorly understood. Because RNA represents the direct output of the genetic information encoded by genomes and a significant proportion of a cell's regulatory capabilities are focused on its synthesis, processing, transport, modification and translation, the generation of such a catalogue is crucial for understanding genome function. Here we report evidence that three-quarters of the human genome is capable of being transcribed, as well as observations about the range and levels of expression, localization, processing fates, regulatory regions and modifications of almost all currently annotated and thousands of previously unannotated RNAs. These observations, taken together, prompt a redefinition of the concept of a gene.
- Published
- 2012
- Full Text
- View/download PDF
32. Evidence for transcript networks composed of chimeric RNAs in human cells.
- Author
-
Djebali S, Lagarde J, Kapranov P, Lacroix V, Borel C, Mudge JM, Howald C, Foissac S, Ucla C, Chrast J, Ribeca P, Martin D, Murray RR, Yang X, Ghamsari L, Lin C, Bell I, Dumais E, Drenkow J, Tress ML, Gelpí JL, Orozco M, Valencia A, van Berkum NL, Lajoie BR, Vidal M, Stamatoyannopoulos J, Batut P, Dobin A, Harrow J, Hubbard T, Dekker J, Frankish A, Salehi-Ashtiani K, Reymond A, Antonarakis SE, Guigó R, and Gingeras TR
- Subjects
- Algorithms, Chimerin Proteins chemistry, Chimerin Proteins genetics, Chromosomes, Human, Pair 1 genetics, Female, Gene Expression Profiling, Gene Regulatory Networks genetics, Humans, Male, Microarray Analysis methods, Models, Biological, Nucleic Acid Amplification Techniques methods, RNA genetics, RNA Isoforms chemistry, RNA Isoforms genetics, RNA Isoforms metabolism, Transcription, Genetic genetics, Validation Studies as Topic, Cells metabolism, Gene Regulatory Networks physiology, RNA physiology, Transcriptome physiology
- Abstract
The classic organization of a gene structure has followed the Jacob and Monod bacterial gene model proposed more than 50 years ago. Since then, empirical determinations of the complexity of the transcriptomes found in yeast to human has blurred the definition and physical boundaries of genes. Using multiple analysis approaches we have characterized individual gene boundaries mapping on human chromosomes 21 and 22. Analyses of the locations of the 5' and 3' transcriptional termini of 492 protein coding genes revealed that for 85% of these genes the boundaries extend beyond the current annotated termini, most often connecting with exons of transcripts from other well annotated genes. The biological and evolutionary importance of these chimeric transcripts is underscored by (1) the non-random interconnections of genes involved, (2) the greater phylogenetic depth of the genes involved in many chimeric interactions, (3) the coordination of the expression of connected genes and (4) the close in vivo and three dimensional proximity of the genomic regions being transcribed and contributing to parts of the chimeric RNAs. The non-random nature of the connection of the genes involved suggest that chimeric transcripts should not be studied in isolation, but together, as an RNA network.
- Published
- 2012
- Full Text
- View/download PDF
33. Comprehensive polyadenylation site maps in yeast and human reveal pervasive alternative polyadenylation.
- Author
-
Ozsolak F, Kapranov P, Foissac S, Kim SW, Fishilevich E, Monaghan AP, John B, and Milos PM
- Subjects
- Humans, Polyadenylation, Saccharomyces cerevisiae genetics, Saccharomyces cerevisiae metabolism, Gene Expression Profiling, Liver metabolism, Poly A analysis, Sequence Analysis, RNA methods
- Abstract
The emerging discoveries on the link between polyadenylation and disease states underline the need to fully characterize genome-wide polyadenylation states. Here, we report comprehensive maps of global polyadenylation events in human and yeast generated using refinements to the Direct RNA Sequencing technology. This direct approach provides a quantitative view of genome-wide polyadenylation states in a strand-specific manner and requires only attomole RNA quantities. The polyadenylation profiles revealed an abundance of unannotated polyadenylation sites, alternative polyadenylation patterns, and regulatory element-associated poly(A)(+) RNAs. We observed differences in sequence composition surrounding canonical and noncanonical human polyadenylation sites, suggesting novel noncoding RNA-specific polyadenylation mechanisms in humans. Furthermore, we observed the correlation level between sense and antisense transcripts to depend on gene expression levels, supporting the view that overlapping transcription from opposite strands may play a regulatory role. Our data provide a comprehensive view of the polyadenylation state and overlapping transcription., (Copyright © 2010 Elsevier Inc. All rights reserved.)
- Published
- 2010
- Full Text
- View/download PDF
34. New class of gene-termini-associated human RNAs suggests a novel RNA copying mechanism.
- Author
-
Kapranov P, Ozsolak F, Kim SW, Foissac S, Lipson D, Hart C, Roels S, Borel C, Antonarakis SE, Monaghan AP, John B, and Milos PM
- Subjects
- Base Sequence, HeLa Cells, Humans, Models, Genetic, Nucleotides genetics, Poly A genetics, Poly A metabolism, Poly U genetics, Poly U metabolism, RNA biosynthesis, RNA genetics, RNA, Antisense classification, RNA, Antisense genetics, RNA, Antisense metabolism, Templates, Genetic, Genes genetics, RNA classification, RNA metabolism
- Abstract
Small (<200 nucleotide) RNA (sRNA) profiling of human cells using various technologies demonstrates unexpected complexity of sRNAs with hundreds of thousands of sRNA species present. Genetic and in vitro studies show that these RNAs are not merely degradation products of longer transcripts but could indeed have a function. Furthermore, profiling of RNAs, including the sRNAs, can reveal not only novel transcripts, but also make clear predictions about the existence and properties of novel biochemical pathways operating in a cell. For example, sRNA profiling in human cells indicated the existence of an unknown capping mechanism operating on cleaved RNA, a biochemical component of which was later identified. Here we show that human cells contain a novel type of sRNA that has non-genomically encoded 5' poly(U) tails. The presence of these RNAs at the termini of genes, specifically at the very 3' ends of known mRNAs, strongly argues for the presence of a yet uncharacterized endogenous biochemical pathway in cells that can copy RNA. We show that this pathway can operate on multiple genes, with specific enrichment towards transcript-encoding components of the translational machinery. Finally, we show that genes are also flanked by sense, 3' polyadenylated sRNAs that are likely to be capped.
- Published
- 2010
- Full Text
- View/download PDF
35. A general definition and nomenclature for alternative splicing events.
- Author
-
Sammeth M, Foissac S, and Guigó R
- Subjects
- Animals, Bees genetics, Caenorhabditis elegans genetics, Cattle, Chickens genetics, Cluster Analysis, Databases, Genetic, Dogs, Drosophila melanogaster genetics, Gene Expression Profiling classification, Gene Expression Profiling methods, Genomics methods, Humans, Mice, Pan troglodytes genetics, Rats, Sequence Analysis, RNA, Species Specificity, Terminology as Topic, Xenopus genetics, Zebrafish genetics, Alternative Splicing, RNA Splice Sites genetics, Software
- Abstract
Understanding the molecular mechanisms responsible for the regulation of the transcriptome present in eukaryotic cells is one of the most challenging tasks in the postgenomic era. In this regard, alternative splicing (AS) is a key phenomenon contributing to the production of different mature transcripts from the same primary RNA sequence. As a plethora of different transcript forms is available in databases, a first step to uncover the biology that drives AS is to identify the different types of reflected splicing variation. In this work, we present a general definition of the AS event along with a notation system that involves the relative positions of the splice sites. This nomenclature univocally and dynamically assigns a specific "AS code" to every possible pattern of splicing variation. On the basis of this definition and the corresponding codes, we have developed a computational tool (AStalavista) that automatically characterizes the complete landscape of AS events in a given transcript annotation of a genome, thus providing a platform to investigate the transcriptome diversity across genes, chromosomes, and species. Our analysis reveals that a substantial part--in human more than a quarter-of the observed splicing variations are ignored in common classification pipelines. We have used AStalavista to investigate and to compare the AS landscape of different reference annotation sets in human and in other metazoan species and found that proportions of AS events change substantially depending on the annotation protocol, species-specific attributes, and coding constraints acting on the transcripts. The AStalavista system therefore provides a general framework to conduct specific studies investigating the occurrence, impact, and regulation of AS.
- Published
- 2008
- Full Text
- View/download PDF
36. Efficient targeted transcript discovery via array-based normalization of RACE libraries.
- Author
-
Djebali S, Kapranov P, Foissac S, Lagarde J, Reymond A, Ucla C, Wyss C, Drenkow J, Dumais E, Murray RR, Lin C, Szeto D, Denoeud F, Calvo M, Frankish A, Harrow J, Makrythanasis P, Vidal M, Salehi-Ashtiani K, Antonarakis SE, Gingeras TR, and Guigó R
- Subjects
- Alternative Splicing, Chromosomes, Human, Pair 21 genetics, Chromosomes, Human, Pair 22 genetics, Cloning, Molecular, Exons, Genome, Human, Humans, Molecular Sequence Data, Oligonucleotide Array Sequence Analysis methods, Protein Isoforms genetics, Reverse Transcriptase Polymerase Chain Reaction, Transcription, Genetic, DNA, Complementary genetics, Gene Expression Profiling methods, Gene Library, Nucleic Acid Amplification Techniques methods, RNA genetics
- Abstract
Rapid amplification of cDNA ends (RACE) is a widely used approach for transcript identification. Random clone selection from the RACE mixture, however, is an ineffective sampling strategy if the dynamic range of transcript abundances is large. To improve sampling efficiency of human transcripts, we hybridized the products of the RACE reaction onto tiling arrays and used the detected exons to delineate a series of reverse-transcriptase (RT)-PCRs, through which the original RACE transcript population was segregated into simpler transcript populations. We independently cloned the products and sequenced randomly selected clones. This approach, RACEarray, is superior to direct cloning and sequencing of RACE products because it specifically targets new transcripts and often results in overall normalization of transcript abundance. We show theoretically and experimentally that this strategy leads indeed to efficient sampling of new transcripts, and we investigated multiplexing the strategy by pooling RACE reactions from multiple interrogated loci before hybridization.
- Published
- 2008
- Full Text
- View/download PDF
37. A combinatorial code for CPE-mediated translational control.
- Author
-
Piqué M, López JM, Foissac S, Guigó R, and Méndez R
- Subjects
- 3' Untranslated Regions genetics, Animals, Cyclin B genetics, Cyclin B metabolism, Cytoplasm metabolism, Humans, Meiosis, Mice, Mutagenesis, Oocytes metabolism, Progesterone pharmacology, RNA, Messenger, Stored metabolism, RNA-Binding Proteins metabolism, Transcription Factors metabolism, Xenopus Proteins metabolism, Xenopus laevis, mRNA Cleavage and Polyadenylation Factors metabolism, 3' Untranslated Regions metabolism, Gene Expression Regulation, Polyadenylation drug effects, Protein Biosynthesis, RNA 3' Polyadenylation Signals
- Abstract
Cytoplasmic polyadenylation plays a key role in the translational control of mRNAs driving biological processes such as gametogenesis, cell-cycle progression, and synaptic plasticity. What determines the distinct time of polyadenylation and extent of translational control of a given mRNA, however, is poorly understood. The polyadenylation-regulated translation is controlled by the cytoplasmic polyadenylation element (CPE) and its binding protein, CPEB, which can assemble both translational repression or activation complexes. Using a combination of mutagenesis and experimental validation of genome-wide computational predictions, we show that the number and relative position of two elements, the CPE and the Pumilio-binding element, with respect to the polyadenylation signal define a combinatorial code that determines whether an mRNA will be translationally repressed by CPEB, as well as the extent and time of cytoplasmic polyadenylation-dependent translational activation.
- Published
- 2008
- Full Text
- View/download PDF
38. Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project.
- Author
-
Birney E, Stamatoyannopoulos JA, Dutta A, Guigó R, Gingeras TR, Margulies EH, Weng Z, Snyder M, Dermitzakis ET, Thurman RE, Kuehn MS, Taylor CM, Neph S, Koch CM, Asthana S, Malhotra A, Adzhubei I, Greenbaum JA, Andrews RM, Flicek P, Boyle PJ, Cao H, Carter NP, Clelland GK, Davis S, Day N, Dhami P, Dillon SC, Dorschner MO, Fiegler H, Giresi PG, Goldy J, Hawrylycz M, Haydock A, Humbert R, James KD, Johnson BE, Johnson EM, Frum TT, Rosenzweig ER, Karnani N, Lee K, Lefebvre GC, Navas PA, Neri F, Parker SC, Sabo PJ, Sandstrom R, Shafer A, Vetrie D, Weaver M, Wilcox S, Yu M, Collins FS, Dekker J, Lieb JD, Tullius TD, Crawford GE, Sunyaev S, Noble WS, Dunham I, Denoeud F, Reymond A, Kapranov P, Rozowsky J, Zheng D, Castelo R, Frankish A, Harrow J, Ghosh S, Sandelin A, Hofacker IL, Baertsch R, Keefe D, Dike S, Cheng J, Hirsch HA, Sekinger EA, Lagarde J, Abril JF, Shahab A, Flamm C, Fried C, Hackermüller J, Hertel J, Lindemeyer M, Missal K, Tanzer A, Washietl S, Korbel J, Emanuelsson O, Pedersen JS, Holroyd N, Taylor R, Swarbreck D, Matthews N, Dickson MC, Thomas DJ, Weirauch MT, Gilbert J, Drenkow J, Bell I, Zhao X, Srinivasan KG, Sung WK, Ooi HS, Chiu KP, Foissac S, Alioto T, Brent M, Pachter L, Tress ML, Valencia A, Choo SW, Choo CY, Ucla C, Manzano C, Wyss C, Cheung E, Clark TG, Brown JB, Ganesh M, Patel S, Tammana H, Chrast J, Henrichsen CN, Kai C, Kawai J, Nagalakshmi U, Wu J, Lian Z, Lian J, Newburger P, Zhang X, Bickel P, Mattick JS, Carninci P, Hayashizaki Y, Weissman S, Hubbard T, Myers RM, Rogers J, Stadler PF, Lowe TM, Wei CL, Ruan Y, Struhl K, Gerstein M, Antonarakis SE, Fu Y, Green ED, Karaöz U, Siepel A, Taylor J, Liefer LA, Wetterstrand KA, Good PJ, Feingold EA, Guyer MS, Cooper GM, Asimenos G, Dewey CN, Hou M, Nikolaev S, Montoya-Burgos JI, Löytynoja A, Whelan S, Pardi F, Massingham T, Huang H, Zhang NR, Holmes I, Mullikin JC, Ureta-Vidal A, Paten B, Seringhaus M, Church D, Rosenbloom K, Kent WJ, Stone EA, Batzoglou S, Goldman N, Hardison RC, Haussler D, Miller W, Sidow A, Trinklein ND, Zhang ZD, Barrera L, Stuart R, King DC, Ameur A, Enroth S, Bieda MC, Kim J, Bhinge AA, Jiang N, Liu J, Yao F, Vega VB, Lee CW, Ng P, Shahab A, Yang A, Moqtaderi Z, Zhu Z, Xu X, Squazzo S, Oberley MJ, Inman D, Singer MA, Richmond TA, Munn KJ, Rada-Iglesias A, Wallerman O, Komorowski J, Fowler JC, Couttet P, Bruce AW, Dovey OM, Ellis PD, Langford CF, Nix DA, Euskirchen G, Hartman S, Urban AE, Kraus P, Van Calcar S, Heintzman N, Kim TH, Wang K, Qu C, Hon G, Luna R, Glass CK, Rosenfeld MG, Aldred SF, Cooper SJ, Halees A, Lin JM, Shulha HP, Zhang X, Xu M, Haidar JN, Yu Y, Ruan Y, Iyer VR, Green RD, Wadelius C, Farnham PJ, Ren B, Harte RA, Hinrichs AS, Trumbower H, Clawson H, Hillman-Jackson J, Zweig AS, Smith K, Thakkapallayil A, Barber G, Kuhn RM, Karolchik D, Armengol L, Bird CP, de Bakker PI, Kern AD, Lopez-Bigas N, Martin JD, Stranger BE, Woodroffe A, Davydov E, Dimas A, Eyras E, Hallgrímsdóttir IB, Huppert J, Zody MC, Abecasis GR, Estivill X, Bouffard GG, Guan X, Hansen NF, Idol JR, Maduro VV, Maskeri B, McDowell JC, Park M, Thomas PJ, Young AC, Blakesley RW, Muzny DM, Sodergren E, Wheeler DA, Worley KC, Jiang H, Weinstock GM, Gibbs RA, Graves T, Fulton R, Mardis ER, Wilson RK, Clamp M, Cuff J, Gnerre S, Jaffe DB, Chang JL, Lindblad-Toh K, Lander ES, Koriabine M, Nefedov M, Osoegawa K, Yoshinaga Y, Zhu B, and de Jong PJ
- Subjects
- Chromatin genetics, Chromatin metabolism, Chromatin Immunoprecipitation, Conserved Sequence genetics, DNA Replication, Evolution, Molecular, Exons genetics, Genetic Variation genetics, Heterozygote, Histones metabolism, Humans, Pilot Projects, Protein Binding, RNA, Messenger genetics, RNA, Untranslated genetics, Transcription Factors metabolism, Transcription Initiation Site, Genome, Human genetics, Genomics, Regulatory Sequences, Nucleic Acid genetics, Transcription, Genetic genetics
- Abstract
We report the generation and analysis of functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project. These data have been further integrated and augmented by a number of evolutionary and computational analyses. Together, our results advance the collective knowledge about human genome function in several major areas. First, our studies provide convincing evidence that the genome is pervasively transcribed, such that the majority of its bases can be found in primary transcripts, including non-protein-coding transcripts, and those that extensively overlap one another. Second, systematic examination of transcriptional regulation has yielded new understanding about transcription start sites, including their relationship to specific regulatory sequences and features of chromatin accessibility and histone modification. Third, a more sophisticated view of chromatin structure has emerged, including its inter-relationship with DNA replication and transcriptional regulation. Finally, integration of these new sources of information, in particular with respect to mammalian evolution based on inter- and intra-species sequence comparisons, has yielded new mechanistic and evolutionary insights concerning the functional landscape of the human genome. Together, these studies are defining a path for pursuit of a more comprehensive characterization of human genome function.
- Published
- 2007
- Full Text
- View/download PDF
39. Prominent use of distal 5' transcription start sites and discovery of a large number of additional exons in ENCODE regions.
- Author
-
Denoeud F, Kapranov P, Ucla C, Frankish A, Castelo R, Drenkow J, Lagarde J, Alioto T, Manzano C, Chrast J, Dike S, Wyss C, Henrichsen CN, Holroyd N, Dickson MC, Taylor R, Hance Z, Foissac S, Myers RM, Rogers J, Hubbard T, Harrow J, Guigó R, Gingeras TR, Antonarakis SE, and Reymond A
- Subjects
- DNA, Complementary genetics, Human Genome Project, Humans, Open Reading Frames, Chromosome Mapping, Exons, Genome, Human, Promoter Regions, Genetic, Quantitative Trait Loci, Transcription, Genetic physiology
- Abstract
This report presents systematic empirical annotation of transcript products from 399 annotated protein-coding loci across the 1% of the human genome targeted by the Encyclopedia of DNA elements (ENCODE) pilot project using a combination of 5' rapid amplification of cDNA ends (RACE) and high-density resolution tiling arrays. We identified previously unannotated and often tissue- or cell-line-specific transcribed fragments (RACEfrags), both 5' distal to the annotated 5' terminus and internal to the annotated gene bounds for the vast majority (81.5%) of the tested genes. Half of the distal RACEfrags span large segments of genomic sequences away from the main portion of the coding transcript and often overlap with the upstream-annotated gene(s). Notably, at least 20% of the resultant novel transcripts have changes in their open reading frames (ORFs), most of them fusing ORFs of adjacent transcripts. A significant fraction of distal RACEfrags show expression levels comparable to those of known exons of the same locus, suggesting that they are not part of very minority splice forms. These results have significant implications concerning (1) our current understanding of the architecture of protein-coding genes; (2) our views on locations of regulatory regions in the genome; and (3) the interpretation of sequence polymorphisms mapping to regions hitherto considered to be "noncoding," ultimately relating to the identification of disease-related sequence alterations.
- Published
- 2007
- Full Text
- View/download PDF
40. Integrating alternative splicing detection into gene prediction.
- Author
-
Foissac S and Schiex T
- Subjects
- Algorithms, Arabidopsis genetics, Codon, Computer Graphics, DNA, Complementary metabolism, Databases, Nucleic Acid, Databases, Protein, Exons, Expressed Sequence Tags, Gene Expression Profiling, Genes, Plant, Genome, Genome, Human, Genomics, Humans, Introns, Models, Genetic, RNA Splice Sites, Sequence Alignment, Sequence Analysis, Protein, Sequence Analysis, RNA, Software, Transcription, Genetic, User-Computer Interface, Alternative Splicing, Databases, Genetic, Proteomics methods
- Abstract
Background: Alternative splicing (AS) is now considered as a major actor in transcriptome/proteome diversity and it cannot be neglected in the annotation process of a new genome. Despite considerable progresses in term of accuracy in computational gene prediction, the ability to reliably predict AS variants when there is local experimental evidence of it remains an open challenge for gene finders., Results: We have used a new integrative approach that allows to incorporate AS detection into ab initio gene prediction. This method relies on the analysis of genomically aligned transcript sequences (ESTs and/or cDNAs), and has been implemented in the dynamic programming algorithm of the graph-based gene finder EuGENE. Given a genomic sequence and a set of aligned transcripts, this new version identifies the set of transcripts carrying evidence of alternative splicing events, and provides, in addition to the classical optimal gene prediction, alternative optimal predictions (among those which are consistent with the AS events detected). This allows for multiple annotations of a single gene in a way such that each predicted variant is supported by a transcript evidence (but not necessarily with a full-length coverage)., Conclusions: This automatic combination of experimental data analysis and ab initio gene finding offers an ideal integration of alternatively spliced gene prediction inside a single annotation pipeline.
- Published
- 2005
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.