59 results on '"Kuhn RM"'
Search Results
2. Towards an evidence-based process for the clinical interpretation of copy number variation.
- Author
-
Riggs, ER, Church, DM, Hanson, K, Horner, VL, Kaminsky, EB, Kuhn, RM, Wain, KE, Williams, ES, Aradhya, S, Kearney, HM, Ledbetter, DH, South, ST, Thorland, EC, and Martin, CL
- Subjects
EVIDENCE-based medicine ,MEDICAL decision making ,GENETIC testing ,MICROARRAY technology ,GENOMES ,PHENOTYPES ,MEDICAL genetics - Abstract
Riggs ER, Church DM, Hanson K, Horner VL, Kaminsky EB, Kuhn RM, Wain KE, Williams ES, Aradhya S, Kearney HM, Ledbetter DH, South ST, Thorland EC, Martin CL. Towards an evidence-based process for the clinical interpretation of copy number variation. The evidence-based review (EBR) process has been widely used to develop standards for medical decision-making and to explore complex clinical questions. This approach can be applied to genetic tests, such as chromosomal microarrays, in order to assist in the clinical interpretation of certain copy number variants (CNVs), particularly those that are rare, and guide array design for optimal clinical utility. To address these issues, the International Standards for Cytogenomic Arrays Consortium has established an EBR Work Group charged with building a framework to systematically assess the potential clinical relevance of CNVs throughout the genome. This group has developed a rating system enumerating the evidence supporting or refuting dosage sensitivity for individual genes and regions that considers the following criteria: number of causative mutations reported; patterns of inheritance; consistency of phenotype; evidence from large-scale case-control studies; mutational mechanisms; data from public genome variation databases; and expert consensus opinion. The system is designed to be dynamic in nature, with regions being reevaluated periodically to incorporate emerging evidence. The evidence collected will be displayed within a publically available database, and can be used in part to inform clinical laboratory CNV interpretations as well as to guide array design. [ABSTRACT FROM AUTHOR]
- Published
- 2012
- Full Text
- View/download PDF
3. A User's Guide to the Encyclopedia of DNA Elements (ENCODE)
- Author
-
Zhi Lu, Giltae Song, Troy W. Whitfield, Vishwanath R. Iyer, Teresa Vales, Angelika Merkel, Max Libbrecht, David Haussler, Ting Wang, Kristen Lee, Lingyun Song, Richard M. Myers, Alfonso Valencia, Rachel A. Harte, Xiaoqin Xu, Lucas D. Ward, Hazuki Takahashi, Nathan C. Sheffield, Thomas Derrien, Georgi K. Marinov, Eric D. Nguyen, Bernard B. Suh, Brian J. Raney, Richard Sandstrom, Thomas D. Tullius, Benoit Miotto, Alexander Dobin, Youhan Xu, Lukas Habegger, Ian Dunham, Brian A. Risk, Paul G. Giresi, Morgan C. Giddings, Hualin Xi, Anshul Kundaje, Robert S. Harris, Devin Absher, Peter J. Bickel, Yanbao Yu, Browen Aken, Colin Kingswood, Bryan R. Lajoie, Peter J. Good, Katrina Learned, Laura Elnitski, Shirley Pepke, Brandon King, Piero Carninci, Xinqiong Yang, Ghia Euskirchen, Kathryn Beal, Christelle Borel, Michael Muratet, Robert L. Grossman, David G. Knowles, Zarmik Moqtaderi, Veronika Boychenko, Steven P. Wilder, Michael L. Tress, Florencia Pauli, Alan P. Boyle, Andrea Tanzer, Philipp Kapranov, Serafim Batzoglou, Audra K. Johnson, Jun Neri, Nitin Bhardwaj, Elise A. Feingold, Venkat S. Malladi, Michael M. Hoffman, William Stafford Noble, Andrea Sboner, Mark Gerstein, Stephanie L. Parker, Jacqueline Dumais, Felix Schlesinger, Deborah R. Winter, Randall H. Brown, Thanh Truong, Rebecca F. Lowdon, Paolo Ribeca, Brooke Rhead, Peggy J. Farnham, Krista Thibeault, Terrence S. Furey, Donna Karolchik, Alec Victorsen, Xiaoan Ruan, Rehab F. Abdelhamid, Amy S. Nesmith, Jing Wang, Nicholas M. Luscombe, Alina R. Cao, Diane Trout, Teri Slifer, Peter E. Newburger, Cricket A. Sloan, Dimitra Lotakis, Stephen M. J. Searle, Ali Mortazavi, Alexandra Bignell, Alex Reynolds, Orion J. Buske, Chris Zaleski, Theresa K. Canfield, Ian Bell, Jin Lian, Vanessa K. Swing, Katalin Toth Fejes, Catherine Ucla, Robert E. Thurman, Jacqueline Chrast, Wei Lin, Tim Hubbard, Gary Saunders, Minyi Shi, Vihra Sotirova, Sherman M. Weissman, Jason D. Lieb, Richard Humbert, Kevin M. Bowling, Assaf Gordon, Tarjei S. Mikkelsen, Jing Leng, Thomas R. Gingeras, Fabian Grubert, Nader Jameel, Jost Vielmetter, Hannah Monahan, Preti Jain, Lindsay L. Waite, Tony Shafer, Joel Rozowsky, Michael Coyne, Brian Reed, M. Kay, Harsha P. Gunawardena, Ross C. Hardison, Gavin Sherlock, Alexandra Charos, Joseph D. Fleming, Ann S. Zweig, Jason Gertz, Rajinder Kaul, Xianjun Dong, Alexandre Reymond, Carrie A. Davis, Haiyan Huang, Chao Cheng, Marco Mariotti, Phil Lacroute, Jason A. Dilocker, Kenneth McCue, R. Robilotto, Stylianos E. Antonarakis, Sridar V. Chittur, Justin Jee, Barbara J. Wold, Sudipto K. Chakrabortty, Erica Dumais, Amartya Sanyal, Nathan Boley, Tianyuan Wang, Julien Lagarde, Anthony Kirilusha, Jonathan B. Preall, Kevin Roberts, Erika Giste, Hugo Y. K. Lam, Alvis Brazma, Gregory J. Hannon, Eric Rynes, Philippe Batut, Kevin Struhl, Margus Lukk, Manching Ku, Suganthi Balasubramanian, Sonali Jha, Jorg Drenkow, W. James Kent, Michael Snyder, Jie Wang, Anna Battenhouse, Charles B. Epstein, Rami Rauch, Christopher Shestak, John A. Stamatoyannopoulos, Gaurab Mukherjee, Cédric Howald, Tanya Kutyavin, Huaien Wang, Scott A. Tenenbaum, Wan Ting Poh, Kate R. Rosenbloom, Manolis Kellis, Pauline A. Fujita, Linfeng Wu, Anita Bansal, Molly Weaver, Linda L. Grasfeder, Peter J. Sabo, Qiang Li, Melissa S. Cline, Robert M. Kuhn, Darin London, Seth Frietze, Atif Shahab, Shane Neph, Damian Keefe, James B. Brown, Mark Diekhans, Webb Miller, Katherine Aylor Fisher, Jiang Du, Hadar H. Sheffer, Sarah Djebali, Frank Doyle, Nathan Lamarre-Vincent, Chia-Lin Wei, Laura A.L. Dillon, Jennifer Harrow, Robert C. Altshuler, Tyler Alioto, Raymond K. Auerbach, Adam Frankish, Rebekka O. Sprouse, Patrick J. Collins, E. Christopher Partridge, Zheng Liu, Yoichiro Shibata, Elliott H. Margulies, Abigail K. Ebersol, Kimberly A. Showers, Eric D. Green, Krishna M. Roskin, Job Dekker, Barbara N. Pusey, Ekta Khurana, Gilberto DeSalvo, Yijun Ruan, Hao Wang, Jainab Khatun, Henriette O'Geen, Alexej Abyzov, Brian Williams, Ryan M. McDaniell, Maya Kasowski, Manoj Hariharan, Felix Kokocinski, Gloria Despacio-Reyes, Zhancheng Zhang, Subhradip Karmakar, Ewan Birney, Koon-Kiu Yan, Xian Chen, Shinny Vong, Daniel Sobral, Nick Bild, Seul K.C. Kim, Timo Lassmann, Li Wang, Minerva E. Sanchez, Vaughan Roach, Theodore Gibson, Stephen C. J. Parker, Michael F. Lin, Patrick A. Navas, Laurence R. Meyer, Luiz O. F. Penalva, Bradley E. Bernstein, Kevin P. White, Emilie Aït Yahya Graison, Juan M. Vaquerizas, Sushma Iyengar, Kimberly M. Newberry, Akshay Bhinge, Xiaolan Zhang, Kim Bell, Yoshihide Hayashizaki, Lucas Lochovsky, Noam Shoresh, Hagen Tilgner, Philip Cayting, Dorrelyn Patacsil, Timothy E. Reddy, Eric Haugen, Katherine E. Varley, M. van Baren, Nathan D. Trinklein, Bum Kyu Lee, Tristan Frum, Marianne Lindahl-Allen, Timothy Durham, Roderic Guigó, Christopher W. Maier, Micha Sammeth, Debasish Raha, Timothy R. Dreszer, Benedict Paten, Robbyn Issner, Michael R. Brent, Kevin Y. Yip, Kim Blahnik, Jason Ernst, Zhiping Weng, Henry Amrhein, Arend Sidow, Javier Herrero, Hui Gao, Stephen G. Landt, Pouya Kheradpour, Galt P. Barber, Gregory E. Crawford, Toby Hunt, HudsonAlpha Institute for Biotechnology [Huntsville, AL], ENCODE Project Consortium : Myers RM, Stamatoyannopoulos J, Snyder M, Dunham I, Hardison RC, Bernstein BE, Gingeras TR, Kent WJ, Birney E, Wold B, Crawford GE, Bernstein BE, Epstein CB, Shoresh N, Ernst J, Mikkelsen TS, Kheradpour P, Zhang X, Wang L, Issner R, Coyne MJ, Durham T, Ku M, Truong T, Ward LD, Altshuler RC, Lin MF, Kellis M, Gingeras TR, Davis CA, Kapranov P, Dobin A, Zaleski C, Schlesinger F, Batut P, Chakrabortty S, Jha S, Lin W, Drenkow J, Wang H, Bell K, Gao H, Bell I, Dumais E, Dumais J, Antonarakis SE, Ucla C, Borel C, Guigo R, Djebali S, Lagarde J, Kingswood C, Ribeca P, Sammeth M, Alioto T, Merkel A, Tilgner H, Carninci P, Hayashizaki Y, Lassmann T, Takahashi H, Abdelhamid RF, Hannon G, Fejes-Toth K, Preall J, Gordon A, Sotirova V, Reymond A, Howald C, Graison E, Chrast J, Ruan Y, Ruan X, Shahab A, Ting Poh W, Wei CL, Crawford GE, Furey TS, Boyle AP, Sheffield NC, Song L, Shibata Y, Vales T, Winter D, Zhang Z, London D, Wang T, Birney E, Keefe D, Iyer VR, Lee BK, McDaniell RM, Liu Z, Battenhouse A, Bhinge AA, Lieb JD, Grasfeder LL, Showers KA, Giresi PG, Kim SK, Shestak C, Myers RM, Pauli F, Reddy TE, Gertz J, Partridge EC, Jain P, Sprouse RO, Bansal A, Pusey B, Muratet MA, Varley KE, Bowling KM, Newberry KM, Nesmith AS, Dilocker JA, Parker SL, Waite LL, Thibeault K, Roberts K, Absher DM, Wold B, Mortazavi A, Williams B, Marinov G, Trout D, Pepke S, King B, McCue K, Kirilusha A, DeSalvo G, Fisher-Aylor K, Amrhein H, Vielmetter J, Sherlock G, Sidow A, Batzoglou S, Rauch R, Kundaje A, Libbrecht M, Margulies EH, Parker SC, Elnitski L, Green ED, Hubbard T, Harrow J, Searle S, Kokocinski F, Aken B, Frankish A, Hunt T, Despacio-Reyes G, Kay M, Mukherjee G, Bignell A, Saunders G, Boychenko V, Van Baren M, Brown RH, Khurana E, Balasubramanian S, Zhang Z, Lam H, Cayting P, Robilotto R, Lu Z, Guigo R, Derrien T, Tanzer A, Knowles DG, Mariotti M, James Kent W, Haussler D, Harte R, Diekhans M, Kellis M, Lin M, Kheradpour P, Ernst J, Reymond A, Howald C, Graison EA, Chrast J, Tress M, Rodriguez JM, Snyder M, Landt SG, Raha D, Shi M, Euskirchen G, Grubert F, Kasowski M, Lian J, Cayting P, Lacroute P, Xu Y, Monahan H, Patacsil D, Slifer T, Yang X, Charos A, Reed B, Wu L, Auerbach RK, Habegger L, Hariharan M, Rozowsky J, Abyzov A, Weissman SM, Gerstein M, Struhl K, Lamarre-Vincent N, Lindahl-Allen M, Miotto B, Moqtaderi Z, Fleming JD, Newburger P, Farnham PJ, Frietze S, O'Geen H, Xu X, Blahnik KR, Cao AR, Iyengar S, Stamatoyannopoulos JA, Kaul R, Thurman RE, Wang H, Navas PA, Sandstrom R, Sabo PJ, Weaver M, Canfield T, Lee K, Neph S, Roach V, Reynolds A, Johnson A, Rynes E, Giste E, Vong S, Neri J, Frum T, Johnson EM, Nguyen ED, Ebersol AK, Sanchez ME, Sheffer HH, Lotakis D, Haugen E, Humbert R, Kutyavin T, Shafer T, Dekker J, Lajoie BR, Sanyal A, James Kent W, Rosenbloom KR, Dreszer TR, Raney BJ, Barber GP, Meyer LR, Sloan CA, Malladi VS, Cline MS, Learned K, Swing VK, Zweig AS, Rhead B, Fujita PA, Roskin K, Karolchik D, Kuhn RM, Haussler D, Birney E, Dunham I, Wilder SP, Keefe D, Sobral D, Herrero J, Beal K, Lukk M, Brazma A, Vaquerizas JM, Luscombe NM, Bickel PJ, Boley N, Brown JB, Li Q, Huang H, Gerstein M, Habegger L, Sboner A, Rozowsky J, Auerbach RK, Yip KY, Cheng C, Yan KK, Bhardwaj N, Wang J, Lochovsky L, Jee J, Gibson T, Leng J, Du J, Hardison RC, Harris RS, Song G, Miller W, Haussler D, Roskin K, Suh B, Wang T, Paten B, Noble WS, Hoffman MM, Buske OJ, Weng Z, Dong X, Wang J, Xi H, Tenenbaum SA, Doyle F, Penalva LO, Chittur S, Tullius TD, Parker SC, White KP, Karmakar S, Victorsen A, Jameel N, Bild N, Grossman RL, Snyder M, Landt SG, Yang X, Patacsil D, Slifer T, Dekker J, Lajoie BR, Sanyal A, Weng Z, Whitfield TW, Wang J, Collins PJ, Trinklein ND, Partridge EC, Myers RM, Giddings MC, Chen X, Khatun J, Maier C, Yu Y, Gunawardena H, Risk B, Feingold EA, Lowdon RF, Dillon LA, Good PJ, Harrow J, Searle S., Becker, Peter B, Broad Institute of MIT and Harvard, Lincoln Laboratory, Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology. Department of Physics, Kellis, Manolis, Epstein, Charles B., Bernstein, Bradley E., Shoresh, Noam, Ernst, Jason, Mikkelsen, Tarjei Sigurd, Kheradpour, Pouya, Zhang, Xiaolan, Wang, Li, Issner, Robbyn, Coyne, Michael J., Durham, Timothy, Ku, Manching, Truong, Thanh, Ward, Lucas D., Altshuler, Robert Charles, Lin, Michael F., ENCODE Project Consortium, Antonarakis, Stylianos, and Miotto, Benoit
- Subjects
RNA, Messenger/genetics ,[SDV]Life Sciences [q-bio] ,Messenger ,Genoma humà ,Genome ,Medical and Health Sciences ,0302 clinical medicine ,Models ,ddc:576.5 ,Biology (General) ,Conserved Sequence ,Genetics ,0303 health sciences ,General Neuroscience ,RNA-Binding Proteins ,Genomics ,Biological Sciences ,Chromatin ,3. Good health ,[SDV] Life Sciences [q-bio] ,DNA-Binding Proteins ,Gene Components ,030220 oncology & carcinogenesis ,DNA methylation ,Encyclopedia ,HIV/AIDS ,Proteïnes de la sang -- Aspectes genètics ,General Agricultural and Biological Sciences ,Databases, Nucleic Acid ,Human ,Research Article ,Quality Control ,Process (engineering) ,QH301-705.5 ,1.1 Normal biological development and functioning ,Computational biology ,Biology ,ENCODE ,General Biochemistry, Genetics and Molecular Biology ,Chromatin/metabolism ,Vaccine Related ,03 medical and health sciences ,Databases ,Genetic ,Underpinning research ,Humans ,RNA, Messenger ,RNA-Binding Proteins/genetics/metabolism ,Vaccine Related (AIDS) ,Gene ,030304 developmental biology ,Internet ,General Immunology and Microbiology ,Nucleic Acid ,Agricultural and Veterinary Sciences ,Base Sequence ,Models, Genetic ,Genome, Human ,Prevention ,Human Genome ,Computational Biology ,DNA Methylation ,ENCODE Project Consortium ,Gene Expression Regulation ,DNA-Binding Proteins/genetics/metabolism ,RNA ,Human genome ,Immunization ,Generic health relevance ,Developmental Biology - Abstract
The mission of the Encyclopedia of DNA Elements (ENCODE) Project is to enable the scientific and medical communities to interpret the human genome sequence and apply it to understand human biology and improve health. The ENCODE Consortium is integrating multiple technologies and approaches in a collective effort to discover and define the functional elements encoded in the human genome, including genes, transcripts, and transcriptional regulatory regions, together with their attendant chromatin states and DNA methylation patterns. In the process, standards to ensure high-quality data have been implemented, and novel algorithms have been developed to facilitate analysis. Data and derived results are made available through a freely accessible database. Here we provide an overview of the project and the resources it is generating and illustrate the application of ENCODE data to interpret the human genome., National Human Genome Research Institute (U.S.), National Institutes of Health (U.S.)
- Published
- 2011
4. Advancing and Integrating the Cusp Concept to Understand Behavioral Repertoire Dynamics.
- Author
-
Becker AM, Kuhn RM, and Pinkelman SE
- Abstract
The behavioral repertoire grows and develops through a lifetime in a manner intricately dependent on bidirectional connections between its current form and the shaping environment. Behavior analysis has discovered many of the key relationships that occur between repertoire elements that govern this constant metamorphosis, including the behavioral cusp: an event that triggers contact with new behavioral contingencies. The current literature already suggests possible integration of the behavioral cusp and related concepts into a wider understanding of behavioural development and cumulative learning. Here we share an attempted step in that progression: an approach to an in-depth characterization of the features and connections underlying cusp variety. We sketch this approach on the basis of differential involvement of contingency terms; the relevance to the cusp of environmental context, accompanying repertoire, or response properties; the connections of particular cusps to other behavioral principles, processes, or concepts; the involvement of co-evolving social repertoires undergoing mutual influence; and the ability of cusps to direct the repertoire either toward desired contingencies or away from a growth-stifling repertoire. We discuss the implications of the schema for expanded applied considerations, the programming of unique cusps, and the need for incorporating cultural context into the cusp. We hope that this schema could be a starting point, subject to empirical refinement, leading to an expanded understanding of repertoire interconnectivity and ontogenetic evolution., (© Association for Behavior Analysis International 2024. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.)
- Published
- 2024
- Full Text
- View/download PDF
5. The UCSC Genome Browser database: 2024 update.
- Author
-
Raney BJ, Barber GP, Benet-Pagès A, Casper J, Clawson H, Cline MS, Diekhans M, Fischer C, Navarro Gonzalez J, Hickey G, Hinrichs AS, Kuhn RM, Lee BT, Lee CM, Le Mercier P, Miga KH, Nassar LR, Nejad P, Paten B, Perez G, Schmelter D, Speir ML, Wick BD, Zweig AS, Haussler D, Kent WJ, and Haeussler M
- Subjects
- Animals, Humans, Mice, Genome, Human, Genome, Viral, Internet, Molecular Sequence Annotation, Software, Databases, Genetic, Genomics, RNA, Viral
- Abstract
The UCSC Genome Browser (https://genome.ucsc.edu) is a web-based genomic visualization and analysis tool that serves data to over 7,000 distinct users per day worldwide. It provides annotation data on thousands of genome assemblies, ranging from human to SARS-CoV2. This year, we have introduced new data from the Human Pangenome Reference Consortium and on viral genomes including SARS-CoV2. We have added 1,200 new genomes to our GenArk genome system, increasing the overall diversity of our genomic representation. We have added support for nine new user-contributed track hubs to our public hub system. Additionally, we have released 29 new tracks on the human genome and 11 new tracks on the mouse genome. Collectively, these new features expand both the breadth and depth of the genomic knowledge that we share publicly with users worldwide., (© The Author(s) 2023. Published by Oxford University Press on behalf of Nucleic Acids Research.)
- Published
- 2024
- Full Text
- View/download PDF
6. GenArk: towards a million UCSC genome browsers.
- Author
-
Clawson H, Lee BT, Raney BJ, Barber GP, Casper J, Diekhans M, Fischer C, Gonzalez JN, Hinrichs AS, Lee CM, Nassar LR, Perez G, Wick B, Schmelter D, Speir ML, Armstrong J, Zweig AS, Kuhn RM, Kirilenko BM, Hiller M, Haussler D, Kent WJ, and Haeussler M
- Subjects
- Genomics, Archives, Nucleic Acid Amplification Techniques, Databases, Genetic, Internet, Software, Genome
- Abstract
Interactive graphical genome browsers are essential tools in genomics, but they do not contain all the recent genome assemblies. We create Genome Archive (GenArk) collection of UCSC Genome Browsers from NCBI assemblies. Built on our established track hub system, this enables fast visualization of annotations. Assemblies come with gene models, repeat masks, BLAT, and in silico PCR. Users can add annotations via track hubs and custom tracks. We can bulk-import third-party resources, demonstrated with TOGA and Ensembl gene models for hundreds of assemblies.Three thousand two hundred sixty-nine GenArk assemblies are listed at https://hgdownload.soe.ucsc.edu/hubs/ and can be searched for on the Genome Browser gateway page., (© 2023. BioMed Central Ltd., part of Springer Nature.)
- Published
- 2023
- Full Text
- View/download PDF
7. The UCSC Genome Browser database: 2023 update.
- Author
-
Nassar LR, Barber GP, Benet-Pagès A, Casper J, Clawson H, Diekhans M, Fischer C, Gonzalez JN, Hinrichs AS, Lee BT, Lee CM, Muthuraman P, Nguy B, Pereira T, Nejad P, Perez G, Raney BJ, Schmelter D, Speir ML, Wick BD, Zweig AS, Haussler D, Kuhn RM, Haeussler M, and Kent WJ
- Subjects
- Humans, COVID-19 epidemiology, COVID-19 genetics, Internet, Phylogeny, SARS-CoV-2 genetics, Software, Web Browser, Databases, Genetic, Genomics methods
- Abstract
The UCSC Genome Browser (https://genome.ucsc.edu) is an omics data consolidator, graphical viewer, and general bioinformatics resource that continues to serve the community as it enters its 23rd year. This year has seen an emphasis in clinical data, with new tracks and an expanded Recommended Track Sets feature on hg38 as well as the addition of a single cell track group. SARS-CoV-2 continues to remain a focus, with regular annotation updates to the browser and continued curation of our phylogenetic sequence placing tool, hgPhyloPlace, whose tree has now reached over 12M sequences. Our GenArk resource has also grown, offering over 2500 hubs and a system for users to request any absent assemblies. We have expanded our bigBarChart display type and created new ways to visualize data via bigRmsk and dynseq display. Displaying custom annotations is now easier due to our chromAlias system which eliminates the requirement for renaming sequence names to the UCSC standard. Users involved in data generation may also be interested in our new tools and trackDb settings which facilitate the creation and display of their custom annotations., (© The Author(s) 2022. Published by Oxford University Press on behalf of Nucleic Acids Research.)
- Published
- 2023
- Full Text
- View/download PDF
8. Variant interpretation: UCSC Genome Browser Recommended Track Sets.
- Author
-
Benet-Pagès A, Rosenbloom KR, Nassar LR, Lee CM, Raney BJ, Clawson H, Schmelter D, Casper J, Gonzalez JN, Perez G, Lee BT, Zweig AS, Kent WJ, Haeussler M, and Kuhn RM
- Subjects
- DNA Copy Number Variations, Genome, Human genetics, Genomics, Humans, Internet, Databases, Genetic, Software
- Abstract
The UCSC Genome Browser has been an important tool for genomics and clinical genetics since the sequence of the human genome was first released in 2000. As it has grown in scope to display more types of data it has also grown more complicated. The data, which are dispersed at many locations worldwide, are collected into one view on the Browser, where the graphical interface presents the data in one location. This supports the expertise of the researcher to interpret variants in the genome. Because the analysis of single nucleotide variants and copy number variants require interpretation of data at very different genomic scales, different data resources are required. We present here several Recommended Track Sets designed to facilitate the interpretation of variants in the clinic, offering quick access to datasets relevant to the appropriate scale., (© 2022 Wiley Periodicals LLC.)
- Published
- 2022
- Full Text
- View/download PDF
9. Development of a Freeze-Dried CRISPR-Cas12 Sensor for Detecting Wolbachia in the Secondary Science Classroom.
- Author
-
Rybnicky GA, Dixon RA, Kuhn RM, Karim AS, and Jewett MC
- Subjects
- Biotechnology, CRISPR-Cas Systems genetics, Gene Editing, Genetic Engineering, Humans, Synthetic Biology education, Wolbachia genetics
- Abstract
Training the future synthetic biology workforce requires the opportunity for students to be exposed to biotechnology concepts and activities in secondary education. Detecting Wolbachia bacteria in arthropods using polymerase chain reaction (PCR) has become a common way for secondary students to investigate and apply recombinant DNA technology in the science classroom. Despite this important activity, cutting-edge biotechnologies such as clustered regularly interspaced short palindromic repeat (CRISPR)-based diagnostics have yet to be widely implemented in the classroom. To address this gap, we present a freeze-dried CRISPR-Cas12 sensing reaction to complement traditional recombinant DNA technology education and teach synthetic biology concepts. The reactions accurately detect Wolbachia from arthropod-derived PCR samples in under 2 h and can be stored at room temperature for over a month without appreciable degradation. The reactions are easy-to-use and cost less than $40 to implement for a classroom of 22 students including the cost of reusable equipment. We see these freeze-dried CRISPR-Cas12 reactions as an accessible way to incorporate synthetic biology education into the existing biology curriculum, which will expand biology educational opportunities in science, technology, engineering, and mathematics.
- Published
- 2022
- Full Text
- View/download PDF
10. The UCSC Genome Browser database: 2022 update.
- Author
-
Lee BT, Barber GP, Benet-Pagès A, Casper J, Clawson H, Diekhans M, Fischer C, Gonzalez JN, Hinrichs AS, Lee CM, Muthuraman P, Nassar LR, Nguy B, Pereira T, Perez G, Raney BJ, Rosenbloom KR, Schmelter D, Speir ML, Wick BD, Zweig AS, Haussler D, Kuhn RM, Haeussler M, and Kent WJ
- Subjects
- Animals, Genome, Human, Humans, Phylogeny, Polymerase Chain Reaction, SARS-CoV-2 genetics, User-Computer Interface, Exome Sequencing, Databases, Genetic, Web Browser
- Abstract
The UCSC Genome Browser, https://genome.ucsc.edu, is a graphical viewer for exploring genome annotations. The website provides integrated tools for visualizing, comparing, analyzing, and sharing both publicly available and user-generated genomic datasets. Data highlights this year include a collection of easily accessible public hub assemblies on new organisms, now featuring BLAT alignment and PCR capabilities, and new and updated clinical tracks (gnomAD, DECIPHER, CADD, REVEL). We introduced a new Track Sets feature and enhanced variant displays to aid in the interpretation of clinical data. We also added a tool to rapidly place new SARS-CoV-2 genomes in a global phylogenetic tree enabling researchers to view the context of emerging mutations in our SARS-CoV-2 Genome Browser. Other new software focuses on usability features, including more informative mouseover displays and new fonts., (© The Author(s) 2021. Published by Oxford University Press on behalf of Nucleic Acids Research.)
- Published
- 2022
- Full Text
- View/download PDF
11. The UCSC Genome Browser database: 2021 update.
- Author
-
Navarro Gonzalez J, Zweig AS, Speir ML, Schmelter D, Rosenbloom KR, Raney BJ, Powell CC, Nassar LR, Maulding ND, Lee CM, Lee BT, Hinrichs AS, Fyfe AC, Fernandes JD, Diekhans M, Clawson H, Casper J, Benet-Pagès A, Barber GP, Haussler D, Kuhn RM, Haeussler M, and Kent WJ
- Subjects
- Animals, COVID-19 epidemiology, COVID-19 virology, Data Curation methods, Epidemics, Humans, Internet, Mice, Molecular Sequence Annotation methods, SARS-CoV-2 physiology, Software, COVID-19 prevention & control, Computational Biology methods, Databases, Genetic, Genome genetics, Genomics methods, SARS-CoV-2 genetics
- Abstract
For more than two decades, the UCSC Genome Browser database (https://genome.ucsc.edu) has provided high-quality genomics data visualization and genome annotations to the research community. As the field of genomics grows and more data become available, new modes of display are required to accommodate new technologies. New features released this past year include a Hi-C heatmap display, a phased family trio display for VCF files, and various track visualization improvements. Striving to keep data up-to-date, new updates to gene annotations include GENCODE Genes, NCBI RefSeq Genes, and Ensembl Genes. New data tracks added for human and mouse genomes include the ENCODE registry of candidate cis-regulatory elements, promoters from the Eukaryotic Promoter Database, and NCBI RefSeq Select and Matched Annotation from NCBI and EMBL-EBI (MANE). Within weeks of learning about the outbreak of coronavirus, UCSC released a genome browser, with detailed annotation tracks, for the SARS-CoV-2 RNA reference assembly., (© The Author(s) 2020. Published by Oxford University Press on behalf of Nucleic Acids Research.)
- Published
- 2021
- Full Text
- View/download PDF
12. UCSC Genome Browser enters 20th year.
- Author
-
Lee CM, Barber GP, Casper J, Clawson H, Diekhans M, Gonzalez JN, Hinrichs AS, Lee BT, Nassar LR, Powell CC, Raney BJ, Rosenbloom KR, Schmelter D, Speir ML, Zweig AS, Haussler D, Haeussler M, Kuhn RM, and Kent WJ
- Subjects
- Genomics, Humans, Internet, Databases, Genetic, Genome, Human, Software
- Abstract
The University of California Santa Cruz Genome Browser website (https://genome.ucsc.edu) enters its 20th year of providing high-quality genomics data visualization and genome annotations to the research community. In the past year, we have added a new option to our web BLAT tool that allows search against all genomes, a single-cell expression viewer (https://cells.ucsc.edu), a 'lollipop' plot display mode for high-density variation data, a RESTful API for data extraction and a custom-track backup feature. New datasets include Tabula Muris single-cell expression data, GeneHancer regulatory annotations, The Cancer Genome Atlas Pan-Cancer variants, Genome Reference Consortium Patch sequences, new ENCODE transcription factor binding site peaks and clusters, the Database of Genomic Variants Gold Standard Variants, Genomenon Mastermind variants and three new multi-species alignment tracks., (© The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research.)
- Published
- 2020
- Full Text
- View/download PDF
13. The UCSC Genome Browser database: 2019 update.
- Author
-
Haeussler M, Zweig AS, Tyner C, Speir ML, Rosenbloom KR, Raney BJ, Lee CM, Lee BT, Hinrichs AS, Gonzalez JN, Gibson D, Diekhans M, Clawson H, Casper J, Barber GP, Haussler D, Kuhn RM, and Kent WJ
- Subjects
- Animals, Chromosome Mapping, Genome, Human genetics, Humans, Molecular Sequence Annotation, Web Browser, Databases, Genetic, Genome genetics, Genomics, Software
- Abstract
The UCSC Genome Browser (https://genome.ucsc.edu) is a graphical viewer for exploring genome annotations. For almost two decades, the Browser has provided visualization tools for genetics and molecular biology and continues to add new data and features. This year, we added a new tool that lets users interactively arrange existing graphing tracks into new groups. Other software additions include new formats for chromosome interactions, a ChIP-Seq peak display for track hubs and improved support for HGVS. On the annotation side, we have added gnomAD, TCGA expression, RefSeq Functional elements, GTEx eQTLs, CRISPR Guides, SNPpedia and created a 30-way primate alignment on the human genome. Nine assemblies now have RefSeq-mapped gene models.
- Published
- 2019
- Full Text
- View/download PDF
14. The UCSC Genome Browser database: 2018 update.
- Author
-
Casper J, Zweig AS, Villarreal C, Tyner C, Speir ML, Rosenbloom KR, Raney BJ, Lee CM, Lee BT, Karolchik D, Hinrichs AS, Haeussler M, Guruvadoo L, Navarro Gonzalez J, Gibson D, Fiddes IT, Eisenhart C, Diekhans M, Clawson H, Barber GP, Armstrong J, Haussler D, Kuhn RM, and Kent WJ
- Subjects
- CRISPR-Cas Systems, Data Display, Gene Regulatory Networks, Genome, Human, Humans, Molecular Sequence Annotation, Terminology as Topic, User-Computer Interface, Databases, Genetic, Genome, Web Browser
- Abstract
The UCSC Genome Browser (https://genome.ucsc.edu) provides a web interface for exploring annotated genome assemblies. The assemblies and annotation tracks are updated on an ongoing basis-12 assemblies and more than 28 tracks were added in the past year. Two recent additions are a display of CRISPR/Cas9 guide sequences and an interactive navigator for gene interactions. Other upgrades from the past year include a command-line version of the Variant Annotation Integrator, support for Human Genome Variation Society variant nomenclature input and output, and a revised highlighting tool that now supports multiple simultaneous regions and colors., (© The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.)
- Published
- 2018
- Full Text
- View/download PDF
15. The UCSC Genome Browser database: 2017 update.
- Author
-
Tyner C, Barber GP, Casper J, Clawson H, Diekhans M, Eisenhart C, Fischer CM, Gibson D, Gonzalez JN, Guruvadoo L, Haeussler M, Heitner S, Hinrichs AS, Karolchik D, Lee BT, Lee CM, Nejad P, Raney BJ, Rosenbloom KR, Speir ML, Villarreal C, Vivian J, Zweig AS, Haussler D, Kuhn RM, and Kent WJ
- Subjects
- Animals, Computational Biology methods, Genome, Genomics methods, Humans, Molecular Sequence Annotation, Software, Databases, Genetic, Search Engine, Web Browser
- Abstract
Since its 2001 debut, the University of California, Santa Cruz (UCSC) Genome Browser (http://genome.ucsc.edu/) team has provided continuous support to the international genomics and biomedical communities through a web-based, open source platform designed for the fast, scalable display of sequence alignments and annotations landscaped against a vast collection of quality reference genome assemblies. The browser's publicly accessible databases are the backbone of a rich, integrated bioinformatics tool suite that includes a graphical interface for data queries and downloads, alignment programs, command-line utilities and more. This year's highlights include newly designed home and gateway pages; a new 'multi-region' track display configuration for exon-only, gene-only and custom regions visualization; new genome browsers for three species (brown kiwi, crab-eating macaque and Malayan flying lemur); eight updated genome assemblies; extended support for new data types such as CRAM, RNA-seq expression data and long-range chromatin interaction pairs; and the unveiling of a new supported mirror site in Japan., (© The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.)
- Published
- 2017
- Full Text
- View/download PDF
16. UCSC Data Integrator and Variant Annotation Integrator.
- Author
-
Hinrichs AS, Raney BJ, Speir ML, Rhead B, Casper J, Karolchik D, Kuhn RM, Rosenbloom KR, Zweig AS, Haussler D, and Kent WJ
- Subjects
- Animals, Databases, Genetic, Genomics, Humans, Internet, Genome, Software
- Abstract
Unlabelled: Two new tools on the UCSC Genome Browser web site provide improved ways of combining information from multiple datasets, optionally including the user's own custom track data and/or data from track hubs. The Data Integrator combines columns from multiple data tracks, showing all items from the first track along with overlapping items from the other tracks. The Variant Annotation Integrator is tailored to adding functional annotations to variant calls; it offers a more restricted set of underlying data tracks but adds predictions of each variant's consequences for any overlapping or nearby gene transcript. When available, it optionally adds additional annotations including effect prediction scores from dbNSFP for missense mutations, ENCODE regulatory summary tracks and conservation scores., Availability and Implementation: The web tools are freely available at http://genome.ucsc.edu/ and the underlying database is available for download at http://hgdownload.cse.ucsc.edu/ The software (written in C and Javascript) is available from https://genome-store.ucsc.edu/ and is freely available for academic and non-profit usage; commercial users must obtain a license., Contact: angie@soe.ucsc.edu, Supplementary Information: Supplementary data are available at Bioinformatics online., (© The Author 2016. Published by Oxford University Press.)
- Published
- 2016
- Full Text
- View/download PDF
17. Integrative genomic analysis by interoperation of bioinformatics tools in GenomeSpace.
- Author
-
Qu K, Garamszegi S, Wu F, Thorvaldsdottir H, Liefeld T, Ocana M, Borges-Rivera D, Pochet N, Robinson JT, Demchak B, Hull T, Ben-Artzi G, Blankenberg D, Barber GP, Lee BT, Kuhn RM, Nekrutenko A, Segal E, Ideker T, Reich M, Regev A, Chang HY, and Mesirov JP
- Subjects
- Data Mining, Humans, Internet, Systems Integration, Algorithms, Chromosome Mapping methods, Computational Biology methods, Databases, Genetic, Genome, Human genetics, Software
- Abstract
Complex biomedical analyses require the use of multiple software tools in concert and remain challenging for much of the biomedical research community. We introduce GenomeSpace (http://www.genomespace.org), a cloud-based, cooperative community resource that currently supports the streamlined interaction of 20 bioinformatics tools and data resources. To facilitate integrative analysis by non-programmers, it offers a growing set of 'recipes', short workflows to guide investigators through high-utility analysis tasks.
- Published
- 2016
- Full Text
- View/download PDF
18. The UCSC Genome Browser database: 2016 update.
- Author
-
Speir ML, Zweig AS, Rosenbloom KR, Raney BJ, Paten B, Nejad P, Lee BT, Learned K, Karolchik D, Hinrichs AS, Heitner S, Harte RA, Haeussler M, Guruvadoo L, Fujita PA, Eisenhart C, Diekhans M, Clawson H, Casper J, Barber GP, Haussler D, Kuhn RM, and Kent WJ
- Subjects
- Animals, Disease genetics, Genes, Genome, Humans, Mice, Molecular Sequence Annotation, Software, Databases, Genetic, Genomics
- Abstract
For the past 15 years, the UCSC Genome Browser (http://genome.ucsc.edu/) has served the international research community by offering an integrated platform for viewing and analyzing information from a large database of genome assemblies and their associated annotations. The UCSC Genome Browser has been under continuous development since its inception with new data sets and software features added frequently. Some release highlights of this year include new and updated genome browsers for various assemblies, including bonobo and zebrafish; new gene annotation sets; improvements to track and assembly hub support; and a new interactive tool, the "Data Integrator", for intersecting data from multiple tracks. We have greatly expanded the data sets available on the most recent human assembly, hg38/GRCh38, to include updated gene prediction sets from GENCODE, more phenotype- and disease-associated variants from ClinVar and ClinGen, more genomic regulatory data, and a new multiple genome alignment., (© The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.)
- Published
- 2016
- Full Text
- View/download PDF
19. The UCSC Genome Browser database: 2015 update.
- Author
-
Rosenbloom KR, Armstrong J, Barber GP, Casper J, Clawson H, Diekhans M, Dreszer TR, Fujita PA, Guruvadoo L, Haeussler M, Harte RA, Heitner S, Hickey G, Hinrichs AS, Hubley R, Karolchik D, Learned K, Lee BT, Li CH, Miga KH, Nguyen N, Paten B, Raney BJ, Smit AF, Speir ML, Zweig AS, Haussler D, Kuhn RM, and Kent WJ
- Subjects
- Animals, Cricetinae, Dogs, Ebolavirus genetics, Gene Expression, Genome, Internet, Mice, Molecular Sequence Annotation, Phenotype, Rats, Software, Databases, Nucleic Acid, Genomics
- Abstract
Launched in 2001 to showcase the draft human genome assembly, the UCSC Genome Browser database (http://genome.ucsc.edu) and associated tools continue to grow, providing a comprehensive resource of genome assemblies and annotations to scientists and students worldwide. Highlights of the past year include the release of a browser for the first new human genome reference assembly in 4 years in December 2013 (GRCh38, UCSC hg38), a watershed comparative genomics annotation (100-species multiple alignment and conservation) and a novel distribution mechanism for the browser (GBiB: Genome Browser in a Box). We created browsers for new species (Chinese hamster, elephant shark, minke whale), 'mined the web' for DNA sequences and expanded the browser display with stacked color graphs and region highlighting. As our user community increasingly adopts the UCSC track hub and assembly hub representations for sharing large-scale genomic annotation data sets and genome sequencing projects, our menu of public data hubs has tripled., (© The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.)
- Published
- 2015
- Full Text
- View/download PDF
20. The UCSC Genome Browser: What Every Molecular Biologist Should Know.
- Author
-
Mangan ME, Williams JM, Kuhn RM, and Lathe WC 3rd
- Subjects
- Computational Biology methods, Databases, Nucleic Acid, Genome, Web Browser
- Abstract
Electronic data resources can enable molecular biologists to quickly get information from around the world that a decade ago would have been buried in papers scattered throughout the library. The ability to access, query, and display these data makes benchwork much more efficient and drives new discoveries. Increasingly, mastery of software resources and corresponding data repositories is required to fully explore the volume of data generated in biomedical and agricultural research, because only small amounts of data are actually found in traditional publications. The UCSC Genome Browser provides a wealth of data and tools that advance understanding of genomic context for many species, enable detailed analysis of data, and provide the ability to interrogate regions of interest across disparate data sets from a wide variety of sources. Researchers can also supplement the standard display with their own data to query and share this with others. Effective use of these resources has become crucial to biological research today, and this unit describes some practical applications of the UCSC Genome Browser., (Copyright © 2014 John Wiley & Sons, Inc.)
- Published
- 2014
- Full Text
- View/download PDF
21. The UCSC Genome Browser database: 2014 update.
- Author
-
Karolchik D, Barber GP, Casper J, Clawson H, Cline MS, Diekhans M, Dreszer TR, Fujita PA, Guruvadoo L, Haeussler M, Harte RA, Heitner S, Hinrichs AS, Learned K, Lee BT, Li CH, Raney BJ, Rhead B, Rosenbloom KR, Sloan CA, Speir ML, Zweig AS, Haussler D, Kuhn RM, and Kent WJ
- Subjects
- Alleles, Animals, Genome, Human, Humans, Internet, Mice, Molecular Sequence Annotation, Polymorphism, Single Nucleotide, Sequence Alignment, Software, Databases, Genetic, Genome, Genomics
- Abstract
The University of California Santa Cruz (UCSC) Genome Browser (http://genome.ucsc.edu) offers online public access to a growing database of genomic sequence and annotations for a large collection of organisms, primarily vertebrates, with an emphasis on the human and mouse genomes. The Browser's web-based tools provide an integrated environment for visualizing, comparing, analysing and sharing both publicly available and user-generated genomic data sets. As of September 2013, the database contained genomic sequence and a basic set of annotation 'tracks' for ∼90 organisms. Significant new annotations include a 60-species multiple alignment conservation track on the mouse, updated UCSC Genes tracks for human and mouse, and several new sets of variation and ENCODE data. New software tools include a Variant Annotation Integrator that returns predicted functional effects of a set of variants uploaded as a custom track, an extension to UCSC Genes that displays haplotype alleles for protein-coding genes and an expansion of data hubs that includes the capability to display remotely hosted user-provided assembly sequence in addition to annotation data. To improve European access, we have added a Genome Browser mirror (http://genome-euro.ucsc.edu) hosted at Bielefeld University in Germany.
- Published
- 2014
- Full Text
- View/download PDF
22. The UCSC genome browser and associated tools.
- Author
-
Kuhn RM, Haussler D, and Kent WJ
- Subjects
- Animals, Computational Biology, Computer Graphics, Data Display, Data Mining, Humans, Internet, Search Engine, Sequence Alignment, Databases, Genetic statistics & numerical data, Genomics statistics & numerical data, Information Storage and Retrieval, Software
- Abstract
The UCSC Genome Browser (http://genome.ucsc.edu) is a graphical viewer for genomic data now in its 13th year. Since the early days of the Human Genome Project, it has presented an integrated view of genomic data of many kinds. Now home to assemblies for 58 organisms, the Browser presents visualization of annotations mapped to genomic coordinates. The ability to juxtapose annotations of many types facilitates inquiry-driven data mining. Gene predictions, mRNA alignments, epigenomic data from the ENCODE project, conservation scores from vertebrate whole-genome alignments and variation data may be viewed at any scale from a single base to an entire chromosome. The Browser also includes many other widely used tools, including BLAT, which is useful for alignments from high-throughput sequencing experiments. Private data uploaded as Custom Tracks and Data Hubs in many formats may be displayed alongside the rich compendium of precomputed data in the UCSC database. The Table Browser is a full-featured graphical interface, which allows querying, filtering and intersection of data tables. The Saved Session feature allows users to store and share customized views, enhancing the utility of the system for organizing multiple trains of thought. Binary Alignment/Map (BAM), Variant Call Format and the Personal Genome Single Nucleotide Polymorphisms (SNPs) data formats are useful for visualizing a large sequencing experiment (whole-genome or whole-exome), where the differences between the data set and the reference assembly may be displayed graphically. Support for high-throughput sequencing extends to compact, indexed data formats, such as BAM, bigBed and bigWig, allowing rapid visualization of large datasets from RNA-seq and ChIP-seq experiments via local hosting.
- Published
- 2013
- Full Text
- View/download PDF
23. The UCSC Genome Browser database: extensions and updates 2013.
- Author
-
Meyer LR, Zweig AS, Hinrichs AS, Karolchik D, Kuhn RM, Wong M, Sloan CA, Rosenbloom KR, Roe G, Rhead B, Raney BJ, Pohl A, Malladi VS, Li CH, Lee BT, Learned K, Kirkup V, Hsu F, Heitner S, Harte RA, Haeussler M, Guruvadoo L, Goldman M, Giardine BM, Fujita PA, Dreszer TR, Diekhans M, Cline MS, Clawson H, Barber GP, Haussler D, and Kent WJ
- Subjects
- Animals, Genome, Human, Humans, Internet, Mice, Molecular Sequence Annotation, Software, Databases, Genetic, Genomics
- Abstract
The University of California Santa Cruz (UCSC) Genome Browser (http://genome.ucsc.edu) offers online public access to a growing database of genomic sequence and annotations for a wide variety of organisms. The Browser is an integrated tool set for visualizing, comparing, analysing and sharing both publicly available and user-generated genomic datasets. As of September 2012, genomic sequence and a basic set of annotation 'tracks' are provided for 63 organisms, including 26 mammals, 13 non-mammal vertebrates, 3 invertebrate deuterostomes, 13 insects, 6 worms, yeast and sea hare. In the past year 19 new genome assemblies have been added, and we anticipate releasing another 28 in early 2013. Further, a large number of annotation tracks have been either added, updated by contributors or remapped to the latest human reference genome. Among these are an updated UCSC Genes track for human and mouse assemblies. We have also introduced several features to improve usability, including new navigation menus. This article provides an update to the UCSC Genome Browser database, which has been previously featured in the Database issue of this journal.
- Published
- 2013
- Full Text
- View/download PDF
24. ENCODE data in the UCSC Genome Browser: year 5 update.
- Author
-
Rosenbloom KR, Sloan CA, Malladi VS, Dreszer TR, Learned K, Kirkup VM, Wong MC, Maddren M, Fang R, Heitner SG, Lee BT, Barber GP, Harte RA, Diekhans M, Long JC, Wilder SP, Zweig AS, Karolchik D, Kuhn RM, Haussler D, and Kent WJ
- Subjects
- Animals, Humans, Internet, Mice, Software, Databases, Genetic, Genome, Human, Genomics
- Abstract
The Encyclopedia of DNA Elements (ENCODE), http://encodeproject.org, has completed its fifth year of scientific collaboration to create a comprehensive catalog of functional elements in the human genome, and its third year of investigations in the mouse genome. Since the last report in this journal, the ENCODE human data repertoire has grown by 898 new experiments (totaling 2886), accompanied by a major integrative analysis. In the mouse genome, results from 404 new experiments became available this year, increasing the total to 583, collected during the course of the project. The University of California, Santa Cruz, makes this data available on the public Genome Browser http://genome.ucsc.edu for visual browsing and data mining. Download of raw and processed data files are all supported. The ENCODE portal provides specialized tools and information about the ENCODE data sets.
- Published
- 2013
- Full Text
- View/download PDF
25. Diagnostic interpretation of array data using public databases and internet sources.
- Author
-
de Leeuw N, Dijkhuizen T, Hehir-Kwa JY, Carter NP, Feuk L, Firth HV, Kuhn RM, Ledbetter DH, Martin CL, van Ravenswaaij-Arts CM, Scherer SW, Shams S, Van Vooren S, Sijmons R, Swertz M, and Hastings R
- Subjects
- Genetic Variation, Genome, Human, Humans, Oligonucleotide Array Sequence Analysis, Polymorphism, Single Nucleotide, Search Engine, DNA Copy Number Variations, Databases, Genetic, Diagnostic Tests, Routine, Internet, Software
- Abstract
The range of commercially available array platforms and analysis software packages is expanding and their utility is improving, making reliable detection of copy-number variants (CNVs) relatively straightforward. Reliable interpretation of CNV data, however, is often difficult and requires expertise. With our knowledge of the human genome growing rapidly, applications for array testing continuously broadening, and the resolution of CNV detection increasing, this leads to great complexity in interpreting what can be daunting data. Correct CNV interpretation and optimal use of the genotype information provided by single-nucleotide polymorphism probes on an array depends largely on knowledge present in various resources. In addition to the availability of host laboratories' own datasets and national registries, there are several public databases and Internet resources with genotype and phenotype information that can be used for array data interpretation. With so many resources now available, it is important to know which are fit-for-purpose in a diagnostic setting. We summarize the characteristics of the most commonly used Internet databases and resources, and propose a general data interpretation strategy that can be used for comparative hybridization, comparative intensity, and genotype-based array data., Competing Interests: Statement: The authors declare no conflict of interest.
- Published
- 2012
- Full Text
- View/download PDF
26. ENCODE whole-genome data in the UCSC Genome Browser: update 2012.
- Author
-
Rosenbloom KR, Dreszer TR, Long JC, Malladi VS, Sloan CA, Raney BJ, Cline MS, Karolchik D, Barber GP, Clawson H, Diekhans M, Fujita PA, Goldman M, Gravell RC, Harte RA, Hinrichs AS, Kirkup VM, Kuhn RM, Learned K, Maddren M, Meyer LR, Pohl A, Rhead B, Wong MC, Zweig AS, Haussler D, and Kent WJ
- Subjects
- Animals, Humans, Internet, Molecular Sequence Annotation, Software, Databases, Nucleic Acid, Genome, Genome, Human, Mice genetics
- Abstract
The Encyclopedia of DNA Elements (ENCODE) Consortium is entering its 5th year of production-level effort generating high-quality whole-genome functional annotations of the human genome. The past year has brought the ENCODE compendium of functional elements to critical mass, with a diverse set of 27 biochemical assays now covering 200 distinct human cell types. Within the mouse genome, which has been under study by ENCODE groups for the past 2 years, 37 cell types have been assayed. Over 2000 individual experiments have been completed and submitted to the Data Coordination Center for public use. UCSC makes this data available on the quality-reviewed public Genome Browser (http://genome.ucsc.edu) and on an early-access Preview Browser (http://genome-preview.ucsc.edu). Visual browsing, data mining and download of raw and processed data files are all supported. An ENCODE portal (http://encodeproject.org) provides specialized tools and information about the ENCODE data sets.
- Published
- 2012
- Full Text
- View/download PDF
27. The UCSC Genome Browser database: extensions and updates 2011.
- Author
-
Dreszer TR, Karolchik D, Zweig AS, Hinrichs AS, Raney BJ, Kuhn RM, Meyer LR, Wong M, Sloan CA, Rosenbloom KR, Roe G, Rhead B, Pohl A, Malladi VS, Li CH, Learned K, Kirkup V, Hsu F, Harte RA, Guruvadoo L, Goldman M, Giardine BM, Fujita PA, Diekhans M, Cline MS, Clawson H, Barber GP, Haussler D, and James Kent W
- Subjects
- Animals, Disease genetics, Genome, Human, Genomics, Humans, Internet, Molecular Sequence Annotation, Phenotype, Databases, Nucleic Acid, Genome
- Abstract
The University of California Santa Cruz Genome Browser (http://genome.ucsc.edu) offers online public access to a growing database of genomic sequence and annotations for a wide variety of organisms. The Browser is an integrated tool set for visualizing, comparing, analyzing and sharing both publicly available and user-generated genomic data sets. In the past year, the local database has been updated with four new species assemblies, and we anticipate another four will be released by the end of 2011. Further, a large number of annotation tracks have been either added, updated by contributors, or remapped to the latest human reference genome. Among these are new phenotype and disease annotations, UCSC genes, and a major dbSNP update, which required new visualization methods. Growing beyond the local database, this year we have introduced 'track data hubs', which allow the Genome Browser to provide access to remotely located sets of annotations. This feature is designed to significantly extend the number and variety of annotation tracks that are publicly available for visualization and analysis from within our site. We have also introduced several usability features including track search and a context-sensitive menu of options available with a right-click anywhere on the Browser's image.
- Published
- 2012
- Full Text
- View/download PDF
28. The Human Epigenome Browser at Washington University.
- Author
-
Zhou X, Maricque B, Xie M, Li D, Sundaram V, Martin EA, Koebbe BC, Nielsen C, Hirst M, Farnham P, Kuhn RM, Zhu J, Smirnov I, Kent WJ, Haussler D, Madden PA, Costello JF, and Wang T
- Subjects
- Humans, Research Personnel, Washington, Databases, Genetic, Epigenomics, Internet, Universities
- Published
- 2011
- Full Text
- View/download PDF
29. ENCODE whole-genome data in the UCSC genome browser (2011 update).
- Author
-
Raney BJ, Cline MS, Rosenbloom KR, Dreszer TR, Learned K, Barber GP, Meyer LR, Sloan CA, Malladi VS, Roskin KM, Suh BB, Hinrichs AS, Clawson H, Zweig AS, Kirkup V, Fujita PA, Rhead B, Smith KE, Pohl A, Kuhn RM, Karolchik D, Haussler D, and Kent WJ
- Subjects
- Gene Expression Regulation, Genomics, Humans, Internet, Software, User-Computer Interface, Databases, Genetic, Genome, Human
- Abstract
The ENCODE project is an international consortium with a goal of cataloguing all the functional elements in the human genome. The ENCODE Data Coordination Center (DCC) at the University of California, Santa Cruz serves as the central repository for ENCODE data. In this role, the DCC offers a collection of high-throughput, genome-wide data generated with technologies such as ChIP-Seq, RNA-Seq, DNA digestion and others. This data helps illuminate transcription factor-binding sites, histone marks, chromatin accessibility, DNA methylation, RNA expression, RNA binding and other cell-state indicators. It includes sequences with quality scores, alignments, signals calculated from the alignments, and in most cases, element or peak calls calculated from the signal data. Each data set is available for visualization and download via the UCSC Genome Browser (http://genome.ucsc.edu/). ENCODE data can also be retrieved using a metadata system that captures the experimental parameters of each assay. The ENCODE web portal at UCSC (http://encodeproject.org/) provides information about the ENCODE data and links for access.
- Published
- 2011
- Full Text
- View/download PDF
30. The UCSC Cancer Genomics Browser: update 2011.
- Author
-
Sanborn JZ, Benz SC, Craft B, Szeto C, Kober KM, Meyer L, Vaske CJ, Goldman M, Smith KE, Kuhn RM, Karolchik D, Kent WJ, Stuart JM, Haussler D, and Zhu J
- Subjects
- DNA Copy Number Variations, Gene Expression, Genome, Human, Humans, Internet, Neoplasms metabolism, Neoplasms pathology, Software, Databases, Genetic, Genomics, Neoplasms genetics
- Abstract
The UCSC Cancer Genomics Browser (https://genome-cancer.ucsc.edu) comprises a suite of web-based tools to integrate, visualize and analyze cancer genomics and clinical data. The browser displays whole-genome views of genome-wide experimental measurements for multiple samples alongside their associated clinical information. Multiple data sets can be viewed simultaneously as coordinated 'heatmap tracks' to compare across studies or different data modalities. Users can order, filter, aggregate, classify and display data interactively based on any given feature set including clinical features, annotated biological pathways and user-contributed collections of genes. Integrated standard statistical tools provide dynamic quantitative analysis within all available data sets. The browser hosts a growing body of publicly available cancer genomics data from a variety of cancer types, including data generated from the Cancer Genome Atlas project. Multiple consortiums use the browser on confidential prepublication data enabled by private installations. Many new features have been added, including the hgMicroscope tumor image viewer, hgSignature for real-time genomic signature evaluation on any browser track, and 'PARADIGM' pathway tracks to display integrative pathway activities. The browser is integrated with the UCSC Genome Browser; thus inheriting and integrating the Genome Browser's rich set of human biology and genetics data that enhances the interpretability of the cancer genomics data.
- Published
- 2011
- Full Text
- View/download PDF
31. The UCSC Genome Browser database: update 2011.
- Author
-
Fujita PA, Rhead B, Zweig AS, Hinrichs AS, Karolchik D, Cline MS, Goldman M, Barber GP, Clawson H, Coelho A, Diekhans M, Dreszer TR, Giardine BM, Harte RA, Hillman-Jackson J, Hsu F, Kirkup V, Kuhn RM, Learned K, Li CH, Meyer LR, Pohl A, Raney BJ, Rosenbloom KR, Smith KE, Haussler D, and Kent WJ
- Subjects
- Animals, Disease genetics, Genes, Genome, Human, Hominidae genetics, Humans, Internet, Molecular Sequence Annotation, Phenotype, RNA Editing, Software, Databases, Genetic, Genomics
- Abstract
The University of California, Santa Cruz Genome Browser (http://genome.ucsc.edu) offers online access to a database of genomic sequence and annotation data for a wide variety of organisms. The Browser also has many tools for visualizing, comparing and analyzing both publicly available and user-generated genomic data sets, aligning sequences and uploading user data. Among the features released this year are a gene search tool and annotation track drag-reorder functionality as well as support for BAM and BigWig/BigBed file formats. New display enhancements include overlay of multiple wiggle tracks through use of transparent coloring, options for displaying transformed wiggle data, a 'mean+whiskers' windowing function for display of wiggle data at high zoom levels, and more color schemes for microarray data. New data highlights include seven new genome assemblies, a Neandertal genome data portal, phenotype and disease association data, a human RNA editing track, and a zebrafish Conservation track. We also describe updates to existing tracks.
- Published
- 2011
- Full Text
- View/download PDF
32. Consensus statement: chromosomal microarray is a first-tier clinical diagnostic test for individuals with developmental disabilities or congenital anomalies.
- Author
-
Miller DT, Adam MP, Aradhya S, Biesecker LG, Brothman AR, Carter NP, Church DM, Crolla JA, Eichler EE, Epstein CJ, Faucett WA, Feuk L, Friedman JM, Hamosh A, Jackson L, Kaminsky EB, Kok K, Krantz ID, Kuhn RM, Lee C, Ostell JM, Rosenberg C, Scherer SW, Spinner NB, Stavropoulos DJ, Tepperberg JH, Thorland EC, Vermeesch JR, Waggoner DJ, Watson MS, Martin CL, and Ledbetter DH
- Subjects
- Child, Chromosome Banding, Humans, Karyotyping, Chromosome Disorders genetics, Congenital Abnormalities genetics, Developmental Disabilities genetics
- Abstract
Chromosomal microarray (CMA) is increasingly utilized for genetic testing of individuals with unexplained developmental delay/intellectual disability (DD/ID), autism spectrum disorders (ASD), or multiple congenital anomalies (MCA). Performing CMA and G-banded karyotyping on every patient substantially increases the total cost of genetic testing. The International Standard Cytogenomic Array (ISCA) Consortium held two international workshops and conducted a literature review of 33 studies, including 21,698 patients tested by CMA. We provide an evidence-based summary of clinical cytogenetic testing comparing CMA to G-banded karyotyping with respect to technical advantages and limitations, diagnostic yield for various types of chromosomal aberrations, and issues that affect test interpretation. CMA offers a much higher diagnostic yield (15%-20%) for genetic testing of individuals with unexplained DD/ID, ASD, or MCA than a G-banded karyotype ( approximately 3%, excluding Down syndrome and other recognizable chromosomal syndromes), primarily because of its higher sensitivity for submicroscopic deletions and duplications. Truly balanced rearrangements and low-level mosaicism are generally not detectable by arrays, but these are relatively infrequent causes of abnormal phenotypes in this population (<1%). Available evidence strongly supports the use of CMA in place of G-banded karyotyping as the first-tier cytogenetic diagnostic test for patients with DD/ID, ASD, or MCA. G-banded karyotype analysis should be reserved for patients with obvious chromosomal syndromes (e.g., Down syndrome), a family history of chromosomal rearrangement, or a history of multiple miscarriages., (Copyright (c) 2010 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.)
- Published
- 2010
- Full Text
- View/download PDF
33. ENCODE whole-genome data in the UCSC Genome Browser.
- Author
-
Rosenbloom KR, Dreszer TR, Pheasant M, Barber GP, Meyer LR, Pohl A, Raney BJ, Wang T, Hinrichs AS, Zweig AS, Fujita PA, Learned K, Rhead B, Smith KE, Kuhn RM, Karolchik D, Haussler D, and Kent WJ
- Subjects
- Animals, Cell Line, Tumor, Computational Biology trends, Genomics, Humans, Information Storage and Retrieval methods, Internet, Mice, Sequence Alignment, Software, Computational Biology methods, Databases, Genetic, Databases, Nucleic Acid, Genome, Human
- Abstract
The Encyclopedia of DNA Elements (ENCODE) project is an international consortium of investigators funded to analyze the human genome with the goal of producing a comprehensive catalog of functional elements. The ENCODE Data Coordination Center at The University of California, Santa Cruz (UCSC) is the primary repository for experimental results generated by ENCODE investigators. These results are captured in the UCSC Genome Bioinformatics database and download server for visualization and data mining via the UCSC Genome Browser and companion tools (Rhead et al. The UCSC Genome Browser Database: update 2010, in this issue). The ENCODE web portal at UCSC (http://encodeproject.org or http://genome.ucsc.edu/ENCODE) provides information about the ENCODE data and convenient links for access.
- Published
- 2010
- Full Text
- View/download PDF
34. The UCSC Genome Browser database: update 2010.
- Author
-
Rhead B, Karolchik D, Kuhn RM, Hinrichs AS, Zweig AS, Fujita PA, Diekhans M, Smith KE, Rosenbloom KR, Raney BJ, Pohl A, Pheasant M, Meyer LR, Learned K, Hsu F, Hillman-Jackson J, Harte RA, Giardine B, Dreszer TR, Clawson H, Barber GP, Haussler D, and Kent WJ
- Subjects
- Animals, Computational Biology trends, Genetic Variation, Genome, Fungal, Genomics, Humans, Information Storage and Retrieval methods, Internet, Invertebrates, Models, Molecular, Phenotype, Software, Computational Biology methods, Databases, Genetic, Databases, Nucleic Acid, Genome
- Abstract
The University of California, Santa Cruz (UCSC) Genome Browser website (http://genome.ucsc.edu/) provides a large database of publicly available sequence and annotation data along with an integrated tool set for examining and comparing the genomes of organisms, aligning sequence to genomes, and displaying and sharing users' own annotation data. As of September 2009, genomic sequence and a basic set of annotation 'tracks' are provided for 47 organisms, including 14 mammals, 10 non-mammal vertebrates, 3 invertebrate deuterostomes, 13 insects, 6 worms and a yeast. New data highlights this year include an updated human genome browser, a 44-species multiple sequence alignment track, improved variation and phenotype tracks and 16 new genome-wide ENCODE tracks. New features include drag-and-zoom navigation, a Wiki track for user-added annotations, new custom track formats for large datasets (bigBed and bigWig), a new multiple alignment output tool, links to variation and protein structure tools, in silico PCR utility enhancements, and improved track configuration tools.
- Published
- 2010
- Full Text
- View/download PDF
35. The UCSC genome browser: what every molecular biologist should know.
- Author
-
Mangan ME, Williams JM, Kuhn RM, and Lathe WC 3rd
- Subjects
- Computer Graphics, Databases, Genetic, User-Computer Interface, Computational Biology methods, Genomics methods, Software
- Abstract
Electronic data resources can enable molecular biologists to query and display many useful features that make benchwork more efficient and drive new discoveries. The UCSC Genome Browser provides a wealth of data and tools that advance one's understanding of genomic context for many species, enable detailed understanding of data, and provide the ability to interrogate regions of interest. Researchers can also supplement the standard display with their own data to query and share with others. Effective use of these resources has become crucial to biological research today, and this unit describes some practical applications of the UCSC Genome Browser.
- Published
- 2009
- Full Text
- View/download PDF
36. The UCSC Cancer Genomics Browser.
- Author
-
Zhu J, Sanborn JZ, Benz S, Szeto C, Hsu F, Kuhn RM, Karolchik D, Archie J, Lenburg ME, Esserman LJ, Kent WJ, Haussler D, and Wang T
- Subjects
- Humans, Biomarkers, Tumor genetics, Chromosome Mapping methods, DNA, Neoplasm genetics, Gene Expression Profiling methods, Neoplasm Proteins genetics, Neoplasms genetics, Software, User-Computer Interface
- Published
- 2009
- Full Text
- View/download PDF
37. The UCSC Genome Browser Database: update 2009.
- Author
-
Kuhn RM, Karolchik D, Zweig AS, Wang T, Smith KE, Rosenbloom KR, Rhead B, Raney BJ, Pohl A, Pheasant M, Meyer L, Hsu F, Hinrichs AS, Harte RA, Giardine B, Fujita P, Diekhans M, Dreszer T, Clawson H, Barber GP, Haussler D, and Kent WJ
- Subjects
- Animals, Chromosome Mapping, Computer Graphics, Gene Expression, Genetic Variation, Humans, RNA, Messenger chemistry, Software, User-Computer Interface, Databases, Nucleic Acid, Genomics
- Abstract
The UCSC Genome Browser Database (GBD, http://genome.ucsc.edu) is a publicly available collection of genome assembly sequence data and integrated annotations for a large number of organisms, including extensive comparative-genomic resources. In the past year, 13 new genome assemblies have been added, including two important primate species, orangutan and marmoset, bringing the total to 46 assemblies for 24 different vertebrates and 39 assemblies for 22 different invertebrate animals. The GBD datasets may be viewed graphically with the UCSC Genome Browser, which uses a coordinate-based display system allowing users to juxtapose a wide variety of data. These data include all mRNAs from GenBank mapped to all organisms, RefSeq alignments, gene predictions, regulatory elements, gene expression data, repeats, SNPs and other variation data, as well as pairwise and multiple-genome alignments. A variety of other bioinformatics tools are also provided, including BLAT, the Table Browser, the Gene Sorter, the Proteome Browser, VisiGene and Genome Graphs.
- Published
- 2009
- Full Text
- View/download PDF
38. UCSC genome browser tutorial.
- Author
-
Zweig AS, Karolchik D, Kuhn RM, Haussler D, and Kent WJ
- Subjects
- Base Sequence, California, Molecular Sequence Data, Universities, Computational Biology methods, Genome, Internet, Sequence Analysis, DNA methods, Software
- Abstract
The University of California Santa Cruz (UCSC) Genome Bioinformatics website consists of a suite of free, open-source, on-line tools that can be used to browse, analyze, and query genomic data. These tools are available to anyone who has an Internet browser and an interest in genomics. The website provides a quick and easy-to-use visual display of genomic data. It places annotation tracks beneath genome coordinate positions, allowing rapid visual correlation of different types of information. Many of the annotation tracks are submitted by scientists worldwide; the others are computed by the UCSC Genome Bioinformatics group from publicly available sequence data. It also allows users to upload and display their own experimental results or annotation sets by creating a custom track. The suite of tools, downloadable data files, and links to documentation and other information can be found at http://genome.ucsc.edu/.
- Published
- 2008
- Full Text
- View/download PDF
39. The UCSC Genome Browser Database: 2008 update.
- Author
-
Karolchik D, Kuhn RM, Baertsch R, Barber GP, Clawson H, Diekhans M, Giardine B, Harte RA, Hinrichs AS, Hsu F, Kober KM, Miller W, Pedersen JS, Pohl A, Raney BJ, Rhead B, Rosenbloom KR, Smith KE, Stanke M, Thakkapallayil A, Trumbower H, Wang T, Zweig AS, Haussler D, and Kent WJ
- Subjects
- Animals, Computer Graphics, Genetic Variation, Humans, Internet, Invertebrates genetics, Sequence Alignment, User-Computer Interface, Vertebrates genetics, Databases, Nucleic Acid, Genomics
- Abstract
The University of California, Santa Cruz, Genome Browser Database (GBD) provides integrated sequence and annotation data for a large collection of vertebrate and model organism genomes. Seventeen new assemblies have been added to the database in the past year, for a total coverage of 19 vertebrate and 21 invertebrate species as of September 2007. For each assembly, the GBD contains a collection of annotation data aligned to the genomic sequence. Highlights of this year's additions include a 28-species human-based vertebrate conservation annotation, an enhanced UCSC Genes set, and more human variation, MGC, and ENCODE data. The database is optimized for fast interactive performance with a set of web-based tools that may be used to view, manipulate, filter and download the annotation data. New toolset features include the Genome Graphs tool for displaying genome-wide data sets, session saving and sharing, better custom track management, expanded Genome Browser configuration options and a Genome Browser wiki site. The downloadable GBD data, the companion Genome Browser toolset and links to documentation and related information can be found at: http://genome.ucsc.edu/.
- Published
- 2008
- Full Text
- View/download PDF
40. Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project.
- Author
-
Birney E, Stamatoyannopoulos JA, Dutta A, Guigó R, Gingeras TR, Margulies EH, Weng Z, Snyder M, Dermitzakis ET, Thurman RE, Kuehn MS, Taylor CM, Neph S, Koch CM, Asthana S, Malhotra A, Adzhubei I, Greenbaum JA, Andrews RM, Flicek P, Boyle PJ, Cao H, Carter NP, Clelland GK, Davis S, Day N, Dhami P, Dillon SC, Dorschner MO, Fiegler H, Giresi PG, Goldy J, Hawrylycz M, Haydock A, Humbert R, James KD, Johnson BE, Johnson EM, Frum TT, Rosenzweig ER, Karnani N, Lee K, Lefebvre GC, Navas PA, Neri F, Parker SC, Sabo PJ, Sandstrom R, Shafer A, Vetrie D, Weaver M, Wilcox S, Yu M, Collins FS, Dekker J, Lieb JD, Tullius TD, Crawford GE, Sunyaev S, Noble WS, Dunham I, Denoeud F, Reymond A, Kapranov P, Rozowsky J, Zheng D, Castelo R, Frankish A, Harrow J, Ghosh S, Sandelin A, Hofacker IL, Baertsch R, Keefe D, Dike S, Cheng J, Hirsch HA, Sekinger EA, Lagarde J, Abril JF, Shahab A, Flamm C, Fried C, Hackermüller J, Hertel J, Lindemeyer M, Missal K, Tanzer A, Washietl S, Korbel J, Emanuelsson O, Pedersen JS, Holroyd N, Taylor R, Swarbreck D, Matthews N, Dickson MC, Thomas DJ, Weirauch MT, Gilbert J, Drenkow J, Bell I, Zhao X, Srinivasan KG, Sung WK, Ooi HS, Chiu KP, Foissac S, Alioto T, Brent M, Pachter L, Tress ML, Valencia A, Choo SW, Choo CY, Ucla C, Manzano C, Wyss C, Cheung E, Clark TG, Brown JB, Ganesh M, Patel S, Tammana H, Chrast J, Henrichsen CN, Kai C, Kawai J, Nagalakshmi U, Wu J, Lian Z, Lian J, Newburger P, Zhang X, Bickel P, Mattick JS, Carninci P, Hayashizaki Y, Weissman S, Hubbard T, Myers RM, Rogers J, Stadler PF, Lowe TM, Wei CL, Ruan Y, Struhl K, Gerstein M, Antonarakis SE, Fu Y, Green ED, Karaöz U, Siepel A, Taylor J, Liefer LA, Wetterstrand KA, Good PJ, Feingold EA, Guyer MS, Cooper GM, Asimenos G, Dewey CN, Hou M, Nikolaev S, Montoya-Burgos JI, Löytynoja A, Whelan S, Pardi F, Massingham T, Huang H, Zhang NR, Holmes I, Mullikin JC, Ureta-Vidal A, Paten B, Seringhaus M, Church D, Rosenbloom K, Kent WJ, Stone EA, Batzoglou S, Goldman N, Hardison RC, Haussler D, Miller W, Sidow A, Trinklein ND, Zhang ZD, Barrera L, Stuart R, King DC, Ameur A, Enroth S, Bieda MC, Kim J, Bhinge AA, Jiang N, Liu J, Yao F, Vega VB, Lee CW, Ng P, Shahab A, Yang A, Moqtaderi Z, Zhu Z, Xu X, Squazzo S, Oberley MJ, Inman D, Singer MA, Richmond TA, Munn KJ, Rada-Iglesias A, Wallerman O, Komorowski J, Fowler JC, Couttet P, Bruce AW, Dovey OM, Ellis PD, Langford CF, Nix DA, Euskirchen G, Hartman S, Urban AE, Kraus P, Van Calcar S, Heintzman N, Kim TH, Wang K, Qu C, Hon G, Luna R, Glass CK, Rosenfeld MG, Aldred SF, Cooper SJ, Halees A, Lin JM, Shulha HP, Zhang X, Xu M, Haidar JN, Yu Y, Ruan Y, Iyer VR, Green RD, Wadelius C, Farnham PJ, Ren B, Harte RA, Hinrichs AS, Trumbower H, Clawson H, Hillman-Jackson J, Zweig AS, Smith K, Thakkapallayil A, Barber G, Kuhn RM, Karolchik D, Armengol L, Bird CP, de Bakker PI, Kern AD, Lopez-Bigas N, Martin JD, Stranger BE, Woodroffe A, Davydov E, Dimas A, Eyras E, Hallgrímsdóttir IB, Huppert J, Zody MC, Abecasis GR, Estivill X, Bouffard GG, Guan X, Hansen NF, Idol JR, Maduro VV, Maskeri B, McDowell JC, Park M, Thomas PJ, Young AC, Blakesley RW, Muzny DM, Sodergren E, Wheeler DA, Worley KC, Jiang H, Weinstock GM, Gibbs RA, Graves T, Fulton R, Mardis ER, Wilson RK, Clamp M, Cuff J, Gnerre S, Jaffe DB, Chang JL, Lindblad-Toh K, Lander ES, Koriabine M, Nefedov M, Osoegawa K, Yoshinaga Y, Zhu B, and de Jong PJ
- Subjects
- Chromatin genetics, Chromatin metabolism, Chromatin Immunoprecipitation, Conserved Sequence genetics, DNA Replication, Evolution, Molecular, Exons genetics, Genetic Variation genetics, Heterozygote, Histones metabolism, Humans, Pilot Projects, Protein Binding, RNA, Messenger genetics, RNA, Untranslated genetics, Transcription Factors metabolism, Transcription Initiation Site, Genome, Human genetics, Genomics, Regulatory Sequences, Nucleic Acid genetics, Transcription, Genetic genetics
- Abstract
We report the generation and analysis of functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project. These data have been further integrated and augmented by a number of evolutionary and computational analyses. Together, our results advance the collective knowledge about human genome function in several major areas. First, our studies provide convincing evidence that the genome is pervasively transcribed, such that the majority of its bases can be found in primary transcripts, including non-protein-coding transcripts, and those that extensively overlap one another. Second, systematic examination of transcriptional regulation has yielded new understanding about transcription start sites, including their relationship to specific regulatory sequences and features of chromatin accessibility and histone modification. Third, a more sophisticated view of chromatin structure has emerged, including its inter-relationship with DNA replication and transcriptional regulation. Finally, integration of these new sources of information, in particular with respect to mammalian evolution based on inter- and intra-species sequence comparisons, has yielded new mechanistic and evolutionary insights concerning the functional landscape of the human genome. Together, these studies are defining a path for pursuit of a more comprehensive characterization of human genome function.
- Published
- 2007
- Full Text
- View/download PDF
41. Analyses of deep mammalian sequence alignments and constraint predictions for 1% of the human genome.
- Author
-
Margulies EH, Cooper GM, Asimenos G, Thomas DJ, Dewey CN, Siepel A, Birney E, Keefe D, Schwartz AS, Hou M, Taylor J, Nikolaev S, Montoya-Burgos JI, Löytynoja A, Whelan S, Pardi F, Massingham T, Brown JB, Bickel P, Holmes I, Mullikin JC, Ureta-Vidal A, Paten B, Stone EA, Rosenbloom KR, Kent WJ, Bouffard GG, Guan X, Hansen NF, Idol JR, Maduro VV, Maskeri B, McDowell JC, Park M, Thomas PJ, Young AC, Blakesley RW, Muzny DM, Sodergren E, Wheeler DA, Worley KC, Jiang H, Weinstock GM, Gibbs RA, Graves T, Fulton R, Mardis ER, Wilson RK, Clamp M, Cuff J, Gnerre S, Jaffe DB, Chang JL, Lindblad-Toh K, Lander ES, Hinrichs A, Trumbower H, Clawson H, Zweig A, Kuhn RM, Barber G, Harte R, Karolchik D, Field MA, Moore RA, Matthewson CA, Schein JE, Marra MA, Antonarakis SE, Batzoglou S, Goldman N, Hardison R, Haussler D, Miller W, Pachter L, Green ED, and Sidow A
- Subjects
- Animals, Human Genome Project, Humans, Evolution, Molecular, Genome, Human, Mammals genetics, Open Reading Frames, Phylogeny, Sequence Alignment
- Abstract
A key component of the ongoing ENCODE project involves rigorous comparative sequence analyses for the initially targeted 1% of the human genome. Here, we present orthologous sequence generation, alignment, and evolutionary constraint analyses of 23 mammalian species for all ENCODE targets. Alignments were generated using four different methods; comparisons of these methods reveal large-scale consistency but substantial differences in terms of small genomic rearrangements, sensitivity (sequence coverage), and specificity (alignment accuracy). We describe the quantitative and qualitative trade-offs concomitant with alignment method choice and the levels of technical error that need to be accounted for in applications that require multisequence alignments. Using the generated alignments, we identified constrained regions using three different methods. While the different constraint-detecting methods are in general agreement, there are important discrepancies relating to both the underlying alignments and the specific algorithms. However, by integrating the results across the alignments and constraint-detecting methods, we produced constraint annotations that were found to be robust based on multiple independent measures. Analyses of these annotations illustrate that most classes of experimentally annotated functional elements are enriched for constrained sequences; however, large portions of each class (with the exception of protein-coding sequences) do not overlap constrained regions. The latter elements might not be under primary sequence constraint, might not be constrained across all mammals, or might have expendable molecular functions. Conversely, 40% of the constrained sequences do not overlap any of the functional elements that have been experimentally identified. Together, these findings demonstrate and quantify how many genomic functional elements await basic molecular characterization.
- Published
- 2007
- Full Text
- View/download PDF
42. Evolutionary and biomedical insights from the rhesus macaque genome.
- Author
-
Gibbs RA, Rogers J, Katze MG, Bumgarner R, Weinstock GM, Mardis ER, Remington KA, Strausberg RL, Venter JC, Wilson RK, Batzer MA, Bustamante CD, Eichler EE, Hahn MW, Hardison RC, Makova KD, Miller W, Milosavljevic A, Palermo RE, Siepel A, Sikela JM, Attaway T, Bell S, Bernard KE, Buhay CJ, Chandrabose MN, Dao M, Davis C, Delehaunty KD, Ding Y, Dinh HH, Dugan-Rocha S, Fulton LA, Gabisi RA, Garner TT, Godfrey J, Hawes AC, Hernandez J, Hines S, Holder M, Hume J, Jhangiani SN, Joshi V, Khan ZM, Kirkness EF, Cree A, Fowler RG, Lee S, Lewis LR, Li Z, Liu YS, Moore SM, Muzny D, Nazareth LV, Ngo DN, Okwuonu GO, Pai G, Parker D, Paul HA, Pfannkoch C, Pohl CS, Rogers YH, Ruiz SJ, Sabo A, Santibanez J, Schneider BW, Smith SM, Sodergren E, Svatek AF, Utterback TR, Vattathil S, Warren W, White CS, Chinwalla AT, Feng Y, Halpern AL, Hillier LW, Huang X, Minx P, Nelson JO, Pepin KH, Qin X, Sutton GG, Venter E, Walenz BP, Wallis JW, Worley KC, Yang SP, Jones SM, Marra MA, Rocchi M, Schein JE, Baertsch R, Clarke L, Csürös M, Glasscock J, Harris RA, Havlak P, Jackson AR, Jiang H, Liu Y, Messina DN, Shen Y, Song HX, Wylie T, Zhang L, Birney E, Han K, Konkel MK, Lee J, Smit AF, Ullmer B, Wang H, Xing J, Burhans R, Cheng Z, Karro JE, Ma J, Raney B, She X, Cox MJ, Demuth JP, Dumas LJ, Han SG, Hopkins J, Karimpour-Fard A, Kim YH, Pollack JR, Vinar T, Addo-Quaye C, Degenhardt J, Denby A, Hubisz MJ, Indap A, Kosiol C, Lahn BT, Lawson HA, Marklein A, Nielsen R, Vallender EJ, Clark AG, Ferguson B, Hernandez RD, Hirani K, Kehrer-Sawatzki H, Kolb J, Patil S, Pu LL, Ren Y, Smith DG, Wheeler DA, Schenck I, Ball EV, Chen R, Cooper DN, Giardine B, Hsu F, Kent WJ, Lesk A, Nelson DL, O'brien WE, Prüfer K, Stenson PD, Wallace JC, Ke H, Liu XM, Wang P, Xiang AP, Yang F, Barber GP, Haussler D, Karolchik D, Kern AD, Kuhn RM, Smith KE, and Zwieg AS
- Subjects
- Animals, Biomedical Research, Female, Gene Duplication, Gene Rearrangement, Genetic Diseases, Inborn, Genetic Variation, Humans, Male, Multigene Family, Mutation, Pan troglodytes genetics, Sequence Analysis, DNA, Species Specificity, Evolution, Molecular, Genome, Macaca mulatta genetics
- Abstract
The rhesus macaque (Macaca mulatta) is an abundant primate species that diverged from the ancestors of Homo sapiens about 25 million years ago. Because they are genetically and physiologically similar to humans, rhesus monkeys are the most widely used nonhuman primate in basic and applied biomedical research. We determined the genome sequence of an Indian-origin Macaca mulatta female and compared the data with chimpanzees and humans to reveal the structure of ancestral primate genomes and to identify evidence for positive selection and lineage-specific expansions and contractions of gene families. A comparison of sequences from individual animals was used to investigate their underlying genetic diversity. The complete description of the macaque genome blueprint enhances the utility of this animal model for biomedical research and improves our understanding of the basic biology of the species.
- Published
- 2007
- Full Text
- View/download PDF
43. The UCSC genome browser database: update 2007.
- Author
-
Kuhn RM, Karolchik D, Zweig AS, Trumbower H, Thomas DJ, Thakkapallayil A, Sugnet CW, Stanke M, Smith KE, Siepel A, Rosenbloom KR, Rhead B, Raney BJ, Pohl A, Pedersen JS, Hsu F, Hinrichs AS, Harte RA, Diekhans M, Clawson H, Bejerano G, Barber GP, Baertsch R, Haussler D, and Kent WJ
- Subjects
- Animals, Base Sequence, Cattle, Computer Graphics, Conserved Sequence, Genome, Human, Humans, Internet, Linkage Disequilibrium, Mice, Open Reading Frames, Polymorphism, Single Nucleotide, Rats, Regulatory Sequences, Nucleic Acid, User-Computer Interface, Databases, Genetic, Genomics
- Abstract
The University of California, Santa Cruz Genome Browser Database contains, as of September 2006, sequence and annotation data for the genomes of 13 vertebrate and 19 invertebrate species. The Genome Browser displays a wide variety of annotations at all scales from the single nucleotide level up to a full chromosome and includes assembly data, genes and gene predictions, mRNA and EST alignments, and comparative genomics, regulation, expression and variation data. The database is optimized for fast interactive performance with web tools that provide powerful visualization and querying capabilities for mining the data. In the past year, 22 new assemblies and several new sets of human variation annotation have been released. New features include VisiGene, a fully integrated in situ hybridization image browser; phyloGif, for drawing evolutionary tree diagrams; a redesigned Custom Track feature; an expanded SNP annotation track; and many new display options. The Genome Browser, other tools, downloadable data files and links to documentation and other information can be found at http://genome.ucsc.edu/.
- Published
- 2007
- Full Text
- View/download PDF
44. The ENCODE Project at UC Santa Cruz.
- Author
-
Thomas DJ, Rosenbloom KR, Clawson H, Hinrichs AS, Trumbower H, Raney BJ, Karolchik D, Barber GP, Harte RA, Hillman-Jackson J, Kuhn RM, Rhead BL, Smith KE, Thakkapallayil A, Zweig AS, Haussler D, and Kent WJ
- Subjects
- Base Sequence, Humans, Internet, Sequence Alignment, Software, User-Computer Interface, Databases, Nucleic Acid, Genome, Human, Genomics
- Abstract
The goal of the Encyclopedia Of DNA Elements (ENCODE) Project is to identify all functional elements in the human genome. The pilot phase is for comparison of existing methods and for the development of new methods to rigorously analyze a defined 1% of the human genome sequence. Experimental datasets are focused on the origin of replication, DNase I hypersensitivity, chromatin immunoprecipitation, promoter function, gene structure, pseudogenes, non-protein-coding RNAs, transcribed RNAs, multiple sequence alignment and evolutionarily constrained elements. The ENCODE project at UCSC website (http://genome.ucsc.edu/ENCODE) is the primary portal for the sequence-based data produced as part of the ENCODE project. In the pilot phase of the project, over 30 labs provided experimental results for a total of 56 browser tracks supported by 385 database tables. The site provides researchers with a number of tools that allow them to visualize and analyze the data as well as download data for local analyses. This paper describes the portal to the data, highlights the data that has been made available, and presents the tools that have been developed within the ENCODE project. Access to the data and types of interactive analysis that are possible are illustrated through supplemental examples.
- Published
- 2007
- Full Text
- View/download PDF
45. Comparative genomic analysis using the UCSC genome browser.
- Author
-
Karolchik D, Bejerano G, Hinrichs AS, Kuhn RM, Miller W, Rosenbloom KR, Zweig AS, Haussler D, and Kent WJ
- Subjects
- Animals, Evolution, Molecular, Humans, Databases, Genetic, Genomics, User-Computer Interface
- Abstract
Comparative analysis of DNA sequence from multiple species can provide insights into the function and evolutionary processes that shape genomes. The University of California Santa Cruz (UCSC) Genome Bioinformatics group has developed several tools and methodologies in its study of comparative genomics, many of which have been incorporated into the UCSC Genome Browser (http://genome.ucsc.edu), an easy-to-use online tool for browsing genomic data and aligned annotation "tracks" in a single window. The comparative genomics annotations in the browser include pairwise alignments, which aid in the identification of orthologous regions between species, and conservation tracks that show measures of evolutionary conservation among sets of multiply aligned species, highlighting regions of the genome that may be functionally important. A related tool, the UCSC Table Browser, provides a simple interface for querying, analyzing, and downloading the data underlying the Genome Browser annotation tracks. Here, we describe a procedure for examining a genomic region of interest in the Genome Browser, analyzing characteristics of the region, filtering the data, and downloading data sets for further study.
- Published
- 2007
- Full Text
- View/download PDF
46. Variation resources at UC Santa Cruz.
- Author
-
Thomas DJ, Trumbower H, Kern AD, Rhead BL, Kuhn RM, Haussler D, and Kent WJ
- Subjects
- Alleles, Animals, Gene Frequency, Genomics, Genotype, Humans, Internet, Linkage Disequilibrium, Mice, Rats, Recombination, Genetic, Sequence Alignment, User-Computer Interface, Databases, Nucleic Acid, Polymorphism, Single Nucleotide
- Abstract
The variation resources within the University of California Santa Cruz Genome Browser include polymorphism data drawn from public collections and analyses of these data, along with their display in the context of other genomic annotations. Primary data from dbSNP is included for many organisms, with added information including genomic alleles and orthologous alleles for closely related organisms. Display filtering and coloring is available by variant type, functional class or other annotations. Annotation of potential errors is highlighted and a genomic alignment of the variant's flanking sequence is displayed. HapMap allele frequencies and linkage disequilibrium (LD) are available for each HapMap population, along with non-human primate alleles. The browsing and analysis tools, downloadable data files and links to documentation and other information can be found at http://genome.ucsc.edu/.
- Published
- 2007
- Full Text
- View/download PDF
47. The UCSC Known Genes.
- Author
-
Hsu F, Kent WJ, Clawson H, Kuhn RM, Diekhans M, and Haussler D
- Subjects
- Base Sequence, California, Molecular Sequence Data, Proteome metabolism, Universities, User-Computer Interface, Chromosome Mapping methods, Database Management Systems, Databases, Protein, Information Storage and Retrieval methods, Proteome chemistry, Proteome genetics, RNA, Messenger genetics
- Abstract
The University of California Santa Cruz (UCSC) Known Genes dataset is constructed by a fully automated process, based on protein data from Swiss-Prot/TrEMBL (UniProt) and the associated mRNA data from Genbank. The detailed steps of this process are described. Extensive cross-references from this dataset to other genomic and proteomic data were constructed. For each known gene, a details page is provided containing rich information about the gene, together with extensive links to other relevant genomic, proteomic and pathway data. As of July 2005, the UCSC Known Genes are available for human, mouse and rat genomes. The Known Genes serves as a foundation to support several key programs: the Genome Browser, Proteome Browser, Gene Sorter and Table Browser offered at the UCSC website. All the associated data files and program source code are also available. They can be accessed at http://genome.ucsc.edu. The genomic coverage of UCSC Known Genes, RefSeq, Ensembl Genes, H-Invitational and CCDS is analyzed. Although UCSC Known Genes offers the highest genomic and CDS coverage among major human and mouse gene sets, more detailed analysis suggests all of them could be further improved.
- Published
- 2006
- Full Text
- View/download PDF
48. The UCSC Genome Browser Database: update 2006.
- Author
-
Hinrichs AS, Karolchik D, Baertsch R, Barber GP, Bejerano G, Clawson H, Diekhans M, Furey TS, Harte RA, Hsu F, Hillman-Jackson J, Kuhn RM, Pedersen JS, Pohl A, Raney BJ, Rosenbloom KR, Siepel A, Smith KE, Sugnet CW, Sultan-Qurraie A, Thomas DJ, Trumbower H, Weber RJ, Weirauch M, Zweig AS, Haussler D, and Kent WJ
- Subjects
- Amino Acid Sequence, Animals, California, Computer Graphics, Dogs, Gene Expression, Genes, Humans, Internet, Mice, Polymorphism, Single Nucleotide, Proteins chemistry, Proteins genetics, Proteins metabolism, Proteomics, Rats, Sequence Alignment, Software, User-Computer Interface, Databases, Genetic, Genomics
- Abstract
The University of California Santa Cruz Genome Browser Database (GBD) contains sequence and annotation data for the genomes of about a dozen vertebrate species and several major model organisms. Genome annotations typically include assembly data, sequence composition, genes and gene predictions, mRNA and expressed sequence tag evidence, comparative genomics, regulation, expression and variation data. The database is optimized to support fast interactive performance with web tools that provide powerful visualization and querying capabilities for mining the data. The Genome Browser displays a wide variety of annotations at all scales from single nucleotide level up to a full chromosome. The Table Browser provides direct access to the database tables and sequence data, enabling complex queries on genome-wide datasets. The Proteome Browser graphically displays protein properties. The Gene Sorter allows filtering and comparison of genes by several metrics including expression data and several gene properties. BLAT and In Silico PCR search for sequences in entire genomes in seconds. These tools are highly integrated and provide many hyperlinks to other databases and websites. The GBD, browsing tools, downloadable data files and links to documentation and other information can be found at http://genome.ucsc.edu/.
- Published
- 2006
- Full Text
- View/download PDF
49. Exploring relationships and mining data with the UCSC Gene Sorter.
- Author
-
Kent WJ, Hsu F, Karolchik D, Kuhn RM, Clawson H, Trumbower H, and Haussler D
- Subjects
- Database Management Systems, Humans, Computational Biology methods, Databases, Genetic, Genome, Human, Genomics methods, Software
- Abstract
In parallel with the human genome sequencing and assembly effort, many tools have been developed to examine the structure and function of the human gene set. The University of California Santa Cruz (UCSC) Gene Sorter has been created as a gene-based counterpart to the chromosome-oriented UCSC Genome Browser to facilitate the study of gene function and evolution. This simple, but powerful tool provides a graphical display of related genes that can be sorted and filtered based on a variety of criteria. Genes may be ordered based on such characteristics as expression profiles, proximity in genome, shared Gene Ontology (GO) terms, and protein similarity. The display can be restricted to a gene set meeting a specific set of constraints by filtering on expression levels, gene name or ID, chromosomal position, and so on. The default set of information for each gene entry-gene name, selected expression data, a BLASTP E-value, genomic position, and a description-can be configured to include many other types of data, including expanded expression data, related accession numbers and IDs, orthologs in other species, GO terms, and much more. The Gene Sorter, a CGI-based Web application written in C with a MySQL database, is tightly integrated with the other applications in the UCSC Genome Browser suite. Available on a selected subset of the genome assemblies found in the Genome Browser, it further enhances the usefulness of the UCSC tool set in interactive genomic exploration and analysis.
- Published
- 2005
- Full Text
- View/download PDF
50. The UCSC Proteome Browser.
- Author
-
Hsu F, Pringle TH, Kuhn RM, Karolchik D, Diekhans M, Haussler D, and Kent WJ
- Subjects
- California, Genomics, Humans, Systems Integration, User-Computer Interface, Databases, Protein, Proteins chemistry, Proteins genetics, Proteomics
- Abstract
The University of California Santa Cruz (UCSC) Proteome Browser provides a wealth of protein information presented in graphical images and with links to other protein-related Internet sites. The Proteome Browser is tightly integrated with the UCSC Genome Browser. For the first time, Genome Browser users have both the genome and proteome worlds at their fingertips simultaneously. The Proteome Browser displays tracks of protein and genomic sequences, exon structure, polarity, hydrophobicity, locations of cysteine and glycosylation potential, Superfamily domains and amino acids that deviate from normal abundance. Histograms show genome-wide distribution of protein properties, including isoelectric point, molecular weight, number of exons, InterPro domains and cysteine locations, together with specific property values of the selected protein. The Proteome Browser also provides links to gene annotations in the Genome Browser, the Known Genes details page and the Gene Sorter; domain information from Superfamily, InterPro and Pfam; three-dimensional structures at the Protein Data Bank and ModBase; and pathway data at KEGG, BioCarta/CGAP and BioCyc. As of August 2004, the Proteome Browser is available for human, mouse and rat proteomes. The browser may be accessed from any Known Genes details page of the Genome Browser at http://genome.ucsc.edu. A user's guide is also available on this website.
- Published
- 2005
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.