31 results on '"Mesirov J"'
Search Results
2. Distinct physiological states of Plasmodium falciparum in malaria-infected patients
- Author
-
Daily, J. P., Scanfeld, D., Pochet, N., Le Roch, K., Plouffe, D., Kamal, M., Sarr, O., Mboup, S., Ndir, O., Wypij, D., Levasseur, K., Thomas, E., Tamayo, P., Dong, C., Zhou, Y., Lander, E. S., Ndiaye, D., Wirth, D., Winzeler, E. A., Mesirov, J. P., and Regev, A.
- Published
- 2007
- Full Text
- View/download PDF
3. GeneCluster 2.0: an advanced toolset for bioarray analysis
- Author
-
Reich, M., Ohm, K., Angelo, M., Tamayo, P., and Mesirov, J. P.
- Published
- 2004
4. Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration
- Author
-
Thorvaldsdottir, H., primary, Robinson, J. T., additional, and Mesirov, J. P., additional
- Published
- 2012
- Full Text
- View/download PDF
5. In vivo profiles in malaria are consistent with a novel physiological state
- Author
-
Wirth, D., primary, Daily, J., additional, Winzeler, E., additional, Mesirov, J. P., additional, and Regev, A., additional
- Published
- 2009
- Full Text
- View/download PDF
6. GeneCruiser: a web service for the annotation of microarray data
- Author
-
Liefeld, T., primary, Reich, M., additional, Gould, J., additional, Zhang, P., additional, Tamayo, P., additional, and Mesirov, J. P., additional
- Published
- 2005
- Full Text
- View/download PDF
7. Characterizing genomic alterations in cancer by complementary functional associations
- Author
-
Kim, J. W., Botvinnik, O. B., Abudayyeh, O., Birger, C., Rosenbluh, J., Shrestha, Y., Abazeed, M. E., Hammerman, P. S., DiCara, D., Konieczkowski, D. J., Johannessen, C. M., Liberzon, A., Alizad-Rahvar, A. R., Alexe, G., Aguirre, A., Ghandi, M., Greulich, H., Vazquez, F., Weir, B. A., Van Allen, E. M., Tsherniak, A., Shao, D. D., Zack, T. I., Noble, M., Getz, G., Beroukhim, R., Garraway, L. A., Ardakani, M., Romualdi, C., Sales, G., Barbie, D. A., Boehm, J. S., Hahn, W. C., Mesirov, J. P., and Tamayo, P.
- Abstract
Systematic efforts to sequence the cancer genome have identified large numbers of relevant mutations and copy number alterations in human cancers; however, elucidating their functional consequences, and their interactions to drive or maintain oncogenic states, is still a significant challenge. Here we introduce REVEALER, a computational method that identifies combinations of mutually exclusive genomic alterations correlated with functional phenotypes, such as the activation or gene-dependency of oncogenic pathways or the sensitivity to a drug treatment. We use REVEALER to uncover complementary genomic alterations associated with the transcriptional activation of β-catenin and NRF2, MEK-inhibitor sensitivity, and KRAS dependency. REVEALER successfully identified both known and new associations demonstrating the power of combining functional profiles with extensive characterization of genomic alterations in cancer genomes.
- Published
- 2016
- Full Text
- View/download PDF
8. Protein structure prediction by a data-level parallel algorithm.
- Author
-
Zhang, X., Waltz, D., and Mesirov, J. P.
- Published
- 1989
- Full Text
- View/download PDF
9. Molecular classification of multiple tumor types
- Author
-
Yeang, C-H., Ramaswamy, S., Tamayo, P., Mukherjee, S., Rifkin, R.M., Angelo, M., Reich, M., Lander, E., Mesirov, J., and Golub, T.
- Abstract
Using gene expression data to classify tumor types is a very promising tool in cancer diagnosis. Previous works show several pairs of tumor types can be successfully distinguished by their gene expression patterns (Golub et al. 1999, Ben-Dor et al. 2000, Alizadeh et al. 2000). However, the simultaneous classification across a heterogeneous set of tumor types has not been well studied yet. We obtained 190 samples from 14 tumor classes and generated a combined expression dataset containing 16063 genes for each of those samples. We performed multi-class classification by combining the outputs of binary classifiers. Three binary classifiers (k-nearest neighbors, weighted voting, and support vector machines) were applied in conjunction with three combination scenarios (one-vs-all, all-pairs, hierarchical partitioning). We achieved the best cross validation error rate of 18.75% and the best test error rate of 21.74% by using the one-vs-all support vector machine algorithm. The results demonstrate the feasibility of performing clinically useful classification from samples of multiple tumor types.Contact: chyeang@mit.edu
- Published
- 2001
10. Human and mouse gene structure: comparative analysis and application to exon prediction.
- Author
-
Batzoglou, S, Pachter, L, Mesirov, J P, Berger, B, and Lander, E S
- Abstract
We describe a novel analytical approach to gene recognition based on cross-species comparison. We first undertook a comparison of orthologous genomic loci from human and mouse, studying the extent of similarity in the number, size and sequence of exons and introns. We then developed an approach for recognizing genes within such orthologous regions by first aligning the regions using an iterative global alignment system and then identifying genes based on conservation of exonic features at aligned positions in both species. The alignment and gene recognition are performed by new programs called and, respectively. performed well at exact identification of coding exons in 117 orthologous pairs tested.
- Published
- 2000
11. Sequencing a genome by walking with clone-end sequences: a mathematical analysis.
- Author
-
Batzoglou, S, Berger, B, Mesirov, J, and Lander, E S
- Abstract
One approach to sequencing a large genome is (1) to sequence a collection of nonoverlapping "seeds" chosen from a genomic library of large-insert clones [such as bacterial artificial chromosomes (BACs)] and then (2) to take successive "walking" steps by selecting and sequencing minimally overlapping clones, using information such as clone-end sequences to identify the overlaps. In this paper we analyze the strategic issues involved in using this approach. We derive formulas showing how two key factors, the initial density of seed clones and the depth of the genomic library used for walking, affect the cost and time of a sequencing project-that is, the amount of redundant sequencing and the number of steps to cover the vast majority of the genome. We also discuss a variant strategy in which a second genomic library with clones having a somewhat smaller insert size is used to close gaps. This approach can dramatically decrease the amount of redundant sequencing, without affecting the rate at which the genome is covered.
- Published
- 1999
- Full Text
- View/download PDF
12. Protein structure prediction by a data-level parallel algorithm
- Author
-
Zhang, X., primary, Waltz, D., additional, and Mesirov, J. P., additional
- Published
- 1989
- Full Text
- View/download PDF
13. Portraits of breast cancer progression
- Author
-
Ganesan Shridar, Mesirov Jill P, Tamayo Pablo, Scanfeld Daniel, Alexe Gabriela, Dalgin Gul S, DeLisi Charles, and Bhanot Gyan
- Subjects
Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background Clustering analysis of microarray data is often criticized for giving ambiguous results because of sensitivity to data perturbation or clustering techniques used. In this paper, we describe a new method based on principal component analysis and ensemble consensus clustering that avoids these problems. Results We illustrate the method on a public microarray dataset from 36 breast cancer patients of whom 31 were diagnosed with at least two of three pathological stages of disease (atypical ductal hyperplasia (ADH), ductal carcinoma in situ (DCIS) and invasive ductal carcinoma (IDC). Our method identifies an optimum set of genes and divides the samples into stable clusters which correlate with clinical classification into Luminal, Basal-like and Her2+ subtypes. Our analysis reveals a hierarchical portrait of breast cancer progression and identifies genes and pathways for each stage, grade and subtype. An intriguing observation is that the disease phenotype is distinguishable in ADH and progresses along distinct pathways for each subtype. The genetic signature for disease heterogeneity across subtypes is greater than the heterogeneity of progression from DCIS to IDC within a subtype, suggesting that the disease subtypes have distinct progression pathways. Our method identifies six disease subtype and one normal clusters. The first split separates the normal samples from the cancer samples. Next, the cancer cluster splits into low grade (pathological grades 1 and 2) and high grade (pathological grades 2 and 3) while the normal cluster is unchanged. Further, the low grade cluster splits into two subclusters and the high grade cluster into four. The final six disease clusters are mapped into one Luminal A, three Luminal B, one Basal-like and one Her2+. Conclusion We confirm that the cancer phenotype can be identified in early stage because the genes altered in this stage progressively alter further as the disease progresses through DCIS into IDC. We identify six subtypes of disease which have distinct genetic signatures and remain separated in the clustering hierarchy. Our findings suggest that the heterogeneity of disease across subtypes is higher than the heterogeneity of the disease progression within a subtype, indicating that the subtypes are in fact distinct diseases.
- Published
- 2007
- Full Text
- View/download PDF
14. Erratum
- Author
-
Zhang, X., Mesirov, J. P., and Waltz, D. L.
- Published
- 1993
- Full Text
- View/download PDF
15. Two-dimensional, viscous, incompressible flow in complex geometries on a massively parallel processor
- Author
-
Mesirov, J [Thinking Machines Corp., Cambridge, MA (United States)]
- Published
- 1992
- Full Text
- View/download PDF
16. Comprehensive molecular profiling of lung adenocarcinoma
- Author
-
Amie Radenbaugh, Noreen Dhalla, Christina Williamson, Charles Saller, James Suh, Ramaswamy Govindan, Travis I. Zack, Paul T. Spellman, Daniel DiCara, Harvey I. Pass, Deepak Srinivasan, William G. Richards, Robert J. Cerfolio, Igor Letovanec, A. Gordon Robertson, Gabriel Sica, Chad J. Creighton, Hendrik Dienemann, Jeffrey A. Borgia, Boris Reva, Bryan F. Meyers, Yiling Lu, Nikolaus Schultz, Christopher I. Amos, Dante Trusty, Carmelo Gaudioso, Michael Meister, James T. Robinson, Lihua Zou, James Shin, Jeremy Parfitt, Darlene Lee, Junyuan Wu, Carl Morrison, Scott L. Carter, Giovanni Ciriello, Nils Weinhold, Elena Nemirovich-Danchenko, Andrew Wei Xu, Christopher G. Maher, Lori Boice, Irina Zaytseva, Dennis A. Wigle, Kenna R. Mills Shaw, Matthew G. Soloway, Matthew Meyerson, Peng Chieh Chen, Frank Schneider, Troy Shelton, Douglas Voet, Steven E. Schumacher, D L Rotin, Saianand Balu, Stuart R. Jefferys, Tom Bodenheimer, Bradley A. Ozenberger, Eric S. Lander, Edward Gabrielson, Konstantin V. Fedosenko, Rehan Akbani, William D. Travis, Ari B. Kahn, Marcin Imielinski, Jacqueline E. Schein, Thomas L. Bauer, Kai Ye, Samuel A. Yousem, Robert C. Onofrio, Thomas Muley, Ayesha S. Bryant, Michael K. Asiedu, Monique Albert, Pei Lin, Corbin D. Jones, Edwina Duhig, Jean C. Zenklusen, Lucinda Fulton, Christina Yau, J. Todd Auman, Leigh B. Thorne, Elena Helman, Richard T. Cheney, William Lee, Patrick K. Kimes, Juok Cho, Alexei Protopopov, Wenbin Liu, Lee Lichtenstein, Jing Wang, Lixing Yang, W. Kimryn Rathmell, Jo Ellen Weaver, David A. Wheeler, Leslie Cope, Mark A. Watson, Heidi J. Sofia, Angeliki Pantazi, Ronglai Shen, Jeffrey Roach, Eric A. Collisson, Patrick Kwok Shing Ng, Angela Hadjipanayis, Peter S. Hammerman, David Van Den Berg, Kwun M. Fong, Nils Gehlenborg, Natasha Rekhtman, William K. Funkhouser, D. Neil Hayes, Harshad S. Mahadeshwar, Semin Lee, Martin Peifer, David Mallery, Piotr A. Mieczkowski, Ranabir Guin, Madhusmita Behera, Philipp A. Schnabel, Jill M. Siegfried, Carmen Gomez-Fernandez, Johanna Gardner, Lynn M. Herbert, Hailei Zhang, Robert S. Fulton, Travis Sullivan, Sahil Seth, Sam Ng, Chandra Sekhar Pedamallu, Barry S. Taylor, Venkatraman E. Seshan, Valerie W. Rusch, Jinze Liu, Daniel P. Raymond, Jianjiong Gao, Nathan A. Pennell, Marco A. Marra, Jan F. Prins, Payal Sipahimalani, Janae V. Simons, Joel S. Parker, Rileen Sinha, Lindy Hunter, Raju Kucherlapati, Dennis T. Maglinte, Fedor Moiseenko, Eric E. Snyder, Roy Tarnuzzer, Beverly Lee, James Stephen Marron, Kristian Cibulskis, Jerome Myers, Haiyan I. Li, Robert Penny, Hartmut Juhl, Richard K. Wilson, Zhining Wang, Eran Hodis, Carrie Sougnez, Jiabin Tang, William Mallard, Bryan Hernandez, Liming Yang, Jennifer Brown, Gad Getz, Farhad Kosari, Catrina Fronick, Juliann Chmielecki, Jianhua Zhang, Suresh S. Ramalingam, Michael Parfenov, Peter J. Park, Tanja Davidsen, Philip H. Lai, Jeff Boyd, Dang Huy Quoc Thinh, Harmanjatinder S. Sekhon, Malcolm V. Brock, Mark Pool, Margi Sheth, Kimberly M. Rieger-Christ, Michael J. Liptay, E. Getz, S. Onur Sumer, Ian A. Yang, B. Arman Aksoy, Douglas B. Flieder, Bradley M. Broom, Carrie Hirst, Solange Peters, Joshua M. Stuart, Khurram Z. Khan, Scott Morris, Donghui Tan, Andrew J. Mungall, Ming-Sound Tsao, Gordon B. Mills, Stephen B. Baylin, Rebecca Carlsen, Sanja Dacic, Julien Baboud, Brenda Rabeno, Richard A. Hajek, Lauren Averett Byers, Yaron S.N. Butterfield, Miruna Balasundaram, Chip Stewart, Katherine Tarvin, Peter B. Illei, James G. Herman, David J. Kwiatkowski, Andy Chu, David Haussler, Natasja Wye, Charles M. Perou, Peter W. Laird, Timothy J. Triche, Yan Shi, Jill P. Mesirov, Angela N. Brooks, Lori Huelsenbeck-Dill, Steven J.M. Jones, Antonia H. Holway, Lixia Diao, Anthony A. Gal, David G. Beer, Angela Tam, Ashley H. Salazar, Mark A. Jensen, Robert A. Holt, Katherine A. Hoadley, John A. Demchok, Sandra McDonald, Chandra Goparaju, David Pot, Belinda E. Clarke, Gordon Robertson, Michael C. Wendl, Helga Thorvaldsdottir, Kristen Rogers, Joshua D. Campbell, Chris Sander, Rayleen V. Bowman, Marc Danie Nazaire, Michael Mayo, Olga Voronina, Ludmila Danilova, Paul Zippile, Netty Santoso, John V. Heymach, Matthew D. Wilkerson, John Eckman, Morgan Windsor, Cureline Oleg Dolzhanskiy, Nina Thiessen, Mara Rosenberg, Gideon Dresdner, Levi A. Garraway, Eric Chuah, Richard Varhol, Elizabeth Buda, Li Ding, Alice H. Berger, Xingzhi Song, John M. S. Bartlett, Michael D. McLellan, Olga Potapova, Joseph Paulauskis, Igor Jurisica, Benjamin Gross, Jaegil Kim, John N. Weinstein, Kevin Lau, Christopher R. Cabanski, Philip Bonomi, Michael S. Noble, Maureen F. Zakowski, George E. Sandusky, Mary Iacocca, Eric J. Burks, Erin Curley, Lynda Chin, Rajiv Dhir, Singer Ma, Sophie C. Egea, Umadevi Veluvolu, Sugy Kodeeswaran, Christopher A. Miller, Moiz S. Bootwalla, Daniel J. Weisenberger, Shaowu Meng, Mei Huang, Elaine R. Mardis, Gordon Saksena, Nicholas J. Petrelli, Yvonne Owusu-Sarpong, Christopher C. Benz, Bernard Kohl, Jingchun Zhu, David I. Heiman, Carol Farver, Scot Waring, Richard A. Moore, Darshan Singh, Andrew D. Cherniack, Rameen Beroukhim, Michael S. Lawrence, Xiaojia Ren, Marc Ladanyi, Stacey Gabriel, Christine Czerwinski, Alan P. Hoyle, Cancer Genome Atlas Research Network, Collisson, E. A., Campbell, J.D., Brooks, A.N., Berger, A.H., Lee, W., Chmielecki, J., Beer, D.G., Cope, L., Creighton, C.J., Danilova, L., Ding, L., Getz, G., Hammerman, P.S., Hayes, D.N., Hernandez, B., Herman, J.G., Heymach, J.V., Jurisica, I., Kucherlapati, R., Kwiatkowski, D., Ladanyi, M., Robertson, G., Schultz, N., Shen, R., Sinha, R., Sougnez, C., Tsao, M.S., Travis, W.D., Weinstein, J.N., Wigle, D.A., Wilkerson, M.D., Chu, A., Cherniack, A.D., Hadjipanayis, A., Rosenberg, M., Weisenberger, D.J., Laird, P.W., Radenbaugh, A., Ma, S., Stuart, J.M., Averett Byers, L., Baylin, S.B., Govindan, R., Meyerson, M., Gabriel, S.B., Cibulskis, K., Kim, J., Stewart, C., Lichtenstein, L., Lander, E.S., Lawrence, M.S., Kandoth, C., Fulton, R., Fulton, L.L., McLellan, M.D., Wilson, R.K., Ye, K., Fronick, C.C., Maher, C.A., Miller, C.A., Wendl, M.C., Cabanski, C., Mardis, E., Wheeler, D., Balasundaram, M., Butterfield, Y.S., Carlsen, R., Chuah, E., Dhalla, N., Guin, R., Hirst, C., Lee, D., Li, H.I., Mayo, M., Moore, R.A., Mungall, A.J., Schein, J.E., Sipahimalani, P., Tam, A., Varhol, R., Robertson, A., Wye, N., Thiessen, N., Holt, R.A., Jones, S.J., Marra, M.A., Imielinski, M., Onofrio, R.C., Hodis, E., Zack, T., Helman, E., Sekhar Pedamallu, C., Mesirov, J., Saksena, G., Schumacher, S.E., Carter, S.L., Garraway, L., Beroukhim, R., Lee, S., Mahadeshwar, H.S., Pantazi, A., Protopopov, A., Ren, X., Seth, S., Song, X., Tang, J., Yang, L., Zhang, J., Chen, P.C., Parfenov, M., Wei Xu, A., Santoso, N., Chin, L., Park, P.J., Hoadley, K.A., Auman, J.T., Meng, S., Shi, Y., Buda, E., Waring, S., Veluvolu, U., Tan, D., Mieczkowski, P.A., Jones, C.D., Simons, J.V., Soloway, M.G., Bodenheimer, T., Jefferys, S.R., Roach, J., Hoyle, A.P., Wu, J., Balu, S., Singh, D., Prins, J.F., Marron, J.S., Parker, J.S., Perou, C.M., Liu, J., Maglinte, D.T., Lai, P.H., Bootwalla, M.S., Van Den Berg, D.J., Triche, T., Cho, J., DiCara, D., Heiman, D., Lin, P., Mallard, W., Voet, D., Zhang, H., Zou, L., Noble, M.S., Gehlenborg, N., Thorvaldsdottir, H., Nazaire, M.D., Robinson, J., Aksoy, B.A., Ciriello, G., Taylor, B.S., Dresdner, G., Gao, J., Gross, B., Seshan, V.E., Reva, B., Sumer, S.O., Weinhold, N., Sander, C., Ng, S., Zhu, J., Benz, C.C., Yau, C., Haussler, D., Spellman, P.T., Kimes, P.K., Broom, B.M., Wang, J., Lu, Y., Kwok Shing Ng, P., Diao, L., Liu, W., Amos, C.I., Akbani, R., Mills, G.B., Curley, E., Paulauskis, J., Lau, K., Morris, S., Shelton, T., Mallery, D., Gardner, J., Penny, R., Saller, C., Tarvin, K., Richards, W.G., Cerfolio, R., Bryant, A., Raymond, D.P., Pennell, N.A., Farver, C., Czerwinski, C., Huelsenbeck-Dill, L., Iacocca, M., Petrelli, N., Rabeno, B., Brown, J., Bauer, T., Dolzhanskiy, O., Potapova, O., Rotin, D., Voronina, O., Nemirovich-Danchenko, E., Fedosenko, K.V., Gal, A., Behera, M., Ramalingam, S.S., Sica, G., Flieder, D., Boyd, J., Weaver, J., Kohl, B., Huy Quoc Thinh, D., Sandusky, G., Juhl, H., Duhig, E., Illei, P., Gabrielson, E., Shin, J., Lee, B., Rodgers, K., Trusty, D., Brock, M.V., Williamson, C., Burks, E., Rieger-Christ, K., Holway, A., Sullivan, T., Asiedu, M.K., Kosari, F., Rekhtman, N., Zakowski, M., Rusch, V.W., Zippile, P., Suh, J., Pass, H., Goparaju, C., Owusu-Sarpong, Y., Bartlett, J.M., Kodeeswaran, S., Parfitt, J., Sekhon, H., Albert, M., Eckman, J., Myers, J.B., Cheney, R., Morrison, C., Gaudioso, C., Borgia, J.A., Bonomi, P., Pool, M., Liptay, M.J., Moiseenko, F., Zaytseva, I., Dienemann, H., Meister, M., Schnabel, P.A., Muley, T.R., Peifer, M., Gomez-Fernandez, C., Herbert, L., Egea, S., Huang, M., Thorne, L.B., Boice, L., Hill Salazar, A., Funkhouser, W.K., Rathmell, W.K., Dhir, R., Yousem, S.A., Dacic, S., Schneider, F., Siegfried, J.M., Hajek, R., Watson, M.A., McDonald, S., Meyers, B., Clarke, B., Yang, I.A., Fong, K.M., Hunter, L., Windsor, M., Bowman, R.V., Peters, S., Letovanec, I., Khan, K.Z., Jensen, M.A., Snyder, E.E., Srinivasan, D., Kahn, A.B., Baboud, J., Pot, D.A., Mills Shaw, K.R., Sheth, M., Davidsen, T., Demchok, J.A., Wang, Z., Tarnuzzer, R., Zenklusen, J.C., Ozenberger, B.A., Sofia, H.J., Massachusetts Institute of Technology. Department of Biology, and Lander, Eric S.
- Subjects
Male ,Lung Neoplasms ,Adenocarcinoma/genetics ,Adenocarcinoma/pathology ,Cell Cycle Proteins/genetics ,Female ,Gene Dosage ,Gene Expression Regulation, Neoplastic ,Genomics ,Humans ,Lung Neoplasms/genetics ,Lung Neoplasms/pathology ,Molecular Typing ,Mutation/genetics ,Oncogenes/genetics ,Sex Factors ,Transcriptome/genetics ,Adenocarcinoma of Lung ,Cell Cycle Proteins ,Biology ,Adenocarcinoma ,Exon ,Germline mutation ,microRNA ,Adenocarcinoma of the lung ,medicine ,Gene ,Multidisciplinary ,Oncogene ,Oncogenes ,medicine.disease ,MET Exon 14 Skipping Mutation ,Molecular biology ,3. Good health ,Mutation ,Transcriptome - Abstract
Adenocarcinoma of the lung is the leading cause of cancer death worldwide. Here we report molecular profiling of 230 resected lung adenocarcinomas using messenger RNA, microRNA and DNA sequencing integrated with copy number, methylation and proteomic analyses. High rates of somatic mutation were seen (mean 8.9 mutations per megabase). Eighteen genes were statistically significantly mutated, including RIT1 activating mutations and newly described loss-of-function MGA mutations which are mutually exclusive with focal MYC amplification. EGFR mutations were more frequent in female patients, whereas mutations in RBM10 were more common in males. Aberrations in NF1, MET, ERBB2 and RIT1 occurred in 13% of cases and were enriched in samples otherwise lacking an activated oncogene, suggesting a driver role for these events in certain tumours. DNA and mRNA sequence from the same tumour highlighted splicing alterations driven by somatic genomic changes, including exon 14 skipping in MET mRNA in 4% of cases. MAPK and PI(3)K pathway activity, when measured at the protein level, was explained by known mutations in only a fraction of cases, suggesting additional, unexplained mechanisms of pathway activation. These data establish a foundation for classification and further investigations of lung adenocarcinoma molecular pathogenesis.
- Published
- 2013
17. Deciphering the Functional Roles of Individual Cancer Alleles Across Comprehensive Cancer Genomic Studies.
- Author
-
Ma JY, Ting S, Tam B, Pham T, Reich M, Mesirov J, Tamayo P, and Kim W
- Abstract
Cancer genome data has been growing in both size and complexity, primarily driven by advances in next-generation sequencing technologies, such as Pan-cancer data from TCGA, ICGC, and single-cell sequencing. Yet, discerning the functional role of individual genomic lesions remains a substantial challenge due to the complexity and scale of the data. Previously, we introduced REVEALER, which identifies groups of genomic alterations that significantly associate with target functional profiles or phenotypes, such as pathway activation, gene dependency, or drug response. In this paper, we present a new mathematical formulation of the algorithm. This version (REVEALER 2.0) is considerably more powerful than the original, allowing for rapid processing and analysis of much larger datasets and facilitating higher-resolution discoveries at the level of individual alleles. REVEALER 2.0 employs the Conditional Information Coefficient (CIC) to pinpoint features that are either complementary or mutually exclusive but still correlate with the target functional profile. The aggregation of these features provides a better explanation for the target functional profile than any single alteration on its own. This is indicative of scenarios where several activating genomic lesions can initiate or stimulate a key pathway or process. We replaced the initial three-dimensional kernel estimation with multiple precomputed one-dimensional kernel estimations, resulting in an approximate 150x increase in speed and efficiency. This improvement, combined with its efficient execution, makes REVEALER 2.0 suitable for much larger datasets and a more extensive range of genomic challenges.
- Published
- 2023
- Full Text
- View/download PDF
18. TORC1/2 kinase inhibition depletes glutathione and synergizes with carboplatin to suppress the growth of MYC-driven medulloblastoma.
- Author
-
Maynard RE, Poore B, Hanaford AR, Pham K, James M, Alt J, Park Y, Slusher BS, Tamayo P, Mesirov J, Archer TC, Pomeroy SL, Eberhart CG, and Raabe EH
- Subjects
- Animals, Antineoplastic Agents pharmacology, Cerebellar Neoplasms drug therapy, Cerebellar Neoplasms enzymology, Cerebellar Neoplasms metabolism, Female, Humans, Medulloblastoma drug therapy, Medulloblastoma enzymology, Medulloblastoma metabolism, Mice, Protein Kinase Inhibitors pharmacology, Xenograft Model Antitumor Assays, Antineoplastic Agents therapeutic use, Carboplatin therapeutic use, Cell Proliferation physiology, Cerebellar Neoplasms pathology, Glutathione metabolism, Mechanistic Target of Rapamycin Complex 1 antagonists & inhibitors, Mechanistic Target of Rapamycin Complex 2 antagonists & inhibitors, Medulloblastoma pathology, Protein Kinase Inhibitors therapeutic use, Proto-Oncogene Proteins c-myc physiology
- Abstract
Medulloblastoma is the most common malignant pediatric brain tumor. Tumors having high levels of c-MYC have the worst clinical prognosis, with only a minority of patients surviving. To address this unmet clinical need, we generated a human neural stem cell model of medulloblastoma that recapitulated the most aggressive subtype phenotypically and by mRNA expression profiling. An in silico analysis of these cells identified mTOR inhibitors as potential therapeutic agents. We hypothesized that the orally bioavailable TORC1/2 kinase inhibitor TAK228 would have activity against MYC-driven medulloblastoma. TAK228 inhibited mTORC1/2, decreased cell growth and caused apoptosis in high-MYC medulloblastoma cell lines. Comprehensive metabolic profiling of medulloblastoma orthotopic xenografts showed upregulation of glutathione compared to matched normal brain. TAK228 suppressed glutathione production. Because glutathione is required to detoxify platinum-containing chemotherapy, we hypothesized that TAK228 would cooperate with carboplatin in medulloblastoma. TAK228 synergized with carboplatin to inhibit cell growth and induce apoptosis and extended survival in orthotopic xenografts of high-MYC medulloblastoma. Brain-penetrant TORC1/2 inhibitors and carboplatin may be an effective combination therapy for high-risk medulloblastoma., (Copyright © 2021 Elsevier B.V. All rights reserved.)
- Published
- 2021
- Full Text
- View/download PDF
19. GeNets: a unified web platform for network-based genomic analyses.
- Author
-
Li T, Kim A, Rosenbluh J, Horn H, Greenfeld L, An D, Zimmer A, Liberzon A, Bistline J, Natoli T, Li Y, Tsherniak A, Narayan R, Subramanian A, Liefeld T, Wong B, Thompson D, Calvo S, Carr S, Boehm J, Jaffe J, Mesirov J, Hacohen N, Regev A, and Lage K
- Subjects
- DNA genetics, Databases, Nucleic Acid, Nucleic Acid Amplification Techniques, RNA genetics, Software, Genomics methods, Internet, Machine Learning
- Abstract
Functional genomics networks are widely used to identify unexpected pathway relationships in large genomic datasets. However, it is challenging to compare the signal-to-noise ratios of different networks and to identify the optimal network with which to interpret a particular genetic dataset. We present GeNets, a platform in which users can train a machine-learning model (Quack) to carry out these comparisons and execute, store, and share analyses of genetic and RNA-sequencing datasets.
- Published
- 2018
- Full Text
- View/download PDF
20. Exome Sequencing of African-American Prostate Cancer Reveals Loss-of-Function ERF Mutations.
- Author
-
Huang FW, Mosquera JM, Garofalo A, Oh C, Baco M, Amin-Mansour A, Rabasha B, Bahl S, Mullane SA, Robinson BD, Aldubayan S, Khani F, Karir B, Kim E, Chimene-Weiss J, Hofree M, Romanel A, Osborne JR, Kim JW, Azabdaftari G, Woloszynska-Read A, Sfanos K, De Marzo AM, Demichelis F, Gabriel S, Van Allen EM, Mesirov J, Tamayo P, Rubin MA, Powell IJ, and Garraway LA
- Subjects
- Black or African American genetics, Animals, Cell Line, Tumor, Class I Phosphatidylinositol 3-Kinases genetics, Exome, Humans, Male, Mice, Mutation, PTEN Phosphohydrolase genetics, Prostatic Neoplasms pathology, Exome Sequencing, Prostatic Neoplasms genetics, Repressor Proteins genetics
- Abstract
African-American men have the highest incidence of and mortality from prostate cancer. Whether a biological basis exists for this disparity remains unclear. Exome sequencing ( n = 102) and targeted validation ( n = 90) of localized primary hormone-naïve prostate cancer in African-American men identified several gene mutations not previously observed in this context, including recurrent loss-of-function mutations in ERF , an ETS transcriptional repressor, in 5% of cases. Analysis of existing prostate cancer cohorts revealed ERF deletions in 3% of primary prostate cancers and mutations or deletions in ERF in 3% to 5% of lethal castration-resistant prostate cancers. Knockdown of ERF confers increased anchorage-independent growth and generates a gene expression signature associated with oncogenic ETS activation and androgen signaling. Together, these results suggest that ERF is a prostate cancer tumor-suppressor gene. More generally, our findings support the application of systematic cancer genomic characterization in settings of broader ancestral diversity to enhance discovery and, eventually, therapeutic applications. Significance: Systematic genomic sequencing of prostate cancer in African-American men revealed new insights into prostate cancer, including the identification of ERF as a prostate cancer gene; somatic copy-number alteration differences; and uncommon PIK3CA and PTEN alterations. This study highlights the importance of inclusion of underrepresented minorities in cancer sequencing studies. Cancer Discov; 7(9); 973-83. ©2017 AACR. This article is highlighted in the In This Issue feature, p. 920 ., (©2017 American Association for Cancer Research.)
- Published
- 2017
- Full Text
- View/download PDF
21. A multi-tool recipe to identify regions of protein-DNA binding and their influence on associated gene expression.
- Author
-
Carlin D, Kosnicki K, Garamszegi S, Ideker T, Thorvaldsdóttir H, Reich M, and Mesirov J
- Abstract
One commonly performed bioinformatics task is to infer functional regulation of transcription factors by observing differential expression under a knockout, and integrating DNA binding information of that transcription factor. However, until now, this this task has required dedicated bioinformatics support to perform the necessary data integration. GenomeSpace provides a protocol, or "recipe", and a user interface with inter-operating software tools to identifying protein occupancies along the genome from a ChIP-seq experiment and associated differentially regulated genes from an RNA-Seq experiment. By integrating RNA-Seq and ChIP-seq analyses, a user is easily able to associate differing expression phenotypes with changing epigenetic landscapes., Competing Interests: Competing interests: No competing interests were disclosed.
- Published
- 2017
- Full Text
- View/download PDF
22. Joint modeling and registration of cell populations in cohorts of high-dimensional flow cytometric data.
- Author
-
Pyne S, Lee SX, Wang K, Irish J, Tamayo P, Nazaire MD, Duong T, Ng SK, Hafler D, Levy R, Nolan GP, Mesirov J, and McLachlan GJ
- Subjects
- Algorithms, Cluster Analysis, Computer Simulation, Humans, Computational Biology methods, Flow Cytometry, Software
- Abstract
In biomedical applications, an experimenter encounters different potential sources of variation in data such as individual samples, multiple experimental conditions, and multivariate responses of a panel of markers such as from a signaling network. In multiparametric cytometry, which is often used for analyzing patient samples, such issues are critical. While computational methods can identify cell populations in individual samples, without the ability to automatically match them across samples, it is difficult to compare and characterize the populations in typical experiments, such as those responding to various stimulations or distinctive of particular patients or time-points, especially when there are many samples. Joint Clustering and Matching (JCM) is a multi-level framework for simultaneous modeling and registration of populations across a cohort. JCM models every population with a robust multivariate probability distribution. Simultaneously, JCM fits a random-effects model to construct an overall batch template--used for registering populations across samples, and classifying new samples. By tackling systems-level variation, JCM supports practical biomedical applications involving large cohorts. Software for fitting the JCM models have been implemented in an R package EMMIX-JCM, available from http://www.maths.uq.edu.au/~gjm/mix_soft/EMMIX-JCM/.
- Published
- 2014
- Full Text
- View/download PDF
23. Reproducibility: In praise of open research measures.
- Author
-
Kolker E, Altintas I, Bourne P, Faris J, Fox G, Frishman D, Geraci C, Hancock W, Lin B, Lancet D, Lisitsa A, Knight R, Martens L, Mesirov J, Özdemir V, Schultes E, Smith T, Snyder M, Srivastava S, Toppo S, and Wilmes P
- Subjects
- Reproducibility of Results, Access to Information, Biological Science Disciplines standards, Periodicals as Topic standards, Research standards
- Published
- 2013
- Full Text
- View/download PDF
24. An oncogenic KRAS2 expression signature identified by cross-species gene-expression analysis.
- Author
-
Sweet-Cordero A, Mukherjee S, Subramanian A, You H, Roix JJ, Ladd-Acosta C, Mesirov J, Golub TR, and Jacks T
- Subjects
- Adenocarcinoma etiology, Animals, Gene Expression Profiling, Lung Neoplasms etiology, Mice, Proto-Oncogene Proteins metabolism, Proto-Oncogene Proteins p21(ras), Reverse Transcriptase Polymerase Chain Reaction, Species Specificity, ras Proteins metabolism, Adenocarcinoma metabolism, Gene Expression physiology, Lung Neoplasms metabolism, Proto-Oncogene Proteins genetics, ras Proteins genetics
- Abstract
Using advanced gene targeting methods, generating mouse models of cancer that accurately reproduce the genetic alterations present in human tumors is now relatively straightforward. The challenge is to determine to what extent such models faithfully mimic human disease with respect to the underlying molecular mechanisms that accompany tumor progression. Here we describe a method for comparing mouse models of cancer with human tumors using gene-expression profiling. We applied this method to the analysis of a model of Kras2-mediated lung cancer and found a good relationship to human lung adenocarcinoma, thereby validating the model. Furthermore, we found that whereas a gene-expression signature of KRAS2 activation was not identifiable when analyzing human tumors with known KRAS2 mutation status alone, integrating mouse and human data uncovered a gene-expression signature of KRAS2 mutation in human lung cancer. We confirmed the importance of this signature by gene-expression analysis of short hairpin RNA-mediated inhibition of oncogenic Kras2. These experiments identified both a pattern of gene expression indicative of KRAS2 mutation and potential effectors of oncogenic KRAS2 activity in human cancer. This approach provides a strategy for using genomic analysis of animal models to probe human disease.
- Published
- 2005
- Full Text
- View/download PDF
25. Genome duplication in the teleost fish Tetraodon nigroviridis reveals the early vertebrate proto-karyotype.
- Author
-
Jaillon O, Aury JM, Brunet F, Petit JL, Stange-Thomann N, Mauceli E, Bouneau L, Fischer C, Ozouf-Costaz C, Bernot A, Nicaud S, Jaffe D, Fisher S, Lutfalla G, Dossat C, Segurens B, Dasilva C, Salanoubat M, Levy M, Boudet N, Castellano S, Anthouard V, Jubin C, Castelli V, Katinka M, Vacherie B, Biémont C, Skalli Z, Cattolico L, Poulain J, De Berardinis V, Cruaud C, Duprat S, Brottier P, Coutanceau JP, Gouzy J, Parra G, Lardier G, Chapple C, McKernan KJ, McEwan P, Bosak S, Kellis M, Volff JN, Guigó R, Zody MC, Mesirov J, Lindblad-Toh K, Birren B, Nusbaum C, Kahn D, Robinson-Rechavi M, Laudet V, Schachter V, Quétier F, Saurin W, Scarpelli C, Wincker P, Lander ES, Weissenbach J, and Roest Crollius H
- Subjects
- Animals, Base Composition, Chromosomes, Human genetics, Conserved Sequence genetics, Evolution, Molecular, Genes genetics, Humans, Karyotyping, Mammals genetics, Models, Genetic, Molecular Sequence Data, Physical Chromosome Mapping, Proteome, Sequence Analysis, DNA, Synteny genetics, Urochordata genetics, Chromosomes genetics, Fishes genetics, Gene Duplication, Genome, Vertebrates genetics
- Abstract
Tetraodon nigroviridis is a freshwater puffer fish with the smallest known vertebrate genome. Here, we report a draft genome sequence with long-range linkage and substantial anchoring to the 21 Tetraodon chromosomes. Genome analysis provides a greatly improved fish gene catalogue, including identifying key genes previously thought to be absent in fish. Comparison with other vertebrates and a urochordate indicates that fish proteins have diverged markedly faster than their mammalian homologues. Comparison with the human genome suggests approximately 900 previously unannotated human genes. Analysis of the Tetraodon and human genomes shows that whole-genome duplication occurred in the teleost fish lineage, subsequent to its divergence from mammals. The analysis also makes it possible to infer the basic structure of the ancestral bony vertebrate genome, which was composed of 12 chromosomes, and to reconstruct much of the evolutionary history of ancient and recent chromosome rearrangements leading to the modern human karyotype.
- Published
- 2004
- Full Text
- View/download PDF
26. Diffuse large B-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning.
- Author
-
Shipp MA, Ross KN, Tamayo P, Weng AP, Kutok JL, Aguiar RC, Gaasenbeek M, Angelo M, Reich M, Pinkus GS, Ray TS, Koval MA, Last KW, Norton A, Lister TA, Mesirov J, Neuberg DS, Lander ES, Aster JC, and Golub TR
- Subjects
- Antineoplastic Combined Chemotherapy Protocols, Cyclophosphamide, Doxorubicin, Humans, Lymphoma, B-Cell drug therapy, Lymphoma, B-Cell mortality, Lymphoma, Large B-Cell, Diffuse drug therapy, Lymphoma, Large B-Cell, Diffuse mortality, Oligonucleotide Array Sequence Analysis, Predictive Value of Tests, Prednisone, Treatment Outcome, Vincristine, Artificial Intelligence, Gene Expression Profiling methods, Lymphoma, B-Cell diagnosis, Lymphoma, Large B-Cell, Diffuse diagnosis
- Abstract
Diffuse large B-cell lymphoma (DLBCL), the most common lymphoid malignancy in adults, is curable in less than 50% of patients. Prognostic models based on pre-treatment characteristics, such as the International Prognostic Index (IPI), are currently used to predict outcome in DLBCL. However, clinical outcome models identify neither the molecular basis of clinical heterogeneity, nor specific therapeutic targets. We analyzed the expression of 6,817 genes in diagnostic tumor specimens from DLBCL patients who received cyclophosphamide, adriamycin, vincristine and prednisone (CHOP)-based chemotherapy, and applied a supervised learning prediction method to identify cured versus fatal or refractory disease. The algorithm classified two categories of patients with very different five-year overall survival rates (70% versus 12%). The model also effectively delineated patients within specific IPI risk categories who were likely to be cured or to die of their disease. Genes implicated in DLBCL outcome included some that regulate responses to B-cell-receptor signaling, critical serine/threonine phosphorylation pathways and apoptosis. Our data indicate that supervised learning classification techniques can predict outcome in DLBCL and identify rational targets for intervention.
- Published
- 2002
- Full Text
- View/download PDF
27. Multiclass cancer diagnosis using tumor gene expression signatures.
- Author
-
Ramaswamy S, Tamayo P, Rifkin R, Mukherjee S, Yeang CH, Angelo M, Ladd C, Reich M, Latulippe E, Mesirov JP, Poggio T, Gerald W, Loda M, Lander ES, and Golub TR
- Subjects
- Biomarkers, Tumor, Cluster Analysis, Humans, Multigene Family, Neoplasms genetics, Gene Expression Profiling, Neoplasms classification, Neoplasms diagnosis
- Abstract
The optimal treatment of patients with cancer depends on establishing accurate diagnoses by using a complex combination of clinical and histopathological data. In some instances, this task is difficult or impossible because of atypical clinical presentation or histopathology. To determine whether the diagnosis of multiple common adult malignancies could be achieved purely by molecular classification, we subjected 218 tumor samples, spanning 14 common tumor types, and 90 normal tissue samples to oligonucleotide microarray gene expression analysis. The expression levels of 16,063 genes and expressed sequence tags were used to evaluate the accuracy of a multiclass classifier based on a support vector machine algorithm. Overall classification accuracy was 78%, far exceeding the accuracy of random classification (9%). Poorly differentiated cancers resulted in low-confidence predictions and could not be accurately classified according to their tissue of origin, indicating that they are molecularly distinct entities with dramatically different gene expression patterns compared with their well differentiated counterparts. Taken together, these results demonstrate the feasibility of accurate, multiclass molecular cancer classification and suggest a strategy for future clinical implementation of molecular cancer diagnostics.
- Published
- 2001
- Full Text
- View/download PDF
28. Chemosensitivity prediction by transcriptional profiling.
- Author
-
Staunton JE, Slonim DK, Coller HA, Tamayo P, Angelo MJ, Park J, Scherf U, Lee JK, Reinhold WO, Weinstein JN, Mesirov JP, Lander ES, and Golub TR
- Subjects
- Gene Expression Profiling, Humans, Neoplasms drug therapy, Oligonucleotide Array Sequence Analysis methods, Predictive Value of Tests, Tumor Cells, Cultured, Drug Resistance, Neoplasm genetics, Neoplasms genetics, Transcription, Genetic
- Abstract
In an effort to develop a genomics-based approach to the prediction of drug response, we have developed an algorithm for classification of cell line chemosensitivity based on gene expression profiles alone. Using oligonucleotide microarrays, the expression levels of 6,817 genes were measured in a panel of 60 human cancer cell lines (the NCI-60) for which the chemosensitivity profiles of thousands of chemical compounds have been determined. We sought to determine whether the gene expression signatures of untreated cells were sufficient for the prediction of chemosensitivity. Gene expression-based classifiers of sensitivity or resistance for 232 compounds were generated and then evaluated on independent sets of data. The classifiers were designed to be independent of the cells' tissue of origin. The accuracy of chemosensitivity prediction was considerably better than would be expected by chance. Eighty-eight of 232 expression-based classifiers performed accurately (with P < 0.05) on an independent test set, whereas only 12 of the 232 would be expected to do so by chance. These results suggest that at least for a subset of compounds genomic approaches to chemosensitivity prediction are feasible.
- Published
- 2001
- Full Text
- View/download PDF
29. Initial sequencing and analysis of the human genome.
- Author
-
Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W, Funke R, Gage D, Harris K, Heaford A, Howland J, Kann L, Lehoczky J, LeVine R, McEwan P, McKernan K, Meldrim J, Mesirov JP, Miranda C, Morris W, Naylor J, Raymond C, Rosetti M, Santos R, Sheridan A, Sougnez C, Stange-Thomann Y, Stojanovic N, Subramanian A, Wyman D, Rogers J, Sulston J, Ainscough R, Beck S, Bentley D, Burton J, Clee C, Carter N, Coulson A, Deadman R, Deloukas P, Dunham A, Dunham I, Durbin R, French L, Grafham D, Gregory S, Hubbard T, Humphray S, Hunt A, Jones M, Lloyd C, McMurray A, Matthews L, Mercer S, Milne S, Mullikin JC, Mungall A, Plumb R, Ross M, Shownkeen R, Sims S, Waterston RH, Wilson RK, Hillier LW, McPherson JD, Marra MA, Mardis ER, Fulton LA, Chinwalla AT, Pepin KH, Gish WR, Chissoe SL, Wendl MC, Delehaunty KD, Miner TL, Delehaunty A, Kramer JB, Cook LL, Fulton RS, Johnson DL, Minx PJ, Clifton SW, Hawkins T, Branscomb E, Predki P, Richardson P, Wenning S, Slezak T, Doggett N, Cheng JF, Olsen A, Lucas S, Elkin C, Uberbacher E, Frazier M, Gibbs RA, Muzny DM, Scherer SE, Bouck JB, Sodergren EJ, Worley KC, Rives CM, Gorrell JH, Metzker ML, Naylor SL, Kucherlapati RS, Nelson DL, Weinstock GM, Sakaki Y, Fujiyama A, Hattori M, Yada T, Toyoda A, Itoh T, Kawagoe C, Watanabe H, Totoki Y, Taylor T, Weissenbach J, Heilig R, Saurin W, Artiguenave F, Brottier P, Bruls T, Pelletier E, Robert C, Wincker P, Smith DR, Doucette-Stamm L, Rubenfield M, Weinstock K, Lee HM, Dubois J, Rosenthal A, Platzer M, Nyakatura G, Taudien S, Rump A, Yang H, Yu J, Wang J, Huang G, Gu J, Hood L, Rowen L, Madan A, Qin S, Davis RW, Federspiel NA, Abola AP, Proctor MJ, Myers RM, Schmutz J, Dickson M, Grimwood J, Cox DR, Olson MV, Kaul R, Raymond C, Shimizu N, Kawasaki K, Minoshima S, Evans GA, Athanasiou M, Schultz R, Roe BA, Chen F, Pan H, Ramser J, Lehrach H, Reinhardt R, McCombie WR, de la Bastide M, Dedhia N, Blöcker H, Hornischer K, Nordsiek G, Agarwala R, Aravind L, Bailey JA, Bateman A, Batzoglou S, Birney E, Bork P, Brown DG, Burge CB, Cerutti L, Chen HC, Church D, Clamp M, Copley RR, Doerks T, Eddy SR, Eichler EE, Furey TS, Galagan J, Gilbert JG, Harmon C, Hayashizaki Y, Haussler D, Hermjakob H, Hokamp K, Jang W, Johnson LS, Jones TA, Kasif S, Kaspryzk A, Kennedy S, Kent WJ, Kitts P, Koonin EV, Korf I, Kulp D, Lancet D, Lowe TM, McLysaght A, Mikkelsen T, Moran JV, Mulder N, Pollara VJ, Ponting CP, Schuler G, Schultz J, Slater G, Smit AF, Stupka E, Szustakowki J, Thierry-Mieg D, Thierry-Mieg J, Wagner L, Wallis J, Wheeler R, Williams A, Wolf YI, Wolfe KH, Yang SP, Yeh RF, Collins F, Guyer MS, Peterson J, Felsenfeld A, Wetterstrand KA, Patrinos A, Morgan MJ, de Jong P, Catanese JJ, Osoegawa K, Shizuya H, Choi S, Chen YJ, and Szustakowki J
- Subjects
- Animals, Chromosome Mapping, Conserved Sequence, CpG Islands, DNA Transposable Elements, Databases, Factual, Drug Industry, Evolution, Molecular, Forecasting, GC Rich Sequence, Gene Duplication, Genes, Genetic Diseases, Inborn, Genetics, Medical, Humans, Mutation, Private Sector, Proteins genetics, Proteome, Public Sector, RNA genetics, Repetitive Sequences, Nucleic Acid, Species Specificity, Genome, Human, Human Genome Project, Sequence Analysis, DNA methods
- Abstract
The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce and make freely available a draft sequence of the human genome. We also present an initial analysis of the data, describing some of the insights that can be gleaned from the sequence.
- Published
- 2001
- Full Text
- View/download PDF
30. Interpreting patterns of gene expression with self-organizing maps: methods and application to hematopoietic differentiation.
- Author
-
Tamayo P, Slonim D, Mesirov J, Zhu Q, Kitareewan S, Dmitrovsky E, Lander ES, and Golub TR
- Subjects
- Animals, Cluster Analysis, HL-60 Cells, Humans, Jurkat Cells, Saccharomyces cerevisiae, Software, Gene Expression Regulation, Hematopoiesis genetics
- Abstract
Array technologies have made it straightforward to monitor simultaneously the expression pattern of thousands of genes. The challenge now is to interpret such massive data sets. The first step is to extract the fundamental patterns of gene expression inherent in the data. This paper describes the application of self-organizing maps, a type of mathematical cluster analysis that is particularly well suited for recognizing and classifying features in complex, multidimensional data. The method has been implemented in a publicly available computer package, GENECLUSTER, that performs the analytical calculations and provides easy data visualization. To illustrate the value of such analysis, the approach is applied to hematopoietic differentiation in four well studied models (HL-60, U937, Jurkat, and NB4 cells). Expression patterns of some 6,000 human genes were assayed, and an online database was created. GENECLUSTER was used to organize the genes into biologically relevant clusters that suggest novel hypotheses about hematopoietic differentiation-for example, highlighting certain genes and pathways involved in "differentiation therapy" used in the treatment of acute promyelocytic leukemia.
- Published
- 1999
- Full Text
- View/download PDF
31. Hybrid system for protein secondary structure prediction.
- Author
-
Zhang X, Mesirov JP, and Waltz DL
- Subjects
- Amino Acid Sequence, Enzymes chemistry, Mathematics, Models, Statistical, Molecular Sequence Data, Software, Protein Conformation, Proteins chemistry
- Abstract
We have developed a hybrid system to predict the secondary structures (alpha-helix, beta-sheet and coil) of proteins and achieved 66.4% accuracy, with correlation coefficients of C(coil) = 0.429, C alpha = 0.470 and C beta = 0.387. This system contains three subsystems ("experts"): a neural network module, a statistical module and a memory-based reasoning module. First, the three experts independently learn the mapping between amino acid sequences and secondary structures from the known protein structures, then a Combiner learns to combine automatically the outputs of the experts to make final predictions. The hybrid system was tested with 107 protein structures through k-way cross-validation. Its performance was better than each expert and all previously reported methods with greater than 0.99 statistical significance. It was observed that for 20% of the residues, all three experts produced the same but wrong predictions. This may suggest an upper bound on the accuracy of secondary structure predictions based on local information from the currently available protein structures, and indicate places where non-local interactions may play a dominant role in conformation. For 64% of the residues, at least two experts were the same and correct, which shows that the Combiner performed better than majority vote. For 77% of the residues, at least one expert was correct, thus there may still be room for improvement in this hybrid approach. Rigorous evaluation procedures were used in testing the hybrid system, and statistical significance measures were developed in analyzing the differences among different methods. When measured in terms of the number of secondary structures (rather than the number of residues) that were predicted correctly, the prediction produced by the hybrid system was also better than those of individual experts.
- Published
- 1992
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.