22 results on '"Anastasiya Belyaeva"'
Search Results
2. Multi-domain translation between single-cell imaging and sequencing data using autoencoders
- Author
-
Karren Dai Yang, Anastasiya Belyaeva, Saradha Venkatachalapathy, Karthik Damodaran, Abigail Katcoff, Adityanarayanan Radhakrishnan, G. V. Shivashankar, and Caroline Uhler
- Subjects
Science - Abstract
Integration of single cell data modalities increases the richness of information about the heterogeneity of cell states, but integration of imaging and transcriptomics is an open challenge. Here the authors use autoencoders to learn a probabilistic coupling and map these modalities to a shared latent space.
- Published
- 2021
- Full Text
- View/download PDF
3. Multimodal LLMs for Health Grounded in Individual-Specific Data.
- Author
-
Anastasiya Belyaeva, Justin Cosentino, Farhad Hormozdiari, Krish Eswaran, Shravya Shetty, Greg Corrado, Andrew Carroll, Cory Y. McLean, and Nicholas A. Furlotte
- Published
- 2023
- Full Text
- View/download PDF
4. Towards a Personal Health Large Language Model.
- Author
-
Justin Cosentino, Anastasiya Belyaeva, Xin Liu 0034, Nicholas A. Furlotte, Zhun Yang, Chace Lee, Erik Schenck, Yojan Patel, Jian Cui, Logan Douglas Schneider, Robby Bryant, Ryan G. Gomes, Allen Jiang, Roy Lee, Yun Liu 0013, Javier Perez Matos, Jameson K. Rogers, Cathy Speed, Shyam Tailor, Megan Walker, Jeffrey Yu, Tim Althoff, Conor Heneghan, John Hernandez, Mark Malhotra, Leor Stern, Yossi Matias, Gregory S. Corrado, Shwetak N. Patel, Shravya Shetty, Jiening Zhan, Shruthi Prabhakara, Daniel McDuff, and Cory Y. McLean
- Published
- 2024
- Full Text
- View/download PDF
5. Identifying 3D Genome Organization in Diploid Organisms via Euclidean Distance Geometry.
- Author
-
Anastasiya Belyaeva, Kaie Kubjas, Lawrence J. Sun, and Caroline Uhler
- Published
- 2022
- Full Text
- View/download PDF
6. Anchored Causal Inference in the Presence of Measurement Error.
- Author
-
Basil Saeed, Anastasiya Belyaeva, Yuhao Wang 0005, and Caroline Uhler
- Published
- 2020
7. Knowledge distillation for fast and accurate DNA sequence correction.
- Author
-
Anastasiya Belyaeva, Joel Shor, Daniel E. Cook, Kishwar Shafin, Daniel Liu, Armin Töpfer, Aaron M. Wenger, William J. Rowell, Howard Yang, Alexey Kolesnikov, Cory Y. McLean, Maria Nattestad, Andrew Carroll, and Pi-Chuan Chang
- Published
- 2022
- Full Text
- View/download PDF
8. DCI: learning causal differences between gene regulatory networks.
- Author
-
Anastasiya Belyaeva, Chandler Squires, and Caroline Uhler
- Published
- 2021
- Full Text
- View/download PDF
9. A draft human pangenome reference
- Author
-
Wen-Wei Liao, Mobin Asri, Jana Ebler, Daniel Doerr, Marina Haukness, Glenn Hickey, Shuangjia Lu, Julian K. Lucas, Jean Monlong, Haley J. Abel, Silvia Buonaiuto, Xian H. Chang, Haoyu Cheng, Justin Chu, Vincenza Colonna, Jordan M. Eizenga, Xiaowen Feng, Christian Fischer, Robert S. Fulton, Shilpa Garg, Cristian Groza, Andrea Guarracino, William T. Harvey, Simon Heumos, Kerstin Howe, Miten Jain, Tsung-Yu Lu, Charles Markello, Fergal J. Martin, Matthew W. Mitchell, Katherine M. Munson, Moses Njagi Mwaniki, Adam M. Novak, Hugh E. Olsen, Trevor Pesout, David Porubsky, Pjotr Prins, Jonas A. Sibbesen, Jouni Sirén, Chad Tomlinson, Flavia Villani, Mitchell R. Vollger, Lucinda L. Antonacci-Fulton, Gunjan Baid, Carl A. Baker, Anastasiya Belyaeva, Konstantinos Billis, Andrew Carroll, Pi-Chuan Chang, Sarah Cody, Daniel E. Cook, Robert M. Cook-Deegan, Omar E. Cornejo, Mark Diekhans, Peter Ebert, Susan Fairley, Olivier Fedrigo, Adam L. Felsenfeld, Giulio Formenti, Adam Frankish, Yan Gao, Nanibaa’ A. Garrison, Carlos Garcia Giron, Richard E. Green, Leanne Haggerty, Kendra Hoekzema, Thibaut Hourlier, Hanlee P. Ji, Eimear E. Kenny, Barbara A. Koenig, Alexey Kolesnikov, Jan O. Korbel, Jennifer Kordosky, Sergey Koren, HoJoon Lee, Alexandra P. Lewis, Hugo Magalhães, Santiago Marco-Sola, Pierre Marijon, Ann McCartney, Jennifer McDaniel, Jacquelyn Mountcastle, Maria Nattestad, Sergey Nurk, Nathan D. Olson, Alice B. Popejoy, Daniela Puiu, Mikko Rautiainen, Allison A. Regier, Arang Rhie, Samuel Sacco, Ashley D. Sanders, Valerie A. Schneider, Baergen I. Schultz, Kishwar Shafin, Michael W. Smith, Heidi J. Sofia, Ahmad N. Abou Tayoun, Françoise Thibaud-Nissen, Francesca Floriana Tricomi, Justin Wagner, Brian Walenz, Jonathan M. D. Wood, Aleksey V. Zimin, Guillaume Bourque, Mark J. P. Chaisson, Paul Flicek, Adam M. Phillippy, Justin M. Zook, Evan E. Eichler, David Haussler, Ting Wang, Erich D. Jarvis, Karen H. Miga, Erik Garrison, Tobias Marschall, Ira M. Hall, Heng Li, and Benedict Paten
- Subjects
Cancer Research ,Multidisciplinary - Abstract
Here the Human Pangenome Reference Consortium presents a first draft of the human pangenome reference. The pangenome contains 47 phased, diploid assemblies from a cohort of genetically diverse individuals1. These assemblies cover more than 99% of the expected sequence in each genome and are more than 99% accurate at the structural and base pair levels. Based on alignments of the assemblies, we generate a draft pangenome that captures known variants and haplotypes and reveals new alleles at structurally complex loci. We also add 119 million base pairs of euchromatic polymorphic sequences and 1,115 gene duplications relative to the existing reference GRCh38. Roughly 90 million of the additional base pairs are derived from structural variation. Using our draft pangenome to analyse short-read data reduced small variant discovery errors by 34% and increased the number of structural variants detected per haplotype by 104% compared with GRCh38-based workflows, which enabled the typing of the vast majority of structural variant alleles per sample.
- Published
- 2023
- Full Text
- View/download PDF
10. Direct Estimation of Differences in Causal Graphs.
- Author
-
Yuhao Wang 0005, Chandler Squires, Anastasiya Belyaeva, and Caroline Uhler
- Published
- 2018
11. Recombination between heterologous human acrocentric chromosomes
- Author
-
Barcelona Supercomputing Center, Human Pangenome Reference Consortium: "Haley J. Abel, Lucinda L. Antonacci-Fulton, Mobin Asri, Gunjan Baid, Carl A. Baker, Anastasiya Belyaeva, Konstantinos Billis, Guillaume Bourque, Silvia Buonaiuto, Andrew Carroll, Mark J. P. Chaisson, Pi-Chuan Chang, Xian H. Chang, Haoyu Cheng, Justin Chu, Sarah Cody, Vincenza Colonna, Daniel E. Cook, Robert M. Cook-Deegan, Omar E. Cornejo, Mark Diekhans, Daniel Doerr, Peter Ebert, Jana Ebler, Evan E. Eichler, Jordan M. Eizenga, Susan Fairley, Olivier Fedrigo, Adam L. Felsenfeld, Xiaowen Feng, Christian Fischer, Paul Flicek, Giulio Formenti, Adam Frankish, Robert S. Fulton, Yan Gao, Shilpa Garg, Erik Garrison, Nanibaa’ A. Garrison, Carlos Garcia Giron, Richard E. Green, Cristian Groza, Andrea Guarracino, Leanne Haggerty, Ira Hall, William T. Harvey, Marina Haukness, David Haussler, Simon Heumos, Glenn Hickey, Kendra Hoekzema, Thibaut Hourlier, Kerstin Howe, Miten Jain, Erich D. Jarvis, Hanlee P. Ji, Eimear E. Kenny, Barbara A. Koenig, Alexey Kolesnikov, Jan O. Korbel, Jennifer Kordosky, Sergey Koren, HoJoon Lee, Alexandra P. Lewis, Heng Li, Wen-Wei Liao, Shuangjia Lu, Tsung-Yu Lu, Julian K. Lucas, Hugo Magalhães, Santiago Marco-Sola, Pierre Marijon, Charles Markello, Tobias Marschall, Fergal J. Martin, Ann McCartney, Jennifer McDaniel, Karen H. Miga, Matthew W. Mitchell, Jean Monlong, Jacquelyn Mountcastle, Katherine M. Munson, Moses Njagi Mwaniki, Maria Nattestad, Adam M. Novak, Sergey Nurk, Hugh E. Olsen, Nathan D. Olson, Benedict Paten, Trevor Pesout, Adam M. Phillippy, Alice B. Popejoy, David Porubsky, Pjotr Prins, Daniela Puiu, Mikko Rautiainen, Allison A. Regier, Arang Rhie, Samuel Sacco, Ashley D. Sanders, Valerie A. Schneider, Baergen I. Schultz, Kishwar Shafin, Jonas A. Sibbesen, Jouni Sirén, Michael W. Smith, Heidi J. Sofia, Ahmad N. Abou Tayoun, Françoise Thibaud-Nissen, Chad Tomlinson, Francesca Floriana Tricomi, Flavia Villani, Mitchell R. Vollger, Justin Wagner, Brian Walenz, Ting Wang, Jonathan M. D. Wood, Aleksey, Guarracino, Andrea, Buonaiuto, Silvia, Gomes de Lima, Leonardo, Potapova, Tamara, Rhie, Arang, Marco, Santiago, Barcelona Supercomputing Center, Human Pangenome Reference Consortium: "Haley J. Abel, Lucinda L. Antonacci-Fulton, Mobin Asri, Gunjan Baid, Carl A. Baker, Anastasiya Belyaeva, Konstantinos Billis, Guillaume Bourque, Silvia Buonaiuto, Andrew Carroll, Mark J. P. Chaisson, Pi-Chuan Chang, Xian H. Chang, Haoyu Cheng, Justin Chu, Sarah Cody, Vincenza Colonna, Daniel E. Cook, Robert M. Cook-Deegan, Omar E. Cornejo, Mark Diekhans, Daniel Doerr, Peter Ebert, Jana Ebler, Evan E. Eichler, Jordan M. Eizenga, Susan Fairley, Olivier Fedrigo, Adam L. Felsenfeld, Xiaowen Feng, Christian Fischer, Paul Flicek, Giulio Formenti, Adam Frankish, Robert S. Fulton, Yan Gao, Shilpa Garg, Erik Garrison, Nanibaa’ A. Garrison, Carlos Garcia Giron, Richard E. Green, Cristian Groza, Andrea Guarracino, Leanne Haggerty, Ira Hall, William T. Harvey, Marina Haukness, David Haussler, Simon Heumos, Glenn Hickey, Kendra Hoekzema, Thibaut Hourlier, Kerstin Howe, Miten Jain, Erich D. Jarvis, Hanlee P. Ji, Eimear E. Kenny, Barbara A. Koenig, Alexey Kolesnikov, Jan O. Korbel, Jennifer Kordosky, Sergey Koren, HoJoon Lee, Alexandra P. Lewis, Heng Li, Wen-Wei Liao, Shuangjia Lu, Tsung-Yu Lu, Julian K. Lucas, Hugo Magalhães, Santiago Marco-Sola, Pierre Marijon, Charles Markello, Tobias Marschall, Fergal J. Martin, Ann McCartney, Jennifer McDaniel, Karen H. Miga, Matthew W. Mitchell, Jean Monlong, Jacquelyn Mountcastle, Katherine M. Munson, Moses Njagi Mwaniki, Maria Nattestad, Adam M. Novak, Sergey Nurk, Hugh E. Olsen, Nathan D. Olson, Benedict Paten, Trevor Pesout, Adam M. Phillippy, Alice B. Popejoy, David Porubsky, Pjotr Prins, Daniela Puiu, Mikko Rautiainen, Allison A. Regier, Arang Rhie, Samuel Sacco, Ashley D. Sanders, Valerie A. Schneider, Baergen I. Schultz, Kishwar Shafin, Jonas A. Sibbesen, Jouni Sirén, Michael W. Smith, Heidi J. Sofia, Ahmad N. Abou Tayoun, Françoise Thibaud-Nissen, Chad Tomlinson, Francesca Floriana Tricomi, Flavia Villani, Mitchell R. Vollger, Justin Wagner, Brian Walenz, Ting Wang, Jonathan M. D. Wood, Aleksey, Guarracino, Andrea, Buonaiuto, Silvia, Gomes de Lima, Leonardo, Potapova, Tamara, Rhie, Arang, and Marco, Santiago
- Abstract
The short arms of the human acrocentric chromosomes 13, 14, 15, 21 and 22 (SAACs) share large homologous regions, including ribosomal DNA repeats and extended segmental duplications1,2. Although the resolution of these regions in the first complete assembly of a human genome—the Telomere-to-Telomere Consortium’s CHM13 assembly (T2T-CHM13)—provided a model of their homology3, it remained unclear whether these patterns were ancestral or maintained by ongoing recombination exchange. Here we show that acrocentric chromosomes contain pseudo-homologous regions (PHRs) indicative of recombination between non-homologous sequences. Utilizing an all-to-all comparison of the human pangenome from the Human Pangenome Reference Consortium4 (HPRC), we find that contigs from all of the SAACs form a community. A variation graph5 constructed from centromere-spanning acrocentric contigs indicates the presence of regions in which most contigs appear nearly identical between heterologous acrocentric chromosomes in T2T-CHM13. Except on chromosome 15, we observe faster decay of linkage disequilibrium in the pseudo-homologous regions than in the corresponding short and long arms, indicating higher rates of recombination6,7. The pseudo-homologous regions include sequences that have previously been shown to lie at the breakpoint of Robertsonian translocations8, and their arrangement is compatible with crossover in inverted duplications on chromosomes 13, 14 and 21. The ubiquity of signals of recombination between heterologous acrocentric chromosomes seen in the HPRC draft pangenome suggests that these shared sequences form the basis for recurrent Robertsonian translocations, providing sequence and population-based confirmation of hypotheses first developed from cytogenetic studies 50 years ago9., Our work depends on the HPRC draft human pangenome resource established in the accompanying Article4, and we thank the production and assembly groups for their efforts in establishing this resource. This work used the computational resources of the UTHSC Octopus cluster and NIH HPC Biowulf cluster. We acknowledge support in maintaining these systems that was critical to our analyses. The authors thank M. Miller for the development of a graphical synopsis of our study (Fig. 5); and R. Williams and N. Soranzo for support and guidance in the design and discussion of our work. This work was supported, in part, by National Institutes of Health/NIDA U01DA047638 (E.G.), National Institutes of Health/NIGMS R01GM123489 (E.G.), NSF PPoSS Award no. 2118709 (E.G. and C.F.), the Tennessee Governor’s Chairs programme (C.F. and E.G.), National Institutes of Health/NCI R01CA266339 (T.P., L.G.d.L. and J.L.G.), and the Intramural Research Program of the National Human Genome Research Institute, National Institutes of Health (A.R., S.K. and A.M.P.). We acknowledge support from Human Technopole (A.G.), Consiglio Nazionale delle Ricerche, Italy (S.B. and V.C.), and Stowers Institute for Medical Research (T.P., L.G.d.L., B.R. and J.L.G.)., Peer Reviewed, "Article signat per 13 autors/es: Andrea Guarracino, Silvia Buonaiuto, Leonardo Gomes de Lima, Tamara Potapova, Arang Rhie, Sergey Koren, Boris Rubinstein, Christian Fischer, Human Pangenome Reference Consortium, Jennifer L. Gerton, Adam M. Phillippy, Vincenza Colonna & Erik Garrison " Human Pangenome Reference Consortium: "Haley J. Abel, Lucinda L. Antonacci-Fulton, Mobin Asri, Gunjan Baid, Carl A. Baker, Anastasiya Belyaeva, Konstantinos Billis, Guillaume Bourque, Silvia Buonaiuto, Andrew Carroll, Mark J. P. Chaisson, Pi-Chuan Chang, Xian H. Chang, Haoyu Cheng, Justin Chu, Sarah Cody, Vincenza Colonna, Daniel E. Cook, Robert M. Cook-Deegan, Omar E. Cornejo, Mark Diekhans, Daniel Doerr, Peter Ebert, Jana Ebler, Evan E. Eichler, Jordan M. Eizenga, Susan Fairley, Olivier Fedrigo, Adam L. Felsenfeld, Xiaowen Feng, Christian Fischer, Paul Flicek, Giulio Formenti, Adam Frankish, Robert S. Fulton, Yan Gao, Shilpa Garg, Erik Garrison, Nanibaa’ A. Garrison, Carlos Garcia Giron, Richard E. Green, Cristian Groza, Andrea Guarracino, Leanne Haggerty, Ira Hall, William T. Harvey, Marina Haukness, David Haussler, Simon Heumos, Glenn Hickey, Kendra Hoekzema, Thibaut Hourlier, Kerstin Howe, Miten Jain, Erich D. Jarvis, Hanlee P. Ji, Eimear E. Kenny, Barbara A. Koenig, Alexey Kolesnikov, Jan O. Korbel, Jennifer Kordosky, Sergey Koren, HoJoon Lee, Alexandra P. Lewis, Heng Li, Wen-Wei Liao, Shuangjia Lu, Tsung-Yu Lu, Julian K. Lucas, Hugo Magalhães, Santiago Marco-Sola, Pierre Marijon, Charles Markello, Tobias Marschall, Fergal J. Martin, Ann McCartney, Jennifer McDaniel, Karen H. Miga, Matthew W. Mitchell, Jean Monlong, Jacquelyn Mountcastle, Katherine M. Munson, Moses Njagi Mwaniki, Maria Nattestad, Adam M. Novak, Sergey Nurk, Hugh E. Olsen, Nathan D. Olson, Benedict Paten, Trevor Pesout, Adam M. Phillippy, Alice B. Popejoy, David Porubsky, Pjotr Prins, Daniela Puiu, Mikko Rautiainen, Allison A. Regier, Arang Rhie, Samuel Sacco, Ashley D. Sanders, Valerie A. Schneider, Baergen I. S, Postprint (published version)
- Published
- 2023
12. Best: A Tool for Characterizing Sequencing Errors
- Author
-
Daniel Liu, Anastasiya Belyaeva, Kishwar Shafin, Pi-Chuan Chang, Andrew Carroll, and Daniel E. Cook
- Abstract
SummaryPlatform-dependent sequencing errors must be understood to develop accurate sequencing technologies. We propose a new tool,best(Bam Error Stats Tool), for efficiently quantifying and summarizing error types in sequenced reads.bestingests reads aligned to a high-quality reference assembly and produces per-read metrics, summary statistics, and stratified metrics across genomic intervals. We show thatbestis 16 times faster than a prior method. In addition to being useful to support development that improves the accuracy of sequencing platforms, best can also be applied to evaluate and improve other experimental factors such as library preparation and error correction methods.Availability and implementationbestis an open-source command-line utility available on Github (github.com/google/best) under an MIT license.Contactdanielecook@google.com
- Published
- 2022
- Full Text
- View/download PDF
13. DeepConsensus improves the accuracy of sequences with a gap-aware sequence transformer
- Author
-
Gunjan Baid, Daniel E. Cook, Kishwar Shafin, Taedong Yun, Felipe Llinares-López, Quentin Berthet, Anastasiya Belyaeva, Armin Töpfer, Aaron M. Wenger, William J. Rowell, Howard Yang, Alexey Kolesnikov, Waleed Ammar, Jean-Philippe Vert, Ashish Vaswani, Cory Y. McLean, Maria Nattestad, Pi-Chuan Chang, and Andrew Carroll
- Subjects
Biomedical Engineering ,Molecular Medicine ,Bioengineering ,Applied Microbiology and Biotechnology ,Biotechnology - Abstract
Circular consensus sequencing with Pacific Biosciences (PacBio) technology generates long (10-25 kilobases), accurate 'HiFi' reads by combining serial observations of a DNA molecule into a consensus sequence. The standard approach to consensus generation, pbccs, uses a hidden Markov model. We introduce DeepConsensus, which uses an alignment-based loss to train a gap-aware transformer-encoder for sequence correction. Compared to pbccs, DeepConsensus reduces read errors by 42%. This increases the yield of PacBio HiFi reads at Q20 by 9%, at Q30 by 27% and at Q40 by 90%. With two SMRT Cells of HG003, reads from DeepConsensus improve hifiasm assembly contiguity ( NG50 4.9 megabases (Mb) to 17.2 Mb), increase gene completeness (94% to 97%), reduce the false gene duplication rate (1.1% to 0.5%), improve assembly base accuracy (Q43 to Q45) and reduce variant-calling errors by 24%. DeepConsensus models could be trained to the general problem of analyzing the alignment of other types of sequences, such as unique molecular identifiers or genome assemblies.
- Published
- 2022
- Full Text
- View/download PDF
14. Multi-domain translation between single-cell imaging and sequencing data using autoencoders
- Author
-
Karthik Damodaran, Saradha Venkatachalapathy, Abigail Katcoff, Karren Dai Yang, Adityanarayanan Radhakrishnan, G. V. Shivashankar, Anastasiya Belyaeva, and Caroline Uhler
- Subjects
CD4-Positive T-Lymphocytes ,0301 basic medicine ,genetic structures ,Computer science ,Science ,Sequencing data ,Population ,General Physics and Astronomy ,Translation (geometry) ,Machine learning ,computer.software_genre ,Article ,General Biochemistry, Genetics and Molecular Biology ,03 medical and health sciences ,0302 clinical medicine ,otorhinolaryngologic diseases ,Humans ,education ,Cell Nucleus ,Principal Component Analysis ,education.field_of_study ,Multidisciplinary ,Modalities ,Sequence Analysis, RNA ,business.industry ,Gene Expression Profiling ,Probabilistic logic ,Reproducibility of Results ,General Chemistry ,Chromatin ,Multi domain ,030104 developmental biology ,Gene Expression Regulation ,ROC Curve ,Cellular heterogeneity ,Data integration ,Artificial intelligence ,Single-Cell Analysis ,Systems biology ,business ,computer ,Algorithms ,030217 neurology & neurosurgery ,psychological phenomena and processes - Abstract
The development of single-cell methods for capturing different data modalities including imaging and sequencing has revolutionized our ability to identify heterogeneous cell states. Different data modalities provide different perspectives on a population of cells, and their integration is critical for studying cellular heterogeneity and its function. While various methods have been proposed to integrate different sequencing data modalities, coupling imaging and sequencing has been an open challenge. We here present an approach for integrating vastly different modalities by learning a probabilistic coupling between the different data modalities using autoencoders to map to a shared latent space. We validate this approach by integrating single-cell RNA-seq and chromatin images to identify distinct subpopulations of human naive CD4+ T-cells that are poised for activation. Collectively, our approach provides a framework to integrate and translate between data modalities that cannot yet be measured within the same cell for diverse applications in biomedical discovery., Nature Communications, 12 (1), ISSN:2041-1723
- Published
- 2021
15. Network Analysis Identifies Regulatory Hotspots in Regions of Chromosome Interactions.
- Author
-
Anastasiya Belyaeva, Saradha Venkatachalapathy, Mallika Nagarajan, G. V. Shivashankar, and Caroline Uhler
- Published
- 2017
- Full Text
- View/download PDF
16. IBCL-365 Variety of Mutations In Russian B-Cell Lymphoma Patients
- Author
-
Bella Biderman, Natalya Severina, Darya Koroleva, Nelli Gabeeva, Anastasiya Belyaeva, Svetlana Tatarnikova, Fatima Babaeva, Ekaterina Nesterova, Yana Mangasarova, Oleg Margolin, Sergey Kravchenko, Tatyana Obukhova, Hunan Julhakyan, Evgeny Zvonkov, and Andrey Sudarikov
- Subjects
Cancer Research ,Oncology ,Hematology - Published
- 2022
- Full Text
- View/download PDF
17. Causal network models of SARS-CoV-2 expression and aging to identify candidates for drug repurposing
- Author
-
Caroline Uhler, G. V. Shivashankar, Karren Dai Yang, Adityanarayanan Radhakrishnan, Louis Cammarata, Chandler Squires, and Anastasiya Belyaeva
- Subjects
0301 basic medicine ,Proteomics ,Aging ,Computer science ,High-throughput screening ,Molecular Networks (q-bio.MN) ,Science ,Gene regulatory network ,Druggability ,General Physics and Astronomy ,Gene Expression ,Context (language use) ,Computational biology ,Virtual drug screening ,Antiviral Agents ,General Biochemistry, Genetics and Molecular Biology ,Article ,Gene regulatory networks ,03 medical and health sciences ,0302 clinical medicine ,Machine learning ,Drug Discovery ,Humans ,Quantitative Biology - Molecular Networks ,Multidisciplinary ,Drug discovery ,SARS-CoV-2 ,Drug Repositioning ,COVID-19 ,General Chemistry ,COVID-19 Drug Treatment ,Clinical trial ,Drug repositioning ,030104 developmental biology ,A549 Cells ,FOS: Biological sciences ,Data integration ,Angiotensin-Converting Enzyme 2 ,Transcriptome ,030217 neurology & neurosurgery ,Algorithms - Abstract
Given the severity of the SARS-CoV-2 pandemic, a major challenge is to rapidly repurpose existing approved drugs for clinical interventions. While a number of data-driven and experimental approaches have been suggested in the context of drug repurposing, a platform that systematically integrates available transcriptomic, proteomic and structural data is missing. More importantly, given that SARS-CoV-2 pathogenicity is highly age-dependent, it is critical to integrate aging signatures into drug discovery platforms. We here take advantage of large-scale transcriptional drug screens combined with RNA-seq data of the lung epithelium with SARS-CoV-2 infection as well as the aging lung. To identify robust druggable protein targets, we propose a principled causal framework that makes use of multiple data modalities. Our analysis highlights the importance of serine/threonine and tyrosine kinases as potential targets that intersect the SARS-CoV-2 and aging pathways. By integrating transcriptomic, proteomic and structural data that is available for many diseases, our drug discovery platform is broadly applicable. Rigorous in vitro experiments as well as clinical trials are needed to validate the identified candidate drugs., Nature Communications, 12 (1), ISSN:2041-1723
- Published
- 2021
18. DCI: Learning Causal Differences between Gene Regulatory Networks
- Author
-
Caroline Uhler, Chandler Squires, and Anastasiya Belyaeva
- Subjects
Statistics and Probability ,Computer science ,Computation ,Stability (learning theory) ,Gene regulatory network ,Machine learning ,computer.software_genre ,Biochemistry ,01 natural sciences ,010104 statistics & probability ,03 medical and health sciences ,0101 mathematics ,Molecular Biology ,Selection (genetic algorithm) ,030304 developmental biology ,Regulation of gene expression ,0303 health sciences ,Causal graph ,business.industry ,Applications Notes ,Expression (mathematics) ,Computer Science Applications ,Computational Mathematics ,Computational Theory and Mathematics ,Sample size determination ,Causal inference ,Artificial intelligence ,business ,computer - Abstract
Summary Designing interventions to control gene regulation necessitates modeling a gene regulatory network by a causal graph. Currently, large-scale gene expression datasets from different conditions, cell types, disease states, and developmental time points are being collected. However, application of classical causal inference algorithms to infer gene regulatory networks based on such data is still challenging, requiring high sample sizes and computational resources. Here, we describe an algorithm that efficiently learns the differences in gene regulatory mechanisms between different conditions. Our difference causal inference (DCI) algorithm infers changes (i.e. edges that appeared, disappeared, or changed weight) between two causal graphs given gene expression data from the two conditions. This algorithm is efficient in its use of samples and computation since it infers the differences between causal graphs directly without estimating each possibly large causal graph separately. We provide a user-friendly Python implementation of DCI and also enable the user to learn the most robust difference causal graph across different tuning parameters via stability selection. Finally, we show how to apply DCI to single-cell RNA-seq data from different conditions and cell states, and we also validate our algorithm by predicting the effects of interventions. Availability and implementation Python package freely available at http://uhlerlab.github.io/causaldag/dci. Supplementary information Supplementary data are available at Bioinformatics online.
- Published
- 2020
19. Image schemas in contrastive linguistics: the case of concept EDUCATION in English, French, Ukrainian and Russian languages
- Author
-
Anastasiya Belyaeva
- Subjects
Ukrainian ,language ,Sociology ,Contrastive linguistics ,Linguistics ,language.human_language ,Image (mathematics) - Published
- 2019
- Full Text
- View/download PDF
20. Network analysis identifies chromosome intermingling regions as regulatory hotspots for transcription
- Author
-
G. V. Shivashankar, Saradha Venkatachalapathy, Mallika Nagarajan, Anastasiya Belyaeva, and Caroline Uhler
- Subjects
0301 basic medicine ,chromosome intermingling ,Transcription, Genetic ,Correlation clustering ,RNA polymerase II ,Computational biology ,Genome ,Chromosome conformation capture ,03 medical and health sciences ,Hi-C ,Transcription (biology) ,Chromosome regions ,Animals ,Chromosomes, Human ,Humans ,Multidisciplinary ,epigenetics ,3D FISH ,biology ,Sequence Analysis, DNA ,Biological Sciences ,DNA binding site ,Biophysics and Computational Biology ,030104 developmental biology ,biology.protein ,Weighted network ,network and clustering analysis - Abstract
Significance We develop a network analysis approach for identifying clusters of interactions between chromosomes, which we validate experimentally. Our method integrates 1D features of the genome, such as epigenetic marks, with 3D interactions, allowing us to study spatially colocalized regions between chromosomes that are functionally relevant. We observe that clusters of interchromosomal regions fall into active and inactive categories. We find that active clusters share transcription factors and are enriched for transcriptional machinery, suggesting that chromosome intermingling regions play a key role in genome regulation. Our method provides a unique quantitative framework that can be broadly applied to study the principles of genome organization and regulation during processes such as cell differentiation and reprogramming., The 3D structure of the genome plays a key role in regulatory control of the cell. Experimental methods such as high-throughput chromosome conformation capture (Hi-C) have been developed to probe the 3D structure of the genome. However, it remains a challenge to deduce from these data chromosome regions that are colocalized and coregulated. Here, we present an integrative approach that leverages 1D functional genomic features (e.g., epigenetic marks) with 3D interactions from Hi-C data to identify functional interchromosomal interactions. We construct a weighted network with 250-kb genomic regions as nodes and Hi-C interactions as edges, where the edge weights are given by the correlation between 1D genomic features. Individual interacting clusters are determined using weighted correlation clustering on the network. We show that intermingling regions generally fall into either active or inactive clusters based on the enrichment for RNA polymerase II (RNAPII) and H3K9me3, respectively. We show that active clusters are hotspots for transcription factor binding sites. We also validate our predictions experimentally by 3D fluorescence in situ hybridization (FISH) experiments and show that active RNAPII is enriched in predicted active clusters. Our method provides a general quantitative framework that couples 1D genomic features with 3D interactions from Hi-C to probe the guiding principles that link the spatial organization of the genome with regulatory control.
- Published
- 2017
- Full Text
- View/download PDF
21. Architecture of HPC clusters for Oil&Gas Industry
- Author
-
Eugeniy Biryaltsev, Ol’ga Zhibrik, Aleksandr Elizarov, Marat Galimov, Anastasiya Belyaeva, and Denis Demidov
- Subjects
business.industry ,Geography, Planning and Development ,Environmental science ,Development ,Architecture ,Process engineering ,business - Published
- 2017
- Full Text
- View/download PDF
22. Application of Quality by Design (QbD) Approach to Ultrasonic Atomization Spray Coating of Drug-Eluting Stents
- Author
-
Sharmista Chatterjee, Minerva Hughes, Martin K. McDermott, Dinesh V. Patwardhan, Tina Zhang, Anastasiya Belyaeva, Celia N. Cruz, Joanne Leadbetter, Nancy Tang, David M. Saylor, Ariel Ash-Shakoor, Conrad W. Merkle, Xiaoli Hu, Reginald K. Avery, Taylor Moot, and Sepideh Parvinian
- Subjects
Materials science ,Surface Properties ,Kinetics ,Pharmaceutical Science ,Surface finish ,Aquatic Science ,engineering.material ,Microscopy, Atomic Force ,Quality by Design ,Coating ,Drug Discovery ,Ultrasonics ,Everolimus ,Composite material ,Polyglactin 910 ,Chromatography, High Pressure Liquid ,Ecology, Evolution, Behavior and Systematics ,Ecology ,Elution ,Design of experiments ,Drug-Eluting Stents ,General Medicine ,Biodegradable polymer ,Microscopy, Electron, Scanning ,engineering ,Ultrasonic sensor ,Agronomy and Crop Science ,Research Article - Abstract
The drug coating process for coated drug-eluting stents (DES) has been identified as a key source of inter- and intra-batch variability in drug elution rates. Quality-by-design (QbD) principles were applied to gain an understanding of the ultrasonic spray coating process of DES. Statistically based design of experiments (DOE) were used to understand the relationship between ultrasonic atomization spray coating parameters and dependent variables such as coating mass ratio, roughness, drug solid state composite microstructure, and elution kinetics. Defect-free DES coatings composed of 70% 85:15 poly(DL-lactide-co-glycolide) and 30% everolimus were fabricated with a constant coating mass. The drug elution profile was characterized by a mathematical model describing biphasic release kinetics. Model coefficients were analyzed as a DOE response. Changes in ultrasonic coating processing conditions resulted in substantial changes in roughness and elution kinetics. Based on the outcome from the DOE study, a design space was defined in terms of the critical coating process parameters resulting in optimum coating roughness and drug elution. This QbD methodology can be useful to enhance the quality of coated DES.
- Published
- 2015
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.