11 results on '"Rajini Haraksingh"'
Search Results
2. Implementation of genomic surveillance of SARS-CoV-2 in the Caribbean: Lessons learned for sustainability in resource-limited settings.
- Author
-
Nikita S D Sahadeo, Soren Nicholls, Filipe R R Moreira, Áine O'Toole, Vernie Ramkissoon, Charles Whittaker, Verity Hill, John T McCrone, Nicholas Mohammed, Anushka Ramjag, Arianne Brown Jordan, Sarah C Hill, Risha Singh, Sue-Min Nathaniel-Girdharrie, Avery Hinds, Nuala Ramkissoon, Kris V Parag, Naresh Nandram, Roshan Parasram, Zobida Khan-Mohammed, Lisa Edghill, Lisa Indar, Aisha Andrewin, Rhonda Sealey-Thomas, Pearl McMillan, Ayoola Oyinloye, Kenneth George, Irad Potter, John Lee, David Johnson, Shawn Charles, Narine Singh, Jacquiline Bisesor-McKenzie, Hazel Laws, Sharon Belmar-George, Simone Keizer-Beache, Sharra Greenaway-Duberry, Nadia Ashwood, Jerome E Foster, Karla Georges, Rahul Naidu, Marsha Ivey, Stanley Giddings, Rajini Haraksingh, Adesh Ramsubhag, Jayaraj Jayaraman, Chinnaraja Chinnadurai, Christopher Oura, Oliver G Pybus, Joy St John, Gabriel Gonzalez-Escobar, Nuno R Faria, and Christine V F Carrington
- Subjects
Public aspects of medicine ,RA1-1270 - Abstract
The COVID-19 pandemic highlighted the importance of global genomic surveillance to monitor the emergence and spread of SARS-CoV-2 variants and inform public health decision-making. Until December 2020 there was minimal capacity for viral genomic surveillance in most Caribbean countries. To overcome this constraint, the COVID-19: Infectious disease Molecular epidemiology for PAthogen Control & Tracking (COVID-19 IMPACT) project was implemented to establish rapid SARS-CoV-2 whole genome nanopore sequencing at The University of the West Indies (UWI) in Trinidad and Tobago (T&T) and provide needed SARS-CoV-2 sequencing services for T&T and other Caribbean Public Health Agency Member States (CMS). Using the Oxford Nanopore Technologies MinION sequencing platform and ARTIC network sequencing protocols and bioinformatics pipeline, a total of 3610 SARS-CoV-2 positive RNA samples, received from 17 CMS, were sequenced in-situ during the period December 5th 2020 to December 31st 2021. Ninety-one Pango lineages, including those of five variants of concern (VOC), were identified. Genetic analysis revealed at least 260 introductions to the CMS from other global regions. For each of the 17 CMS, the percentage of reported COVID-19 cases sequenced by the COVID-19 IMPACT laboratory ranged from 0·02% to 3·80% (median = 1·12%). Sequences submitted to GISAID by our study represented 73·3% of all SARS-CoV-2 sequences from the 17 CMS available on the database up to December 31st 2021. Increased staffing, process and infrastructural improvement over the course of the project helped reduce turnaround times for reporting to originating institutions and sequence uploads to GISAID. Insights from our genomic surveillance network in the Caribbean region directly influenced non-pharmaceutical countermeasures in the CMS countries. However, limited availability of associated surveillance and clinical data made it challenging to contextualise the observed SARS-CoV-2 diversity and evolution, highlighting the need for development of infrastructure for collecting and integrating genomic sequencing data and sample-associated metadata.
- Published
- 2023
- Full Text
- View/download PDF
3. Abstract P3-19-05: Genomic machine learning model predicts radiation therapy benefit in early-stage breast cancer patients with high accuracy
- Author
-
Kimberly Badal, Jerome Foster, Rajini Haraksingh, and Melford John
- Subjects
Cancer Research ,Oncology - Abstract
Background: Radiation therapy (RT) is frequently recommended for post-surgery treatment of early-stage breast cancer (BC) patients, though not all benefit. Clinical factors currently guide RT treatment decisions. This work presents a high-accuracy genomic machine learning (ML) model to predict RT-benefit in early-stage BC patients and a novel method for selecting genomic features for ML algorithms. Methods: Gene expression data from 477 early-stage BC patients treated with surgery and RT from the METABRIC cohort were obtained. Wilcoxon Rank Sum (Wilcox RS) test and Cox Proportional Hazards (Cox PH) were used to reduce the number of genes used to train 8 ML algorithms. Each ML algorithm was trained on a random subset of 80% of the data using 10-fold cross-validation and tested on the remaining 20% to assess its performance in predicting relapse status within 30 years. Results: The genomic data were reduced using Wilcox RS and Cox PH to a 1,596 gene set and a 977 gene set. These gene sets when used to train the 8 ML algorithms resulted in models that ranged in performance accuracies from 54.01% to 95.6%. The highest accuracies were obtained using Support Vector Machine (SVM977 - 93.41%, SVM1596 - 95.6%) and Neural Networks (NN977 - 92.31%, NN1596 - 93.41%). The accuracy of all models when tested on RT-untreated patients was 30-40% lower compared to RT-treated patients. SVM977 had the highest sensitivity of 91.09%. Members of the 977 gene set were enriched with genes involved in the cell cycle and differentiation as well as radiogenes. Conclusion: We developed an SVM model that used 977 differentially expressed genes as features that predicted RT-benefit in early-stage BC patients with 93.41% accuracy and 91.09% sensitivity. We also developed a novel genomic feature selection approach that used Wilcoxon RS followed by Cox PH that resulted in expression values of only 4% of all genes being used as features in the models. Citation Format: Kimberly Badal, Jerome Foster, Rajini Haraksingh, Melford John. Genomic machine learning model predicts radiation therapy benefit in early-stage breast cancer patients with high accuracy [abstract]. In: Proceedings of the 2021 San Antonio Breast Cancer Symposium; 2021 Dec 7-10; San Antonio, TX. Philadelphia (PA): AACR; Cancer Res 2022;82(4 Suppl):Abstract nr P3-19-05.
- Published
- 2022
- Full Text
- View/download PDF
4. Genomic Machine Learning Model Predicts Radiation Therapy Benefit in Early-Stage Breast Cancer Patients with High Accuracy
- Author
-
Kimberly Badal, Jerome E. Foster, Rajini Haraksingh, and Melford John
- Abstract
BackgroundRadiation therapy (RT) is frequently recommended for post-surgery treatment of early-stage breast cancer (BC) patients, though not all benefit. Clinical factors currently guide RT treatment decisions. At present, models to predict RT-benefit predominantly use statistical methods with modest performance. In this paper we present a high-accuracy genomic Machine Learning (ML) model to predict RT-benefit in early-stage BC patients. We also present a novel method for selecting genomic features for training ML algorithms. MethodsGene expression data from 463 early-stage BC patients treated with surgery and RT from the METABRIC cohort were obtained. Wilcoxon Rank Sum (Wilcoxon RS) test and Cox Proportional Hazards (Cox PH) were used to reduce the number of genes used to train eight ML algorithms. ML algorithms were trained on 80% of data using 10-fold cross validation and tested on 20% of data to assess performance in predicting relapse status. Results Genome-wide gene expression data was reduced by 96% using Wilcoxon RS and Cox PH to a 1,596 gene set and a 977 gene set. These gene sets were used to train eight ML algorithms resulting in models that ranged in performance accuracies from 54.01% to 95.6%. Highest accuracies were obtained using Support Vector Machine (SVM977–93.41%, SVM1596–95.6%) and Neural Networks algorithms (NN977 – 92.31%, NN1596 – 93.41%). In RT-untreated patients, accuracies of all models were 30% to 40% lower compared to RT-treated patients. SVM977 had the highest sensitivity of 91.09%. Members of the 977 set were enriched with genes involved in cell cycle and differentiation as well as genes associated with radiosensitivity and radioresistance. Conclusion This study presents a novel genomic feature selection approach that used Wilcoxon RS followed by Cox PH to reduce the number of genes from genome-wide gene expression data used for training ML algorithms by 96%. This approach led to an SVM model that used the expression values of 977 genes to predict RT-benefit in early-stage BC patients with 93.41% accuracy. This work demonstrates that ML models can be clinically useful for predicting cancer patient outcomes.
- Published
- 2022
- Full Text
- View/download PDF
5. Additional file 6: Table 3. of Comprehensive performance comparison of high-resolution array platforms for genome-wide Copy Number Variation (CNV) analysis in humans
- Author
-
Rajini Haraksingh, Abyzov, Alexej, and Urban, Alexander
- Subjects
ComputingMethodologies_PATTERNRECOGNITION - Abstract
Manifest and cluster files used in Genome Studio analysis of Illumina arrays. Lists the Illumina-supplied manifest and cluster files for each array that were used in Genome Studio analysis. These files were downloaded from http://support.illumina.com/array/downloads.html . (DOCX 57 kb)
- Published
- 2017
- Full Text
- View/download PDF
6. Additional file 4: Table S2. of Comprehensive performance comparison of high-resolution array platforms for genome-wide Copy Number Variation (CNV) analysis in humans
- Author
-
Rajini Haraksingh, Abyzov, Alexej, and Urban, Alexander
- Subjects
genetic structures ,sense organs ,eye diseases - Abstract
Array detection of gold standard deletions > 100 kb in size. Summarizes the detection of the seven gold standard CNVs > 100 kb by the different arrays including the number of arrays that detected each large CNV, which arrays detected each CNV, and possible reasons why a particular large CNV may not be detectable by certain arrays. (DOCX 114 kb)
- Published
- 2017
- Full Text
- View/download PDF
7. Additional file 3: Figure S2. of Comprehensive performance comparison of high-resolution array platforms for genome-wide Copy Number Variation (CNV) analysis in humans
- Author
-
Rajini Haraksingh, Abyzov, Alexej, and Urban, Alexander
- Abstract
Cumulative frequencies of sizes of validated and non-validated CNV calls using platform specific algorithm. Cumulative frequencies of the sizes of validated CNVs are shown in red. Cumulative frequencies of sizes of non-validated CNVs are shown in green. CNV size is plotted on a log scale. Plots are shown for all arrays with more than 50 validated CNVs called using the platform specific algorithm. P-values were computed using a Mannâ Whitney U test that corrects for ties and uses a continuity correction. The p-values correspond to a one-sided hypothesis. (PDF 88 kb)
- Published
- 2017
- Full Text
- View/download PDF
8. The variant call format and VCFtools
- Author
-
Javier Herrero, Carla Gallo, Harold Swerdlow, Goncalo Abecasis, Ewan Birney, Richard Durbin, Gavin Ha, Rajini Haraksingh, Gil McVean, Vineet Bafna, Paul Kersey, Laura Clarke, Robert Handsaker, Daniel MacArthur, Daniel Zerbino, Tatiana Borodina, Stephen Sherry, Andres C Garcia-Montero, Ralf Sudbrak, Sarah Dunstan, Klaudia Walter, Jonathan Sebat, Gerton Lunter, John Marioni, Ran Blekhman, Matthias Haimel, and Andreas Dahl
- Subjects
0106 biological sciences ,Statistics and Probability ,dbSNP ,Genotype ,Computer science ,Information Storage and Retrieval ,Genomics ,Single-nucleotide polymorphism ,Computational biology ,010603 evolutionary biology ,01 natural sciences ,Biochemistry ,Genome ,Population genomics ,03 medical and health sciences ,Genome resequencing ,Genetic variation ,Humans ,1000 Genomes Project ,Molecular Biology ,Exome ,Alleles ,030304 developmental biology ,computer.programming_language ,0303 health sciences ,Variant Call Format ,Information retrieval ,Genome, Human ,Dna polymorphism ,Genetic Variation ,Computer Science Applications ,Applications Note ,Computational Mathematics ,Computational Theory and Mathematics ,Perl ,Sequence Analysis ,computer ,Software ,Reference genome - Abstract
Summary: The variant call format (VCF) is a generic format for storing DNA polymorphism data such as SNPs, insertions, deletions and structural variants, together with rich annotations. VCF is usually stored in a compressed manner and can be indexed for fast data retrieval of variants from a range of positions on the reference genome. The format was developed for the 1000 Genomes Project, and has also been adopted by other projects such as UK10K, dbSNP and the NHLBI Exome Project. VCFtools is a software suite that implements various utilities for processing VCF files, including validation, merging, comparing and also provides a general Perl API. Availability: http://vcftools.sourceforge.net Contact: rd@sanger.ac.uk
- Published
- 2016
9. Variation in genome-wide mutation rates within and between human families
- Author
-
Javier Herrero, Harold Swerdlow, Rajini Haraksingh, Christopher Hartl, Laura Clarke, Ryan Mills, David N. Cooper, Carlos Torroja, Daniel MacArthur, Carlos D. Bustamante, Tatiana Borodina, Ralf Sudbrak, Philip Rosenstiel, Eugene Kulesha, Klaudia Walter, Simon Myers, Jonathan Sebat, Eric Stone, Kiran Garimella, Rajesh Radhakrishnan, Sarah Lindsay, William McLaren, Vadim Zalunin, Andrew Clark, Rasko Leinonen, Thomas Keane, Stephen Keenan, and Andreas Dahl
- Subjects
Male ,Genetics ,Mutation ,Mutation rate ,Genome, Human ,DNA Mutational Analysis ,Chromosome Mapping ,Genetic Variation ,Germline mosaicism ,Biology ,medicine.disease_cause ,Polymerase Chain Reaction ,Genome ,Article ,Germline ,Germline mutation ,medicine ,Humans ,Family ,Female ,Human genome ,Gene ,Germ-Line Mutation - Abstract
J.B.S. Haldane proposed in 1947 that the male germline may be more mutagenic than the female germline. Diverse studies have supported Haldane's contention of a higher average mutation rate in the male germline in a variety of mammals, including humans. Here we present, to our knowledge, the first direct comparative analysis of male and female germline mutation rates from the complete genome sequences of two parent-offspring trios. Through extensive validation, we identified 49 and 35 germline de novo mutations (DNMs) in two trio offspring, as well as 1,586 non-germline DNMs arising either somatically or in the cell lines from which the DNA was derived. Most strikingly, in one family, we observed that 92% of germline DNMs were from the paternal germline, whereas, in contrast, in the other family, 64% of DNMs were from the maternal germline. These observations suggest considerable variation in mutation rates within and between families.
- Published
- 2011
- Full Text
- View/download PDF
10. The 1000 Genomes Project: data management and community access
- Author
-
Javier Herrero, Brendan Vaughan, Harold Swerdlow, Rajini Haraksingh, David Altshuler, Laura Clarke, Ryan Mills, Jin Yu, Daniel MacArthur, Tatiana Borodina, Stephen Sherry, Paul Flicek, Ralf Sudbrak, Loukas Moutsianas, Philip Rosenstiel, Eugene Kulesha, Klaudia Walter, Simon Myers, Jonathan Sebat, Rajesh Radhakrishnan, Aarno Palotie, William McLaren, Angie Hinrichs, Vadim Zalunin, Andrew Clark, Rasko Leinonen, Thomas Keane, Stephen Keenan, Andreas Dahl, and Xiangqun Zheng Bradley
- Subjects
Data collection ,Genome, Human ,business.industry ,Data management ,Computational Biology ,Genetic Variation ,Genomics ,Sequence Analysis, DNA ,Cell Biology ,Computational biology ,Human genetic variation ,Variation (game tree) ,Biology ,Biochemistry ,Data science ,Article ,Data access ,Databases, Genetic ,Humans ,Human genome ,1000 Genomes Project ,business ,Molecular Biology ,Biotechnology - Abstract
The 1000 Genomes Project was launched as one of the largest distributed data collection and analysis projects ever undertaken in biology. In addition to the primary scientific goals of creating both a deep catalogue of human genetic variation and extensive methods to accurately discover and characterize variation using new sequencing technologies, the project makes all of its data publicly available for community use. The project data coordination center has developed and deployed several tools to enable widespread data access.
- Published
- 2012
11. Diversity of human copy number variation and multicopy genes
- Author
-
Javier Herrero, Harold Swerdlow, Michael Eberle, Alexej Abyzov, Kristian Cibulskis, Rajini Haraksingh, David Altshuler, Can Alkan, Laura Clarke, Ryan Mills, Richa Agarwala, David N. Cooper, Francesca Antonacci, Shankar Balasubramanian, Daniel MacArthur, Jesus Christ Aguinaga, Stephen Sherry, Jay Shendure, Ralf Sudbrak, Philip Rosenstiel, Eugene Kulesha, Klaudia Walter, Simon Myers, Jonathan Sebat, Ken Chen, Eric Stone, Peter Sudmant, Stephen Schaffner, Rajesh Radhakrishnan, MICHAEL MCLELLAN, William McLaren, Angie Hinrichs, Vadim Zalunin, Afidalina Tumian, Andrew Clark, Rasko Leinonen, Jacob Kitzman, Thomas Keane, Stephen Keenan, and Andreas Dahl
- Subjects
Male ,DNA Copy Number Variations ,Genotype ,Population ,Copy number analysis ,Gene Conversion ,Gene Dosage ,Biology ,Polymorphism, Single Nucleotide ,Article ,Evolution, Molecular ,Gene Frequency ,Genes, Duplicate ,Gene Duplication ,Gene family ,Humans ,Gene conversion ,Copy-number variation ,education ,Gene ,Genetics ,education.field_of_study ,Genetic diversity ,Multidisciplinary ,Genome, Human ,Racial Groups ,Chromosome Mapping ,Genetic Variation ,Genomics ,Sequence Analysis, DNA ,Haplotypes ,Human genome ,Female ,Databases, Nucleic Acid - Abstract
Evolution, Gene Number, and Disease Slight variations in the numbers of copies of genes influence human disease and other characters. Variants can be hard to detect when they lie in heavily duplicated and widely similar regions of sequence known as “dark matter.” Sudmant et al. (p. 641 ) have methods to tease apart the duplicated regions to reveal singly unique nucleotide identifiers. These have turned out to be among the most variable seen in different human population groups—most notably among genes for neurodevelopment and neurological diseases. Such polymorphisms can be genotyped with specificity and may help us understand how variation in copy number may affect human evolution and disease.
- Published
- 2010
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.