258 results on '"Ritchie ME"'
Search Results
2. Systematic comparison of sequencing-based spatial transcriptomic methods
- Author
-
You, Y, Fu, Y, Li, L, Zhang, Z, Jia, S, Lu, S, Ren, W, Liu, Y, Xu, Y, Liu, X, Jiang, F, Peng, G, Sampath Kumar, A, Ritchie, ME, Tian, L, You, Y, Fu, Y, Li, L, Zhang, Z, Jia, S, Lu, S, Ren, W, Liu, Y, Xu, Y, Liu, X, Jiang, F, Peng, G, Sampath Kumar, A, Ritchie, ME, and Tian, L
- Abstract
Recent developments of sequencing-based spatial transcriptomics (sST) have catalyzed important advancements by facilitating transcriptome-scale spatial gene expression measurement. Despite this progress, efforts to comprehensively benchmark different platforms are currently lacking. The extant variability across technologies and datasets poses challenges in formulating standardized evaluation metrics. In this study, we established a collection of reference tissues and regions characterized by well-defined histological architectures, and used them to generate data to compare 11 sST methods. We highlighted molecular diffusion as a variable parameter across different methods and tissues, significantly affecting the effective resolutions. Furthermore, we observed that spatial transcriptomic data demonstrate unique attributes beyond merely adding a spatial axis to single-cell data, including an enhanced ability to capture patterned rare cell states along with specific markers, albeit being influenced by multiple factors including sequencing depth and resolution. Our study assists biologists in sST platform selection, and helps foster a consensus on evaluation standards and establish a framework for future benchmarking efforts that can be used as a gold standard for the development and benchmarking of computational tools for spatial transcriptomic analysis.
- Published
- 2024
3. Venetoclax treatment in patients with cancer has limited impact on circulating T and NK cells
- Author
-
Teh, CE, Peng, H, Luo, M-X, Tan, T, Trussart, M, Howson, LJ, Chua, CC, Muttiah, C, Brown, F, Ritchie, ME, Wei, AH, Roberts, AW, Bryant, VL, Anderson, MA, Lindeman, GJ, Huang, DCS, Thijssen, R, Gray, DHD, Teh, CE, Peng, H, Luo, M-X, Tan, T, Trussart, M, Howson, LJ, Chua, CC, Muttiah, C, Brown, F, Ritchie, ME, Wei, AH, Roberts, AW, Bryant, VL, Anderson, MA, Lindeman, GJ, Huang, DCS, Thijssen, R, and Gray, DHD
- Abstract
Venetoclax is an effective treatment for certain blood cancers, such as chronic lymphocytic leukemia (CLL) and acute myeloid leukemia (AML). However, most patients relapse while on venetoclax and further treatment options are limited. Combining venetoclax with immunotherapies is an attractive approach; however, a detailed understanding of how venetoclax treatment impacts normal immune cells in patients is lacking. In this study, we performed deep profiling of peripheral blood (PB) cells from patients with CLL and AML before and after short-term treatment with venetoclax using mass cytometry (cytometry by time of flight) and found no impact on the concentrations of key T-cell subsets or their expression of checkpoint molecules. We also analyzed PB from patients with breast cancer receiving venetoclax long-term using a single-cell multiomics approach (cellular indexing of transcriptomes and epitopes by sequencing) and functional assays. We found significant depletion of B-cell populations with low expression of MCL-1 relative to other immune cells, attended by extensive transcriptomic changes. By contrast, there was less impact on circulating T cells and natural killer (NK) cells, with no changes in their subset composition, transcriptome, or function following venetoclax treatment. Our data indicate that venetoclax has minimal impact on circulating T or NK cells, supporting the rationale of combining this BH3 mimetic drug with cancer immunotherapies for more durable antitumor responses.
- Published
- 2023
4. Arginine-rich C9ORF72 ALS proteins stall ribosomes in a manner distinct from a canonical ribosome-associated quality control substrate
- Author
-
Kriachkov, V, Ormsby, AR, Kusnadi, EP, McWilliam, HEG, Mintern, JD, Amarasinghe, SL, Ritchie, ME, Furic, L, Hatters, DM, Kriachkov, V, Ormsby, AR, Kusnadi, EP, McWilliam, HEG, Mintern, JD, Amarasinghe, SL, Ritchie, ME, Furic, L, and Hatters, DM
- Abstract
Hexanucleotide expansion mutations in C9ORF72 are a frequent cause of amyotrophic lateral sclerosis. We previously reported that long arginine-rich dipeptide repeats (DPRs), mimicking abnormal proteins expressed from the hexanucleotide expansion, caused translation stalling when expressed in cell culture models. Whether this stalling provides a mechanism of pathogenicity remains to be determined. Here, we explored the molecular features of DPR-induced stalling and examined whether known mechanisms such as ribosome quality control (RQC) regulate translation elongation on sequences that encode arginine-rich DPRs. We demonstrate that arginine-rich DPRs lead to stalling in a length-dependent manner, with lengths longer than 40 repeats invoking severe translation arrest. Mutational screening of 40×Gly-Xxx DPRs shows that stalling is most pronounced when Xxx is a charged amino acid (Arg, Lys, Glu, or Asp). Through a genome-wide knockout screen, we find that genes regulating stalling on polyadenosine mRNA coding for poly-Lys, a canonical RQC substrate, act differently in the case of arginine-rich DPRs. Indeed, these findings point to a limited scope for natural regulatory responses to resolve the arginine-rich DPR stalls, even though the stalls may be sensed, as evidenced by an upregulation of RQC gene expression. These findings therefore implicate arginine-rich DPR-mediated stalled ribosomes as a source of stress and toxicity and may be a crucial component in pathomechanisms.
- Published
- 2023
5. BAF complex-mediated chromatin relaxation is required for establishment of X chromosome inactivation
- Author
-
Keniry, A, Jansz, N, Gearing, LJ, Wanigasuriya, I, Chen, J, Nefzger, CM, Hickey, PF, Gouil, Q, Liu, J, Breslin, KA, Iminitoff, M, Beck, T, del Fierro, AT, Whitehead, L, Jarratt, A, Kinkel, SA, Taberlay, PC, Willson, T, Pakusch, M, Ritchie, ME, Hilton, DJ, Polo, JM, Blewitt, ME, Keniry, A, Jansz, N, Gearing, LJ, Wanigasuriya, I, Chen, J, Nefzger, CM, Hickey, PF, Gouil, Q, Liu, J, Breslin, KA, Iminitoff, M, Beck, T, del Fierro, AT, Whitehead, L, Jarratt, A, Kinkel, SA, Taberlay, PC, Willson, T, Pakusch, M, Ritchie, ME, Hilton, DJ, Polo, JM, and Blewitt, ME
- Abstract
The process of epigenetic silencing, while fundamentally important, is not yet completely understood. Here we report a replenishable female mouse embryonic stem cell (mESC) system, Xmas, that allows rapid assessment of X chromosome inactivation (XCI), the epigenetic silencing mechanism of one of the two X chromosomes that enables dosage compensation in female mammals. Through a targeted genetic screen in differentiating Xmas mESCs, we reveal that the BAF complex is required to create nucleosome-depleted regions at promoters on the inactive X chromosome during the earliest stages of establishment of XCI. Without this action gene silencing fails. Xmas mESCs provide a tractable model for screen-based approaches that enable the discovery of unknown facets of the female-specific process of XCI and epigenetic silencing more broadly.
- Published
- 2022
6. A transcriptomic dataset evaluating the effect of radiotherapy injury on cells of skin and soft tissue
- Author
-
Shukla, L, Lee, SA, Du, MRM, Karnezis, T, Ritchie, ME, Shayan, R, Shukla, L, Lee, SA, Du, MRM, Karnezis, T, Ritchie, ME, and Shayan, R
- Abstract
Radiotherapy injury to cells of the skin and subcutaneous tissue is an inevitable consequence of external beam radiation for treatment of cancer. This sublethal injury to normal tissues plays a significant role in the development of fibrosis, lymphedema, impaired wound healing, and recurrent infections. To elucidate the transcriptional changes that occur in cells of the skin and soft tissues after radiotherapy injury, we performed genome-wide RNA-sequencing comparing irradiated cells (10Gy) with non-irradiated (0Gy) controls in normal human dermal fibroblasts, normal human keratinocytes, human microvascular endothelial cells, human dermal lymphatic endothelial cells, pericytes and adipose derived stem cell populations. These data are publicly available from the Gene Expression Omnibus database (accession number GSE184119). Further insights can be gained by comparing the mRNA signatures arising from radiation injury derived from these data to publicly available signatures from other studies involving similar or different tissue types. These global targets hold potential for manipulation to mitigate radiotherapy soft tissue injury.
- Published
- 2022
7. Maternal SMCHD1 controls both imprinted Xist expression and imprinted X chromosome inactivation
- Author
-
Wanigasuriya, I, Kinkel, SA, Beck, T, Roper, EA, Breslin, K, Lee, HJ, Keniry, A, Ritchie, ME, Blewitt, ME, Gouil, Q, Wanigasuriya, I, Kinkel, SA, Beck, T, Roper, EA, Breslin, K, Lee, HJ, Keniry, A, Ritchie, ME, Blewitt, ME, and Gouil, Q
- Abstract
Embryonic development is dependent on the maternal supply of proteins through the oocyte, including factors setting up the adequate epigenetic patterning of the zygotic genome. We previously reported that one such factor is the epigenetic repressor SMCHD1, whose maternal supply controls autosomal imprinted expression in mouse preimplantation embryos and mid-gestation placenta. In mouse preimplantation embryos, X chromosome inactivation is also an imprinted process. Combining genomics and imaging, we show that maternal SMCHD1 is required not only for the imprinted expression of Xist in preimplantation embryos, but also for the efficient silencing of the inactive X in both the preimplantation embryo and mid-gestation placenta. These results expand the role of SMCHD1 in enforcing the silencing of Polycomb targets. The inability of zygotic SMCHD1 to fully restore imprinted X inactivation further points to maternal SMCHD1's role in setting up the appropriate chromatin environment during preimplantation development, a critical window of epigenetic remodelling.
- Published
- 2022
8. Epigenetic modulators of B cell fate identified through coupled phenotype-transcriptome analysis
- Author
-
Kong, IY, Trezise, S, Light, A, Todorovski, I, Arnau, GM, Gadipally, S, Yoannidis, D, Simpson, KJ, Dong, X, Whitehead, L, Tempany, JC, Farchione, AJ, Sheikh, AA, Groom, JR, Rogers, KL, Herold, MJ, Bryant, VL, Ritchie, ME, Willis, SN, Johnstone, RW, Hodgkin, PD, Nutt, SL, Vervoort, SJ, Hawkins, ED, Kong, IY, Trezise, S, Light, A, Todorovski, I, Arnau, GM, Gadipally, S, Yoannidis, D, Simpson, KJ, Dong, X, Whitehead, L, Tempany, JC, Farchione, AJ, Sheikh, AA, Groom, JR, Rogers, KL, Herold, MJ, Bryant, VL, Ritchie, ME, Willis, SN, Johnstone, RW, Hodgkin, PD, Nutt, SL, Vervoort, SJ, and Hawkins, ED
- Abstract
High-throughput methodologies are the cornerstone of screening approaches to identify novel compounds that regulate immune cell function. To identify novel targeted therapeutics to treat immune disorders and haematological malignancies, there is a need to integrate functional cellular information with the molecular mechanisms that regulate changes in immune cell phenotype. We facilitate this goal by combining quantitative methods for dissecting complex simultaneous cell phenotypic effects with genomic analysis. This combination strategy we term Multiplexed Analysis of Cells sequencing (MAC-seq), a modified version of Digital RNA with perturbation of Genes (DRUGseq). We applied MAC-seq to screen compounds that target the epigenetic machinery of B cells and assess altered humoral immunity by measuring changes in proliferation, survival, differentiation and transcription. This approach revealed that polycomb repressive complex 2 (PRC2) inhibitors promote antibody secreting cell (ASC) differentiation in both murine and human B cells in vitro. This is further validated using T cell-dependent immunization in mice. Functional dissection of downstream effectors of PRC2 using arrayed CRISPR screening uncovered novel regulators of B cell differentiation, including Mybl1, Myof, Gas7 and Atoh8. Together, our findings demonstrate that integrated phenotype-transcriptome analyses can be effectively combined with drug screening approaches to uncover the molecular circuitry that drives lymphocyte fate decisions.
- Published
- 2022
9. NAb-seq: an accurate, rapid, and cost-effective method for antibody long-read sequencing in hybridoma cell lines and single B cells
- Author
-
Satish, HPS, Zeglinski, K, Uren, RT, Nutt, SL, Ritchie, ME, Gouil, Q, Kluck, RM, Satish, HPS, Zeglinski, K, Uren, RT, Nutt, SL, Ritchie, ME, Gouil, Q, and Kluck, RM
- Abstract
Despite their common use in research, monoclonal antibodies are currently not systematically sequenced. This can lead to issues with reproducibility and the occasional loss of antibodies with loss of cell lines. Hybridoma cell lines have been the primary means of generating monoclonal antibodies from immunized animals, including mice, rats, rabbits, and alpacas. Excluding therapeutic antibodies, few hybridoma-derived antibody sequences are known. Sanger sequencing has been "the gold standard" for antibody gene sequencing, but this method relies on the availability of species-specific degenerate primer sets for amplification of light and heavy antibody genes and it requires lengthy and expensive cDNA preparation. Here, we leveraged recent improvements in long-read Oxford Nanopore Technologies (ONT) sequencing to develop Nanopore Antibody sequencing (NAb-seq): a three-day, species-independent, and cost-effective workflow to characterize paired full-length immunoglobulin light- and heavy-chain genes from hybridoma cell lines. When compared to Sanger sequencing of two hybridoma cell lines, long-read ONT sequencing was highly accurate, reliable, and amenable to high throughput. We further show that the method is applicable to single cells, allowing efficient antibody discovery in rare populations such as memory B cells. In summary, NAb-seq promises to accelerate identification and validation of hybridoma antibodies as well as antibodies from single B cells used in research, diagnostics, and therapeutics.
- Published
- 2022
10. A transcriptomic dataset evaluating the effect of radiotherapy injury on cells of skin and soft tissue (vol 41, 107828, 2022)
- Author
-
Shuka, L, Lee, SA, Du, MRM, Karnezis, T, Ritchie, ME, Shayan, R, Shuka, L, Lee, SA, Du, MRM, Karnezis, T, Ritchie, ME, and Shayan, R
- Abstract
[This corrects the article DOI: 10.1016/j.dib.2022.107828.].
- Published
- 2022
11. A specialized tyrosine-based endocytosis signal in MR1 controls antigen presentation to MAIT cells
- Author
-
Lim, HJ, Wubben, JM, Garcia, CP, Cruz-Gomez, S, Deng, J, Mak, JYW, Hachani, A, Anderson, RJ, Painter, GF, Goyette, J, Amarasinghe, SL, Ritchie, ME, Roquilly, A, Fairlie, DP, Gaus, K, Rossjohn, J, Villadangos, JA, McWilliam, HEG, Lim, HJ, Wubben, JM, Garcia, CP, Cruz-Gomez, S, Deng, J, Mak, JYW, Hachani, A, Anderson, RJ, Painter, GF, Goyette, J, Amarasinghe, SL, Ritchie, ME, Roquilly, A, Fairlie, DP, Gaus, K, Rossjohn, J, Villadangos, JA, and McWilliam, HEG
- Abstract
MR1 is a highly conserved microbial immune-detection system in mammals. It captures vitamin B-related metabolite antigens from diverse microbes and presents them at the cell surface to stimulate MR1-restricted lymphocytes including mucosal-associated invariant T (MAIT) cells. MR1 presentation and MAIT cell recognition mediate homeostasis through host defense and tissue repair. The cellular mechanisms regulating MR1 cell surface expression are critical to its function and MAIT cell recognition, yet they are poorly defined. Here, we report that human MR1 is equipped with a tyrosine-based motif in its cytoplasmic domain that mediates low affinity binding with the endocytic adaptor protein 2 (AP2) complex. This interaction controls the kinetics of MR1 internalization from the cell surface and minimizes recycling. We propose MR1 uses AP2 endocytosis to define the duration of antigen presentation to MAIT cells and the detection of a microbial metabolic signature by the immune system.
- Published
- 2022
12. Dashboard-style interactive plots for RNA-seq analysis are R Markdown ready with Glimma 2.0
- Author
-
Kariyawasam, H, Su, S, Voogd, O, Ritchie, ME, Law, CW, Kariyawasam, H, Su, S, Voogd, O, Ritchie, ME, and Law, CW
- Abstract
Glimma 1.0 introduced intuitive, point-and-click interactive graphics for differential gene expression analysis. Here, we present a major update to Glimma that brings improved interactivity and reproducibility using high-level visualization frameworks for R and JavaScript. Glimma 2.0 plots are now readily embeddable in R Markdown, thus allowing users to create reproducible reports containing interactive graphics. The revamped multidimensional scaling plot features dashboard-style controls allowing the user to dynamically change the colour, shape and size of sample points according to different experimental conditions. Interactivity was enhanced in the MA-style plot for comparing differences to average expression, which now supports selecting multiple genes, export options to PNG, SVG or CSV formats and includes a new volcano plot function. Feature-rich and user-friendly, Glimma makes exploring data for gene expression analysis more accessible and intuitive and is available on Bioconductor and GitHub.
- Published
- 2021
13. CD36 family members are TCR-independent ligands for CD1 antigen-presenting molecules.
- Author
-
Gherardin, NA, Redmond, SJ, McWilliam, HEG, Almeida, CF, Gourley, KHA, Seneviratna, R, Li, S, De Rose, R, Ross, FJ, Nguyen-Robertson, CV, Su, S, Ritchie, ME, Villadangos, JA, Moody, DB, Pellicci, DG, Uldrich, AP, Godfrey, DI, Gherardin, NA, Redmond, SJ, McWilliam, HEG, Almeida, CF, Gourley, KHA, Seneviratna, R, Li, S, De Rose, R, Ross, FJ, Nguyen-Robertson, CV, Su, S, Ritchie, ME, Villadangos, JA, Moody, DB, Pellicci, DG, Uldrich, AP, and Godfrey, DI
- Abstract
CD1c presents lipid-based antigens to CD1c-restricted T cells, which are thought to be a major component of the human T cell pool. However, the study of CD1c-restricted T cells is hampered by the presence of an abundantly expressed, non-T cell receptor (TCR) ligand for CD1c on blood cells, confounding analysis of TCR-mediated CD1c tetramer staining. Here, we identified the CD36 family (CD36, SR-B1, and LIMP-2) as ligands for CD1c, CD1b, and CD1d proteins and showed that CD36 is the receptor responsible for non-TCR-mediated CD1c tetramer staining of blood cells. Moreover, CD36 blockade clarified tetramer-based identification of CD1c-restricted T cells and improved identification of CD1b- and CD1d-restricted T cells. We used this technique to characterize CD1c-restricted T cells ex vivo and showed diverse phenotypic features, TCR repertoire, and antigen-specific subsets. Accordingly, this work will enable further studies into the biology of CD1 and human CD1-restricted T cells.
- Published
- 2021
14. Single-Cell Transcriptomic Analysis Reveals BCMA CAR-T Cell Dynamics in a Patient with Refractory Primary Plasma Cell Leukemia
- Author
-
Li, X, Guo, X, Zhu, Y, Wei, G, Zhang, Y, Xu, H, Cui, J, Wu, W, He, J, Ritchie, ME, Weiskittel, TM, Li, H, Yu, H, Ding, L, Shao, M, Luo, Q, Xu, X, Teng, X, Chang, AH, Zhang, J, Huang, H, Hu, Y, Li, X, Guo, X, Zhu, Y, Wei, G, Zhang, Y, Xu, H, Cui, J, Wu, W, He, J, Ritchie, ME, Weiskittel, TM, Li, H, Yu, H, Ding, L, Shao, M, Luo, Q, Xu, X, Teng, X, Chang, AH, Zhang, J, Huang, H, and Hu, Y
- Abstract
Chimeric antigen receptor T cell (CAR-T) therapy has revolutionized the clinical treatment of hematological malignancies due to the prominent anti-tumor effects. B cell maturation antigen (BCMA) CAR-T cells have demonstrated promising effects in patients with relapsed/refractory multiple myeloma. However, the dynamics of CAR-T cell proliferation and cytotoxicity in clinical patients remains unexplored. Here, we longitudinally profiled the transcriptomes of 55,488 T cells including CAR-T products, CAR-T cells, and endogenous T cells at the peak and remission phases in a plasma cell leukemia (PCL) patient treated with BCMA CAR-T cells by single-cell transcriptomic analysis. Our results showed distinct CAR-T and endogenous T cell subsets indicating stage-specific expression in proliferation, cytotoxicity, and intercellular signaling pathways. Furthermore, we found that CAR-T cells at peak phase gradually convert to a highly cytotoxic state from a highly proliferative state along a development trajectory. Moreover, re-analysis of a single cell study from CD8+ CD19 CAR-T confirmed our findings. These commonalities suggest conserved mechanisms for CAR-T treatment across hematological malignancies. Taken together, our current study provides insight into CAR-T cell dynamics during CAR-T therapy and proves that both BCMA CAR-T and CD19 CAR-T have similar transcriptional characteristics, especially at the CAR-T peak phase.
- Published
- 2021
15. The long and the short of it: unlocking nanopore long-read RNA sequencing data with short-read differential expression analysis tools
- Author
-
Dong, X, Tian, L, Gouil, Q, Kariyawasam, H, Su, S, De Paoli-Iseppi, R, Prawer, YDJ, Clark, MB, Breslin, K, Iminitoff, M, Blewitt, ME, Law, CW, Ritchie, ME, Dong, X, Tian, L, Gouil, Q, Kariyawasam, H, Su, S, De Paoli-Iseppi, R, Prawer, YDJ, Clark, MB, Breslin, K, Iminitoff, M, Blewitt, ME, Law, CW, and Ritchie, ME
- Abstract
Application of Oxford Nanopore Technologies' long-read sequencing platform to transcriptomic analysis is increasing in popularity. However, such analysis can be challenging due to the high sequence error and small library sizes, which decreases quantification accuracy and reduces power for statistical testing. Here, we report the analysis of two nanopore RNA-seq datasets with the goal of obtaining gene- and isoform-level differential expression information. A dataset of synthetic, spliced, spike-in RNAs ('sequins') as well as a mouse neural stem cell dataset from samples with a null mutation of the epigenetic regulator Smchd1 was analysed using a mix of long-read specific tools for preprocessing together with established short-read RNA-seq methods for downstream analysis. We used limma-voom to perform differential gene expression analysis, and the novel FLAMES pipeline to perform isoform identification and quantification, followed by DRIMSeq and limma-diffSplice (with stageR) to perform differential transcript usage analysis. We compared results from the sequins dataset to the ground truth, and results of the mouse dataset to a previous short-read study on equivalent samples. Overall, our work shows that transcriptomic analysis of long-read nanopore data using long-read specific preprocessing methods together with short-read differential expression methods and software that are already in wide use can yield meaningful results.
- Published
- 2021
16. Community-wide hackathons to identify central themes in single-cell multi-omics (vol 22, 220, 2021)
- Author
-
Cao, K-AL, Abadi, AJ, Davis-Marcisak, EF, Hsu, L, Arora, A, Coullomb, A, Deshpande, A, Feng, Y, Jeganathan, P, Loth, M, Meng, C, Mu, W, Pancaldi, V, Sankaran, K, Righelli, D, Singh, A, Sodicoff, JS, Stein-O'Brien, GL, Subramanian, A, Welch, JD, You, Y, Argelaguet, R, Carey, VJ, Dries, R, Greene, CS, Holmes, S, Love, MI, Ritchie, ME, Yuan, G-C, Culhane, AC, Fertig, E, Cao, K-AL, Abadi, AJ, Davis-Marcisak, EF, Hsu, L, Arora, A, Coullomb, A, Deshpande, A, Feng, Y, Jeganathan, P, Loth, M, Meng, C, Mu, W, Pancaldi, V, Sankaran, K, Righelli, D, Singh, A, Sodicoff, JS, Stein-O'Brien, GL, Subramanian, A, Welch, JD, You, Y, Argelaguet, R, Carey, VJ, Dries, R, Greene, CS, Holmes, S, Love, MI, Ritchie, ME, Yuan, G-C, Culhane, AC, and Fertig, E
- Published
- 2021
17. Comprehensive characterization of single-cell full-length isoforms in human and mouse with long-read sequencing
- Author
-
Tian, L, Jabbari, JS, Thijssen, R, Gouil, Q, Amarasinghe, SL, Voogd, O, Kariyawasam, H, Du, MRM, Schuster, J, Wang, C, Su, S, Dong, X, Law, CW, Lucattini, A, Prawer, YDJ, Collar-Fernandez, C, Chung, JD, Naim, T, Chan, A, Ly, CH, Lynch, GS, Ryall, JG, Anttila, CJA, Peng, H, Anderson, MA, Flensburg, C, Majewski, I, Roberts, AW, Huang, DCS, Clark, MB, Ritchie, ME, Tian, L, Jabbari, JS, Thijssen, R, Gouil, Q, Amarasinghe, SL, Voogd, O, Kariyawasam, H, Du, MRM, Schuster, J, Wang, C, Su, S, Dong, X, Law, CW, Lucattini, A, Prawer, YDJ, Collar-Fernandez, C, Chung, JD, Naim, T, Chan, A, Ly, CH, Lynch, GS, Ryall, JG, Anttila, CJA, Peng, H, Anderson, MA, Flensburg, C, Majewski, I, Roberts, AW, Huang, DCS, Clark, MB, and Ritchie, ME
- Abstract
A modified Chromium 10x droplet-based protocol that subsamples cells for both short-read and long-read (nanopore) sequencing together with a new computational pipeline (FLAMES) is developed to enable isoform discovery, splicing analysis, and mutation detection in single cells. We identify thousands of unannotated isoforms and find conserved functional modules that are enriched for alternative transcript usage in different cell types and species, including ribosome biogenesis and mRNA splicing. Analysis at the transcript level allows data integration with scATAC-seq on individual promoters, improved correlation with protein expression data, and linked mutations known to confer drug resistance to transcriptome heterogeneity.
- Published
- 2021
18. Community-wide hackathons to identify central themes in single-cell multi-omics
- Author
-
Cao, K-AL, Abadi, AJ, Davis-Marcisak, EF, Hsu, L, Arora, A, Coullomb, A, Deshpande, A, Feng, Y, Jeganathan, P, Loth, M, Meng, C, Mu, W, Pancaldi, V, Sankaran, K, Singh, A, Sodicoff, JS, Stein-O'Brien, GL, Subramanian, A, Welch, JD, You, Y, Argelaguet, R, Carey, VJ, Dries, R, Greene, CS, Holmes, S, Love, M, Ritchie, ME, Yuan, G-C, Culhane, AC, Fertig, E, Cao, K-AL, Abadi, AJ, Davis-Marcisak, EF, Hsu, L, Arora, A, Coullomb, A, Deshpande, A, Feng, Y, Jeganathan, P, Loth, M, Meng, C, Mu, W, Pancaldi, V, Sankaran, K, Singh, A, Sodicoff, JS, Stein-O'Brien, GL, Subramanian, A, Welch, JD, You, Y, Argelaguet, R, Carey, VJ, Dries, R, Greene, CS, Holmes, S, Love, M, Ritchie, ME, Yuan, G-C, Culhane, AC, and Fertig, E
- Published
- 2021
19. Benchmarking UMI-based single-cell RNA-seq preprocessing workflows
- Author
-
You, Y, Tian, L, Su, S, Dong, X, Jabbari, JS, Hickey, PF, Ritchie, ME, You, Y, Tian, L, Su, S, Dong, X, Jabbari, JS, Hickey, PF, and Ritchie, ME
- Abstract
BACKGROUND: Single-cell RNA-sequencing (scRNA-seq) technologies and associated analysis methods have rapidly developed in recent years. This includes preprocessing methods, which assign sequencing reads to genes to create count matrices for downstream analysis. While several packaged preprocessing workflows have been developed to provide users with convenient tools for handling this process, how they compare to one another and how they influence downstream analysis have not been well studied. RESULTS: Here, we systematically benchmark the performance of 10 end-to-end preprocessing workflows (Cell Ranger, Optimus, salmon alevin, alevin-fry, kallisto bustools, dropSeqPipe, scPipe, zUMIs, celseq2, and scruff) using datasets yielding different biological complexity levels generated by CEL-Seq2 and 10x Chromium platforms. We compare these workflows in terms of their quantification properties directly and their impact on normalization and clustering by evaluating the performance of different method combinations. While the scRNA-seq preprocessing workflows compared vary in their detection and quantification of genes across datasets, after downstream analysis with performant normalization and clustering methods, almost all combinations produce clustering results that agree well with the known cell type labels that provided the ground truth in our analysis. CONCLUSIONS: In summary, the choice of preprocessing method was found to be less important than other steps in the scRNA-seq analysis process. Our study comprehensively compares common scRNA-seq preprocessing workflows and summarizes their characteristics to guide workflow users.
- Published
- 2021
20. NanoMethViz: An R/Bioconductor package for visualizing long-read methylation data
- Author
-
Schneidman-Duhovny, D, Su, S, Gouil, Q, Blewitt, ME, Cook, D, Hickey, PF, Ritchie, ME, Schneidman-Duhovny, D, Su, S, Gouil, Q, Blewitt, ME, Cook, D, Hickey, PF, and Ritchie, ME
- Abstract
A key benefit of long-read nanopore sequencing technology is the ability to detect modified DNA bases, such as 5-methylcytosine. The lack of R/Bioconductor tools for the effective visualization of nanopore methylation profiles between samples from different experimental groups led us to develop the NanoMethViz R package. Our software can handle methylation output generated from a range of different methylation callers and manages large datasets using a compressed data format. To fully explore the methylation patterns in a dataset, NanoMethViz allows plotting of data at various resolutions. At the sample-level, we use dimensionality reduction to look at the relationships between methylation profiles in an unsupervised way. We visualize methylation profiles of classes of features such as genes or CpG islands by scaling them to relative positions and aggregating their profiles. At the finest resolution, we visualize methylation patterns across individual reads along the genome using the spaghetti plot and heatmaps, allowing users to explore particular genes or genomic regions of interest. In summary, our software makes the handling of methylation signal more convenient, expands upon the visualization options for nanopore data and works seamlessly with existing methylation analysis tools available in the Bioconductor project. Our software is available at https://bioconductor.org/packages/NanoMethViz.
- Published
- 2021
21. long-read-tools.org: an interactive catalogue of analysis methods for long-read sequencing data
- Author
-
Amarasinghe, SL, Ritchie, ME, Gouil, Q, Amarasinghe, SL, Ritchie, ME, and Gouil, Q
- Abstract
BACKGROUND: The data produced by long-read third-generation sequencers have unique characteristics compared to short-read sequencing data, often requiring tailored analysis tools for tasks ranging from quality control to downstream processing. The rapid growth in software that addresses these challenges for different genomics applications is difficult to keep track of, which makes it hard for users to choose the most appropriate tool for their analysis goal and for developers to identify areas of need and existing solutions to benchmark against. FINDINGS: We describe the implementation of long-read-tools.org, an open-source database that organizes the rapidly expanding collection of long-read data analysis tools and allows its exploration through interactive browsing and filtering. The current database release contains 478 tools across 32 categories. Most tools are developed in Python, and the most frequent analysis tasks include base calling, de novo assembly, error correction, quality checking/filtering, and isoform detection, while long-read single-cell data analysis and transcriptomics are areas with the fewest tools available. CONCLUSION: Continued growth in the application of long-read sequencing in genomics research positions the long-read-tools.org database as an essential resource that allows researchers to keep abreast of both established and emerging software to help guide the selection of the most relevant tool for their analysis needs.
- Published
- 2021
22. Modulation of Vagal Sensory Neurons via High Mobility Group Box-1 and Receptor for Advanced Glycation End Products: Implications for Respiratory Viral Infections
- Author
-
Mazzone, SB, Yang, S-K, Keller, JA, Simanauskaite, J, Arikkatt, J, Fogarty, MJ, Moe, AAK, Chen, C, Trewella, MW, Tian, L, Ritchie, ME, Chua, BY, Phipps, S, Short, KR, McGovern, AE, Mazzone, SB, Yang, S-K, Keller, JA, Simanauskaite, J, Arikkatt, J, Fogarty, MJ, Moe, AAK, Chen, C, Trewella, MW, Tian, L, Ritchie, ME, Chua, BY, Phipps, S, Short, KR, and McGovern, AE
- Abstract
Vagal sensory neurons contribute to the symptoms and pathogenesis of inflammatory pulmonary diseases through processes that involve changes to their morphological and functional characteristics. The alarmin high mobility group box-1 (HMGB1) is an early mediator of pulmonary inflammation and can have actions on neurons in a range of inflammatory settings. We hypothesized that HMGB1 can regulate the growth and function of vagal sensory neurons and we set out to investigate this and the mechanisms involved. Culturing primary vagal sensory neurons from wildtype mice in the presence of HMGB1 significantly increased neurite outgrowth, while acute application of HMGB1 to isolated neurons under patch clamp electrophysiological investigation produced inward currents and enhanced action potential firing. Transcriptional analyses revealed the expression of the cognate HMGB1 receptors, Receptor for Advanced Glycation End products (RAGE) and Toll-like Receptor 4 (TLR4), in subsets of vagal sensory neurons. HMGB1-evoked growth and electrophysiological responses were significantly reduced in primary vagal sensory neurons harvested from RAGE deficient mice and completely absent in neurons from RAGE/TLR4 double deficient mice. Immunohistochemical analysis of vagal sensory neurons collected from mice after intranasal infection with murine pneumovirus or influenza A virus (IAV), or after intratracheal administration with the viral mimetic PolyI:C, revealed a significant increase in nuclear-to-cytoplasm translocation of HMGB1 compared to mock-inoculated mice. Neurons cultured from virus infected wildtype mice displayed a significant increase in neurite outgrowth, which was not observed for neurons from virus infected RAGE or RAGE/TLR4 deficient mice. These data suggest that HMGB1 can enhance vagal sensory neuron growth and excitability, acting primarily via sensory neuron RAGE. Activation of the HMGB1-RAGE axis in vagal sensory neurons could be an important mechanism leading to vagal h
- Published
- 2021
23. Covering all your bases: incorporating intron signal from RNA-seq data
- Author
-
Lee, S, Zhang, AY, Su, S, Ng, AP, Holik, AZ, Asselin-Labat, M-L, Ritchie, ME, Law, CW, Lee, S, Zhang, AY, Su, S, Ng, AP, Holik, AZ, Asselin-Labat, M-L, Ritchie, ME, and Law, CW
- Abstract
RNA-seq datasets can contain millions of intron reads per library that are typically removed from downstream analysis. Only reads overlapping annotated exons are considered to be informative since mature mRNA is assumed to be the major component sequenced, especially for poly(A) RNA libraries. In this study, we show that intron reads are informative, and through exploratory data analysis of read coverage that intron signal is representative of both pre-mRNAs and intron retention. We demonstrate how intron reads can be utilized in differential expression analysis using our index method where a unique set of differentially expressed genes can be detected using intron counts. In exploring read coverage, we also developed the superintronic software that quickly and robustly calculates user-defined summary statistics for exonic and intronic regions. Across multiple datasets, superintronic enabled us to identify several genes with distinctly retained introns that had similar coverage levels to that of neighbouring exons. The work and ideas presented in this paper is the first of its kind to consider multiple biological sources for intron reads through exploratory data analysis, minimizing bias in discovery and interpretation of results. Our findings open up possibilities for further methods development for intron reads and RNA-seq data in general.
- Published
- 2020
24. Smchd1 is a maternal effect gene required for genomic imprinting
- Author
-
Wanigasuriya, I, Gouil, Q, Kinkel, SA, del Fierro, AT, Beck, T, Roper, EA, Breslin, K, Stringer, J, Hutt, K, Lee, HJ, Keniry, A, Ritchie, ME, Blewitt, ME, Wanigasuriya, I, Gouil, Q, Kinkel, SA, del Fierro, AT, Beck, T, Roper, EA, Breslin, K, Stringer, J, Hutt, K, Lee, HJ, Keniry, A, Ritchie, ME, and Blewitt, ME
- Abstract
Genomic imprinting establishes parental allele-biased expression of a suite of mammalian genes based on parent-of-origin specific epigenetic marks. These marks are under the control of maternal effect proteins supplied in the oocyte. Here we report epigenetic repressor Smchd1 as a novel maternal effect gene that regulates the imprinted expression of ten genes in mice. We also found zygotic SMCHD1 had a dose-dependent effect on the imprinted expression of seven genes. Together, zygotic and maternal SMCHD1 regulate three classic imprinted clusters and eight other genes, including non-canonical imprinted genes. Interestingly, the loss of maternal SMCHD1 does not alter germline DNA methylation imprints pre-implantation or later in gestation. Instead, what appears to unite most imprinted genes sensitive to SMCHD1 is their reliance on polycomb-mediated methylation as germline or secondary imprints, therefore we propose that SMCHD1 acts downstream of polycomb imprints to mediate its function.
- Published
- 2020
25. A guide to creating design matrices for gene expression experiments.
- Author
-
Law, CW, Zeglinski, K, Dong, X, Alhamdoosh, M, Smyth, GK, Ritchie, ME, Law, CW, Zeglinski, K, Dong, X, Alhamdoosh, M, Smyth, GK, and Ritchie, ME
- Abstract
Differential expression analysis of genomic data types, such as RNA-sequencing experiments, use linear models to determine the size and direction of the changes in gene expression. For RNA-sequencing, there are several established software packages for this purpose accompanied with analysis pipelines that are well described. However, there are two crucial steps in the analysis process that can be a stumbling block for many -- the set up an appropriate model via design matrices and the set up of comparisons of interest via contrast matrices. These steps are particularly troublesome because an extensive catalogue for design and contrast matrices does not currently exist. One would usually search for example case studies across different platforms and mix and match the advice from those sources to suit the dataset they have at hand. This article guides the reader through the basics of how to set up design and contrast matrices. We take a practical approach by providing code and graphical representation of each case study, starting with simpler examples (e.g. models with a single explanatory variable) and move onto more complex ones (e.g. interaction models, mixed effects models, higher order time series and cyclical models). Although our work has been written specifically with a limma-style pipeline in mind, most of it is also applicable to other software packages for differential expression analysis, and the ideas covered can be adapted to data analysis of other high-throughput technologies. Where appropriate, we explain the interpretation and differences between models to aid readers in their own model choices. Unnecessary jargon and theory is omitted where possible so that our work is accessible to a wide audience of readers, from beginners to those with experience in genomics data analysis.
- Published
- 2020
26. Unique properties of a subset of human pluripotent stem cells with high capacity for self-renewal
- Author
-
Lau, KX, Mason, EA, Kie, J, De Souza, DP, Kloehn, J, Tull, D, McConville, MJ, Keniry, A, Beck, T, Blewitt, ME, Ritchie, ME, Naik, SH, Zalcenstein, D, Korn, O, Su, S, Romero, IG, Spruce, C, Baker, CL, McGarr, TC, Wells, CA, Pera, MF, Lau, KX, Mason, EA, Kie, J, De Souza, DP, Kloehn, J, Tull, D, McConville, MJ, Keniry, A, Beck, T, Blewitt, ME, Ritchie, ME, Naik, SH, Zalcenstein, D, Korn, O, Su, S, Romero, IG, Spruce, C, Baker, CL, McGarr, TC, Wells, CA, and Pera, MF
- Abstract
Archetypal human pluripotent stem cells (hPSC) are widely considered to be equivalent in developmental status to mouse epiblast stem cells, which correspond to pluripotent cells at a late post-implantation stage of embryogenesis. Heterogeneity within hPSC cultures complicates this interspecies comparison. Here we show that a subpopulation of archetypal hPSC enriched for high self-renewal capacity (ESR) has distinct properties relative to the bulk of the population, including a cell cycle with a very low G1 fraction and a metabolomic profile that reflects a combination of oxidative phosphorylation and glycolysis. ESR cells are pluripotent and capable of differentiation into primordial germ cell-like cells. Global DNA methylation levels in the ESR subpopulation are lower than those in mouse epiblast stem cells. Chromatin accessibility analysis revealed a unique set of open chromatin sites in ESR cells. RNA-seq at the subpopulation and single cell levels shows that, unlike mouse epiblast stem cells, the ESR subset of hPSC displays no lineage priming, and that it can be clearly distinguished from gastrulating and extraembryonic cell populations in the primate embryo. ESR hPSC correspond to an earlier stage of post-implantation development than mouse epiblast stem cells.
- Published
- 2020
27. CellBench: R/Bioconductor software for comparing single-cell RNA-seq analysis methods
- Author
-
Gorodkin, J, Su, S, Tian, L, Dong, X, Hickey, PF, Freytag, S, Ritchie, ME, Gorodkin, J, Su, S, Tian, L, Dong, X, Hickey, PF, Freytag, S, and Ritchie, ME
- Abstract
MOTIVATION: Bioinformatic analysis of single-cell gene expression data is a rapidly evolving field. Hundreds of bespoke methods have been developed in the past few years to deal with various aspects of single-cell analysis and consensus on the most appropriate methods to use under different settings is still emerging. Benchmarking the many methods is therefore of critical importance and since analysis of single-cell data usually involves multi-step pipelines, effective evaluation of pipelines involving different combinations of methods is required. Current benchmarks of single-cell methods are mostly implemented with ad-hoc code that is often difficult to reproduce or extend, and exhaustive manual coding of many combinations is infeasible in most instances. Therefore, new software is needed to manage pipeline benchmarking. RESULTS: The CellBench R software facilitates method comparisons in either a task-centric or combinatorial way to allow pipelines of methods to be evaluated in an effective manner. CellBench automatically runs combinations of methods, provides facilities for measuring running time and delivers output in tabular form which is highly compatible with tidyverse R packages for summary and visualization. Our software has enabled comprehensive benchmarking of single-cell RNA-seq normalization, imputation, clustering, trajectory analysis and data integration methods using various performance metrics obtained from data with available ground truth. CellBench is also amenable to benchmarking other bioinformatics analysis tasks. AVAILABILITY AND IMPLEMENTATION: Available from https://bioconductor.org/packages/CellBench.
- Published
- 2020
28. Targeting triple-negative breast cancers with the Smac-mimetic birinapant
- Author
-
Lalaoui, N, Merino, D, Giner, G, Vaillant, F, Chau, D, Liu, L, Kratina, T, Pal, B, Whittle, JR, Etemadi, N, Berthelet, J, Grasel, J, Hall, C, Ritchie, ME, Ernst, M, Smyth, GK, Vaux, DL, Visvader, JE, Lindeman, GJ, Silke, J, Lalaoui, N, Merino, D, Giner, G, Vaillant, F, Chau, D, Liu, L, Kratina, T, Pal, B, Whittle, JR, Etemadi, N, Berthelet, J, Grasel, J, Hall, C, Ritchie, ME, Ernst, M, Smyth, GK, Vaux, DL, Visvader, JE, Lindeman, GJ, and Silke, J
- Abstract
Smac mimetics target inhibitor of apoptosis (IAP) proteins, thereby suppressing their function to facilitate tumor cell death. Here we have evaluated the efficacy of the preclinical Smac-mimetic compound A and the clinical lead birinapant on breast cancer cells. Both exhibited potent in vitro activity in triple-negative breast cancer (TNBC) cells, including those from patient-derived xenograft (PDX) models. Birinapant was further studied using in vivo PDX models of TNBC and estrogen receptor-positive (ER+) breast cancer. Birinapant exhibited single agent activity in all TNBC PDX models and augmented response to docetaxel, the latter through induction of TNF. Transcriptomic analysis of TCGA datasets revealed that genes encoding mediators of Smac-mimetic-induced cell death were expressed at higher levels in TNBC compared with ER+ breast cancer, resulting in a molecular signature associated with responsiveness to Smac mimetics. In addition, the cell death complex was preferentially formed in TNBCs versus ER+ cells in response to Smac mimetics. Taken together, our findings provide a rationale for prospectively selecting patients whose breast tumors contain a competent death receptor signaling pathway for the further evaluation of birinapant in the clinic.
- Published
- 2020
29. Germline heterozygous mutations in Nxf1 perturb RNA metabolism and trigger thrombocytopenia and lymphopenia in mice
- Author
-
Chappaz, S, Law, CW, Dowling, MR, Carey, KT, Lane, RM, Ngo, LH, Wickramasinghe, VO, Smyth, GK, Ritchie, ME, Kile, BT, Chappaz, S, Law, CW, Dowling, MR, Carey, KT, Lane, RM, Ngo, LH, Wickramasinghe, VO, Smyth, GK, Ritchie, ME, and Kile, BT
- Abstract
In eukaryotic cells, messenger RNA (mRNA) molecules are exported from the nucleus to the cytoplasm, where they are translated. The highly conserved protein nuclear RNA export factor1 (Nxf1) is an important mediator of this process. Although studies in yeast and in human cell lines have shed light on the biochemical mechanisms of Nxf1 function, its contribution to mammalian physiology is less clear. Several groups have identified recurrent NXF1 mutations in chronic lymphocytic leukemia (CLL), placing it alongside several RNA-metabolism factors (including SF3B1, XPO, RPS15) whose dysregulation is thought to contribute to CLL pathogenesis. We report here an allelic series of germline point mutations in murine Nxf1. Mice heterozygous for these loss-of-function Nxf1 mutations exhibit thrombocytopenia and lymphopenia, together with milder hematological defects. This is primarily caused by cell-intrinsic defects in the survival of platelets and peripheral lymphocytes, which are sensitized to intrinsic apoptosis. In contrast, Nxf1 mutations have almost no effect on red blood cell homeostasis. Comparative transcriptome analysis of platelets, lymphocytes, and erythrocytes from Nxf1-mutant mice shows that, in response to impaired Nxf1 function, the cytoplasmic representation of transcripts encoding regulators of RNA metabolism is altered in a unique, lineage-specific way. Thus, blood cell lineages exhibit differential requirements for Nxf1-mediated global mRNA export.
- Published
- 2020
30. Opportunities and challenges in long-read sequencing data analysis
- Author
-
Amarasinghe, SL, Su, S, Dong, X, Zappia, L, Ritchie, ME, Gouil, Q, Amarasinghe, SL, Su, S, Dong, X, Zappia, L, Ritchie, ME, and Gouil, Q
- Abstract
Long-read technologies are overcoming early limitations in accuracy and throughput, broadening their application domains in genomics. Dedicated analysis tools that take into account the characteristics of long-read data are thus required, but the fast pace of development of such tools can be overwhelming. To assist in the design and analysis of long-read sequencing projects, we review the current landscape of available tools and present an online interactive database, long-read-tools.org, to facilitate their browsing. We further focus on the principles of error correction, base modification detection, and long-read transcriptomics analysis and highlight the challenges that remain.
- Published
- 2020
31. The EMT modulator SNAI1 contributes to AML pathogenesis via its interaction with LSD1.
- Author
-
Carmichael, CL, Wang, J, Nguyen, T, Kolawole, O, Benyoucef, A, De Mazière, C, Milne, AR, Samuel, S, Gillinder, K, Hediyeh-Zadeh, S, Vo, ANQ, Huang, Y, Knezevic, K, McInnes, WRL, Shields, BJ, Mitchell, H, Ritchie, ME, Lammens, T, Lintermans, B, Van Vlierberghe, P, Wong, NC, Haigh, K, Thoms, JAI, Toulmin, E, Curtis, DJ, Oxley, EP, Dickins, RA, Beck, D, Perkins, A, McCormack, MP, Davis, MJ, Berx, G, Zuber, J, Pimanda, JE, Kile, BT, Goossens, S, Haigh, JJ, Carmichael, CL, Wang, J, Nguyen, T, Kolawole, O, Benyoucef, A, De Mazière, C, Milne, AR, Samuel, S, Gillinder, K, Hediyeh-Zadeh, S, Vo, ANQ, Huang, Y, Knezevic, K, McInnes, WRL, Shields, BJ, Mitchell, H, Ritchie, ME, Lammens, T, Lintermans, B, Van Vlierberghe, P, Wong, NC, Haigh, K, Thoms, JAI, Toulmin, E, Curtis, DJ, Oxley, EP, Dickins, RA, Beck, D, Perkins, A, McCormack, MP, Davis, MJ, Berx, G, Zuber, J, Pimanda, JE, Kile, BT, Goossens, S, and Haigh, JJ
- Abstract
Modulators of epithelial-to-mesenchymal transition (EMT) have recently emerged as novel players in the field of leukemia biology. The mechanisms by which EMT modulators contribute to leukemia pathogenesis, however, remain to be elucidated. Here we show that overexpression of SNAI1, a key modulator of EMT, is a pathologically relevant event in human acute myeloid leukemia (AML) that contributes to impaired differentiation, enhanced self-renewal, and proliferation of immature myeloid cells. We demonstrate that ectopic expression of Snai1 in hematopoietic cells predisposes mice to AML development. This effect is mediated by interaction with the histone demethylase KDM1A/LSD1. Our data shed new light on the role of SNAI1 in leukemia development and identify a novel mechanism of LSD1 corruption in cancer. This is particularly pertinent given the current interest surrounding the use of LSD1 inhibitors in the treatment of multiple different malignancies, including AML.
- Published
- 2020
32. Distinct initiating events underpin the immune and metabolic heterogeneity of KRAS-mutant lung adenocarcinoma
- Author
-
Best, SA, Ding, S, Kersbergen, A, Dong, X, Song, J-Y, Xie, Y, Reljic, B, Li, K, Vince, JE, Rathi, V, Wright, GM, Ritchie, ME, Sutherland, KD, Best, SA, Ding, S, Kersbergen, A, Dong, X, Song, J-Y, Xie, Y, Reljic, B, Li, K, Vince, JE, Rathi, V, Wright, GM, Ritchie, ME, and Sutherland, KD
- Abstract
The KRAS oncoprotein, a critical driver in 33% of lung adenocarcinoma (LUAD), has remained an elusive clinical target due to its perceived undruggable nature. The identification of dependencies borne through common co-occurring mutations are sought to more effectively target KRAS-mutant lung cancer. Approximately 20% of KRAS-mutant LUAD carry loss-of-function mutations in KEAP1, a negative regulator of the antioxidant response transcription factor NFE2L2/NRF2. We demonstrate that Keap1-deficient KrasG12D lung tumors arise from a bronchiolar cell-of-origin, lacking pro-tumorigenic macrophages observed in tumors originating from alveolar cells. Keap1 loss activates the pentose phosphate pathway, inhibition of which, using 6-AN, abrogated tumor growth. These studies highlight alternative therapeutic approaches to specifically target this unique subset of KRAS-mutant LUAD cancers.
- Published
- 2019
33. Using long-read sequencing to detect imprinted DNA methylation
- Author
-
Gigante, S, Gouil, Q, Lucattini, A, Keniry, A, Beck, T, Tinning, M, Gordon, L, Woodruff, C, Speed, TP, Blewitt, ME, Ritchie, ME, Gigante, S, Gouil, Q, Lucattini, A, Keniry, A, Beck, T, Tinning, M, Gordon, L, Woodruff, C, Speed, TP, Blewitt, ME, and Ritchie, ME
- Abstract
Systematic variation in the methylation of cytosines at CpG sites plays a critical role in early development of humans and other mammals. Of particular interest are regions of differential methylation between parental alleles, as these often dictate monoallelic gene expression, resulting in parent of origin specific control of the embryonic transcriptome and subsequent development, in a phenomenon known as genomic imprinting. Using long-read nanopore sequencing we show that, with an average genomic coverage of ∼10, it is possible to determine both the level of methylation of CpG sites and the haplotype from which each read arises. The long-read property is exploited to characterize, using novel methods, both methylation and haplotype for reads that have reduced basecalling precision compared to Sanger sequencing. We validate the analysis both through comparison of nanopore-derived methylation patterns with those from Reduced Representation Bisulfite Sequencing data and through comparison with previously reported data. Our analysis successfully identifies known imprinting control regions (ICRs) as well as some novel differentially methylated regions which, due to their proximity to hitherto unknown monoallelically expressed genes, may represent new ICRs.
- Published
- 2019
34. scPipe: A flexible R/Bioconductor preprocessing pipeline for single-cell RNA-sequencing data
- Author
-
Pertea, M, Tian, L, Su, S, Dong, X, Amann-Zalcenstein, D, Biben, C, Seidi, A, Hilton, DJ, Naik, SH, Ritchie, ME, Pertea, M, Tian, L, Su, S, Dong, X, Amann-Zalcenstein, D, Biben, C, Seidi, A, Hilton, DJ, Naik, SH, and Ritchie, ME
- Abstract
Single-cell RNA sequencing (scRNA-seq) technology allows researchers to profile the transcriptomes of thousands of cells simultaneously. Protocols that incorporate both designed and random barcodes have greatly increased the throughput of scRNA-seq, but give rise to a more complex data structure. There is a need for new tools that can handle the various barcoding strategies used by different protocols and exploit this information for quality assessment at the sample-level and provide effective visualization of these results in preparation for higher-level analyses. To this end, we developed scPipe, an R/Bioconductor package that integrates barcode demultiplexing, read alignment, UMI-aware gene-level quantification and quality control of raw sequencing data generated by multiple protocols that include CEL-seq, MARS-seq, Chromium 10X, Drop-seq and Smart-seq. scPipe produces a count matrix that is essential for downstream analysis along with an HTML report that summarises data quality. These results can be used as input for downstream analyses including normalization, visualization and statistical testing. scPipe performs this processing in a few simple R commands, promoting reproducible analysis of single-cell data that is compatible with the emerging suite of open-source scRNA-seq analysis tools available in R/Bioconductor and beyond. The scPipe R package is available for download from https://www.bioconductor.org/packages/scPipe.
- Published
- 2018
35. Herbivore impact on grassland plant diversity depends on habitat productivity and herbivore size
- Author
-
Bakker, ES, Ritchie, ME, Olff, H, Milchunas, DG, Knops, JMH, Bakker, Elisabeth S., Ritchie, Mark E., Milchunas, Daniel G., Knops, Johannes M.H., Olff group, and Aquatic Ecology (AqE)
- Subjects
Biodiversity ,Grazing ,species-richness ,Body Size ,species richness ,DISTURBANCE ,environments ,disturbance ,Mammals ,fertility ,Ecology ,Agroforestry ,cross-site ,food and beverages ,PE&RC ,plant-animal ,gradient ,Europe ,Habitat ,Productivity (ecology) ,Plantenecologie en Natuurbeheer ,community structure ,COMMUNITY STRUCTURE ,GERMINATION ,RECRUITMENT ,Conservation of Natural Resources ,SEED ,Plant Ecology and Nature Conservation ,Biology ,Poaceae ,SPECIES-RICHNESS ,establishment ,GRADIENT ,Animals ,Ecosystem ,grazing ,Ecology, Evolution, Behavior and Systematics ,Herbivore ,WIMEK ,Diet ,tallgrass prairie ,Habitat destruction ,germination ,recruitment ,Wildlife Ecology and Conservation ,North America ,ESTABLISHMENT ,Species richness ,seed ,TALLGRASS PRAIRIE ,ENVIRONMENTS - Abstract
Mammalian herbivores can have pronounced effects on plant diversity but are currently declining in many productive ecosystems through direct extirpation, habitat loss and fragmentation, while being simultaneously introduced as livestock in other, often unproductive, ecosystems that lacked such species during recent evolutionary times. The biodiversity consequences of these changes are still poorly understood. We experimentally separated the effects of primary productivity and herbivores of different body size on plant species richness across a 10-fold productivity gradient using a 7-year field experiment at seven grassland sites in North America and Europe. We show that assemblages including large herbivores increased plant diversity at higher productivity but decreased diversity at low productivity, while small herbivores did not have consistent effects along the productivity gradient. The recognition of these large-scale, cross-site patterns in herbivore effects is important for the development of appropriate biodiversity conservation strategies. [KEYWORDS: Cross-site ; fertility ; grazing ; plant–animal ;species richness]
- Published
- 2006
36. Glimma: interactive graphics for gene expression analysis
- Author
-
Berger, B, Su, S, Law, CW, Ah-Cann, C, Asselin-Labat, M-L, Blewitt, ME, Ritchie, ME, Berger, B, Su, S, Law, CW, Ah-Cann, C, Asselin-Labat, M-L, Blewitt, ME, and Ritchie, ME
- Abstract
MOTIVATION: graphics for RNA-sequencing and microarray gene expression analyses may contain upwards of tens of thousands of points. Details about certain genes or samples of interest are easily obscured in such dense summary displays. Incorporating interactivity into summary plots would enable additional information to be displayed on demand and facilitate intuitive data exploration. RESULTS: The open-source Glimma package creates interactive graphics for exploring gene expression analysis with a few simple R commands. It extends popular plots found in the limma package, such as multi-dimensional scaling plots and mean-difference plots, to allow individual data points to be queried and additional annotation information to be displayed upon hovering or selecting particular points. It also offers links between plots so that more information can be revealed on demand. Glimma is widely applicable, supporting data analyses from a number of well-established Bioconductor workflows ( limma , edgeR and DESeq2 ) and uses D3/JavaScript to produce HTML pages with interactive displays that enable more effective data exploration by end-users. Results from Glimma can be easily shared between bioinformaticians and biologists, enhancing reporting capabilities while maintaining reproducibility. AVAILABILITY AND IMPLEMENTATION: The Glimma R package is available from http://bioconductor.org/packages/Glimma/ . CONTACT: su.s@wehi.edu.au , law@wehi.edu.au or mritchie@wehi.edu.au.
- Published
- 2017
37. Easy and efficient ensemble gene set testing with EGSEA.
- Author
-
Alhamdoosh, M, Law, CW, Tian, L, Sheridan, JM, Ng, M, Ritchie, ME, Alhamdoosh, M, Law, CW, Tian, L, Sheridan, JM, Ng, M, and Ritchie, ME
- Abstract
Gene set enrichment analysis is a popular approach for prioritising the biological processes perturbed in genomic datasets. The Bioconductor project hosts over 80 software packages capable of gene set analysis. Most of these packages search for enriched signatures amongst differentially regulated genes to reveal higher level biological themes that may be missed when focusing only on evidence from individual genes. With so many different methods on offer, choosing the best algorithm and visualization approach can be challenging. The EGSEA package solves this problem by combining results from up to 12 prominent gene set testing algorithms to obtain a consensus ranking of biologically relevant results.This workflow demonstrates how EGSEA can extend limma-based differential expression analyses for RNA-seq and microarray data using experiments that profile 3 distinct cell populations important for studying the origins of breast cancer. Following data normalization and set-up of an appropriate linear model for differential expression analysis, EGSEA builds gene signature specific indexes that link a wide range of mouse or human gene set collections obtained from MSigDB, GeneSetDB and KEGG to the gene expression data being investigated. EGSEA is then configured and the ensemble enrichment analysis run, returning an object that can be queried using several S4 methods for ranking gene sets and visualizing results via heatmaps, KEGG pathway views, GO graphs, scatter plots and bar plots. Finally, an HTML report that combines these displays can fast-track the sharing of results with collaborators, and thus expedite downstream biological validation. EGSEA is simple to use and can be easily integrated with existing gene expression analysis pipelines for both human and mouse data.
- Published
- 2017
38. Combining multiple tools outperforms individual methods in gene set enrichment analyses
- Author
-
Birol, I, Alhamdoosh, M, Ng, M, Wilson, NJ, Sheridan, JM, Huynh, H, Wilson, MJ, Ritchie, ME, Birol, I, Alhamdoosh, M, Ng, M, Wilson, NJ, Sheridan, JM, Huynh, H, Wilson, MJ, and Ritchie, ME
- Abstract
MOTIVATION: Gene set enrichment (GSE) analysis allows researchers to efficiently extract biological insight from long lists of differentially expressed genes by interrogating them at a systems level. In recent years, there has been a proliferation of GSE analysis methods and hence it has become increasingly difficult for researchers to select an optimal GSE tool based on their particular dataset. Moreover, the majority of GSE analysis methods do not allow researchers to simultaneously compare gene set level results between multiple experimental conditions. RESULTS: The ensemble of genes set enrichment analyses (EGSEA) is a method developed for RNA-sequencing data that combines results from twelve algorithms and calculates collective gene set scores to improve the biological relevance of the highest ranked gene sets. EGSEA's gene set database contains around 25 000 gene sets from sixteen collections. It has multiple visualization capabilities that allow researchers to view gene sets at various levels of granularity. EGSEA has been tested on simulated data and on a number of human and mouse datasets and, based on biologists' feedback, consistently outperforms the individual tools that have been combined. Our evaluation demonstrates the superiority of the ensemble approach for GSE analysis, and its utility to effectively and efficiently extrapolate biological functions and potential involvement in disease processes from lists of differentially regulated genes. AVAILABILITY AND IMPLEMENTATION: EGSEA is available as an R package at http://www.bioconductor.org/packages/EGSEA/ . The gene sets collections are available in the R package EGSEAdata from http://www.bioconductor.org/packages/EGSEAdata/ . CONTACTS: monther.alhamdoosh@csl.com.au mritchie@wehi.edu.au. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
- Published
- 2017
39. RNA-seq mixology: designing realistic control experiments to compare protocols and analysis methods
- Author
-
Holik, AZ, Law, CW, Liu, R, Wang, Z, Wang, W, Ahn, J, Asselin-Labat, M-L, Smyth, GK, Ritchie, ME, Holik, AZ, Law, CW, Liu, R, Wang, Z, Wang, W, Ahn, J, Asselin-Labat, M-L, Smyth, GK, and Ritchie, ME
- Abstract
Carefully designed control experiments provide a gold standard for benchmarking different genomics research tools. A shortcoming of many gene expression control studies is that replication involves profiling the same reference RNA sample multiple times. This leads to low, pure technical noise that is atypical of regular studies. To achieve a more realistic noise structure, we generated a RNA-sequencing mixture experiment using two cell lines of the same cancer type. Variability was added by extracting RNA from independent cell cultures and degrading particular samples. The systematic gene expression changes induced by this design allowed benchmarking of different library preparation kits (standard poly-A versus total RNA with Ribozero depletion) and analysis pipelines. Data generated using the total RNA kit had more signal for introns and various RNA classes (ncRNA, snRNA, snoRNA) and less variability after degradation. For differential expression analysis, voom with quality weights marginally outperformed other popular methods, while for differential splicing, DEXSeq was simultaneously the most sensitive and the most inconsistent method. For sample deconvolution analysis, DeMix outperformed IsoPure convincingly. Our RNA-sequencing data set provides a valuable resource for benchmarking different protocols and data pre-processing workflows. The extra noise mimics routine lab experiments more closely, ensuring any conclusions are widely applicable.
- Published
- 2017
40. Setdb1-mediated H3K9 methylation is enriched on the inactive X and plays a role in its epigenetic silencing
- Author
-
Keniry, A, Gearing, LJ, Jansz, N, Liu, J, Holik, AZ, Hickey, PF, Kinkel, SA, Moore, DL, Breslin, K, Chen, K, Liu, R, Phillips, C, Pakusch, M, Biben, C, Sheridan, JM, Kile, BT, Carmichael, C, Ritchie, ME, Hilton, DJ, Blewitt, ME, Keniry, A, Gearing, LJ, Jansz, N, Liu, J, Holik, AZ, Hickey, PF, Kinkel, SA, Moore, DL, Breslin, K, Chen, K, Liu, R, Phillips, C, Pakusch, M, Biben, C, Sheridan, JM, Kile, BT, Carmichael, C, Ritchie, ME, Hilton, DJ, and Blewitt, ME
- Abstract
BACKGROUND: The presence of histone 3 lysine 9 (H3K9) methylation on the mouse inactive X chromosome has been controversial over the last 15 years, and the functional role of H3K9 methylation in X chromosome inactivation in any species has remained largely unexplored. RESULTS: Here we report the first genomic analysis of H3K9 di- and tri-methylation on the inactive X: we find they are enriched at the intergenic, gene poor regions of the inactive X, interspersed between H3K27 tri-methylation domains found in the gene dense regions. Although H3K9 methylation is predominantly non-genic, we find that depletion of H3K9 methylation via depletion of H3K9 methyltransferase Set domain bifurcated 1 (Setdb1) during the establishment of X inactivation, results in failure of silencing for around 150 genes on the inactive X. By contrast, we find a very minor role for Setdb1-mediated H3K9 methylation once X inactivation is fully established. In addition to failed gene silencing, we observed a specific failure to silence X-linked long-terminal repeat class repetitive elements. CONCLUSIONS: Here we have shown that H3K9 methylation clearly marks the murine inactive X chromosome. The role of this mark is most apparent during the establishment phase of gene silencing, with a more muted effect on maintenance of the silent state. Based on our data, we hypothesise that Setdb1-mediated H3K9 methylation plays a role in epigenetic silencing of the inactive X via silencing of the repeats, which itself facilitates gene silencing through alterations to the conformation of the whole inactive X chromosome.
- Published
- 2016
41. Transcriptional profiling of the epigenetic regulator Smchd1
- Author
-
Liu, R, Chen, K, Jansz, N, Blewitt, ME, Ritchie, ME, Liu, R, Chen, K, Jansz, N, Blewitt, ME, and Ritchie, ME
- Abstract
Smchd1 is an epigenetic repressor with important functions in healthy cellular processes and disease. To elucidate its role in transcriptional regulation, we performed two independent genome-wide RNA-sequencing studies comparing wild-type and Smchd1 null samples in neural stem cells and lymphoma cell lines. Using an R-based analysis pipeline that accommodates observational and sample-specific weights in the linear modeling, we identify key genes dysregulated by Smchd1 deletion such as clustered protocadherins in the neural stem cells and imprinted genes in both experiments. Here we provide a detailed description of this analysis, from quality control to read mapping and differential expression analysis. These data sets are publicly available from the Gene Expression Omnibus database (accession numbers GSE64099 and GSE65747).
- Published
- 2016
42. RNA-seq analysis is easy as 1-2-3 with limma, Glimma and edgeR.
- Author
-
Law, CW, Alhamdoosh, M, Su, S, Dong, X, Tian, L, Smyth, GK, Ritchie, ME, Law, CW, Alhamdoosh, M, Su, S, Dong, X, Tian, L, Smyth, GK, and Ritchie, ME
- Abstract
The ability to easily and efficiently analyse RNA-sequencing data is a key strength of the Bioconductor project. Starting with counts summarised at the gene-level, a typical analysis involves pre-processing, exploratory data analysis, differential expression testing and pathway analysis with the results obtained informing future experiments and validation studies. In this workflow article, we analyse RNA-sequencing data from the mouse mammary gland, demonstrating use of the popular edgeR package to import, organise, filter and normalise the data, followed by the limma package with its voom method, linear modelling and empirical Bayes moderation to assess differential expression and perform gene set testing. This pipeline is further enhanced by the Glimma package which enables interactive exploration of the results so that individual samples and genes can be examined by the user. The complete analysis offered by these three packages highlights the ease with which researchers can turn the raw counts from an RNA-sequencing experiment into biological insights using Bioconductor.
- Published
- 2016
43. Loss of PUMA (BBC3) does not prevent thrombocytopenia caused by the loss of BCL-XL (BCL2L1)
- Author
-
Delbridge, ARD, Chappaz, S, Ritchie, ME, Kile, BT, Strasser, A, Grabow, S, Delbridge, ARD, Chappaz, S, Ritchie, ME, Kile, BT, Strasser, A, and Grabow, S
- Abstract
Apoptosis is required to maintain tissue homeostasis in multicellular organisms. Platelets, the anucleate cells that are essential for blood clotting, are a prime example. Their brief life span in the circulation is regulated by the intrinsic apoptosis pathway. Pro-survival BCL-XL (also termed BCL2L1) is essential for platelet viability. It functions to restrain the pro-apoptotic BCL-2 family members BAK (also termed BAK1) and BAX, the essential mediators of intrinsic apoptosis. Genetic deletion or pharmacological inhibition of BCL-XL results in thrombocytopenia. Conversely, deletion of BAK in platelets doubles their circulating life span. However, what triggers platelet apoptosis in vivo remains unclear. The pro-apoptotic BH3-only proteins are essential for initiating apoptosis in nucleated cells, and there is some evidence to suggest they also play a role in platelet biology. We investigated whether PUMA (also termed BBC3), a potent BH3-only protein that can inhibit all pro-survival BCL-2 family members as well as directly activate BAX, regulates the death of platelets. Surprisingly, loss of PUMA had no impact on the loss of platelets caused by loss of BCL-XL. It therefore remains to be established whether other BH3-only proteins play a critical role in induction of apoptosis in platelets or whether their death is controlled solely by the interactions between BCL-XL with BAK and BAX.
- Published
- 2016
44. High concordance between Illumina HiSeq2500 and NextSeq500 for reduced representation bisulfite sequencing (RRBS)
- Author
-
Yin, D, Ritchie, ME, Jabbari, JS, Beck, T, Blewitt, ME, Keniry, A, Yin, D, Ritchie, ME, Jabbari, JS, Beck, T, Blewitt, ME, and Keniry, A
- Abstract
Reduced representation bisulfite sequencing (RRBS) provides an efficient method for measuring DNA methylation at single base resolution in regions of high CpG density. This technique has been extensively tested on the HiSeq2500, which uses a 4-colour detection method, however it is unclear if the method will also work on the NextSeq500 platform, which employs a 2-colour detection system. We created an RRBS library and sequenced it on both the HiSeq2500 and NextSeq500, and found no significant difference in the base composition of reads derived from either machine. Moreover, the methylation calls made from the data of each instrument were highly concordant, with methylation patterns across the genome appearing as expected. Therefore, RRBS can be sequenced on the Nextseq500 with comparable quality to that of the HiSeq2500. All sequencing data are deposited in the GEO database under accession number GSE87097.
- Published
- 2016
45. A pooled shRNA screen for regulators of primary mammary stem and progenitor cells identifies roles for Asap1 and Prox1
- Author
-
Sheridan, JM, Ritchie, ME, Best, SA, Jiang, K, Beck, TJ, Vaillant, F, Liu, K, Dickins, RA, Smyth, GK, Lindeman, GJ, Visvader, JE, Sheridan, JM, Ritchie, ME, Best, SA, Jiang, K, Beck, TJ, Vaillant, F, Liu, K, Dickins, RA, Smyth, GK, Lindeman, GJ, and Visvader, JE
- Abstract
BACKGROUND: The molecular regulators that orchestrate stem cell renewal, proliferation and differentiation along the mammary epithelial hierarchy remain poorly understood. Here we have performed a large-scale pooled RNAi screen in primary mouse mammary stem cell (MaSC)-enriched basal cells using 1295 shRNAs against genes principally involved in transcriptional regulation. METHODS: MaSC-enriched basal cells transduced with lentivirus pools carrying shRNAs were maintained as non-adherent mammospheres, a system known to support stem and progenitor cells. Integrated shRNAs that altered culture kinetics were identified by next generation sequencing as relative frequency changes over time. RNA-seq-based expression profiling coupled with in vitro progenitor and in vivo transplantation assays was used to confirm a role for candidate genes in mammary stem and/or progenitor cells. RESULTS: Utilizing a mammosphere-based assay, the screen identified several candidate regulators. Although some genes had been previously implicated in mammary gland development, the vast majority of genes uncovered have no known function within the mammary gland. RNA-seq analysis of freshly purified primary mammary epithelial populations and short-term cultured mammospheres was used to confirm the expression of candidate regulators. Two genes, Asap1 and Prox1, respectively implicated in breast cancer metastasis and progenitor cell function in other systems, were selected for further analysis as their roles in the normal mammary gland were unknown. Both Prox1 and Asap1 were shown to act as negative regulators of progenitor activity in vitro, and Asap1 knock-down led to a marked increase in repopulating activity in vivo, implying a role in stem cell activity. CONCLUSIONS: This study has revealed a number of novel genes that influence the activity or survival of mammary stem and/or progenitor cells. Amongst these, we demonstrate that Prox1 and Asap1 behave as negative regulators of mammary stem/progen
- Published
- 2015
46. limma powers differential expression analyses for RNA-sequencing and microarray studies
- Author
-
Ritchie, ME, Phipson, B, Wu, D, Hu, Y, Law, CW, Shi, W, Smyth, GK, Ritchie, ME, Phipson, B, Wu, D, Hu, Y, Law, CW, Shi, W, and Smyth, GK
- Abstract
limma is an R/Bioconductor software package that provides an integrated solution for analysing data from gene expression experiments. It contains rich features for handling complex experimental designs and for information borrowing to overcome the problem of small sample sizes. Over the past decade, limma has been a popular choice for gene discovery through differential expression analyses of microarray and high-throughput PCR data. The package contains particularly strong facilities for reading, normalizing and exploring such data. Recently, the capabilities of limma have been significantly expanded in two important directions. First, the package can now perform both differential expression and differential splicing analyses of RNA sequencing (RNA-seq) data. All the downstream analysis tools previously restricted to microarray data are now available for RNA-seq as well. These capabilities allow users to analyse both RNA-seq and microarray data with very similar pipelines. Second, the package is now able to go past the traditional gene-wise expression analyses in a variety of ways, analysing expression profiles in terms of co-regulated sets of genes or in terms of higher-order expression signatures. This provides enhanced possibilities for biological interpretation of gene expression differences. This article reviews the philosophy and design of the limma package, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.
- Published
- 2015
47. Why weight? Modelling sample and observational level variability improves power in RNA-seq analyses
- Author
-
Liu, R, Holik, AZ, Su, S, Jansz, N, Chen, K, Leong, HS, Blewitt, ME, Asselin-Labat, M-L, Smyth, GK, Ritchie, ME, Liu, R, Holik, AZ, Su, S, Jansz, N, Chen, K, Leong, HS, Blewitt, ME, Asselin-Labat, M-L, Smyth, GK, and Ritchie, ME
- Abstract
Variations in sample quality are frequently encountered in small RNA-sequencing experiments, and pose a major challenge in a differential expression analysis. Removal of high variation samples reduces noise, but at a cost of reducing power, thus limiting our ability to detect biologically meaningful changes. Similarly, retaining these samples in the analysis may not reveal any statistically significant changes due to the higher noise level. A compromise is to use all available data, but to down-weight the observations from more variable samples. We describe a statistical approach that facilitates this by modelling heterogeneity at both the sample and observational levels as part of the differential expression analysis. At the sample level this is achieved by fitting a log-linear variance model that includes common sample-specific or group-specific parameters that are shared between genes. The estimated sample variance factors are then converted to weights and combined with observational level weights obtained from the mean-variance relationship of the log-counts-per-million using 'voom'. A comprehensive analysis involving both simulations and experimental RNA-sequencing data demonstrates that this strategy leads to a universally more powerful analysis and fewer false discoveries when compared to conventional approaches. This methodology has wide application and is implemented in the open-source 'limma' package.
- Published
- 2015
48. Repression of Igf1 expression by Ezh2 prevents basal cell differentiation in the developing lung
- Author
-
Galvis, LA, Holik, AZ, Short, KM, Pasquet, J, Lun, ATL, Blewitt, ME, Smyth, IM, Ritchie, ME, Asselin-Labat, M-L, Galvis, LA, Holik, AZ, Short, KM, Pasquet, J, Lun, ATL, Blewitt, ME, Smyth, IM, Ritchie, ME, and Asselin-Labat, M-L
- Abstract
Epigenetic mechanisms involved in the establishment of lung epithelial cell lineage identities during development are largely unknown. Here, we explored the role of the histone methyltransferase Ezh2 during lung lineage determination. Loss of Ezh2 in the lung epithelium leads to defective lung formation and perinatal mortality. We show that Ezh2 is crucial for airway lineage specification and alveolarization. Using optical projection tomography imaging, we found that branching morphogenesis is affected in Ezh2 conditional knockout mice and the remaining bronchioles are abnormal, lacking terminally differentiated secretory club cells. Remarkably, RNA-seq analysis revealed the upregulation of basal genes in Ezh2-deficient epithelium. Three-dimensional imaging for keratin 5 further showed the unexpected presence of a layer of basal cells from the proximal airways to the distal bronchioles in E16.5 embryos. ChIP-seq analysis indicated the presence of Ezh2-mediated repressive marks on the genomic loci of some but not all basal genes, suggesting an indirect mechanism of action of Ezh2. We found that loss of Ezh2 de-represses insulin-like growth factor 1 (Igf1) expression and that modulation of IGF1 signaling ex vivo in wild-type lungs could induce basal cell differentiation. Altogether, our work reveals an unexpected role for Ezh2 in controlling basal cell fate determination in the embryonic lung endoderm, mediated in part by repression of Igf1 expression.
- Published
- 2015
49. Transcriptome and H3K27 tri-methylation profiling of Ezh2-deficient lung epithelium
- Author
-
Holik, AZ, Galvis, LA, Lun, ATL, Ritchie, ME, Asselin-Labat, M-L, Holik, AZ, Galvis, LA, Lun, ATL, Ritchie, ME, and Asselin-Labat, M-L
- Abstract
The adaptation of the lungs to air breathing at birth requires the fine orchestration of different processes to control lung morphogenesis and progenitor cell differentiation. However, there is little understanding of the role that epigenetic modifiers play in the control of lung development. We found that the histone methyl transferase Ezh2 plays a critical role in lung lineage specification and survival at birth. We performed a genome-wide transcriptome study combined with a genome-wide analysis of the distribution of H3K27 tri-methylation marks to interrogate the role of Ezh2 in lung epithelial cells. Lung cells isolated from Ezh2-deficient and control mice at embryonic day E16.5 were sorted into epithelial and mesenchymal populations based on EpCAM expression. This enabled us to dissect the transcriptional and epigenetic changes induced by the loss of Ezh2 specifically in the lung epithelium. Here we provide a detailed description of the analysis of the RNA-seq and ChIP-seq data, including quality control, read mapping, differential expression and differential binding analyses, as well as visualisation methods used to present the data. These data can be accessed from the Gene Expression Omnibus database (super-series accession number GSE57393).
- Published
- 2015
50. edgeR: a versatile tool for the analysis of shRNA-seq and CRISPR-Cas9 genetic screens.
- Author
-
Dai, Z, Sheridan, JM, Gearing, LJ, Moore, DL, Su, S, Wormald, S, Wilcox, S, O'Connor, L, Dickins, RA, Blewitt, ME, Ritchie, ME, Dai, Z, Sheridan, JM, Gearing, LJ, Moore, DL, Su, S, Wormald, S, Wilcox, S, O'Connor, L, Dickins, RA, Blewitt, ME, and Ritchie, ME
- Abstract
Pooled library sequencing screens that perturb gene function in a high-throughput manner are becoming increasingly popular in functional genomics research. Irrespective of the mechanism by which loss of function is achieved, via either RNA interference using short hairpin RNAs (shRNAs) or genetic mutation using single guide RNAs (sgRNAs) with the CRISPR-Cas9 system, there is a need to establish optimal analysis tools to handle such data. Our open-source processing pipeline in edgeR provides a complete analysis solution for screen data, that begins with the raw sequence reads and ends with a ranked list of candidate genes for downstream biological validation. We first summarize the raw data contained in a fastq file into a matrix of counts (samples in the columns, genes in the rows) with options for allowing mismatches and small shifts in sequence position. Diagnostic plots, normalization and differential representation analysis can then be performed using established methods to prioritize results in a statistically rigorous way, with the choice of either the classic exact testing methodology or generalized linear modeling that can handle complex experimental designs. A detailed users' guide that demonstrates how to analyze screen data in edgeR along with a point-and-click implementation of this workflow in Galaxy are also provided. The edgeR package is freely available from http://www.bioconductor.org.
- Published
- 2014
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.