Back to Search Start Over

A new statistical method for curve group analysis of longitudinal gene expression data illustrated for breast cancer in the NOWAC postgenome cohort as a proof of principle

Authors :
Sandra Plancade
Eiliv Lund
Lars Holden
Jean-Christophe Thalabard
Marit Holden
Gregory Nuel
Hege Bøvelstad
Nicolle A. Mode
Clara-Cecilie Günther
Department of Community Medicine
The Arctic University of Norway [Tromsø, Norway] (UiT)
Norsk Regnesentral [Oslo] (NR)
Mathématiques et Informatique Appliquées du Génome à l'Environnement [Jouy-En-Josas] (MaIAGE)
Institut National de la Recherche Agronomique (INRA)
Laboratoire de Probabilités et Modèles Aléatoires (LPMA)
Université Pierre et Marie Curie - Paris 6 (UPMC)-Université Paris Diderot - Paris 7 (UPD7)-Centre National de la Recherche Scientifique (CNRS)
Mathématiques Appliquées Paris 5 (MAP5 - UMR 8145)
Université Paris Descartes - Paris 5 (UPD5)-Institut National des Sciences Mathématiques et de leurs Interactions (INSMI)-Centre National de la Recherche Scientifique (CNRS)
European Project: 232997,EC:FP7:ERC,ERC-2008-AdG,TICE(2009)
The Arctic University of Norway
Lund, Eiliv
Norsk Regnesentral
MAP5 - Mathématiques Appliquées à Paris 5 (MAP5)
Université Paris Descartes - Paris 5 (UPD5) - Institut National des Sciences Mathématiques et de leurs Interactions - Centre National de la Recherche Scientifique (CNRS)
Université Pierre et Marie Curie - Paris 6 (UPMC) - Université Paris Diderot - Paris 7 (UP7) - Centre National de la Recherche Scientifique (CNRS)
Centre National de la Recherche Scientifique (CNRS)-Institut National des Sciences Mathématiques et de leurs Interactions (INSMI)-Université Paris Descartes - Paris 5 (UPD5)
Source :
BMC Medical Research Methodology, BMC Medical Research Methodology, 2016, 16 (1), pp.28. ⟨10.1186/s12874-016-0129-z⟩, BMC Medical Research Methodology, BioMed Central, 2016, 16 (1), pp.28. ⟨10.1186/s12874-016-0129-z⟩, BMC Medical Research Methodology, Vol 16, Iss 1, Pp 1-12 (2016), BMC Medical Research Methodology (16), . (2016), BMC Medical Research Methodology, BioMed Central, 2016, 16 (1), pp.1. , BioMed Central, BMC Medical Research Methodology, BioMed Central, 2016, 16, pp.28. <10.1186/s12874-016-0129-z>
Publication Year :
2016
Publisher :
HAL CCSD, 2016.

Abstract

This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. This article is also available via DOI:10.1186/s12874-016-0129-z Background: The understanding of changes in temporal processes related to human carcinogenesis is limited. One approach for prospective functional genomic studies is to compile trajectories of differential expression of genes, based on measurements from many case-control pairs. We propose a new statistical method that does not assume any parametric shape for the gene trajectories. Methods: The trajectory of a gene is defined as the curve representing the changes in gene expression levels in the blood as a function of time to cancer diagnosis. In a nested case–control design it consists of differences in gene expression levels between cases and controls. Genes can be grouped into curve groups, each curve group corresponding to genes with a similar development over time. The proposed new statistical approach is based on a set of hypothesis testing that can determine whether or not there is development in gene expression levels over time, and whether this development varies among different strata. Curve group analysis may reveal significant differences in gene expression levels over time among the different strata considered. This new method was applied as a “proof of concept” to breast cancer in the Norwegian Women and Cancer (NOWAC) postgenome cohort, using blood samples collected prospectively that were specifically preserved for transcriptomic analyses (PAX tube). Cohort members diagnosed with invasive breast cancer through 2009 were identified through linkage to the Cancer Registry of Norway, and for each case a random control from the postgenome cohort was also selected, matched by birth year and time of blood sampling, to create a case-control pair. After exclusions, 441 case-control pairs were available for analyses, in which we considered strata of lymph node status at time of diagnosis and time of diagnosis with respect to breast cancer screening visits. Results: The development of gene expression levels in the NOWAC postgenome cohort varied in the last years before breast cancer diagnosis, and this development differed by lymph node status and participation in the Norwegian Breast Cancer Screening Program. The differences among the investigated strata appeared larger in the year before breast cancer diagnosis compared to earlier years. Conclusions: This approach shows good properties in term of statistical power and type 1 error under minimal assumptions. When applied to a real data set it was able to discriminate between groups of genes with non-linear similar patterns before diagnosis.

Details

Language :
English
ISSN :
14712288
Database :
OpenAIRE
Journal :
BMC Medical Research Methodology, BMC Medical Research Methodology, 2016, 16 (1), pp.28. ⟨10.1186/s12874-016-0129-z⟩, BMC Medical Research Methodology, BioMed Central, 2016, 16 (1), pp.28. ⟨10.1186/s12874-016-0129-z⟩, BMC Medical Research Methodology, Vol 16, Iss 1, Pp 1-12 (2016), BMC Medical Research Methodology (16), . (2016), BMC Medical Research Methodology, BioMed Central, 2016, 16 (1), pp.1. <BioMed Central>, BioMed Central, BMC Medical Research Methodology, BioMed Central, 2016, 16, pp.28. <10.1186/s12874-016-0129-z>
Accession number :
edsair.doi.dedup.....efcdba3796263c227e6c8c502beb2b0a
Full Text :
https://doi.org/10.1186/s12874-016-0129-z⟩