Back to Search
Start Over
Quality assessment metrics for whole genome gene expression profiling of paraffin embedded samples
- Source :
- BMC Research Notes
- Publisher :
- Springer Nature
-
Abstract
- Background Formalin fixed, paraffin embedded tissues are most commonly used for routine pathology analysis and for long term tissue preservation in the clinical setting. Many institutions have large archives of Formalin fixed, paraffin embedded tissues that provide a unique opportunity for understanding genomic signatures of disease. However, genome-wide expression profiling of Formalin fixed, paraffin embedded samples have been challenging due to RNA degradation. Because of the significant heterogeneity in tissue quality, normalization and analysis of these data presents particular challenges. The distribution of intensity values from archival tissues are inherently noisy and skewed due to differential sample degradation raising two primary concerns; whether a highly skewed array will unduly influence initial normalization of the data and whether outlier arrays can be reliably identified. Findings Two simple extensions of common regression diagnostic measures are introduced that measure the stress an array undergoes during normalization and how much a given array deviates from the remaining arrays post-normalization. These metrics are applied to a study involving 1618 formalin-fixed, paraffin-embedded HER2-positive breast cancer samples from the N9831 adjuvant trial processed with Illumina’s cDNA-mediated Annealing Selection extension and Ligation assay. Conclusion Proper assessment of array quality within a research study is crucial for controlling unwanted variability in the data. The metrics proposed in this paper have direct biological interpretations and can be used to identify arrays that should either be removed from analysis all together or down-weighted to reduce their influence in downstream analyses.
- Subjects :
- Quality Control
DNA, Complementary
Paclitaxel
Receptor, ErbB-2
Paraffin-embedded tissue
Breast Neoplasms
High-dimensional array quality
Computational biology
Biology
Antibodies, Monoclonal, Humanized
Bioinformatics
Genome
General Biochemistry, Genetics and Molecular Biology
Formalin-Fixed
03 medical and health sciences
0302 clinical medicine
Technical Note
Outlier detection
Humans
Cyclophosphamide
Oligonucleotide Array Sequence Analysis
Randomized Controlled Trials as Topic
030304 developmental biology
Medicine(all)
0303 health sciences
Models, Statistical
Paraffin Embedding
Genome, Human
Quality assessment
Biochemistry, Genetics and Molecular Biology(all)
Gene Expression Profiling
General Medicine
Formalin fixed
Trastuzumab
Paraffin embedded
Gene expression profiling
Clinical Trials, Phase III as Topic
Doxorubicin
030220 oncology & carcinogenesis
Outlier
RNA
Female
Anomaly detection
Regression diagnostic
Subjects
Details
- Language :
- English
- ISSN :
- 17560500
- Volume :
- 6
- Issue :
- 1
- Database :
- OpenAIRE
- Journal :
- BMC Research Notes
- Accession number :
- edsair.doi.dedup.....fd9db5c30e202f65a8624883c61d584a
- Full Text :
- https://doi.org/10.1186/1756-0500-6-33