1. Finding the needle in a high-dimensional haystack: Canonical correlation analysis for neuroscientists
- Author
-
Hao-Ting Wang, Jonathan Smallwood, Janaina Mourao-Miranda, Cedric Huchuan Xia, Theodore D. Satterthwaite, Danielle S. Bassett, and Danilo Bzdok
- Subjects
Machine learning ,Big data ,Data science ,Deep phenotyping ,Modality fusion ,Neurosciences. Biological psychiatry. Neuropsychiatry ,RC321-571 - Abstract
The 21st century marks the emergence of “big data” with a rapid increase in the availability of datasets with multiple measurements. In neuroscience, brain-imaging datasets are more commonly accompanied by dozens or hundreds of phenotypic subject descriptors on the behavioral, neural, and genomic level. The complexity of such “big data” repositories offer new opportunities and pose new challenges for systems neuroscience. Canonical correlation analysis (CCA) is a prototypical family of methods that is useful in identifying the links between variable sets from different modalities. Importantly, CCA is well suited to describing relationships across multiple sets of data, such as in recently available big biomedical datasets. Our primer discusses the rationale, promises, and pitfalls of CCA.
- Published
- 2020
- Full Text
- View/download PDF