Lingling Ding, Ravikiran Mane, Zhenzhou Wu, Yong Jiang, Xia Meng, Jing Jing, Weike Ou, Xueyun Wang, Yu Liu, Jinxi Lin, Xingquan Zhao, Hao Li, Yongjun Wang, and Zixiao Li
Summary: Background: Acute ischaemic stroke (AIS) is a highly heterogeneous disorder and warrants further investigation to stratify patients with different outcomes and treatment responses. Using a large-scale stroke registry cohort, we applied data-driven approach to identify novel phenotypes based on multiple biomarkers. Methods: In a nationwide, prospective, 201-hospital registry study taking place in China between August 01, 2015 and March 31, 2018, the patients with AIS who were over 18 years of age and admitted to the hospital within 7 days from symptom onset were included. 92 biomarkers were included in the analysis. In the derivation cohort (n=9539), an unsupervised Gaussian mixture model was applied to categorize patients into distinct phenotypes. A classifier was developed using the most important biomarkers and was applied to categorize patients into their corresponding phenotypes in an validation cohort (n=2496). The differences in biological features, clinical outcomes, and treatment response were compared across the phenotypes. Findings: We identified four phenotypes with distinct characteristics in 9288 patients with non-cardioembolic ischaemic stroke. Phenotype 1 was associated with abnormal glucose and lipid metabolism. Phenotype 2 was characterized by inflammation and abnormal renal function. Phenotype 3 had the least laboratory abnormalities and small infarct lesions. Phenotype 4 was characterized by disturbance in homocysteine metabolism. Findings were replicated in the validation cohort. In comparison with phenotype 3, the risk of stroke recurrence (adjusted hazard ratio [aHR] 2.02, 95% confidence intervals [CI] 1.04-3.94), and mortality (aHR 18.14, 95%CI 6.62-49.71) at 3-month post-stroke were highest in phenotype 2, followed by phenotype 4 and phenotype 1, after adjustment for age, gender, smoking, drinking, history of stroke, hypertension, diabetes mellitus, dyslipidemia, and coronary heart disease. The Monte Carlo simulation showed that the patients with phenotype 2 could benefit from high-intensity statin therapy. Interpretation: A data-driven approach could aid in the identification of patients at a higher risk of adverse clinical outcomes following non-cardioembolic ischaemic stroke. These phenotypes, based on different pathophysiology, can suggest individualized treatment plans. Funding: Beijing Natural Science Foundation (grant number Z200016), Beijing Municipal Committee of Science and Technology (grant number Z201100005620010), National Natural Science Foundation of China (grant number 82101360, 92046016, 82171270), Chinese Academy of Medical Sciences Innovation Fund for Medical Sciences (grant number 2019-I2M-5-029).