Abdulrhman Fahad Aljouie,1,2 Ahmed Almazroa,2,3 Yahya Bokhari,1,2 Mohammed Alawad,1,2 Ebrahim Mahmoud,4 Eman Alawad,4 Ali Alsehawi,5 Mamoon Rashid,1,2 Lamya Alomair,1,2 Shahad Almozaai,6 Bedoor Albesher,6 Hassan Alomaish,5 Rayyan Daghistani,5 Naif Khalaf Alharbi,2,7 Manal Alaamery,2,8– 10 Mohammad Bosaeed,2– 4 Hesham Alshaalan5 1Bioinformatics Section, King Abdullah International Medical Research Center, Riyadh, Saudi Arabia; 2King Saud bin Abdulaziz University for Health Sciences, Riyadh, Saudi Arabia; 3Department of Imaging Research, King Abdullah International Medical Research Center, Riyadh, Saudi Arabia; 4Department of Medicine, Ministry of National Guard - Health Affairs, Riyadh, Saudi Arabia; 5Department of Medical Imaging, Ministry of National Guard - Health Affairs, Riyadh, Saudi Arabia; 6College of Pharmacy, King Saud bin Abdulaziz University for Health Sciences, Riyadh, Saudi Arabia; 7Department of Infectious Disease Research, King Abdullah International Medical Research Center, Riyadh, Saudi Arabia; 8Department of Developmental Medicine, King Abdullah International Medical Research Center, Riyadh, Saudi Arabia; 9KACST-BWH Center of Excellence for Biomedicine, Joint Centers of Excellence Program, King Abdulaziz City for Science and Technology (KACST), Riyadh, Saudi Arabia; 10King Abdulaziz City for Science and Technology (KACST)-Saudi Human Genome Satellite Lab at Abdulaziz Medical City, Ministry of National Guard Health Affairs (MNGHA), Riyadh, Saudi ArabiaCorrespondence: Mohammed AlawadBioinformatics Section, King Abdullah International Medical Research Center, King Saud bin Abdulaziz University for Health Sciences, Riyadh, Saudi ArabiaEmail malawad@sdaia.gov.saAhmed AlmazroaDepartment of Imaging Research, King Abdullah International Medical Research Center King Saud bin Abdulaziz University for Health Sciences, Riyadh, Saudi ArabiaEmail almazroaah@ngha.med.saBackground: Coronavirus disease 2019 (COVID-19), caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), emerged in Wuhan, China, in late 2019 and created a global pandemic that overwhelmed healthcare systems. COVID-19, as of July 3, 2021, yielded 182 million confirmed cases and 3.9 million deaths globally according to the World Health Organization. Several patients who were initially diagnosed with mild or moderate COVID-19 later deteriorated and were reclassified to severe disease type.Objective: The aim is to create a predictive model for COVID-19 ventilatory support and mortality early on from baseline (at the time of diagnosis) and routinely collected data of each patient (CXR, CBC, demographics, and patient history).Methods: Four common machine learning algorithms, three data balancing techniques, and feature selection are used to build and validate predictive models for COVID-19 mechanical requirement and mortality. Baseline CXR, CBC, demographic, and clinical data were retrospectively collected from April 2, 2020, till June 18, 2020, for 5739 patients with confirmed PCR COVID-19 at King Abdulaziz Medical City in Riyadh. However, of those patients, only 1508 and 1513 have met the inclusion criteria for ventilatory support and mortalilty endpoints, respectively.Results: In an independent test set, ventilation requirement predictive model with top 20 features selected with reliefF algorithm from baseline radiological, laboratory, and clinical data using support vector machines and random undersampling technique attained an AUC of 0.87 and a balanced accuracy of 0.81. For mortality endpoint, the top model yielded an AUC of 0.83 and a balanced accuracy of 0.80 using all features with balanced random forest. This indicates that with only routinely collected data our models can predict the outcome with good performance. The predictive ability of combined data consistently outperformed each data set individually for intubation and mortality. For the ventilator support, chest X-ray severity annotations alone performed better than comorbidity, complete blood count, age, or gender with an AUC of 0.85 and balanced accuracy of 0.79. For mortality, comorbidity alone achieved an AUC of 0.80 and a balanced accuracy of 0.72, which is higher than models that use either chest radiograph, laboratory, or demographic features only.Conclusion: The experimental results demonstrate the practicality of the proposed COVID-19 predictive tool for hospital resource planning and patients’ prioritization in the current COVID-19 pandemic crisis.Keywords: COVID-19, mortality, NIV, X-rays, CBC, random forest, SMOTE; machine learning