The operation of photovoltaic (PV) systems, like any other system, in fault free environment ensures maximum performance. Hence, accurate and timely fault identification of PV system demands greater importance. Present investigation proposes a stacking ensemble-based PV array fault diagnosis method, which is integrated with automatic feature engineering and selection technique, handling of imbalanced dataset for unbiased classification. The proposed ensemble utilizes decision tree (DT), random forest (RF), extra trees (EXT), extreme gradient boosting machine (XGBoost) as base learners, and light gradient boosting machine (LightGBM) as both base and meta-learner. To validate the proposed technique, a test system of 4.8kWp capacity has been built using the MATLAB/Simulink environment incorporating one-year real-time irradiance and module temperature data. Irradiance, module temperature, voltage, current, and power are collected as primary raw data which are then concocted using the autofeat python library for automatic feature engineering and selection. Subsequently, the proposed fault diagnosis strategy is built in python 3.8.5 using the engineered and class label balanced dataset, prepared using the synthetic minority over-sampling technique (SMOTE). Results demonstrate promising performance of the proposed ensemble technique with 97.04% accuracy in classifying the different faults in the PV array, which accounts for approximately 1โ3% improvement over other individual machine learning (ML) models. [ABSTRACT FROM AUTHOR]