736 results on '"Moreno J"'
Search Results
2. Multiple remotely sensed datasets and machine learning models to predict chlorophyll-a concentration in the Nakdong River, South Korea.
- Author
-
Lee B, Im JK, Han JW, Kang T, Kim W, Kim M, and Lee S
- Subjects
- Republic of Korea, Chlorophyll analysis, Remote Sensing Technology, Support Vector Machine, Machine Learning, Chlorophyll A, Environmental Monitoring methods, Rivers chemistry
- Abstract
The Nakdong River is a crucial water resource in South Korea, supplying water for various purposes such as potable water, irrigation, and recreation. However, the river is vulnerable to algal blooms due to the inflow of pollutants from multiple points and non-point sources. Monitoring chlorophyll-a (Chl-a) concentrations, a proxy for algal biomass is essential for assessing the trophic status of the river and managing its ecological health. This study aimed to improve the accuracy and reliability of Chl-a estimation in the Nakdong River using machine learning models (MLMs) and simultaneous use of multiple remotely sensed datasets. This study compared the performances of four MLMs: multi-layer perceptron (MLP), support vector machine (SVM), random forest (RF), and eXetreme Gradient Boosting (XGB) using three different input datasets: (1) two remotely sensed datasets (Sentinel-2 and Landsat-8), (2) standalone Sentinel-2, and (3) standalone Landsat-8. The results showed that the MLP model with multiple remotely sensed datasets outperformed other MLMs with 0.43 - 0.86 greater in R
2 and 0.36 - 5.88 lower in RMSE. The MLP model demonstrated the highest performance across the range of Chl-a concentrations and predicted peaks above 20 mg/m3 relatively well compared to other models. This was likely due to the capacity of MLP to handle imbalanced datasets. The predictive map of the spatial distribution of Chl-a generated by MLP well captured the areas with high and low Chl-a concentrations. This study pointed out the impacts of imbalanced Chl-a concentration observations (dominated by low Chl-a concentrations) on the performance of MLMs. The data imbalance likely led to MLMs poorly trained for high Chl-a values, producing low prediction accuracy. In conclusion, this study demonstrated the value of multiple remotely sensed datasets in enhancing the accuracy and reliability of Chl-a estimation, mainly when using the MLP model. These findings would provide valuable insights into utilizing MLMs effectively for Chl-a monitoring., (© 2024. The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.)- Published
- 2024
- Full Text
- View/download PDF
3. Surfing the OCEAN: The machine learning psycholexical approach 2.0 to detect personality traits in texts.
- Author
-
Giannini F, Marelli M, Stella F, Monzani D, and Pancani L
- Subjects
- Humans, Adult, Female, Male, Young Adult, Neural Networks, Computer, Personality, Machine Learning
- Abstract
Objective: We aimed to develop a machine learning model to infer OCEAN traits from text., Background: The psycholexical approach allows retrieving information about personality traits from human language. However, it has rarely been applied because of methodological and practical issues that current computational advancements could overcome., Method: Classical taxonomies and a large Yelp corpus were leveraged to learn an embedding for each personality trait. These embeddings were used to train a feedforward neural network for predicting trait values. Their generalization performances have been evaluated through two external validation studies involving experts (N = 11) and laypeople (N = 100) in a discrimination task about the best markers of each trait and polarity., Results: Intrinsic validation of the model yielded excellent results, with R
2 values greater than 0.78. The validation studies showed a high proportion of matches between participants' choices and model predictions, confirming its efficacy in identifying new terms related to the OCEAN traits. The best performance was observed for agreeableness and extraversion, especially for their positive polarities. The model was less efficient in identifying the negative polarity of openness and conscientiousness., Conclusions: This innovative methodology can be considered a "psycholexical approach 2.0," contributing to research in personality and its practical applications in many fields., (© 2024 Wiley Periodicals LLC.)- Published
- 2024
- Full Text
- View/download PDF
4. A high hydrophobic moment arginine-rich peptide screened by a machine learning algorithm enhanced ADC antitumor activity.
- Author
-
Su RL, Cao XW, Zhao J, and Wang FJ
- Subjects
- Humans, Cell-Penetrating Peptides chemistry, Cell-Penetrating Peptides pharmacology, Arginine chemistry, Trastuzumab chemistry, Trastuzumab pharmacology, Immunoconjugates chemistry, Immunoconjugates pharmacology, Antineoplastic Agents pharmacology, Antineoplastic Agents chemistry, Cell Line, Tumor, Receptor, ErbB-2 metabolism, Hydrophobic and Hydrophilic Interactions, Machine Learning
- Abstract
Cell-penetrating peptides (CPPs) with better biomolecule delivery properties will expand their clinical applications. Using the MLCPP2.0 machine algorithm, we screened multiple candidate sequences with potential cellular uptake ability from the nuclear localization signal/nuclear export signal database and verified them through cell-penetrating fluorescent tracing experiments. A peptide (NCR) derived from the Rev protein of the caprine arthritis-encephalitis virus exhibited efficient cell-penetrating activity, delivering over four times more EGFP than the classical CPP TAT, allowing it to accumulate in lysosomes. Structural and property analysis revealed that a high hydrophobic moment and an appropriate hydrophobic region contribute to the high delivery activity of NCR. Trastuzumab emtansine (T-DM1), a HER2-targeted antibody-drug conjugate, could improve its anti-tumor activity by enhancing targeted delivery efficiency and increasing lysosomal drug delivery. This study designed a new NCR vector to non-covalently bind T-DM1 by fusing domain Z, which can specifically bind to the Fc region of immunoglobulin G and effectively deliver T-DM1 to lysosomes. MTT results showed that the domain Z-NCR vector significantly enhanced the cytotoxicity of T-DM1 against HER2-positive tumor cells while maintaining drug specificity. Our results make a useful attempt to explore the potential application of CPP as a lysosome-targeted delivery tool., (© 2024 European Peptide Society and John Wiley & Sons Ltd.)
- Published
- 2024
- Full Text
- View/download PDF
5. Machine learning model prediction of 6-month functional outcome in elderly patients with intracerebral hemorrhage.
- Author
-
Trevisi G, Caccavella VM, Scerrati A, Signorelli F, Salamone GG, Orsini K, Fasciani C, D'Arrigo S, Auricchio AM, D'Onofrio G, Salomi F, Albanese A, De Bonis P, Mangiola A, and Sturiale CL
- Subjects
- Aged, Glasgow Coma Scale, Glasgow Outcome Scale, Humans, Prognosis, Retrospective Studies, Cerebral Hemorrhage epidemiology, Cerebral Hemorrhage surgery, Machine Learning
- Abstract
Spontaneous intracerebral hemorrhage (ICH) has an increasing incidence and a worse outcome in elderly patients. The ability to predict the functional outcome in these patients can be helpful in supporting treatment decisions and establishing prognostic expectations. We evaluated the performance of a machine learning (ML) model to predict the 6-month functional status in elderly patients with ICH leveraging the predictive value of the clinical characteristics at hospital admission. Data were extracted by a retrospective multicentric database of patients ≥ 70 years of age consecutively admitted for the management of spontaneous ICH between January 1, 2014 and December 31, 2019. Relevant demographic, clinical, and radiological variables were selected by a feature selection algorithm (Boruta) and used to build a ML model. Outcome was determined according to the Glasgow Outcome Scale (GOS) at 6 months from ICH: dead (GOS 1), poor outcome (GOS 2-3: vegetative status/severe disability), and good outcome (GOS 4-5: moderate disability/good recovery). Ten features were selected by Boruta with the following relative importance order in the ML model: Glasgow Coma Scale, Charlson Comorbidity Index, ICH score, ICH volume, pupillary status, brainstem location, age, anticoagulant/antiplatelet agents, intraventricular hemorrhage, and cerebellar location. Random forest prediction model, evaluated on the hold-out test set, achieved an AUC of 0.96 (0.94-0.98), 0.89 (0.86-0.93), and 0.93 (0.90-0.95) for dead, poor, and good outcome classes, respectively, demonstrating high discriminative ability. A random forest classifier was successfully trained and internally validated to stratify elderly patients with spontaneous ICH into prognostic subclasses. The predictive value is enhanced by the ability of ML model to identify synergy among variables., (© 2022. The Author(s).)
- Published
- 2022
- Full Text
- View/download PDF
6. A Machine Learning Approach to Growth Direction Finding for Automated Planting of Bulbous Plants.
- Author
-
Booth BG, Sijbers J, and De Beenhouwer J
- Subjects
- Algorithms, Forecasting, Imaging, Three-Dimensional, Neural Networks, Computer, Plant Roots anatomy & histology, Sensitivity and Specificity, X-Rays, Agriculture methods, Machine Learning, Plant Roots growth & development, Robotics methods
- Abstract
In agricultural robotics, a unique challenge exists in the automated planting of bulbous plants: the estimation of the bulb's growth direction. To date, no existing work addresses this challenge. Therefore, we propose the first robotic vision framework for the estimation of a plant bulb's growth direction. The framework takes as input three x-ray images of the bulb and extracts shape, edge, and texture features from each image. These features are then fed into a machine learning regression algorithm in order to predict the 2D projection of the bulb's growth direction. Using the x-ray system's geometry, these 2D estimates are then mapped to the 3D world coordinate space, where a filtering on the estimate's variance is used to determine whether the estimate is reliable. We applied our algorithm on 27,200 x-ray simulations from T. Apeldoorn bulbs on a standard desktop workstation. Results indicate that our machine learning framework is fast enough to meet industry standards (<0.1 seconds per bulb) while providing acceptable accuracy (e.g. error < 30° in 98.40% of cases using an artificial 3-layer neural network). The high success rates of the proposed framework indicate that it is worthwhile to proceed with the development and testing of a physical prototype of a robotic bulb planting system.
- Published
- 2020
- Full Text
- View/download PDF
7. A kernel-based multi-feature image representation for histopathology image classification
- Author
-
MORENO, J, CAICEDO, J, and GONZÁLEZ, F
- Subjects
machine learning ,anotación automática de imágenes ,Automatic image annotation ,aprendizaje máquina - Abstract
This paper presents a novel strategy for building a high-dimensional feature space to represent histopathology image contents. Histogram features, related to colors, textures and edges, are combined together in a unique image representation space using kernel functions. This feature space is further enhanced by the application of Latent Semantic Analysis, to model hidden relationships among visual patterns. All that information is included in the new image representation space. Then, Support Vector Machine classifiers are used to assign semantic labels to images. Processing and classification algorithms operate on top of kernel functions, so that, the structure of the feature space is completely controlled using similarity measures and a dual representation. The proposed approach has shown a successful performance in a classification task using a dataset with 1,502 real histopathology images in 18 different classes. The results show that our approach for histological image classification obtains an improved average performance of 20.6% when compared to a conventional classification approach based on SVM directly applied to the original kernel. Este trabajo presenta una estrategia nueva para la construcción de un espacio de características de gran dimensionalidad para la representación del contenido de imágenes de histopatología. Histogramas de características, relacionados con colores, texturas y bordes, son combinados para obtener una única representación de la imagen utilizando funciones de kernels. Este espacio de características es mejorado mediante la aplicación de Análisis de Semántica Latente, para modelar relaciones ocultas entre los patrones visuales. Esta información es incluida en la representación de la imagen en el nuevo espacio. Luego, un clasificador de Máquinas de Vectores de Soporte es utilizado para asignar etiquetas semánticas a las imágenes. Algoritmos de procesamiento y de clasificación son utilizados en las funciones del kernel, por lo que la estructura del espacio de características es completamente controlada mediante medidas de similitud y la representación dual. El enfoque propuesto mostró un desempeño exitoso en la tarea de clasificación con un conjunto de datos de 1.502 imágenes reales de histopatología en 18 clases diferentes. Los resultados muestran que nuestro enfoque para la clasificación de imágenes histológicas obtiene una mejora promedio en el rendimiento del 20,6% en comparación con un método de clasificación convencional, basado en la aplicación de una Máquina de Vectores de Soporte sobre la función de kernel original.
- Published
- 2010
8. Evolutionary optimization of multi-parametric kernel $$\epsilon$$-SVMr for forecasting problems.
- Author
-
Gascón-Moreno, J., Ortiz-García, E., Salcedo-Sanz, S., Carro-Calvo, L., Saavedra-Moreno, B., and Portilla-Figueras, A.
- Subjects
SUPPORT vector machines ,MATHEMATICAL optimization ,KERNEL functions ,MACHINE learning ,SOFT computing - Abstract
In this paper, we propose a novel multi-parametric kernel Support Vector Regression algorithm (SVMr) optimized with an evolutionary technique, specially well suited for forecasting problems. The multi-parametric SVMr model and the evolutionary algorithm proposed are both described in detail in the paper. In addition, several new bounds for the multi-parametric kernel considered are obtained, in such a way that the SVMr hyper-parameters' search space is reduced. We present experimental evidences of the good performance of the evolutionary algorithm for optimizing the multi-parametric kernel, when compared to a standard SVMr with a Grid Search approach. Specifically, results in different real regression problems from public repositories are obtained, and also a real application focused on the short-term temperature prediction at Barcelona's airport. The results obtained have shown the good performance of the multi-parametric kernel approach both in accuracy and computation time. [ABSTRACT FROM AUTHOR]
- Published
- 2013
- Full Text
- View/download PDF
9. New validation methods for improving standard and multi-parametric support vector regression training time
- Author
-
Gascón-Moreno, J., Salcedo-Sanz, S., Ortiz-Garcı´a, E.G., Acevedo-Rodrı´guez, J., and Portilla-Figueras, Jose A.
- Subjects
- *
PARAMETER estimation , *SUPPORT vector machines , *REGRESSION analysis , *MACHINE learning , *SEARCH algorithms , *PERFORMANCE evaluation , *HEURISTIC algorithms , *EXPERT systems , *ARTIFICIAL intelligence - Abstract
Abstract: The selection of hyper-parameters in support vector regression algorithms (SVMr) is an essential process in the training of these learning machines. Unfortunately, there is not an exact method to obtain the optimal values of SVMr hyper-parameters. Therefore, it is necessary to use a search algorithm and sometimes a validation method in order to find the best combination of hyper-parameters. The problem is that the SVMr training time can be huge in large training databases if standard search algorithms and validation methods (such as grid search and K-fold cross validation), are used. In this paper we propose two novel validation methods which reduce the SVMr training time, maintaining the accuracy of the final machine. We show the good performance of both methods in the standard SVMr with 3 hyper-parameters (where the hyper-parameters search is usually carried out by means of a grid search) and also in the extension to multi-parametric kernels, where meta-heuristic approaches such as evolutionary algorithms must be used to look for the best set of SVMr hyper-parameters. In all cases the new validation methods have provided very good results in terms of training time, without affecting the final SVMr accuracy. [Copyright &y& Elsevier]
- Published
- 2012
- Full Text
- View/download PDF
10. Current Earth System Models Overestimate Ecosystem Respiration in Mid‐To‐High Latitude Dryland Regions.
- Author
-
Wu, Dongxing, Liu, Shaomin, He, Bin, Xu, Ziwei, Wu, Xiuchen, Xu, Tongren, Yang, Xiaofan, Wei, Jiaxing, Peng, Zhixing, and Wang, Xiaona
- Subjects
ARID regions ,EARTH currents ,SOIL respiration ,SOIL moisture ,CARBON cycle - Abstract
The inhibition of foliar respiration by light is a crucial yet often overlooked component in estimating ecosystem respiration. However, current estimations of the light inhibition of ecosystem respiration are biased by ignoring the effects of moisture factors. We developed a novel physics‐constrained machine learning method to quantify the extent of light inhibition (Reli) driven by multiple factors in global ecosystems. Our findings revealed significant seasonal variations in light inhibition rate aligned with vegetation growth. Temperature predominantly influenced variations in Reli, and the temperature‐Reli relationship was regulated by vapor pressure deficit rather than soil water content. A reassessment of global ecosystem respiration revealed that current Earth system models (ESMs) overestimate ecosystem respiration in mid‐to‐high latitude dryland regions, with a global average light inhibition strength of 0.51 (±0.16). Knowledge from this study provides an accurate understanding of light inhibition driven by temperature and moisture coupling in simulating carbon cycle. Plain Language Summary: Ecosystem respiration is mainly composed of vegetation respiration and soil respiration. However, estimates of ecosystem respiration are biased by ignoring the inhibition of leaf respiration in the light. We developed a novel physics‐constrained machine learning method to estimate the ecosystem respiration incorporating the ecological process of light inhibition. We found a clear relationship between light inhibition rate and vegetation growth on a seasonal scale. The light inhibition of ecosystem respiration was driven by both temperature and moisture factors. The global average light inhibition strength was 0.51 (±0.16). Key Points: Light inhibition of ecosystem respiration showed seasonal variations that match vegetation growthRelationship between temperature and light inhibition was regulated by vapor pressure deficitCurrent Earth system models overestimated ecosystem respiration in mid‐to‐high latitude dryland regions [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
11. Estimation of above-ground biomass in dry temperate forests using Sentinel-2 data and random forest: a case study of the Swat area of Pakistan.
- Author
-
Muhammad, Bilal, Rehman, Arif U. R., Mumtaz, Faisal, Qun, Yin, and Zhongkui, Jia
- Subjects
NORMALIZED difference vegetation index ,FOREST density ,BIOMASS estimation ,TEMPERATE forests ,TROPICAL dry forests ,FOREST biomass - Abstract
Accurate mapping of above-ground biomass (AGB) is essential for carbon stock quantification and climate change impact assessment, particularly in mountainous areas. This study applies a random forest (RF) regression model to predict the spatial distribution of AGB in Usho (site A) and Utror (site B) forests located in the northern mountainous region of Pakistan. The predicted maps elucidate AGB variations across these sites, with non-forest areas excluded based on an normalized difference vegetation index (NDVI) threshold value of <0.4. Three different combinations of input datasets were used to predict the biomass, including spectral bands (SBs) only, vegetation indexes (VIs) only, and a combination of both spectral bands and vegetation indexes (SBVIs). Utilizing SBs, the biomass ranged between 150 and 286 mg/ha in site A and 99 and 376 mg/ha in site B. Meanwhile, using VIs indicated a biomass range of 163 Mg/ha–337 Mg/ha and 131–392 Mg/ha for sites A and B, respectively. The combination of spectral bands and vegetation indexes yielded AGB values of 145–290 Mg/ha in site A and 116–389 Mg/ha in site B. The northern and western regions of site A, characterized by higher altitudes and lower forest density, notably showed lower biomass values than other regions. Conversely, similar regions in site B, situated at lower latitudes, demonstrated different biomass ranges. The RF model exhibited robust accuracy, with R
2 values of 0.74 and 0.83 for spectral bands and vegetation indexes, respectively. However, with a combination of both, an R2 of 0.79 was achieved. Furthermore, altitudinal gradients significantly influence the biomass distribution across both sites, with specific elevation ranges yielding optimal results. The AGB variation along the slope further corroborated these findings. In both sites, the western aspects showed the highest biomass across all combinations of input datasets. The variable importance analysis highlighted that ARVI8a, NDI45, Band12, Band11, TSAVI8, and ARVI8a are significant predictors in sites A and B. This comprehensive analysis enhances our understanding of AGB distribution in the mountainous forests of Pakistan, offering valuable insights for forest management and ecological studies. [ABSTRACT FROM AUTHOR]- Published
- 2024
- Full Text
- View/download PDF
12. Integration of remote sensing and artificial neural networks for prediction of soil organic carbon in arid zones.
- Author
-
Gouda, Mohamed, Abu-hashim, Mohamed, Nassrallah, Attyat, Khalil, Mohamed N., Hendawy, Ehab, benhasher, Fahdah F., Shokr, Mohamed S., Elshewy, Mohamed A., and Mohamed, Elsayed said
- Subjects
NORMALIZED difference vegetation index ,ARTIFICIAL neural networks ,CARBON cycle ,SOIL management ,AGRICULTURE - Abstract
Introduction: Mapping soil organic carbon (SOC) with high precision is useful for controlling soil fertility and comprehending the global carbon cycle. Low-relief locations are characterized by minimal variability in traditional soil-forming elements, such as terrain and climatic conditions, which make it difficult to reflect the spatial variation of soil properties. In the meantime, vegetation cover makes it more difficult to obtain direct knowledge about agricultural soil. Crop growth and biomass are reflected by the normalized difference vegetation index (NDVI), a significant indicator. Rather than using conventional soil-forming variables. Methods: In this study, a novel model for predicting SOC was developed using Landsat-8 Operational Land Imager (OLI) band data (Blue (B), Green (G), Red (R), and Near Infrared (NIR), NDVI data as the supporting variables, and Artificial Neural Networks (ANNs). A total of 120 surface soil samples were collected at a depth of 25 cm in the northeastern Nile Delta near Damietta City. Of these, 80% (96 samples) were randomly selected for model training, while the remaining 24 samples were used for testing and validation. Additionally, Gaussian Process Regression (GPR) models were trained to estimate SOC levels using the Matern 5/ 2 kernel within the Regression Learner framework. Results and discussion: The results demonstrate that both the ANN with a multilayer feedforward network and the GPR model offer effective frameworks for SOC prediction. The ANN achieved an R² value of 0.84, while the GPR model with the Matern 5/2 kernel achieved a higher R² value of 0.89. These findings, supported by visual and statistical evaluations through crossvalidation, confirm the reliability and accuracy of the models. Conclusion: The systematic application of GPR within the Regression Learner framework provides a robust tool for SOC prediction, contributing to sustainable soil management and agricultural practices. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
13. Product Recommendation System With Machine Learning Algorithms for SME Banking.
- Author
-
Met, Ilker, Erkoc, Ayfer, Seker, Sadi Evren, Erturk, Mehmet Ali, Ulug, Baha, and Ortale, Riccardo
- Abstract
In the present era, where competition pervades across all domains, profitability holds crucial economic importance for numerous companies, including the banking industry. Offering the right products to customers is a fundamental problem that directly affects banks' net revenue. Machine learning (ML) approaches can address this issue using customer behavior analysis from historical customer data. This study addresses the issue by processing customer transactions using a bank's current account debt (CAD) product with state‐of‐the‐art ML approaches. In the first step, exploratory data analysis (EDA) is performed to examine the data and detect patterns and anomalies. Then, different regression methods (tree‐based methods) are tested to analyze the model's performance. The obtained results show that the light gradient boosting machine (LGBM) algorithm outperforms other methods with an 84% accuracy rate in the light gradient boosting algorithm, which is the most accurate of the three methods used. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
14. Comparative Impact of Fog and Rainfall on Vegetation in a Foggy Desert.
- Author
-
Qiao, Na, Wang, Honglang, Li, Yue, and Wang, Lixin
- Subjects
VEGETATION greenness ,VEGETATION dynamics ,METEOROLOGICAL observations ,PLANT variation ,SOIL temperature - Abstract
Fog is an important water source that alleviates vegetation water stress, especially for dryland ecosystems. Comprehensive knowledge of fog and rainfall effects can help us better understand dryland vegetation responses to current and future climates. However, the differences between fog and rainfall effects on vegetation are poorly understood. This study compared the effects of fog and rainfall on vegetation greenness changes based on the ground‐level meteorological observations in the Namib Desert and the satellite vegetation index. The vegetation index and its first derivative were utilized to indicate vegetation greenness and its change rate, respectively. Results showed that fog played a more significant role than rainfall in explaining vegetation greenness change rates, while accumulated rainfall was more important than fog in determining vegetation greenness. Soil temperature was an important factor in explaining vegetation greenness changes. These findings offer key insights into how fog and rainfall differentially contribute to vegetation greenness changes. Plain Language Summary: Dryland plant significantly impacts the global carbon budget, making up about 40% of global net primary productivity. The importance of fog on dryland productivity is highlighted since plant growth in dry areas is strongly constrained by water resources. This study showed that fog consistently appeared among the top five most critical factors in explaining variations in plant greenness and its change rates for both herbaceous and woody areas. Notably, it was found that plant greenness change rates were more influenced by fog than rainfall, although plant greenness was more sensitive to accumulated rainfall than fog. Soil temperature consistently showed large effects on plant greenness changes, potentially limiting plant growth in the context of future global warming. Key Points: The first study to simultaneously quantify the effects of fog and rainfall on vegetation greenness changes in a dryland ecosystemFog was more important than rainfall in explaining vegetation greenness change ratesSoil temperature played an important role in vegetation greenness changes with a negative effect [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
15. Advances in Remote Sensing and Machine Learning Methods for Invasive Plants Study: A Comprehensive Review.
- Author
-
Zaka, Muhammad Murtaza and Samat, Alim
- Subjects
ARTIFICIAL neural networks ,CONVOLUTIONAL neural networks ,AERIAL photography ,OPTICAL radar ,LIDAR - Abstract
This paper provides a comprehensive review of advancements in the detection; evaluation; and management of invasive plant species (IPS) using diverse remote sensing (RS) techniques and machine learning (ML) methods. Analyzing the high-resolution datasets received from drones, satellites, and aerial photography enables the perfect cartography technique and analysis of the spread and various impacts of ecology on IPS. The majority of current research on hyperspectral imaging with unmanned aerial vehicle (UAV) enhanced by ML has significantly improved the accuracy and efficiency of identifying mapping IPS, and it also serves as a powerful instrument for ecological management. The integrative association is essential to manage the alien species better, as researchers from multiple other fields participate in modeling innovative methods and structures. Incorporating advanced technologies like light detection and ranging (LiDAR) and hyperspectral imaging shows potential for improving spatial and spectral analysis approaches and utilizing ML approaches such as a support vector machine (SVM), random forest (RF), artificial neural network (ANN), convolutional neural network (CNN), and deep convolutional neural network (DCNN) analysis for detecting complex IPS. The significant results indicate that ML methods, most importantly SVM and RF, are victorious in recognizing the alien species via analyzing RS data. This report emphasizes the importance of continuous research efforts to improve predictive models, fill gaps in our understanding of the connections between climate, urbanization and invasion dynamics, and expands conservation initiatives via utilizing RS techniques. This study also highlights the potential for RS data to refine management plans, enabling the implementation of more efficient strategies for controlling IPS and preserving ecosystems. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
16. From Spectral Characteristics to Index Bands: Utilizing UAV Hyperspectral Index Optimization on Algorithms for Estimating Canopy Nitrogen Concentration in Carya Cathayensis Sarg.
- Author
-
Feng, Hailin, Zhou, Tong, Wang, Ketao, Huang, Jianqin, Liang, Hao, Lu, Chenghao, Ruan, Yaoping, and Xu, Liuchang
- Subjects
STANDARD deviations ,DRONE aircraft ,HICKORIES ,MACHINE learning ,NITROGEN - Abstract
Employing drones and hyperspectral imagers for large-scale, precise evaluation of nitrogen (N) concentration in Carya cathayensis Sarg canopies is crucial for accurately managing nitrogen fertilization in C. cathayensis Sarg cultivation. This study gathered five sets of hyperspectral imagery data from C. cathayensis Sarg plantations across four distinct locations with varying environmental stresses using drones. The research assessed the canopy nitrogen concentration of C. cathayensis Sarg trees both during singular growth periods and throughout their entire growth cycles. The objective was to explore the influence of band combinations and spectral index formula configurations on the predictive capability of the hyperspectral indices (HIs) for canopy N concentration (CNC), optimize the performance between HIs and machine learning approaches, and validate the efficacy of optimized HI algorithms. The findings revealed the following: (i) Optimized HIs demonstrated optimal predictive performance during both singular growth periods and the full growth cycles of C. cathayensis Sarg. The most effective HI model for singular growth periods was the optimized–modified–normalized difference vegetation index (opt-mNDVI), achieving an adjusted coefficient of determination (R
2 ) of 0.96 and a root mean square error (RMSE) of 0.71. For the entire growth cycle, the HI model, also opt-mNDVI, attained an R2 of 0.75 and an RMSE of 2.11; (ii) optimized band combinations substantially enhanced HIs' predictive performance by 16% to 71%, while the choice between three-band and two-band combinations influenced the predictive capacity of optimized HIs by 4% to 46%. Hence, utilizing optimized HIs combined with Unmanned Aerial Vehicle (UAV) hyperspectral imaging to evaluate nitrogen concentration in C. cathayensis Sarg trees under complex field conditions offers significant practical value. [ABSTRACT FROM AUTHOR]- Published
- 2024
- Full Text
- View/download PDF
17. A Graph Similarity Algorithm Based on Graph Partitioning and Attention Mechanism.
- Author
-
Miao, Fengyu, Zhou, Xiuzhuang, Xiao, Shungen, and Zhang, Shiliang
- Subjects
GRAPH neural networks ,MACHINE learning ,SUBGRAPHS ,ALGORITHMS - Abstract
In recent years, graph similarity algorithms have been extensively developed based on neural networks. However, with an increase in the node count in graphs, these models either suffer from a reduced representation ability or face a significant increase in the computational cost. To address this issue, a graph similarity algorithm based on graph partitioning and attention mechanisms was proposed in this study. Our method first divided each input graph into the subgraphs to directly extract the local structural features. The residual graph convolution and multihead self-attention mechanisms were employed to generate node embeddings for each subgraph, extract the feature information from the nodes, and regenerate the subgraph embeddings using varying attention weights. Initially, rough cosine similarity calculations were performed on all subgraph pairs from the two sets of subgraphs, with highly similar pairs selected for precise similarity computation. These results were then integrated into the similarity score for the input graph. The experimental results indicated that the proposed learning algorithm outperformed the traditional algorithms and similar computing models in terms of graph similarity computation performance. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
18. Epidemiology and Ecology of Usutu Virus Infection and Its Global Risk Distribution.
- Author
-
Chen, Jiahao, Zhang, Yuanyuan, Zhang, Xiaoai, Zhang, Meiqi, Yin, Xiaohong, Zhang, Lei, Peng, Cong, Fu, Bokang, Fang, Liqun, and Liu, Wei
- Subjects
AEDES albopictus ,CULEX pipiens ,ENGLISH sparrow ,VIRAL ecology ,MACHINE learning ,AEDES aegypti - Abstract
Usutu virus (USUV) is an emerging mosquito-transmitted flavivirus with increasing incidence of human infection and geographic expansion, thus posing a potential threat to public health. In this study, we established a comprehensive spatiotemporal database encompassing USUV infections in vectors, animals, and humans worldwide by an extensive literature search. Based on this database, we characterized the geographic distribution and epidemiological features of USUV infections. By employing boosted regression tree (BRT) models, we projected the distributions of three main vectors (Culex pipiens, Aedes albopictus, and Culiseta longiareolata) and three main hosts (Turdus merula, Passer domesticus, and Ardea cinerea) to obtain the mosquito index and bird index. These indices were further incorporated as predictors into the USUV infection models. Through an ensemble learning model, we achieved a decent model performance, with an area under the curve (AUC) of 0.992. The mosquito index contributed significantly, with relative contributions estimated at 25.51%. Our estimations revealed a potential exposure area for USUV spanning 1.80 million km
2 globally with approximately 1.04 billion people at risk. This can guide future surveillance efforts for USUV infections, especially for countries located within high-risk areas and those that have not yet conducted surveillance activities. [ABSTRACT FROM AUTHOR]- Published
- 2024
- Full Text
- View/download PDF
19. Extraction of Maize Distribution Information Based on Critical Fertility Periods and Active–Passive Remote Sensing.
- Author
-
Lv, Xiaoran, Zhang, Xiangjun, Yu, Haikun, Lu, Xiaoping, Zhou, Junli, Feng, Junbiao, and Su, Hang
- Abstract
This study proposes a new method for integrating active and passive remote sensing data during critical reproductive periods in order to extract maize areas early and to address the problem of low accuracy in the classification of maize-growing areas affected by climate change. Focusing on Jiaozuo City, this study utilized active–passive remote sensing images to determine the optimal time for maize identification. The relative importance of features was assessed using a feature selection method combined with a machine learning algorithm, the impact of both single-source and multi-source features on accuracy was analyzed to generate the optimal feature subset, and the classification accuracies of different machine learning classification methods for maize at the tasseling stage were compared. Ultimately, this study identified the most effective remote sensing features and methods for maize detection during the optimal fertility period. The experimental results show that the feature set optimized for the tasseling stage significantly enhanced maize recognition accuracy. Specifically, the random forest (RF) method, when applied to the multi-source data fusion feature set, yielded the highest accuracy, improving classification accuracy by 24.6% and 4.86% over single-source features, and achieving an overall accuracy of 93.38% with a Kappa coefficient of 0.91. Data on the study area's maize area were also extracted for the years 2018–2022, with accuracy values of 93.83%, 98.77%, 97%, and 98.05%, respectively. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
20. Trends and Gaps in the Scientific Literature about the Effects of Nutritional Supplements on Canine Leishmaniosis.
- Author
-
Previti, Annalisa, Biondi, Vito, Sicuso, Diego Antonio, Pugliese, Michela, and Passantino, Annamaria
- Subjects
SCIENTIFIC literature ,LEISHMANIASIS ,EVIDENCE gaps ,DIETARY supplements ,TEXT mining - Abstract
In canine leishmaniosis (CanL), complex interactions between the parasites and the immunological background of the host influence the clinical presentation and evolution of infection and disease. Therefore, the potential use of nutraceuticals as immunomodulatory agents becomes of considerable interest. Some biological principles, mainly derived from plants and referred to as plant-derived nutraceuticals, are considered as supplementation for Leishmania spp. infection. This study provides a systematic review regarding the use of nutraceuticals as a treatment using a text mining (TM) and topic analysis (TA) approach to identify dominant topics of nutritional supplements in leishmaniosis-based research, summarize the temporal trend in topics, interpret the evolution within the last century and highlight any possible research gaps. Scopus
® database was screened to select 18 records. Findings revealed an increasing trend in research records since 1994. TM identified terms with the highest weighted frequency and TA highlighted the main research areas, namely "Nutraceutical supports and their anti-inflammatory/antioxidant properties", "AHCC and nucleotides in CanL", "Vit. D3 and Leishmaniosis", "Functional food effects and Leishmaniosis" and "Extract effects and Leishmaniosis". Despite the existing academic interest, there are only a few studies on this issue so far, which reveals a gap in the literature that should be filled. [ABSTRACT FROM AUTHOR]- Published
- 2024
- Full Text
- View/download PDF
21. Spectral Intelligence: AI-Driven Hyperspectral Imaging for Agricultural and Ecosystem Applications.
- Author
-
Ali, Faizan, Razzaq, Ali, Tariq, Waheed, Hameed, Akhtar, Rehman, Abdul, Razzaq, Khizar, Sarfraz, Sohaib, Rajput, Nasir Ahmed, Zaki, Haitham E. M., Shahid, Muhammad Shafiq, and Ondrasek, Gabrijel
- Subjects
MACHINE learning ,AGRICULTURE ,REMOTE sensing ,PHYTOPATHOGENIC microorganisms ,FARMERS - Abstract
Ensuring global food security amid mounting challenges, such as population growth, disease infestations, resource limitations, and climate change, is a pressing concern. Anticipated increases in food demand add further complexity to this critical issue. Plant pathogens, responsible for substantial crop losses (up to 41%) in major crops like wheat, rice, maize, soybean, and potato, exacerbate the situation. Timely disease detection is crucial, yet current practices often identify diseases at advanced stages, leading to severe infestations. To address this, remote sensing and Hyperspectral imaging (HSI) have emerged as robust and nondestructive techniques, exhibiting promising results in early disease identification. Integrating machine learning algorithms with image data sets enables precise spatial–temporal disease identification, facilitating timely detection, predictive modeling, and effective disease management without compromising fitness or climate adaptability. By harnessing these cutting-edge technologies and data-driven decision-making, growers can optimize input costs while achieving enhanced yields, making significant strides toward global food security in the face of climate change risks. This review will discuss some of the foundational concepts of remote sensing, several platforms used for remote sensing data collection, successful application of the approach, and its future perspective. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
22. A Diboronic Acid-Based Fluorescent Sensor Array for Rapid Identification of Lonicerae Japonicae Flos and Lonicerae Flos.
- Author
-
Bian, Ying, Xiang, Chenqing, Xu, Yi, Zhu, Rongping, Qin, Shuanglin, and Zhang, Zhijun
- Subjects
FISHER discriminant analysis ,MACHINE learning ,FLUORESCENT probes ,SENSOR arrays ,RATE setting - Abstract
Lonicerae japonicae flos (LJF) and Lonicerae flos (LF) are traditional Chinese herbs that are commonly used and widely known for their medicinal properties and edibility. Although they may have a similar appearance and vary slightly in chemical composition, their effectiveness as medicine and their use in clinical settings vary significantly, making them unsuitable for substitution. In this study, a novel 2 × 3 six-channel fluorescent sensor array is proposed that uses machine learning algorithms in combination with the indicator displacement assay (IDA) method to quickly identify LJF and LF. This array comprises two coumarin-based fluorescent indicators (ES and MS) and three diboronic acid-substituted 4,4′-bipyridinium cation quenchers (Q1–Q3), forming six dynamic complexes (C1–C6). When these complexes react with the ortho-dihydroxy groups of phenolic acid compounds in LJF and LF, they release different fluorescent indicators, which in turn causes distinct fluorescence recovery. By optimizing eight machine learning algorithms, the model achieved 100% and 98.21% accuracy rates in the testing set and the cross-validation predictions, respectively, in distinguishing between LJF and LF using Linear Discriminant Analysis (LDA). The integration of machine learning with this fluorescent sensor array shows great potential in analyzing and detecting foods and pharmaceuticals that contain polyphenols. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
23. Identification of autophagy-related genes as potential biomarkers correlated with immune infiltration in bipolar disorder: a bioinformatics analysis.
- Author
-
Cao, Dong, Liu, Yafang, Mei, Jinghong, Yu, Shuailong, Zeng, Cong, Zhang, Jing, and Li, Yujuan
- Subjects
MACHINE learning ,BRAIN abnormalities ,BIPOLAR disorder ,NEUROLOGICAL disorders ,BIOMARKERS - Abstract
Background: Bipolar disorder (BPD) is a kind of manic and depressive phase alternate episodes of serious mental illness, and it is correlated with well-documented cortical brain abnormalities. Emerging evidence supports that autophagy dysfunction in neuronal system contributes to pathophysiological changes in neurological disease. However, the role of autophagy in bipolar disorder has rarely been elucidated. This study aimed to identify the autophagy-related gene as a potential biomarker Correlated to immune infiltration in BPD. Methods: The microarray dataset GSE23848 and autophagy-related genes (ARGs) were downloaded. Differentially expressed genes (DEGs) between normal and BPD samples were screened using the R software. Machine learning algorithms were performed to screen the significant candidate biomarker from autophagy-related differentially expressed genes (ARDEGs). The correlation between the screened ARDEGs and infiltrating immune cells was explored through correlation analysis. Results: In this study, the autophagy pathway was abundantly enriched and activated in BPD, as indicated by Pathway enrichment analysis. We identified 16 ARDEGs in BPD compared to the normal group. A signature of 4 ARDEGs (ERN1, ATG3, CTSB, and EIF2AK3) was screened. ROC analysis showed that the above genes have good diagnostic performance. In addition, immune correlation analysis considered that the above four genes significantly correlated with immune cells in BPD. Conclusions: Autophagy - immune cell axis mediates pathophysiological changes in BPD. Four important ARDEGs are prospective to be potential biomarkers associated with immune infiltration in BPD and helpful for the prediction or diagnosis of BPD. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
24. A hybrid Transformer-LSTM model apply to glucose prediction.
- Author
-
Bian, QingXiang, As'arry, Azizan, Cong, XiangGuo, Rezali, Khairil Anas bin Md, and Raja Ahmad, Raja Mohd Kamil bin
- Subjects
CONTINUOUS glucose monitoring ,PREDICTION models ,HYPOGLYCEMIA ,MACHINE learning ,GLUCOSE ,HYPERGLYCEMIA - Abstract
The global prevalence of diabetes is escalating, with estimates indicating that over 536.6 million individuals were afflicted by 2021, accounting for approximately 10.5% of the world's population. Effective management of diabetes, particularly monitoring and prediction of blood glucose levels, remains a significant challenge due to the severe health risks associated with inaccuracies, such as hypoglycemia and hyperglycemia. This study addresses this critical issue by employing a hybrid Transformer-LSTM (Long Short-Term Memory) model designed to enhance the accuracy of future glucose level predictions based on data from Continuous Glucose Monitoring (CGM) systems. This innovative approach aims to reduce the risk of diabetic complications and improve patient outcomes. We utilized a dataset which contain more than 32000 data points comprising CGM data from eight patients collected by Suzhou Municipal Hospital in Jiangsu Province, China. This dataset includes historical glucose readings and equipment calibration values, making it highly suitable for developing predictive models due to its richness and real-time applicability. Our findings demonstrate that the hybrid Transformer-LSTM model significantly outperforms the standard LSTM model, achieving Mean Square Error (MSE) values of 1.18, 1.70, and 2.00 at forecasting intervals of 15, 30, and 45 minutes, respectively. This research underscores the potential of advanced machine learning techniques in the proactive management of diabetes, a critical step toward mitigating its impact. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
25. Multi‐Objective Bayesian Optimization for Laminate‐Inspired Mechanically Reinforced Piezoelectric Self‐Powered Sensing Yarns.
- Author
-
Yang, Ziyue, Park, Kundo, Nam, Jisoo, Cho, Jaewon, Choi, Yong Jun, Kim, Yong‐Il, Kim, Hyeonsoo, Ryu, Seunghwa, and Kim, Miso
- Subjects
MACHINE learning ,TENSILE strength ,STRAINS & stresses (Mechanics) ,MECHANICAL ability ,LAMINATED materials ,YARN - Abstract
Piezoelectric fiber yarns produced by electrospinning offer a versatile platform for intelligent devices, demonstrating mechanical durability and the ability to convert mechanical strain into electric signals. While conventional methods involve twisting a single poly(vinylidene fluoride‐co‐trifluoroethylene)(P(VDF‐TrFE)) fiber mat to create yarns, by limiting control over the mechanical properties, an approach inspired by composite laminate design principles is proposed for strengthening. By stacking multiple electrospun mats in various sequences and twisting them into yarns, the mechanical properties of P(VDF‐TrFE) yarn structures are efficiently optimized. By leveraging a multi‐objective Bayesian optimization‐based machine learning algorithm without imposing specific stacking restrictions, an optimal stacking sequence is determined that simultaneously enhances the ultimate tensile strength (UTS) and failure strain by considering the orientation angles of each aligned fiber mat as discrete design variables. The conditions on the Pareto front that achieve a balanced improvement in both the UTS and failure strain are identified. Additionally, applying corona poling induces extra dipole polarization in the yarn state, successfully fabricating mechanically robust and high‐performance piezoelectric P(VDF‐TrFE) yarns. Ultimately, the mechanically strengthened piezoelectric yarns demonstrate superior capabilities in self‐powered sensing applications, particularly in challenging environments and sports scenarios, substantiating their potential for real‐time signal detection. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
26. Overview of HOMO-MEX at IberLEF 2024: Hate Speech Detection Towards the Mexican Spanish speaking LGBT+ Population.
- Author
-
Gómez-Adorno, Helena, Bel-Enguix, Gemma, Calvo, Hiram, Ojeda-Trueba, Sergio, Andersen, Scott Thomas, Vásquez, Juan, Alcántara, Tania, Soto, Miguel, and Macias, Cesar
- Subjects
SONG lyrics ,DATA augmentation ,SOCIOLINGUISTICS ,HATE speech ,SPANISH language - Abstract
Copyright of Procesamiento del Lenguaje Natural is the property of Sociedad Espanola para el Procesamiento del Lenguaje Natural and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
- Published
- 2024
- Full Text
- View/download PDF
27. Drought assessment in Kabul River basin using machine learnings.
- Author
-
KHAN, UZAIR, KHALIL, ALAMGIR, and JAN, SHABIR
- Subjects
MACHINE learning ,DROUGHT forecasting ,WATERSHEDS ,WATER supply ,DECISION trees - Abstract
Droughts significantly impact water resources and agriculture, leading to economic losses and potential human fatalities. This study aims to predict droughts by analysing changes in the Standardised Precipitation Evapotranspiration Index (SPEI) for the Kabul River basin using data from 1981 to 2022. The research is divided into three phases: calculating SPEI, splitting the dataset into training (80%) and testing (20%) subsets, and evaluating model performance. Various machine learning algorithms, including XGBoost, Decision tree, AdaBoost, and KNN, were employed alongside different climatic variables. The models were assessed using statistical metrics such as R², RMSE, MAE, MSE for regression, and confusion matrix, accuracy, precision, recall, F1 score, ROC AUC, and Log loss for classification. Results showed strong performance, with R² values of 0.97, 0.86, 0.92, and 0.96 for XGBoost, KNN, Decision tree, and AdaBoost, respectively. SPEI demonstrated significant potential for drought forecasting, and spatial distribution mapping revealed persistent moderate drought occurrences. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
28. Evading Cyber-Attacks on Hadoop Ecosystem: A Novel Machine Learning-Based Security-Centric Approach towards Big Data Cloud.
- Author
-
Sharma, Neeraj A., Kumar, Kunal, Khorshed, Tanzim, Ali, A B M Shawkat, Khalid, Haris M., Muyeen, S. M., and Jose, Linju
- Subjects
VIRTUAL machine systems ,CLOUD computing security measures ,BIG data ,CYBERTERRORISM ,HYPERVISOR (Computer software) - Abstract
The growing industry and its complex and large information sets require Big Data (BD) technology and its open-source frameworks (Apache Hadoop) to (1) collect, (2) analyze, and (3) process the information. This information usually ranges in size from gigabytes to petabytes of data. However, processing this data involves web consoles and communication channels which are prone to intrusion from hackers. To resolve this issue, a novel machine learning (ML)-based security-centric approach has been proposed to evade cyber-attacks on the Hadoop ecosystem while considering the complexity of Big Data in Cloud (BDC). An Apache Hadoop-based management interface "Ambari" was implemented to address the variation and distinguish between attacks and activities. The analyzed experimental results show that the proposed scheme effectively (1) blocked the interface communication and retrieved the performance measured data from (2) the Ambari-based virtual machine (VM) and (3) BDC hypervisor. Moreover, the proposed architecture was able to provide a reduction in false alarms as well as cyber-attack detection. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
29. Knowledge Graphs and Semantic Web Tools in Cyber Threat Intelligence: A Systematic Literature Review.
- Author
-
Bratsas, Charalampos, Anastasiadis, Efstathios Konstantinos, Angelidis, Alexandros K., Ioannidis, Lazaros, Kotsakis, Rigas, and Ougiaroglou, Stefanos
- Subjects
KNOWLEDGE graphs ,SEMANTIC Web ,COMPUTER crimes ,DATA mining ,INTERNET security - Abstract
The amount of data related to cyber threats and cyber attack incidents is rapidly increasing. The extracted information can provide security analysts with useful Cyber Threat Intelligence (CTI) to enhance their decision-making. However, because the data sources are heterogeneous, there is a lack of common representation of information, rendering the analysis of CTI complicated. With this work, we aim to review ongoing research on the use of semantic web tools such as ontologies and Knowledge Graphs (KGs) within the CTI domain. Ontologies and KGs can effectively represent information in a common and structured schema, enhancing interoperability among the Security Operation Centers (SOCs) and the stakeholders on the field of cybersecurity. When fused with Machine Learning (ML) and Deep Learning (DL) algorithms, the constructed ontologies and KGs can be augmented with new information and advanced inference capabilities, facilitating the discovery of previously unknown CTI. This systematic review highlights the advancements of this field over the past and ongoing decade and provides future research directions. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
30. Combination of Multiple Variables and Machine Learning for Regional Cropland Water and Carbon Fluxes Estimation: A Case Study in the Haihe River Basin.
- Author
-
Cheng, Minghan, Liu, Kaihua, Liu, Zhangxin, Xu, Junzeng, Zhang, Zhengxian, and Sun, Chengming
- Subjects
WATER management ,MACHINE learning ,ECOSYSTEM management ,EDDY flux ,RADIATION ,WATER bikes - Abstract
Understanding the water and carbon cycles within terrestrial ecosystems is crucial for effective monitoring and management of regional water resources and the ecological environment. However, physical models like the SEB- and LUE-based ones can be complex and demand extensive input data. In our study, we leveraged multiple variables (vegetation growth, surface moisture, radiative energy, and other relative variables) as inputs for various regression algorithms, including Multiple Linear Regression (MLR), Random Forest Regression (RFR), and Backpropagation Neural Network (BPNN), to estimate water (ET) and carbon fluxes (NEE) in the Haihe River Basin, and compared the estimated results with the observations from six eddy covariance flux towers. We aimed to (1) assess the impacts of different input variables on the accuracy of ET and NEE estimations, (2) compare the accuracy of the three regression methods, including three machine learning algorithms and Multiple Linear Regression, and (3) evaluate the performance of ET and NEE estimation models across various regions. The key findings include: (1) Increasing the number of input variables typically improved the accuracy of ET and NEE estimations. (2) RFR proved to be the most accurate for both ET and NEE estimations among the three regression algorithms. Of these, the four types of variables used together with RFR resulted in the best accuracy for ET (R
2 of 0.81 and an RMSE of 1.13 mm) and NEE (R2 of 0.83 and an RMSE of 2.83 gC/m2 ) estimations. (3) Vegetation growth variables (i.e., VIs) are the most important inputs for ET and NEE estimation. (4) The proposed ET and NEE estimation models exhibited some variation in accuracy across different validation sites. Despite these variations, the accuracy levels across all six validation sites remained relatively high. Overall, this study lays the groundwork for an efficient approach to agricultural water resources and ecosystem monitoring and management. [ABSTRACT FROM AUTHOR]- Published
- 2024
- Full Text
- View/download PDF
31. Improved Winter Wheat Yield Estimation by Combining Remote Sensing Data, Machine Learning, and Phenological Metrics.
- Author
-
Li, Shiji, Huang, Jianxi, Xiao, Guilong, Huang, Hai, Sun, Zhigang, and Li, Xuecao
- Subjects
NORMALIZED difference vegetation index ,MACHINE learning ,AGRICULTURE ,CROP yields ,SOIL weathering - Abstract
Accurate yield prediction is essential for global food security and effective agricultural management. Traditional empirical statistical models and crop models face significant limitations, including high computational demands and dependency on high-resolution soil and daily weather data, that restrict their scalability across different temporal and spatial scales. Moreover, the lack of sufficient observational data further hinders the broad application of these methods. In this study, building on the SCYM method, we propose an integrated framework that combines crop models and machine learning techniques to optimize crop yield modeling methods and the selection of vegetation indices. We evaluated three commonly used vegetation indices and three widely applied ML techniques. Additionally, we assessed the impact of combining meteorological and phenological variables on yield estimation accuracy. The results indicated that the green chlorophyll vegetation index (GCVI) outperformed the normalized difference vegetation index (NDVI) and enhanced vegetation index (EVI) in linear models, achieving an R
2 of 0.31 and an RMSE of 396 kg/ha. Non-linear ML methods, particularly LightGBM, demonstrated superior performance, with an R2 of 0.42 and RMSE of 365 kg/ha for GCVI. The combination of GCVI with meteorological and phenological data provided the best results, with an R2 of 0.60 and an RMSE of 295 kg/ha. Our proposed framework significantly enhances the accuracy and efficiency of winter wheat yield estimation, supporting more effective agricultural management and policymaking. [ABSTRACT FROM AUTHOR]- Published
- 2024
- Full Text
- View/download PDF
32. Streamflow prediction using machine learning models in selected rivers of Southern India.
- Author
-
Sharma, Rajat Kr, Kumar, Sudhanshu, Padmalal, D., and Roy, Arka
- Subjects
MACHINE learning ,WATER management ,SUPPORT vector machines ,CLIMATE extremes ,FLOW simulations ,WATER table ,WATERSHEDS - Abstract
The need for adequate data on the spatial and temporal variability of freshwater resources is a significant challenge to the water managers of the world in water resource planning and management. The problems will be acute in the coming years because of the increase in frequency and intensity of hydrologic extremes due to climate change. Therefore, streamflow prediction has become an important area of research because of its importance in flood mitigation, reservoir operation, and water resource management. In this paper, we have tested four Machine Learning models (ML models): Support Vector Machines (SVM), Random Forest (RF), Long Short-Term Memory (LSTM), and Multivariate Adaptive Regression Splines (MARS) for streamflow prediction at daily and monthly time scales in three rivers draining in the different climatic and geological settings. The SVM, RF, LSTM, and MARS models have been trained and tested in the Suvarna, Aghanashini, and Kunderu River Basins in peninsular India. Model intercomparison was made to identify the best suitable model for streamflow prediction. The RF outperforms other models for daily streamflow, and MARS outperforms other models for monthly streamflow prediction in the Suvarna river with Nash-Sutcliffe efficiency (NSE) values of 0.676 and 0.924, respectively. SVM (NSE = 0.741) and RF (NSE = 0.826) are found to be the best models for daily and monthly streamflow prediction in the Aghanashini river. MARS outperformed other models in the case of high, severe, and extreme flow simulation with NSE values of 0.481, 0.374, and 0.455, respectively, in the Aghanashini river. Other hydrological variables (groundwater level data, antecedent soil moisture, potential evapotranspiration data) and a better spatial resolution of rainfall data can be used to develop more accurate machine-learning models for streamflow predictions. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
33. Enhancing the Resolution of Satellite Ocean Data Using Discretized Satellite Gridding Neural Networks.
- Author
-
Liu, Shirong, Jia, Wentao, Wang, Qianyun, Zhang, Weimin, and Wang, Huizan
- Subjects
ARTIFICIAL neural networks ,MULTISENSOR data fusion ,MACHINE learning ,SPATIAL resolution ,NONLINEAR systems - Abstract
Ocean satellite data are often impeded by intrinsic limitations in resolution and accuracy. However, conventional data reconstruction approaches encounter substantial challenges when facing the nonlinear oceanic system and high-resolution fusion of variables. This research presents a Discrete Satellite Gridding Neural Network (DSGNN), a new machine learning method that processes satellite data within a discrete grid framework. By transforming the positional information of grid elements into a standardized vector format, the DSGNN significantly elevates the accuracy and resolution of data fusion through a neural network model. This method's innovative aspect lies in its discretization and fusion technique, which not only enhances the spatial resolution of oceanic data but also, through the integration of multi-element datasets, better reflects the true physical state of the ocean. A comprehensive analysis of the reconstructed datasets indicates the DSGNN's consistency and reliability across different seasons and oceanic regions, especially in its adept handling of complex nonlinear interactions and small-scale oceanic features. The DSGNN method has demonstrated exceptional competence in reconstructing global ocean datasets, maintaining small error variance, and achieving high congruence with in situ observations, which is almost equivalent to 1/12° hybrid coordinate ocean model (HYCOM) data. This study offers a novel and potent strategy for the high-resolution reconstruction and fusion of ocean satellite datasets. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
34. The fusion of vegetation indices increases the accuracy of cotton leaf area prediction.
- Author
-
Xianglong Fan, Pan Gao, Mengli Zhang, Hao Cang, Lifu Zhang, Ze Zhang, Jin Wang, Xin Lv, Qiang Zhang, and Lulu Ma
- Subjects
LEAF area index ,BACK propagation ,SUPPORT vector machines ,MACHINE learning ,CROP canopies ,COTTON - Abstract
Introduction: Rapid and accurate estimation of leaf area index (LAI) is of great significance for the precision agriculture because LAI is an important parameter to evaluate crop canopy structure and growth status. Methods: In this study, 20 vegetation indices were constructed by using cotton canopy spectra. Then, cotton LAI estimation models were constructed based on multiple machine learning (ML) methods extreme learning machine (ELM), random forest (RF), back propagation (BP), multivariable linear regression (MLR), support vector machine (SVM)], and the optimal modeling strategy (RF) was selected. Finally, the vegetation indices with a high correlation with LAI were fused to construct the VI-fusion RF model, to explore the potential of multivegetation index fusion in the estimation of cotton LAI. Results: The RF model had the highest estimation accuracy among the LAI estimation models, and the estimation accuracy of models constructed by fusing multiple VIs was higher than that of models constructed based on single VIs. Among the multi-VI fusion models, the RF model constructed based on the fusion of seven vegetation indices (MNDSI, SRI, GRVI, REP, CIred-edge, MSR, and NVI) had the highest estimation accuracy, with coefficient of determination (R2), rootmean square error (RMSE), normalized rootmean square error (NRMSE), and mean absolute error (MAE) of 0.90, 0.50, 0.14, and 0.26, respectively. Discussion: Appropriate fusion of vegetation indices can include more spectral features in modeling and significantly improve the cotton LAI estimation accuracy. This study will provide a technical reference for improving the cotton LAI estimation accuracy, and the proposed method has great potential for crop growth monitoring applications. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
35. Predicting Food‐Security Crises in the Horn of Africa Using Machine Learning.
- Author
-
Busker, Tim, van den Hurk, Bart, de Moel, Hans, van den Homberg, Marc, van Straaten, Chiem, Odongo, Rhoda A., and Aerts, Jeroen C. J. H.
- Subjects
MACHINE learning ,FOOD security ,LEAD time (Supply chain management) ,PREDICTION models ,DROUGHTS - Abstract
In this study, we present a machine‐learning model capable of predicting food insecurity in the Horn of Africa, which is one of the most vulnerable regions worldwide. The region has frequently been affected by severe droughts and food crises over the last several decades, which will likely increase in future. Therefore, exploring novel methods of increasing early warning capabilities is of vital importance to reducing food‐insecurity risk. We present a XGBoost machine‐learning model to predict food‐security crises up to 12 months in advance. We used >20 data sets and the FEWS IPC current‐situation estimates to train the machine‐learning model. Food‐security dynamics were captured effectively by the model up to 3 months in advance (R2 > 0.6). Specifically, we predicted 20% of crisis onsets in pastoral regions (n = 96) and 20%–50% of crisis onsets in agro‐pastoral regions (n = 22) with a 3‐month lead time. We also compared our 8‐month model predictions to the 8‐month food‐security outlooks produced by FEWS NET. Over a relatively short test period (2019–2022), results suggest the performance of our predictions is similar to FEWS NET for agro‐pastoral and pastoral regions. However, our model is clearly less skilled in predicting food security for crop‐farming regions than FEWS NET. With the well‐established FEWS NET outlooks as a basis, this study highlights the potential for integrating machine‐learning methods into operational systems like FEWS NET. Plain Language Summary: In the face of increasing droughts and food crises, this study explored the use of machine learning to provide predictions of food crises in the Horn of Africa, up to 12 months in advance. We used an algorithm called "XGBoost," which we fed with over 20 data sets of potential food security drivers. After training the model, we found that food security dynamics were accurately predicted up to 3 months in advance, especially in pastoral and agro‐pastoral regions. The model accurately predicted 20% of crisis onsets in pastoral areas and 20%–50% in agro‐pastoral regions with a 3‐month lead time. In agro‐pastoral and pastoral regions, our machine learning algorithm showed a similar performance to the established early warning system from FEWS NET. The machine‐learning model did not show good performance in crop‐farming areas. Nonetheless, this study underscores the potential of integrating machine‐learning methods into existing operational systems like FEWS NET. By doing so, it paves the way for improved early warning capabilities, crucial in mitigating the looming threat of food insecurity in the Horn of Africa. Key Points: A machine‐learning model is presented to predict food‐security crises in the Horn of AfricaThe model demonstrates high overall performance, and performs similarly to FEWS NET outlooks in the (agro‐) pastoral regionsThis study can be utilized to integrate machine learning into existing early warning systems, thereby creating hybrid solutions for the future [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
36. Unlocking treatment success: predicting atypical antipsychotic continuation in youth with mania.
- Author
-
Yang, Xiangying, Huang, Wenbo, Liu, Li, Li, Lei, Qing, Song, Huang, Na, Zeng, Jun, and Yang, Kai
- Abstract
Purpose: This study aimed to create and validate robust machine-learning-based prediction models for antipsychotic drug (risperidone) continuation in children and teenagers suffering from mania over one year and to discover potential variables for clinical treatment. Method: The study population was collected from the national claims database in China. A total of 4,532 patients aged 4–18 who began risperidone therapy for mania between September 2013 and October 2019 were identified. The data were randomly divided into two datasets: training (80%) and testing (20%). Five regularly used machine learning methods were employed, in addition to the SuperLearner (SL) algorithm, to develop prediction models for the continuation of atypical antipsychotic therapy. The area under the receiver operating characteristic curve (AUC) with a 95% confidence interval (CI) was utilized. Results: In terms of discrimination and robustness in predicting risperidone treatment continuation, the generalized linear model (GLM) performed the best (AUC: 0.823, 95% CI: 0.792–0.854, intercept near 0, slope close to 1.0). The SL model (AUC: 0.823, 95% CI: 0.791–0.853, intercept near 0, slope close to 1.0) also exhibited significant performance. Furthermore, the present findings emphasize the significance of several unique clinical and socioeconomic variables, such as the frequency of emergency room visits for nonmental health disorders. Conclusions: The GLM and SL models provided accurate predictions regarding risperidone treatment continuation in children and adolescents with episodes of mania and hypomania. Consequently, applying prediction models in atypical antipsychotic medicine may aid in evidence-based decision-making. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
37. Evaluation of machine learning-dynamical hybrid method incorporating remote sensing data for in-season maize yield prediction under drought.
- Author
-
Luo, Yi, Wang, Huijing, Cao, Junjun, Li, Jinxiao, Tian, Qun, Leng, Guoyong, and Niyogi, Dev
- Subjects
REMOTE sensing ,CORN ,METEOROLOGICAL satellites ,PRODUCTION losses ,CHLOROPHYLL spectra ,DROUGHTS ,DROUGHT forecasting - Abstract
Effective yield forecasting is a key strategy for adaptation when facing food loss to climate variability. Currently, solar-induced chlorophyll fluorescence (SIF) is an emerging remote-sensing index owing to its high relevance to plant photosynthesis, and sensitivity to drought. Despite many studies have focused on drought monitoring and production assessment by SIF, little puts it into practice for in-season yield prediction. In this study, we combined multi-source satellite and meteorological data, especially coupling with subseasonal-to-seasonal (S2S) dynamic atmospheric prediction climate model (IAP-CAS FGOALS-f2), with an addition of SIF, to predict maize yields in the U.S. Corn Belt, based on the developed machine learning dynamical hybrid model (MHCF). By comparison, we found that SIF performed well in the correlation analysis with yield, with average correlations up to 0.719 in August. Then we utilized different algorithms, different models (S2S data for MHCF, climate data for the Benchmark), and different input combinations to train and predict maize yields. All four algorithms using SIF significantly improved prediction performance. S2S + VIs + SIF combination (FGOALS-f2、NDVI、EVI、SIF) can achieve the best performance, while the XGBoost algorithm reached 0.897 of R
2 . With the best combination, it can achieve 4 months before maize harvest (with R2 value of 0.85, and RMSE < 13 bu/acre). In 2012, the year had a severe drought, although predictive capability decreased in all the predictions, the models with SIF still maintained robust and improved the prediction (improved R2 by 5.92%, and RMSE decreased by 18.08% of XGBoost). According to the study, it can be expected, the combination of MHCF and SIF will play a greater role in subseasonal yield prediction. We also provide an operational proposition of hybrid yield forecasting method to fully integrating climate prediction and machine learning for early notice of crop production losses. [ABSTRACT FROM AUTHOR]- Published
- 2024
- Full Text
- View/download PDF
38. Classification of Garlic (Allium sativum L.) Crops by Fertilizer Differences Using Ground-Based Hyperspectral Imaging System.
- Author
-
Chung, Hwanjo, Wi, Seunghwan, Cho, Byoung-Kwan, and Lee, Hoonsoo
- Subjects
HYPERSPECTRAL imaging systems ,FERTILIZER application ,STATISTICAL learning ,MACHINE learning ,AGRICULTURAL productivity - Abstract
In contemporary agriculture, enhancing the efficient production of crops and optimizing resource utilization have become paramount objectives. Garlic growth and quality are influenced by various factors, with fertilizers playing a pivotal role in shaping both aspects. This study aimed to develop classification models for distinguishing garlic fertilizer application differences by employing statistical and machine learning techniques, such as partial least squares (PLS), based on data acquired from a ground-based hyperspectral imaging system in the agricultural sector. The garlic variety chosen for this study was Hongsan, and the fertilizer application plots were segmented into three distinct sections. Data were acquired within the VIS/NIR wavelength range using hyperspectral imaging. Following data acquisition, the standard normal variate (SNV) pre-processing technique was applied to enhance the dataset. To identify the optimal wavelengths, various techniques such as sequential forward selection (SFS), successive projections algorithm (SPA), variable importance in projection (VIP), and interval partial least squares (iPLS) were employed, resulting in the selection of 12 optimal wavelengths. For the fertilizer application difference model, six integrated vegetation indices were chosen for comparison with existing growth indicators. Using the same methodology, the model construction showed accuracies of 90.7% for PLS. Thus, the proposed model suggests that efficient regulation of garlic fertilizer application can be achieved by utilizing statistical and machine learning techniques. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
39. The present and future of seizure detection, prediction, and forecasting with machine learning, including the future impact on clinical trials.
- Author
-
Kerr, Wesley T., McFarlane, Katherine N., and Pucci, Gabriela Figueiredo
- Subjects
MACHINE learning ,CLINICAL trials ,SEIZURES (Medicine) ,EPILEPTIFORM discharges ,ARTIFICIAL intelligence - Abstract
Seizures have a profound impact on quality of life and mortality, in part because they can be challenging both to detect and forecast. Seizure detection relies upon accurately differentiating transient neurological symptoms caused by abnormal epileptiform activity from similar symptoms with different causes. Seizure forecasting aims to identify when a person has a high or low likelihood of seizure, which is related to seizure prediction. Machine learning and artificial intelligence are data-driven techniques integrated with neurodiagnostic monitoring technologies that attempt to accomplish both of those tasks. In this narrative review, we describe both the existing software and hardware approaches for seizure detection and forecasting, as well as the concepts for how to evaluate the performance of new technologies for future application in clinical practice. These technologies include long-term monitoring both with and without electroencephalography (EEG) that report very high sensitivity as well as reduced false positive detections. In addition, we describe the implications of seizure detection and forecasting upon the evaluation of novel treatments for seizures within clinical trials. Based on these existing data, long-term seizure detection and forecasting with machine learning and artificial intelligence could fundamentally change the clinical care of people with seizures, but there are multiple validation steps necessary to rigorously demonstrate their benefits and costs, relative to the current standard. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
40. Prediction of wheat SPAD using integrated multispectral and support vector machines.
- Author
-
Wei Wang, Na Sun, Bin Bai, Hao Wu, Yukun Cheng, Hongwei Geng, JiKun Song, JinPing Zhou, Zhiyuan Pang, SongTing Qian, and Wanyin Zeng
- Subjects
SUPPORT vector machines ,WINTER wheat ,MACHINE learning ,WATER restrictions ,WHEAT ,MULTISPECTRAL imaging - Abstract
Rapidly obtaining the chlorophyll content of crop leaves is of great significance for timely diagnosis of crop health and effective field management. Multispectral imagery obtained from unmanned aerial vehicles (UAV) is being used to remotely sense the SPAD (Soil and Plant Analyzer Development) values of wheat crops. However, existing research has not yet fully considered the impact of different growth stages and crop populations on the accuracy of SPAD estimation. In this study, 300 materials from winter wheat natural populations in Xinjiang, collected between 2020 to 2022, were analyzed. UAV multispectral images were obtained in the experimental area, and vegetation indices were extracted to analyze the correlation between the selected vegetation indices and SPAD values. The input variables for the model were screened, and a support vector machine (SVM) model was constructed to estimate SPAD values during the heading, flowering, and filling stages under different water stresses. The aim was to provide a method for the rapid acquisition of winter wheat SPAD values. The results showed that the SPAD values under normal irrigation were higher than those under water restriction. Multiple vegetation indices were significantly correlated with SPAD values. In the prediction model construction of SPAD, the different models had high estimation accuracy under both normal irrigation and water limitation treatments, with correlation coefficients of predicted and measured values under normal irrigation in different environments the value of r from 0.59 to 0.81 and RMSE from 2.15 to 11.64, compared to RE from 0.10% to 1.00%; and under drought stress in different environments, correlation coefficients of predicted and measured values of r was 0.69--0.79, RMSE was 2.30--12.94, and RE was 0.10%--1.30%. This study demonstrated that the optimal combination of feature selection methods and machine learning algorithms can lead to a more accurate estimation of winter wheat SPAD values. In summary, the SVM model based on UAV multispectral images can rapidly and accurately estimate winter wheat SPAD value. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
41. Spatially distributed snow depth, bulk density, and snow water equivalent from ground-based and airborne sensor integration at Grand Mesa, Colorado, USA.
- Author
-
Meehan, Tate G., Hojatimalekshah, Ahmad, Marshall, Hans-Peter, Deeb, Elias J., O'Neel, Shad, McGrath, Daniel, Webb, Ryan W., Bonnell, Randall, Raleigh, Mark S., Hiemstra, Christopher, and Elder, Kelly
- Subjects
SNOW accumulation ,GROUND penetrating radar ,MACHINE learning ,WATER depth ,LIDAR - Abstract
Estimating snow mass in the mountains remains a major challenge for remote-sensing methods. Airborne lidar can retrieve snow depth, and some promising results have recently been obtained from spaceborne platforms, yet density estimates are required to convert snow depth to snow water equivalent (SWE). However, the retrieval of snow bulk density remains unsolved, and limited data are available to evaluate model estimates of density in mountainous terrain. Toward the goal of landscape-scale retrievals of snow density, we estimated bulk density and length-scale variability by combining ground-penetrating radar (GPR) two-way travel-time observations and airborne-lidar snow depths collected during the mid-winter NASA SnowEx 2020 campaign at Grand Mesa, Colorado, USA. Key advancements of our approach include an automated layer-picking method that leverages the GPR reflection coherence and the distributed lidar–GPR-retrieved bulk density with machine learning. The root-mean-square error between the distributed estimates and in situ observations is 11 cm for depth, 27 kgm-3 for density, and 46 mm for SWE. The median relative uncertainty in distributed SWE is 13 %. Interactions between wind, terrain, and vegetation display corroborated controls on bulk density that show model and observation agreement. Knowledge of the spatial patterns and predictors of density is critical for the accurate assessment of SWE and essential snow research applications. The spatially continuous snow density and SWE estimated over approximately 16 km2 may serve as necessary calibration and validation for stepping prospective remote-sensing techniques toward broad-scale SWE retrieval. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
42. Research on the Evaluation of Baijiu Flavor Quality Based on Intelligent Sensory Technology Combined with Machine Learning.
- Author
-
Aliya, Liu, Shi, Zhang, Danni, Cao, Yufa, Sun, Jinyuan, Jiang, Shui, and Liu, Yuan
- Subjects
FISHER discriminant analysis ,ELECTRONIC tongues ,ELECTRONIC noses ,MACHINE learning ,SENSORY evaluation ,FLAVOR - Abstract
Baijiu, one of the world's six major distilled spirits, has an extremely rich flavor profile, which increases the complexity of its flavor quality evaluation. This study employed an electronic nose (E-nose) and electronic tongue (E-tongue) to detect 42 types of strong-aroma Baijiu. Linear discriminant analysis (LDA) was performed based on the different production origins, alcohol content, and grades. Twelve trained Baijiu evaluators participated in the quantitative descriptive analysis (QDA) of the Baijiu samples. By integrating characteristic values from the intelligent sensory detection data and combining them with the human sensory evaluation results, machine learning was used to establish a multi-submodel-based flavor quality prediction model and classification model for Baijiu. The results showed that different Baijiu samples could be well distinguished, with a prediction model R
2 of 0.9994 and classification model accuracy of 100%. This study provides support for the establishment of a flavor quality evaluation system for Baijiu. [ABSTRACT FROM AUTHOR]- Published
- 2024
- Full Text
- View/download PDF
43. Predictive Model for Bark Beetle Outbreaks in European Forests.
- Author
-
Fernández-Carrillo, Ángel, Franco-Nieto, Antonio, Yagüe-Ballester, María Julia, and Gómez-Giménez, Marta
- Subjects
MACHINE learning ,BARK beetles ,FOREST management ,RANDOM forest algorithms ,REMOTE sensing - Abstract
Bark beetle outbreaks and forest mortality have rocketed in European forests because of warmer winters, intense droughts, and poor management. The methods developed to predict a bark beetle outbreak have three main limitations: (i) a small-spatial-scale implementation; (ii) specific field-based input datasets that are usually hard to obtain at large scales; and (iii) predictive models constrained by coarse climatic factors. Therefore, a methodological approach accounting for a comprehensive set of environmental traits that can predict a bark beetle outbreak accurately is needed. In particular, we aimed to (i) analyze the influence of environmental traits that cause bark beetle outbreaks; (ii) compare different machine learning architectures for predicting bark beetle attacks; and (iii) map the attack probability before the start of the bark beetle life cycle. Random Forest regression achieved the best-performing results. The predicted bark beetle damage reached a high robustness in the test area (F1 = 96.9, OA = 94.4) and showed low errors (CE = 2.0, OE = 4.2). Future improvements should focus on including additional variables, e.g., forest age and validation sites. Remote sensing-based methods contributed to detecting bark beetle outbreaks in large extensive forested areas in a cost-effective and robust manner. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
44. Toward Solving a Puzzle of Fragmented Archeological Textiles.
- Author
-
Gigilashvili, Davit, Gulbrandsen, Casper Fabian, Ha Thu Nguyen, Havgar, Margrethe, Vedeler, Marianne, and Hardeberg, Jon Yngve
- Subjects
FEATURE extraction ,JIGSAW puzzles ,MACHINE learning ,DIGITAL images ,COMPUTER scientists - Abstract
Archeological textiles can provide invaluable insight into the past. However, they are often highly fragmented, and a puzzle has to be solved to re-assemble the object and recover the original motifs. Unlike common jigsaw puzzles, archeological fragments are highly damaged, and no correct solution to the puzzle is known. Although automatic puzzle solving has fascinated computer scientists for a long time, this work is one of the first attempts to apply modern machine learning solutions to archeological textile re-assembly. First and foremost, it is important to know which fragments belong to the same object. Therefore, features are extracted from digital images of textile fragments using color statistics, classical texture descriptors, and deep learning methods. These features are used to conduct clustering and identify similar fragments. Four different case studies with increasing complexity are discussed in this article: from well - preserved textiles with available ground truth to an actual open problem of Oseberg archeological tapestry with unknown solution. This work reveals significant knowledge gaps in current machine learning, which helps us to outline a future avenue toward more specialized application-specific models. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
45. Open Remote Sensing Data in Digital Soil Organic Carbon Mapping: A Review.
- Author
-
Radočaj, Dorijan, Gašparović, Mateo, and Jurišić, Mladen
- Subjects
DIGITAL soil mapping ,DIGITAL mapping ,DIGITAL maps ,ENVIRONMENTAL mapping ,LITERATURE reviews - Abstract
This review focuses on digital soil organic carbon (SOC) mapping at regional or national scales in spatial resolutions up to 1 km using open data remote sensing sources, emphasizing its importance in achieving United Nations' Sustainable Development Goals (SDGs) related to hunger, climate action, and land conservation. The literature review was performed according to scientific studies indexed in the Web of Science Core Collection database since 2000. The analysis reveals a steady rise in total digital soil mapping studies since 2000, with digital SOC mapping studies accounting for over 20% of these studies in 2023, among which SDGs 2 (Zero Hunger) and 13 (Climate Action) were the most represented. Notably, countries like the United States, China, Germany, and Iran lead in digital SOC mapping research. The shift towards machine and deep learning methods in digital SOC mapping has surged post-2010, necessitating environmental covariates like topography, climate, and spectral data, which are cornerstones of machine and deep learning prediction methods. It was noted that the available climate data primarily restrict the spatial resolution of digital SOC mapping to 1 km, which typically requires downscaling to harmonize with topography (up to 30 m) and multispectral data (up to 10–30 m). Future directions include the integration of diverse remote sensing data sources, the development of advanced algorithms leveraging machine learning, and the utilization of high-resolution remote sensing for more precise SOC mapping. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
46. A machine learning approach for differentiating bipolar disorder type II and borderline personality disorder using electroencephalography and cognitive abnormalities.
- Author
-
Nazari, Mohammad-Javad, Shalbafan, Mohammadreza, Eissazade, Negin, Khalilian, Elham, Vahabi, Zahra, Masjedi, Neda, Ghidary, Saeed Shiry, Saadat, Mozafar, and Sadegh-Zadeh, Seyed-Ali
- Subjects
BORDERLINE personality disorder ,BIPOLAR disorder ,MACHINE learning ,ELECTROENCEPHALOGRAPHY ,WISCONSIN Card Sorting Test ,COGNITIVE testing ,MULTIMODAL user interfaces ,COGNITIVE computing - Abstract
This study addresses the challenge of differentiating between bipolar disorder II (BD II) and borderline personality disorder (BPD), which is complicated by overlapping symptoms. To overcome this, a multimodal machine learning approach was employed, incorporating both electroencephalography (EEG) patterns and cognitive abnormalities for enhanced classification. Data were collected from 45 participants, including 20 with BD II and 25 with BPD. Analysis involved utilizing EEG signals and cognitive tests, specifically the Wisconsin Card Sorting Test and Integrated Cognitive Assessment. The k-nearest neighbors (KNN) algorithm achieved a balanced accuracy of 93%, with EEG features proving to be crucial, while cognitive features had a lesser impact. Despite the strengths, such as diverse model usage, it's important to note limitations, including a small sample size and reliance on DSM diagnoses. The study suggests that future research should explore multimodal data integration and employ advanced techniques to improve classification accuracy and gain a better understanding of the neurobiological distinctions between BD II and BPD. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
47. Retrieval of Crop Canopy Chlorophyll: Machine Learning vs. Radiative Transfer Model.
- Author
-
Alam, Mir Md Tasnim, Simic Milas, Anita, Gašparović, Mateo, and Osei, Henry Poku
- Subjects
CROP canopies ,RADIATIVE transfer ,MACHINE learning ,PARTIAL least squares regression ,STANDARD deviations - Abstract
In recent years, the utilization of machine learning algorithms and advancements in unmanned aerial vehicle (UAV) technology have caused significant shifts in remote sensing practices. In particular, the integration of machine learning with physical models and their application in UAV–satellite data fusion have emerged as two prominent approaches for the estimation of vegetation biochemistry. This study evaluates the performance of five machine learning regression algorithms (MLRAs) for the mapping of crop canopy chlorophyll at the Kellogg Biological Station (KBS) in Michigan, USA, across three scenarios: (1) application to Landsat 7, RapidEye, and PlanetScope satellite images; (2) application to UAV–satellite data fusion; and (3) integration with the PROSAIL radiative transfer model (hybrid methods PROSAIL + MLRAs). The results indicate that the majority of the five MLRAs utilized in UAV–satellite data fusion perform better than the five PROSAIL + MLRAs. The general trend suggests that the integration of satellite data with UAV-derived information, including the normalized difference red-edge index (NDRE), canopy height model, and leaf area index (LAI), significantly enhances the performance of MLRAs. The UAV–RapidEye dataset exhibits the highest coefficient of determination (R
2 ) and the lowest root mean square errors (RMSE) when employing kernel ridge regression (KRR) and Gaussian process regression (GPR) (R2 = 0.89 and 0.89 and RMSE = 8.99 µg/cm2 and 9.65 µg/cm2 , respectively). Similar performance is observed for the UAV–Landsat and UAV–PlanetScope datasets (R2 = 0.86 and 0.87 for KRR, respectively). For the hybrid models, the maximum performance is attained with the Landsat data using KRR and GPR (R2 = 0.77 and 0.51 and RMSE = 33.10 µg/cm2 and 42.91 µg/cm2 , respectively), followed by R2 = 0.75 and RMSE = 39.78 µg/cm2 for the PlanetScope data upon integrating partial least squares regression (PLSR) into the hybrid model. Across all hybrid models, the RapidEye data yield the most stable performance, with the R2 ranging from 0.45 to 0.71 and RMSE ranging from 19.16 µg/cm2 to 33.07 µg/cm2 . The study highlights the importance of synergizing UAV and satellite data, which enables the effective monitoring of canopy chlorophyll in small agricultural lands. [ABSTRACT FROM AUTHOR]- Published
- 2024
- Full Text
- View/download PDF
48. Machine learning versus deep learning in land system science: a decision-making framework for effective land classification.
- Author
-
Southworth, Jane, Smith, Audrey C., Safaei, Mohammad, Rahaman, Mashoukur, Alruzuq, Ali, Tefera, Bewuket B., Muir, Carly S., and Herrero, Hannah V.
- Subjects
DEEP learning ,SYSTEMS theory ,MACHINE learning ,CHOICE (Psychology) ,CLASSIFICATION ,DATA integration ,AUTOMATIC classification - Abstract
This review explores the comparative utility of machine learning (ML) and deep learning (DL) in land system science (LSS) classification tasks. Through a comprehensive assessment, the study reveals that while DL techniques have emerged with transformative potential, their application in LSS often faces challenges related to data availability, computational demands, model interpretability, and overfitting. In many instances, traditional ML models currently present more effective solutions, as illustrated in our decisionmaking framework. Integrative opportunities for enhancing classification accuracy include data integration from diverse sources, the development of advanced DL architectures, leveraging unsupervised learning, and infusing domain-specific knowledge. The research also emphasizes the need for regular model evaluation, the creation of diversified training datasets, and fostering interdisciplinary collaborations. Furthermore, while the promise of DL for future advancements in LSS is undeniable, present considerations often tip the balance in favor of ML models for many classification schemes. This review serves as a guide for researchers, emphasizing the importance of choosing the right computational tools in the evolving landscape of LSS, to achieve reliable and nuanced land-use change data. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
49. Chickpea leaf water potential estimation from ground and VENµS satellite.
- Author
-
Sadeh, Roy, Avneri, Asaf, Tubul, Yaniv, Lati, Ran N., Bonfil, David J., Peleg, Zvi, and Herrmann, Ittai
- Subjects
ARTIFICIAL neural networks ,CHICKPEA ,LEAF area index ,VEGETATION monitoring ,IRRIGATION scheduling ,LEGUME farming - Abstract
Chickpea (Cicer arietinum L.) is a major grain legume grown worldwide as a staple protein source. Traditionally, it is a rain-fed crop, but supplemental irrigation can increase yields and counteract the challenges posed by the changing climate worldwide. A fast and non-destructive plant water status assessment method may streamline irrigation management. The main objective of this study was to remotely assess the leaf water potential (LWP) and leaf area index (LAI) of field-grown chickpea. Five irrigation treatments were applied in two farm experiments and two commercial fields. Ground hyperspectral canopy reflectance and Vegetation and Environment monitoring on a New Micro-Satellite (VENµS) images acquired throughout the study. In parallel, LWP and LAI measurements were captured in the field. Vegetation indices (VIs) and machine learning (ML) based on all spectral bands were used to calibrate and validate spectral estimation models. The normalized difference spectral index (NDSI) that used bands on 1600 and 1730 nm (NDSI
(1600,1730) ) selected in the current study yielded the LWP lowest estimation error on independent validation (RMSE = 0.19 [MPa]) using linear regression. VENµS based VIs resulted in relatively lower LWP estimation accuracy (RMSE = 0.23–0.29 [MPa]) compared to VIs calculated from ground hyperspectral data (RMSE = 0.19–0.21 [MPa]). Artificial neural network (ANN) models for LWP from ground and space spectral data showed similar performances (RMSE = 0.15–0.17 [MPa]), and were both more accurate than VIs. LWP response to the irrigation treatments was faster than the LAI response and was captured by the NDSI(1600,1730) . The low correlation found between LWP and LAI (r = 0.08–0.44) supports the conclusion that spectral reflectance of chickpea canopy can be used to estimate LWP per se and is only partially affected by morphological changes induced by irrigation treatments and canopy development. The ability to rapidly estimate chickpea LWP may improve irrigation scheduling in the future. [ABSTRACT FROM AUTHOR]- Published
- 2024
- Full Text
- View/download PDF
50. Estimating rainfed groundnut's leaf area index using Sentinel-2 based on Machine Learning Regression Algorithms and Empirical Models.
- Author
-
Ekwe, Michael Chibuike, Adeluyi, Oluseun, Verrelst, Jochem, Kross, Angela, and Odiji, Caleb Akoji
- Subjects
LEAF area index ,MACHINE learning ,NORMALIZED difference vegetation index ,KRIGING - Abstract
The leaf area index (LAI), a crucial biophysical indicator, is used to assess and monitor crop growth for effective agricultural management. This study assessed the LAI at the seedling stage after conducting a field experiment with rainfed groundnut. The study tests the performance of multiple machine learning regression algorithms (MLRAs) and empirical vegetation indices (VIs) in retrieving groundnut's LAI using freely available Sentinel-2 data. The bands at 665 nm, 705 nm, 842 nm, and 2190 nm are the most sensitive for retrieving groundnut's LAI, according to an analysis of its band spectrum. Results suggest that VIs computed with wavebands centered at red (665 nm), red edge (705 nm), and near-infrared (842 nm) exhibited optimal R
2 with Sentinel-2 data. Normalized difference vegetation index (NDVI), red edge normalized difference vegetation index (NDVIre), simple ratio (SR), red edge simple ratio (SRre), and green normalized difference vegetation index (gNDVI) were utilized as predictors for LAI. Regarding the results of the validation between estimated and measured LAI, SR demonstrated the highest accuracy for groundnut LAI prediction (r2 = 0.67, RMSE = 0.89). Ten MLRAs were tested, and results indicate from the perspective of the accuracy of models, the Gaussian processes regression, GPR (r2 = 0.73 and RMSE = 0.81), Kernel ridge regression, KRR (r2 = 0.72 and RMSE = 0.82) and Support vector regression, SVR (r2 = 0.70 and RMSE = 0.85) demonstrated to be the most suitable for LAI estimation for rainfed groundnut at the seedling stage. The systematic analysis based on the regression approaches tested here revealed that the GPR outperformed other models combined, therefore, most suitable for estimating rainfed groundnut LAI at the seedling stage. These findings serve as a benchmark for obtaining crop biophysical parameters in the framework of groundnut traits monitoring in a tropical West Africa. [ABSTRACT FROM AUTHOR]- Published
- 2024
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.