47 results on '"and machine learning"'
Search Results
2. Reliability Testing of Machine Learning Model Prediction Capability towards Unidentifiable Microplastic Spectral Data: Triple Battery and Colorant Investigation.
- Author
-
Williams, Wesley A. and Aravamudhan, Shyam
- Subjects
ARTIFICIAL intelligence ,K-nearest neighbor classification ,COPPER phthalocyanine ,MICROPLASTICS ,ANALYTICAL chemistry - Abstract
Microplastics (MPs) are exudated fragments and fibers from environmental plastic refuse in the lower millimeter to micrometer range with some intentionally created particles and degraded exudates from anthropogenic sources. These particles are often hard to identify due to extensive weathering and material heterogeneity from additives. Leveraging the ability of machine learning (ML) models, which train on features from selected polymer classes, can aid in determining particle identity despite heterogeneity. Herein, a 1,800 spectral training dataset was employed for model training (900 from µ-FTIR and 900 from µ-Raman) using data from common MP polymers (synthetically or naturally derived) and plastics. A threefold battery; synthetic data (SD), mixed synthetic data (MSD), and real-world data from an FTIR Library of Plastic Particles sourced from the Environment (FLOPP-E) and a [Raman] Spectral Library of Plastic Particles aged in the Environment (SLOPP-E); was administered for reliability. Firstly, the SD test determined subspace k-nearest neighbors (SKNN) and wide neural network (WNN) as champions (µ-FTIR and µ-Raman, respectively) with an accuracy of, 99%/100% and 98%/100% (χ
2 = 31.99/69, p =.0024/ <.0001). Secondly, the MSD test served as a progenitor to the multi-class prediction from µ-Raman's SKNN showing consistency across 5 replicates (H =.25—6.59, p =.156—.993). And thirdly, the real-world test exhibited a loss in accuracy rate with only the champion, SKNN, retaining ~ 73% and ~ 49% of the correct predictions. Investigation into the probable causes of misprediction via colorant additives led to the discovery of white and blue as the most unpredictable across both databases. Lastly, an investigation of 2 samples revealed confounding colorant additives (SKNN's predictions) as copper phthalocyanine (C2. Blue Fiber) and a derivative from the family of diketo-pyrrolo-pyrroles (Polyester 12. Red Fiber). [ABSTRACT FROM AUTHOR]- Published
- 2025
- Full Text
- View/download PDF
3. The Role of Social Media Platforms in Spreading Misinformation Targeting Specific Racial and Ethnic Groups: A Brief Review
- Author
-
Farzaneh Saadati, Isun Chehreh, and Ebrahim Ansari
- Subjects
social media ,misinformation ,racial ,ethnic ,fake news ,generative artificial intelligence ,and machine learning ,Telecommunication ,TK5101-6720 - Abstract
This study discusses the impacts of misinformation on social cohesion, trust, and well-being, particularly when targeting specific racial and ethnic groups. It categorizes and reviews various articles to identify the sources and types of misinformation on social media, highlighting common themes and origins. The study briefly acknowledges that generative artificial intelligence and machine learning tools can increase the chance of the generation and spread of harmful misinformation across digital platforms. It also highlights the importance of digital and media literacy education in helping individuals critically evaluate information and navigate online spaces responsibly. Promoting racial and ethnic digital literacy is crucial for protecting against misinformation and fostering informed, representative online discourse. The study calls for a multifaceted approach centered on trust, transparency, and accountability in addressing social media misinformation. Ultimately, it advocates for a culture of critical thinking, factchecking, and ethical behavior to create an online environment that respects diversity, inclusion, and truth, thereby contributing to an informed and empowered society.
- Published
- 2024
- Full Text
- View/download PDF
4. Enhancing Drug Safety: AIs Role in Pharmacovigilance and Adverse Event Reporting.
- Author
-
Alex, Hannah
- Subjects
MEDICATION safety ,MEDICAL equipment ,ARTIFICIAL intelligence ,MACHINE learning ,DEEP learning - Abstract
The role of pharmacovigilance (PV) in healthcare is to optimize the safety and efficacy of the delivery of pharmaceutical drugs and medical equipment. The increasingly dynamic nature of adverse drug reactions and pharmacovigilance has rendered traditional approaches susceptible to the underreporting of ADRs. Subsequently, the integration of Artificial intelligence in adverse drug reaction reporting is an outstanding technological advancement in pharmacovigilance. Therefore, this critical analysis applied a systematic literature review to comprehend the extensive role of AI in Pharmacovigilance. The research findings acknowledged that AI technologies such as machine learning, deep learning, and natural learning processing (NLP) have automated PV, leading to enhanced signal detection, analysis of unstructured data, risk assessment, and regulatory compliance. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
5. Data-driven approach for identifying the factors related to debt collector performance
- Author
-
Keerthana Sivamayilvelan, Elakkiya Rajasekar, Santhi Balachandran, Ketan Kotecha, and Subramaniyaswamy Vairavasundaram
- Subjects
Continual learning ,Debt collector performance ,Debt collection ,Deep reinforcement learning ,Statistical analysis ,And machine learning ,Management. Industrial management ,HD28-70 ,Business ,HF5001-6182 - Abstract
The company's success and growth heavily rely on its workforce's performance, yet the evaluation of employees has been only partially and inconclusively executed so far. The primary goal of this research is to build an open innovation framework for analyzing the performance of the debt collector. We have developed the Reinforcement Learning based Continual Learning (RLC) approach for evaluating the performance by analyzing the metrics such as visit patterns and collection percentage. We have used the private debt collection dataset to assess the debt collector's performance. We formulated hypotheses derived from insights gained during exploratory data analysis and subsequently validated them through statistical testing. Whether there are noticeable distinctions among debt collectors in terms of visitation frequency, collection rates, and collection modes. This proposed open innovation framework for analyzing the debt collector performance provides significant variation in terms of collection rate. The proposed EDQN-CL achieved a 13.56 % higher classification rate than the existing algorithm for categorizing the debt collector performance.
- Published
- 2024
- Full Text
- View/download PDF
6. AI in Space.
- Author
-
Kavyashree N., Zareen, Afshan, Amrutha J., and Sharath M. N.
- Subjects
ARTIFICIAL intelligence ,DEEP learning ,SPACE industrialization ,CONSTELLATIONS - Abstract
Artificial intelligence (AI) and its subset, machine learning (ML), are growing in acceptance within the space industry. These days, autonomous navigation, spacecraft health monitoring, and operational management of satellite constellations all make extensive use of ML algorithms. However, a lot of research on AI applications in space missions can be categorized into two main categories. One disadvantage of the first category is that it is out of date and ignores a number of noteworthy and recent advances in the field, such as the contributions of deep learning (DL) and bioinspired AI algorithms. The issue with the second category is that it describes specific AI techniques and algorithms in excessive depth, which might be problematic. Owing to these limitations, a concise. [ABSTRACT FROM AUTHOR]
- Published
- 2024
7. Quantum-Powered Insights: Unravelling the Nexus of Quantum Computing, Machine Learning, and Quantum Machine Learning.
- Author
-
Jeure, Vijayalaxmi and K., Veena
- Subjects
QUANTUM computing ,MACHINE learning ,QUANTUM computers ,QUBITS ,QUANTUM states ,QUANTUM mechanics - Abstract
Quantum computing represents a paradigm shift in computational theory, offering unprecedented capabilities for solving complex problems that are intractable for classical computers. The foundational concepts are quantum mechanics, including superposition, entanglement, and quantum gates, these principles form the basis of quantum computation. Quantum bits, or qubits, leverage superposition and entanglement to perform calculations in a fundamentally different manner than classical bits, allowing for exponential parallelism and potentially revolutionary computational power. Quantum machine learning (QML) stands at the nexus of two cutting-edge fields: quantum computing and data science. Combining the principles of quantum mechanics with the computational power of quantum computers, QML aims to harness quantum phenomena to enhance traditional machine learning algorithms. Quantum machine learning algorithms leverage quantum states, such as superposition and entanglement, to process and analyse data in ways that surpass the capabilities of classical methods. Quantum machine learning represents a frontier in both quantum computing and data science, offering novel approaches to data analysis and decision-making with the potential to transform industries and drive scientific discovery. [ABSTRACT FROM AUTHOR]
- Published
- 2024
8. Machine learning framework for simulation of artifacts in paranasal sinuses diagnosis using CT images.
- Author
-
Musleh, Abdullah
- Subjects
- *
ARTIFICIAL neural networks , *COMPUTED tomography , *MACHINE learning , *ENDOSCOPIC surgery , *DIAGNOSIS , *PARANASAL sinuses , *COMPUTER monitors - Abstract
In the medical field, diagnostic tools that make use of deep neural networks have reached a level of performance never before seen. A proper diagnosis of a patient's condition is crucial in modern medicine since it determines whether or not the patient will receive the care they need. Data from a sinus CT scan is uploaded to a computer and displayed on a high-definition monitor to give the surgeon a clear anatomical orientation before endoscopic sinus surgery. In this study, a unique method is presented for detecting and diagnosing paranasal sinus disorders using machine learning. The researchers behind the current study designed their own approach. To speed up diagnosis, one of the primary goals of our study is to create an algorithm that can accurately evaluate the paranasal sinuses in CT scans. The proposed technology makes it feasible to automatically cut down on the number of CT scan images that require investigators to manually search through them all. In addition, the approach offers an automatic segmentation that may be used to locate the paranasal sinus region and crop it accordingly. As a result, the suggested method dramatically reduces the amount of data that is necessary during the training phase. As a result, this results in an increase in the efficiency of the computer while retaining a high degree of performance accuracy. The suggested method not only successfully identifies sinus irregularities but also automatically executes the necessary segmentation without requiring any manual cropping. This eliminates the need for time-consuming and error-prone human labor. When tested with actual CT scans, the method in question was discovered to have an accuracy of 95.16 percent while retaining a sensitivity of 99.14 percent throughout. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
9. A Novel Iris Recognition System Development Using Convolutional Neural Network
- Author
-
Manimaran, M., Shalini, M. Angel, Dineshkumar, S., Dinesh, C., Dhinesh, S., Angrisani, Leopoldo, Series Editor, Arteaga, Marco, Series Editor, Chakraborty, Samarjit, Series Editor, Chen, Shanben, Series Editor, Chen, Tan Kay, Series Editor, Dillmann, Rüdiger, Series Editor, Duan, Haibin, Series Editor, Ferrari, Gianluigi, Series Editor, Ferre, Manuel, Series Editor, Hirche, Sandra, Series Editor, Jabbari, Faryar, Series Editor, Jia, Limin, Series Editor, Kacprzyk, Janusz, Series Editor, Khamis, Alaa, Series Editor, Kroeger, Torsten, Series Editor, Li, Yong, Series Editor, Liang, Qilian, Series Editor, Martín, Ferran, Series Editor, Ming, Tan Cher, Series Editor, Minker, Wolfgang, Series Editor, Misra, Pradeep, Series Editor, Mukhopadhyay, Subhas, Series Editor, Ning, Cun-Zheng, Series Editor, Nishida, Toyoaki, Series Editor, Oneto, Luca, Series Editor, Panigrahi, Bijaya Ketan, Series Editor, Pascucci, Federica, Series Editor, Qin, Yong, Series Editor, Seng, Gan Woon, Series Editor, Speidel, Joachim, Series Editor, Veiga, Germano, Series Editor, Wu, Haitao, Series Editor, Zamboni, Walter, Series Editor, Tan, Kay Chen, Series Editor, Agrawal, Jitendra, editor, Shukla, Rajesh K., editor, Sharma, Sanjeev, editor, and Shieh, Chin-Shiuh, editor
- Published
- 2024
- Full Text
- View/download PDF
10. The Use of Big Data Analytics in E-commerce Marketing: A Case Study of Morocco
- Author
-
Hicham, El Yousfi, Abdelilah, Zrida, Imane, Najih, Abdelghaffar, Imane, Rocha, Álvaro, Series Editor, Hameurlain, Abdelkader, Editorial Board Member, Idri, Ali, Editorial Board Member, Vaseashta, Ashok, Editorial Board Member, Dubey, Ashwani Kumar, Editorial Board Member, Montenegro, Carlos, Editorial Board Member, Laporte, Claude, Editorial Board Member, Moreira, Fernando, Editorial Board Member, Peñalvo, Francisco, Editorial Board Member, Dzemyda, Gintautas, Editorial Board Member, Mejia-Miranda, Jezreel, Editorial Board Member, Hall, Jon, Editorial Board Member, Piattini, Mário, Editorial Board Member, Holanda, Maristela, Editorial Board Member, Tang, Mincong, Editorial Board Member, Ivanovíc, Mirjana, Editorial Board Member, Muñoz, Mirna, Editorial Board Member, Kanth, Rajeev, Editorial Board Member, Anwar, Sajid, Editorial Board Member, Herawan, Tutut, Editorial Board Member, Colla, Valentina, Editorial Board Member, Devedzic, Vladan, Editorial Board Member, and Farhaoui, Yousef, editor
- Published
- 2024
- Full Text
- View/download PDF
11. An Innovative SALO-IDT-Based Intrusion Detection Model for Increasing the Security of IoT Networks
- Author
-
Venkatesan, S., Ramakrishnan, M., Archana, M., Angrisani, Leopoldo, Series Editor, Arteaga, Marco, Series Editor, Chakraborty, Samarjit, Series Editor, Chen, Jiming, Series Editor, Chen, Shanben, Series Editor, Chen, Tan Kay, Series Editor, Dillmann, Rüdiger, Series Editor, Duan, Haibin, Series Editor, Ferrari, Gianluigi, Series Editor, Ferre, Manuel, Series Editor, Jabbari, Faryar, Series Editor, Jia, Limin, Series Editor, Kacprzyk, Janusz, Series Editor, Khamis, Alaa, Series Editor, Kroeger, Torsten, Series Editor, Li, Yong, Series Editor, Liang, Qilian, Series Editor, Martín, Ferran, Series Editor, Ming, Tan Cher, Series Editor, Minker, Wolfgang, Series Editor, Misra, Pradeep, Series Editor, Mukhopadhyay, Subhas, Series Editor, Ning, Cun-Zheng, Series Editor, Nishida, Toyoaki, Series Editor, Oneto, Luca, Series Editor, Panigrahi, Bijaya Ketan, Series Editor, Pascucci, Federica, Series Editor, Qin, Yong, Series Editor, Seng, Gan Woon, Series Editor, Speidel, Joachim, Series Editor, Veiga, Germano, Series Editor, Wu, Haitao, Series Editor, Zamboni, Walter, Series Editor, Zhang, Junjie James, Series Editor, Tan, Kay Chen, Series Editor, George, V. I., editor, Santhosh, K. V., editor, and Lakshminarayanan, Samavedham, editor
- Published
- 2024
- Full Text
- View/download PDF
12. Traffic Noise Modeling in Sambalpur City Using Machine Learning Technique
- Author
-
Meher, K., Majhi, S., Khandualo, S. K., Pradhan, P. K., Chaari, Fakher, Series Editor, Gherardini, Francesco, Series Editor, Ivanov, Vitalii, Series Editor, Haddar, Mohamed, Series Editor, Cavas-Martínez, Francisco, Editorial Board Member, di Mare, Francesca, Editorial Board Member, Kwon, Young W., Editorial Board Member, Tolio, Tullio A. M., Editorial Board Member, Trojanowska, Justyna, Editorial Board Member, Schmitt, Robert, Editorial Board Member, Xu, Jinyang, Editorial Board Member, Sahoo, Seshadev, editor, and Yedla, Natraj, editor
- Published
- 2024
- Full Text
- View/download PDF
13. Automated alzheimer's disease detection and diagnosis method based on Bayesian optimization and CNN-based pre-trained features
- Author
-
Saim, Meriem and Feroui, Amel
- Published
- 2024
- Full Text
- View/download PDF
14. A novel probabilistic intermittent neural network (PINN) and artificial jelly fish optimization (AJFO)-based plant leaf disease detection system.
- Author
-
Saraswathi, E. and Faritha Banu, J.
- Subjects
- *
PLANT diseases , *FOLIAGE plants , *JELLY , *SYSTEM identification - Abstract
Plant leaf disease identification and classification are the most essential and demanding tasks in the agriculture field. In traditional researches, various automated detection technologies have been developed with the goal of more accurately identifying plant leaf disease. Nevertheless, it faces some problems related to complex mathematical modeling, increased time consumption, processing overhead, and mis-prediction results. Therefore, a novel probabilistic intermittent neural network and artificial jelly fish optimization-based plant leaf disease detection system is proposed in this paper. The proposed work aims to "make a new detection scheme to identify correctly plant leaf disease from the given dataset." Here, the probabilistic intermittent neural network (PINN) classification technique is used to predict label as normal or affected by disease. If it is disease affected, the residual multi-scale Unet segmentation (RMUNet) segmentation technique is applied to segment the disease affected region. Finally, the simulation outcomes confirm the efficiency of the proposed leaf disease identification system under some variables. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
15. Geometrical Features Based-mmWave UAV Path Loss Prediction Using Machine Learning for 5G and Beyond
- Author
-
Sajjad Hussain, Syed Faraz Naeem Bacha, Adnan Ahmad Cheema, Berk Canberk, and Trung Q. Duong
- Subjects
UAVs ,millimeter-wave (mmWave) ,5G ,path loss (PL) ,ray tracing ,and machine learning ,Telecommunication ,TK5101-6720 ,Transportation and communications ,HE1-9990 - Abstract
Unmanned aerial vehicles (UAVs) are envisioned to play a pivotal role in modern telecommunication and wireless sensor networks, offering unparalleled flexibility and mobility for communication and data collection in diverse environments. This paper presents a comprehensive investigation into the performance of supervised machine learning (ML) models for path loss (PL) prediction in UAV-assisted millimeter-wave (mmWave) radio networks. Leveraging a unique set of interpretable geometrical features, six distinct ML models–linear regression (LR), support vector regressor (SVR), K nearest neighbors (KNN), random forest (RF), extreme gradient boosting (XGBoost), and deep neural network (DNN)–are rigorously evaluated using a massive dataset generated from extensive raytracing (RT) simulations in a typical urban environment. Our results demonstrate that the RF algorithm outperforms other models showcasing superior predictive performance for the test dataset with a root mean square error (RMSE) of 2.38 dB. The proposed ML models demonstrate superior accuracy compared to 3GPP and ITU-R models for mmWave radio networks. This study thoroughly investigates the adaptability of these models to unseen environments and examines the feasibility of training them with sparse datasets to improve accuracy. The reduction in computation time achieved by using ML models instead of extensive RT computations for sparse training datasets is evaluated, and an efficient algorithm for training such models is proposed. Additionally, the sensitivity of ML models to noisy input features is analyzed. We also assess the importance of geometrical features and the impact of sequentially increasing the number of these features on model performance. The results emphasize the significance of the proposed geometrical features and demonstrate the potential of ML models to provide computationally efficient and relatively accurate PL predictions in diverse urban environments.
- Published
- 2024
- Full Text
- View/download PDF
16. Intelligent System for Assessing University Student Personality Development and Career Readiness.
- Author
-
Izbassar, Assylzhan, Muratbekova, Muragul, Amangeldi, Daniyar, Oryngozha, Nazzere, Ogorodova, Anna, and Shamoi, Pakizar
- Subjects
MACHINE learning ,PERSONALITY development ,STUDENT development ,COLLEGE students ,KNOWLEDGE acquisition (Expert systems) ,PREPAREDNESS - Abstract
While academic metrics such as transcripts and GPA are commonly used to evaluate students' knowledge acquisition, there are limited comprehensive metrics to measure their preparedness for the challenges of post-graduation life. This research paper explores the impact of various factors on university students' readiness for change and transition, with a focus on their preparedness for careers. The methodology employed in this study involves designing a survey based on Paul J. Mayer's "The Balance Wheel" to capture students' sentiments on various life aspects, including satisfaction with the educational process and expectations of salary. The collected data from a KBTU student survey (n=47) were processed through machine learning models: Linear Regression, Support Vector Regression (SVR), and Random Forest Regression. Subsequently, an intelligent system was built using these models and fuzzy sets. The system is capable of evaluating graduates' readiness for their future careers and demonstrates a high predictive power. The findings of this research have practical implications for educational institutions. Such an intelligent system can serve as a valuable tool for universities to assess and enhance students' preparedness for post-graduation challenges. By recognizing the factors contributing to students' readiness for change, universities can refine curricula and processes to better prepare students for their career journeys. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
17. Revolutionizing Cancer Detection through AI Algorithms.
- Author
-
Srivastava, Nidhi
- Subjects
EARLY detection of cancer ,ARTIFICIAL intelligence ,MACHINE learning ,DEEP learning ,COMPUTED tomography - Abstract
Artificial Intelligence (AI) is a field which has spread its wing everywhere. The two subsets of AI – Machine Learning (ML) and Deep Learning (DL) are very much in demand nowadays. Use of AI in healthcare is now seen at a large scale. One of the prominent uses of AI algorithm is in detection of cancer. The number of cancer cases is increasing rapidly and both doctors and researchers are trying to find a solution for the same. Detection of cancer is possible through various tests like MRI, CT scan, etc. It is a challenge for the doctors to interpret the tests correctly as it is very complex and time-consuming. It becomes even more challenging to infer the tests if the cancer is in the early stage. This is where AI can help. Various ML/ DL algorithms can be used to speed up the detection process and thus identify cancerous cells in the initial stage so that the survival rate of the patient increases. Early diagnosis and timely intervention can make a significant difference between life and death for cancer patients. This paper briefly outlines the role of AI including ML and DL in cancer detection and how various researchers are using it for prediction of disease. The paper also outlines the general steps used in AI for detection of cancer. [ABSTRACT FROM AUTHOR]
- Published
- 2024
18. A robust supervised machine learning based approach for offline-online traffic classification of software-defined networking.
- Author
-
Eissa, Menas Ebrahim, Mohamed, M. A., and Ata, Mohamed Maher
- Abstract
Due to the exponential increase of internet applications and network users, network traffic classification (NTC) is a crucial study subject. It successfully improves network service identifiability and security concerns of the traffic network and provides a way that improves the Quality of services (QoS). Recently, with the emergence of software-defined networking (SDN) and its ability to get the entire network overview using a centralized controller, machine learning (ML) has been used for NTC. In this paper, an SDN QoS guarantee framework with machine learning traffic classification has been proposed. The framework includes a classification system with two stages, the offline stage, where the classifier was trained and tested, and the online stage, where dealing with the flows and testing the classifier speed is simulated using spark streaming. The result shows that the classifier successfully identifies the specific traffic application with an accuracy of 100% on the "IP-network-traffic-flows-labeled-with-87-apps" dataset and identifies the traffic type with an accuracy of 99.95% on the "ISCX-VPN-NONVPN" dataset. In addition, the classifier speed is proven to be a round 3500 record/sec and a patch duration of 917.3 ms on average with 3210 flows/Trigger. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
19. How Does Artificial Intelligence Synergize to Make Investment Decisions? A Critical Analysis
- Author
-
Maharani, Satia Nur, Rahardiansyah, Risal Fadhil, Appolloni, Andrea, Series Editor, Caracciolo, Francesco, Series Editor, Ding, Zhuoqi, Series Editor, Gogas, Periklis, Series Editor, Huang, Gordon, Series Editor, Nartea, Gilbert, Series Editor, Ngo, Thanh, Series Editor, Striełkowski, Wadim, Series Editor, Zutiasari, Ika, editor, and Kurniawan, Dediek Tri, editor
- Published
- 2023
- Full Text
- View/download PDF
20. A Novel Approach to Project Employee’s Performance and Confinement
- Author
-
Shirisha, Gangarapu, Sekar, M. Raja, Powers, David M. W., Series Editor, Kumar, Amit, editor, Ghinea, Gheorghita, editor, Merugu, Suresh, editor, and Hashimoto, Takako, editor
- Published
- 2023
- Full Text
- View/download PDF
21. Construction and evaluation of endometriosis diagnostic prediction model and immune infiltration based on efferocytosis-related genes
- Author
-
Fang-Li Pei, Jin-Jin Jia, Shu-Hong Lin, Xiao-Xin Chen, Li-Zheng Wu, Zeng-Xian Lin, Bo-Wen Sun, and Cheng Zeng
- Subjects
endometriosis ,efferocytosis ,immune infiltration ,bioinformatics ,and machine learning ,Biology (General) ,QH301-705.5 - Abstract
Background: Endometriosis (EM) is a long-lasting inflammatory disease that is difficult to treat and prevent. Existing research indicates the significance of immune infiltration in the progression of EM. Efferocytosis has an important immunomodulatory function. However, research on the identification and clinical significance of efferocytosis-related genes (EFRGs) in EM is sparse.Methods: The EFRDEGs (differentially expressed efferocytosis-related genes) linked to datasets associated with endometriosis were thoroughly examined utilizing the Gene Expression Omnibus (GEO) and GeneCards databases. The construction of the protein-protein interaction (PPI) and transcription factor (TF) regulatory network of EFRDEGs ensued. Subsequently, machine learning techniques including Univariate logistic regression, LASSO, and SVM classification were applied to filter and pinpoint diagnostic biomarkers. To establish and assess the diagnostic model, ROC analysis, multivariate regression analysis, nomogram, and calibration curve were employed. The CIBERSORT algorithm and single-cell RNA sequencing (scRNA-seq) were employed to explore immune cell infiltration, while the Comparative Toxicogenomics Database (CTD) was utilized for the identification of potential therapeutic drugs for endometriosis. Finally, immunohistochemistry (IHC) and reverse transcription quantitative polymerase chain reaction (RT-qPCR) were utilized to quantify the expression levels of biomarkers in clinical samples of endometriosis.Results: Our findings revealed 13 EFRDEGs associated with EM, and the LASSO and SVM regression model identified six hub genes (ARG2, GAS6, C3, PROS1, CLU, and FGL2). Among these, ARG2, GAS6, and C3 were confirmed as diagnostic biomarkers through multivariate logistic regression analysis. The ROC curve analysis of GSE37837 (AUC = 0.627) and GSE6374 (AUC = 0.635), along with calibration and DCA curve assessments, demonstrated that the nomogram built on these three biomarkers exhibited a commendable predictive capacity for the disease. Notably, the ratio of nine immune cell types exhibited significant differences between eutopic and ectopic endometrial samples, with scRNA-seq highlighting M0 Macrophages, Fibroblasts, and CD8 Tex cells as the cell populations undergoing the most substantial changes in the three biomarkers. Additionally, our study predicted seven potential medications for EM. Finally, the expression levels of the three biomarkers in clinical samples were validated through RT-qPCR and IHC, consistently aligning with the results obtained from the public database.Conclusion: we identified three biomarkers and constructed a diagnostic model for EM in this study, these findings provide valuable insights for subsequent mechanistic research and clinical applications in the field of endometriosis.
- Published
- 2024
- Full Text
- View/download PDF
22. AI Tools for Assessing Human Fertility Using Risk Factors: A State-of-the-Art Review.
- Author
-
GhoshRoy, Debasmita, Alvi, P. A., and Santosh, KC
- Subjects
- *
BIOMARKERS , *ONLINE information services , *LIFESTYLES , *OBESITY , *MEN'S health , *META-analysis , *SYSTEMATIC reviews , *AGE distribution , *ARTIFICIAL intelligence , *MACHINE learning , *RISK assessment , *INFERTILITY , *FERTILITY , *MEDLINE , *WOMEN'S health , *REPRODUCTIVE health , *DISEASE risk factors - Abstract
Infertility has massively disrupted social and marital life, resulting in stressful emotional well-being. Early diagnosis is the utmost need for faster adaption to respond to these changes, which makes possible via AI tools. Our main objective is to comprehend the role of AI in fertility detection since we have primarily worked to find biomarkers and related risk factors associated with infertility. This paper aims to vividly analyse the role of AI as an effective method in screening, predicting for infertility and related risk factors. Three scientific repositories: PubMed, Web of Science, and Scopus, are used to gather relevant articles via technical terms: (human infertility OR human fertility) AND risk factors AND (machine learning OR artificial intelligence OR intelligent system). In this way, we systematically reviewed 42 articles and performed a meta-analysis. The significant findings and recommendations are discussed. These include the rising importance of data augmentation, feature extraction, explainability, and the need to revisit the meaning of an effective system for fertility analysis. Additionally, the paper outlines various mitigation actions that can be employed to tackle infertility and its related risk factors. These insights contribute to a better understanding of the role of AI in fertility analysis and the potential for improving reproductive health outcomes. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
23. Convergence Research and Training in Computational Bioengineering: A Case Study on AI/ML-Driven Biofilm–Material Interaction Discovery
- Author
-
Zylla, Jessica L. S., Bomgni, Alain B., Sani, Rajesh K., Subramaniam, Mahadevan, Lushbough, Carol, Winter, Robb, Gadhamshetty, Venkataramana R., Chundi, Parvathi, and Gnimpieba, Etienne Z.
- Published
- 2024
- Full Text
- View/download PDF
24. Effective Heart Disease Classification for Telehealth Systems
- Author
-
Jose, Deepa, Rajesh, Arya, Jaisharmila, P., Jasvine, Jeba M., Andal, V., Angrisani, Leopoldo, Series Editor, Arteaga, Marco, Series Editor, Panigrahi, Bijaya Ketan, Series Editor, Chakraborty, Samarjit, Series Editor, Chen, Jiming, Series Editor, Chen, Shanben, Series Editor, Chen, Tan Kay, Series Editor, Dillmann, Rüdiger, Series Editor, Duan, Haibin, Series Editor, Ferrari, Gianluigi, Series Editor, Ferre, Manuel, Series Editor, Hirche, Sandra, Series Editor, Jabbari, Faryar, Series Editor, Jia, Limin, Series Editor, Kacprzyk, Janusz, Series Editor, Khamis, Alaa, Series Editor, Kroeger, Torsten, Series Editor, Li, Yong, Series Editor, Liang, Qilian, Series Editor, Martín, Ferran, Series Editor, Ming, Tan Cher, Series Editor, Minker, Wolfgang, Series Editor, Misra, Pradeep, Series Editor, Möller, Sebastian, Series Editor, Mukhopadhyay, Subhas, Series Editor, Ning, Cun-Zheng, Series Editor, Nishida, Toyoaki, Series Editor, Pascucci, Federica, Series Editor, Qin, Yong, Series Editor, Seng, Gan Woon, Series Editor, Speidel, Joachim, Series Editor, Veiga, Germano, Series Editor, Wu, Haitao, Series Editor, Zamboni, Walter, Series Editor, Zhang, Junjie James, Series Editor, Marriwala, N., editor, Tripathi, C. C., editor, Jain, Shruti, editor, and Mathapathi, Shivakumar, editor
- Published
- 2022
- Full Text
- View/download PDF
25. Data-driven approach for identifying the factors related to debt collector performance.
- Author
-
Sivamayilvelan, Keerthana, Rajasekar, Elakkiya, Balachandran, Santhi, Kotecha, Ketan, and Vairavasundaram, Subramaniyaswamy
- Subjects
DEEP reinforcement learning ,COLLECTING of accounts ,EMPLOYEE reviews ,STATISTICAL learning ,MACHINE learning - Abstract
The company's success and growth heavily rely on its workforce's performance, yet the evaluation of employees has been only partially and inconclusively executed so far. The primary goal of this research is to build an open innovation framework for analyzing the performance of the debt collector. We have developed the Reinforcement Learning based Continual Learning (RLC) approach for evaluating the performance by analyzing the metrics such as visit patterns and collection percentage. We have used the private debt collection dataset to assess the debt collector's performance. We formulated hypotheses derived from insights gained during exploratory data analysis and subsequently validated them through statistical testing. Whether there are noticeable distinctions among debt collectors in terms of visitation frequency, collection rates, and collection modes. This proposed open innovation framework for analyzing the debt collector performance provides significant variation in terms of collection rate. The proposed EDQN-CL achieved a 13.56 % higher classification rate than the existing algorithm for categorizing the debt collector performance. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
26. An efficient fraud detection framework with credit card imbalanced data in financial services.
- Author
-
Abd El-Naby, Aya, Hemdan, Ezz El-Din, and El-Sayed, Ayman
- Subjects
FRAUD investigation ,CREDIT card fraud ,CREDIT cards ,FRAUD ,K-nearest neighbor classification - Abstract
Credit card fraud has adversely impacted market economic order and has broken stakeholders, financial entities, and consumers' trust and interest. Card fraud losses are increasing annually and billions of dollars are being lost. Therefore, this work provides a framework for fraud card detection to be tackled efficiently. Recently, the imbalanced dataset for fraud card transactions due to the number of ordinary transactions being far greater than the amount of fraud. Before solving the fraud problem, we first have to solve the imbalanced data problem which occurred when one class considerably outnumbers the examples of the other class. So, the classification of fraud come to be very tough as the result may get biased towards the majority group. Thus, this paper aims firstly to use hybrid sampling and oversampling preprocessing techniques to solve the imbalanced data problem, and secondly to resolve the fraud. The performance of the proposed framework is estimated based on different metrics accuracy, precision, and recall in comparing existing algorithms such as KNN, LR, LDA, NB, and CART. The obtained results revealing that when the data is highly imbalanced, the model strives to detect fraudulent transactions. Besides, it can predict positive classes improved significantly, reaching an accuracy of 99.9. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
27. News' Credibility Detection on Social Media Using Machine Learning Algorithms.
- Author
-
Yasser, Farah, AbdelMawgoud, Sayed AbdelGaber, and Idrees, Amira M.
- Subjects
SOCIAL media ,MACHINE learning ,TEXT mining ,COVID-19 pandemic ,PUBLIC trustees - Abstract
Social media is essential in many aspects of our lives. Social media allows us to find news for free. anyone can access it easily at any time. However, social media may also facilitate the rapid spread of misleading news. As a result, there is a probability that low-quality news, including incorrect and fake information, will spread over social media. As well as detecting news credibility on social media becomes essential because fake news can affect society negatively, and the spread of false news has a considerable impact on personal reputation and public trust. In this research, we conducted a model that detects the credibility of Arabic news from social media; particularly Arabic tweets. The content of the tweets revolves around the COVID-19 pandemic. The proposed model applied to detect news credibility using text mining techniques and one of the well-known machine learning classifiers, Decision tree which has the best accuracy equal to 86.6%. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
28. AI-Based Analysis for Industry 4.0 Maturity Models -- A Systematic Review and Bibliometric Analysis.
- Author
-
Almarri, Sadeem and Bouras, Abdelghani
- Subjects
INDUSTRY 4.0 ,TRANSPORTATION ,TRUCK drivers ,ARTIFICIAL intelligence ,MACHINE learning - Abstract
The fourth industrial revolution (industry 4.0) is a term used to describe the ongoing transformation of business processes combined with the latest technologies that are changing the way businesses operate, leading to the faster, cheaper, and effective delivery of products and services. Different companies and countries are preparing for this revolution by developing Industry 4.0 strategies. To assist the companies seeking to adopt Industry 4.0 in manageable phases, the maturity models (MMs) were created and used. However, in the context of Industry 4.0, literature reviews indicate that the number of models has increased sharply in recent years. This paper aims to present and analyze the results of bibliometric analysis and systematic literature review, which highlight the recent developments in the field of Industry 4.0 MMs, with a focus on investigating the existing models, their types, levels, and dimensions. Based on our findings, we introduced the core dimensions of the MMs that will require more intensive efforts to set the ground for the transformation journey, utilizing the power of AI and machine learning. This work will enable the scientific community to investigate the publication hierarchy in this emerging filed, and it will serve as a starting point for future MMs developments. [ABSTRACT FROM AUTHOR]
- Published
- 2022
29. IoT Intrusion Detection Using Machine Learning with a Novel High Performing Feature Selection Method.
- Author
-
Albulayhi, Khalid, Abu Al-Haija, Qasem, Alsuhibany, Suliman A., Jillepalli, Ananth A., Ashrafuzzaman, Mohammad, and Sheldon, Frederick T.
- Subjects
FEATURE selection ,INTRUSION detection systems (Computer security) ,MACHINE learning ,INTERNET of things ,BOOTSTRAP aggregation (Algorithms) ,SET theory - Abstract
The Internet of Things (IoT) ecosystem has experienced significant growth in data traffic and consequently high dimensionality. Intrusion Detection Systems (IDSs) are essential self-protective tools against various cyber-attacks. However, IoT IDS systems face significant challenges due to functional and physical diversity. These IoT characteristics make exploiting all features and attributes for IDS self-protection difficult and unrealistic. This paper proposes and implements a novel feature selection and extraction approach (i.e., our method) for anomaly-based IDS. The approach begins with using two entropy-based approaches (i.e., information gain (IG) and gain ratio (GR)) to select and extract relevant features in various ratios. Then, mathematical set theory (union and intersection) is used to extract the best features. The model framework is trained and tested on the IoT intrusion dataset 2020 (IoTID20) and NSL-KDD dataset using four machine learning algorithms: Bagging, Multilayer Perception, J48, and IBk. Our approach has resulted in 11 and 28 relevant features (out of 86) using the intersection and union, respectively, on IoTID20 and resulted 15 and 25 relevant features (out of 41) using the intersection and union, respectively, on NSL-KDD. We have further compared our approach with other state-of-the-art studies. The comparison reveals that our model is superior and competent, scoring a very high 99.98% classification accuracy. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
30. Cardiovascular disease detection using machine learning and carotid/femoral arterial imaging frameworks in rheumatoid arthritis patients.
- Author
-
Konstantonis, George, Singh, Krishna V., Sfikakis, Petros P., Jamthikar, Ankush D., Kitas, George D., Gupta, Suneet K., Saba, Luca, Verrou, Kleio, Khanna, Narendra N., Ruzsa, Zoltan, Sharma, Aditya M., Laird, John R., Johri, Amer M., Kalra, Manudeep, Protogerou, Athanasios, and Suri, Jasjit S.
- Subjects
- *
MACHINE learning , *RHEUMATOID arthritis , *FISHER discriminant analysis , *CARDIOVASCULAR diseases , *PERIPHERAL vascular diseases , *CORONARY disease - Abstract
The study proposes a novel machine learning (ML) paradigm for cardiovascular disease (CVD) detection in individuals at medium to high cardiovascular risk using data from a Greek cohort of 542 individuals with rheumatoid arthritis, or diabetes mellitus, and/or arterial hypertension, using conventional or office-based, laboratory-based blood biomarkers and carotid/femoral ultrasound image-based phenotypes. Two kinds of data (CVD risk factors and presence of CVD—defined as stroke, or myocardial infarction, or coronary artery syndrome, or peripheral artery disease, or coronary heart disease) as ground truth, were collected at two-time points: (i) at visit 1 and (ii) at visit 2 after 3 years. The CVD risk factors were divided into three clusters (conventional or office-based, laboratory-based blood biomarkers, carotid ultrasound image-based phenotypes) to study their effect on the ML classifiers. Three kinds of ML classifiers (Random Forest, Support Vector Machine, and Linear Discriminant Analysis) were applied in a two-fold cross-validation framework using the data augmented by synthetic minority over-sampling technique (SMOTE) strategy. The performance of the ML classifiers was recorded. In this cohort with overall 46 CVD risk factors (covariates) implemented in an online cardiovascular framework, that requires calculation time less than 1 s per patient, a mean accuracy and area-under-the-curve (AUC) of 98.40% and 0.98 (p < 0.0001) for CVD presence detection at visit 1, and 98.39% and 0.98 (p < 0.0001) at visit 2, respectively. The performance of the cardiovascular framework was significantly better than the classical CVD risk score. The ML paradigm proved to be powerful for CVD prediction in individuals at medium to high cardiovascular risk. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
31. Risk Factors and Prediction of ST-segment Elevation Myocardial Infarction.
- Author
-
Emakhu, Joshua, Monplaisir, Leslie, Aguwa, Celestine, Masoud, Sara, Arslanturk, Suzan, Etu, Egbe-Etu, and Miller, Joseph
- Subjects
MORTALITY ,MYOCARDIAL infarction ,ELECTRONIC health records ,RECEIVER operating characteristic curves ,DEVELOPING countries - Abstract
Studies have shown that the mortality rate due to ST-segment elevation myocardial infarction (STEMI) has drastically increased in developed and developing countries. The purpose of this study is to develop a diagnostic support tool to help classify STEMI patients and validate the predictive risk factors associated with STEMI using an ensemble learning approach. In this retrospective data-mining study, the data are retrieved from electronic health records of an urban emergency department between January 2017 and August 2020. A Random Forest model is trained to classify non-acute coronary syndrome (non-ACS) etiologies and STEMI patients using 38 features. Of the study cohort, 411 patients with chest pain fulfilled inclusion criteria, of whom 225 (55%) are STEMI, and 186 (45%) are non-ACS etiologies patients. The proposed framework successfully classifies the non-ACS etiologies and STEMI patients with recall and area under the receiver operating characteristic (AUROC) values of 73% and 90%, respectively. [ABSTRACT FROM AUTHOR]
- Published
- 2021
32. Using Machine Learning to Assess Solar Energy Grid Disturbances.
- Author
-
Ramirez, Jose, Soto, Esteban A., Wollega, Ebisa, and Bosman, Lisa B.
- Subjects
MACHINE learning ,SOLAR energy ,ELECTRIC power ,ENERGY consumption ,ELECTRIC utilities - Abstract
Energy generation, sources and distribution methods have been continuously evolving over the past decade. With the increased efficiency associated with solar energy production and distribution, local homeowners have also assumed the role of energy generators, even getting credit for access electricity supplied to the grid given the policy around net-metering. When planning their energy distribution frameworks, electricity providers have to take these changes in energy consumption and generation into account. However, little is known about how solar energy systems impact the demand and supply of grid electricity managed by utility companies. This study proposes a new approach to solar energy predictive modeling which combines machine learning and a variety of publicly available data sources to predict site-specific temperature and solar irradiance (the two primary "missing ingredients"). The preliminary findings show a decreased error when using the new approach (near-future data) in comparison to the traditional approach (historical data) for predicting solar energy generation. As the adoption of solar energy increases, so will potential disruptions to the grid. These preliminary findings show the potential for aggregating individual site-specific predictions to the regional level for the purpose of estimating area-specific solar energy disturbances and moving efforts towards predictive grid optimization. [ABSTRACT FROM AUTHOR]
- Published
- 2020
33. Putative biomarkers for predicting tumor sample purity based on gene expression data
- Author
-
Yuanyuan Li, David M. Umbach, Adrienna Bingham, Qi-Jing Li, Yuan Zhuang, and Leping Li
- Subjects
Tumor purity ,RNA-seq ,Gene expression ,XGBoost ,Gradient boosted trees ,And machine learning ,Biotechnology ,TP248.13-248.65 ,Genetics ,QH426-470 - Abstract
Abstract Background Tumor purity is the percent of cancer cells present in a sample of tumor tissue. The non-cancerous cells (immune cells, fibroblasts, etc.) have an important role in tumor biology. The ability to determine tumor purity is important to understand the roles of cancerous and non-cancerous cells in a tumor. Methods We applied a supervised machine learning method, XGBoost, to data from 33 TCGA tumor types to predict tumor purity using RNA-seq gene expression data. Results Across the 33 tumor types, the median correlation between observed and predicted tumor-purity ranged from 0.75 to 0.87 with small root mean square errors, suggesting that tumor purity can be accurately predicted υσινγ expression data. We further confirmed that expression levels of a ten-gene set (CSF2RB, RHOH, C1S, CCDC69, CCL22, CYTIP, POU2AF1, FGR, CCL21, and IL7R) were predictive of tumor purity regardless of tumor type. We tested whether our set of ten genes could accurately predict tumor purity of a TCGA-independent data set. We showed that expression levels from our set of ten genes were highly correlated (ρ = 0.88) with the actual observed tumor purity. Conclusions Our analyses suggested that the ten-gene set may serve as a biomarker for tumor purity prediction using gene expression data.
- Published
- 2019
- Full Text
- View/download PDF
34. Automatic segmentation of medical images using a novel Harris Hawk optimization method and an active contour model.
- Author
-
Tamoor, Maria and Younas, Irfan
- Subjects
- *
DIAGNOSTIC imaging , *IMAGE segmentation , *CARDIAC magnetic resonance imaging , *CONGENITAL heart disease , *THERAPEUTICS , *HEART diseases - Abstract
Medical image segmentation is a key step to assist diagnosis of several diseases, and accuracy of a segmentation method is important for further treatments of different diseases. Different medical imaging modalities have different challenges such as intensity inhomogeneity, noise, low contrast, and ill-defined boundaries, which make automated segmentation a difficult task. To handle these issues, we propose a new fully automated method for medical image segmentation, which utilizes the advantages of thresholding and an active contour model. In this study, a Harris Hawks optimizer is applied to determine the optimal thresholding value, which is used to obtain the initial contour for segmentation. The obtained contour is further refined by using a spatially varying Gaussian kernel in the active contour model. The proposed method is then validated using a standard skin dataset (ISBI 2016), which consists of variable-sized lesions and different challenging artifacts, and a standard cardiac magnetic resonance dataset (ACDC, MICCAI 2017) with a wide spectrum of normal hearts, congenital heart diseases, and cardiac dysfunction. Experimental results show that the proposed method can effectively segment the region of interest and produce superior segmentation results for skin (overall Dice Score 0.90) and cardiac dataset (overall Dice Score 0.93), as compared to other state-of-the-art algorithms. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
35. IoT Intrusion Detection Using Machine Learning with a Novel High Performing Feature Selection Method
- Author
-
Khalid Albulayhi, Qasem Abu Al-Haija, Suliman A. Alsuhibany, Ananth A. Jillepalli, Mohammad Ashrafuzzaman, and Frederick T. Sheldon
- Subjects
cybersecurity ,anomaly detection accuracy ,feature selection ,Internet of Things (IoT) ,intrusion detection system ,and machine learning ,Technology ,Engineering (General). Civil engineering (General) ,TA1-2040 ,Biology (General) ,QH301-705.5 ,Physics ,QC1-999 ,Chemistry ,QD1-999 - Abstract
The Internet of Things (IoT) ecosystem has experienced significant growth in data traffic and consequently high dimensionality. Intrusion Detection Systems (IDSs) are essential self-protective tools against various cyber-attacks. However, IoT IDS systems face significant challenges due to functional and physical diversity. These IoT characteristics make exploiting all features and attributes for IDS self-protection difficult and unrealistic. This paper proposes and implements a novel feature selection and extraction approach (i.e., our method) for anomaly-based IDS. The approach begins with using two entropy-based approaches (i.e., information gain (IG) and gain ratio (GR)) to select and extract relevant features in various ratios. Then, mathematical set theory (union and intersection) is used to extract the best features. The model framework is trained and tested on the IoT intrusion dataset 2020 (IoTID20) and NSL-KDD dataset using four machine learning algorithms: Bagging, Multilayer Perception, J48, and IBk. Our approach has resulted in 11 and 28 relevant features (out of 86) using the intersection and union, respectively, on IoTID20 and resulted 15 and 25 relevant features (out of 41) using the intersection and union, respectively, on NSL-KDD. We have further compared our approach with other state-of-the-art studies. The comparison reveals that our model is superior and competent, scoring a very high 99.98% classification accuracy.
- Published
- 2022
- Full Text
- View/download PDF
36. Enhanced Android Malware Detection and Family Classification, using Conversation-level Network Traffic Features.
- Author
-
Abuthawabeh, Mohammad and Mahmoud, Khaled
- Published
- 2020
- Full Text
- View/download PDF
37. A hybrid feature selection method for credit scoring
- Author
-
Sang Ha Van, Nam Nguyen Ha, and Hien Nguyen Thi Bao
- Subjects
Credit risk ,Credit scoring ,Hybrid Feature selection ,GBM ,RFE ,Information Values ,and Machine learning ,Technology (General) ,T1-995 - Abstract
Reliable credit scoring models played a very important role of retail banks to evaluate credit applications and it has been widely studied. The main objective of this paper is to build a hybrid credit scoring model using feature selection approach. In this study, we constructed a credit scoring model based on parallel GBM (Gradient Boosted Model), filter and wrapper approaches to evaluate the applicant’s credit score from the input features. Feature scoring expression are combined by feature important (Gini index) and Information Value. Backward sequential scheme is used for selecting optimal subset of relevant features while the subset is evaluated by GBM classifier. To reduce the running time, we applied parallel GBM classifier to evaluate the proposed subset of features. The experimental results showed that the proposed method obtained a higher predictive accuracy than a baseline method for some certain datasets. It also showed faster speed and better generalization than traditional feature selection methods widely used in credit scoring.
- Published
- 2017
- Full Text
- View/download PDF
38. Putative biomarkers for predicting tumor sample purity based on gene expression data.
- Author
-
Li, Yuanyuan, Umbach, David M., Bingham, Adrienna, Li, Qi-Jing, Zhuang, Yuan, and Li, Leping
- Subjects
GENES ,TUMOR markers ,GENE expression ,STANDARD deviations ,SUPERVISED learning - Abstract
Background: Tumor purity is the percent of cancer cells present in a sample of tumor tissue. The non-cancerous cells (immune cells, fibroblasts, etc.) have an important role in tumor biology. The ability to determine tumor purity is important to understand the roles of cancerous and non-cancerous cells in a tumor. Methods: We applied a supervised machine learning method, XGBoost, to data from 33 TCGA tumor types to predict tumor purity using RNA-seq gene expression data. Results: Across the 33 tumor types, the median correlation between observed and predicted tumor-purity ranged from 0.75 to 0.87 with small root mean square errors, suggesting that tumor purity can be accurately predicted υσινγ expression data. We further confirmed that expression levels of a ten-gene set (CSF2RB, RHOH, C1S, CCDC69, CCL22, CYTIP, POU2AF1, FGR, CCL21, and IL7R) were predictive of tumor purity regardless of tumor type. We tested whether our set of ten genes could accurately predict tumor purity of a TCGA-independent data set. We showed that expression levels from our set of ten genes were highly correlated (ρ = 0.88) with the actual observed tumor purity. Conclusions: Our analyses suggested that the ten-gene set may serve as a biomarker for tumor purity prediction using gene expression data. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
39. Verification of an Expert System for Forecasting Ice-Block-Formation: The Case of the Northern Dvina River.
- Author
-
Aleshin, I. M. and Malygin, I. V.
- Subjects
- *
MACHINE learning , *MATHEMATICAL models , *COMPUTER software , *ONLINE monitoring systems , *EXPERT systems - Abstract
Abstract—: Here we provide a short description of an expert system for predicting the ice-jamming power in the area of the Northern Dvina River and a procedure to verify this system. This expert system is based on hydrological and meteorological data for 1991-2016. The data were processed using a machine learning technique and adjacent mathematics, because there is no mathematical model of the ice-jamming process and time series of observations are too short to apply classical statistics. This expert system was developed in 2012; it was adjusted using data from 1991-2010 seasons obtained at hydrological stations. The current investigation involves additional data on 2011-2016 seasons to repeat learning and estimate system quality. The developed system demonstrates a reliable efficiency: the forecast results coincide with observations for all six added seasons (2011-2016). It should be noted that the additional data do not change forecast accuracy, which remained approximately 85%, like in the previous study. All developed software is cross-platform, written with a C++ language, and is implemented as a command line application. This software can be easily adopted to operate as a part of the Northern Dvina River online monitoring service. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
40. A Filtering-Based Approach for Improving Crowdsourced GNSS Traces in a Data Update Context
- Author
-
Stefan S. Ivanovic, Ana-Maria Olteanu-Raimond, Sébastien Mustière, and Thomas Devogele
- Subjects
data quality ,outlier ,crowdsourced GNSS traces ,and machine learning ,Geography (General) ,G1-922 - Abstract
Traces collected by citizens using GNSS (Global Navigation Satellite System) devices during sports activities such as running, hiking or biking are now widely available through different sport-oriented collaborative websites. The traces are collected by citizens for their own purposes and frequently shared with the sports community on the internet. Our research assumption is that crowdsourced GNSS traces may be a valuable source of information to detect updates in authoritative datasets. Despite their availability, the traces present some issues such as poor metadata, attribute incompleteness and heterogeneous positional accuracy. Moreover, certain parts of the traces (GNSS points composing the traces) are results of the displacements made out of the existing paths. In our context (i.e., update authoritative data) these off path GNSS points are considered as noise and should be filtered. Two types of noise are examined in this research: Points representing secondary activities (e.g., having a lunch break) and points representing errors during the acquisition. The first ones we named secondary human behaviour (SHB), whereas we named the second ones outliers. The goal of this paper is to improve the smoothness of traces by detecting and filtering both SHB and outliers. Two methods are proposed. The first one allows for the detection secondary human behaviour by analysing only traces geometry. The second one is a rule-based machine learning method that detects outliers by taking into account the intrinsic characteristics of points composing the traces, as well as the environmental conditions during traces acquisition. The proposed approaches are tested on crowdsourced GNSS traces collected in mountain areas during sports activities.
- Published
- 2019
- Full Text
- View/download PDF
41. COVID-19 Prediction Models and Unexploited Data
- Author
-
Santosh, K. C.
- Published
- 2020
- Full Text
- View/download PDF
42. Wind Farm Fault Detection by Monitoring Wind Speed in the Wake Region
- Author
-
Minh-Quang Tran, Yi-Chen Li, Meng-Kun Liu, and Chen-Yang Lan
- Subjects
Control and Optimization ,010504 meteorology & atmospheric sciences ,Computer science ,020209 energy ,Astrophysics::High Energy Astrophysical Phenomena ,Energy balance ,Energy Engineering and Power Technology ,Upwind scheme ,02 engineering and technology ,Wake ,Computational fluid dynamics ,01 natural sciences ,Turbine ,lcsh:Technology ,Fault detection and isolation ,Wind speed ,Physics::Fluid Dynamics ,and machine learning ,feature selection ,0202 electrical engineering, electronic engineering, information engineering ,Astrophysics::Solar and Stellar Astrophysics ,Electrical and Electronic Engineering ,Engineering (miscellaneous) ,Physics::Atmospheric and Oceanic Physics ,0105 earth and related environmental sciences ,Wind power ,Finite volume method ,wind turbine fault detection ,wind energy dissipation model ,machine learning ,Renewable Energy, Sustainability and the Environment ,business.industry ,Turbulence ,lcsh:T ,Physics::Space Physics ,business ,Energy (miscellaneous) ,Marine engineering - Abstract
A novel concept of wind farm fault detection by monitoring the wind speed in the wake region is proposed in this study. A wind energy dissipation model was coupled with a computational fluid dynamics solver to simulate the fluid field of a wind turbine array, and the wind velocity and direction in the simulation were exported for identifying wind turbine faults. The 3D steady Navier–Stokes equations were solved by using the cell center finite volume method with a second order upwind scheme and a k−ε turbulence model. In addition, the wind energy dissipation model, derived from energy balance and Betz’s law, was added to the Navier–Stokes equations’ source term. The simulation results indicate that the wind speed distribution in the wake region contains significant information regarding multiple wind turbine faults. A feature selection algorithm specifically designed for the analysis of wind flow was proposed to reduce the number of features. This algorithm proved to have better performance than fuzzy entropy measures and recursive feature elimination methods under a limited number of features. As a result, faults in the wind turbine array could be detected and identified by machine learning algorithms.
- Published
- 2020
43. Gas channels and chimneys prediction using artificial neural networks and multi-seismic attributes, offshore West Nile Delta, Egypt
- Author
-
Amir Ismail, Hatem Farouk Ewida, Aldo Zollo, Sahar Nazeri, Mohammad Galal Al-Ibiary, Ismail, A., Ewida, H. F., Nazeri, S., Al-Ibiary, M. G., and Zollo, A.
- Subjects
Artificial neural network ,Petrophysics ,Well logging ,Drilling ,Geotechnical Engineering and Engineering Geology ,computer.software_genre ,Physics::Geophysics ,Current (stream) ,Multi-seismic attribute ,Fuel Technology ,Petroleum exploration ,And machine learning ,Multilayer perceptron ,Nile delta ,Submarine pipeline ,Data mining ,computer ,Geology ,Gas chimney - Abstract
Machine learning techniques combined with multi-seismic attributes and well logs datasets have been successfully used in reducing the risk of drilling operations and petroleum exploration by providing precise petrophysical and seismic information extracted from the hydrocarbon reservoir rocks. For this purpose, Artificial Neural Networks (ANNs) work as a multi-channel processing system with a high degree of interconnection to classify various faces and predict the reservoir properties through the seismic profile by involving multi-seismic attributes and optionally well logs to the inputs. The main aim of this study is to use both supervised and unsupervised neural networks for the first time in the West Delta Deep Marine (WDDM) concession to identify the spatial dimensions of the gas-bearing channels and the detection of gas chimneys across the seismic profiles. We use back-error propagation algorithms of the Multilayer Perceptron (MLP) and self-organizing Unsupervised Vector Quantizer (UVQ) as supervised and unsupervised neural network methods, respectively, to detect the gas zones and channels, and to classify the gas chimneys and non-gas chimneys zones, as well as classification of the seismic reflections and lithologies. The output acquires a detailed analysis of the distribution pattern of gas channels and accurate information to image the gas chimneys. In the current study, the approach adopted is beneficial to image the gas chimneys and channels in different basins in any region of the world with similar geological settings.
- Published
- 2022
- Full Text
- View/download PDF
44. A Filtering-Based Approach for Improving Crowdsourced GNSS Traces in a Data Update Context
- Author
-
Sébastien Mustière, Thomas Devogele, Ana-Maria Olteanu-Raimond, Stefan S. Ivanovic, Laboratoire des Sciences et Technologies de l'Information Géographique (LaSTIG), École nationale des sciences géographiques (ENSG), Institut National de l'Information Géographique et Forestière [IGN] (IGN)-Institut National de l'Information Géographique et Forestière [IGN] (IGN), Laboratoire d'Informatique Fondamentale et Appliquée de Tours (LIFAT), Université de Tours (UT)-Institut National des Sciences Appliquées - Centre Val de Loire (INSA CVL), Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-Centre National de la Recherche Scientifique (CNRS), and Université de Tours-Institut National des Sciences Appliquées - Centre Val de Loire (INSA CVL)
- Subjects
crowdsourced GNSS traces ,Computer science ,[SHS.INFO]Humanities and Social Sciences/Library and information sciences ,Geography, Planning and Development ,0211 other engineering and technologies ,lcsh:G1-922 ,Context (language use) ,Satellite system ,02 engineering and technology ,computer.software_genre ,and machine learning ,Earth and Planetary Sciences (miscellaneous) ,data quality ,Computers in Earth Sciences ,ComputingMilieux_MISCELLANEOUS ,021101 geological & geomatics engineering ,[SHS.STAT]Humanities and Social Sciences/Methods and statistics ,business.industry ,021107 urban & regional planning ,outlier ,Metadata ,GNSS applications ,Data quality ,Outlier ,The Internet ,Noise (video) ,Data mining ,business ,computer ,lcsh:Geography (General) - Abstract
Traces collected by citizens using GNSS (Global Navigation Satellite System) devices during sports activities such as running, hiking or biking are now widely available through different sport-oriented collaborative websites. The traces are collected by citizens for their own purposes and frequently shared with the sports community on the internet. Our research assumption is that crowdsourced GNSS traces may be a valuable source of information to detect updates in authoritative datasets. Despite their availability, the traces present some issues such as poor metadata, attribute incompleteness and heterogeneous positional accuracy. Moreover, certain parts of the traces (GNSS points composing the traces) are results of the displacements made out of the existing paths. In our context (i.e., update authoritative data) these off path GNSS points are considered as noise and should be filtered. Two types of noise are examined in this research: Points representing secondary activities (e.g., having a lunch break) and points representing errors during the acquisition. The first ones we named secondary human behaviour (SHB), whereas we named the second ones outliers. The goal of this paper is to improve the smoothness of traces by detecting and filtering both SHB and outliers. Two methods are proposed. The first one allows for the detection secondary human behaviour by analysing only traces geometry. The second one is a rule-based machine learning method that detects outliers by taking into account the intrinsic characteristics of points composing the traces, as well as the environmental conditions during traces acquisition. The proposed approaches are tested on crowdsourced GNSS traces collected in mountain areas during sports activities.
- Published
- 2019
- Full Text
- View/download PDF
45. Deep Learning Black Box Problem
- Author
-
Hussain, Jabbar and Hussain, Jabbar
- Abstract
Application of neural networks in deep learning is rapidly growing due to their ability to outperform other machine learning algorithms in different kinds of problems. But one big disadvantage of deep neural networks is its internal logic to achieve the desired output or result that is un-understandable and unexplainable. This behavior of the deep neural network is known as “black box”. This leads to the following questions: how prevalent is the black box problem in the research literature during a specific period of time? The black box problems are usually addressed by socalled rule extraction. The second research question is: what rule extracting methods have been proposed to solve such kind of problems? To answer the research questions, a systematic literature review was conducted for data collection related to topics, the black box, and the rule extraction. The printed and online articles published in higher ranks journals and conference proceedings were selected to investigate and answer the research questions. The analysis unit was a set of journals and conference proceedings articles related to the topics, the black box, and the rule extraction. The results conclude that there has been gradually increasing interest in the black box problems with the passage of time mainly because of new technological development. The thesis also provides an overview of different methodological approaches used for rule extraction methods.
- Published
- 2019
46. Using Artificial Intelligence Techniques for Solar Irradiation Forecasting: The Case of Saudi Arabia.
- Author
-
Alhebshi, Fatima, Alnabilsi, Heba, Bensenouci, Ahmed, and Brahimi, Tayeb
- Subjects
SOLAR technology ,ARTIFICIAL intelligence ,RENEWABLE energy sources ,PHOTOVOLTAIC power systems ,SOLAR radiation ,RENEWABLE natural resources ,GRIDS (Cartography) - Abstract
The Middle East and North Africa (MENA) region is considered as one of the best areas, for solar radiation, in the world as seen in Figure 1 [1-2]. Sustainable Development Goals (SDGs), adopted by 193 nations in 2015, include, for the first time, a target to ensure access to affordable, reliable, and modern energy for all by 2030 [3]. In an analysis by IRENA [4] on the investment opportunities in the GCC, it is reported that close to 60% of the GCC's surface is hit by the sun and has significant potential for solar PV power systems. The European Photovoltaic Industry Association (EPIA) and Greenpeace expect that PV could provide up to 12% of electricity demand in European countries by 2020 [5]. Developing just 1% of this technology could eventually result in 470 GW of solar PV capacity. In its vision 2030 [ref], Saudi Arabia identified renewable energy as one of the pillars of economic diversification, away from oil, with an initial target set to 9.5 GW. The abundance of solar resource potential in KSA combined with the falling cost of associated technologies, including photovoltaic (PV) modules, represent the major factor behind the increased use of this source of energy. Recently, Saudi Arabia launched one of the Kingdom's largest project to build $500 billion mega-city business and industrial zone "NEOM" with a high ambition to make this new area running on 100% renewable energy. However, despite its abundance and the many advantages over other sources of energy, this technology is related to the challenging whether predictions and modeling accuracy. The integration of solar energy into the electrical network would be more efficient if the fluctuation of the Global Horizontal Irradiation/Irradiance (GHI) is more reliable and hence the PV energy output well predicted more accurately. Although a solar PV system relies on many components such as inverter, charge controller, batteries, and panels, forecasting the solar irradiation represents the main step in ensuring and designing an efficient solar PV system since it depends on several parameters namely, PV system location and orientation, daytime, and sunshine period. This paper attempts to develop a prediction model using the artificial neural network (ANN) [6, 7] for estimating the monthly average daily solar irradiation in the city of Jeddah, Kingdom of Saudi Arabia but, may be extended to other cities of the Kingdom. An in-depth ANN forecast method for solar irradiation is presented along with the statistical approach and techniques for predicting the Global Horizontal Irradiation (GHI). By using case examples, it is possible to build models capable of predicting and generating rules that can be translated into natural query language and provide a measure of the confidence of the classification on the basis of its attributes. The data used in this study uses attributes such as variable weather and solar irradiation data of 10 cities provided by KACARE [8] as part of the Renewable Resource Monitoring and Mapping (RRMM) Program. The simulation is performed at Jeddah in an attempt to compare with the experimental measurements by KACARE and measurements done at Effat Solar PV system installed at the roof of the Deanship for Graduate Studies and Research (DGSR). An example of solar radiation and air temperature distributions computed using RETScreen [9], an energy management software created by the Government of Canada, to help predicting the viability and feasibility of energy project including renewable energy sources, is given in Figure 2. ANN method may use up to 12 attributes with MATLAB and/or WEKA [10]. The set of attributes being used for training are namely the time of the day, the year, the latitude and longitude, the air temperature, the wind speed and its direction, the azimuth angle, the diffuse horizontal irradiance, the direct horizontal irradiance, the global horizontal irradiance, the humidity, the pressure, and the zenith angle. The significance of this study relies on its capability of predicting the solar irradiation to quantify and improve the PV system design, to ensure a secure and reliable electrical output, and to help electrical grid operators to manage the entire grid system. Another key element of this study is the outreach and dissemination of renewable energy technologies on Effat University campus, in addition to providing students with opportunities to perform experiments on the installed PV systems and to help Effat University becoming a leader in best practice campus sustainability. [ABSTRACT FROM AUTHOR]
- Published
- 2019
47. A Filtering-Based Approach for Improving Crowdsourced GNSS Traces in a Data Update Context.
- Author
-
Ivanovic, Stefan S., Olteanu-Raimond, Ana-Maria, Mustière, Sébastien, and Devogele, Thomas
- Subjects
GLOBAL Positioning System ,METADATA ,FILTERS & filtration - Abstract
Traces collected by citizens using GNSS (Global Navigation Satellite System) devices during sports activities such as running, hiking or biking are now widely available through different sport-oriented collaborative websites. The traces are collected by citizens for their own purposes and frequently shared with the sports community on the internet. Our research assumption is that crowdsourced GNSS traces may be a valuable source of information to detect updates in authoritative datasets. Despite their availability, the traces present some issues such as poor metadata, attribute incompleteness and heterogeneous positional accuracy. Moreover, certain parts of the traces (GNSS points composing the traces) are results of the displacements made out of the existing paths. In our context (i.e., update authoritative data) these off path GNSS points are considered as noise and should be filtered. Two types of noise are examined in this research: Points representing secondary activities (e.g., having a lunch break) and points representing errors during the acquisition. The first ones we named secondary human behaviour (SHB), whereas we named the second ones outliers. The goal of this paper is to improve the smoothness of traces by detecting and filtering both SHB and outliers. Two methods are proposed. The first one allows for the detection secondary human behaviour by analysing only traces geometry. The second one is a rule-based machine learning method that detects outliers by taking into account the intrinsic characteristics of points composing the traces, as well as the environmental conditions during traces acquisition. The proposed approaches are tested on crowdsourced GNSS traces collected in mountain areas during sports activities. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.