Publisher: elsevier b.v. / Topic: machine learning - Searchworks@Jio Institute Digital Library Search Results

Showing total 17,301 results

Start Over Topic machine learning Publisher elsevier b.v.

17,301 results

1. An Approach to Automate the Scientific Paper's Evaluation Based on NLP Technologies: the Experience in the Russian Segment of Financial Technologies Field.

Author: Terentieva, Elena, Zheltova, Kristina, and Dukhanov, Alexey
Subjects: MACHINE learning, FINANCIAL technology, LANGUAGE models
Abstract: This paper reflects the research to design and application the complex method to evaluate the quality of the paper written in the IMRaD format. As a field we choose Russian segment of the financial technologies area since we had interest to use multilingual language model. The method covers paper's relevance to the chosen field and internal topics, the actuality, relation between parts, text borrowing, and experiment part. The clustering and classifying algorithms, and language machine learning model were used. The experimental part based on corpus of Russian papers shows the deviation of method's recommendations from expert opinion not more than 20%. The results may be interested not only reviewers but student as authors of futures papers to avoid common mistakes during paper's writing. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

2. An Overview of Machine Learning in Orthopedic Surgery: An Educational Paper.

Author: Padash, Sirwa, Mickley, John P., Vera Garcia, Diana V., Nugen, Fred, Khosravi, Bardia, Erickson, Bradley J., Wyles, Cody C., and Taunton, Michael J.
Abstract: The growth of artificial intelligence combined with the collection and storage of large amounts of data in the electronic medical record collection has created an opportunity for orthopedic research and translation into the clinical environment. Machine learning (ML) is a type of artificial intelligence tool well suited for processing the large amount of available data. Specific areas of ML frequently used by orthopedic surgeons performing total joint arthroplasty include tabular data analysis (spreadsheets), medical imaging processing, and natural language processing (extracting concepts from text). Previous studies have discussed models able to identify fractures in radiographs, identify implant type in radiographs, and determine the stage of osteoarthritis based on walking analysis. Despite the growing popularity of ML, there are limitations including its reliance on "good" data, potential for overfitting, long life cycle for creation, and ability to only perform one narrow task. This educational article will further discuss a general overview of ML, discussing these challenges and including examples of successfully published models. [Display omitted] [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

3. State of the art paper: Cardiac computed tomography of the left atrium in atrial fibrillation.

Author: Bodagh, Neil, Williams, Michelle C., Vickneson, Keeran, Gharaviri, Ali, Niederer, Steven, and Williams, Steven E.
Abstract: The clinical spectrum of atrial fibrillation means that a patient-individualized approach is required to ensure optimal treatment. Cardiac computed tomography can accurately delineate atrial structure and function and could contribute to a personalized care pathway for atrial fibrillation patients. The imaging modality offers excellent spatial resolution and has been utilised in pre-, peri- and post-procedural care for patients with atrial fibrillation. Advances in temporal resolution, acquisition times and analysis techniques suggest potential expanding roles for cardiac computed tomography in the future management of patients with atrial fibrillation. The aim of the current review is to discuss the use of cardiac computed tomography in atrial fibrillation in pre-, peri- and post-procedural settings. Potential future applications of cardiac computed tomography including atrial wall thickness assessment and epicardial fat volume quantification are discussed together with emerging analysis techniques including computational modelling and machine learning with attention paid to how these developments may contribute to a personalized approach to atrial fibrillation management. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

4. SMIAltmetric: A comprehensive metric for evaluating social media impact of scientific papers on Twitter (X).

Author: Wang, Zuzheng, Lu, Yongxu, Zhou, Yuanyuan, and Ji, Jiaojiao
Subjects: MACHINE learning, SOCIAL impact, SCHOLARLY communication, SOCIAL classes, ALTMETRICS, SOCIAL media
Abstract: • A new indicator, SMIAltmetric, for measuring scientific paper's social media impact. • Key impact factors: Followers, retweets, mentions, and Citation. • SMIAltmetric outperforms Altmetric in finer differentiation. The rise of social media has significantly influenced scholarly communication, knowledge dissemination, and research evaluation, leading to the enrichment of alternative metrics (altmetrics) for evaluating academic papers' social impact, which assesses the social impact of academic papers through online activities, including reading, bookmarking, downloading, and commenting. However, these altmetrics often focus on the number of mentions on social media rather than thoroughly evaluating the source, content, and dissemination of these mentions. To address this gap, this study introduces the social media impact altmetric (SMIAltmetric), which is based on 44,087 publications and 860,680 tweets (now "posts"), a comprehensive scoring system for evaluating scientific papers on Twitter (now "X"), using diverse features, including literature-related, social media engagement-related, user-related, and content-related features. Employing Altmetric Attention Acores (AAS) as labels, we tested eight machine learning algorithms, with XGBoost demonstrating the highest accuracy at 0.8672. Crucial factors influencing SMIAltmetric, as identified by the SHAP value, were followers, retweets, mentions, and citation. Furthermore, consistency analysis and convergent validation between the proposed SMIAltmetric and AAS confirm the reliability and finer differentiation of SMIAltmetric. The proposed SMIAltmetric provides a more comprehensive understanding of a paper's social media impact, enhancing the evaluation of scientific discourse and its engagement with society. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

5. Late Middle Ages watermarked Italian paper: A Machine Learning spatial-temporal approach.

Author: Teodonio, Lorenzo, Scatigno, Claudia, Missori, Mauro, and Festa, Giulia
Subjects: *WATERMARKS, *CATALOGS, *MACHINE learning, *MIDDLE Ages, *FOURIER transform infrared spectroscopy, *PAPER mills
Abstract: • Machine Learning to classify watermarked Italian paper. • XRF and FTIR spectroscopies to study late middle-aged Italian watermarked paper. • A new spatial-temporal approach to trace the local recipes and the geographical paper mills' production. • Bergamo and Bologna stand out as marked watermarked Italian paper mills. Manuscripts, illuminated codex, books, documents and letters are composite materials, traces of the past starting from the invention of the writing. In this context, dating is one of the most important information for document attribution, and watermarked papeItaly'smarkers for studying their time-spatial distribution. In the Late Middle Ages, Italy's most important centre of paper mill was located in the town of Fabriano (Marche region, Italy). Here, a selection of ten Italian Late Middle Ages watermarked papers belonging to the Corpus Chartarum Italicarum (Corpus of Italian papers) is characterised by elemental and molecular spectroscopies and collected data are analysed by Machine Learning (ML) to trace the local fabrication recipes and the geographical paper mills production. Data from portable X-ray fluorescence and Fourier transform infrared spectroscopy were analysed through Singular Vector Machine, Soft Independent Modeling of Class Analogy and Moving Blocks methods. This innovative ML spatial-temporal approach based on keeping the temporal variable fixed is used to find elemental benchmarks for classifying the watermarked Italian notarial catalogue of the Late Middle Ages finding differences in the local recipes and studying the homogeneity in the paper mills' production. Results show that watermarked paper from Northern Italy, from the town of Strozza, Piacenza and Bergamo , as well as Bologna, present a high elemental and molecular homogeneity, which indicates that the hand-made processing technique could be the same helped by the proximity of the three cities, starting point for technologies exchange or influence. No heavy metals are found in the watermarked paper and K, Ca, Fe and Zn are identified as elemental benchmarks. Finally, Ca, Ti, Mn, Cr and Fe are particularly present on the edges of the watermarked papers. [Display omitted] [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

6. Content-based quality evaluation of scientific papers using coarse feature and knowledge entity network.

Author: Wang, Zhongyi, Zhang, Haoxuan, Chen, Haihua, Feng, Yunhe, and Ding, Junhua
Subjects: MACHINE learning, SCIENCE education, COMPUTER science, PEER pressure, RANDOM forest algorithms
Abstract: Pre-evaluating scientific paper quality aids in alleviating peer review pressure and fostering scientific advancement. Although prior studies have identified numerous quality-related features, their effectiveness and representativeness of paper content remain to be comprehensively investigated. Addressing this issue, we propose a content-based interpretable method for pre-evaluating the quality of scientific papers. Firstly, we define quality attributes of computer science (CS) papers as integrity , clarity , novelty , and significance , based on peer review criteria from 11 top-tier CS conferences. We formulate the problem as two classification tasks: Accepted/Disputed/Rejected (ADR) and Accepted/Rejected (AR). Subsequently, we construct fine-grained features from metadata and knowledge entity networks, including text structure, readability, references, citations, semantic novelty, and network structure. We empirically evaluate our method using the ICLR paper dataset, achieving optimal performance with the Random Forest model, yielding F1 scores of 0.715 and 0.762 for the two tasks, respectively. Through feature analysis and case studies employing SHAP interpretable methods, we demonstrate that the proposed features enhance the performance of machine learning models in scientific paper quality evaluation, offering interpretable evidence for model decisions. • Define four criteria for quality evaluation of scientific papers: integrity, clarity, novelty, and significance. • Propose a framework for quality evaluation of scientific papers based on coarse features and knowledge entity network. • An effective algorithm for measuring the novelty and significance of scientific papers based on knowledge entity networks. • Create and release a rigorous dataset, which could serve as the gold standard for quality evaluation of scientific papers. • Conduct extensive experiments to validate the effectiveness of the proposed framework. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

7. Textual features of peer review predict top-cited papers: An interpretable machine learning perspective.

Author: Sun, Zhuanlan
Subjects: MACHINE learning, ARTIFICIAL intelligence, TECHNICAL reports, PEER communication, THEATER reviews, EMOTION recognition
Abstract: • A framework combining machine learning models and SHAP to interpret how peer review improves the research impact. • The importance of key linguistic, sentiment, and peer review features from peer review reports in determining the scientific significance of papers. • Valuable insights for authors to improve the quality of work and increase academic influence by paying closer attention to peer review characteristics. • Textual features of peer review reports play an important role in predicting post-publication scientific impact. Peer review is crucial in improving the quality and reliability of scientific research. However, the mechanisms through which peer review practices ensure papers become top-cited papers (TCPs) after publication are not well understood. In this study, by collecting a data set containing 13, 066 papers published between 2016 and 2020 from Nature communications with open peer review reports, we aim to examine how textual features embedded within the peer review reports of papers that reflect the reviewers' emotions may predict the papers to be TCPs. We compiled a list of 15 textual features and classified them into three categories: peer review features, linguistic features, and sentiment features. We then chose the XGBoost machine learning model with the best performance in predicting TCPs, and utilized the explainable artificial intelligence techniques SHAP to interpret the role of feature importance on the prediction results. The distribution of feature importance ranking results demonstrates that sentiment features play a crucial role in determining papers' potential to be highly cited. This conclusion still holds, even when the ranking of the feature importance changes in the subgroup analysis of dividing the samples into four disciplines (biological sciences, health sciences, physical sciences, and earth and environmental sciences), as well as two groups based on whether reviewers' identities were revealed. This research emphasizes the textual features retrieved from peer review reports that play role in improving manuscript quality can predict the post-publication research impact. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

8. Cigarette paper as evidence: Forensic profiling using ATR-FTIR spectroscopy and machine learning algorithms.

Author: Kapoor, Muskaan, Sharma, Akanksha, and Sharma, Vishal
Subjects: *CIGARETTES, *FORENSIC sciences, *FOURIER transform infrared spectroscopy, *MACHINE learning, *ALGORITHMS
Abstract: This research highlights the underestimated significance of cigarette paper as evidence at crime scenes. The primary objective is to distinguish cigarette paper from similar-looking alternatives, addressing the first research objective. The second objective involves identifying cigarette paper brands using attenuated total reflectance Fourier transform infrared (ATR-FTIR) spectroscopy and machine learning (ML) algorithms. Accurate differentiation of cigarette paper from normal paper is emphasized. ATR-FTIR spectroscopy, coupled with principal component analysis (PCA) for dimensionality reduction, is employed for brand identification. Among fifteen ML algorithms compared, the CatBoost classifier excels for both objectives. This research presents a non-destructive, effective method for studying cigarette paper, contributing valuable insights to crime scene investigations. [Display omitted] • Forensic evaluation of cigarette paper utilizing ATR-FTIR spectroscopy and Machine learning algorithms. • Peak characterization and differentiation-distinguishing cigarette paper from other types. • Machine learning algorithm comparison: assessing discrimination across nine cigarette brands. • External validation of the dominant algorithm using unknown samples. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

9. Machine learning-assisted wide-gamut fluorescence visual test paper for propazine determination in fish and seawater samples.

Author: Liu, Hua, You, Jinjie, Liu, Chenxi, Zhang, Zeming, Sun, Aili, Hao, Guijie, and Shi, Xizhi
Subjects: *FLUORESCENCE, *SEAWATER, *SUPPORT vector machines, *FLUORESCENCE quenching, *IMPRINTED polymers
Abstract: Molecularly imprinted polymer (MIP-QDs) with fluorescence quenching ability toward propazine was synthesized for propazine detection. b(Blue)-MIP-QDs were prepared using ZnCdS/ZnS QDs via reverse micro-emulsion, whereas r(red)-MIP-QDs were synthesized using CdSe/ZnS QDs. By utilizing graphene quantum dots (GQDs) as a stable fluorescence intensity reference, the wide-gamut fluorescence test paper was constructed on the basis of mixing b-MIP-QDs, r-MIP-QDs, and GQDs under the optimal ratio. When analyzing spiked propazine in fish and seawater samples using a test paper, satisfactory recoveries of 104.0 %–114.6 % and 92.0 %–96.4 % were obtained, with corresponding limits of detection of 5.0 μg/kg and 1.0 μg/L, respectively. The RGB extractor was utilized to extract the actual fluorescence color and construct a dataset consisting of R, G, and B values, as well as concentration data from 400 samples. The SVR model of Python 3.9.7 was used to obtain and analyze the concentration and feature data. After optimization, the constructed model achieved a correlation coefficient of 0.98 and an RMSE of only 1.81, indicating high prediction accuracy and excellent generalization ability that meet quenching prediction requirements. As an intelligent and rapid detection method, this model holds significant practical significance. [Display omitted] • A wide-gamut fluorescence visual test paper was fabricated. • The visual test paper demonstrates excellent linearity, accuracy, precision, and sensitivity for promazine detect. • An RGB model by support vector machine were constructed. • The model exhibits high accuracy and strong generalization ability. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

10. Nonparametric functional analysis under joint estimation with applications to identifying highly cited papers.

Author: Chowdhury, K.P.
Subjects: MACHINE learning, ARTIFICIAL intelligence, FUNCTIONAL analysis, ELECTRONIC data processing, STATISTICAL significance, CITATION indexes, SCIENTOMETRICS
Abstract: This article introduces a nonparametric methodology combining the strengths of binary regression and latent variable formulations, while overcoming their disadvantages. The mathematical results are implemented through a novel Bayesian Hierarchical estimation methodology called Latent Adaptive Hierarchical Expectation Maximization Like algorithm. Requiring minimal assumptions, it extends extant methodologies, and in simulation studies gives better prediction and inference performances for asymmetric data generating processes. A new classification statistic, called Adjusted Receiver Operating Curve Statistic is also introduced. Utilizing it we demonstrate better overall model fit, inference and prediction performance of the proposed methodology over widely used existing methods in the sciences. In addition, the methodology can be used to perform model diagnostics for any model specification. This is a highly useful result, and it extends existing work for categorical model diagnostics broadly across the sciences. Furthermore, the mathematical results also highlight important new findings regarding the interplay of statistical significance and scientific significance. Finally, the methodology is applied to identifying highly-cited papers in the social sciences in a joint estimation framework. The results indicate that the methodology outperforms widely used existing artificial intelligence and machine learning models with very few Monte Carlo iterations. In Scientometric application, it finds Journal Impact Factor to be more important than Keyword Popularity parameters for explaining citation outcomes in select social science fields. It further finds that the percentage change in Published Popularity may also help to explain citation outcomes in the field. The findings appear to be new to the Scientometric field. • Introduces new statistical method for regressions requiring few assumptions. • Outperforms existing artificial intelligence/machine learning algorithms. • Introduces ARS: A general statistic for predication/comparison of models. • Applies methodology to identify highly-cited papers in the social sciences. • JIF more important than keywords for getting cited in some social science fields. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

11. Primate eye tracking with carbon-nanotube-paper-composite based capacitive sensors and machine learning algorithms.

Author: Li, Tianyi, Sakthivelpathi, Vigneshwar, Qian, Zhongjie, Soetedjo, Robijanto, and Chung, Jae-Hyun
Subjects: *PROXIMITY detectors, *EYE tracking, *CAPACITIVE sensors, *MACHINE learning, *EYE movements
Abstract: Accurate real-time eye tracking is crucial in oculomotor system research. While the scleral search coil system is the gold standard, its implantation procedure and bulkiness pose challenges. Camera-based systems are affected by ambient lighting and require high computational and electric power. This study presents a novel eye tracker using proximity capacitive sensors made of carbon-nanotube-paper-composite (CPC). These sensors detect femtofarad-level capacitance changes caused by primate corneal movement during horizontal and vertical eye rotations. Data processing and machine learning algorithms are evaluated to enhance the accuracy of gaze angle prediction. The system performance is benchmarked against the scleral coil during smooth pursuits, saccades tracking, and fixations. The eye tracker demonstrates up to 0.97 correlation with the coil in eye tracking and is capable of estimating gaze angle with a median absolute error as low as 0.30°. The capacitive eye tracker demonstrates good consistency and accuracy in comparison to the gold-standard scleral search coil method. This lightweight, non-invasive capacitive eye tracker offers potential as an alternative to traditional coil and camera-based systems in oculomotor research and vision science. [Display omitted] • Novel capacitive proximity sensors are evaluated for non-human primate eye tracking. • The sensors demonstrate a high correlation (up to 0.97) with the scleral coil system. • Machine learning estimates the gaze with a median absolute error as low as 0.30°. • Show promise as a non-contact, low-cost, and low-power eye tracking tool. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

12. A portable europium complex-loaded fluorescent test paper combined with smartphone analysis for the on-site and visual detection of mancozeb in food samples.

Author: Cheng, Zheng, Liu, Xinfang, Zhao, Beibei, Liu, Xu, Yang, Xiaorui, Zhang, Xiaoyu, and Feng, Xun
Subjects: *MACHINE learning, *MANCOZEB, *FARM produce, *FLUORESCENCE quenching, *HIERARCHICAL clustering (Cluster analysis)
Abstract: The on-site detection of mancozeb in food samples holds immense value for food safety. A red-fluorescent europium complex (Eu-PYDC-Phen) has been prepared and employed as a fluorescence probe for mancozeb detection. The optimized probe suspension exhibits excellent detection performances, including a wide linear range (0–0.24 mM), low detection limit (65 nM), rapid response (2 mins) and high selectivity. Moreover, a portable detection platform was carefully designed, integrating the Eu-PYDC-Phen -based fluorescent test strips with smartphone color recognition software. This innovative platform enables visual and on-site detection of mancozeb in tomato, apple, and lettuce, achieving satisfactory recovery rates (90.34 to 106.50%). Furthermore, the integration of machine learning techniques based on hierarchical clustering algorithm has the potential to further improve the prediction and decision-making efficiency in mancozeb detection. This work provides an economical, convenient, and reliable strategy for on-site detection of pesticide in agricultural products, thereby making a meaningful contribution to food safety. A Eu-PYDC-Phen probe was employed for the detection of mancozeb through a fluorescence quenching effect. A smartphone-assisted test paper platform was designed for the rapid and visual detection of mancozeb in lettuce, tomato, and apple samples. [Display omitted] • A Eu-PYDC-Phen probe was applied for highly sensitive and selective detection of mancozeb. • Smartphone was integration with paper-based probe for visual identification. • The platform was applied for rapid detection of target in food samples. • A machine learning model was developed to simplify the detection process. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

13. Detection of SARS-CoV-2 using machine learning-enabled paper-assisted ratiometric fluorescent sensors based on target-induced magnetic DNAzyme.

Author: Wang, Wenhai, Luo, Lun, Li, Yanmei, Hong, Bin, Ma, Yi, Kang, Keren, and Wang, Jufang
Subjects: *DEOXYRIBOZYMES, *SARS-CoV-2, *ARTIFICIAL vision, *MACHINE learning, *DETECTORS, *TARGETED drug delivery
Abstract: The development of an advanced analytical platform with regard to SARS-CoV-2 is crucial for public health. Herein, we present a machine learning platform based on paper-assisted ratiometric fluorescent sensors for highly sensitive detection of the SARS-CoV-2 RdRp gene. The assay involves target-induced rolling circle amplification to generate magnetic DNAzyme, which is then detectable using the paper-assisted ratiometric fluorescent sensor. This sensor detects the SARS-CoV-2 RdRp gene with a visible-fluorescence color response. Moreover, leveraging different fluorescence responses, the ResNet algorithm of machine learning assists in accurately identifying fluorescence images and differentiating the concentration of the SARS-CoV-2 RdRp gene with over 99% recognition accuracy. The machine learning platform exhibits exceptional sensitivity and color responsiveness, achieving a limit of detection of 30 fM for the SARS-CoV-2 RdRp gene. The integration of intelligent artificial vision with the paper-assisted ratiometric fluorescent sensor presents a novel approach for the on-site detection of COVID-19 and holds potential for broader use in disease diagnostics in the future. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

14. Machine learning assisted identification of antibiotic-resistant Staphylococcus aureus strains using a paper-based ratiometric sensor array.

Author: Laliwala, Aayushi, Gupta, Ritika, Svechkarev, Denis, Bayles, Kenneth W., Sadykov, Marat R., and Mohs, Aaron M.
Subjects: *SENSOR arrays, *MACHINE learning, *SUPPORT vector machines, *STAPHYLOCOCCUS aureus, *SKIN infections
Abstract: [Display omitted] • Sensing tool for identifying antimicrobial-resistant Staphylococcus aureus strains. • Fluorescent sensor array pre-adsorbed on paper microzone plates for better efficiency. • Classification of laboratory and clinical S. aureus strains powered by machine learning. • Sensor identifies biofilms associated with antimicrobial-resistant S. aureus strains. • Rapid, low-volume identification with >90% classification accuracy. Staphylococcus aureus , a versatile human pathogen, significantly impacts global health causing a broad spectrum of medical conditions that range from minor skin infections to life-threatening diseases. The clinical importance of S. aureus is underscored by its resistance to multiple antibiotics and formation of biofilms, providing protection against antimicrobials and immune responses. To date, the identification of antimicrobial-resistant (AMR) S. aureus strains, such as methicillin-resistant S. aureus (MRSA) and vancomycin-intermediate S. aureus (VISA), requires time-consuming and expensive methodologies, including culture-based, molecular, and phenotypic techniques. Previously, we developed a paper-based ratiometric sensor array composed of fluorescent sensor dyes (3-hydroxyflavone derivatives) pre-adsorbed on paper microzone plates. Combined with machine learning algorithms such as neural networks, this sensor effectively discriminated 16 bacterial species and determined their Gram status. In this study, we evaluate its ability to distinguish antibiotic-resistant S. aureus strains and their biofilms. Our results demonstrate that the sensor array, in conjunction with LDA and neural networks, successfully differentiated three common laboratory MRSA strains from three methicillin-susceptible S. aureus (MSSA) strains with 82.5% accuracy. Furthermore, using support vector machines, this sensor was able to distinguish and categorically classify MRSA, MSSA, and VISA clinical isolates with 97.5% accuracy. Remarkably, beyond distinguishing planktonic cultures, this sensor array demonstrated a formidable capability to discriminate AMR S. aureus biofilms, achieving over 80% accuracy. Combined, the results of this study highlight the paper-based sensor array's significant potential as a robust diagnostic tool to accurately, rapidly, and easily identify drug-resistant S. aureus strains in clinically relevant settings. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

15. Enhancing pathogen identification in cheese with high background microflora using an artificial neural network-enabled paper chromogenic array sensor approach.

Author: Jia, Zhen, Lin, Zhuangsheng, Luo, Yaguang, Cardoso, Zachary A., Wang, Dayang, Flock, Genevieve H., Thompson-Witrick, Katherine A., Yu, Hengyong, and Zhang, Boce
Subjects: *SENSOR arrays, *ARRAIGNMENT, *ESCHERICHIA coli O157:H7, *PATHOGENIC bacteria, *SALMONELLA enteritidis, *IDENTIFICATION
Abstract: Biohazards, which may occur at all supply chain stages, pose significant threats to food safety and public health. Addressing these concerns and enhancing food safety necessitates a nondestructive pathogen surveillance approach capable of continuously and simultaneously detecting multiple pathogens. Detecting and differentiating low concentrations of pathogenic bacteria amid high background microflora levels in foods is challenging, requiring technology with high sensitivity and robust discriminatory capability. This study introduces an artificial neural network-driven paper chromogenic array sensor (ANN-PCA) technique developed for the nondestructive, continuous, and simultaneous detection of Salmonella Enteritidis (SE) and Escherichia coli O157:H7 (Ec) from a high background microflora in shredded cheddar cheese. This method enables accurate detection of SE and Ec in monoculture and cocktail culture while distinguishing them from a high level of background microflora (∼7.5 log CFU/g), with accuracies ranging from 72 ± 11% to 92 ± 3%. In addition, SE and Ec were successfully identified at concentrations as low as 1 log CFU/g within one day, with an accuracy of 72 ± 11%. This approach exhibits promising potential for integration into a digitalized, smart, and resilient nondestructive surveillance system for real-time pathogen detection in foods throughout the supply chain without enrichment, incubation, or other sample preparation steps. [Display omitted] • Development of a machine learning-driven paper chromogenic array sensor approach. • Non-destructive, continuous, and simultaneous detection of multiple pathogens. • Detection of pathogenic bacteria from a high level of background microflora. • The approach provided potential detection for pathogens throughout the food supply chain. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

16. Versatile photo-sensing ability of paper based flexible 2D-Sb0.3Sn0.7Se2 photodetector and performance prediction with machine learning algorithm.

Author: Rawal, Kuntesh, Devendrabhai, Patel Dixita, Pataniya, Pratik, Jain, Prince, Joshi, Anand, Solanki, G.K., and Tannarana, Mohit
Subjects: *PHOTODETECTORS, *K-nearest neighbor classification, *STANDARD deviations, *MACHINE performance, *MACHINE learning, *CHEMICAL peel
Abstract: Present report demonstrates the application of Sb 0.3 Sn 0.7 Se 2 single crystal as a paper based flexible photodetector. Direct vapour transport grown bulk crystals of Sb 0.3 Sn 0.7 Se 2 has been converted to nanosheets by chemical assisted exfoliation process. The paper-based photodetector is fabricated and the switching action is studied. Tuning of photodetector has been carried out by 670 nm laser illumination for different bias voltages. Temporal photo-response is also studied under polychromatic light with different intensities in vacuum and open environment. The low temperature stability of photodetector has been studied for temperature range 300 K–180 K. Experimental results are obtained in terms of time resolved photocurrent under different illumination and atmospheric conditions. The flexibility and stability are also examined in detail for fabricated detector. Overall, the results suggest the application of Sb 0.3 Sn 0.7 Se 2 as a versatile flexible photodetector. Additionally, the machine learning (ML) model is trained and tested using an experimental photocurrent dataset that has a complex material design with variations in time, bias voltage, intensity, and temperature. The k-nearest neighbor algorithm exhibited outstanding performance, achieving the highest R2 value of 0.9986 when applied to a temperature dataset, with a test size of 0.4. Performance metrics such as mean absolute error and root mean squared error of various test sizes ranging from 0.4 to 0.6 are used to assess the model's accuracy and robustness in changing conditions. This comprehensive analysis not only establishes a platform for future experimental optimization of photodetector materials but also underscores the efficacy of ML regression techniques in developing high-performance photodetectors. [Display omitted] • Chemical assisted exfoliation process were used for the synthesis of SbSnSe2 nanosheets. • Synthesized nanosheets were decorated on paper for the fabrication of flexible photo-detector. • The photodetection application was demonstrated for different atmospheric conditions and different illuminations. • Low temperature stability of the detector was also studied. • KNN Machine learning model were applied to the experimental data for the prediction of photodetection behavior. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

17. Surveillance of pathogenic bacteria on a food matrix using machine-learning-enabled paper chromogenic arrays.

Author: Jia, Zhen, Luo, Yaguang, Wang, Dayang, Holliday, Emma, Sharma, Arnav, Green, Madison M., Roche, Michelle R., Thompson-Witrick, Katherine, Flock, Genevieve, Pearlstein, Arne J., Yu, Hengyong, and Zhang, Boce
Subjects: *PATHOGENIC bacteria, *SALMONELLA, *ESCHERICHIA coli, *FOOD pathogens, *SENSOR arrays, *FOOD safety, *MACHINE learning, *ESCHERICHIA coli O157:H7, *FOOD microbiology
Abstract: Global food systems can benefit significantly from continuous monitoring of microbial food safety, a task for which tedious operations, destructive sampling, and the inability to monitor multiple pathogens remain challenging. This study reports significant improvements to a paper chromogenic array sensor - machine learning (PCA-ML) methodology sensing concentrations of volatile organic compounds (VOCs) emitted on a species-specific basis by pathogens by streamlining dye selection, sensor fabrication, database construction, and machine learning and validation. This approach enables noncontact, time-dependent, simultaneous monitoring of multiple pathogens (Listeria monocytogenes , Salmonella , and E. coli O157:H7) at levels as low as 1 log CFU/g with over 90% accuracy. The report provides theoretical and practical frameworks demonstrating that chromogenic response, including limits of detection, depends on time integrals of VOC concentrations. The paper also discusses the potential for implementing PCA-ML in the food supply chain for different food matrices and pathogens, with species- and strain-specific identification. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

18. Characterizing highly cited method and non-method papers using citation contexts: The role of uncertainty.

Author: Small, Henry
Subjects: CITATION analysis, MEDICAL periodicals, LOGISTIC regression analysis, MATHEMATICAL variables, MACHINE learning
Abstract: The top 1000 biomedical papers by number of citations are classified by method, type of method and non-methods by examination of citation contexts. Supervised machine learning is applied to the context data for a training sample of papers which is then used to classify the full list, revealing that words indicating utility are most important for the classification of methods. Further word analysis is carried out using corpus linguistics to uncover context words that characterize non-methods. Hedging words are found to play an important role for non-methods, and several are selected for further analysis with logistic regression. Other variables in the regression are a consensus variable based on the similarity of contexts for a paper and another variable based on whether citations come from “methods” sections of citing papers. Accuracy of predictions from logistic regression is comparable to machine learning. The results are interpreted in terms of the perceived certainty or uncertainty of the underlying knowledge, that is, methods and their outputs have higher certainty, and non-methods higher uncertainty. Evidence is found that hedging is inversely related to citation frequency. Implications of this work for the study of the development of science and the role of methods and tools in biomedical research are discussed. [ABSTRACT FROM AUTHOR]
Published: 2018
Full Text: View/download PDF

19. Comment on papers using machine learning for significant wave height time series prediction: Complex models do not outperform auto-regression.

Author: Jiang, Haoyu, Zhang, Yuan, Qian, Chengcheng, and Wang, Xuan
Subjects: *ARTIFICIAL neural networks, *TIME series analysis, *PREDICTION models, *ARTIFICIAL intelligence, *MACHINE learning, *DECOMPOSITION method
Abstract: • Five Machine Learning (ML) models compared for wave height time series prediction. • Complex ML models do not outperform simple AR in wave height time series prediction. • Comment to related papers: signal decomposition in test set series is WRONG. Significant Wave Height (SWH) is crucial in many aspect of ocean engineering. The accurate prediction of SWH has therefore been of immense practical value. Recently, Artificial Intelligence (AI) time series prediction methods have been widely used for single-point short-term SWH time-series forecasting, resulting in many AI-based models claiming to achieve good results. However, the extent to which these complex AI models can outperform traditional methods has largely been overlooked. This study compared five different models - AutoRegressive (AR), eXtreme Gradient Boosting (XGB), Artificial Neural Network (ANN), Long Short-Term Memory (LSTM), and WaveNet - for their performance on SWH time series prediction at 16 buoy locations. Surprisingly, the results suggest that the differences of performance among different models are negligible, indicating that all these AI models have only "learned" the linear auto-regression from the data. Additionally, we noticed that many recent studies used signal decomposition method for such time series prediction, and most of them decomposed the test sets, which is WRONG. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

20. Intelligent computer vision system for segregating recyclable waste papers

Author: Rahman, Mohammad Osiur, Hussain, Aini, Scavino, Edgar, Basri, Hassan, and Hannan, M.A.
Subjects: *PAPER recycling, *COMPUTER vision, *IMAGE processing, *ARTIFICIAL intelligence, *SORTING (Electronic computers), *MACHINE learning, *PATTERN recognition systems, *PAPER chemicals
Abstract: Abstract: This article explores the application of image processing techniques in recyclable waste paper sorting. In recycling, waste papers are segregated into various grades as they are subjected to different recycling processes. Highly sorted paper streams facilitate high quality end products and save processing chemicals and energy. From 1932 to 2009, different mechanical and optical paper sorting methods have been developed to fill the paper sorting demand. Still, in many countries including Malaysia, waste papers are sorted into different grades using a manual sorting system. Because of inadequate throughput and some major drawbacks of mechanical paper sorting systems, the popularity of optical paper sorting systems has increased. Automated paper sorting systems offer significant advantages over human inspection in terms of worker fatigue, throughput, speed, and accuracy. This research attempts to develop a smart vision sensing system that is able to separate the different grades of paper using first-order features. To construct a template database, a statistical approach with intra-class and inter-class variation techniques are applied to the feature selection process. Finally, the K-nearest neighbor (KNN) algorithm is applied for paper object grade identification. The remarkable achievement obtained with the method is the accurate identification and dynamic sorting of all grades of papers using simple image processing techniques. [Copyright &y& Elsevier]
Published: 2011
Full Text: View/download PDF

21. Addressing the sample volume dependency of the colorimetric glucose measurement on microfluidic paper-based and thread/paper-based analytical devices using a novel low-cost analytical viewpoint.

Author: Derakhshani, Mohammad, Jahanshahi, Amir, and Ghourchian, Hedayatollah
Subjects: *IMAGE processing, *TIME measurements
Abstract: [Display omitted] • A novel analytical technique for colorimetric glucose determination on µPADs. • Non-invasive sweat glucose detection independent of sample volume. • Seamless operation: no extra gadgets nor smartphone camera is required. • Computationally inexpensive readout on portable Arm platforms demonstrated. The colorimetric method is widely exploited for glucose measurement on microfluidic paper-based and microfluidic thread/paper-based analytical devices, μPADs and μTPADs, respectively. However, two significant challenges still hinder the real-world applications: the variation of the generated color based on sample volume variation, and the consistent dependency on an imaging camera - usually a smartphone camera - for color readout. The latter also suffers from multiple limitations, such as the variable ambient light conditions and imaging variation among different smartphone brands and models. This manuscript suggests a new device demonstrating a novel viewpoint to the colorimetric method to address both aforementioned challenges. The presented device, Volume-Independent Autonomous box (VIA box), continuously monitors the transient color of the µPAD upon the introduction of the sample till color stabilization, in contrast to similar studies where merely the final stabilized color is read. The interpretation of the transient color profile with respect to time results in the measurement of the glucose level independent of sample volume. In addition, the VIA box monitors µPAD's color using an ordinary RGB sensor instead of an imaging camera in similar studies. Since merely RGB values are recorded in any measurement instance, the computational cost is extremely low compared to the relevant literature where image processing techniques are used. VIA box exploits a simple low-end microcontroller to continuously monitor the transient color, interpret the color profile, and show the glucose level. The samples used in this manuscript are made of artificial sweat samples, including the known glucose range in real sweat samples. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

22. Predicting moisture penetration dynamics in paper with machine learning approach.

Author: Alzweighi, Mossab, Mansour, Rami, Maass, Alexander, Hirn, Ulrich, and Kulachenko, Artem
Subjects: *MACHINE learning, *FEEDFORWARD neural networks, *HYGROTHERMOELASTICITY, *MACHINE dynamics, *RECURRENT neural networks, *MOISTURE
Abstract: • A machine learning approach was used to investigate moisture penetration in paper materials. • The study evaluated the capabilities of both the Feedforward Neural Network (FNN) and the Recurrent Neural Network (RNN). • Numerically generated data was employed for network training. • A continuum model incorporating anisotropic properties, creep behavior, viscoelasticity, and moisture dependency was used for simulations. • The RNN demonstrated superior predictive capability compared to the FNN. In this work, we predicted the gradient of the deformational moisture dynamics in a sized commercial paper by observing the curl deformation in response to the one-sided water application. The deformational moisture is a part of the applied liquid which ends up in the fibers causing swelling and subsequent mechanical response of the entire fiber network structure. The adapted approach combines traditional experimental procedures, advanced machine learning techniques and continuum modeling to provide insights into the complex phenomenon relevant to ink-jet digital printing in which the sized and coated paper is often used, meaning that not all the applied moisture will reach the fibers. Key material properties including elasticity, plastic parameters, viscoelasticity, creep, moisture dependent behavior, along with hygroexpansion coefficients are identified through extensive testing, providing vital data for subsequent simulation using a continuum model. Two machine learning models, a Feedforward Neural Network (FNN) and a Recurrent Neural Network (RNN), are probed in this study. Both models are trained using exclusively numerically generated moisture profile histories, showcasing the value of such data in contexts where experimental data acquisition is challenging. These two models are subsequently utilized to predict moisture profile history based on curl experimental measurements, with the RNN demonstrating superior accuracy due to its ability to account for temporal dependencies. The predicted moisture profiles are used as inputs for the continuum model to simulate the associated curl response comparing it to the experiment representing "never seen" data. The result of comparison shows highly predictive capability of the RNN. This study melds traditional experimental methods and innovative machine learning techniques, providing a robust technique for predicting moisture gradient dynamics that can be used for both optimizing the ink solution and paper structure to achieve desirable printing quality with lowest curl propensities during printing. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

23. Visible detection of chilled beef freshness using a paper-based colourimetric sensor array combining with deep learning algorithms.

Author: Lin, Yuandong, Ma, Ji, Cheng, Jun-Hu, and Sun, Da-Wen
Subjects: *MACHINE learning, *DEEP learning, *SENSOR arrays, *PATTERN recognition systems, *MULTIVARIATE analysis, *FEATURE extraction
Abstract: • Qualitative and quantitative detection of amine gases could be achieved by CSA. • A visible detection of beef freshness using the amine-responsive CSA was proposed. • ResNet34 had the best performance for beef freshness detection based on CSA. • T-SNE could further visualize and understand the classification process of DL. This study developed an innovative approach that combines a colourimetric sensor array (CSA) composed of twelve pH-response dyes with advanced algorithms, aiming to detect amine gases and assess the freshness of chilled beef. With the assistance of multivariate statistical analysis, the sensor array can effectively distinguish five amine gases and enable rapid quantification of trimethylamine vapour with a limit of detection (LOD) of 8.02 ppb and visually monitor the fresh levels of chilled beef. Moreover, the utilization of deep learning models (ResNet34, VGG16, and GoogleNet) for chilled beef freshness evaluation achieved an overall accuracy of 98.0 %. Furthermore, t -distributed stochastic neighbour embedding (t -SNE) visualized the feature extraction process and provided explanations to understand the classification process of deep learning. The results demonstrated that applying deep learning techniques in the process of pattern recognition of CSA can help in realizing the rapid, robust, and accurate assessment of chilled beef freshness. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

24. Enhanced 3-D asynchronous correlation data preprocessing method for Raman spectroscopy of Chinese handmade paper.

Author: Yan, Chunsheng, Cheng, Zhongyi, Cao, Linquan, and Wen, Yingke
Subjects: *MACHINE learning, *RAMAN spectroscopy, *CONVOLUTIONAL neural networks, *HILBERT transform, *DATA structures
Abstract: [Display omitted] • 3D-ACM involves two rounds of Hilbert transform and tensor product operations. • It significantly enhances the equivalent frequency points and sample numbers. • The R-squared values for PLS-LR, KNN, RF and CNN models approach or equal 1. • These supervised models are comparable to unsupervised models such as PCA-LR. We have developed a novel 3D asynchronous correlation method (3D-ACM) designed for the classification and identification of Chinese handmade paper samples using Raman spectra and machine learning. The 3D-ACM approach involves two rounds of tensor product and Hilbert transform operations. In the tensor product process, the outer product of the spectral data from different samples within the same category is computed, establishing inner connections among all samples within that category. The Hilbert transform introduces a 90-degree phase shift, resulting in a true three-dimensional spectral data structure. This expansion significantly increases the number of equivalent frequency points and samples within each category. This enhancement substantially boosts spectral resolution and reveals more hidden information within the spectral data. To maximize the potential of 3D-ACM, we employed six machine learning models: principal component analysis (PCA) with linear regression (LR), support vector machine (SVM) with LR, k-Nearest Neighbors (KNN), random forest (RF), and convolutional neural network (CNN). When applied to the 3D-ACM data preprocessing method, R-squared values of PLS-LR, KNN, RF and CNN supervised models, approached or equaled 1. This indicates exceptional performance comparable to unsupervised models like PCA. 3D-ACM stands as a versatile mathematical technique not confined to spectral data. It also eliminates the necessity for additional experimental setups or external control conditions, distinct from traditional two-dimensional correlation spectroscopy. Moreover, it preserves the original experimental data, setting it apart from conventional data preprocessing methods. This positions 3D-ACM as a promising tool for future material classification and identification in conjunction with machine learning. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

25. Identifying potential breakthrough research: A machine learning method using scientific papers and Twitter data.

Author: Li, Xin, Wen, Yang, Jiang, Jiaojiao, Daim, Tugrul, and Huang, Lucheng
Subjects: MACHINE learning, RESOURCE allocation, DATA mining, DISRUPTIVE innovations, GREEN technology
Abstract: Breakthrough research may signal shifts in science, technology, and innovation systems. Early identification of breakthrough research is important not only for scientists, but also for policy makers and R&D experts in developing R&D strategies and allocating R&D resources. Researchers mostly use scientific papers data to identify potential breakthrough research, but they rarely make use of Twitter data related to scientific research and machine learning methods. Analysis of Twitter data is of great significance for us to understand the public's perception of potential breakthrough research and to identify potential breakthrough research. Machine learning methods can assist us in predicting the trend of events by utilizing prior knowledge and experience. Therefore, this paper proposes a framework for identifying potential breakthrough research using machine learning methods with scientific papers and Twitter data. We select solar cells as a case study to verify the valid and flexible of this framework. In this case, we use machine learning method to discover potential breakthrough research from scientific papers, and we use Twitter data mining to analyze Twitter users' sense of and response to the discovered potential breakthrough research, which aims to achieve a more extensive and diverse assessment of the discovered potential breakthrough research. This paper contributes to identifying potential breakthrough research, as well as understanding the emergence and development of breakthrough research. It will be of interest to R&D experts in the field of solar cell technology. • We proposed a framework for identifying potential breakthrough research using machine learning method. • We found 8 potential breakthrough researches in the field of solar cell technology in 2015. • Twitter data mining could be used to assist in identifying potential breakthrough research. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

26. Machine learning-assisted photoluminescent sensor array based on gold nanoclusters for the discrimination of antibiotics with test paper.

Author: Xu, Jinming, Chen, Xihang, Zhou, Huangmei, Zhao, Yu, Cheng, Yuchi, Wu, Ying, Zhang, Jie, Chen, Jinquan, and Zhang, Sanjun
Subjects: *GOLD clusters, *PHOTOLUMINESCENT polymers, *SENSOR arrays, *FISHER discriminant analysis, *ANTIBIOTIC residues, *ANTIBIOTICS
Abstract: Antibiotic residues accumulation in the environment endangers ecosystems and human health. There is an urgent need for a facile and efficient strategy to detect antibiotics. Here, we report a photoluminescent sensor array based on protein-stabilized gold nanoclusters (AuNCs) for the detection of two families of antibiotics, tetracyclines and quinolones. The nanoclusters were synthesized with bovine serum albumin (BSA) and ovalbumin (OVA), respectively. They had different interactions with seven kinds of antibiotics and exhibited diverse photoluminescence (PL) responses, which were analyzed by linear discriminant analysis and ExtraTrees algorithms. The sensor array performed well in both classification and quantification of seven antibiotics. And the quantitative results of all antibiotics obtained R2 of no less than 0.99 at 0–100 μM when using suitable regression models. Additionally, the sensor array was able to distinguish antibiotic mixtures and multiple interfering substances, and it also kept 100% classification accuracy in river water samples. Moreover, test paper assisted by a smartphone was applied for quick detection of antibiotics, with good performance in both HEPES buffer and river water. These studies reveal great potential for the point-of-use analysis of antibiotics in environmental monitoring. [Display omitted] • The first nanoclusters-based photoluminescent sensor array for quinolones is developed. • Seven antibiotics of tetracyclines and quinolones are successfully identified. • ExtraTrees algorithm is first employed for antibiotics detection. • Test paper assisted by a smartphone is used to identify antibiotics in river water. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

27. Paper-based multiplexed colorimetric biosensing of cardiac and lipid biomarkers integrated with machine learning for accurate acute myocardial infarction early diagnosis and prognosis.

Author: Low, Joyce Siew Yong, Thevarajah, T. Malathi, Chang, Siow Wee, and Khor, Sook Mei
Subjects: *MYOCARDIAL infarction, *EARLY diagnosis, *LIPIDS, *MACHINE learning, *HIGH density lipoproteins, *PROGNOSIS
Abstract: This study demonstrates how a colorimetric biosensor based on microfluidic paper can swiftly diagnose a disease and predict its prognosis to triage patients effectively. This was the first biosensor to quantify the gold standard cardiac troponin (cTnI) and lipid biomarkers, including high-density lipoprotein (HDL) and low-density lipoprotein (LDL) simultaneously. Prior research encountered obstacles or limitations, such as measuring a single biomarker or total cholesterol, which cannot distinguish between LDL and HDL. CatBoost, an advanced machine learning (ML) technique used for diagnosis that combines the predictive power of ML algorithms obtained an impressive area under the receiver operating curves (AUROC) of 0.97 ± 0.018 for all possible classification thresholds. CatBoost is a brand-new ensemble framework based on the interaction of multiple health parameters that can generate AUROC values of 0.897 ± 0.047 for the accurate prognosis of recurrent acute myocardial infarction (AMI), demonstrating remarkably accurate AMI diagnosis and prognosis. In addition, this paper-based analytical device (µPAD) biosensor employs an electrophoretic method to overcome the challenges posed by non-specific adsorption. This is accomplished by isolating non-specific biomolecules based on differences in their isoelectric points and removing non-specifically adsorbing colorimetric markers. The limits of detection (LoD) for cTnI in AMI were lower than their respective clinical cutoff values. This study also demonstrated that the proposed ML framework produced significantly better results than conventional statistical analysis. High correlation filtering and (t-SNE) dimensionality reduction were utilized for a limited number of data points. The respectable accuracy and AUROC of this method were also validated using cross-validation. [Display omitted] • First ML-integrated colorimetric biosensing for accurate AMI diagnosis and prognosis. • Cardiac and lipid biomarkers simultaneous detection improves diagnostic reliability. • The electrophoretic method omits the washing steps to allow rapid diagnosis. • The new ML framework significantly enhances outcomes compared to statistical analysis. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

28. An explainable artificial-intelligence-based approach to investigating factors that influence the citation of papers.

Author: Ha, Taehyun
Subjects: MACHINE learning, ARTIFICIAL intelligence, CITATION analysis, STRATEGIC planning, BIG data, BIBLIOMETRICS
Abstract: The number of citations is often used to estimate the impact of a study. Previous studies have investigated what factors of publications affect citations and how they affect citations. However, the findings of the studies were unable to reach a consensus because of the limited sample size, domain, and measurement. This study reviewed previous studies that addressed factors influencing citations and then identified 14 measurable factors. Approximately 33 million publications from the Scopus database were used to train and validate a CatBoost model. A SHAP framework was used to interpret the trained model by focusing on how salient factors affect the number of citations. The results showed that the year is a significant factor affecting the citation but not the priority factor. A publication source was presented as the most important factor contributing to the citation. Several implications and strategic approaches to maximizing the impact of a study were discussed. • This study examines 14 factors that can influence the citation of papers. • CatBoost model and SCOPUS dataset are used to examine the influences. • SHAP interprets the model and suggests how the factors contribute to the citation. • The results show that selecting the right journal/conference is the most important. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

29. Scientific papers citation analysis using textual features and SMOTE resampling techniques.

Author: Umer, Muhammad, Sadiq, Saima, Missen, Malik Muhammad Saad, Hameed, Zahid, Aslam, Zahid, Siddique, Muhammad Abubakar, and NAPPI, Michele
Subjects: *CITATION analysis, *CONTENT analysis, *MACHINE learning, *SENTIMENT analysis, *PATTERN recognition systems, *USER-generated content
Abstract: • Explore qualitative aspects of citations to measure the influence of a research article. • Apply a feature representation technique in combination with machine learning models to find the sentiment of citation. • Determine the sentiment of citation instances into positive, negative, or neutral. • Analyze the efficacy of SMOTE in balancing the citation sentiment dataset. Ascertaining the impact of research is significant for the research community and academia of all disciplines. The only prevalent measure associated with the quantification of research quality is the citation-count. Although a number of citations play a significant role in academic research, sometimes citations can be biased or made to discuss only the weaknesses and shortcomings of the research. By considering the sentiment of citations and recognizing patterns in text can aid in understanding the opinion of the peer research community and will also help in quantifying the quality of research articles. Efficient feature representation combined with machine learning classifiers has yielded significant improvement in text classification. However, the effectiveness of such combinations has not been analyzed for citation sentiment analysis. This study aims to investigate pattern recognition using machine learning models in combination with frequency-based and prediction-based feature representation techniques with and without using Synthetic Minority Oversampling Technique (SMOTE) on publicly available citation sentiment dataset. Sentiment of citation instances are classified into positive, negative or neutral. Results indicate that the Extra tree classifier in combination with Term Frequency-Inverse Document Frequency achieved 98.26% accuracy on the SMOTE-balanced dataset. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

30. Identification accuracy improvement of non-uniform paper samples.

Author: Sugawara, Shigeru, Nakayama, Yoshihiko, Taniguchi, Hideya, Kawashima, Natsumi, and Ishimaru, Ichiro
Subjects: *SPECTRAL imaging, *ABSORPTION spectra, *ACCURACY, *IDENTIFICATION, *MACHINE learning, *PAPER
Abstract: Graphical abstract Highlights • Hyper spectral imaging was used to get many spectra in a short time. • The identification of eight paper types was done by using machine learning. • Even papers with very similar spectra could be distinguished with high accuracy. • For 68,400 spectra per sample, the accuracy was 87.8%. Abstract Paper is non-uniform; thus, it is difficult to ascertain whether the slight differences in near-infrared absorption spectra are caused by the difference in paper type or the paper's non-uniformity. We used near-infrared hyperspectral imaging to measure the spectra of many different points quickly for eight different paper samples. Next, we analysed the data using machine learning and constructed a model to ascertain whether the differences in the spectra are due to the difference in paper type or the paper's non-uniformity. Even papers with very similar spectral shapes were distinguished with high accuracy with this method. As the number of spectra used to construct the classification model increased, the accuracy of the classification model increased. With 6840 spectra per sample, the accuracy exceeded 85%, and with 68,400 spectra per sample, the accuracy for identification was 87.8%. [ABSTRACT FROM AUTHOR]
Published: 2019
Full Text: View/download PDF

31. Tensor product based 2-D correlation data preprocessing methods for Raman spectroscopy of Chinese handmade paper.

Author: Yan, Chunsheng, Luo, Si, Cao, Linquan, Cheng, Zhongyi, and Zhang, Hui
Subjects: *TENSOR products, *RAMAN spectroscopy, *SUPPORT vector machines, *K-nearest neighbor classification, *PRINCIPAL components analysis, *MACHINE learning
Abstract: [Display omitted] • The 2-D correlation methods do not require external perturbation variables. • They are pure mathematical methods that utilize the tensor product of spectral data. • The R2 values of KNN and RF for TDACM are close to 1, indicating nearly 100% improvement. The paper introduces two new methods, namely the cross correlation method (CCM) and two-dimensional correlation method (TDCM), for preprocessing Raman spectroscopy data for analyzing Chinese handmade paper samples. CCM expands the spectral dimension from 1 × N to 1 × 2 N - 1 by taking cross-correlation between two spectral data of the same category. TDCM includes two-dimensional synchronous correlation method (TDSCM) and two-dimensional asynchronous correlation method (TDACM), which expand the spectral dimension from 1 × N to N × N by taking tensor products between two spectral data and between one spectral data and the Hilbert transformation of the other spectral data of the same category, respectively. The experimental data were preprocessed using baseline removal, CCM, TDSCM, and TDACM methods. Four machine learning models were employed to evaluate the effects of these methods: principal component analysis (PCA) combined with linear regression (LR), support vector machine (SVM) combined with LR, k-Nearest Neighbors (KNN), and random forest (RF). The results show that the R-squared values for the PCA model were nearly 1 for all types of data, indicating high accuracy. However, for SVM-LR, KNN, and RF models, the R-squared values were sorted in the order of raw data, baseline removal data, CCM, TDSCM, and TDACM preprocessed data. The R-squared values of KNN and RF machine learning models for TDACM preprocessed data were approaching 1, indicating that the accuracy of machine learning was significantly improved by nearly 100%. This has led to a remarkable improvement in the accuracy of supervised models such as KNN and RF, bringing them closer to the level of unsupervised models such as PCA. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

32. Detection of poor controller tuning with Gramian Angular Field (GAF) and StackAutoencoder (SAE).

Author: Memarian, Amirreza, Damarla, Seshu Kumar, Memarian, Alireza, and Huang, Biao
Subjects: *PAPER pulp, *PRODUCT quality, *OSCILLATIONS
Abstract: Efficient control loop performance is pivotal in process industries to ensure optimal production, maintain product quality, and adhere to regulatory standards. Poorly tuned controllers can disrupt these objectives, necessitating accurate detection methods. This paper introduces a novel approach for detecting poor controller tuning through advanced techniques: the Gramian Angular Field (GAF) and Stack Auto-Encoder (SAE). Unlike manual methods, this automated system promptly identifies poorly tuned controllers, offering real-time monitoring and timely alerts to operators. The proposed methodology is substantiated through two case studies: the ISDB dataset and the pulp and paper dataset. The outcomes illustrate that the proposed approach correctly determines the appropriate outcome for the majority of the analyzed control loops across diverse industries. • New method detects poorly tuned controllers via Gramian angular field and SAE. • PV and OP images help SAE distinguish poor tuning from other oscillation causes. • Transfer learning has been used to improve methodology's effectiveness. • Tested on benchmark control loops, yielding accurate verdicts for most cases. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

33. 3D plasmonic hexaplex paper sensor for label-free human saliva sensing and machine learning-assisted early-stage lung cancer screening.

Author: Linh, Vo Thi Nhat, Kim, Hongyoon, Lee, Min-Young, Mun, Jungho, Kim, Yeseul, Jeong, Byeong-Ho, Park, Sung-Gyu, Kim, Dong-Ho, Rho, Junsuk, and Jung, Ho Sang
Subjects: *PLASMONICS, *MACHINE learning, *EARLY detection of cancer, *MEDICAL screening, *LUNG cancer, *SERS spectroscopy
Abstract: A label-free detection method for noninvasive biofluids enables rapid on-site disease screening and early-stage cancer diagnosis by analyzing metabolic alterations. Herein, we develop three-dimensional plasmonic hexaplex nanostructures coated on a paper substrate (3D-PHP). This flexible and highly absorptive 3D-PHP sensor is integrated with commercial saliva collection tube to create an efficient on-site sensing platform for lung cancer screening via surface-enhanced Raman scattering (SERS) measurement of human saliva. The multispike hexaplex-shaped gold nanostructure enhances contact with saliva viscosity, enabling effective sampling and SERS enhancement. Through testing patient salivary samples, the 3D-PHP sensor demonstrates successful lung cancer detection and diagnosis. A logistic regression-based machine learning model successfully classifies benign and malignant patients, exhibiting high clinical sensitivity and specificity. Additionally, important Raman peak positions related to different lung cancer stages are investigated, suggesting insights for early-stage cancer diagnosis. Integrating 3D-PHP senor with the conventional saliva collection tube platform is expected to offer promising practicality for rapid on-site disease screening and diagnosis, and significant advancements in cancer detection and patient care. [Display omitted] [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

34. Choice modelling in the age of machine learning - Discussion paper.

Author: van Cranenburgh, Sander, Wang, Shenhao, Vij, Akshay, Pereira, Francisco, and Walker, Joan
Subjects: MACHINE learning, POLLINATION
Abstract: Since its inception, the choice modelling field has been dominated by theory-driven modelling approaches. Machine learning offers an alternative data-driven approach for modelling choice behaviour and is increasingly drawing interest in our field. Cross-pollination of machine learning models, techniques and practices could help overcome problems and limitations encountered in the current theory-driven modelling paradigm, such as subjective labour-intensive search processes for model selection, and the inability to work with text and image data. However, despite the potential benefits of using the advances of machine learning to improve choice modelling practices, the choice modelling field has been hesitant to embrace machine learning. This discussion paper aims to consolidate knowledge on the use of machine learning models, techniques and practices for choice modelling, and discuss their potential. Thereby, we hope not only to make the case that further integration of machine learning in choice modelling is beneficial, but also to further facilitate it. To this end, we clarify the similarities and differences between the two modelling paradigms; we review the use of machine learning for choice modelling; and we explore areas of opportunities for embracing machine learning models and techniques to improve our practices. To conclude this discussion paper, we put forward a set of research questions which must be addressed to better understand if and how machine learning can benefit choice modelling. • Clarifies the similarities and differences between theory and data-driven paradigms. • Reviews the use of machine learning for choice modelling. • Explores opportunities for embracing machine learning to benefit choice modelling. • Puts forward research agenda. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

35. Chaotic time series prediction for the game, Rock-Paper-Scissors.

Author: Salvetti, Franco, Patelli, Paolo, and Nicolo, Simone
Subjects: GAME theory, MACHINE learning, DIFFERENTIAL equations, ERGODIC theory
Abstract: Abstract: Two players of Rock-Paper-Scissors are modeled as adaptive agents which use a reinforcement learning algorithm and exhibit chaotic behavior in terms of trajectories of probability in mixed strategies space. This paper demonstrates that an external super-agent can exploit the behavior of the other players to predict favorable moments to play against one of the other players the symbol suggested by a sub-optimal strategy. This third agent does not affect the learning process of the other two players, whose only goal is to beat each other. The choice of the best moment to play is based on a threshold associated with the Local Lyapunov Exponent or the Entropy, each computed by using the time series of symbols played by one of the other players. A method for automatically adapting such a threshold is presented and evaluated. The results show that these techniques can be used effectively by a super-agent in a game involving adaptive agents that exhibit collective chaotic behavior. [Copyright &y& Elsevier]
Published: 2007
Full Text: View/download PDF

36. Capillary flow velocity profile analysis on paper-based microfluidic chips for screening oil types using machine learning.

Author: Chung, Soo, Loh, Andrew, Jennings, Christian M., Sosnowski, Katelyn, Ha, Sung Yong, Yim, Un Hyuk, and Yoon, Jeong-Yeol
Subjects: *CAPILLARY flow, *FLOW velocity, *MICROFLUIDIC analytical techniques, *MACHINE learning, *FISHER discriminant analysis, *HEAVY oil
Abstract: We conceived a novel approach to screen oil types on a wax-printed paper-based microfluidic platform. Various oil samples spontaneously flowed through a micrometer-scale channel via capillary action while their components were filtered and partitioned. The resulting capillary flow velocity profile fluctuated during the flow, which was used to screen oil types. Raspberry Pi camera captured the video clips, and a custom Python code analyzed them to obtain the capillary flow velocity profiles. 106 velocity profiles (each with 125 frames for 5 s) were recorded from various oil samples to build a training database. Principal component analysis (PCA), support vector machine (SVM), and linear discriminant analysis (LDA) were used to classify the oil types into heavy-to-medium crude, light crude, marine fuel, lubricant, and diesel oils. The second-order polynomial SVM model with PCA as a pre-processing step showed the highest accuracy: 90% in classifying crude oils and 81% in classifying non-crude oils. The assay took less than 30 s from the sample to answer, with 5 s of the capillary action-driven flow. This simple and effective assay will allow rapid preliminary screening of oil types, enable early tracking, and reduce the number of suspect samples to be analyzed by laboratory fingerprinting analysis. [Display omitted] • A novel approach to screen oil types on a paper microfluidic platform. • Raspberry Pi camera acquired capillary flow velocity profiles of diverse oil samples. • Various machine learning based classifications were tested, including PCA, SVM, and LDA. • 90% accuracy in classifying crude oil samples and 81% in non-crude oil samples. • < 30 s from the sample to answer without the need for laboratory equipment. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

37. Deep learning-assisted ultra-accurate smartphone testing of paper-based colorimetric ELISA assays.

Author: Duan, Sixuan, Cai, Tianyu, Zhu, Jia, Yang, Xi, Lim, Eng Gee, Huang, Kaizhu, Hoettges, Kai, Zhang, Quan, Fu, Hao, Guo, Qiang, Liu, Xinyu, Yang, Zuming, and Song, Pengfei
Subjects: *DEEP learning, *MACHINE learning, *SMARTPHONES, *ENZYME-linked immunosorbent assay, *MEDICAL screening, *MOBILE apps
Abstract: Smartphone has long been considered as one excellent platform for disease screening and diagnosis, especially when combined with microfluidic paper-based analytical devices (μPADs) that feature low cost, ease of use, and pump-free operations. In this paper, we report a deep learning-assisted smartphone platform for ultra-accurate testing of paper-based microfluidic colorimetric enzyme-linked immunosorbent assay (c-ELISA). Different from existing smartphone-based μPAD platforms, whose sensing reliability is suffered from uncontrolled ambient lighting conditions, our platform is able to eliminate those random lighting influences for enhanced sensing accuracy. We first constructed a dataset that contains c-ELISA results (n = 2048) of rabbit IgG as the model target on μPADs under eight controlled lighting conditions. Those images are then used to train four different mainstream deep learning algorithms. By training with these images, the deep learning algorithms can well eliminate the influences of lighting conditions. Among them, the GoogLeNet algorithm gives the highest accuracy (>97%) in quantitative rabbit IgG concentration classification/prediction, which also provides 4% higher area under curve (AUC) value than that of the traditional curve fitting results analysis method. In addition, we fully automate the whole sensing process and achieve the "image in, answer out" to maximize the convenience of the smartphone. A simple and user-friendly smartphone application has been developed that controls the whole process. This newly developed platform further enhances the sensing performance of μPADs for use by laypersons in low-resource areas and can be facilely adapted to the real disease protein biomarkers detection by c-ELISA on μPADs. [Display omitted] • This deep learning-assisted smartphone platform is unaffected by ambient lighting. • A fully automated "image in, answer out" operation fashion. • A 2048 custom image dataset is used to test 4 mainstream deep learning algorithms. • GoogLeNet provides >97% accuracy in quantitative rabbit IgG testing. • The area under the curve (AUC) is 4% higher than that of conventional methods. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

38. Rapid analysis the type of customs paper using Micro-NIR spectrometers and machine learning algorithms.

Author: Xia, Jingjing, Min, Shungeng, and Li, Jinyao
Subjects: *MACHINE learning, *SPECTROMETERS, *K-nearest neighbor classification, *DISCRIMINANT analysis, *CLASSIFICATION algorithms, *ATTENUATED total reflectance
Abstract: [Display omitted] • Micro-NIR spectrometer was proposed to identify the paper types. • Four machine learning algorithms were discussed in both Micro-NIR and ATR-FTIR. • The performance of Micro-NIR model was better than ATR-FTIR. Quick identification of paper types for customs is extremely crucial. Although there are a variety of researches focus on the discrimination of paper, these techniques either require complex preprocessing or large-scale instruments, which are not suitable for customs environments. In this study, we predicted the type of customs paper by using a Micro-NIR spectrometer, and compared the results with Attenuated Total Reflection-Fourier Transform Infrared Spectroscopy (ATR-FTIR). Four different classification algorithms, including linear and non-linear classifiers: K-nearest neighbor (KNN), soft independent modeling of class analogy (SIMCA), partial least squares discriminant analysis (PLS-DA), and least squares-support vector machine (LS-SVM) were employed to classify the type of paper. 20 groups of datasets were selected by Monte Carlo sampling. For Micro-NIR data, the performances of KNN and LS-SVM were outstanding than SIMCA and PLS-DA, with the average accuracies 96.06% and 98.91%, respectively. The outcome of SIMCA and PLS-DA were similar, with the average accuracies 93.00% and 93.97%. Based on the standard derivation, the best stability of models was LS-SVM (1.06%), followed by PLS-DA (1.12%), KNN (1.22%) and SIMCA (3.07%). Compared with ATR-FTIR, the effects of Micro-NIR were better, which were embodies in the better KNN and SIMCA models, and the comparable LS-SVM model. The result demonstrated that the Micro-NIR combined with machine learning algorithms was an effective method to classify the type of customs paper efficiently and quickly, even better than ATR-FTIR. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

39. Advances in artificial neural networks, machine learning and computational intelligence: Selected papers from the 26th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (ESANN 2018).

Author: Oneto, Luca, Bunte, Kerstin, and Schleif, Frank-Michael
Subjects: *ARTIFICIAL neural networks, *ARTIFICIAL intelligence, *COMPUTATIONAL intelligence, *MACHINE learning, *STATISTICAL learning, *TECHNOLOGY
Published: 2019
Full Text: View/download PDF

40. Functional analysis of generalized linear models under non-linear constraints with applications to identifying highly-cited papers.

Author: Chowdhury, K.P.
Subjects: SIMULATION methods & models, ARTIFICIAL neural networks, MANAGEMENT information systems, FUNCTIONAL analysis, LINEAR statistical models, ARTIFICIAL intelligence, CITATION indexes
Abstract: • Robust functional form contains true parameters far more often than popular models. • Matches/outperforms widely used regression and Neural Network models. • Finds appropriate balance between Model Fit, Inference and Prediction (MIPs). • Introduces new large-sample DGP test; can use to improve A.I. models. • For MIS field finds Popularity Parameter to be important for predicting citations. This article introduces a versatile functional form for Generalized Linear Models (GLMs) through a simple, yet effective, transformation of the current framework. The models are applied through a new hierarchical bayesian estimation procedure for logistic regression to highly-cited papers in the Management Information Systems (MIS) field. The results are uniformly better, in regards to model fit and inference for in-sample and out-of-sample data, for simulation studies and real-world data applications, requiring very little time to convergence to true population parameters. In simulation studies, I show that the method contains the true parameters nearly three times as often as widely used existing GLMs, and does so while having confidence intervals that are 54.50% smaller, while requiring around two-thirds the number of MCMC iterations as existing bayesian methods. In Scientometric applications the methodology is shown to be highly robust with predictive/classification accuracy, either equaling or exceeding existing methods for identifying highly-cited articles including Artificial Neural Networks (ANN). Thus, the method is shown to be robust to the amount of asymmetry (or symmetry) of the probability of success (or failure) and robust to unbalanced samples and varying Data Generating Processes. Further, the methodology is equivalent to current methods if the data support them and is therefore complementary to existing methods, without loss of interpretability of model parameters. For the MIS field it finds that Popularity Parameter (PP) of an article Keywords can predict whether a paper will be highly-cited (top 25% of highly-cited articles) between two to three years after publication and beyond. Furthermore, given the small number of iterations needed for convergence, the methodology can also be used as a baseline method in Big Data (BD) settings for both Artificial Intelligence (AI) and Machine Learning (ML) contexts as well. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

41. Optimizing feature selection in intrusion detection systems: Pareto dominance set approaches with mutual information and linear correlation.

Author: Barbosa, Guilherme Nunes Nasseh, Andreoni, Martin, and Mattos, Diogo Menezes Ferrazani
Subjects: FEATURE selection, INTRUSION detection systems (Computer security), MACHINE learning, SOCIAL dominance, PEARSON correlation (Statistics), FILTER paper
Abstract: In the realm of network intrusion detection using machine learning, feature selection aims for computational efficiency, enhanced performance, and model interpretability, preventing overfitting and optimizing data visualization. This paper proposes a filtering method for feature selection, which optimizes information quantity and linear correlation between resultant features. The method identifies Pareto dominant pairs of informative and correlated features, constructs a graph, and selects key features based on betweenness centrality in its connected components. The proposal yields a more concise and informative dataset representation. Experimental results, using three diverse datasets, demonstrate that the proposal achieves more than 95% accuracy in classifying network attacks with just 14% of the total number features in original datasets. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

42. Nondestructive and multiplex differentiation of pathogenic microorganisms from spoilage microflora on seafood using paper chromogenic array and neural network.

Author: Yang, Manyun, Luo, Yaguang, Sharma, Arnav, Jia, Zhen, Wang, Shilong, Wang, Dayang, Lin, Sophia, Perreault, Whitney, Purohit, Sonia, Gu, Tingting, Dillow, Hyden, Liu, Xiaobo, Yu, Hengyong, and Zhang, Boce
Subjects: *SEAFOOD, *PATHOGENIC microorganisms, *SHEWANELLA putrefaciens, *FOOD pathogens, *FOOD safety, *VOLATILE organic compounds, *PATHOGENIC bacteria
Abstract: [Display omitted] • Paper chromogenic array (PCA) integrated with machine learning (ML) was developed. • PCA exhibit distinguishable pattern shifts when reacting volatile metabolites. • PCA pattern recognition achieved using a cross-validated neural network. • PCA accurately identify multiplexed pathogens from indigenous microflora. • The nondestructive PCA-ML holds great potential as a smart food safety system. Non-destructive detection of human foodborne pathogens is critical to ensuring food safety and public health. Here, we report a new method using a paper chromogenic array coupled with a machine learning neural network (PCA-NN) to detect viable pathogens in the presence of background microflora and spoilage microbe in seafood via volatile organic compounds sensing. Morganella morganii and Shewanella putrefaciens were used as the model pathogen and spoilage bacteria. The study evaluated microbial detection in monoculture and cocktail multiplex detection. The accuracy of PCA-NN detection was first assessed on standard media and later validated on cod and salmon as real seafood models with pathogenic and spoilage bacteria, as well as background microflora. In this study PCA-NN method successfully identified pathogenic microorganisms from microflora with or without the prevalent spoilage microbe, Shewanella putrefaciens in seafood, with accuracies ranging from 90% to 99%. This approach has the potential to advance smart packaging by achieving nondestructive pathogen surveillance on food without enrichment, incubation, or other sample preparation. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

43. Imitation learning for aerobatic maneuvering in fixed-wing aircraft.

Author: Freitas, Henrique, Camacho, Rui, and Castro Silva, Daniel
Subjects: MACHINE learning, MODEL airplanes, TRANSFER of training, CONFERENCE papers, AUTOMATIC pilot (Airplanes)
Abstract: This study focuses on the task of developing automated models for complex aerobatic aircraft maneuvers. The approach employed here utilizes Behavioral Cloning, a technique in which human pilots supply a series of sample maneuvers. These maneuvers serve as training data for a Machine Learning algorithm, enabling the system to generate control models for each maneuver. The optimal instances for each maneuver were chosen based on a set of objective evaluation criteria. By utilizing these selected sets of examples, resilient models were developed, capable of reproducing the maneuvers performed by the human pilots who supplied the examples. In certain instances, these models even exhibited superior performance compared to the pilots themselves, a phenomenon referred to as the "clean-up effect". We also explore the application of transfer learning to adapt the developed controllers to various airplane models, revealing compelling evidence that transfer learning is effective for refining them for targeted aircraft. A comprehensive set of intricate maneuvers was executed through a meta-controller capable of orchestrating the fundamental maneuvers acquired through imitation. This undertaking yielded promising outcomes, demonstrating the proficiency of several Machine Learning models in successfully executing highly intricate aircraft maneuvers. This paper is an extended version of the previously ICCS 2023 published conference paper [1]. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

44. Explainable proactive control of industrial processes.

Author: Kuk, Edyta, Bobek, Szymon, and Nalepa, Grzegorz J.
Subjects: PROCESS control systems, ARTIFICIAL intelligence, MANUFACTURING processes, CONFERENCE papers, INDUSTRY 4.0
Abstract: One of the goals of Industry 4.0 is the adoption of data-driven models to enhance various aspects of the manufacturing process, such as monitoring equipment conditions, ensuring product quality, detecting failures, and preparing optimal maintenance plans. However, many machine-learning algorithms require a large amount of training data to reach desired performance. In numerous industrial applications, such data is either not available or its acquisition is a costly process. In such cases, simulation frameworks are employed to replicate the behavior of real-world facilities and generate data for further analysis. Simulation frameworks typically provide high-quality data but are often slow which can be problematic when real-time decision-making is required. Control approaches based on simulation-based data commonly face challenges related to inflexibility, particularly in dynamic production environments undergoing frequent reconfiguration and upgrades. This paper introduces a method that seeks to strike a balance between the reliance on simulated data and the limited robustness of simulation-based control methods. This balance is achieved by supplementing available data with additional expert knowledge, enabling the matching of similar data sources and their combination for reuse. Furthermore, we augment the methods with an explainability layer, facilitating collaboration between the human expert and the AI system, leading to informed and actionable decisions. The performance of the proposed solution is demonstrated through a case study on gas production from an underground reservoir resulting in reduced downtime, heightened process reliability, and enhanced overall performance. This paper builds upon our conference paper (Kuk et al., 2023), addressing the same problem with an extended, more generic methodology, and presenting entirely new results. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

45. Identification of organic chemical indicators for tracking pollution sources in groundwater by machine learning from GC-HRMS-based suspect and non-target screening data.

Author: Ekpe, Okon Dominic, Choo, Gyojin, Kang, Jin-Kyu, Yun, Seong-Taek, and Oh, Jeong-Eun
Subjects: *INDICATORS & test-papers, *GROUNDWATER pollution, *MICROPOLLUTANTS, *MACHINE learning, *ORGANIC compounds, *DATA integrity, *FECAL contamination, *FEATURE selection
Abstract: • 252 chemicals were identified by SNTS in groundwater from four regions with diverse contamination histories. • A novel and robust systematic machine learning-based workflow for predicting chemical indicators was proposed. • The proposed workflow showed good predictive ability (Q2) of 0.897. • 51 chemical indicators for tracking groundwater contamination sources were suggested. In this study, the strong analytical power of gas chromatography coupled to a high resolution mass spectrometry (GC-HRMS) in suspect and non-target screening (SNTS) of organic micropollutants was combined with machine learning tools for proposing a novel and robust systematic environmental forensics workflow, focusing on groundwater contamination. Groundwater samples were collected from four different regions with diverse contamination histories (namely oil [OC], agricultural [AGR], industrial [IND], and landfill [LF]), and a total of 252 organic micropollutants were identified, including pharmaceuticals, personal care products, pesticides, polycyclic aromatic hydrocarbons, plasticizers, phenols, organophosphate flame retardants, transformation products, and others, with detection frequencies ranging from 3 % to 100 %. Amongst the SNTS identified compounds, a total of 51 chemical indicators (i.e., OC: 13, LF: 12, AGR: 19, IND: 7) which included level 1 and 2 SNTS identified chemicals were pinpointed across all sampling regions by integrating a bootstrapped feature selection method involving the bootfs algorithm and a partial least squares discriminant analysis (PLS-DA) model to determine potential prevalent contamination sources. The proposed workflow showed good predictive ability (Q2) of 0.897, and the suggested contamination sources were gasoline, diesel, and/or other light petroleum products for the OC region, anthropogenic activities for the LF region, agricultural and human activities for the AGR region, and industrial/human activities for the IND region. These results suggest that the proposed workflow can select a subset of the most diagnostic features in the chemical space that can best distinguish a specific contamination source class. [Display omitted] [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

46. Combination of cellulose tissue paper and bleach-treated graphene in stiffness reinforcement of polyvinyl alcohol film.

Author: Abdullah, Abu Hannifa, Ismail, Zulhelmi, Idris, Wan Farhana W., Khusairi, Zulsyazwan Ahmad, and Zuhan, Mohd Khairul Nizam Mohd
Subjects: *GRAPHENE, *POLYVINYL alcohol, *CELLULOSE, *POLYMER films, *MACHINE learning, *ELASTIC modulus
Abstract: A pre-treatment of graphene with bleach is considered one of the possible purification methods after liquid-phase exfoliation. However, the effect of this treatment on the mechanical reinforcement strategy for polymer film is yet to be investigated to date. In this full work, the influence of the C/O ratio, I D /I G, and volume of graphene after combination with cellulose tissue on the resulting stiffness of polyvinyl alcohol (PVA) composite film has been extensively studied. It is noticed that the incorporation of 30 ml graphene that had been pre-treated for 3 h into PVA had produced the best increment in elastic modulus (1.6 GPa against 0.4 GPa) while a shorter pre-treatment duration of graphene (1 h) would require more graphene volume (40 ml) to match the previous stiffness improvement level. By using the collected experimental data (90 samples), we further modeled the effect of tissue and PVA mass, C/O ratio, I D /I G , and graphene volume on modulus using machine learning (ML) algorithms. [Display omitted] • Combination of cellulose tissue and graphene as filler hybrid to combat poor dispersibility of bleach-treated graphene • Mechanical reinforcement effect was observed for graphene treated at 3 h due to the well-balanced C/O and I D /I G. • Addition of more tissue/graphene mass is required for graphene with a lower C/O to enhance the stiffness. • Machine learning study shows k-nearest neighbours with k = 1 is the best prediction model for composite stiffness. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

47. PSRMTE: Paper submission recommendation using mixtures of transformer.

Author: Nguyen, Dac Huu, Huynh, Son Thanh, Dinh, Cuong Viet, Huynh, Phong Tan, and Nguyen, Binh Thanh
Subjects: *COMPUTATIONAL mathematics, *RECOMMENDER systems, *MACHINE learning, *COMPUTER science, *ELECTRONIC journals, *APPLIED mathematics
Abstract: Nowadays, there has been a rapidly increasing number of scientific submissions in multiple research domains. A large number of journals have various acceptance rates, impact factors, and rankings in different publishers. It becomes time-consuming for many researchers to select the most suitable journal to submit their work with the highest acceptance rate. A paper submission recommendation system is more critical for the research community and publishers as it gives scientists another support to complete their submission conveniently. This paper investigates the submission recommendation system for two main research topics: computer science and applied mathematics. Unlike the previous works (Wang et al., 2018; Son et al., 2020) that extract TF–IDF and statistical features as well as utilize numerous machine learning algorithms (logistics regression and multiple perceptrons) for building the recommendation engine, we present an efficient paper submission recommendation algorithm by using different bidirectional transformer encoders and the Mixture of Transformer Encoders technique. We compare the performance between our methodology and other approaches by one dataset from Wang et al. (2018) with 14012 papers in computer science and another dataset collected by us with 223,782 articles in 178 Springer applied mathematics journals in terms of top K accuracy (K = 1 , 3 , 5 , 10). The experimental results show that our proposed method extensively outperforms other state-of-the-art techniques with a significant margin in all top K accuracy for both two datasets. We publish all datasets collected and our implementation codes for further references. 1 1 https://github.com/BinhMisfit/PSRMTE. • Bidirectional transformer encoders can improve the performance of the paper submission recommendation system. • The Mixture of Transformer Encoders framework shows the efficiency in the paper submission recommendation problem. • Proposed techniques can surpass other recent techniques on two datasets related. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

48. Nondestructive multiplex detection of foodborne pathogens with background microflora and symbiosis using a paper chromogenic array and advanced neural network.

Author: Jia, Zhen, Luo, Yaguang, Wang, Dayang, Dinh, Quynh N., Lin, Sophia, Sharma, Arnav, Block, Ethan M., Yang, Manyun, Gu, Tingting, Pearlstein, Arne J., Yu, Hengyong, and Zhang, Boce
Subjects: *FOOD pathogens, *LISTERIA monocytogenes, *ESCHERICHIA coli O157:H7, *FEEDFORWARD neural networks, *SALMONELLA enteritidis, *VOLATILE organic compounds, *SYMBIOSIS
Abstract: We have developed an inexpensive, standardized paper chromogenic array (PCA) integrated with a machine learning approach to accurately identify single pathogens (Listeria monocytogenes , Salmonella Enteritidis, or Escherichia coli O157:H7) or multiple pathogens (either in multiple monocultures, or in a single cocktail culture), in the presence of background microflora on food. Cantaloupe, a commodity with significant volatile organic compound (VOC) emission and large diverse populations of background microflora, was used as the model food. The PCA was fabricated from a paper microarray via photolithography and paper microfluidics, into which 22 chromogenic dye spots were infused and to which three red/green/blue color-standard dots were taped. When exposed to VOCs emitted by pathogens of interest, dye spots exhibited distinguishable color changes and pattern shifts, which were automatically segmented and digitized into a ΔR/ΔG/ΔB database. We developed an advanced deep feedforward neural network with a learning rate scheduler, L 2 regularization, and shortcut connections. After training on the ΔR/ΔG/ΔB database, the network demonstrated excellent performance in identifying pathogens in single monocultures, multiple monocultures, and in cocktail culture, and in distinguishing them from the background signal on cantaloupe, providing accuracy of up to 93% and 91% under ambient and refrigerated conditions, respectively. With its combination of speed, reliability, portability, and low cost, this nondestructive approach holds great potential to significantly advance culture-free pathogen detection and identification on food, and is readily extendable to other food commodities with complex microflora. • A paper chromogenic array (PCA) - machine learning approach was developed to accurately identify multiple pathogens in background microflora. • PCAs, fabricated via photolithography, react with volatile organic compounds to exhibit distinguishable color pattern shifts. • An advanced neural network demonstrated excellent performance with a learning rate schedule, L2 regularization, and shortcut connections. • This nondestructive approach holds great potential to significantly advance culture-free pathogen detection and identification on food. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

49. Machine learning based urinary pH sensing using polyaniline deposited paper device and integration of smart web app interface: Theory to application.

Author: Biswas, Souvik, Pal, Arijit, Chakraborty, Pratip, Chaudhury, Koel, and Das, Soumen
Subjects: *WEB-based user interfaces, *SMART devices, *POLYANILINES, *MACHINE learning, *MACHINE theory, *ELECTRON transport, *LOCAL area networks, *STANDARD deviations
Abstract: The present study employs density functional theory-based first principle calculation to investigate the electron transport properties of polyaniline following exposure to acidic and alkaline pH. In-situ deposited polyaniline-based paper device maintains emeraldine salt form while it is exposed to acidic pH and converts to emeraldine base when it is subjected to alkaline pH solutions. These structural changes at acidic and alkaline pH are validated experimentally by Raman spectra. Furthermore, the Raman spectra computed from density functional theory are validated with the experimental spectra. The changes in the theoretical energy band gap of polyaniline obtained from first principle calculations were correlated with the changes in the experimental impedimetric response of the sensor after exposure to acidic and alkaline solutions. Finally, the impedimetric responses were used to predict urine pH through a machine learning based smart and interactive web application. Different machine learning based regression models were implemented to acquire the best possible outcome. Gradient Boosting Regressor with least square loss model was selected as it showed lowest mean square, mean absolute, and root mean square error than other models. The smart sensing platform successfully predicts the unknown pH of urine samples with an average accuracy of more than 98%. The locally deployed smart web app can be accessed within a local area network by the end-user, which holds promise towards effective detection of urinary pH. [Display omitted] [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

50. Letter to the editor of radiotherapy and oncology regarding the paper titled "Impact on xerostomia for nasopharyngeal carcinoma patients treated with superficial parotid lobe-sparing intensity-modulated radiation therapy (SPLS-IMRT): A prospective phase II randomized controlled study." by Huang et al

Author: Sarode, Gargi S., Sarode, Sachin C., and Anand, Rahul
Subjects: *NASOPHARYNX cancer, *RADIOTHERAPY, *XEROSTOMIA, *ONCOLOGY, *DEEP learning
Published: 2022
Full Text: View/download PDF

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

17,301 results

Search Results

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources