1,389 results
Search Results
2. Deep Learning for 3D Reconstruction, Augmentation, and Registration: A Review Paper.
- Author
-
Vinodkumar, Prasoon Kumar, Karabulut, Dogus, Avots, Egils, Ozcinar, Cagri, and Anbarjafari, Gholamreza
- Subjects
- *
DEEP learning , *COMPUTER vision , *GRAPH neural networks , *ARTIFICIAL intelligence , *MACHINE learning , *GENERATIVE adversarial networks - Abstract
The research groups in computer vision, graphics, and machine learning have dedicated a substantial amount of attention to the areas of 3D object reconstruction, augmentation, and registration. Deep learning is the predominant method used in artificial intelligence for addressing computer vision challenges. However, deep learning on three-dimensional data presents distinct obstacles and is now in its nascent phase. There have been significant advancements in deep learning specifically for three-dimensional data, offering a range of ways to address these issues. This study offers a comprehensive examination of the latest advancements in deep learning methodologies. We examine many benchmark models for the tasks of 3D object registration, augmentation, and reconstruction. We thoroughly analyse their architectures, advantages, and constraints. In summary, this report provides a comprehensive overview of recent advancements in three-dimensional deep learning and highlights unresolved research areas that will need to be addressed in the future. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
3. Special issue on intelligent systems: ISMIS 2022 selected papers.
- Author
-
Ceci, Michelangelo, Flesca, Sergio, Manco, Giuseppe, and Masciari, Elio
- Subjects
MACHINE learning ,ARTIFICIAL intelligence ,DECISION support systems ,KNOWLEDGE representation (Information theory) ,COMPUTER vision ,DEEP learning - Abstract
This document is a special issue of the Journal of Intelligent Information Systems, focusing on the selected papers from the International Symposium on Methodologies for Intelligent Systems (ISMIS 2022). The symposium, held in Cosenza, Italy, showcased research on various topics related to artificial intelligence, including decision support, knowledge representation, machine learning, computer vision, and more. The special issue includes eleven papers that have undergone rigorous peer-reviewing and cover a wide range of research topics, such as deep learning, anomaly detection, malware detection, sentiment classification, and healthcare professionals' burnout. The authors express their gratitude to the contributors and reviewers for their valuable contributions. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
4. An interactive assessment framework for residential space layouts using pix2pix predictive model at the early-stage building design
- Author
-
Mostafavi, Fatemeh, Tahsildoost, Mohammad, Zomorodian, Zahra Sadat, and Shahrestani, Seyed Shayan
- Published
- 2024
- Full Text
- View/download PDF
5. Measurement in Machine Vision Editorial Paper.
- Author
-
Sergiyenko, Oleg, Flores-Fuentes, Wendy, Rodríguez-Quiñonez, Julio C., Mercorelli, Paolo, Kawabe, Tohru, and Bhateja, Vikrant
- Subjects
- *
COMPUTER vision , *CYBER physical systems , *INTERPOLATION algorithms , *ARTIFICIAL intelligence , *OPTICAL computing , *SENSORY memory , *DISPLACEMENT (Mechanics) - Abstract
Measurement related to different machine vision functions is the base for developing of cyber-physical systems able to see and make decisions. These kinds of systems are emerging in all areas of our daily lives. They can be found in the medical area, in industry, in the agriculture, in all those interconnected cloud computing-based systems related to flying/terrestrial robotics, navigation, automated surgery, smart cities, smart health monitoring, etc. All of them are extremely dependent on the same: adequate coordinates measurement, properly selected data processing and data fusion algorithms, evaluation procedures for performance analysis of measurement within Machine Vision systems, processes and algorithms (both traditional and artificial intelligence), mathematical models for 3D-measurement purposes (measurement of displacements, surface profiles, deformations, data augmentation/interpolation, etc.), and distributed visual measurement systems, as well as distributed memory and sensory part. Cyber-physical systems can be implemented on almost any application, especially on those dotted by robots and automated guided devices (from aerospace applications to domestic cleaners). The success of the measurement process depends on the kind of sensors and their optoelectronics characteristics and intrinsic parameters, as well as their respective operating and processing. The correct approach selection for the application, the data acquisition and collection efficiency, the data processing algorithms, the hardware processors response time, and the intelligent auto adaptability to changing environments or conditions. Recently, the emergence of artificial intelligence algorithms and the internet of things have powerful development of such systems, highlighting the importance and the impact of the measurement accuracy related to machine vision performance. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
6. Synthetic images generation for semantic understanding in facility management
- Author
-
Rampini, Luca and Re Cecconi, Fulvio
- Published
- 2024
- Full Text
- View/download PDF
7. AI Machine Vision based Oven White Paper Color Classification and Label Position Real-time Monitoring System to Check Direction.
- Author
-
Hee-Chul Kim, Youn-Saup Yoon, and Yong-Mo Kim
- Subjects
COMPUTER vision ,DEEP learning ,JOB classification ,MANUFACTURING process automation ,ARTIFICIAL intelligence ,COLOR image processing - Abstract
We develop a vision system for batch inspection by oven white paper model color by manufacturing a machine vision system for the oven manufacturing automation process. In the vision system, white paper object detection (spring), color clustering, and histogram extraction are performed. In addition, for the automated process of home appliances, we intend to develop an automatic mold combination detection algorithm that inspects the label position and direction (angle/coordinate) using deep learning. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
8. INTRODUCTION TO THE SPECIAL ISSUE ON NEXT GENERATION PERVASIVE RECONFIGURABLE COMPUTING FOR HIGH PERFORMANCE REAL TIME APPLICATIONS.
- Author
-
VENKATESAN, C., YU-DONG ZHANG, CHOW CHEE ONN, and AND YONG SHI
- Subjects
MACHINE learning ,REINFORCEMENT learning ,HIGH performance computing ,COMPUTER vision ,ARTIFICIAL intelligence ,PARSING (Computer grammar) ,DEEP learning - Abstract
This document introduces a special issue of the journal "Scalable Computing: Practice & Experience" focused on next-generation pervasive reconfigurable computing for high-performance real-time applications. The authors discuss the importance of adaptable platforms for real-time tasks and highlight the benefits of reconfigurable computing in accelerating applications like image processing and machine learning. The special issue aims to explore recent advancements in this field and includes research papers on topics such as network security, malware detection, software reliability prediction, and optimization algorithms for wing design. The papers cover a range of computer science and technology topics, showcasing advancements and their potential impact on various computing domains. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
9. The emergence and evolution of urban AI.
- Author
-
Batty, Michael
- Subjects
ARTIFICIAL intelligence ,DEEP learning ,COMPUTER vision - Abstract
The fourth paper which is about "Emotional AI and Crime" takes the argument into the key area of how good or bad are AI techniques that are designed for facial and related recognition. To an extent, the focus on AI is wider than what might find in any discussion of AI in the narrower technical field for context is all important to see urban AI in context. Artificial intelligence (AI) emerged alongside the development of the digital computer more than 80 years ago during the second world war. [Extracted from the article]
- Published
- 2023
- Full Text
- View/download PDF
10. OBJECT DETECTION FOR THE VISUALLY IMPAIRED.
- Author
-
CHIMWANGA, BRIGHTSON
- Subjects
OBJECT recognition (Computer vision) ,MACHINE learning ,ARTIFICIAL intelligence ,ASSISTIVE computer technology ,COMPUTER vision - Abstract
This paper presents the design and development of a mobile application, built using Flutter, that leverages object detection to enhance the lives of visually impaired individuals. The application addresses a crucial challenge faced by this community: the lack of real-time information about their surroundings. A solution is proposed that utilizes pre-trained machine learning models, potentially through TensorFlow Lite for on-device processing, to identify objects within the user's field of view as captured by the smartphone camera. The application goes beyond simple object recognition; detected objects are translated into natural language descriptions through text-to-speech functionality, providing crucial auditory cues about the environment. This real-time information stream empowers users to navigate their surroundings with greater confidence and independence. Accessibility is a core principle of this paper. The user interface will be designed for compatibility with screen readers, ensuring seamless interaction for users who rely on assistive technologies. Haptic feedback mechanisms will be incorporated to provide non-visual cues and enhance the user experience. The ultimate goal of this paper is to create a user-friendly and informative application that empowers visually impaired individuals to gain greater independence in their daily lives. The application has the potential to improve spatial awareness, foster a sense of security, and promote overall inclusion within society. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
11. A study on the application of convolutional neural networks for the maintenance of railway tracks.
- Author
-
Pappaterra, Mauro José, Pappaterra, María Lucía, and Flammini, Francesco
- Subjects
CONVOLUTIONAL neural networks ,COMPUTER vision ,LITERATURE reviews ,JOINT use of railroad facilities ,INTELLIGENT sensors ,BALLAST (Railroads) - Abstract
This paper provides an overview of the applications of Convolutional Neural Networks (CNN) in the railway maintenance industry. Our research covers specifically the subdomain of railway track maintenance. In this study, we have analyzed the state-of-the-art of CNNs applied to railway track maintenance by conducting an extensive literature review, summarizing different tasks and problems related to the topic and presenting solutions based on CNNs with a special emphasis on the data used to create these models. The results of our research show different applications of CNNs within the scope, including the detection of defects in the surface of railway rails and railway track components, such as fasteners, joints, sleepers, switches and crossings, as well as the recognition of track components, and the continuous monitoring of railway tracks. The architecture of CNNs is fitting to learning spatial hierarchies of features directly from the input data, making them of great use for Computer Vision and other applications related to the topic at hand. The implementation of IoT devices and smart sensors aid the collection of real-time data which can be used to feed powerful CNN models to recognize patterns and identify complex events related to the maintenance of railway tracks. This and more insights are discussed in detail within the contents of this paper. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
12. IJAIT Topics and Papers in the Third Decade.
- Author
-
Liu, Alan
- Subjects
- *
DECISION support systems , *COMPUTER vision , *ARTIFICIAL intelligence , *TRAVELING salesman problem - Published
- 2022
- Full Text
- View/download PDF
13. ELEVATING HEALTHCARE: THE SYNERGY OF AI AND BIOSENSORS IN DISEASE MANAGEMENT.
- Author
-
ESWARAN, USHAA, ESWARAN, VIVEK, MURALI, KEERTHNA, and ESWARAN, VISHAL
- Subjects
ARTIFICIAL intelligence ,BIOSENSORS ,DISEASE management ,MEDICAL care ,MACHINE learning ,DRUG delivery systems ,ARTIFICIAL neural networks ,COMPUTER vision - Abstract
Biosensors integrated with artificial intelligence (AI) hold immense potential for transforming healthcare through rapid, automated diagnostics and precision therapeutics. This paper reviews the convergence of biosensing and AI towards developing smart biomedical systems. The fundamentals, historical evolution, and classification of biosensors are presented, highlighting key applications across infections, chronic illnesses, and environmental monitoring. Core AI concepts, including machine learning, neural networks, computer vision, and natural language processing, are discussed, along with their implementation to augment biosensor functionality, connectivity, point-of-care adoption, and laboratory automation. Promising research directions and real-world case studies applying AI-integrated biosensors for early diagnosis and drug delivery are discussed. The opportunities and challenges in advancing this synergistic technology are contemplated, underscoring the need for cross-disciplinary collaboration, clinical validation, ethical vigilance and supportive policy environments to successfully translate AI-biosensors into practical healthcare solutions. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
14. Multi-class waste segregation using computer vision and robotic arm.
- Author
-
Lahoti, Jayanti, Sn, Jathin, Krishna, M. Vamshi, Prasad, Mallika, BS, Rajeshwari, Mysore, Namratha, and Nayak, Jyothi S.
- Subjects
COMPUTER vision ,GLASS recycling ,WASTE products ,ROBOTICS ,WASTE management ,ARTIFICIAL intelligence - Abstract
Waste segregation is an essential aspect of a smoothly functioning waste management system. Usually, various recyclable waste types are disposed of together at the source, and this brings in the necessity to segregate them into their categories. Dry waste needs to be separated into its own categories to ensure that the proper procedures are implemented to treat and process it, which leads to an overall increased recycling rate and reduced landfill impact. Paper, plastics, metals, and glass are just a few examples of the many dry waste materials that can be recycled or recovered to create new goods or energy. Over the past years, much research has been conducted to devise effective and productive ways to achieve proper segregation for the waste that is being produced at an ever-increasing rate. This article introduces a multi-class garbage segregation system employing the YOLOv5 object detection model. Our final prototype demonstrates the capability of classifying dry waste categories and segregating them into their respective bins using a 3D-printed robotic arm. Within our controlled test environment, the system correctly segregated waste classes, mainly paper, plastic, metal, and glass, eight out of 10 times successfully. By integrating the principles of artificial intelligence and robotics, our approach simplifies and optimizes the traditional waste segregation process. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
15. Recent Advances in Computer Vision: Technologies and Applications.
- Author
-
Gao, Mingliang, Zou, Guofeng, Li, Yun, and Guo, Xiangyu
- Subjects
IMAGE recognition (Computer vision) ,COMPUTER vision ,NATURAL language processing ,ARTIFICIAL intelligence ,MACHINE learning ,DEEP learning ,IMAGE segmentation - Abstract
This document is a special issue of the journal "Electronics" that focuses on recent advances in computer vision. The introduction explains how computer vision has transformed various industries and daily life by enabling machines to interpret and understand visual information. It also highlights the challenges that still exist in the field, such as model robustness and interpretability. The future of computer vision is discussed, including the development of multimodal models and advancements in areas like self-supervised learning. The special issue includes 10 papers that cover a range of topics, including stereo matching, low-light image enhancement, automated test grading, image segmentation, virtual clothing design, large-scale learning, camera pose estimation, few-shot segmentation, and image-to-audio conversion. The papers present novel studies, approaches, and reviews that contribute to the advancement of computer vision. The document concludes by emphasizing the importance of computer vision in addressing various challenges and the need for ongoing research and interdisciplinary collaboration to tackle complex real-world problems. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
16. Evaluation of Lie Detection Techniques: Overview.
- Author
-
Mohammed, Zena Tarik and Dahl, Ielaf O. Abdul Majjed
- Subjects
LIE detectors & detection ,FACIAL expression ,ARTIFICIAL vision ,ARTIFICIAL intelligence ,COMPUTER vision - Abstract
Recently, the need to separate truth from lies has motivated lie detection as a constant human endeavor; therefore there is a need to develop lie detection techniques and focus on the new area of lie detection utilizing facial expression. Human faces are a powerful repository of emotions in the complicated interaction between verbal and non-verbal clues that characterize human communication. From this micro-expression, the transitory emotion discloses the more prominent indicators that precede deceitful behavior, which makes the tapestry rich in information that can be harnessed to detect a lie. Historically, the development of deceiving lies passed through many developments to find the best way to get high performance, but the development of artificial intelligence and face recognition has further altered the landscape of lie detection. In this paper, the reason for lie detection is revealed with the techniques used to detect lies. The paper aims to present and survey the techniques with comparison used to detect lies, which will highlight the importance of this topic and urge researchers to develop current techniques or find other related techniques that serve the issue. The presentation of the techniques in this research revealed that the lie detection technique using facial expressions is considered the best technique to achieve the detection of lies. Facial expression is the most efficient because it does not require physical contact and because they are visual of real internal feelings and not voluntary movements, and computer vision and artificial intelligence have had an effective role in supporting this method and exploiting it optimally. Finally, the paper shows the limitations and achievements that the researchers found in their research to help researchers in this field. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
17. RGB‐D road segmentation based on cross‐modality feature maintenance and encouragement.
- Author
-
Yuan, Xia, Wu, Xinyi, Cui, Yanchao, and Zhao, Chunxia
- Subjects
ENCOURAGEMENT ,GAUSSIAN mixture models ,MAINTENANCE ,COMPUTER vision ,PRIOR learning - Abstract
Deep images can provide rich spatial structure information, which can effectively exclude the interference of illumination and road texture in road scene segmentation and make better use of the prior knowledge of road area. This paper first proposes a new cross‐modal feature maintenance and encouragement network. It includes a quantization statistics module as well as a maintenance and encouragement module for effective fusion between multimodal data. Meanwhile, for the problem that if the road segmentation is performed directly using a segmentation network, there will be a lack of supervised guidance with clear physical meaningful information and poor interpretability of learning features, this paper proposes two road segmentation models based on prior knowledge of deep image: disparity information and surface normal vector information. Then, a two‐branch neural network is used to process the colour image and the processed depth image separately, to achieve the full utilization of the complementary features of the two modalities. The experimental results on the KITTI road dataset and Cityscapes dataset show that the method in this paper has good road segmentation performance and high computational efficiency. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
18. 3D human motion prediction: A survey.
- Author
-
Lyu, Kedi, Chen, Haipeng, Liu, Zhenguang, Zhang, Beiqi, and Wang, Ruili
- Subjects
- *
ARTIFICIAL intelligence , *COMPUTER vision , *CONFERENCE papers , *FORECASTING , *HUMAN beings - Abstract
3D human motion prediction, predicting future poses from a given sequence, is an issue of great significance and challenge in computer vision and machine intelligence, which can help machines in understanding human behaviors. Due to the increasing development and understanding of Deep Neural Networks (DNNs) and the availability of large-scale human motion datasets, the human motion prediction has been remarkably advanced with a surge of interest among academia and industrial community. In this context, a comprehensive survey on 3D human motion prediction is conducted for the purpose of retrospecting and analyzing relevant works from existing released literature. In addition, a pertinent taxonomy is constructed to categorize these existing approaches for 3D human motion prediction. In this survey, relevant methods are categorized into three categories: human pose representation , network structure design , and prediction target. We systematically review all relevant journal and conference papers in the field of human motion prediction since 2015, which are presented in detail based on proposed categorizations in this survey. Furthermore, the outline for the public benchmark datasets, evaluation criteria, and performance comparisons are respectively presented in this paper. The limitations of the state-of-the-art methods are discussed as well, hoping for paving the way for future explorations. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
19. Extracting entity relations for "problem-solving" knowledge graph of scientific domains using word analogy.
- Author
-
Chen, Guo, Peng, Jiabin, Xu, Tianxiang, and Xiao, Lu
- Subjects
KNOWLEDGE graphs ,SCIENTIFIC knowledge ,PROBLEM solving ,ARTIFICIAL intelligence ,ANALOGY ,WORD problems (Mathematics) ,COMPUTER vision ,GRAPH algorithms - Abstract
Purpose: Problem-solving" is the most crucial key insight of scientific research. This study focuses on constructing the "problem-solving" knowledge graph of scientific domains by extracting four entity relation types: problem-solving, problem hierarchy, solution hierarchy and association. Design/methodology/approach: This paper presents a low-cost method for identifying these relationships in scientific papers based on word analogy. The problem-solving and hierarchical relations are represented as offset vectors of the head and tail entities and then classified by referencing a small set of predefined entity relations. Findings: This paper presents an experiment with artificial intelligence papers from the Web of Science and achieved good performance. The F1 scores of entity relation types problem hierarchy, problem-solving and solution hierarchy, which were 0.823, 0.815 and 0.748, respectively. This paper used computer vision as an example to demonstrate the application of the extracted relations in constructing domain knowledge graphs and revealing historical research trends. Originality/value: This paper uses an approach that is highly efficient and has a good generalization ability. Instead of relying on a large-scale manually annotated corpus, it only requires a small set of entity relations that can be easily extracted from external knowledge resources. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
20. Advances in Tangible and Embodied Interaction for Virtual and Augmented Reality.
- Author
-
Cardoso, Jorge C. S., Perrotta, André, Silva, Paula Alexandra, and Martins, Pedro
- Subjects
VIRTUAL reality ,AUGMENTED reality ,HAPTIC devices ,COMPUTER vision ,OBJECT recognition algorithms ,ARTIFICIAL intelligence ,INDOOR positioning systems - Abstract
"An Interactive Augmented Reality Graph Visualization for Chinese Painters" [[2]]: This paper describes an interactive AR system that allows Chinese painters to explore and visualise complex graphs in an intuitive and immersive way. VR and AR technologies have seen tremendous progress in recent years, enabling novel and exciting ways to interact within virtual environments. Virtual Reality (VR) and Augmented Reality (AR) technologies have the potential to revolutionise the way we interact with digital content. [Extracted from the article]
- Published
- 2023
- Full Text
- View/download PDF
21. Literature Review of Deep-Learning-Based Detection of Violence in Video.
- Author
-
Negre, Pablo, Alonso, Ricardo S., González-Briones, Alfonso, Prieto, Javier, and Rodríguez-González, Sara
- Subjects
LITERATURE reviews ,VIOLENCE ,ARTIFICIAL intelligence ,URBAN planning ,VIDEOS - Abstract
Physical aggression is a serious and widespread problem in society, affecting people worldwide. It impacts nearly every aspect of life. While some studies explore the root causes of violent behavior, others focus on urban planning in high-crime areas. Real-time violence detection, powered by artificial intelligence, offers a direct and efficient solution, reducing the need for extensive human supervision and saving lives. This paper is a continuation of a systematic mapping study and its objective is to provide a comprehensive and up-to-date review of AI-based video violence detection, specifically in physical assaults. Regarding violence detection, the following have been grouped and categorized from the review of the selected papers: 21 challenges that remain to be solved, 28 datasets that have been created in recent years, 21 keyframe extraction methods, 16 types of algorithm inputs, as well as a wide variety of algorithm combinations and their corresponding accuracy results. Given the lack of recent reviews dealing with the detection of violence in video, this study is considered necessary and relevant. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
22. A COMPARATIVE EXPLORATION OF ACTIVATION FUNCTIONS FOR IMAGE CLASSIFICATION IN CONVOLUTIONAL NEURAL NETWORKS.
- Author
-
MAKHDOOM, FAIZA and RAHMAN, JAMSHAID UL
- Subjects
ARTIFICIAL neural networks ,DEEP learning ,ARTIFICIAL intelligence ,MACHINE learning ,DIGITAL image processing ,COMPUTER vision - Abstract
Activation functions play a crucial role in enabling neural networks to carry out tasks with increased flexibility by introducing non-linearity. The selection of appropriate activation functions becomes even more crucial, especially in the context of deeper networks where the objective is to learn more intricate patterns. Among various deep learning tools, Convolutional Neural Networks (CNNs) stand out for their exceptional ability to learn complex visual patterns. In practice, ReLu is commonly employed in convolutional layers of CNNs, yet other activation functions like Swish can demonstrate superior training performance while maintaining good testing accuracy on different datasets. This paper presents an optimally refined strategy for deep learning-based image classification tasks by incorporating CNNs with advanced activation functions and an adjustable setting of layers. A thorough analysis has been conducted to support the effectiveness of various activation functions when coupled with the favorable softmax loss, rendering them suitable for ensuring a stable training process. The results obtained on the CIFAR-10 dataset demonstrate the favorability and stability of the adopted strategy throughout the training process. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
23. Guest Editorial: AI‐enabled intelligent network for 5G and beyond.
- Author
-
Tseng, Fan‐Hsun, Chen, Chi‐Yuan, Malekian, Reza, Nakano, Tadashi, and Zhang, Zhenjiang
- Subjects
5G networks ,COMPUTER engineering ,COMPUTER vision ,ARTIFICIAL intelligence ,COMPUTER science ,PARTICLE swarm optimization ,INTELLIGENT networks - Abstract
Artificial intelligence (AI), machine learning (ML) and deep learning (DL) techniques have made a breakthrough in many different research fields, such as computer vision, natural language processing, network management and mobile communication. One of the primary goals in 5G network pursues high efficiency, higher spectrum and energy efficiency, higher system capacity and data rates, and lower latency with cost reduction. The papers on the topic of "AI-enabled intelligent networks for 5G and beyond" contain novel network architectures and mechanisms by using AI/ML/DL approaches for 5G/B5G networks. [Extracted from the article]
- Published
- 2022
- Full Text
- View/download PDF
24. Analysis of developments and hotspots of international research on sports AI.
- Author
-
Li, Jian, Li, Meiyue, and Lin, Hao
- Subjects
DEEP learning ,ARTIFICIAL neural networks ,ARTIFICIAL intelligence ,SPORTS sciences ,WEARABLE technology ,MACHINE learning - Abstract
In this paper, 1,538 papers retrieved with the keywords "sports artificial intelligence (AI)" on the Web of Science database since 2007 were taken as the data source, and the Cite Space V software was used to visualize and analyze them. A visual knowledge graph was used to streamline the countries, institutions and authors conducting sports AI research, discipline distribution, research hotspots and development trends in the past 15 years. Subsequently, its development direction and research progress were discussed. Sports AI was widely distributed, with the US, China and the UK leading the way. The most prolific authors and teams in research on sports AI were concentrated in American universities. Their main research direction is to develop and improve smart wearable devices based on machine learning and deep learning technologies for different groups of people. Research on sports AI involved multiple disciplines, which mainly applied and referred to research methodologies and theories on engineering, computer science and sports science. It could be seen from the frequency and centrality of keywords that in the current field of sports AI, machine learning is the main direction, artificial neural networks is the main algorithm, and practical and empirical research based on data mining is the focus. The research hotspots were divided into three major clusters: physical health promotion, sports injury prevention and control, and athletic performance enhancement. How to introduce intelligent technology into sports for a perfect integration still has an arduous and long way to go. Future development requires joint efforts and participation of scientific researchers, professionals and common people. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
25. Pathological test type and chemical detection using deep neural networks: a case study using ELISA and LFA assays.
- Author
-
Hoque Tania, Marzia, Kaiser, M. Shamim, Abu-Hassan, Kamal, and Hossain, M. A.
- Subjects
ARTIFICIAL neural networks ,ARTIFICIAL intelligence ,CHEMICAL testing ,MEDICAL personnel ,DEEP learning - Abstract
Purpose: The gradual increase in geriatric issues and global imbalance of the ratio between patients and healthcare professionals have created a demand for intelligent systems with the least error-prone diagnosis results to be used by less medically trained persons and save clinical time. This paper aims at investigating the development of image-based colourimetric analysis. The purpose of recognising such tests is to support wider users to begin a colourimetric test to be used at homecare settings, telepathology and so on. Design/methodology/approach: The concept of an automatic colourimetric assay detection is delivered by utilising two cases. Training deep learning (DL) models on thousands of images of these tests using transfer learning, this paper (1) classifies the type of the assay and (2) classifies the colourimetric results. Findings: This paper demonstrated that the assay type can be recognised using DL techniques with 100% accuracy within a fraction of a second. Some of the advantages of the pre-trained model over the calibration-based approach are robustness, readiness and suitability to deploy for similar applications within a shorter period of time. Originality/value: To the best of the authors' knowledge, this is the first attempt to provide colourimetric assay type classification (CATC) using DL. Humans are capable to learn thousands of visual classifications in their life. Object recognition may be a trivial task for humans, due to photometric and geometric variabilities along with the high degree of intra-class variabilities, it can be a challenging task for machines. However, transforming visual knowledge into machines, as proposed, can support non-experts to better manage their health and reduce some of the burdens on experts. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
26. Computer vision-based visualization and quantification of body skeletal movements for investigation of traditional skills: the production of Kizumi winnowing baskets.
- Author
-
Yang, Peng, Furukawa, Yuka, Imaishi, Migiwa, Kubo, Mitsunori, and Ueda, Akira
- Subjects
ARTIFICIAL intelligence ,HUMAN mechanics ,TRADITIONAL farming ,HUMAN body ,GEOGRAPHIC names ,COMPUTER vision - Abstract
This paper explores the application of computer vision and mathematical modeling to analyze the intricate movements involved in weaving a traditional farming tool, the winnowing basket. By utilizing OpenPose algorithms, the study simplifies and visualizes the craftsmen's motions, particularly focusing on wrist movements. Video data of craftsmen in Chiba, Japan, creating Kizumi (place name) winnowing baskets is used as the basis for analysis. The extracted information is used to generate 2D motion trajectories of the wrist, allowing a comparison between beginners who watched parsed videos and those who watched the original videos in terms of skill acquisition and learning time. By visualizing human body behavior and combining statistical results, this study demonstrates the potential of artificial intelligence techniques such as computer vision for observing repetitive human movement and inheriting traditional skills. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
27. Fractional differentiation based image enhancement for automatic detection of malignant melanoma.
- Author
-
Anber, Basmah and Yurtkan, Kamil
- Subjects
EDGE detection (Image processing) ,IMAGE recognition (Computer vision) ,ARTIFICIAL intelligence ,COMPUTER vision ,MELANOMA - Abstract
Recent improvements in artificial intelligence and computer vision make it possible to automatically detect abnormalities in medical images. Skin lesions are one broad class of them. There are types of lesions that cause skin cancer, again with several types. Melanoma is one of the deadliest types of skin cancer. Its early diagnosis is at utmost importance. The treatments are greatly aided with artificial intelligence by the quick and precise diagnosis of these conditions. The identification and delineation of boundaries inside skin lesions have shown promise when using the basic image processing approaches for edge detection. Further enhancements regarding edge detections are possible. In this paper, the use of fractional differentiation for improved edge detection is explored on the application of skin lesion detection. A framework based on fractional differential filters for edge detection in skin lesion images is proposed that can improve automatic detection rate of malignant melanoma. The derived images are used to enhance the input images. Obtained images then undergo a classification process based on deep learning. A well-studied dataset of HAM10000 is used in the experiments. The system achieves 81.04% accuracy with EfficientNet model using the proposed fractional derivative based enhancements whereas accuracies are around 77.94% when using original images. In almost all the experiments, the enhanced images improved the accuracy. The results show that the proposed method improves the recognition performance. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
28. Computer vision and machine learning approaches for metadata enrichment to improve searchability of historical newspaper collections.
- Author
-
Ali, Dilawar, Milleville, Kenzo, Verstockt, Steven, Van de Weghe, Nico, Chambers, Sally, and Birkholz, Julie M.
- Subjects
COMPUTER vision ,DATA mining ,ARTIFICIAL intelligence ,IMAGE analysis ,MACHINE learning ,DIGITAL libraries ,DIGITAL humanities - Abstract
Purpose: Historical newspaper collections provide a wealth of information about the past. Although the digitization of these collections significantly improves their accessibility, a large portion of digitized historical newspaper collections, such as those of KBR, the Royal Library of Belgium, are not yet searchable at article-level. However, recent developments in AI-based research methods, such as document layout analysis, have the potential for further enriching the metadata to improve the searchability of these historical newspaper collections. This paper aims to discuss the aforementioned issue. Design/methodology/approach: In this paper, the authors explore how existing computer vision and machine learning approaches can be used to improve access to digitized historical newspapers. To do this, the authors propose a workflow, using computer vision and machine learning approaches to (1) provide article-level access to digitized historical newspaper collections using document layout analysis, (2) extract specific types of articles (e.g. feuilletons – literary supplements from Le Peuple from 1938), (3) conduct image similarity analysis using (un)supervised classification methods and (4) perform named entity recognition (NER) to link the extracted information to open data. Findings: The results show that the proposed workflow improves the accessibility and searchability of digitized historical newspapers, and also contributes to the building of corpora for digital humanities research. The AI-based methods enable automatic extraction of feuilletons, clustering of similar images and dynamic linking of related articles. Originality/value: The proposed workflow enables automatic extraction of articles, including detection of a specific type of article, such as a feuilleton or literary supplement. This is particularly valuable for humanities researchers as it improves the searchability of these collections and enables corpora to be built around specific themes. Article-level access to, and improved searchability of, KBR's digitized newspapers are demonstrated through the online tool (https://tw06v072.ugent.be/kbr/). [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
29. Vehicle Re-Identification Method Based on Multi-Task Learning in Foggy Scenarios.
- Author
-
Gao, Wenchao, Chen, Yifan, Cui, Chuanrui, and Tian, Chi
- Subjects
MACHINE learning ,ARTIFICIAL intelligence ,AUTOMOBILE license plates ,COMPUTER vision ,BRANCHING processes ,DEEP learning - Abstract
Vehicle re-identification employs computer vision to determine the presence of specific vehicles in images or video sequences, often using vehicle appearance for identification due to the challenge of capturing complete license plate information. Addressing the performance issues caused by fog, such as image blur and loss of key positional information, this paper introduces a multi-task learning framework incorporating a multi-scale fusion defogging method (MsF). This method effectively mitigates image blur to produce clearer images, which are then processed by the re-identification branch. Additionally, a phase attention mechanism is introduced to adaptively preserve crucial details. Utilizing advanced artificial intelligence techniques and deep learning algorithms, the framework is evaluated on both synthetic and real datasets, showing significant improvements in mean average precision (mAP)—an increase of 2.5% to 87.8% on the synthetic dataset and 1.4% to 84.1% on the real dataset. These enhancements demonstrate the method's superior performance over the semi-supervised joint defogging learning (SJDL) model, particularly under challenging foggy conditions, thus enhancing vehicle re-identification accuracy and deepening the understanding of applying multi-task learning frameworks in adverse visual environments. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
30. Development of the Anthropomorphic Arm for Collaborative and Home Service Robot CHARMIE.
- Author
-
Syed, Fawad A., Lopes, Gil, and Ribeiro, A. Fernando
- Subjects
POSE estimation (Computer vision) ,INDUSTRIAL robots ,ARTIFICIAL intelligence ,SENSOR arrays ,COMPUTER vision - Abstract
Service robots are rapidly transitioning from concept to reality, making significant strides in development. Similarly, the field of prosthetics is evolving at an impressive pace, with both areas now being highly relevant in the industry. Advancements in these fields are continually pushing the boundaries of what is possible, leading to the increasing creation of individual arm and hand prosthetics, either as standalone units or combined packages. This trend is driven by the rise of advanced collaborative robots that seamlessly integrate with human counterparts in real-world applications. This paper presents an open-source, 3D-printed robotic arm that has been assembled and programmed using two distinct approaches. The first approach involves controlling the hand via teleoperation, utilizing a camera and machine learning-based hand pose estimation. This method details the programming techniques and processes required to capture data from the camera and convert it into hardware signals. The second approach employs kinematic control using the Denavit-Hartenbergmethod to define motion and determine the position of the end effector in 3D space. Additionally, this work discusses the assembly and modifications made to the arm and hand to create a cost-effective and practical solution. Typically, implementing teleoperation requires numerous sensors and cameras to ensure smooth and successful operation. This paper explores methods enabled by artificial intelligence (AI) that reduce the need for extensive sensor arrays and equipment. It investigates how AI-generated data can be translated into tangible hardware applications across various fields. The advancements in computer vision, combined with AI capable of accurately predicting poses, have the potential to revolutionize the way we control and interact with the world around us. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
31. A Comprehensive Review of AI Diagnosis Strategies for Age-Related Macular Degeneration (AMD).
- Author
-
Abd El-Khalek, Aya A., Balaha, Hossam Magdy, Sewelam, Ashraf, Ghazal, Mohammed, Khalil, Abeer T., Abo-Elsoud, Mohy Eldin A., and El-Baz, Ayman
- Subjects
MACULAR degeneration ,RETINAL diseases ,COMPUTER vision ,ARTIFICIAL intelligence ,MACHINE learning ,DEEP learning - Abstract
The rapid advancement of computational infrastructure has led to unprecedented growth in machine learning, deep learning, and computer vision, fundamentally transforming the analysis of retinal images. By utilizing a wide array of visual cues extracted from retinal fundus images, sophisticated artificial intelligence models have been developed to diagnose various retinal disorders. This paper concentrates on the detection of Age-Related Macular Degeneration (AMD), a significant retinal condition, by offering an exhaustive examination of recent machine learning and deep learning methodologies. Additionally, it discusses potential obstacles and constraints associated with implementing this technology in the field of ophthalmology. Through a systematic review, this research aims to assess the efficacy of machine learning and deep learning techniques in discerning AMD from different modalities as they have shown promise in the field of AMD and retinal disorders diagnosis. Organized around prevalent datasets and imaging techniques, the paper initially outlines assessment criteria, image preprocessing methodologies, and learning frameworks before conducting a thorough investigation of diverse approaches for AMD detection. Drawing insights from the analysis of more than 30 selected studies, the conclusion underscores current research trajectories, major challenges, and future prospects in AMD diagnosis, providing a valuable resource for both scholars and practitioners in the domain. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
32. Kinesiology-Inspired Assessment of Intrusion Risk Based on Human Motion Features.
- Author
-
Huang, He, Hu, Hao, Xu, Feng, and Zhang, Zhipeng
- Subjects
INTRUSION detection systems (Computer security) ,RISK assessment ,HUMAN kinematics ,CONTROL groups ,COMPUTER vision ,BUILDING sites ,ARTIFICIAL intelligence - Abstract
Intrusion behavior in hazardous areas is one of the major causes of construction safety accidents including falls from height and strikes by objects. Implementing automatic and preassessment of intrusions to enhance safety performance is of great importance in construction areas. Traditional behavioral safety management mainly relies on manual observation, which makes it difficult to accurately identify detailed changes in behavioral posture, while the results of risk analysis are susceptible to bias due to subjective factors. The emergence of artificial intelligence techniques and computer vision has provided new solutions for human behavior detection in recent years. Accurate vision-based skeleton extraction helps capture detailed behavioral information. Current studies generally focus on intrusion after the occurrence and rarely select metrics considering complex human motion features. It is difficult to accurately assess the potential intrusion risk, resulting in inefficient ex-ante safety management outcomes. This paper presents a novel intrusion assessment approach by integrating human kinematics to extract risk indicators and apply objective assessment methods for risk quantification. An indoor experiment with control groups was conducted by employing skeleton detection technology with safety knowledge to demonstrate its feasibility and effectiveness. The risk levels of the different activities were compared through a control group experimental analysis. The results show that a satisfying accuracy of intrusion assessment can be achieved for different workers. Appropriate warning and intervention methods can be implemented to mitigate the occurrence or reduce the severity of intrusions, thus reducing safety incidents on construction sites. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
33. A Comprehensive Survey of Deep Learning and Its Applications in Advancing Artificial Intelligence.
- Author
-
Sugil, A. Jasmine, Merriliance, K., and Lourdusamy, Mary Immaculate Sheela
- Subjects
DEEP learning ,ARTIFICIAL intelligence ,NATURAL language processing ,COMPUTER vision ,CONVOLUTIONAL neural networks ,BOLTZMANN machine - Abstract
Deep learning, a subset of machine learning, stands at the forefront of artificial intelligence, striving to bridge the gap to its ultimate goal. This paper employs summary and induction methodologies to research into the area of deep learning. It begins by surveying the global development and current landscape of deep learning. Next, it elucidates the structural principles, characteristics, and key models, including stacked auto encoders, deep belief networks, deep Boltzmann machines, and convolutional neural networks. Furthermore, it examines the latest advancements and applications of deep learning across diverse domains such as speech processing, computer vision, natural language processing, and medical diagnostics. Finally, the paper outlines the challenges and future research directions within the realm of deep learning. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
34. Streamlined Shopping Experience with Futuristic Automated Billing - SmartPay.
- Author
-
Y. M., Santhosh, B. H., Hemamalini, Adil, Syed, S. B., Yogesh, and M., Rakshith
- Subjects
MACHINE learning ,COMPUTER vision ,DEEP learning ,TECHNOLOGICAL innovations ,INVOICES ,COMPUTER engineering - Abstract
Amazon Go is a new grocery store concept that eliminates the need for checkout lines and cashiers by using a combination of computer vision, sensors, and deep learning algorithms to automatically track what customers take off the shelves. With Amazon Go, customers can quickly enter the store, grab what they need, and leave without ever needing to wait in line or deal with a cashier. Amazon Go utilises advanced technologies such as computer vision, deep learning algorithms, and sensor fusion to create a unique shopping experience. Customers are able to enter the store and shop without having to wait in lines or interact with other customers, creating a streamlined and efficient shopping experience. Additionally, Amazon Go utilises artificial intelligence to track customer movement and automatically charges customers for items they take out of the store, which eliminates the need for traditional checkouts. This paper discusses the technological innovations behind Amazon Go and how they can be improved in the current scenario. [ABSTRACT FROM AUTHOR]
- Published
- 2024
35. Editorial: AI-empowered services for interconnected smart plant protection systems.
- Author
-
Xu Zheng
- Subjects
PLANT protection ,COMPUTER vision ,SMART devices ,BOTANY - Abstract
This document is an editorial from the journal Frontiers in Plant Science titled "AI-empowered services for interconnected smart plant protection systems." The editorial discusses a research topic focused on using artificial intelligence (AI) techniques to enhance smart plant protection systems. The research topic received 14 submissions, with six outstanding papers accepted. These papers cover various aspects of plant protection, including computer vision models for fruit and disease detection, as well as AI methods for insect pest density estimation and secure data-sharing strategies. The editorial highlights the innovative ideas and investigations presented in the research topic and emphasizes their potential for advancing smart plant protection techniques. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
36. A High-Precision Method for 100-Day-Old Classification of Chickens in Edge Computing Scenarios Based on Federated Computing.
- Author
-
Huang, Yikang, Yang, Xinze, Guo, Jiangyi, Cheng, Jia, Qu, Hao, Ma, Jie, and Li, Lin
- Subjects
EDGE computing ,CHICKENS ,ARTIFICIAL intelligence ,ARTIFICIAL vision ,COMPUTER vision ,CLASSIFICATION - Abstract
Simple Summary: Improving the accuracy of day-age detection of chickens is of great importance for chicken rearing. This paper focuses on the problem of classifying the age of chickens within 100 days. This paper proposes a high-precision federated learning-based model that can be applied to edge computing scenarios. Finally, our method can achieve an accuracy of 96.1%, which can fully meet the needs of application scenarios. Due to the booming development of computer vision technology and artificial intelligence algorithms, it has become more feasible to implement artificial rearing of animals in real production scenarios. Improving the accuracy of day-age detection of chickens is one of the examples and is of great importance for chicken rearing. This paper focuses on the problem of classifying the age of chickens within 100 days. Due to the huge amount of data and the different computing power of different devices in practical application scenarios, it is important to maximize the computing power of edge computing devices without sacrificing accuracy. This paper proposes a high-precision federated learning-based model that can be applied to edge computing scenarios. In order to accommodate different computing power in different scenarios, this paper proposes a dual-ended adaptive federated learning framework; in order to adapt to low computing power scenarios, this paper performs lightweighting operations on the mainstream model; and in order to verify the effectiveness of the model, this paper conducts a number of targeted experiments. Compared with AlexNet, VGG, ResNet and GoogLeNet, this model improves the classification accuracy to 96.1%, which is 14.4% better than the baseline model and improves the Recall and Precision by 14.8% and 14.2%, respectively. In addition, by lightening the network, our methods reduce the inference latency and transmission latency by 24.4 ms and 10.5 ms, respectively. Finally, this model is deployed in a real-world application and an application is developed based on the wechat SDK. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
37. Deep-Learning-CNN for Detecting Covered Faces with Niqab.
- Author
-
Alashbi, Abdulaziz A., Sunar, Mohd Shahrizal, and Alqahtani, Zieb
- Subjects
DEEP learning ,HUMAN facial recognition software ,COMPUTER vision ,COMPUTER algorithms ,ARTIFICIAL intelligence - Abstract
Detecting occluded faces is a non-trivial problem for face detection in computer vision. This challenge becomes more difficult when the occlusion covers majority of the face. Despite the high performance of current state-of-the-art face detection algorithms, the detection of occluded and covered faces is an unsolved problem and is still worthy of study. In this paper, a deep-learning-face-detection model Niqab-Face-Detector is proposed along with context-based labeling technique for detecting unconstrained veiled faces such as faces covered with niqab. An experimental test was conducted to evaluate the performances of the proposed model using the Niqab-Face dataset. The experiment showed encouraging results and improved accuracy compared with state-of-the-art face detection algorithms. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
38. Artificial Intelligence-Driven Innovations in Hydrogen Safety.
- Author
-
Patil, Ravindra R., Calay, Rajnish Kaur, Mustafa, Mohamad Y., and Thakur, Somil
- Subjects
MACHINE learning ,ARTIFICIAL neural networks ,WIRELESS sensor network security ,WIRELESS sensor networks ,COMPUTER vision ,ARTIFICIAL intelligence - Abstract
This review explores recent advancements in hydrogen gas (H
2 ) safety through the lens of artificial intelligence (AI) techniques. As hydrogen gains prominence as a clean energy source, ensuring its safe handling becomes paramount. The paper critically evaluates the implementation of AI methodologies, including artificial neural networks (ANN), machine learning algorithms, computer vision (CV), and data fusion techniques, in enhancing hydrogen safety measures. By examining the integration of wireless sensor networks and AI for real-time monitoring and leveraging CV for interpreting visual indicators related to hydrogen leakage issues, this review highlights the transformative potential of AI in revolutionizing safety frameworks. Moreover, it addresses key challenges such as the scarcity of standardized datasets, the optimization of AI models for diverse environmental conditions, etc., while also identifying opportunities for further research and development. This review foresees faster response times, reduced false alarms, and overall improved safety for hydrogen-related applications. This paper serves as a valuable resource for researchers, engineers, and practitioners seeking to leverage state-of-the-art AI technologies for enhanced hydrogen safety systems. [ABSTRACT FROM AUTHOR]- Published
- 2024
- Full Text
- View/download PDF
39. Recent Advances on Deep Learning for Sign Language Recognition.
- Author
-
Yanqiong Zhang and Xianwei Jiang
- Subjects
DEEP learning ,SIGN language ,TRANSFORMER models ,ARTIFICIAL neural networks ,RECURRENT neural networks ,CONVOLUTIONAL neural networks - Abstract
Sign language, a visual-gestural language used by the deaf and hard-of-hearing community, plays a crucial role in facilitating communication and promoting inclusivity. Sign language recognition (SLR), the process of automatically recognizing and interpreting sign language gestures, has gained significant attention in recent years due to its potential to bridge the communication gap between the hearing impaired and the hearing world. The emergence and continuous development of deep learning techniques have provided inspiration and momentum for advancing SLR. This paper presents a comprehensive and up-to-date analysis of the advancements, challenges, and opportunities in deep learning-based sign language recognition, focusing on the past five years of research. We explore various aspects of SLR, including sign data acquisition technologies, sign language datasets, evaluation methods, and different types of neural networks. Convolutional Neural Networks (CNN) and Recurrent Neural Networks (RNN) have shown promising results in fingerspelling and isolated sign recognition. However, the continuous nature of sign language poses challenges, leading to the exploration of advanced neural network models such as the Transformer model for continuous sign language recognition (CSLR). Despite significant advancements, several challenges remain in the field of SLR. These challenges include expanding sign language datasets, achieving user independence in recognition systems, exploring different input modalities, effectively fusing features, modeling co-articulation, and improving semantic and syntactic understanding. Additionally, developing lightweight network architectures for mobile applications is crucial for practical implementation. By addressing these challenges, we can further advance the field of deep learning for sign language recognition and improve communication for the hearing-impaired community. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
40. Diagnostics of Exercise-Induced Laryngeal Obstruction Using Machine Learning: A Narrative Review.
- Author
-
Mæstad, Rune, Kvidaland, Haakon Kristian, Clemm, Hege, Røksund, Ola Drange, and Arghandeh, Reza
- Subjects
COMPUTER vision ,IMAGE segmentation ,ARTIFICIAL intelligence ,APPLICATION software ,MACHINE learning - Abstract
Objective: This paper explores machine learning methods for exercise-induced laryngeal obstruction (EILO) diagnostics. Traditional diagnostic approaches like CLE scoring face subjectivity, limiting precise objective assessments. Machine learning is introduced as a theoretical solution to potentially overcome these limitations and improve diagnostic precision. Methods: A narrative review was conducted to explore the integration of machine learning techniques in the diagnostics of EILO. Result: Three machine learning methods for the segmentation of laryngeal images were discovered: fully convolutional network, Mask R-CNN, and 3D VOSNet. Our findings reveal that the integration of machine learning with EILO diagnostics remains a largely untapped research domain, providing significant room for further exploration. Conclusions: The integration of ML techniques for EILO diagnostics has the potential to be a helpful tool for clinicians. The application of computer vision ML methods, such as image segmentation, to delineate laryngeal structures paves the way for a more objective assessment. While challenges persist, especially in differences in patients' laryngeal anatomy, the synergy of ML and medical expertise is an important field to explore in the years to come. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
41. Exploring Intervention Techniques for Alzheimer's Disease: Conventional Methods and the Role of AI in Advancing Care.
- Author
-
Subramanian, Karthikeyan, Hajamohideen, Faizal, Viswan, Vimbi, Shaffi, Noushath, and Mahmud, Mufti
- Subjects
ALZHEIMER'S disease ,ARTIFICIAL intelligence ,REMINISCENCE therapy ,DATA privacy ,COMPUTER vision ,DEEP learning - Abstract
Alzheimer's disease (AD) is a neurodegenerative condition characterized by cognitive decline and functional impairment. This study compares conventional intervention techniques with emerging artificial intelligence (AI) approaches to AD. Intervention technique refers to a specific method or approach employed to bring about positive change in a particular situation. In the context of AD, such techniques are crucial as they aim to slow down the progression of symptoms, alleviate behavioral challenges, and support patients and their caretakers in managing the complexities of the condition. Conventional intervention techniques, such as cognitive stimulation and reality orientation, have demonstrated benefits in improving cognitive function and emotional well-being. Conventional intervention approaches are widely preferred as they have a proven track record of effectiveness, personalized response, cost-effectiveness, and patient-centered care. Despite these benefits, they are limited by individual variability in response and long-term effectiveness. On the other hand, AI-based approaches such as computer vision and deep learning hold the potential to revolutionize Alzheimer's interventions. These technologies offer early detection, personalized care, and remote monitoring capabilities. They can provide tailored interventions, assist decision-making, and enhance caregiver support. Although AI-based interventions face challenges such as data privacy and implementation complexity, their potential to transform Alzheimer's care is significant. This research paper compares conventional and AI-based approaches. It reveals that while traditional techniques are well established and have proven benefits, AI-based interventions offer novel opportunities for personalized and advanced care. Combining the strengths of both approaches may lead to more comprehensive and effective interventions for individuals with AD. Continued research and collaboration are crucial to harness the full potential of AI in improving Alzheimer's care and enhancing the quality of life for affected individuals and their caregivers. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
42. Special issue on advances in pattern recognition and computer vision, applications and systems.
- Author
-
Irfan Uddin, M.
- Subjects
PATTERN recognition systems ,COMPUTER vision ,ARTIFICIAL intelligence - Abstract
We have encouraged our authors to present multidisciplinary research papers in this special issue and present papers which are directly or indirectly related to pattern recognition and computer vision. This special issue is on topics of pattern recognition and computer vision, along with their applications in different domains and systems in different environments. [Extracted from the article]
- Published
- 2022
- Full Text
- View/download PDF
43. Human Event Recognition in Smart Classrooms Using Computer Vision: A Systematic Literature Review.
- Author
-
Córdoba-Tlaxcalteco, M. L. and Benítez-Guerrero, E.
- Subjects
COMPUTER vision ,ASSISTIVE technology ,CLASSROOMS ,RECOGNITION (Psychology) ,COVID-19 pandemic ,RESEARCH personnel - Abstract
The field of human event recognition using visual data in smart environments has emerged as a fruitful and successful area of study, with extensive research and development efforts driving significant advancements. These advancements have not only provided valuable insights, but also led to practical applications in various domains. In this context, human actions, activities, interactions, and behaviors can all be considered as events of interest in smart environments. However, when it comes to smart classrooms, there is a lack of unified consensus on the definition of the term "human event." This lack of agreement presents a significant challenge for educators, researchers, and developers, as it hampers their ability to precisely identify and classify the specific situations that are relevant within the educational context. The aim of this paper is to address this challenge by conducting a systematic literature review of relevant events in smart classrooms, with a focus on their applications in assistive technology. The review encompasses a comprehensive analysis of 227 published documents spanning from 2012 to 2022. It delves into key algorithms, methodologies, and applications of vision-based event recognition in smart environments. As the primary outcome, the review identifies the most significant events, classifying them according to single person behavior, or multiple-person interactions, or object-person interactions. It also examines their practical applications within the educational context. The paper concludes with a discussion on the relevance and practicality of vision-based human event recognition in smart classrooms, particularly in the post-COVID era. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
44. A Complete Review on Image Denoising Techniques for Medical Images.
- Author
-
Kaur, Amandeep and Dong, Guanfang
- Subjects
IMAGE denoising ,DIAGNOSTIC imaging ,GENERATIVE adversarial networks ,ARTIFICIAL intelligence ,COMPUTER vision ,IMAGE enhancement (Imaging systems) - Abstract
Medical imaging methods, such as CT scans, MRI scans, X-rays, and ultrasound imaging, are widely used for diagnosis in the healthcare domain. However, these methods are often affected by noise, which can lead to incorrect diagnoses. Radiologists used to rely on visual features observed through various imaging techniques to diagnose diseases in patients, but now, intelligent machines and artificial intelligence offer more accurate and early diagnoses. Over the past few decades, the classical problem of image denoising in computer vision has been extensively studied. This survey paper discusses the various techniques applicable which have tried to remove the noise from medical images. A complete overview of the problem hypothesis is stated in the paper, with an in-depth discussion on types and sources of noise and the evaluation metrics deployed, followed by the discussion and implementation of various filtering and image enhancement techniques. The section is succeeded by a comprehensive literature review conducted on leading and state-of-the-art methods in broadly four domains—frequency domain, filtering, CNN-based, Generative Adversarial Networks (GAN)-based and Transformer-based approaches. The conclusion summarises the findings and proposes the importance of image denoising, focusing on Explainable AI (XAI). [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
45. Bird's nest defect detection of transmission lines based on domain knowledge and occlusion reasoning.
- Author
-
Dong, Na, Zhang, Wenjing, Chen, Ze, Feng, Haiyan, and Jia, Jiandong
- Subjects
ELECTRIC lines ,BIRD nests ,PATTERN recognition systems - Abstract
Bird's nest defect is an important cause of transmission line faults. To achieve accurate detection of bird nest defects in complex scenarios, a bird nest defect detection model for transmission lines was proposed that combines domain knowledge and occlusion reasoning networks. On the one hand, the model utilized the domain knowledge of the location of the bird's nest, using edge detection to obtain tower area information to constrain the location of candidate frames. This helps to reduce the false detection caused by complex backgrounds. On the other hand, on the basis of analyzing the occlusion characteristics of bird nests, the model employed occlusion reasoning networks that randomly erase features at the feature level to simulate the occlusion of bird nests in real scenes and improve the model's detection capability for occluded targets. Additionally, a multi‐scale feature fusion algorithm was designed in this paper to adapt the model to the scale variations of bird nests in aerial images. Experimental results demonstrate that the model outperforms advanced target detection models and other bird nest defect detection methods, with an AP50 of 78.8% and an AR10 of 72.4% for defect detection. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
46. Artificial Intelligence in Photovoltaic Fault Identification and Diagnosis: A Systematic Review.
- Author
-
Islam, Mahmudul, Rashel, Masud Rana, Ahmed, Md Tofael, Islam, A. K. M. Kamrul, and Tlemçani, Mouhaydine
- Subjects
ARTIFICIAL intelligence ,MACHINE learning ,DEEP learning ,CONVOLUTIONAL neural networks ,PROCESS capability ,PHOTOVOLTAIC power systems - Abstract
Photovoltaic (PV) fault detection is crucial because undetected PV faults can lead to significant energy losses, with some cases experiencing losses of up to 10%. The efficiency of PV systems depends upon the reliable detection and diagnosis of faults. The integration of Artificial Intelligence (AI) techniques has been a growing trend in addressing these issues. The goal of this systematic review is to offer a comprehensive overview of the recent advancements in AI-based methodologies for PV fault detection, consolidating the key findings from 31 research papers. An initial pool of 142 papers were identified, from which 31 were selected for in-depth review following the PRISMA guidelines. The title, objective, methods, and findings of each paper were analyzed, with a focus on machine learning (ML) and deep learning (DL) approaches. ML and DL are particularly suitable for PV fault detection because of their capacity to process and analyze large amounts of data to identify complex patterns and anomalies. This study identified several AI techniques used for fault detection in PV systems, ranging from classical ML methods like k-nearest neighbor (KNN) and random forest to more advanced deep learning models such as Convolutional Neural Networks (CNNs). Quantum circuits and infrared imagery were also explored as potential solutions. The analysis found that DL models, in general, outperformed traditional ML models in accuracy and efficiency. This study shows that AI methodologies have evolved and been increasingly applied in PV fault detection. The integration of AI in PV fault detection offers high accuracy and effectiveness. After reviewing these studies, we proposed an Artificial Neural Network (ANN)-based method for PV fault detection and classification. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
47. A Deep Learning-Based Programming and Creation Algorithm of NFT Artwork.
- Author
-
Wang, T.
- Subjects
DEEP learning ,GENERATIVE adversarial networks ,COMPUTER vision ,ALGORITHMS ,ARTIFICIAL intelligence ,IMAGE analysis - Abstract
In the field of computer vision, it is a very challenging task to use artificial intelligence deep learning method to realize the programming and creation of NFT artwork. With the continuous development and improvement of deep learning technology, this task has become a reality. The generative adversarial network model used in deep learning can generate new images based on the extraction and analysis of image data features and has become an important tool for NFT artwork image generation. In order to better realize the NFT artwork programming, this paper analyzes the working principle of the traditional adversarial generation method and then uses the StyleGAN model to edit the higher-level attributes of the image, which can effectively control the generated style and style of the NFT artwork image. Finally, in order to improve the quality of the generated images, this paper introduces a channel attention mechanism and a spatial attention mechanism to ensure that the generated images are more reasonable and realistic. Finally, through a large number of experiments, it is proved that the NFT artwork transmission programming algorithm based on artificial intelligence deep learning proposed in this paper can control the overall style of image generation according to the needs of the transmission, and the generated image features have good details and high visual quality. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
48. From language to algorithm: trans and non-binary identities in research on facial and gender recognition
- Author
-
Thieme, Katja, Saunders, Mary Ann S., and Ferreira, Laila
- Published
- 2024
- Full Text
- View/download PDF
49. Face expression recognition based on NGO-BILSTM model.
- Author
-
Jiarui Zhong, Tangxian Chen, and Liuhan Yi
- Subjects
GOSHAWK ,FACIAL expression ,ARTIFICIAL intelligence ,DEEP learning ,ARTIFICIAL vision ,COMPUTER vision - Abstract
Introduction: Facial expression recognition has always been a hot topic in computer vision and artificial intelligence. In recent years, deep learning models have achieved good results in accurately recognizing facial expressions. BILSTM network is such a model. However, the BILSTM network's performance depends largely on its hyperparameters, which is a challenge for optimization. Methods: In this paper, a Northern Goshawk optimization (NGO) algorithm is proposed to optimize the hyperparameters of BILSTM network for facial expression recognition. The proposed methods were evaluated and compared with other methods on the FER2013, FERplus and RAF-DB datasets, taking into account factors such as cultural background, race and gender. Results: The results show that the recognition accuracy of the model on FER2013 and FERPlus data sets is much higher than that of the traditional VGG16 network. The recognition accuracy is 89.72% on the RAF-DB dataset, which is 5.45, 9.63, 7.36, and 3.18% higher than that of the proposed facial expression recognition algorithms DLP-CNN, gACNN, pACNN, and LDL-ALSG in recent 2 years, respectively. Discussion: In conclusion, NGO algorithm effectively optimized the hyperparameters of BILSTM network, improved the performance of facial expression recognition, and provided a new method for the hyperparameter optimization of BILSTM network for facial expression recognition. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
50. Vision sensor‐based SLAM problem for small UAVs in dynamic indoor environments.
- Author
-
Zhou, Lanfeng, Kong, Mingyue, Liu, Ziwei, and Li, Ling
- Subjects
COMPUTER vision ,ARTIFICIAL intelligence ,POINT cloud ,VISION ,DRONE aircraft - Abstract
In recent years, with the rapid development of artificial intelligence, machine vision and other related technologies, there has been a demand for higher levels of intelligence in UAVs. There are many excellent SLAM systems available, but most of them assume that their working environment is static. When there are dynamic objects in the environment the localization and mapping accuracy of the SLAM system is reduced, and it can even cause its tracking to fail. To solve this problem. In this paper, we propose a target detection‐based SLAM algorithm for real‐time operation in crowded dynamic environments. The algorithm removes feature points of dynamic objects from key frames and then constructs a point cloud map of based on the state‐of‐the‐art SLAM system ORB‐SLAM3. Finally, the proposed method is validated and evaluated on multiple dynamic datasets and real environments. The results show that the algorithm in this paper outperforms other visual SLAM algorithms in terms of localization and map building accuracy in more highly dynamic datasets, while guaranteeing performance. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.