6,398 results
Search Results
2. Web-based diagnostic platform for microorganism-induced deterioration on paper-based cultural relics with iterative training from human feedback
- Author
-
Chenshu Liu, Songbin Ben, Chongwen Liu, Xianchao Li, Qingxia Meng, Yilin Hao, Qian Jiao, and Pinyi Yang
- Subjects
Paper-based cultural relics ,Conservation ,Computer vision ,Deep learning ,Strain classification ,Web application ,Fine Arts ,Analytical chemistry ,QD71-142 - Abstract
Abstract Purpose Paper-based artifacts hold significant cultural and social values. However, paper is intrinsically fragile to microorganisms, such as mold, due to its cellulose composition, which can serve as a microorganisms’ nutrient source. Mold not only can damage papers’ structural integrity and pose significant challenges to conservation works but also may subject individuals attending the contaminated artifacts to health risks. Current approaches for strain identification usually require extensive training, prolonged time for analysis, expensive operation costs, and higher risks of secondary damage due to sampling. Thus, in current conservation practices with mold-contaminated artifacts, little pre-screening or strain identification was performed before mold removal, and the cleaning techniques are usually broad-spectrum rather than strain-specific. With deep learning showing promising applications across various domains, this study investigated the feasibility of using a convolutional neural network (CNN) for fast in-situ recognition and classification of mold on paper. Methods Molds were first non-invasively sampled from ancient Xuan Paper-based Chinese books from the Qing and Ming dynasties. Strains were identified using molecular biology methods and the four most prevalent strains were inoculated on Xuan paper to create mockups for image collection. Microscopic images of the molds as well as their stains situated on paper were collected using a compound microscope and commercial microscope lens for cell phone cameras, which were then used for training CNN models with a transfer learning scheme to perform the classification of mold. To enable involvement and contribution from the research community, a web interface that actuates the process while providing interactive features for users to learn about the information of the classified strain was constructed. Moreover, a feedback functionality in the web interface was embedded for catching potential classification errors, adding additional training images, or introducing new strains, all to refine the generalizability and robustness of the model. Results & Conclusion In the study, we have constructed a suite of high-confidence classification CNN models for the diagnostic process for mold contamination in conservation. At the same time, a web interface was constructed that allows recurrently refining the model with human feedback through engaging the research community. Overall, the proposed framework opens new avenues for effective and timely identification of mold, thus enabling proactive and targeted mold remediation strategies in conservation.
- Published
- 2024
- Full Text
- View/download PDF
3. Speech recognition for Kazakh language: a research paper.
- Author
-
Kapyshev, Galym, Nurtas, Marat, and Altaibek, Aizhan
- Subjects
SPEECH perception ,AUTOMATIC speech recognition ,NATURAL language processing ,LANGUAGE research ,DEEP learning ,MARKOV processes - Abstract
In recent years, the research pertaining to speech recognition technology in the Kazakh language has gained significant importance. This is due to the increasing demand for natural language processing applications in the region where Kazakh is predominantly spoken. Thus, there exists an urgent requirement for precise and dependable speech recognition systems. The research study examines the application of sophisticated deep learning methodologies, such as Natural Language Processing (NLP) and Hidden Markov Model (HMM), in facilitating speech recognition for the Kazakh language. Additionally, the investigation delves into how various techniques, including data preprocessing, acoustic modeling, and language modeling, can aid in devising effective speech recognition systems. The article deliberates on the feasible uses of speech recognition technology in the geographic area where Kazakh language is spoken and outlines its future research prospects. The investigation underscores the significance of persistent inquiry in this realm to confront distinctive obstacles encountered in creating speech recognition systems for languages with restricted resources. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
4. Web-based diagnostic platform for microorganism-induced deterioration on paper-based cultural relics with iterative training from human feedback.
- Author
-
Liu, Chenshu, Ben, Songbin, Liu, Chongwen, Li, Xianchao, Meng, Qingxia, Hao, Yilin, Jiao, Qian, and Yang, Pinyi
- Subjects
- *
CONVOLUTIONAL neural networks , *COMMUNITY involvement , *DEEP learning , *CLASSIFICATION , *CAMERA phones , *RELICS , *AUTOMATIC classification , *SECURITY classification (Government documents) - Abstract
Purpose: Paper-based artifacts hold significant cultural and social values. However, paper is intrinsically fragile to microorganisms, such as mold, due to its cellulose composition, which can serve as a microorganisms' nutrient source. Mold not only can damage papers' structural integrity and pose significant challenges to conservation works but also may subject individuals attending the contaminated artifacts to health risks. Current approaches for strain identification usually require extensive training, prolonged time for analysis, expensive operation costs, and higher risks of secondary damage due to sampling. Thus, in current conservation practices with mold-contaminated artifacts, little pre-screening or strain identification was performed before mold removal, and the cleaning techniques are usually broad-spectrum rather than strain-specific. With deep learning showing promising applications across various domains, this study investigated the feasibility of using a convolutional neural network (CNN) for fast in-situ recognition and classification of mold on paper. Methods: Molds were first non-invasively sampled from ancient Xuan Paper-based Chinese books from the Qing and Ming dynasties. Strains were identified using molecular biology methods and the four most prevalent strains were inoculated on Xuan paper to create mockups for image collection. Microscopic images of the molds as well as their stains situated on paper were collected using a compound microscope and commercial microscope lens for cell phone cameras, which were then used for training CNN models with a transfer learning scheme to perform the classification of mold. To enable involvement and contribution from the research community, a web interface that actuates the process while providing interactive features for users to learn about the information of the classified strain was constructed. Moreover, a feedback functionality in the web interface was embedded for catching potential classification errors, adding additional training images, or introducing new strains, all to refine the generalizability and robustness of the model. Results & Conclusion: In the study, we have constructed a suite of high-confidence classification CNN models for the diagnostic process for mold contamination in conservation. At the same time, a web interface was constructed that allows recurrently refining the model with human feedback through engaging the research community. Overall, the proposed framework opens new avenues for effective and timely identification of mold, thus enabling proactive and targeted mold remediation strategies in conservation. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
5. Computer vision digitization of smartphone images of anesthesia paper health records from low-middle income countries.
- Author
-
Folks, Ryan D., Naik, Bhiken I., Brown, Donald E., and Durieux, Marcel E.
- Subjects
- *
MEDICAL records , *ARTIFICIAL neural networks , *COMPUTER vision , *DIASTOLIC blood pressure , *MEDICAL personnel , *DEEP learning , *SYSTOLIC blood pressure - Abstract
Background: In low-middle income countries, healthcare providers primarily use paper health records for capturing data. Paper health records are utilized predominately due to the prohibitive cost of acquisition and maintenance of automated data capture devices and electronic medical records. Data recorded on paper health records is not easily accessible in a digital format to healthcare providers. The lack of real time accessible digital data limits healthcare providers, researchers, and quality improvement champions to leverage data to improve patient outcomes. In this project, we demonstrate the novel use of computer vision software to digitize handwritten intraoperative data elements from smartphone photographs of paper anesthesia charts from the University Teaching Hospital of Kigali. We specifically report our approach to digitize checkbox data, symbol-denoted systolic and diastolic blood pressure, and physiological data. Methods: We implemented approaches for removing perspective distortions from smartphone photographs, removing shadows, and improving image readability through morphological operations. YOLOv8 models were used to deconstruct the anesthesia paper chart into specific data sections. Handwritten blood pressure symbols and physiological data were identified, and values were assigned using deep neural networks. Our work builds upon the contributions of previous research by improving upon their methods, updating the deep learning models to newer architectures, as well as consolidating them into a single piece of software. Results: The model for extracting the sections of the anesthesia paper chart achieved an average box precision of 0.99, an average box recall of 0.99, and an mAP0.5-95 of 0.97. Our software digitizes checkbox data with greater than 99% accuracy and digitizes blood pressure data with a mean average error of 1.0 and 1.36 mmHg for systolic and diastolic blood pressure respectively. Overall accuracy for physiological data which includes oxygen saturation, inspired oxygen concentration and end tidal carbon dioxide concentration was 85.2%. Conclusions: We demonstrate that under normal photography conditions we can digitize checkbox, blood pressure and physiological data to within human accuracy when provided legible handwriting. Our contributions provide improved access to digital data to healthcare practitioners in low-middle income countries. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
6. Deep Learning for 3D Reconstruction, Augmentation, and Registration: A Review Paper
- Author
-
Prasoon Kumar Vinodkumar, Dogus Karabulut, Egils Avots, Cagri Ozcinar, and Gholamreza Anbarjafari
- Subjects
deep learning ,3D reconstruction ,3D augmentation ,3D registration ,point cloud ,voxel ,Science ,Astrophysics ,QB460-466 ,Physics ,QC1-999 - Abstract
The research groups in computer vision, graphics, and machine learning have dedicated a substantial amount of attention to the areas of 3D object reconstruction, augmentation, and registration. Deep learning is the predominant method used in artificial intelligence for addressing computer vision challenges. However, deep learning on three-dimensional data presents distinct obstacles and is now in its nascent phase. There have been significant advancements in deep learning specifically for three-dimensional data, offering a range of ways to address these issues. This study offers a comprehensive examination of the latest advancements in deep learning methodologies. We examine many benchmark models for the tasks of 3D object registration, augmentation, and reconstruction. We thoroughly analyse their architectures, advantages, and constraints. In summary, this report provides a comprehensive overview of recent advancements in three-dimensional deep learning and highlights unresolved research areas that will need to be addressed in the future.
- Published
- 2024
- Full Text
- View/download PDF
7. Deep Learning for Automated Visual Inspection in Manufacturing and Maintenance: A Survey of Open- Access Papers
- Author
-
Nils Hütten, Miguel Alves Gomes, Florian Hölken, Karlo Andricevic, Richard Meyes, and Tobias Meisen
- Subjects
automated visual inspection ,industrial applications ,deep learning ,computer vision ,convolutional neural network ,vision transformer ,Technology ,Applied mathematics. Quantitative methods ,T57-57.97 - Abstract
Quality assessment in industrial applications is often carried out through visual inspection, usually performed or supported by human domain experts. However, the manual visual inspection of processes and products is error-prone and expensive. It is therefore not surprising that the automation of visual inspection in manufacturing and maintenance is heavily researched and discussed. The use of artificial intelligence as an approach to visual inspection in industrial applications has been considered for decades. Recent successes, driven by advances in deep learning, present a possible paradigm shift and have the potential to facilitate automated visual inspection, even under complex environmental conditions. For this reason, we explore the question of to what extent deep learning is already being used in the field of automated visual inspection and which potential improvements to the state of the art could be realized utilizing concepts from academic research. By conducting an extensive review of the openly accessible literature, we provide an overview of proposed and in-use deep-learning models presented in recent years. Our survey consists of 196 open-access publications, of which 31.7% are manufacturing use cases and 68.3% are maintenance use cases. Furthermore, the survey also shows that the majority of the models currently in use are based on convolutional neural networks, the current de facto standard for image classification, object recognition, or object segmentation tasks. Nevertheless, we see the emergence of vision transformer models that seem to outperform convolutional neural networks but require more resources, which also opens up new research opportunities for the future. Another finding is that in 97% of the publications, the authors use supervised learning techniques to train their models. However, with the median dataset size consisting of 2500 samples, deep-learning models cannot be trained from scratch, so it would be beneficial to use other training paradigms, such as self-supervised learning. In addition, we identified a gap of approximately three years between approaches from deep-learning-based computer vision being published and their introduction in industrial visual inspection applications. Based on our findings, we additionally discuss potential future developments in the area of automated visual inspection.
- Published
- 2024
- Full Text
- View/download PDF
8. Special Issue "Emerging AI+X-Based Sensor and Networking Technologies including Selected Papers from ICGHIT 2022–2023".
- Author
-
Kim, Byung-Seo, Afzal, Muhammad Khalil, and Ullah, Rehmat
- Subjects
- *
MULTICASTING (Computer networks) , *INFORMATION technology , *SENSOR networks , *ARTIFICIAL neural networks , *DEEP learning , *BEAM steering , *INTEGRATED circuit design , *COMPUTER network security - Abstract
This document is a summary of a special issue of the journal Sensors, titled "Emerging AI+X-Based Sensor and Networking Technologies including Selected Papers from ICGHIT 2022–2023." The special issue features selected papers from the 10th and 11th International Conferences on Green and Human Information Technology (ICGHITs), which were held in Korea and Thailand. The conferences focused on the theme of "Emerging Artificial Intelligent (AI)+X technology" and "Hyper Automation + Human AI" respectively. The selected papers cover various topics such as network security, routing protocols, signal detection, and clustering mechanisms, all incorporating AI-based methods. The issue also includes papers on topics like secure authentication, distance estimation in RFID systems, energy optimization in smart homes, blockchain technology, and radar signal detection. The authors emphasize the importance of both technology and humanity in advancing green and information technologies. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
9. Deep Learning for 3D Reconstruction, Augmentation, and Registration: A Review Paper.
- Author
-
Vinodkumar, Prasoon Kumar, Karabulut, Dogus, Avots, Egils, Ozcinar, Cagri, and Anbarjafari, Gholamreza
- Subjects
- *
DEEP learning , *COMPUTER vision , *GRAPH neural networks , *ARTIFICIAL intelligence , *MACHINE learning , *GENERATIVE adversarial networks - Abstract
The research groups in computer vision, graphics, and machine learning have dedicated a substantial amount of attention to the areas of 3D object reconstruction, augmentation, and registration. Deep learning is the predominant method used in artificial intelligence for addressing computer vision challenges. However, deep learning on three-dimensional data presents distinct obstacles and is now in its nascent phase. There have been significant advancements in deep learning specifically for three-dimensional data, offering a range of ways to address these issues. This study offers a comprehensive examination of the latest advancements in deep learning methodologies. We examine many benchmark models for the tasks of 3D object registration, augmentation, and reconstruction. We thoroughly analyse their architectures, advantages, and constraints. In summary, this report provides a comprehensive overview of recent advancements in three-dimensional deep learning and highlights unresolved research areas that will need to be addressed in the future. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
10. SiLK-SLAM: accurate, robust and versatile visual SLAM with simple learned keypoints
- Author
-
Yao, Jianjun and Li, Yingzhao
- Published
- 2024
- Full Text
- View/download PDF
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.