35,376 results
Search Results
2. Web-based diagnostic platform for microorganism-induced deterioration on paper-based cultural relics with iterative training from human feedback
- Author
-
Chenshu Liu, Songbin Ben, Chongwen Liu, Xianchao Li, Qingxia Meng, Yilin Hao, Qian Jiao, and Pinyi Yang
- Subjects
Paper-based cultural relics ,Conservation ,Computer vision ,Deep learning ,Strain classification ,Web application ,Fine Arts ,Analytical chemistry ,QD71-142 - Abstract
Abstract Purpose Paper-based artifacts hold significant cultural and social values. However, paper is intrinsically fragile to microorganisms, such as mold, due to its cellulose composition, which can serve as a microorganisms’ nutrient source. Mold not only can damage papers’ structural integrity and pose significant challenges to conservation works but also may subject individuals attending the contaminated artifacts to health risks. Current approaches for strain identification usually require extensive training, prolonged time for analysis, expensive operation costs, and higher risks of secondary damage due to sampling. Thus, in current conservation practices with mold-contaminated artifacts, little pre-screening or strain identification was performed before mold removal, and the cleaning techniques are usually broad-spectrum rather than strain-specific. With deep learning showing promising applications across various domains, this study investigated the feasibility of using a convolutional neural network (CNN) for fast in-situ recognition and classification of mold on paper. Methods Molds were first non-invasively sampled from ancient Xuan Paper-based Chinese books from the Qing and Ming dynasties. Strains were identified using molecular biology methods and the four most prevalent strains were inoculated on Xuan paper to create mockups for image collection. Microscopic images of the molds as well as their stains situated on paper were collected using a compound microscope and commercial microscope lens for cell phone cameras, which were then used for training CNN models with a transfer learning scheme to perform the classification of mold. To enable involvement and contribution from the research community, a web interface that actuates the process while providing interactive features for users to learn about the information of the classified strain was constructed. Moreover, a feedback functionality in the web interface was embedded for catching potential classification errors, adding additional training images, or introducing new strains, all to refine the generalizability and robustness of the model. Results & Conclusion In the study, we have constructed a suite of high-confidence classification CNN models for the diagnostic process for mold contamination in conservation. At the same time, a web interface was constructed that allows recurrently refining the model with human feedback through engaging the research community. Overall, the proposed framework opens new avenues for effective and timely identification of mold, thus enabling proactive and targeted mold remediation strategies in conservation.
- Published
- 2024
- Full Text
- View/download PDF
3. Computer vision digitization of smartphone images of anesthesia paper health records from low-middle income countries
- Author
-
Folks, Ryan D., Naik, Bhiken I., Brown, Donald E., and Durieux, Marcel E.
- Published
- 2024
- Full Text
- View/download PDF
4. Automated assessment of pen and paper tests using computer vision
- Author
-
Jocovic, Vladimir, Marinkovic, Milan, Stojanovic, Sasa, and Nikolic, Bosko
- Published
- 2024
- Full Text
- View/download PDF
5. Computer vision digitization of smartphone images of anesthesia paper health records from low-middle income countries
- Author
-
Ryan D. Folks, Bhiken I. Naik, Donald E. Brown, and Marcel E. Durieux
- Subjects
Computer vision ,Computer extraction of time series ,Document analysis ,Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background In low-middle income countries, healthcare providers primarily use paper health records for capturing data. Paper health records are utilized predominately due to the prohibitive cost of acquisition and maintenance of automated data capture devices and electronic medical records. Data recorded on paper health records is not easily accessible in a digital format to healthcare providers. The lack of real time accessible digital data limits healthcare providers, researchers, and quality improvement champions to leverage data to improve patient outcomes. In this project, we demonstrate the novel use of computer vision software to digitize handwritten intraoperative data elements from smartphone photographs of paper anesthesia charts from the University Teaching Hospital of Kigali. We specifically report our approach to digitize checkbox data, symbol-denoted systolic and diastolic blood pressure, and physiological data. Methods We implemented approaches for removing perspective distortions from smartphone photographs, removing shadows, and improving image readability through morphological operations. YOLOv8 models were used to deconstruct the anesthesia paper chart into specific data sections. Handwritten blood pressure symbols and physiological data were identified, and values were assigned using deep neural networks. Our work builds upon the contributions of previous research by improving upon their methods, updating the deep learning models to newer architectures, as well as consolidating them into a single piece of software. Results The model for extracting the sections of the anesthesia paper chart achieved an average box precision of 0.99, an average box recall of 0.99, and an mAP0.5-95 of 0.97. Our software digitizes checkbox data with greater than 99% accuracy and digitizes blood pressure data with a mean average error of 1.0 and 1.36 mmHg for systolic and diastolic blood pressure respectively. Overall accuracy for physiological data which includes oxygen saturation, inspired oxygen concentration and end tidal carbon dioxide concentration was 85.2%. Conclusions We demonstrate that under normal photography conditions we can digitize checkbox, blood pressure and physiological data to within human accuracy when provided legible handwriting. Our contributions provide improved access to digital data to healthcare practitioners in low-middle income countries.
- Published
- 2024
- Full Text
- View/download PDF
6. NSTU-BDTAKA: An open dataset for Bangladeshi paper currency detection and recognition
- Author
-
Md. Jubayar Alam Rafi, Mohammad Rony, and Nazia Majadi
- Subjects
Computer vision ,Deep learning ,Image analysis ,Taka detection ,Taka recognition ,YOLOv5 model ,Computer applications to medicine. Medical informatics ,R858-859.7 ,Science (General) ,Q1-390 - Abstract
One of the most popular and well-established forms of payment in use today is paper money. Handling paper money might be challenging for those with vision impairments. Assistive technology has been reinventing itself throughout time to better serve the elderly and disabled people. To detect paper currency and extract other useful information from them, image processing techniques and other advanced technologies, such as Artificial Intelligence, Deep Learning, etc., can be used. In this paper, we present a meticulously curated and comprehensive dataset named ‘NSTU-BDTAKA’ tailored for the simultaneous detection and recognition of a specific object of cultural significance - the Bangladeshi paper currency (in Bengali it is called ‘Taka’). This research aims to facilitate the development and evaluation of models for both taka detection and recognition tasks, offering a rich resource for researchers and practitioners alike. The dataset is divided into two distinct components: (i) taka detection, and (ii) taka recognition. The taka detection subset comprises 3,111 high-resolution images, each meticulously annotated with rectangular bounding boxes that encompass instances of the taka. These annotations serve as ground truth for training and validating object detection models, and we adopt the state-of-the-art YOLOv5 architecture for this purpose. In the taka recognition subset, the dataset has been extended to include a vast collection of 28,875 images, each showcasing various instances of the taka captured in diverse contexts and environments. The recognition dataset is designed to address the nuanced task of taka recognition providing researchers with a comprehensive set of images to train, validate, and test recognition models. This subset encompasses challenges such as variations in lighting, scale, orientation, and occlusion, further enhancing the robustness of developed recognition algorithms. The dataset NSTU-BDTAKA not only serves as a benchmark for taka detection and recognition but also fosters advancements in object detection and recognition methods that can be extrapolated to other cultural artifacts and objects. We envision that the dataset will catalyze research efforts in the field of computer vision, enabling the development of more accurate, robust, and efficient models for both detection and recognition tasks.
- Published
- 2024
- Full Text
- View/download PDF
7. Composite Structure Detection Method for Surface Scratches on Textured Paper based on Photometric Stereoscopic Imaging.
- Author
-
Yaoshun Yue and Maohai Lin
- Subjects
COMPOSITE structures ,CCD cameras ,PHOTOMETRIC stereo ,IMAGE enhancement (Imaging systems) ,STEREO vision (Computer science) ,FEATURE extraction ,COMPUTER vision ,IMAGE intensifiers - Abstract
Along with the improvement of quality requirements industrial production, surface inspection of workpiece has gradually become an indispensable and important process in the production of the workpiece. Aiming at the traditional methods in textured paper inspection, there are problems of low efficiency and large error based on machine vision, we propose a "photometric stereo vision + fast Fourier enhancement + feature fusion" composite structure inspection method. First, as the traditional CCD camera produces obvious noise and scratches, which are difficult to distinguish from the background texture area, we propose combining the photometric stereo vision measurement algorithm to get the surface gradient information of the textured paper to obtain more gradient texture information; and then realize the secondary enhancement of the image through Fourier transform in spatial and frequency domains. Second, as the textured paper scratches are difficult to detect, the features are difficult to extract, and the threshold boundary is difficult to define, we propose dynamic threshold segmentation through multi-feature fusion to realize the surface scratch detection work of textured paper. We designed experiments using more than 300 different textured papers; and the results show that the composite structure detection method proposed in this paper is feasible and has advantages. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
8. Facilitating free travel in the Schengen area—A position paper by the European Association for Biometrics
- Author
-
Christoph Busch, Farzin Deravi, Dinusha Frings, Els Kindt, Ralph Lessmann, Alexander Nouak, Jean Salomon, Mateus Achcar, Fernando Alonso‐Fernandez, Daniel Bachenheimer, David Bethell, Josef Bigun, Matthew Brawley, Guido Brockmann, Enrique Cabello, Patrizio Campisi, Aleksandrs Cepilovs, Miles Clee, Mickey Cohen, Christian Croll, Andrzej Czyżewski, Bernadette Dorizzi, Martin Drahansky, Pawel Drozdowski, Catherine Fankhauser, Julian Fierrez, Marta Gomez‐Barrero, Georg Hasse, Richard Guest, Ekaterina Komleva, Sebastien Marcel, Gian Luca Marcialis, Laurent Mercier, Emilio Mordini, Stefance Mouille, Pavlina Navratilova, Javier Ortega‐Garcia, Dijana Petrovska, Norman Poh, Istvan Racz, Ramachandra Raghavendra, Christian Rathgeb, Christophe Remillet, Uwe Seidel, Luuk Spreeuwers, Brage Strand, Sirra Toivonen, and Andreas Uhl
- Subjects
biometrics (access control) ,biometric template protection ,biometric applications ,computer vision ,data privacy ,image analysis for biometrics ,Electronic computers. Computer science ,QA75.5-76.95 - Abstract
Abstract Due to migration, terror‐threats and the viral pandemic, various EU member states have re‐established internal border control or even closed their borders. European Association for Biometrics (EAB), a non‐profit organisation, solicited the views of its members on ways which biometric technologies and services may be used to help with re‐establishing open borders within the Schengen area while at the same time mitigating any adverse effects. From the responses received, this position paper was composed to identify ideas to re‐establish free travel between the member states in the Schengen area. The paper covers the contending needs for security, open borders and fundamental rights as well as legal constraints that any technological solution must consider. A range of specific technologies for direct biometric recognition alongside complementary measures are outlined. The interrelated issues of ethical and societal considerations are also highlighted. Provided a holistic approach is adopted, it may be possible to reach a more optimal trade‐off with regards to open borders while maintaining a high‐level of security and protection of fundamental rights. European Association for Biometrics and its members can play an important role in fostering a shared understanding of security and mobility challenges and their solutions.
- Published
- 2023
- Full Text
- View/download PDF
9. An Overview of Machine Learning in Orthopedic Surgery: An Educational Paper.
- Author
-
Padash, Sirwa, Mickley, John P., Vera Garcia, Diana V., Nugen, Fred, Khosravi, Bardia, Erickson, Bradley J., Wyles, Cody C., and Taunton, Michael J.
- Abstract
The growth of artificial intelligence combined with the collection and storage of large amounts of data in the electronic medical record collection has created an opportunity for orthopedic research and translation into the clinical environment. Machine learning (ML) is a type of artificial intelligence tool well suited for processing the large amount of available data. Specific areas of ML frequently used by orthopedic surgeons performing total joint arthroplasty include tabular data analysis (spreadsheets), medical imaging processing, and natural language processing (extracting concepts from text). Previous studies have discussed models able to identify fractures in radiographs, identify implant type in radiographs, and determine the stage of osteoarthritis based on walking analysis. Despite the growing popularity of ML, there are limitations including its reliance on "good" data, potential for overfitting, long life cycle for creation, and ability to only perform one narrow task. This educational article will further discuss a general overview of ML, discussing these challenges and including examples of successfully published models. [Display omitted] [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
10. Deep Learning for Automated Visual Inspection in Manufacturing and Maintenance: A Survey of Open- Access Papers
- Author
-
Nils Hütten, Miguel Alves Gomes, Florian Hölken, Karlo Andricevic, Richard Meyes, and Tobias Meisen
- Subjects
automated visual inspection ,industrial applications ,deep learning ,computer vision ,convolutional neural network ,vision transformer ,Technology ,Applied mathematics. Quantitative methods ,T57-57.97 - Abstract
Quality assessment in industrial applications is often carried out through visual inspection, usually performed or supported by human domain experts. However, the manual visual inspection of processes and products is error-prone and expensive. It is therefore not surprising that the automation of visual inspection in manufacturing and maintenance is heavily researched and discussed. The use of artificial intelligence as an approach to visual inspection in industrial applications has been considered for decades. Recent successes, driven by advances in deep learning, present a possible paradigm shift and have the potential to facilitate automated visual inspection, even under complex environmental conditions. For this reason, we explore the question of to what extent deep learning is already being used in the field of automated visual inspection and which potential improvements to the state of the art could be realized utilizing concepts from academic research. By conducting an extensive review of the openly accessible literature, we provide an overview of proposed and in-use deep-learning models presented in recent years. Our survey consists of 196 open-access publications, of which 31.7% are manufacturing use cases and 68.3% are maintenance use cases. Furthermore, the survey also shows that the majority of the models currently in use are based on convolutional neural networks, the current de facto standard for image classification, object recognition, or object segmentation tasks. Nevertheless, we see the emergence of vision transformer models that seem to outperform convolutional neural networks but require more resources, which also opens up new research opportunities for the future. Another finding is that in 97% of the publications, the authors use supervised learning techniques to train their models. However, with the median dataset size consisting of 2500 samples, deep-learning models cannot be trained from scratch, so it would be beneficial to use other training paradigms, such as self-supervised learning. In addition, we identified a gap of approximately three years between approaches from deep-learning-based computer vision being published and their introduction in industrial visual inspection applications. Based on our findings, we additionally discuss potential future developments in the area of automated visual inspection.
- Published
- 2024
- Full Text
- View/download PDF
11. Deep Learning for 3D Reconstruction, Augmentation, and Registration: A Review Paper.
- Author
-
Vinodkumar, Prasoon Kumar, Karabulut, Dogus, Avots, Egils, Ozcinar, Cagri, and Anbarjafari, Gholamreza
- Subjects
- *
DEEP learning , *COMPUTER vision , *GRAPH neural networks , *ARTIFICIAL intelligence , *MACHINE learning , *GENERATIVE adversarial networks - Abstract
The research groups in computer vision, graphics, and machine learning have dedicated a substantial amount of attention to the areas of 3D object reconstruction, augmentation, and registration. Deep learning is the predominant method used in artificial intelligence for addressing computer vision challenges. However, deep learning on three-dimensional data presents distinct obstacles and is now in its nascent phase. There have been significant advancements in deep learning specifically for three-dimensional data, offering a range of ways to address these issues. This study offers a comprehensive examination of the latest advancements in deep learning methodologies. We examine many benchmark models for the tasks of 3D object registration, augmentation, and reconstruction. We thoroughly analyse their architectures, advantages, and constraints. In summary, this report provides a comprehensive overview of recent advancements in three-dimensional deep learning and highlights unresolved research areas that will need to be addressed in the future. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
12. Editorial for the Special Issue on "Feature Papers in Section AI in Imaging".
- Author
-
Fernández-Caballero, Antonio
- Subjects
GENERATIVE artificial intelligence ,COMPUTER vision ,ARTIFICIAL intelligence ,INTELLIGENT agents ,COMPUTER graphics ,DEEP learning ,EXPERT systems - Published
- 2024
- Full Text
- View/download PDF
13. Research and Evaluation on an Optical Automatic Detection System for the Defects of the Manufactured Paper Cups.
- Author
-
Wang, Ping, Lee, Yang-Han, Tseng, Hsien-Wei, and Yang, Cheng-Fu
- Subjects
- *
COMPUTER vision , *SURFACE defects , *LIGHT sources , *IMAGE sensors , *IMAGE processing ,RESEARCH evaluation - Abstract
In this paper, the paper cups were used as the research objects, and the machine vision detection technology was combined with different image processing techniques to investigate a non-contact optical automatic detection system to identify the defects of the manufactured paper cups. The combined ring light was used as the light source, an infrared (IR) LED matrix panel was used to provide the IR light to constantly highlight the outer edges of the detected objects, and a multi-grid pixel array was used as the image sensor. The image processing techniques, including the Gaussian filter, Sobel operator, Binarization process, and connected component, were used to enhance the inspection and recognition of the defects existing in the produced paper cups. There were three different detection processes for paper cups, which were divided into internal, external, and bottom image acquisition processes. The present study demonstrated that all the detection processes could clearly detect the surface defect features of the manufactured paper cups, such as dirt, burrs, holes, and uneven thickness. Our study also revealed that the average time for the investigated Automatic Optical Detection to detect the defects on the paper cups was only 0.3 s. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
14. Method and Installation for Efficient Automatic Defect Inspection of Manufactured Paper Bowls.
- Author
-
Yu, Shaoyong, Lee, Yang-Han, Chen, Cheng-Wen, Gao, Peng, Xu, Zhigang, Chen, Shunyi, and Yang, Cheng-Fu
- Subjects
COMPUTER vision ,AUTOMATIC optical inspection ,SEMICONDUCTOR detectors ,IMAGE processing ,PIXELS ,SENSOR arrays ,LIGHT sources - Abstract
Various techniques were combined to optimize an optical inspection system designed to automatically inspect defects in manufactured paper bowls. A self-assembled system was utilized to capture images of defects on the bowls. The system employed an image sensor with a multi-pixel array that combined a complementary metal-oxide semiconductor and a photo detector. A combined ring light served as the light source, while an infrared (IR) LED matrix panel was used to provide constant IR light to highlight the outer edges of the objects being inspected. The techniques employed in this study to enhance defect inspections on produced paper bowls included Gaussian filtering, Sobel operators, binarization, and connected components. Captured images were processed using these technologies. Once the non-contact inspection system's machine vision method was completed, defects on the produced paper bowls were inspected using the system developed in this study. Three inspection methods were used in this study: internal inspection, external inspection, and bottom inspection. All three methods were able to inspect surface features of produced paper bowls, including dirt, burrs, holes, and uneven thickness. The results of our study showed that the average time required for machine vision inspections of each paper bowl was significantly less than the time required for manual inspection. Therefore, the investigated machine vision system is an efficient method for inspecting defects in fabricated paper bowls. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
15. Special issue on intelligent systems: ISMIS 2022 selected papers.
- Author
-
Ceci, Michelangelo, Flesca, Sergio, Manco, Giuseppe, and Masciari, Elio
- Subjects
MACHINE learning ,ARTIFICIAL intelligence ,DECISION support systems ,KNOWLEDGE representation (Information theory) ,COMPUTER vision ,DEEP learning - Abstract
This document is a special issue of the Journal of Intelligent Information Systems, focusing on the selected papers from the International Symposium on Methodologies for Intelligent Systems (ISMIS 2022). The symposium, held in Cosenza, Italy, showcased research on various topics related to artificial intelligence, including decision support, knowledge representation, machine learning, computer vision, and more. The special issue includes eleven papers that have undergone rigorous peer-reviewing and cover a wide range of research topics, such as deep learning, anomaly detection, malware detection, sentiment classification, and healthcare professionals' burnout. The authors express their gratitude to the contributors and reviewers for their valuable contributions. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
16. Valmet to Supply IQ Web Inspection System to Yueyang Forest & Paper Co., Ltd.
- Subjects
- *
PAPER industry , *PRODUCT quality , *LED lighting , *COMPUTER vision , *INFORMATION retrieval - Published
- 2024
17. Facilitating free travel in the Schengen area—A position paper by the European Association for Biometrics.
- Author
-
Busch, Christoph, Deravi, Farzin, Frings, Dinusha, Kindt, Els, Lessmann, Ralph, Nouak, Alexander, Salomon, Jean, Achcar, Mateus, Alonso‐Fernandez, Fernando, Bachenheimer, Daniel, Bethell, David, Bigun, Josef, Brawley, Matthew, Brockmann, Guido, Cabello, Enrique, Campisi, Patrizio, Cepilovs, Aleksandrs, Clee, Miles, Cohen, Mickey, and Croll, Christian
- Subjects
- *
BIOMETRY , *DATA privacy , *BORDER security , *NONPROFIT organizations , *CIVIL rights - Abstract
Due to migration, terror‐threats and the viral pandemic, various EU member states have re‐established internal border control or even closed their borders. European Association for Biometrics (EAB), a non‐profit organisation, solicited the views of its members on ways which biometric technologies and services may be used to help with re‐establishing open borders within the Schengen area while at the same time mitigating any adverse effects. From the responses received, this position paper was composed to identify ideas to re‐establish free travel between the member states in the Schengen area. The paper covers the contending needs for security, open borders and fundamental rights as well as legal constraints that any technological solution must consider. A range of specific technologies for direct biometric recognition alongside complementary measures are outlined. The interrelated issues of ethical and societal considerations are also highlighted. Provided a holistic approach is adopted, it may be possible to reach a more optimal trade‐off with regards to open borders while maintaining a high‐level of security and protection of fundamental rights. European Association for Biometrics and its members can play an important role in fostering a shared understanding of security and mobility challenges and their solutions. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
18. Image processing based quality control of coated paper folding.
- Author
-
Pál, Magdolna, Novaković, Dragoljub, Dedijer, Sandra, Koltai, László, Jurič, Ivana, Vladić, Gojko, and Kašiković, Nemanja
- Subjects
- *
PAPER coatings , *IMAGE processing , *QUALITY control , *PAPER arts , *STRAINS & stresses (Mechanics) , *COMPUTER vision - Abstract
During the folding process substrates are exposed to high-localized stresses, which in the case of coated papers and boards, can lead to decreased aesthetic features or complete loss of functionality. Production efficiency of the folding process could be improved by an automated, computer vision-based inspection system. For such a task, different existing computer-aided fold-crack evaluation approaches were analyzed. A detailed research was conducted to propose an image processing based fold cracking assessment via finding optimal sample preparation and digitization techniques and developing an algorithm for the digital image analysis and feature extraction. The analysis of the applicability of different sample preparation and digitization parameters, as well as the proposed digital image feature, was done by correlation evaluation, one-way analysis of variance (ANOVA) and corresponding post hoc tests. The results indicated that the developed algorithm fulfils the set requirements and the proposed feature of digitized samples faithfully describes the analyzed fold-cracks. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
19. Closer Looking: Computer Vision in Material Studies of Art.
- Author
-
Mintie, Katherine, Messier, Paul, and Crockett, Damon
- Subjects
COMPUTER vision ,PRINTS ,ARTISTS ,PHOTOGRAPHIC printing ,SURFACE texture ,ART reproduction - Abstract
The article discusses the use of computer vision in analyzing the material nature of art. Topics mentioned include the importance of understanding how artworks are made and the inspirations of artists, the examination of surface textures of photographic prints, and the photomechanical reproduction of images.
- Published
- 2024
- Full Text
- View/download PDF
20. Q3D: a complete solution for quality control and inspection in additive manufacturing processes
- Author
-
Kladovasilakis, Nikolaos, Charalampous, Paschalis, Kostavelis, Ioannis, and Tzovaras, Dimitrios
- Published
- 2024
- Full Text
- View/download PDF
21. VIBRANT-WALK: An algorithm to detect plagiarism of figures in academic papers.
- Author
-
Parmar, Shashank and Jain, Bhavya
- Subjects
- *
PLAGIARISM , *COMPUTER algorithms , *ALGORITHMS , *COMPUTER vision , *RANDOM walks - Abstract
Detecting plagiarism in academic papers is crucial for maintaining academic integrity, preserving the originality of published work, and safeguarding intellectual property. While existing applications excel at text plagiarism detection, they fall short when it comes to image plagiarism. This paper introduces a novel algorithm, named "VIBRANT-WALK," designed to detect image plagiarism in academic manuscripts. The challenge of identifying plagiarized images is formidable, requiring a unique approach. Traditional Computer Vision algorithms, proficient in image similarity tasks, face limitations in determining whether an image has been previously used in an article. To address this, the proposed algorithm leverages a repository of all published article pages, focusing on absolute identicality rather than image similarity. The algorithm comprises two stages. In the first stage, a "Vibrancy Matrix" is created through image preprocessing, aiding in contour determination. The second stage involves pixel-by-pixel comparison with images from published manuscripts. To enhance efficiency, the algorithm initiates comparisons from the pixel with the highest score in the Vibrancy Matrix, followed by pixel comparisons through random walks, significantly reducing complexity. To conduct the study, a custom dataset was compiled from 69 research articles, capturing snapshots of each page and figure. Overall, we present 485 unique test cases where we can test the accuracy and efficiency of the algorithm. The lack of publicly available datasets necessitated this approach. The proposed algorithm outperformed the existing models and algorithms in this field by achieving an overall accuracy of 94.8% on the collated dataset, identifying 460 instances of plagiarism out of the 485 test cases. The algorithm also demonstrated a 100% accuracy rate in avoiding false positives. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
22. PaperNet: A Dataset and Benchmark for Fine-Grained Paper Classification.
- Author
-
Yue, Tan, Li, Yong, Shi, Xuzhao, Qin, Jiedong, Fan, Zijiao, and Hu, Zonghai
- Subjects
NATURAL language processing ,COMPUTER vision ,VISUAL fields ,CLASSIFICATION - Abstract
Document classification is an important area in Natural Language Processing (NLP). Because a huge amount of scientific papers have been published at an accelerating rate, it is beneficial to carry out intelligent paper classifications, especially fine-grained classification for researchers. However, a public scientific paper dataset for fine-grained classification is still lacking, so the existing document classification methods have not been put to the test. To fill this vacancy, we designed and collected the PaperNet-Dataset that consists of multi-modal data (texts and figures). PaperNet 1.0 version contains hierarchical categories of papers in the fields of computer vision (CV) and NLP, 2 coarse-grained and 20 fine-grained (7 in CV and 13 in NLP). We ran current mainstream models on the PaperNet-Dataset, along with a multi-modal method that we propose. Interestingly, none of these methods reaches an accuracy of 80% in fine-grained classification, showing plenty of room for improvement. We hope that PaperNet-Dataset will inspire more work in this challenging area. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
23. Citation impact analysis of research papers that appear in oral and poster sessions.
- Author
-
Ke, Shih-Wen, Lin, Wei-Chao, Tsai, Chih-Fong, and Hu, Ya-Han
- Subjects
COMPUTER vision ,CONFERENCE papers ,CITATION analysis ,POSTER presentations ,COMPUTER science conferences ,CONFERENCES & conventions - Abstract
Purpose -- Conference publications are an important aspect of research activities. There are generally both oral presentations and poster sessions at large international conferences. One can hypothesise that, for the same conferences, the papers presented in oral sessions should have a higher research impact than the papers presented in poster sessions. However, there has been no related study examining the validity of this hypothesis. In other words, the difference of research impact between papers presented orally or during poster sessions has not been discussed in literature. Therefore, the purpose of this paper is to conduct a citation analysis to compare the research impact of papers presented in oral and poster sessions. Design/methodology/approach -- In this paper, data from three leading conferences in the field of computer vision are examined, namely CVPR (2011 and 2012), ICCV (2011) and ECCV (2012). Several types of citation-related statistics are collected, including the number of highly cited papers (i.e. high number of citations) presented in oral and poster sessions, the total citations of both types of papers, the average citations of oral and poster papers, and the average citations of each frequently cited paper of both types. Findings -- There are three main findings. First, a larger proportion of highly cited papers are from oral sessions than poster sessions. Second, the average number of citations per paper is larger for those presented in oral sessions than poster sessions. Third, the average number of citations for highly cited papers presented in oral sessions is not necessarily greater than for the ones presented in poster sessions. Originality/value -- The originality of this paper is that it is the first attempt to examine the differences of citation impacts of conference papers presented in oral and poster sessions. The findings of this study will allow future bibliometrics research to further explore this related issue for longer periods and different fields. [ABSTRACT FROM AUTHOR]
- Published
- 2014
- Full Text
- View/download PDF
24. Interactive paper tearing.
- Author
-
Schreck, Camille, Rohmer, Damien, and Hahmann, Stefanie
- Subjects
- *
PAPER textiles , *TEXTURE analysis (Image processing) , *COMPUTER graphics , *COMPUTER vision , *ANIMATION (Cinematography) - Abstract
We propose an efficient method to model paper tearing in the context of interactive modeling. The method uses geometrical information to automatically detect potential starting points of tears. We further introduce a new hybrid geometrical and physical-based method to compute the trajectory of tears while procedurally synthesizing high resolution details of the tearing path using a texture based approach. The results obtained are compared with real paper and with previous studies on the expected geometric paths of paper that tears. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
25. Watching plants grow – a position paper on computer vision and Arabidopsis thaliana
- Author
-
Jonathan Bell and Hannah M. Dee
- Subjects
computer vision ,Arabidopsis thaliana ,image analysis ,image segmentation ,image sequences ,Computer applications to medicine. Medical informatics ,R858-859.7 ,Computer software ,QA76.75-76.765 - Abstract
The authors present a comprehensive overview of image processing and analysis work done to support research into the model flowering plant Arabidopsis thaliana. Beside the plant's importance in biological research, using image analysis to obtain experimental measurements of it is an interesting vision problem in its own right, involving the segmentation and analysis of sequences of images of objects whose shape varies between individual specimens and also changes over time. While useful measurements can be obtained by segmenting a whole plant from the background, they suggest that the increased range and precision of measurements made available by leaf‐level segmentation makes this a problem well worth solving. A variety of approaches have been tried by biologists as well as computer vision researchers. This is an interdisciplinary area and the computer vision community has an important contribution to make. They suggest that there is a need for publicly available datasets with ground truth annotations to enable the evaluation of new approaches and to support the building of training data for modern data‐driven computer vision approaches, which are those most likely to result in the kind of fully automated systems that will be of use to biologists.
- Published
- 2017
- Full Text
- View/download PDF
26. Measurement in Machine Vision Editorial Paper.
- Author
-
Sergiyenko, Oleg, Flores-Fuentes, Wendy, Rodríguez-Quiñonez, Julio C., Mercorelli, Paolo, Kawabe, Tohru, and Bhateja, Vikrant
- Subjects
- *
COMPUTER vision , *CYBER physical systems , *INTERPOLATION algorithms , *ARTIFICIAL intelligence , *OPTICAL computing , *SENSORY memory , *DISPLACEMENT (Mechanics) - Abstract
Measurement related to different machine vision functions is the base for developing of cyber-physical systems able to see and make decisions. These kinds of systems are emerging in all areas of our daily lives. They can be found in the medical area, in industry, in the agriculture, in all those interconnected cloud computing-based systems related to flying/terrestrial robotics, navigation, automated surgery, smart cities, smart health monitoring, etc. All of them are extremely dependent on the same: adequate coordinates measurement, properly selected data processing and data fusion algorithms, evaluation procedures for performance analysis of measurement within Machine Vision systems, processes and algorithms (both traditional and artificial intelligence), mathematical models for 3D-measurement purposes (measurement of displacements, surface profiles, deformations, data augmentation/interpolation, etc.), and distributed visual measurement systems, as well as distributed memory and sensory part. Cyber-physical systems can be implemented on almost any application, especially on those dotted by robots and automated guided devices (from aerospace applications to domestic cleaners). The success of the measurement process depends on the kind of sensors and their optoelectronics characteristics and intrinsic parameters, as well as their respective operating and processing. The correct approach selection for the application, the data acquisition and collection efficiency, the data processing algorithms, the hardware processors response time, and the intelligent auto adaptability to changing environments or conditions. Recently, the emergence of artificial intelligence algorithms and the internet of things have powerful development of such systems, highlighting the importance and the impact of the measurement accuracy related to machine vision performance. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
27. A field test of computer-vision-based gaze estimation in psychology.
- Author
-
Valtakari, Niilo V., Hessels, Roy S., Niehorster, Diederick C., Viktorsson, Charlotte, Nyström, Pär, Falck-Ytter, Terje, Kemner, Chantal, and Hooge, Ignace T. C.
- Subjects
GAZE ,COMPUTER science conferences ,TECHNICAL literature ,PSYCHOLOGY ,VIDEO recording ,CONFERENCE papers - Abstract
Computer-vision-based gaze estimation refers to techniques that estimate gaze direction directly from video recordings of the eyes or face without the need for an eye tracker. Although many such methods exist, their validation is often found in the technical literature (e.g., computer science conference papers). We aimed to (1) identify which computer-vision-based gaze estimation methods are usable by the average researcher in fields such as psychology or education, and (2) evaluate these methods. We searched for methods that do not require calibration and have clear documentation. Two toolkits, OpenFace and OpenGaze, were found to fulfill these criteria. First, we present an experiment where adult participants fixated on nine stimulus points on a computer screen. We filmed their face with a camera and processed the recorded videos with OpenFace and OpenGaze. We conclude that OpenGaze is accurate and precise enough to be used in screen-based experiments with stimuli separated by at least 11 degrees of gaze angle. OpenFace was not sufficiently accurate for such situations but can potentially be used in sparser environments. We then examined whether OpenFace could be used with horizontally separated stimuli in a sparse environment with infant participants. We compared dwell measures based on OpenFace estimates to the same measures based on manual coding. We conclude that OpenFace gaze estimates may potentially be used with measures such as relative total dwell time to sparse, horizontally separated areas of interest, but should not be used to draw conclusions about measures such as dwell duration. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
28. AI Machine Vision based Oven White Paper Color Classification and Label Position Real-time Monitoring System to Check Direction.
- Author
-
Hee-Chul Kim, Youn-Saup Yoon, and Yong-Mo Kim
- Subjects
COMPUTER vision ,DEEP learning ,JOB classification ,MANUFACTURING process automation ,ARTIFICIAL intelligence ,COLOR image processing - Abstract
We develop a vision system for batch inspection by oven white paper model color by manufacturing a machine vision system for the oven manufacturing automation process. In the vision system, white paper object detection (spring), color clustering, and histogram extraction are performed. In addition, for the automated process of home appliances, we intend to develop an automatic mold combination detection algorithm that inspects the label position and direction (angle/coordinate) using deep learning. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
29. Intelligent computer vision system for segregating recyclable waste papers
- Author
-
Rahman, Mohammad Osiur, Hussain, Aini, Scavino, Edgar, Basri, Hassan, and Hannan, M.A.
- Subjects
- *
PAPER recycling , *COMPUTER vision , *IMAGE processing , *ARTIFICIAL intelligence , *SORTING (Electronic computers) , *MACHINE learning , *PATTERN recognition systems , *PAPER chemicals - Abstract
Abstract: This article explores the application of image processing techniques in recyclable waste paper sorting. In recycling, waste papers are segregated into various grades as they are subjected to different recycling processes. Highly sorted paper streams facilitate high quality end products and save processing chemicals and energy. From 1932 to 2009, different mechanical and optical paper sorting methods have been developed to fill the paper sorting demand. Still, in many countries including Malaysia, waste papers are sorted into different grades using a manual sorting system. Because of inadequate throughput and some major drawbacks of mechanical paper sorting systems, the popularity of optical paper sorting systems has increased. Automated paper sorting systems offer significant advantages over human inspection in terms of worker fatigue, throughput, speed, and accuracy. This research attempts to develop a smart vision sensing system that is able to separate the different grades of paper using first-order features. To construct a template database, a statistical approach with intra-class and inter-class variation techniques are applied to the feature selection process. Finally, the K-nearest neighbor (KNN) algorithm is applied for paper object grade identification. The remarkable achievement obtained with the method is the accurate identification and dynamic sorting of all grades of papers using simple image processing techniques. [Copyright &y& Elsevier]
- Published
- 2011
- Full Text
- View/download PDF
30. 基于机器视觉的瓦楞纸压制送料 机器人的控制.
- Author
-
陈贤 and 夏建春
- Subjects
ROBOT motion ,CUTTING machines ,PAPER arts ,COMPUTER vision ,PREDICTION models ,CARDBOARD ,ROBOT industry - Abstract
Copyright of China Pulp & Paper is the property of China Pulp & Paper Magazines Publisher and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
- Published
- 2021
- Full Text
- View/download PDF
31. An interactive assessment framework for residential space layouts using pix2pix predictive model at the early-stage building design
- Author
-
Mostafavi, Fatemeh, Tahsildoost, Mohammad, Zomorodian, Zahra Sadat, and Shahrestani, Seyed Shayan
- Published
- 2024
- Full Text
- View/download PDF
32. Synthetic images generation for semantic understanding in facility management
- Author
-
Rampini, Luca and Re Cecconi, Fulvio
- Published
- 2024
- Full Text
- View/download PDF
33. Evaluation and decision-making framework for concrete surface quality based on computer vision and ontology
- Author
-
Chai, Ying Tao and Wang, Ting-Kwei
- Published
- 2023
- Full Text
- View/download PDF
34. Enhancing IoT Network Security: Unveiling the Power of Self-Supervised Learning against DDoS Attacks.
- Author
-
Almaraz-Rivera, Josue Genaro, Cantoral-Ceballos, Jose Antonio, and Botero, Juan Felipe
- Subjects
SUPERVISED learning ,DENIAL of service attacks ,COMPUTER network security ,INTRUSION detection systems (Computer security) ,COMPUTER network traffic ,INTERNET of things ,ELECTRONIC paper - Abstract
The Internet of Things (IoT), projected to exceed 30 billion active device connections globally by 2025, presents an expansive attack surface. The frequent collection and dissemination of confidential data on these devices exposes them to significant security risks, including user information theft and denial-of-service attacks. This paper introduces a smart, network-based Intrusion Detection System (IDS) designed to protect IoT networks from distributed denial-of-service attacks. Our methodology involves generating synthetic images from flow-level traffic data of the Bot-IoT and the LATAM-DDoS-IoT datasets and conducting experiments within both supervised and self-supervised learning paradigms. Self-supervised learning is identified in the state of the art as a promising solution to replace the need for massive amounts of manually labeled data, as well as providing robust generalization. Our results showcase that self-supervised learning surpassed supervised learning in terms of classification performance for certain tests. Specifically, it exceeded the F1 score of supervised learning for attack detection by 4.83% and by 14.61% in accuracy for the multiclass task of protocol classification. Drawing from extensive ablation studies presented in our research, we recommend an optimal training framework for upcoming contrastive learning experiments that emphasize visual representations in the cybersecurity realm. This training approach has enabled us to highlight the broader applicability of self-supervised learning, which, in some instances, outperformed supervised learning transferability by over 5% in precision and nearly 1% in F1 score. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
35. A Surveillance-and-Blockchain-based Tracking System for Mitigation of Baggage Mishandling at Smart Airports.
- Author
-
Jiang, Yirui, Tran, Trung Hieu, and Williams, Leon
- Subjects
LUGGAGE ,COMPUTER vision ,RADIO frequency identification systems ,BLOCKCHAINS ,ELECTRONIC paper - Abstract
Baggage mishandling has received much attention by airport operators. Traditional baggage tracking methods (e.g., manual, barcode, and radio-frequency identification) have not been able to deal with the challenge of baggage mishandling due to their unreliable and inefficient performance. Baggage data comprises sensitive personal information, linking individuals to their personal details and travel history, with the potential to expose security vulnerabilities. This paper proposes a smart baggage tracking system based on surveillance and blockchain technology for mitigation of baggage mishandling at airports. Surveillance including a network of airport cameras is used to recognize and monitor locations of baggage and passengers. Blockchain technology is utilized to manage and process baggage and passenger databases, guaranteeing the security, privacy, and transparency of baggage information. Surveillance-captured images of baggage and passengers undergo processing through computer vision algorithms to determine the current whereabouts of baggage, subsequently synchronized and updated within the blockchain storage. Additionally, a user interface is developed to present real-time baggage tracking information. Preliminary experiments have demonstrated the applicability of the smart baggage tracking system for airports. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
36. 基于机器视觉的纸病诊断系统 光源设计方案研究.
- Author
-
汤伟, 王锦韫, and 冯波
- Subjects
LIGHT sources ,COMPUTER vision ,OPTICAL properties ,EXPERIMENTAL design - Abstract
Copyright of China Pulp & Paper is the property of China Pulp & Paper Magazines Publisher and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
- Published
- 2022
- Full Text
- View/download PDF
37. Application of computer vision for construction progress monitoring: a qualitative investigation
- Author
-
Moragane, H.P.M.N.L.B., Perera, B.A.K.S., Palihakkara, Asha Dulanjalie, and Ekanayake, Biyanka
- Published
- 2024
- Full Text
- View/download PDF
38. Improved HardNet and Stricter Outlier Filtering to Guide Reliable Matching.
- Author
-
Meng Xu, Chen Shen, Jun Zhang, Zhipeng Wang, Zhiwei Ruan, Stefan Poslad, and Pengfei Xu
- Subjects
DEEP learning ,IMAGE registration ,COMPUTER vision ,IMAGE retrieval ,CONFERENCE papers ,STATISTICAL sampling ,MICROPOLAR elasticity - Abstract
As the fundamental problem in the computer vision area, image matching has wide applications in pose estimation, 3D reconstruction, image retrieval, etc. Suffering from the influence of external factors, the process of image matching using classical local detectors, e.g., scale-invariant feature transform (SIFT), and the outlier filtering approaches, e.g., Random sample consensus (RANSAC), show high computation speed and pool robustness under changing illumination and viewpoints conditions, while image matching approaches with deep learning strategy (such as HardNet, OANet) display reliable achievements in large-scale datasets with challenging scenes. However, the past learning-based approaches are limited to the distinction and quality of the dataset and the training strategy in the image-matching approaches. As an extension of the previous conference paper, this paper proposes an accurate and robust image matching approach using fewer training data in an end-to-end manner, which could be used to estimate the pose error This research first proposes a novel dataset cleaning and construction strategy to eliminate the noise and improve the training efficiency; Secondly, a novel loss named quadratic hinge triplet loss (QHT) is proposed to gather more effective and stable feature matching; Thirdly, in the outlier filtering process, the stricter OANet and bundle adjustment are applied for judging samples by adding the epipolar distance constraint and triangulation constraint to generate more outstanding matches; Finally, to recall the matching pairs, dynamic guided matching is used and then submit the inliers after the PyRANSAC process. Multiple evaluation metrics are used and reported in the 1st place in the Track1 of CVPR Image-Matching Challenge Workshop. The results show that the proposed method has advanced performance in large-scale and challenging Phototourism benchmark. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
39. Intelligent Noncontact Structural Displacement Detection Method Based on Computer Vision and Deep Learning.
- Author
-
Liu, Hongbo, Zhang, Fan, Ma, Rui, Wang, Longxuan, Chen, Zhihua, Zhang, Qian, and Guo, Liulu
- Subjects
DISPLACEMENT (Psychology) ,COMPUTER vision ,EUCLIDEAN distance ,COMPUTER simulation ,DEEP learning ,BAMBOO - Abstract
Accurate identification of structural displacements is important for structural state assessment and performance evaluation. This paper proposes a real-time structural displacement detection model based on computer vision and deep learning. The model consists of three stages: identification, tracking, and displacement resolution. First, the displacement target is identified and tracked by the improved YOLO v7 algorithm and the improved DeepSORT algorithm. Then, the Euclidean distance method based on inverse perspective mapping (IPM-ED) is proposed for the analytical conversion of the displacement. Next, the accuracy and effectiveness of this displacement detection model are evaluated through four groups of bamboo axial compression tests. A comparative analysis is conducted between the IPM-ED displacement analysis method and the commonly used ED displacement analysis method. Finally, the robustness of this method is tested by using a cable breakage test of a cable dome structure as an application case. The research results demonstrate that the maximum average error of the four groups of bamboo displacement tests is only 3.10 mm, and the maximum relative error of peak displacement is only 6.54 mm. The RMSE basically stays around 3.5 mm. The maximum displacement error in the application case is only 4.91 mm, with a maximum MAPE of 4.94%. In addition, the error percentage under the IPM-ED algorithm is basically within 5%, while the error percentage of the ED algorithm is more than 10%. The method in this paper achieves efficient and intelligent identification of structural displacements in a non-contact manner. The proposed method is suitable for environments where the contact displacement sensor is easily affected by vibration, the site is complex and requires additional displacement sensor fixing equipment, the displacement sensor with super-high structure is unsafe to deploy, and the contact displacement sensor in narrow space is inconvenient to deploy, so it has broad application prospects. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
40. Pantograph Slider Detection Architecture and Solution Based on Deep Learning.
- Author
-
Guo, Qichang, Tang, Anjie, and Yuan, Jiabin
- Subjects
IMAGE processing ,TECHNICAL specifications ,COMPUTER vision ,DEEP learning ,PANTOGRAPH - Abstract
Railway transportation has been integrated into people's lives. According to the "Notice on the release of the General Technical Specification of High-speed Railway Power Supply Safety Testing (6C System) System" issued by the National Railway Administration of China in 2012, it is required to install pantograph and slide monitoring devices in high-speed railway stations, station throats and the inlet and exit lines of high-speed railway sections, and it is required to detect the damage of the slider with high precision. It can be seen that the good condition of the pantograph slider is very important for the normal operation of the railway system. As a part of providing power for high-speed rail and subway, the pantograph must be paid attention to in railway transportation to ensure its integrity. The wear of the pantograph is mainly due to the contact power supply between the slide block and the long wire during high-speed operation, which inevitably produces scratches, resulting in depressions on the upper surface of the pantograph slide block. During long-term use, because the depression is too deep, there is a risk of fracture. Therefore, it is necessary to monitor the slider regularly and replace the slider with serious wear. At present, most of the traditional methods use automation technology or simple computer vision technology for detection, which is inefficient. Therefore, this paper introduces computer vision and deep learning technology into pantograph slide wear detection. Specifically, this paper mainly studies the wear detection of the pantograph slider based on deep learning and the main purpose is to improve the detection accuracy and improve the effect of segmentation. From a methodological perspective, this paper employs a linear array camera to enhance the quality of the data sets. Additionally, it integrates an attention mechanism to improve segmentation performance. Furthermore, this study introduces a novel image stitching method to address issues related to incomplete images, thereby providing a comprehensive solution. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
41. A deep learning counting model applied to quality control
- Author
-
Jaramillo, Juan R.
- Published
- 2023
- Full Text
- View/download PDF
42. Review Paper on Enhancing Communication: Machine Learning for Live Sign-to-Text Translation.
- Author
-
Rangari, Aditya, Bhide, Devendra, More, Vaibhav, Wahurwagh, Kunal, Shirbhate, Dhiraj, and Andhare, Chetan
- Subjects
MACHINE learning ,INTERPERSONAL communication ,NATURAL language processing ,CONVOLUTIONAL neural networks ,SIGN language ,COMPUTER vision - Abstract
For those who are deaf or hard of hearing, sign language is essential as their main form of communication. using ease, sign language gestures may be translated into written or spoken words in real time using the Sign Language Translator, and vice versa. This system interprets and communicates sign language gestures by utilising computer vision and natural language processing (NLP). Given that sign language uses a wide range of hand movements to communicate meaning, it might be difficult to identify certain motions by looking for patterns. Individuals communicate and engage using a variety of gestures. In this study, a human-computer interface that can recognise motions in sign language and properly translate them into text is shown. The suggested method improves interpersonal communication by using convolutional neural networks and long short-term memory networks for gesture interpretation and detection. [ABSTRACT FROM AUTHOR]
- Published
- 2024
43. Pathological test type and chemical detection using deep neural networks: a case study using ELISA and LFA assays
- Author
-
Hoque Tania, Marzia, Kaiser, M. Shamim, Abu-Hassan, Kamal, and Hossain, M. A.
- Published
- 2023
- Full Text
- View/download PDF
44. Automated paper impurities evaluation using feature representations based on ADMM sparse codes.
- Author
-
Qizi, Huangpeng, Huang, Wenwei, and Shi, Hanyi
- Subjects
- *
COMPUTER vision , *SPARSE approximations , *SPARSE graphs , *MATHEMATICAL optimization , *INFORMATION resources management - Abstract
To automatic detect and characterize paper impurities with computer vision, we present a novel two parts evaluation procedure with feature representations using Alternating Direction Method of Multipliers (ADMM) sparse codes. The method is based on an offline training step to obtain sparse coefficients and codebooks via learning extracted features with ADMM optimization, followed by an online detection step to use linear SVM classifier to assess defective paper samples from no-defective ones. Our approach bridges the gap between paper impurities evaluation and sparse feature representations, taking advantages of existing ADMM algorithms to handle sparse codes problem. We compare different feature descriptors and sparse code methods to implement the procedure and experimentally validate it on a dataset of 11 paper classes. Experiment results show that the proposed method is competitive and effective in terms of evaluation accuracy and speed. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
45. 基于木桶理论的纸病检测系统 架构设计.
- Author
-
汤伟, 成爽爽, 冯波, 曲蕴慧, and 王孟效
- Subjects
COMPUTER vision ,ARCHITECTURAL designs ,CONSTRUCTION planning ,CAMERAS ,EQUILIBRIUM - Abstract
Copyright of China Pulp & Paper is the property of China Pulp & Paper Magazines Publisher and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
- Published
- 2021
- Full Text
- View/download PDF
46. Irregular Rotation Deformation from Paper Scanning: An Investigation.
- Author
-
Nasrudin, Mohammad Faidzul, Wahdan, Omar M., and Omar, Khairuddin
- Subjects
SCANNING systems ,ARTIFICIAL intelligence ,DIGITAL images ,COMPUTER vision ,COMPUTER science ,DIGITAL computer simulation - Abstract
Abstract: Image acquisition has great influence on the performance of any computer vision application. Different methods can be utilized to acquire the digital image of a paper, whilst scanning scheme is among the most attractive methods. This attractiveness is because of the fewer types of potential deformations and the low cost of the scanning devices, e.g. flatbed scanners. However, paper is commonly placed imperfectly on the scanner. This slight rotation is not usually based on a pivot around the paper''s geometrical center (the well known regular rotation) but instead it is based on a pivot placed at the corner of the paper. Thus, the result is a digital image that is deformed with an “irregular rotation”. The characteristic of this deformation phenomenon is currently unknown to computer vision scientists. In this paper we provide an extensive investigation of this deformation. In addition, a new set of equations that sway and measure the transformation is proposed. Our investigation leads to the conclusion that the “irregular rotation” phenomenon produces a shear transformation. Furthermore, the experimental results confirm the theoretical findings. [Copyright &y& Elsevier]
- Published
- 2012
- Full Text
- View/download PDF
47. The people behind the papers – Thomas Naert and Soeren Lienkamp.
- Subjects
- *
DEEP learning , *POLYCYSTIC kidney disease , *CYSTIC kidney disease , *DEVELOPMENTAL biology , *COMPUTER vision - Published
- 2021
- Full Text
- View/download PDF
48. Apply or Die: On the Role and Assessment of Application Papers in Visualization.
- Author
-
Weber, Gunther H., Carpendale, Sheelagh, Ebert, David, Fisher, Brian, Hagen, Hans, Shneiderman, Ben, and Ynnerman, Anders
- Subjects
DATA visualization ,COMPUTER graphics ,DATA modeling ,COMPUTER vision ,VISUALIZATION ,COMPUTER software - Abstract
Application-oriented papers provide an important way to invigorate and cross-pollinate the visualization field, but the exact criteria for judging an application paper's merit remain an open question. This article builds on a panel at the 2016 IEEE Visualization Conference entitled "Application Papers: What Are They, and How Should They Be Evaluated?" that sought to gain a better understanding of prevalent views in the visualization community. This article surveys current trends that favor application papers, reviews the benefits and contributions of this paper type, and discusses their assessment in the review process. It concludes with recommendations to ensure that the visualization community is more inclusive to application papers. [ABSTRACT FROM PUBLISHER]
- Published
- 2017
- Full Text
- View/download PDF
49. Automated serial sectioning applied to 3D paper structure analysis.
- Author
-
WILTSCHE, M., DONOSER, M., KRITZINGER, J., and BAUER, W.
- Subjects
AUTOMATED library serials contol systems ,MICROSCOPY ,AUTOMATION ,COMPUTER vision ,SPATIAL analysis (Statistics) ,MORPHOLOGY ,FIBERS - Abstract
A better understanding of paper properties requires a detailed knowledge about the spatial arrangement of its constituent materials in its structure. This paper presents a novel approach for the analysis of the three-dimensional paper structure at the fibre level. A technique combining a rotary microtome and an optical microscopy was developed allowing serial sectioning of hundreds of cuts. The microscope is fixed on a moveable stage and mounted in front of a microtome. Repeatedly, thin slices are cut off an embedded paper sample and the cut block surface is scanned in a fully automated process. The prototype built is able to digitize paper samples with a size of more than 1 cm at a possible three-dimensional resolution below 1 μm. Advanced computer vision methods are applied to extract relevant information from the digitized samples. Currently, the most important applications are the analysis of pigment coating layers on the paper surfaces and the analysis of fibre transverse morphology. Besides the analysis of paper structures, this technique is also suited for the spatial analysis of other materials, if the structural features are accessible with light optical microscopy. [ABSTRACT FROM AUTHOR]
- Published
- 2011
- Full Text
- View/download PDF
50. INTRODUCTION TO THE SPECIAL ISSUE ON NEXT GENERATION PERVASIVE RECONFIGURABLE COMPUTING FOR HIGH PERFORMANCE REAL TIME APPLICATIONS.
- Author
-
VENKATESAN, C., YU-DONG ZHANG, CHOW CHEE ONN, and AND YONG SHI
- Subjects
MACHINE learning ,REINFORCEMENT learning ,HIGH performance computing ,COMPUTER vision ,ARTIFICIAL intelligence ,PARSING (Computer grammar) ,DEEP learning - Abstract
This document introduces a special issue of the journal "Scalable Computing: Practice & Experience" focused on next-generation pervasive reconfigurable computing for high-performance real-time applications. The authors discuss the importance of adaptable platforms for real-time tasks and highlight the benefits of reconfigurable computing in accelerating applications like image processing and machine learning. The special issue aims to explore recent advancements in this field and includes research papers on topics such as network security, malware detection, software reliability prediction, and optimization algorithms for wing design. The papers cover a range of computer science and technology topics, showcasing advancements and their potential impact on various computing domains. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.