105,491 results
Search Results
2. Web-based diagnostic platform for microorganism-induced deterioration on paper-based cultural relics with iterative training from human feedback
- Author
-
Chenshu Liu, Songbin Ben, Chongwen Liu, Xianchao Li, Qingxia Meng, Yilin Hao, Qian Jiao, and Pinyi Yang
- Subjects
Paper-based cultural relics ,Conservation ,Computer vision ,Deep learning ,Strain classification ,Web application ,Fine Arts ,Analytical chemistry ,QD71-142 - Abstract
Abstract Purpose Paper-based artifacts hold significant cultural and social values. However, paper is intrinsically fragile to microorganisms, such as mold, due to its cellulose composition, which can serve as a microorganisms’ nutrient source. Mold not only can damage papers’ structural integrity and pose significant challenges to conservation works but also may subject individuals attending the contaminated artifacts to health risks. Current approaches for strain identification usually require extensive training, prolonged time for analysis, expensive operation costs, and higher risks of secondary damage due to sampling. Thus, in current conservation practices with mold-contaminated artifacts, little pre-screening or strain identification was performed before mold removal, and the cleaning techniques are usually broad-spectrum rather than strain-specific. With deep learning showing promising applications across various domains, this study investigated the feasibility of using a convolutional neural network (CNN) for fast in-situ recognition and classification of mold on paper. Methods Molds were first non-invasively sampled from ancient Xuan Paper-based Chinese books from the Qing and Ming dynasties. Strains were identified using molecular biology methods and the four most prevalent strains were inoculated on Xuan paper to create mockups for image collection. Microscopic images of the molds as well as their stains situated on paper were collected using a compound microscope and commercial microscope lens for cell phone cameras, which were then used for training CNN models with a transfer learning scheme to perform the classification of mold. To enable involvement and contribution from the research community, a web interface that actuates the process while providing interactive features for users to learn about the information of the classified strain was constructed. Moreover, a feedback functionality in the web interface was embedded for catching potential classification errors, adding additional training images, or introducing new strains, all to refine the generalizability and robustness of the model. Results & Conclusion In the study, we have constructed a suite of high-confidence classification CNN models for the diagnostic process for mold contamination in conservation. At the same time, a web interface was constructed that allows recurrently refining the model with human feedback through engaging the research community. Overall, the proposed framework opens new avenues for effective and timely identification of mold, thus enabling proactive and targeted mold remediation strategies in conservation.
- Published
- 2024
- Full Text
- View/download PDF
3. Computer vision digitization of smartphone images of anesthesia paper health records from low-middle income countries
- Author
-
Folks, Ryan D., Naik, Bhiken I., Brown, Donald E., and Durieux, Marcel E.
- Published
- 2024
- Full Text
- View/download PDF
4. Computer vision digitization of smartphone images of anesthesia paper health records from low-middle income countries
- Author
-
Ryan D. Folks, Bhiken I. Naik, Donald E. Brown, and Marcel E. Durieux
- Subjects
Computer vision ,Computer extraction of time series ,Document analysis ,Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background In low-middle income countries, healthcare providers primarily use paper health records for capturing data. Paper health records are utilized predominately due to the prohibitive cost of acquisition and maintenance of automated data capture devices and electronic medical records. Data recorded on paper health records is not easily accessible in a digital format to healthcare providers. The lack of real time accessible digital data limits healthcare providers, researchers, and quality improvement champions to leverage data to improve patient outcomes. In this project, we demonstrate the novel use of computer vision software to digitize handwritten intraoperative data elements from smartphone photographs of paper anesthesia charts from the University Teaching Hospital of Kigali. We specifically report our approach to digitize checkbox data, symbol-denoted systolic and diastolic blood pressure, and physiological data. Methods We implemented approaches for removing perspective distortions from smartphone photographs, removing shadows, and improving image readability through morphological operations. YOLOv8 models were used to deconstruct the anesthesia paper chart into specific data sections. Handwritten blood pressure symbols and physiological data were identified, and values were assigned using deep neural networks. Our work builds upon the contributions of previous research by improving upon their methods, updating the deep learning models to newer architectures, as well as consolidating them into a single piece of software. Results The model for extracting the sections of the anesthesia paper chart achieved an average box precision of 0.99, an average box recall of 0.99, and an mAP0.5-95 of 0.97. Our software digitizes checkbox data with greater than 99% accuracy and digitizes blood pressure data with a mean average error of 1.0 and 1.36 mmHg for systolic and diastolic blood pressure respectively. Overall accuracy for physiological data which includes oxygen saturation, inspired oxygen concentration and end tidal carbon dioxide concentration was 85.2%. Conclusions We demonstrate that under normal photography conditions we can digitize checkbox, blood pressure and physiological data to within human accuracy when provided legible handwriting. Our contributions provide improved access to digital data to healthcare practitioners in low-middle income countries.
- Published
- 2024
- Full Text
- View/download PDF
5. NSTU-BDTAKA: An open dataset for Bangladeshi paper currency detection and recognition
- Author
-
Md. Jubayar Alam Rafi, Mohammad Rony, and Nazia Majadi
- Subjects
Computer vision ,Deep learning ,Image analysis ,Taka detection ,Taka recognition ,YOLOv5 model ,Computer applications to medicine. Medical informatics ,R858-859.7 ,Science (General) ,Q1-390 - Abstract
One of the most popular and well-established forms of payment in use today is paper money. Handling paper money might be challenging for those with vision impairments. Assistive technology has been reinventing itself throughout time to better serve the elderly and disabled people. To detect paper currency and extract other useful information from them, image processing techniques and other advanced technologies, such as Artificial Intelligence, Deep Learning, etc., can be used. In this paper, we present a meticulously curated and comprehensive dataset named ‘NSTU-BDTAKA’ tailored for the simultaneous detection and recognition of a specific object of cultural significance - the Bangladeshi paper currency (in Bengali it is called ‘Taka’). This research aims to facilitate the development and evaluation of models for both taka detection and recognition tasks, offering a rich resource for researchers and practitioners alike. The dataset is divided into two distinct components: (i) taka detection, and (ii) taka recognition. The taka detection subset comprises 3,111 high-resolution images, each meticulously annotated with rectangular bounding boxes that encompass instances of the taka. These annotations serve as ground truth for training and validating object detection models, and we adopt the state-of-the-art YOLOv5 architecture for this purpose. In the taka recognition subset, the dataset has been extended to include a vast collection of 28,875 images, each showcasing various instances of the taka captured in diverse contexts and environments. The recognition dataset is designed to address the nuanced task of taka recognition providing researchers with a comprehensive set of images to train, validate, and test recognition models. This subset encompasses challenges such as variations in lighting, scale, orientation, and occlusion, further enhancing the robustness of developed recognition algorithms. The dataset NSTU-BDTAKA not only serves as a benchmark for taka detection and recognition but also fosters advancements in object detection and recognition methods that can be extrapolated to other cultural artifacts and objects. We envision that the dataset will catalyze research efforts in the field of computer vision, enabling the development of more accurate, robust, and efficient models for both detection and recognition tasks.
- Published
- 2024
- Full Text
- View/download PDF
6. Brain Tumor Synthetic Data Generation with Adaptive StyleGANs
- Author
-
Tariq, Usama, Qureshi, Rizwan, Zafar, Anas, Aftab, Danyal, Wu, Jia, Alam, Tanvir, Shah, Zubair, Ali, Hazrat, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Prates, Raquel Oliveira, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Longo, Luca, editor, and O’Reilly, Ruairi, editor
- Published
- 2023
- Full Text
- View/download PDF
7. Facilitating free travel in the Schengen area—A position paper by the European Association for Biometrics
- Author
-
Christoph Busch, Farzin Deravi, Dinusha Frings, Els Kindt, Ralph Lessmann, Alexander Nouak, Jean Salomon, Mateus Achcar, Fernando Alonso‐Fernandez, Daniel Bachenheimer, David Bethell, Josef Bigun, Matthew Brawley, Guido Brockmann, Enrique Cabello, Patrizio Campisi, Aleksandrs Cepilovs, Miles Clee, Mickey Cohen, Christian Croll, Andrzej Czyżewski, Bernadette Dorizzi, Martin Drahansky, Pawel Drozdowski, Catherine Fankhauser, Julian Fierrez, Marta Gomez‐Barrero, Georg Hasse, Richard Guest, Ekaterina Komleva, Sebastien Marcel, Gian Luca Marcialis, Laurent Mercier, Emilio Mordini, Stefance Mouille, Pavlina Navratilova, Javier Ortega‐Garcia, Dijana Petrovska, Norman Poh, Istvan Racz, Ramachandra Raghavendra, Christian Rathgeb, Christophe Remillet, Uwe Seidel, Luuk Spreeuwers, Brage Strand, Sirra Toivonen, and Andreas Uhl
- Subjects
biometrics (access control) ,biometric template protection ,biometric applications ,computer vision ,data privacy ,image analysis for biometrics ,Electronic computers. Computer science ,QA75.5-76.95 - Abstract
Abstract Due to migration, terror‐threats and the viral pandemic, various EU member states have re‐established internal border control or even closed their borders. European Association for Biometrics (EAB), a non‐profit organisation, solicited the views of its members on ways which biometric technologies and services may be used to help with re‐establishing open borders within the Schengen area while at the same time mitigating any adverse effects. From the responses received, this position paper was composed to identify ideas to re‐establish free travel between the member states in the Schengen area. The paper covers the contending needs for security, open borders and fundamental rights as well as legal constraints that any technological solution must consider. A range of specific technologies for direct biometric recognition alongside complementary measures are outlined. The interrelated issues of ethical and societal considerations are also highlighted. Provided a holistic approach is adopted, it may be possible to reach a more optimal trade‐off with regards to open borders while maintaining a high‐level of security and protection of fundamental rights. European Association for Biometrics and its members can play an important role in fostering a shared understanding of security and mobility challenges and their solutions.
- Published
- 2023
- Full Text
- View/download PDF
8. Automatic Vehicle Ego Body Extraction for Reducing False Detections in Automated Driving Applications
- Author
-
Hogan, Ciarán, Sistu, Ganesh, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Prates, Raquel Oliveira, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Longo, Luca, editor, and O’Reilly, Ruairi, editor
- Published
- 2023
- Full Text
- View/download PDF
9. Deep Learning for Automated Visual Inspection in Manufacturing and Maintenance: A Survey of Open- Access Papers
- Author
-
Nils Hütten, Miguel Alves Gomes, Florian Hölken, Karlo Andricevic, Richard Meyes, and Tobias Meisen
- Subjects
automated visual inspection ,industrial applications ,deep learning ,computer vision ,convolutional neural network ,vision transformer ,Technology ,Applied mathematics. Quantitative methods ,T57-57.97 - Abstract
Quality assessment in industrial applications is often carried out through visual inspection, usually performed or supported by human domain experts. However, the manual visual inspection of processes and products is error-prone and expensive. It is therefore not surprising that the automation of visual inspection in manufacturing and maintenance is heavily researched and discussed. The use of artificial intelligence as an approach to visual inspection in industrial applications has been considered for decades. Recent successes, driven by advances in deep learning, present a possible paradigm shift and have the potential to facilitate automated visual inspection, even under complex environmental conditions. For this reason, we explore the question of to what extent deep learning is already being used in the field of automated visual inspection and which potential improvements to the state of the art could be realized utilizing concepts from academic research. By conducting an extensive review of the openly accessible literature, we provide an overview of proposed and in-use deep-learning models presented in recent years. Our survey consists of 196 open-access publications, of which 31.7% are manufacturing use cases and 68.3% are maintenance use cases. Furthermore, the survey also shows that the majority of the models currently in use are based on convolutional neural networks, the current de facto standard for image classification, object recognition, or object segmentation tasks. Nevertheless, we see the emergence of vision transformer models that seem to outperform convolutional neural networks but require more resources, which also opens up new research opportunities for the future. Another finding is that in 97% of the publications, the authors use supervised learning techniques to train their models. However, with the median dataset size consisting of 2500 samples, deep-learning models cannot be trained from scratch, so it would be beneficial to use other training paradigms, such as self-supervised learning. In addition, we identified a gap of approximately three years between approaches from deep-learning-based computer vision being published and their introduction in industrial visual inspection applications. Based on our findings, we additionally discuss potential future developments in the area of automated visual inspection.
- Published
- 2024
- Full Text
- View/download PDF
10. Automatic Unsupervised Clustering of Videos of the Intracytoplasmic Sperm Injection (ICSI) Procedure
- Author
-
Storås, Andrea M., Riegler, Michael A., Haugen, Trine B., Thambawita, Vajira, Hicks, Steven A., Hammer, Hugo L., Kakulavarapu, Radhika, Halvorsen, Pål, Stensen, Mette H., Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Prates, Raquel Oliveira, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Zouganeli, Evi, editor, Yazidi, Anis, editor, Mello, Gustavo, editor, and Lind, Pedro, editor
- Published
- 2022
- Full Text
- View/download PDF
11. Deep Learning for 3D Reconstruction, Augmentation, and Registration: A Review Paper.
- Author
-
Vinodkumar, Prasoon Kumar, Karabulut, Dogus, Avots, Egils, Ozcinar, Cagri, and Anbarjafari, Gholamreza
- Subjects
- *
DEEP learning , *COMPUTER vision , *GRAPH neural networks , *ARTIFICIAL intelligence , *MACHINE learning , *GENERATIVE adversarial networks - Abstract
The research groups in computer vision, graphics, and machine learning have dedicated a substantial amount of attention to the areas of 3D object reconstruction, augmentation, and registration. Deep learning is the predominant method used in artificial intelligence for addressing computer vision challenges. However, deep learning on three-dimensional data presents distinct obstacles and is now in its nascent phase. There have been significant advancements in deep learning specifically for three-dimensional data, offering a range of ways to address these issues. This study offers a comprehensive examination of the latest advancements in deep learning methodologies. We examine many benchmark models for the tasks of 3D object registration, augmentation, and reconstruction. We thoroughly analyse their architectures, advantages, and constraints. In summary, this report provides a comprehensive overview of recent advancements in three-dimensional deep learning and highlights unresolved research areas that will need to be addressed in the future. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
12. Editorial for the Special Issue on "Feature Papers in Section AI in Imaging".
- Author
-
Fernández-Caballero, Antonio
- Subjects
GENERATIVE artificial intelligence ,COMPUTER vision ,ARTIFICIAL intelligence ,INTELLIGENT agents ,COMPUTER graphics ,DEEP learning ,EXPERT systems - Published
- 2024
- Full Text
- View/download PDF
13. Method and Installation for Efficient Automatic Defect Inspection of Manufactured Paper Bowls.
- Author
-
Yu, Shaoyong, Lee, Yang-Han, Chen, Cheng-Wen, Gao, Peng, Xu, Zhigang, Chen, Shunyi, and Yang, Cheng-Fu
- Subjects
COMPUTER vision ,AUTOMATIC optical inspection ,SEMICONDUCTOR detectors ,IMAGE processing ,PIXELS ,SENSOR arrays ,LIGHT sources - Abstract
Various techniques were combined to optimize an optical inspection system designed to automatically inspect defects in manufactured paper bowls. A self-assembled system was utilized to capture images of defects on the bowls. The system employed an image sensor with a multi-pixel array that combined a complementary metal-oxide semiconductor and a photo detector. A combined ring light served as the light source, while an infrared (IR) LED matrix panel was used to provide constant IR light to highlight the outer edges of the objects being inspected. The techniques employed in this study to enhance defect inspections on produced paper bowls included Gaussian filtering, Sobel operators, binarization, and connected components. Captured images were processed using these technologies. Once the non-contact inspection system's machine vision method was completed, defects on the produced paper bowls were inspected using the system developed in this study. Three inspection methods were used in this study: internal inspection, external inspection, and bottom inspection. All three methods were able to inspect surface features of produced paper bowls, including dirt, burrs, holes, and uneven thickness. The results of our study showed that the average time required for machine vision inspections of each paper bowl was significantly less than the time required for manual inspection. Therefore, the investigated machine vision system is an efficient method for inspecting defects in fabricated paper bowls. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
14. Enabling GPU-Enhanced Computer Vision and Machine Learning Research Using Containers
- Author
-
Michel, Martial, Burnett, Nicholas, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Weiland, Michèle, editor, Juckeland, Guido, editor, Alam, Sadaf, editor, and Jagode, Heike, editor
- Published
- 2019
- Full Text
- View/download PDF
15. Valmet to Supply IQ Web Inspection System to Yueyang Forest & Paper Co., Ltd.
- Subjects
- *
PAPER industry , *PRODUCT quality , *LED lighting , *COMPUTER vision , *INFORMATION retrieval - Published
- 2024
16. Research and Evaluation on an Optical Automatic Detection System for the Defects of the Manufactured Paper Cups.
- Author
-
Wang, Ping, Lee, Yang-Han, Tseng, Hsien-Wei, and Yang, Cheng-Fu
- Subjects
- *
COMPUTER vision , *SURFACE defects , *LIGHT sources , *IMAGE sensors , *IMAGE processing ,RESEARCH evaluation - Abstract
In this paper, the paper cups were used as the research objects, and the machine vision detection technology was combined with different image processing techniques to investigate a non-contact optical automatic detection system to identify the defects of the manufactured paper cups. The combined ring light was used as the light source, an infrared (IR) LED matrix panel was used to provide the IR light to constantly highlight the outer edges of the detected objects, and a multi-grid pixel array was used as the image sensor. The image processing techniques, including the Gaussian filter, Sobel operator, Binarization process, and connected component, were used to enhance the inspection and recognition of the defects existing in the produced paper cups. There were three different detection processes for paper cups, which were divided into internal, external, and bottom image acquisition processes. The present study demonstrated that all the detection processes could clearly detect the surface defect features of the manufactured paper cups, such as dirt, burrs, holes, and uneven thickness. Our study also revealed that the average time for the investigated Automatic Optical Detection to detect the defects on the paper cups was only 0.3 s. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
17. Facilitating free travel in the Schengen area—A position paper by the European Association for Biometrics.
- Author
-
Busch, Christoph, Deravi, Farzin, Frings, Dinusha, Kindt, Els, Lessmann, Ralph, Nouak, Alexander, Salomon, Jean, Achcar, Mateus, Alonso‐Fernandez, Fernando, Bachenheimer, Daniel, Bethell, David, Bigun, Josef, Brawley, Matthew, Brockmann, Guido, Cabello, Enrique, Campisi, Patrizio, Cepilovs, Aleksandrs, Clee, Miles, Cohen, Mickey, and Croll, Christian
- Subjects
- *
BIOMETRY , *DATA privacy , *BORDER security , *NONPROFIT organizations , *CIVIL rights - Abstract
Due to migration, terror‐threats and the viral pandemic, various EU member states have re‐established internal border control or even closed their borders. European Association for Biometrics (EAB), a non‐profit organisation, solicited the views of its members on ways which biometric technologies and services may be used to help with re‐establishing open borders within the Schengen area while at the same time mitigating any adverse effects. From the responses received, this position paper was composed to identify ideas to re‐establish free travel between the member states in the Schengen area. The paper covers the contending needs for security, open borders and fundamental rights as well as legal constraints that any technological solution must consider. A range of specific technologies for direct biometric recognition alongside complementary measures are outlined. The interrelated issues of ethical and societal considerations are also highlighted. Provided a holistic approach is adopted, it may be possible to reach a more optimal trade‐off with regards to open borders while maintaining a high‐level of security and protection of fundamental rights. European Association for Biometrics and its members can play an important role in fostering a shared understanding of security and mobility challenges and their solutions. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
18. VIBRANT-WALK: An algorithm to detect plagiarism of figures in academic papers.
- Author
-
Parmar, Shashank and Jain, Bhavya
- Subjects
- *
PLAGIARISM , *COMPUTER algorithms , *ALGORITHMS , *COMPUTER vision , *RANDOM walks - Abstract
Detecting plagiarism in academic papers is crucial for maintaining academic integrity, preserving the originality of published work, and safeguarding intellectual property. While existing applications excel at text plagiarism detection, they fall short when it comes to image plagiarism. This paper introduces a novel algorithm, named "VIBRANT-WALK," designed to detect image plagiarism in academic manuscripts. The challenge of identifying plagiarized images is formidable, requiring a unique approach. Traditional Computer Vision algorithms, proficient in image similarity tasks, face limitations in determining whether an image has been previously used in an article. To address this, the proposed algorithm leverages a repository of all published article pages, focusing on absolute identicality rather than image similarity. The algorithm comprises two stages. In the first stage, a "Vibrancy Matrix" is created through image preprocessing, aiding in contour determination. The second stage involves pixel-by-pixel comparison with images from published manuscripts. To enhance efficiency, the algorithm initiates comparisons from the pixel with the highest score in the Vibrancy Matrix, followed by pixel comparisons through random walks, significantly reducing complexity. To conduct the study, a custom dataset was compiled from 69 research articles, capturing snapshots of each page and figure. Overall, we present 485 unique test cases where we can test the accuracy and efficiency of the algorithm. The lack of publicly available datasets necessitated this approach. The proposed algorithm outperformed the existing models and algorithms in this field by achieving an overall accuracy of 94.8% on the collated dataset, identifying 460 instances of plagiarism out of the 485 test cases. The algorithm also demonstrated a 100% accuracy rate in avoiding false positives. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
19. Machine-vision-based Detection of Paper Roll Core Eccentricity : Fast and Robust On-Line Measurement Using Circular Hough Transform
- Author
-
Sehlstedt, Erik
- Subjects
Defect detection ,Hough transform ,Datorseende och robotik (autonoma system) ,On-line measurement ,Quality control ,Computer vision ,Core eccentricity ,Circular Hough transform ,Paper roll ,Machine vision ,Computer Vision and Robotics (Autonomous Systems) ,Eccentricity - Abstract
The field of computer vision offers tools that allow machines to derive meaningful infor-mation from video and images and consequently make decisions based on visual inputs. In the paper industry, implementation of machine vision (MV) can be used to automate and speed up processes that require visual inspection, particularly certain segments of quality control – one such application being detection and measurement of paper roll core eccentricity. Core eccentricity is a roll build error in which the roll core is offset from the geometric roll center, potentially causing runnability issues. This particular project aims to improve the detection of paper roll core eccentricity at the Mondi Dynäs integrated pulp and paper mill through creation, calibration and evaluation of a machine-vision-based tool for on-line core eccentricity measurement. The tool utilizes the Hough Transform (HT), since HT is a simple yet fast and robust algorithm when it comes to identification of basic shapes such as lines and circles. The proposed solution was evaluated in two ways; firstly by determining at what level of accuracy the measurements could be provided, accounting for how well the solution deals with correction of systematic error caused by environmental factors, and secondly by analyzing how well characteristic roll features could be accurately identified in large sets of data, necessary to consistently perform measurements. The evaluation of the proposed solution showed a 99.9% detection rate for characteristic paper roll features, and a 98.1% detection rate of laser lines used for correction of position and orientation induced error. Assessment of the measurement accuracy following successful detection was on par with the current optical measurement method, and the proposed solution was able to classify distinctive features with a 96.8% accuracy. Lastly, several improvement actions to address faulty detection were identified, and factors to be considered for future installment were highlighted.
- Published
- 2022
20. [Paper] Quality Improvement for Real-time Free Viewpoint Video Using View-dependent Shape Refinement
- Author
-
Masaru Sugano, Keisuke Nonaka, Tatsuya Kobayashi, Ryosuke Watanabe, Kato Haruhisa, and Tomoaki Konno
- Subjects
business.industry ,Computer science ,Signal Processing ,3D reconstruction ,Media Technology ,View dependent ,Image processing ,Computer vision ,Paper quality ,Artificial intelligence ,business ,Computer Graphics and Computer-Aided Design ,Camera resectioning - Published
- 2021
21. Special issue on intelligent systems: ISMIS 2022 selected papers.
- Author
-
Ceci, Michelangelo, Flesca, Sergio, Manco, Giuseppe, and Masciari, Elio
- Subjects
MACHINE learning ,ARTIFICIAL intelligence ,DECISION support systems ,KNOWLEDGE representation (Information theory) ,COMPUTER vision ,DEEP learning - Abstract
This document is a special issue of the Journal of Intelligent Information Systems, focusing on the selected papers from the International Symposium on Methodologies for Intelligent Systems (ISMIS 2022). The symposium, held in Cosenza, Italy, showcased research on various topics related to artificial intelligence, including decision support, knowledge representation, machine learning, computer vision, and more. The special issue includes eleven papers that have undergone rigorous peer-reviewing and cover a wide range of research topics, such as deep learning, anomaly detection, malware detection, sentiment classification, and healthcare professionals' burnout. The authors express their gratitude to the contributors and reviewers for their valuable contributions. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
22. PaperNet: A Dataset and Benchmark for Fine-Grained Paper Classification.
- Author
-
Yue, Tan, Li, Yong, Shi, Xuzhao, Qin, Jiedong, Fan, Zijiao, and Hu, Zonghai
- Subjects
NATURAL language processing ,COMPUTER vision ,VISUAL fields ,CLASSIFICATION - Abstract
Document classification is an important area in Natural Language Processing (NLP). Because a huge amount of scientific papers have been published at an accelerating rate, it is beneficial to carry out intelligent paper classifications, especially fine-grained classification for researchers. However, a public scientific paper dataset for fine-grained classification is still lacking, so the existing document classification methods have not been put to the test. To fill this vacancy, we designed and collected the PaperNet-Dataset that consists of multi-modal data (texts and figures). PaperNet 1.0 version contains hierarchical categories of papers in the fields of computer vision (CV) and NLP, 2 coarse-grained and 20 fine-grained (7 in CV and 13 in NLP). We ran current mainstream models on the PaperNet-Dataset, along with a multi-modal method that we propose. Interestingly, none of these methods reaches an accuracy of 80% in fine-grained classification, showing plenty of room for improvement. We hope that PaperNet-Dataset will inspire more work in this challenging area. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
23. Watching plants grow – a position paper on computer vision and Arabidopsis thaliana
- Author
-
Jonathan Bell and Hannah M. Dee
- Subjects
computer vision ,Arabidopsis thaliana ,image analysis ,image segmentation ,image sequences ,Computer applications to medicine. Medical informatics ,R858-859.7 ,Computer software ,QA76.75-76.765 - Abstract
The authors present a comprehensive overview of image processing and analysis work done to support research into the model flowering plant Arabidopsis thaliana. Beside the plant's importance in biological research, using image analysis to obtain experimental measurements of it is an interesting vision problem in its own right, involving the segmentation and analysis of sequences of images of objects whose shape varies between individual specimens and also changes over time. While useful measurements can be obtained by segmenting a whole plant from the background, they suggest that the increased range and precision of measurements made available by leaf‐level segmentation makes this a problem well worth solving. A variety of approaches have been tried by biologists as well as computer vision researchers. This is an interdisciplinary area and the computer vision community has an important contribution to make. They suggest that there is a need for publicly available datasets with ground truth annotations to enable the evaluation of new approaches and to support the building of training data for modern data‐driven computer vision approaches, which are those most likely to result in the kind of fully automated systems that will be of use to biologists.
- Published
- 2017
- Full Text
- View/download PDF
24. Apply of digital speckle projection in measurement of paper sheet thickness
- Author
-
Li Weixian, Wu Fan, Zhu Junyi, Dong Mingli, Zhang Yumeng, Sha Di, and WU Sijin
- Subjects
Paper sheet ,Speckle pattern ,business.industry ,Computer science ,Computer vision ,Artificial intelligence ,Projection (set theory) ,business ,Atomic and Molecular Physics, and Optics - Published
- 2019
25. ECG Paper Record Digitization and Diagnosis Using Deep Learning
- Author
-
Vruddhi Shah, Sharath Dinesh, Ninad Mehendale, Siddharth Mishra, Darsh Parmar, Gaurav Khatwani, Prathamesh Daphal, Darshan Sapariya, and Rupali Patil
- Subjects
Computer science ,0206 medical engineering ,Biomedical Engineering ,Image processing ,02 engineering and technology ,Data_CODINGANDINFORMATIONTHEORY ,030218 nuclear medicine & medical imaging ,03 medical and health sciences ,0302 clinical medicine ,Diagnosis ,medicine ,ComputerSystemsOrganization_SPECIAL-PURPOSEANDAPPLICATION-BASEDSYSTEMS ,Computer vision ,Paper ECG ,Digitization ,Left bundle branch block ,business.industry ,Deep learning ,Pattern recognition ,General Medicine ,Right bundle branch block ,medicine.disease ,020601 biomedical engineering ,ComputingMethodologies_PATTERNRECOGNITION ,Original Article ,Artificial intelligence ,Ecg signal ,business - Abstract
Purpose Electrocardiogram (ECG) is one of the most essential tools for detecting heart problems. Till today most of the ECG records are available in paper form. It can be challenging and time-consuming to manually assess the ECG paper records. Hence, automated diagnosis and analysis are possible if we digitize such paper ECG records. Methods The proposed work aims to convert ECG paper records into a 1-D signal and generate an accurate diagnosis of heart-related problems using deep learning. Camera-captured ECG images or scanned ECG paper records are used for the proposed work. Effective pre-processing techniques are used for the removal of shadow from the images. A deep learning model is used to get a threshold value that separates ECG signal from its background and after applying various image processing techniques threshold ECG image gets converted into digital ECG. These digitized 1-D ECG signals are then passed to another deep learning model for the automated diagnosis of heart diseases into different classes such as ST-segment elevation myocardial infarction (STEMI), Left Bundle Branch Block (LBBB), Right Bundle Branch Block (RBBB), and T-wave abnormality. Results The accuracy of deep learning-based binarization is 97%. Further deep learning-based diagnosis approach of such digitized paper ECG records was having an accuracy of 94.4%. Conclusions The digitized ECG signals can be useful to various research organizations because the trends in heart problems can be determined and diagnosed from preserved paper ECG records. This approach can be easily implemented in areas where such expertise is not available. Supplementary Information The online version contains supplementary material available at 10.1007/s40846-021-00632-0.
- Published
- 2020
26. AI Machine Vision based Oven White Paper Color Classification and Label Position Real-time Monitoring System to Check Direction.
- Author
-
Hee-Chul Kim, Youn-Saup Yoon, and Yong-Mo Kim
- Subjects
COMPUTER vision ,DEEP learning ,JOB classification ,MANUFACTURING process automation ,ARTIFICIAL intelligence ,COLOR image processing - Abstract
We develop a vision system for batch inspection by oven white paper model color by manufacturing a machine vision system for the oven manufacturing automation process. In the vision system, white paper object detection (spring), color clustering, and histogram extraction are performed. In addition, for the automated process of home appliances, we intend to develop an automatic mold combination detection algorithm that inspects the label position and direction (angle/coordinate) using deep learning. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
27. Intelligent computer vision system for segregating recyclable waste papers
- Author
-
Rahman, Mohammad Osiur, Hussain, Aini, Scavino, Edgar, Basri, Hassan, and Hannan, M.A.
- Subjects
- *
PAPER recycling , *COMPUTER vision , *IMAGE processing , *ARTIFICIAL intelligence , *SORTING (Electronic computers) , *MACHINE learning , *PATTERN recognition systems , *PAPER chemicals - Abstract
Abstract: This article explores the application of image processing techniques in recyclable waste paper sorting. In recycling, waste papers are segregated into various grades as they are subjected to different recycling processes. Highly sorted paper streams facilitate high quality end products and save processing chemicals and energy. From 1932 to 2009, different mechanical and optical paper sorting methods have been developed to fill the paper sorting demand. Still, in many countries including Malaysia, waste papers are sorted into different grades using a manual sorting system. Because of inadequate throughput and some major drawbacks of mechanical paper sorting systems, the popularity of optical paper sorting systems has increased. Automated paper sorting systems offer significant advantages over human inspection in terms of worker fatigue, throughput, speed, and accuracy. This research attempts to develop a smart vision sensing system that is able to separate the different grades of paper using first-order features. To construct a template database, a statistical approach with intra-class and inter-class variation techniques are applied to the feature selection process. Finally, the K-nearest neighbor (KNN) algorithm is applied for paper object grade identification. The remarkable achievement obtained with the method is the accurate identification and dynamic sorting of all grades of papers using simple image processing techniques. [Copyright &y& Elsevier]
- Published
- 2011
- Full Text
- View/download PDF
28. Stone, Paper, Scissors Mini-Game for AI Pet Robot
- Author
-
Aditya Aspat, Abhishek Ghoshal, and Elton Lemos
- Subjects
Background subtraction ,Identification (information) ,Action (philosophy) ,Computer science ,Simple (abstract algebra) ,business.industry ,Robot ,Computer vision ,Emotion recognition ,Artificial intelligence ,business ,GeneralLiterature_MISCELLANEOUS - Abstract
The Artificial Intelligence (AI) Pet Robot is a combination of various fields of computer science. This paper showcases the various functionalities of our AI Pet. Most of the functionalities showcased use the immage processing modules made available through OpenCV. The pet robot has various features such as emotion recognition, follow routine, mini-game etc. This paper discusses the mini-game aspect of the robot. The game has been developed by using VGG16 convolutional network for identification of the action performed by the user. To improve the accuracy we have made use of background subtraction which gives removes all the unwanted objects from the background and gives a simple cutout of the users hand.
- Published
- 2021
29. On Microstructure Estimation Using Flatbed Scanners for Paper Surface-Based Authentication
- Author
-
Chau-Wai Wong and Runze Liu
- Subjects
021110 strategic, defence & security studies ,Authentication ,Computer Networks and Communications ,Computer science ,business.industry ,Feature extraction ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,0211 other engineering and technologies ,Word error rate ,02 engineering and technology ,Fingerprint ,Feature (computer vision) ,Heightmap ,Key (cryptography) ,Computer vision ,Specular reflection ,Artificial intelligence ,Safety, Risk, Reliability and Quality ,business - Abstract
Paper surfaces under the microscopic view are observed to be formed by intertwisted wood fibers. Such structures of paper surfaces are unique from one location to another and are almost impossible to duplicate. Previous work used microscopic surface normals to characterize such intrinsic structures as a “fingerprint” of paper for security and forensic applications. In this work, we examine several key research questions of feature extraction in both scientific and engineering aspects to facilitate the deployment of paper surface-based authentication when flatbed scanners are used as the acquisition device. We analytically show that, under the unique optical setup of flatbed scanners, the specular reflection does not play a role in norm map estimation. We verify, using a larger dataset than prior work, that the scanner-acquired norm maps, although blurred, are consistent with those measured by confocal microscopes. We confirm that, when choosing an authentication feature, high spatial-frequency subbands of the heightmap are more powerful than the norm map. Finally, we show that it is possible to empirically calculate the physical dimensions of the paper patch needed to achieve a certain authentication performance in equal error rate (EER). We analytically show that log(EER) is decreasing linearly in the edge length of a paper patch.
- Published
- 2021
30. A paper based colorimeter using smartphone light sensor
- Author
-
Bebeh Wahid Nuryadin, U Umairoh, and Ade Yeti Nuryantini
- Subjects
Computer science ,business.industry ,Colorimeter ,Photodetector ,Computer vision ,Artificial intelligence ,Paper based ,business - Abstract
A paper-based colorimeter for absorbance and concentration measurement of the food colouring dye is proposed. The paper-based colorimeter system consists of a white LED as light source, paper-based cuvette holder, and smartphone light sensor. The paper-based colorimeter with smartphone light sensor is low-cost, mobile and real-time for the detection of colouring dye concentration. The detection response of the paper-based colorimeter system was found to be linear with the colouring dye concentration in the range from 0 to 0.025 g/mL with a correlation coefficient (R2) 0.89±0.04. The experimental results show that this paper-based colorimeter system is highly sensitive and have a potential application, from student labs to small industries.
- Published
- 2021
31. 7.3: Invited Paper: Integration of large‐area optical imagers for biometric recognition and touch in displays
- Author
-
Richard van de Ketterij, Tung Huei Ke, Auke Jisk Kronemeijer, Santhosh Shanmugam, Gerwin H. Gelinck, Ilias Katsouras, Eric Meulenkamp, Daniel Tordera, Ezequiel Delvitto, Leslye Ugalde Lopez, Luis Moreno Hagelsieb, Florian De Roose, H. B. Akkerman, Pawel E. Malinowski, Albert J. J. M. van Breemen, and Bart Peeters
- Subjects
Biometrics ,Computer science ,business.industry ,Computer vision ,Artificial intelligence ,Fingerprint recognition ,business - Published
- 2021
32. A Survey Paper on Image forgery detection Using Pseudo Zernike Moment
- Author
-
Brijesh Patel and Sheshang Degadwala
- Subjects
Computer science ,Zernike polynomials ,business.industry ,020206 networking & telecommunications ,02 engineering and technology ,Moment (mathematics) ,symbols.namesake ,0202 electrical engineering, electronic engineering, information engineering ,symbols ,Image forgery ,020201 artificial intelligence & image processing ,Computer vision ,Artificial intelligence ,business - Abstract
Photographs are taken as valid evidences in various scenarios of our day to day life. Because of the developments in the field of Image Processing, altering images according to ones need is not a difficult task. Techniques of Image Forensics play its crucial role at this juncture. One of the mostly found types of image tampering is Copy-Move forgery. A copy-move forgery is performed by copying a region in an image and pasting it on another region in the same image, mostly after some form of post-processing like rotation, scaling, blurring, noise addition, JPEG compression etc. Two types of copy-move forgery detection techniques exist in literature. They are the Block based methods and Key-point based methods. Both the methods have their own advantages and limitations. This paper presents a survey on the recent developments in block based methods. As forgeries have become popular, the importance of forgery detection is much increased. Copy-move forgery, one of the most commonly used methods, copies a part of the image and pastes it into another part of the image. In this paper, we propose a detection method of copy-move forgery that localizes duplicated regions using Zernike moments. Since the magnitude of Zernike moments is algebraically invariant against rotation, the proposed method can detect a forged region even though it is rotated. Our scheme is also resilient to the intentional distortions such as additive white Gaussian noise, JPEG compression, and blurring. Experimental results demonstrate that the proposed scheme is appropriate to identify the forged region by copy-rotate-move forgery.
- Published
- 2020
33. Image-Based Quantitative Analysis of Foxing Stains on Old Printed Paper Documents
- Author
-
Gyuho Kim, Jung Gon Kim, Kitaek Kang, and Woo Sik Yoo
- Subjects
Archeology ,Image quality ,Computer science ,Materials Science (miscellaneous) ,Conservation ,Image processing software ,01 natural sciences ,foxing stains ,image analysis ,parasitic diseases ,0601 history and archaeology ,Computer vision ,lcsh:CC1-960 ,old paper document ,060102 archaeology ,Pixel ,quantitative analysis ,010405 organic chemistry ,business.industry ,Foxing ,Color analysis ,06 humanities and the arts ,0104 chemical sciences ,Quantitative analysis (finance) ,software development ,RGB color model ,lcsh:Archaeology ,Artificial intelligence ,business ,Image based - Abstract
We studied the feasibility of image-based quantitative analysis of foxing stains on collections of old (16th&ndash, 20th century) European books stored in the Rare Book Library of the Seoul National University in Korea. We were able to quantitatively determine the foxing affected areas on books from their photographs using a newly developed image processing software (PicMan) including cultural property characterization applications, specifically. Dimensional and color analysis of photographs were successfully done quantitatively. Histograms of RGB (red, green, blue) pixels of photographs clearly showed the change in color distribution of foxing stains compared to the other areas of the photographs. Several sample images of quantitative measurement of foxing stains and virtually restored images were generated to provide easy visual inspection and comparison between restored images and the original photographs. Image quality, resolution, and digital file format requirements for quantitative analysis are described. Image-based quantitative analysis of foxing stains on paper documents are found to be very promising towards automation for objective characterization of photographs of cultural properties. This technique can be used to create a cultural property digital database. Quantitative and statistical analysis techniques can be introduced to monitor the effect of storage and conservation environment on the cultural properties.
- Published
- 2019
- Full Text
- View/download PDF
34. Position paper on ethical, legal and social challenges linked to audio- and video-based AAL solutions
- Author
-
Ake-Kob, Alin, Aleksic, Slavisa, Alexin, Zoltán, Blaževičienė, Aurelija, Čartolovni, Anto, Colonna, Liane, Dantas, Carina, Fedosov, Anton, Fosch-Villaronga, Eduard, Florez-Revuelta, Francisco, He, Zhicheng, Jevremović, Aleksandar, Klimczuk, Andrzej, Kuźmicz, Maksymilian, Lambrinos, Lambros, Lutz, Christoph, Malešević, Anamaria, Mekovec, Renata, Miguel, Cristina, Mujirishvili, Tamar, Pajalic, Zada, Perez Vega, Rodrigo, Pierscionek, Barbara, Ravi, Siddharth, Sarf, Pika, Solanas, Agusti, and Tamò-Larrieux, Aurelia
- Subjects
J14 ,History ,O33 ,Polymers and Plastics ,M14 ,L86 ,O18 ,privacy ,audio processing ,ethics ,Industrial and Manufacturing Engineering ,computer vision ,silver economy ,actice assisted living ,active ageing ,active assisted living ,ddc:330 ,Business and International Management - Abstract
Cost Action 19121: GoodBrother The European Cooperation in Science and Technology (COST) is a funding organisation for the creation of research networks, called COST Actions (CA). These networks offer an open space for collaboration among scientists across Europe (and beyond) and thereby give impetus to research advancements and innovation. Many institutions around Europe participate actively in the CA19121 - Network on Privacy-Aware Audio- and Video-Based Applications for Active and Assisted Living, also called GoodBrother. Europe faces crucial challenges regarding health and social care due to the demographic change and current economic context. Active Assisted Living (AAL) technologies are a possible solution to support tackling them. AAL technologies aim at improving the health, quality of life, and wellbeing of older, impaired, and frail people. AAL systems use different sensors to monitor the environment and its dwellers. Cameras and microphones are being more frequently used for AAL. They monitor an environment and gather information, being the most straightforward and natural way of describing events, persons, objects, actions, and interactions. Recent advances have given these devices the ability to ‘see’ and ‘hear.’ However, their use can be seen as intrusive by some end-users such as assisted persons and professional and informal caregivers. GoodBrother aims to increase the awareness of the ethical, legal, and privacy issues associated with audio- and video-based monitoring and to propose privacy-aware working solutions for assisted living by creating an interdisciplinary community of researchers and industrial partners from different fields (computing, engineering, healthcare, law, sociology) and other stakeholders (users, policymakers, public services), stimulating new research and innovation. GoodBrother will offset the “Big Brother” sense of continuous monitoring by increasing user acceptance, exploiting these new solutions, and improving market reach. Working Group 1 on Social Responsibility: Ethical, legal, social, data protection and privacy issues Experts from diverse disciplines are analysing the ethical, legal, data protection and privacy issues associated with the use of cameras and microphones in private spaces, and how to manage multi-party privacy preferences. They also study the differences according to gender and cultural/societal background in the perception of these issues. This WG aims to establish the core requirements that AAL solutions must fulfil to consider ethico-legal issues and to integrate privacy by design and by default. Those requirements will set up the guidelines for the technical WGs (WG2, WG3 and WG4). The Workgroup goals are: Review the current European and international legislation and the ethical issues that underpin this on the use of audio- and video-based monitoring in private environments. Study the differences in the perception of privacy depending on the culture, society, gender and age of the users, and analyse the situations and conditions in later life, i.e. occurrence of a fall, which may affect that perception. Investigate the potential benefits and barriers of AAL technology adoption for people in need of care. Support the development of privacy-aware monitoring systems by a continuous exchange of knowledge with technological participants in the Action. Promote the consideration of ethical, legal, privacy and gender matters in the design of AAL solutions. Inform other WGs on the ethico-legal requirements in the design and development of AAL solutions. About this Position Paper In this position paper, we have used Alan Cooper’s persona technique to illustrate the utility of audio- and video-based AAL technologies. Therefore, two primary examples of potential audio- and video-based AAL users, Anna and Irakli, serve as reference points for describing salient ethical, legal and social challenges related to use of AAL. These challenges are presented on three levels: individual, societal, and regulatory. For each challenge, a set of policy recommendations is suggested., This publication is based upon work from COST Action GoodBrother – Network on Privacy-Aware Audio- and Video- Based Applications for Active and Assisted Living, supported by COST (European Cooperation in Science and Technology). COST (European Cooperation in Science and Technology) is a funding agency for research and innovation networks. Our Actions help connect research initiatives across Europe and enable scientists to grow their ideas by sharing them with their peers. This boosts their research, career and innovation. www.cost.eu
- Published
- 2022
35. A Survey Paper on System Frameworks for Image Processing
- Author
-
Vishwajit Gaikwad
- Subjects
Computer science ,business.industry ,Computer vision ,Image processing ,Artificial intelligence ,business - Published
- 2021
36. Watching plants grow – a position paper on computer vision andArabidopsis thaliana
- Author
-
Hannah Dee and Jonathan Bell
- Subjects
0106 biological sciences ,0301 basic medicine ,Ground truth ,business.industry ,Computer science ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Image processing ,Image segmentation ,01 natural sciences ,Variety (cybernetics) ,03 medical and health sciences ,Range (mathematics) ,030104 developmental biology ,Market segmentation ,Position paper ,Computer vision ,Segmentation ,Computer Vision and Pattern Recognition ,Artificial intelligence ,business ,Software ,010606 plant biology & botany - Abstract
The authors present a comprehensive overview of image processing and analysis work done to support research into the model flowering plant Arabidopsis thaliana. Beside the plant's importance in biological research, using image analysis to obtain experimental measurements of it is an interesting vision problem in its own right, involving the segmentation and analysis of sequences of images of objects whose shape varies between individual specimens and also changes over time. While useful measurements can be obtained by segmenting a whole plant from the background, they suggest that the increased range and precision of measurements made available by leaf-level segmentation makes this a problem well worth solving. A variety of approaches have been tried by biologists as well as computer vision researchers. This is an interdisciplinary area and the computer vision community has an important contribution to make. They suggest that there is a need for publicly available datasets with ground truth annotations to enable the evaluation of new approaches and to support the building of training data for modern data-driven computer vision approaches, which are those most likely to result in the kind of fully automated systems that will be of use to biologists.
- Published
- 2017
37. Application of convolutional neural network models for personality prediction from social media images and citation prediction for academic papers
- Author
-
Dave, Akshat
- Subjects
Artificial intelligence ,Computer science ,Computer Vision ,Neural Networks - Abstract
Inspired by the success of convolutional neural networks in image classification, and other higher level vision tasks, we explore two applications of such deep convolutional neural networks to model tasks typically involving human assessment, viz. i) prediction of personality from social media images, and ii) prediction of citations from the visual elements of an academic paper. The aim in this context is to discover if there is any predictable and learnable signal in the input data. As an extension, we attempt to discover what aspects of the signal are indeed learnt that lead to the results presented. For instance, if personality can be predicted, what aspects of the image are causing that? Similarly if an academic paper is highly cited, what are the characteristic visual elements that cause this? We employ convolutional neural networks in order to understand what imputable attributes we may derive that are simpler to reason.
- Published
- 2016
38. [Paper] HEVC-based Light-field Coding using Basis Images and Frame Reordering
- Author
-
Kota Imaeda, Yukihiro Bandoh, Hideaki Kimata, Keita Takahashi, Seishi Takamura, and Toshiaki Fujii
- Subjects
Basis (linear algebra) ,business.industry ,Computer science ,Compression (functional analysis) ,Signal Processing ,Frame (networking) ,Media Technology ,Computer vision ,Artificial intelligence ,business ,Computer Graphics and Computer-Aided Design ,Light field ,Coding (social sciences) - Published
- 2021
39. [Paper] Sports Camera Calibration using Flexible Intersection Selection and Refinement
- Author
-
Ryosuke Watanabe, Tomoaki Konno, Sei Naito, Keisuke Nonaka, and Hiroki Tsurusaki
- Subjects
Intersection ,Computer science ,business.industry ,Signal Processing ,Media Technology ,Computer vision ,Artificial intelligence ,business ,Computer Graphics and Computer-Aided Design ,Selection (genetic algorithm) ,Camera resectioning - Published
- 2021
40. [Paper] Development of System to Classify Speckle Images for Visual Inspection of Cutlery
- Author
-
Ryosuke Harakawa, Masahiro Iwahashi, Tadaaki Isobe, and Yuya Takimoto
- Subjects
Contextual image classification ,Computer science ,Machine vision ,business.industry ,Feature extraction ,Cutlery ,Computer Graphics and Computer-Aided Design ,Visual inspection ,Speckle pattern ,Signal Processing ,Media Technology ,Computer vision ,Artificial intelligence ,business - Published
- 2021
41. [Invite Paper] Functional Imaging with Multi-tap CMOS Pixels
- Author
-
Keiichiro Kagawa
- Subjects
Functional imaging ,Cmos pixels ,Computer science ,business.industry ,Wide field imaging ,Signal Processing ,Media Technology ,Computer vision ,Artificial intelligence ,business ,Computer Graphics and Computer-Aided Design ,Active illumination ,Structured light - Published
- 2021
42. Preparation of Papers for IFAC Conferences & Symposia: Computer Vision-enabled Human-Cyber-Physical Workstation for Proactive Ergonomic Risks Mitigation
- Author
-
George Q. Huang, Danqi Yan, Xuefeng Zhao, Yiming Rong, Yuquan Leng, Daqiang Guo, and Shiquan Ling
- Subjects
Development environment ,Functional validation ,Ergonomic risk ,Adaptive control ,Workstation ,business.industry ,Computer science ,media_common.quotation_subject ,Cyber-physical system ,Human factors and ergonomics ,law.invention ,Control and Systems Engineering ,law ,Quality (business) ,Computer vision ,Artificial intelligence ,business ,media_common - Abstract
In production, poor ergonomic environments not only lead to increased workloads and health hazards for employees but also tend to reduce efficiency and quality. Recently, the human-cyber-physical (HCP) system has been proposed and widely studies to meet human capabilities and limitations. However, most existing frameworks are still not adaptive enough to integrate humans into a smart production environment due to a lack of real-time individual human factors digitalization. This research proposes an HCP workstation model for comprehensive assembly resources digitalization and autonomous interaction by CPS enabling technologies. Based on this, an adaptive control system has been developed for proactive ergonomic risk mitigation. Computer vision is deployed for real-time individual ergonomic evaluation and a prototype has been set up for functional validation.
- Published
- 2021
43. [Paper] CFA Handling and Quality Analysis for Compressive Light Field Camera
- Author
-
Yasutaka Inagaki, Hajime Nagahara, Kohei Sakai, Toshiaki Fujii, and Keita Takahashi
- Subjects
Light-field camera ,Computer science ,business.industry ,Computer Graphics and Computer-Aided Design ,Convolutional neural network ,law.invention ,Computational photography ,Quality (physics) ,law ,Signal Processing ,Media Technology ,Computer vision ,Artificial intelligence ,business ,Light field - Published
- 2021
44. [Paper] Noise Bias Compensation for Color Images after Tone Mapping
- Author
-
Ryosuke Harakawa, Sayaka Minewaki, Yo Umeki, and Masahiro Iwahashi
- Subjects
Noise ,business.industry ,Computer science ,Noise reduction ,Signal Processing ,Bayesian probability ,Media Technology ,Computer vision ,Artificial intelligence ,Tone mapping ,business ,Computer Graphics and Computer-Aided Design ,Compensation (engineering) - Published
- 2021
45. A Review Paper on Automatic Number Plate Recognition System
- Author
-
Shally Gupta, Rajesh Shyam Singh, and Hardwari Lal Mandoria
- Subjects
0209 industrial biotechnology ,020901 industrial engineering & automation ,Computer science ,business.industry ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,0202 electrical engineering, electronic engineering, information engineering ,Recognition system ,020201 artificial intelligence & image processing ,Computer vision ,02 engineering and technology ,Artificial intelligence ,business - Abstract
Number plate recognition brings a drastic improvement for the city traffic enhancement. It provides the direction in which the steps should be taken for working of an effective intelligent transportation system. ANPR have become necessity for traffic control management due to rapid increment of vehicles. The main aim of ANPR is to monitor traffic and for security purpose. Recognition of number plate uses image processing techniques and latest technology in detecting characters on vehicle license plates automatically. In recent years, there are many technological developments in recognizing license plate area of research. Image processing protocols like OCR technology allow the traffic surveillance to deal with several problems that occurs in criminal investigation, toll collection, monitoring traffic, controlling speed, parking management etc. For efficient management of traffic and mass surveillance in transportation system, an ANPR system is essential. With the aid of image processing algorithms and vehicle images dataset, it becomes possible to monitor traffic at a large scale. Vehicle images are helpful for recognizing characters on license plates by performing image segmentation, feature extraction and character recognition. Data collected through captured images are utilized in the commercial applications, law enforcement, traffic applications etc. The software examines the vehicle picture as an input image that results in displaying the plate numbers. The system with image processing used reliably for traffic detection where modification of the technologies enables an accurate acknowledgement of vehicle number plates. There are several ANPR systems developed, working on character recognition of LP by the help of image processing technique. This paper reviews the performance by researchers in this particular area towards meeting goals of transportation system. It also provides major issues and challenges in this field.
- Published
- 2020
46. Research on image processing algorithm of immune colloidal gold test paper detection
- Author
-
Guang Yang, Tiefeng Wang, and Peng Zhang
- Subjects
CMOS sensor ,business.industry ,Computer science ,Materials Science (miscellaneous) ,Template matching ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Image processing ,HSL and HSV ,Industrial and Manufacturing Engineering ,Image (mathematics) ,Color rendering index ,Position (vector) ,Line (geometry) ,Computer vision ,Artificial intelligence ,Business and International Management ,business - Abstract
In order to better solve the problem of automatic identification of quality control line and detection line in the detection of gold standard test strip, this paper proposes to collect the image information of gold standard test strip after color rendering through CMOS sensor, preprocess the obtained information, transform RGB image into gray image, build cloud model in the CIELAB/HSV/HSL space, and apply the improved AdaBoost algorithm to determine the position of detection line and quality control line Place. Compared with the traditional template matching method, it improves the accuracy and accuracy of recognition.
- Published
- 2020
47. Experiences in digitizing and digitally measuring a paper-based ECG archive
- Author
-
Tuomas Kerola, Tuomas Kenttä, Markku Heliövaara, Aapo L. Aro, Arttu Holkeri, Kai Noponen, Harri Rissanen, Antti Eranti, Heikki V. Huikuri, M. Anette E. Haukilahti, Tapio Seppänen, M. Juhani Junttila, Paul Knekt, HYKS erva, Clinicum, Department of Medicine, Kardiologian yksikkö, and HUS Heart and Lung Center
- Subjects
Paper ,Medical Records Systems, Computerized ,Computer science ,Wave form ,electrocardiography ,Information Storage and Retrieval ,030204 cardiovascular system & hematology ,ELECTROCARDIOGRAM ,QT interval ,03 medical and health sciences ,QRS complex ,Electrocardiography ,0302 clinical medicine ,Humans ,Computer vision ,030212 general & internal medicine ,PR interval ,conversion ,Digitization ,Finland ,Measurement method ,business.industry ,Signal Processing, Computer-Assisted ,Conversion ,Paper based ,Health Surveys ,digitization ,3121 General medicine, internal medicine and other clinical medicine ,Health survey ,Artificial intelligence ,Cardiology and Cardiovascular Medicine ,business ,WAVE-FORM ,Software - Abstract
Background: No established method for digitizing and digital measuring of paper electrocardiograms (ECG) exists. We describe a paper ECG digitizing and digital measuring process, and report comparability to manual measurements. Methods: A paper ECG was recorded from 7203 health survey participants in 1978-1980. With specific software, the ECGs were digitized (ECG Trace Tool), and measured digitally (EASE). A sub-sample of 100 ECGs was selected for manual measurements. Results: The measurement methods showed good agreement. The mean global (EASE)-(manual) differences were 1.4 ms (95% CI 0.5-2.2) for PR interval, 1.0 ms (95% CI 1.5 [-0.5]) for QRS duration, and 11.6 ms (95% CI 10.5-12.7) for QT interval. The mean inter-method amplitude differences of RampV5, RampV6, SampV1, TampII and TampV5 ranged from 0.03 mV to 0.01 mV. Conclusions: The presented paper-to-digital conversion and digital measurement process is an accurate and reliable method, enabling efficient storing and analysis of paper ECGs. (C) 2017 Elsevier Inc. All rights reserved.
- Published
- 2018
48. Rorschach Inkblot Image Stylization using Silhouette Matching and Paper Texture Simulation
- Author
-
Jong Chul Yoon
- Subjects
Matching (statistics) ,business.industry ,Computer science ,General Engineering ,Computer vision ,Artificial intelligence ,business ,Texture (geology) ,Image (mathematics) ,Rorschach test ,Silhouette - Published
- 2020
49. [Paper] Multiple Human Tracking with Alternately Updating Trajectories and Multi-Frame Action Features
- Author
-
Hitoshi Nishimura, Kazuyuki Tasaka, Hiroshi Murase, and Yasutomo Kawanishi
- Subjects
Action (philosophy) ,business.industry ,Computer science ,Signal Processing ,Media Technology ,Action recognition ,Computer vision ,Artificial intelligence ,Tracking (particle physics) ,business ,Computer Graphics and Computer-Aided Design ,Drone ,Multi frame - Published
- 2020
50. Compensation of elements in the image background on the digitized papers of old books
- Author
-
I. Z. Myklushka, O. B. Havrylyshyn, and V. B. Repeta
- Subjects
Computer science ,business.industry ,Computer vision ,General Medicine ,Artificial intelligence ,business ,Image (mathematics) ,Compensation (engineering) - Published
- 2020
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.