Start Over

Assisting the Visually Impaired in Multi-object Scene Description Using OWA-Based Fusion of CNN Models

Authors :: Yakoub Bazi
Naif Alajlan
Haikel Alhichri
Source :: Arabian Journal for Science and Engineering. 45:10511-10527
Publication Year :: 2020
Publisher :: Springer Science and Business Media LLC, 2020.
Abstract: Advances in technology can provide a lot of support for visually impaired (VI) persons. In particular, computer vision and machine learning can provide solutions for object detection and recognition. In this work, we propose a multi-label image classification solution for assisting a VI person in recognizing the presence of multiple objects in a scene. The solution is based on the fusion of two deep CNN models using the induced ordered weighted averaging (OWA) approach. Namely, in this work, we fuse the outputs of two pre-trained CNN models, VGG16 and SqueezeNet. To use the induced OWA approach, we need to estimate a confidence measure in the outputs of the two CNN base models. To this end, we propose the residual error between the predicted output and the true output as a measure of confidence. We estimate this residual error using another dedicated CNN model that is trained on the residual errors computed from the main CNN models. Then, the OAW technique uses these estimated residual errors as confidence measures and fuses the decisions of the two main CNN models. When tested on four image datasets of indoor environments from two separate locations, the proposed novel method improves the detection accuracy compared to both base CNN models. The results are also significantly better than state-of-the-art methods reported in the literature.

Subjects :: Multidisciplinary
Contextual image classification
Computer science
business.industry
Pattern recognition
02 engineering and technology
Residual
Object (computer science)
Base (topology)
Measure (mathematics)
Object detection
Image (mathematics)
020204 information systems
0202 electrical engineering, electronic engineering, information engineering
Fuse (electrical)
020201 artificial intelligence & image processing
Artificial intelligence
business

Details

ISSN :: 21914281 and 2193567X
Volume :: 45
Database :: OpenAIRE
Journal :: Arabian Journal for Science and Engineering
Accession number :: edsair.doi...........6aa27273528330c74001ba57e4a5e77d
Full Text :: https://doi.org/10.1007/s13369-020-04799-7

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Assisting the Visually Impaired in Multi-object Scene Description Using OWA-Based Fusion of CNN Models

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Assisting the Visually Impaired in Multi-object Scene Description Using OWA-Based Fusion of CNN Models

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources