Back to Search Start Over

A Metaverse text recognition model based on character-level contrastive learning.

Authors :
Sun, Le
Li, Huiyun
Muhammad, Ghulam
Source :
Applied Soft Computing; Dec2023:Part A, Vol. 149, pN.PAG-N.PAG, 1p
Publication Year :
2023

Abstract

Accurate and efficient text recognition can enhance the accuracy and time efficiency of human–computer interaction and information exchange in noise Metaverse scenarios. For example, it can improve the safety of digital twin-based intelligent transportation and the intelligence of Non-Player Characters. Robust features play a vital role in the performance of scene text recognition models in noise Metaverse situations. To extract robust features, and improve the accuracy and time efficiency of scene text recognition, we propose a Chara cter level text recognition model in Meta verse applications, called MetaChara. It contains two main components: a lightweight text feature extraction module (LightFeature), and a robust character recognition module (RobChara). LightFeature leverages the advantage of global feature aggregation in the primitive representation learning network to handle irregular text images.RobChara incorporates the capability of contrastive learning from the momentum contrast method, improving the robustness of feature extraction in MetaChara. It structures a feature queue for organized storage. By optimizing the similarity of intra-character features and maximizing inter-character differences, it makes the model better adapted to scene text recognition tasks in Metaverse. Experiment results demonstrate that MetaChara is light with 29.14 million parameters and time efficient with an average recognition speed of 1.73 s. It also achieves excellent performance in terms of FLoating-point Operations (FLOPs), registering only 59.60 billion times for each operation. MetaChara achieves an average accuracy of 0.969 for character recognition. We present a case study where MetaChara quickly and accurately recognizes scene texts within the context of autonomous driving in the Metaverse. This demonstrates how MetaChara enhances safety and improves time efficiency for intelligent transportation systems. • Introducing MetaChara, a contrastive learning model for scene text in the Metaverse. • MetaChara blends GCN with MoCo for faster and accurate text recognition. • Our vision strategy boosts noise-free image expansion for character strength. • MetaChara is fast and light with high accuracy in noisy Metaverse scenarios. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
15684946
Volume :
149
Database :
Supplemental Index
Journal :
Applied Soft Computing
Publication Type :
Academic Journal
Accession number :
173726268
Full Text :
https://doi.org/10.1016/j.asoc.2023.110969