Author: "Jie, Bo" / Topic: computer science - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Jie, Bo"' showing total 10 results

Start Over Author "Jie, Bo" Topic computer science

10 results on '"Jie, Bo"'

1. Detecting Text in Scene and Traffic Guide Panels With Attention Anchor Mechanism

Author: Jie-Bo Hou, Long-Huang Wu, Xiaobin Zhu, Chang Liu, Xu-Cheng Yin, Hongfa Wang, and Chun Yang
Subjects: Ground truth, Pixel, Computer science, Orientation (computer vision), Mechanical Engineering, Feature extraction, computer.software_genre, Object detection, Computer Science Applications, Active appearance model, Robustness (computer science), ComputerApplications_MISCELLANEOUS, Automotive Engineering, Data mining, Intelligent transportation system, computer
Abstract: Text detection in complex scene images is a challenging task for intelligent transportation. Recently, anchor mechanisms are widely utilized in scene text detection tasks. However, in existing methods, anchors are generally predefined empirically, degrading robustness to complex scenarios with various sizes and orientation variations. In this paper, we propose a novel Attention Anchor Mechanism (AAM), especially targeting at predicting appropriate anchors for each pixel. To be concrete, we regard a series of predefined anchors as basic anchors and utilize an attention model to predict weights corresponding to basic anchors. Consequently, the weighted sum of basic anchors in each pixel can obtain a predicted anchor. In this way, the gap between the predicted anchors and the corresponding ground truth boxes could be narrowed, making the network easier to regress. For facilitating the design of basic anchors, we adopt a dimension-decomposition mechanism to predict width, height, and angle of anchors, respectively. Extensive experiments on several public datasets demonstrate that our method achieves state-of-the-art performance.
Published: 2021
Full Text: View/download PDF

2. Multi-orientation scene text detection with scale-guided regression

Author: Xiaobin Zhu, Xu-Cheng Yin, Liang Min, Jingyan Qin, Chun Yang, and Jie-Bo Hou
Subjects: Series (mathematics), Scale (ratio), business.industry, Computer science, Orientation (computer vision), Cognitive Neuroscience, Pattern recognition, Regression, Computer Science Applications, Artificial Intelligence, Margin (machine learning), Bounding overwatch, Feature (machine learning), Key (cryptography), Artificial intelligence, business
Abstract: Existing multi-orientation scene text detection methods generally contain two crucial components: regression prediction for text bounding boxes and classification prediction for text/non-text. However, these methods always regard classification prediction and regression prediction as two independent procedures, neglecting fully exploring their mutual relations. Based on this key observation, we propose an innovative Scale-Guided Regression Module (SRM), specially for multi-orientation scene text detection. Equipped with width-guided kernels and height-guided kernels of different sizes, our SRM can generate a series of scale feature maps of candidate texts by capturing their shape information in classification prediction. The scale feature maps are used to predict the width and height of candidate texts, which can serve as guides for regressing bounding boxes. In this way, the procedures of classification and regression can be coherently integrated. In addition, we adopt IoU loss to train our network and then integrate IoU loss and l 1 -smooth loss for fine-tuning. Extensive experiments on publicly available datasets demonstrate the state-of-the-art performance of our method. Notably, our method achieves significant improvement of performance on long texts, e.g., on MSRA-TD500, our method outperforms Basemodel with a great margin (4.86 % in terms of Recall).
Published: 2021
Full Text: View/download PDF

3. GCCNet: Grouped channel composition network for scene text detection

Author: Xiaobin Zhu, Lei Xiao, Long-Huang Wu, Jie-Bo Hou, Chun Yang, Xu-Cheng Yin, and Chang Liu
Subjects: 0209 industrial biotechnology, Ground truth, Computer science, Intersection (set theory), Cognitive Neuroscience, 02 engineering and technology, Text detection, Composition (combinatorics), computer.software_genre, Computer Science Applications, Weighting, 020901 industrial engineering & automation, Artificial Intelligence, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Data mining, computer, Block (data storage), Communication channel
Abstract: Anchor mechanism is widely applied in scene text detection methods and demonstrates promising performance. However, existing anchor mechanisms have two major limitations, namely handcrafted anchor design and hard-wired anchor assignment. We propose a novel Grouped Channels Composition(GCC) block to achieve the data-driven anchor design and adaptive anchor assignment. To be more specific, our GCC block uses optimizable anchor functions rather than handcrafted ones to achieve data-drive anchor design. In our GCC block, an adaptive anchor assignment is achieved with the attention mechanism instead of empirically assigning anchor according to the Intersection Over Union (IoU) between ground truth and targets. We then build a corresponding network named GCCNet with our GCC blocks. We also propose a Unified Loss Weighting module to alleviate the inconsistency between classification score and localization accuracy. Experiments conducted on publicly available datasets demonstrate the state-of-the-art performance of our methods.
Published: 2021
Full Text: View/download PDF

4. HAM: Hidden Anchor Mechanism for Scene Text Detection

Author: Hongfa Wang, Chang Liu, Xiaobin Zhu, Kekai Sheng, Xu-Cheng Yin, Long-Huang Wu, and Jie-Bo Hou
Subjects: Computer science, 02 engineering and technology, Text detection, Image segmentation, computer.software_genre, Computer Graphics and Computer-Aided Design, Electronic mail, Minimum bounding box, 0202 electrical engineering, electronic engineering, information engineering, Task analysis, 020201 artificial intelligence & image processing, Data mining, computer, Software
Abstract: Direct regression and anchor are the two mainly effective and prevailing mechanisms in the paradigm of scene text detection. However, the use of direct regression-based methods may be challenging during optimization without the help of anchors as references. Unfortunately, the anchor-based methods always suffer from the careful design of the anchors, degrading the robustness to complex scenes. To address the above-mentioned problems, we propose a novel hidden anchor mechanism (HAM) especially for scene text detection. The predictions of anchors are innovatively regarded as hidden layers, and the weighted sum of the predictions is integrated into a direct regression-based network. Hence, the architecture of our HAM still has the characteristic of simplicity as with direct regression-based methods. Moreover, it is easier to optimize anchors as references with this type of method than with direct regression-based methods. In this way, our network can take advantage of both direct regression and anchor mechanisms. In addition, we decouple three kinds of one-dimensional anchors from three-dimensional anchors, greatly reducing the number of anchors in text bounding box matching without performance degradation. We also propose a post-processing technique for long text detection, named iterative regression box (IRB), which takes a few additional computational costs and can be easily generalized to other methods. Experiments on several public datasets demonstrate that the proposed method achieves state-of-the-art performance. Code is available at https://github.com/hjbplayer/HAM .
Published: 2020
Full Text: View/download PDF

5. Self-Adaptive Aspect Ratio Anchor for Oriented Object Detection in Remote Sensing Images

Author: Xu-Cheng Yin, Xiaobin Zhu, and Jie-Bo Hou
Subjects: remote sensing images, object detection, aspect ratio, anchor, Orientation (computer vision), Computer science, Science, Gaussian, 0211 other engineering and technologies, 02 engineering and technology, ENCODE, Aspect ratio (image), Object detection, symbols.namesake, Remote sensing (archaeology), Feature (computer vision), 0202 electrical engineering, electronic engineering, information engineering, symbols, General Earth and Planetary Sciences, 020201 artificial intelligence & image processing, Pyramid (image processing), 021101 geological & geomatics engineering, Remote sensing
Abstract: Object detection is a significant and challenging problem in the study of remote sensing. Since remote sensing images are typically captured with a bird’s-eye view, the aspect ratios of objects in the same category may obey a Gaussian distribution. Generally, existing object detection methods ignore exploring the distribution character of aspect ratios for improving performance in remote sensing tasks. In this paper, we propose a novel Self-Adaptive Aspect Ratio Anchor (SARA) to explicitly explore aspect ratio variations of objects in remote sensing images. To be concrete, our SARA can self-adaptively learn an appropriate aspect ratio for each category. In this way, we can only utilize a simple squared anchor (related to the strides of feature maps in Feature Pyramid Networks) to regress objects in various aspect ratios. Finally, we adopt an Oriented Box Decoder (OBD) to align the feature maps and encode the orientation information of oriented objects. Our method achieves a promising mAP value of 79.91% on the DOTA dataset.
Published: 2021
Full Text: View/download PDF

6. Deep Relational Reasoning Graph Network for Arbitrary Shape Text Detection

Author: Xu-Cheng Yin, Shi-Xue Zhang, Chang Liu, Chun Yang, Xiaobin Zhu, Hongfa Wang, and Jie-Bo Hou
Subjects: FOS: Computer and information sciences, business.industry, Computer science, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, 02 engineering and technology, Text detection, 010501 environmental sciences, 01 natural sciences, Convolutional neural network, Graph, 0202 electrical engineering, electronic engineering, information engineering, Task analysis, 020201 artificial intelligence & image processing, Artificial intelligence, business, Relational reasoning, 0105 earth and related environmental sciences
Abstract: Arbitrary shape text detection is a challenging task due to the high variety and complexity of scenes texts. In this paper, we propose a novel unified relational reasoning graph network for arbitrary shape text detection. In our method, an innovative local graph bridges a text proposal model via Convolutional Neural Network (CNN) and a deep relational reasoning network via Graph Convolutional Network (GCN), making our network end-to-end trainable. To be concrete, every text instance will be divided into a series of small rectangular components, and the geometry attributes (e.g., height, width, and orientation) of the small components will be estimated by our text proposal model. Given the geometry attributes, the local graph construction model can roughly establish linkages between different text components. For further reasoning and deducing the likelihood of linkages between the component and its neighbors, we adopt a graph-based network to perform deep relational reasoning on local graphs. Experiments on public available datasets demonstrate the state-of-the-art performance of our method., Comment: 10 pages, Accepted by CVPR2020 Oral
Published: 2020
Full Text: View/download PDF

7. Scene Video Text Tracking With Graph Matching

Author: Shu Tian, Jie-Bo Hou, Xu-Cheng Yin, Li-Yu Meng, Chun Yang, and Wei-Yi Pei
Subjects: graph matching, General Computer Science, Matching (graph theory), Computer science, Feature extraction, 02 engineering and technology, 01 natural sciences, Text tracking, Robustness (computer science), Histogram, 0103 physical sciences, 0202 electrical engineering, electronic engineering, information engineering, General Materials Science, 010301 acoustics, template matching, business.industry, Template matching, General Engineering, Sorting, Pattern recognition, Tracking system, Object (computer science), 020201 artificial intelligence & image processing, lcsh:Electrical engineering. Electronics. Nuclear engineering, Artificial intelligence, business, lcsh:TK1-9971
Abstract: Video has become one of the dominant data resources with the development of the Internet. As a result, the structured sorting of videos, which can be used for storage and extraction, represents a growing concern in the community. In particular, the text within videos can carry rich semantic information, leading to many novel studies wherein text tracking and recognition are performed. One essential step in text tracking involves template matching. In general, the adjacent matrices are modeled to represent the extracted tracking object features. Then, often, the Hungarian algorithm is applied to find the correspondence pairs between consecutive frames. In many works, text features are extracted based on morphological features such as color histograms and aspect ratios. However, under those features, similar text objects are not sufficiently distinguishable to make a distinction between them. To address this issue, we regard the template matching task as a graph matching problem. The main novelty involves a graph matching approach that utilizes the relationship between two trajectories or two objects, whereby a graph matching solver can be readily used in our tracking system. By utilizing the content information, the mismatch between the same object among different frames is effectively reduced. The experimental results demonstrate that the tracker with the graph matching method tends to increase the valid correspondence of trajectories and candidate objects.
Published: 2018
Full Text: View/download PDF

8. Detecting Text in News Images with Similarity Embedded Proposals

Author: Xu-Cheng Yin, Xiaobin Zhu, Chun Yang, Jie-Bo Hou, and Miaotong Jiang
Subjects: Computer science, business.industry, 02 engineering and technology, Construct (python library), 010501 environmental sciences, computer.software_genre, 01 natural sciences, Similarity (network science), 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Artificial intelligence, business, computer, Natural language processing, 0105 earth and related environmental sciences
Abstract: Text extraction plays an important role in news images analysis tasks. Howerver, the conglutination of subtitles and station logos makes text detection challenging. In this paper, we develop an effective news text detection framework by introducing a novel similarity embedded proposal mechanism. The main idea is to predict similarity for each fine-scale coarse proposal to help construct text bounding boxes. Specifically, a CNN and bi-directional LSTM based network is used to produce vectors embedded in coarse proposals provided by Connectionist Text Proposal Network (CTPN). Notablely, similarity embedded proposal mechanism can be generalized to other sub-text level text detection models. Comparing to the state-of-the-art method (CTPN), our framework improves F-measure by 25.2% on our Private News Dataset and 8.9% on ICDAR 2013 benchmarks, respectively.
Published: 2019
Full Text: View/download PDF

9. Application of Variable Frequency Control Technology in Solid Waste Treatment System of Nuclear Power Plant

Author: Cong Xue, Jing-Jie Bo, and Li Zhou
Subjects: Waste treatment, Treatment system, Municipal solid waste, Salient, Computer science, law, Transmission line, Nuclear power plant, Drum, Automotive engineering, law.invention, Variable frequency control
Abstract: Based on the brief introduction of the variable frequency control technology, this paper elaborates the design and implementation of the variable frequency control strategy in the transmission line of the cementation facility, and describes the salient features of the control strategy. A frequency converter switches the control of two motors. And the reactor is used to achieve long-distance transmission of the frequency signal. In this project, the frequency regulation mainly affects the operation speed of the roller, and the shift of the roller speed is to take into account both the efficiency of waste treatment and the positioning accuracy of metal drum. The successful implementation of the project provides a good theoretical basis and example for the massive, standardized application of variable frequency control technology in the nuclear power plant.
Published: 2017
Full Text: View/download PDF

10. Computer–Aided Simulation of Mastoidectomy

Author: Wen Wei–ping, Guo Jie–bo, Chen He–xin, Xu Geng, Ma Zhi–chao, and Wang Zhang–feng
Subjects: Sigmoid sinus, medicine.medical_specialty, Computer science, medicine.medical_treatment, Mastoidectomy, Computer platform, Auditory canal, Otorhinolaryngology, Jugular bulb, Temporal bone, medicine, Computer aided simulation, Radiology, Surgical simulation
Abstract: Objective To establish a three–dimensional model of the temporal bone using CT scan images for study of temporal bone structures and simulation of mastoidectomy procedures. Methods CT scan images from 6 individuals (12 temporal bones) were used to reconstruct the Fallopian canal, internal auditory canal, cochlea, semicircular canals, sigmoid sinus, posterior fossa floor and jugular bulb on a computer platform. Their anatomical relations within the temporal bone were restored in the computed model. The same model was used to simulate mastoidectomy procedures. Results The reconstructed computer model provided accurate and clear three–dimensional images of temporal bone structures. Simulation of mastoidectomy using these images provided procedural experiences closely mimicking the real surgical procedure. Conclusion Computer–aided three dimensional reconstruction of temporal bone structures using CT scan images is a useful tool in surgical simulation and can aid surgical procedure planning. Key words three–dimension reconstruction, CT scan, surgery simulation
Published: 2008
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

10 results on '"Jie, Bo"'

1. Detecting Text in Scene and Traffic Guide Panels With Attention Anchor Mechanism

2. Multi-orientation scene text detection with scale-guided regression

3. GCCNet: Grouped channel composition network for scene text detection

4. HAM: Hidden Anchor Mechanism for Scene Text Detection

5. Self-Adaptive Aspect Ratio Anchor for Oriented Object Detection in Remote Sensing Images

6. Deep Relational Reasoning Graph Network for Arbitrary Shape Text Detection

7. Scene Video Text Tracking With Graph Matching

8. Detecting Text in News Images with Similarity Embedded Proposals

9. Application of Variable Frequency Control Technology in Solid Waste Treatment System of Nuclear Power Plant

10. Computer–Aided Simulation of Mastoidectomy

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Journal

Database

Publisher

10 results on '"Jie, Bo"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources