Author: "Li, Gongyang" / Publication Year Range: Last 50 years - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Li, Gongyang"' showing total 111 results

Start Over Author "Li, Gongyang" Publication Year Range Last 50 years

111 results on '"Li, Gongyang"'

1. Context-Aware Interaction Network for RGB-T Semantic Segmentation

Author: Lv, Ying, Liu, Zhi, and Li, Gongyang
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: RGB-T semantic segmentation is a key technique for autonomous driving scenes understanding. For the existing RGB-T semantic segmentation methods, however, the effective exploration of the complementary relationship between different modalities is not implemented in the information interaction between multiple levels. To address such an issue, the Context-Aware Interaction Network (CAINet) is proposed for RGB-T semantic segmentation, which constructs interaction space to exploit auxiliary tasks and global context for explicitly guided learning. Specifically, we propose a Context-Aware Complementary Reasoning (CACR) module aimed at establishing the complementary relationship between multimodal features with the long-term context in both spatial and channel dimensions. Further, considering the importance of global contextual and detailed information, we propose the Global Context Modeling (GCM) module and Detail Aggregation (DA) module, and we introduce specific auxiliary supervision to explicitly guide the context interaction and refine the segmentation map. Extensive experiments on two benchmark datasets of MFNet and PST900 demonstrate that the proposed CAINet achieves state-of-the-art performance. The code is available at https://github.com/YingLv1106/CAINet., Comment: 13 pages, 7 figures, Accepted by IEEE Transactions on Multimedia 2024
Published: 2024
Full Text: View/download PDF

2. Texture-Semantic Collaboration Network for ORSI Salient Object Detection

Author: Li, Gongyang, Bai, Zhen, and Liu, Zhi
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Salient object detection (SOD) in optical remote sensing images (ORSIs) has become increasingly popular recently. Due to the characteristics of ORSIs, ORSI-SOD is full of challenges, such as multiple objects, small objects, low illuminations, and irregular shapes. To address these challenges, we propose a concise yet effective Texture-Semantic Collaboration Network (TSCNet) to explore the collaboration of texture cues and semantic cues for ORSI-SOD. Specifically, TSCNet is based on the generic encoder-decoder structure. In addition to the encoder and decoder, TSCNet includes a vital Texture-Semantic Collaboration Module (TSCM), which performs valuable feature modulation and interaction on basic features extracted from the encoder. The main idea of our TSCM is to make full use of the texture features at the lowest level and the semantic features at the highest level to achieve the expression enhancement of salient regions on features. In the TSCM, we first enhance the position of potential salient regions using semantic features. Then, we render and restore the object details using the texture features. Meanwhile, we also perceive regions of various scales, and construct interactions between different regions. Thanks to the perfect combination of TSCM and generic structure, our TSCNet can take care of both the position and details of salient objects, effectively handling various scenes. Extensive experiments on three datasets demonstrate that our TSCNet achieves competitive performance compared to 14 state-of-the-art methods. The code and results of our method are available at https://github.com/MathLee/TSCNet., Comment: 5 pages, 3 figures, Accepted by IEEE Transactions on Circuits and Systems II: Express Briefs 2023
Published: 2023
Full Text: View/download PDF

3. Salient Object Detection in Optical Remote Sensing Images Driven by Transformer

Author: Li, Gongyang, Bai, Zhen, Liu, Zhi, Zhang, Xinpeng, and Ling, Haibin
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Existing methods for Salient Object Detection in Optical Remote Sensing Images (ORSI-SOD) mainly adopt Convolutional Neural Networks (CNNs) as the backbone, such as VGG and ResNet. Since CNNs can only extract features within certain receptive fields, most ORSI-SOD methods generally follow the local-to-contextual paradigm. In this paper, we propose a novel Global Extraction Local Exploration Network (GeleNet) for ORSI-SOD following the global-to-local paradigm. Specifically, GeleNet first adopts a transformer backbone to generate four-level feature embeddings with global long-range dependencies. Then, GeleNet employs a Direction-aware Shuffle Weighted Spatial Attention Module (D-SWSAM) and its simplified version (SWSAM) to enhance local interactions, and a Knowledge Transfer Module (KTM) to further enhance cross-level contextual interactions. D-SWSAM comprehensively perceives the orientation information in the lowest-level features through directional convolutions to adapt to various orientations of salient objects in ORSIs, and effectively enhances the details of salient objects with an improved attention mechanism. SWSAM discards the direction-aware part of D-SWSAM to focus on localizing salient objects in the highest-level features. KTM models the contextual correlation knowledge of two middle-level features of different scales based on the self-attention mechanism, and transfers the knowledge to the raw features to generate more discriminative features. Finally, a saliency predictor is used to generate the saliency map based on the outputs of the above three modules. Extensive experiments on three public datasets demonstrate that the proposed GeleNet outperforms relevant state-of-the-art methods. The code and results of our method are available at https://github.com/MathLee/GeleNet., Comment: 13 pages, 6 figures, Accepted by IEEE Transactions on Image Processing 2023
Published: 2023
Full Text: View/download PDF

4. No-Service Rail Surface Defect Segmentation via Normalized Attention and Dual-scale Interaction

Author: Li, Gongyang, Han, Chengjun, and Liu, Zhi
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: No-service rail surface defect (NRSD) segmentation is an essential way for perceiving the quality of no-service rails. However, due to the complex and diverse outlines and low-contrast textures of no-service rails, existing natural image segmentation methods cannot achieve promising performance in NRSD images, especially in some unique and challenging NRSD scenes. To this end, in this paper, we propose a novel segmentation network for NRSDs based on Normalized Attention and Dual-scale Interaction, named NaDiNet. Specifically, NaDiNet follows the enhancement-interaction paradigm. The Normalized Channel-wise Self-Attention Module (NAM) and the Dual-scale Interaction Block (DIB) are two key components of NaDiNet. NAM is a specific extension of the channel-wise self-attention mechanism (CAM) to enhance features extracted from low-contrast NRSD images. The softmax layer in CAM will produce very small correlation coefficients which are not conducive to low-contrast feature enhancement. Instead, in NAM, we directly calculate the normalized correlation coefficient between channels to enlarge the feature differentiation. DIB is specifically designed for the feature interaction of the enhanced features. It has two interaction branches with dual scales, one for fine-grained clues and the other for coarse-grained clues. With both branches working together, DIB can perceive defect regions of different granularities. With these modules working together, our NaDiNet can generate accurate segmentation map. Extensive experiments on the public NRSD-MN dataset with man-made and natural NRSDs demonstrate that our proposed NaDiNet with various backbones (i.e., VGG, ResNet, and DenseNet) consistently outperforms 10 state-of-the-art methods. The code and results of our method are available at https://github.com/monxxcn/NaDiNet., Comment: 10 pages, 6 figures, Accepted by IEEE Transactions on Instrumentation and Measurement 2023
Published: 2023

5. Lightweight Salient Object Detection in Optical Remote-Sensing Images via Semantic Matching and Edge Alignment

Author: Li, Gongyang, Liu, Zhi, Zhang, Xinpeng, and Lin, Weisi
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Recently, relying on convolutional neural networks (CNNs), many methods for salient object detection in optical remote sensing images (ORSI-SOD) are proposed. However, most methods ignore the huge parameters and computational cost brought by CNNs, and only a few pay attention to the portability and mobility. To facilitate practical applications, in this paper, we propose a novel lightweight network for ORSI-SOD based on semantic matching and edge alignment, termed SeaNet. Specifically, SeaNet includes a lightweight MobileNet-V2 for feature extraction, a dynamic semantic matching module (DSMM) for high-level features, an edge self-alignment module (ESAM) for low-level features, and a portable decoder for inference. First, the high-level features are compressed into semantic kernels. Then, semantic kernels are used to activate salient object locations in two groups of high-level features through dynamic convolution operations in DSMM. Meanwhile, in ESAM, cross-scale edge information extracted from two groups of low-level features is self-aligned through L2 loss and used for detail enhancement. Finally, starting from the highest-level features, the decoder infers salient objects based on the accurate locations and fine details contained in the outputs of the two modules. Extensive experiments on two public datasets demonstrate that our lightweight SeaNet not only outperforms most state-of-the-art lightweight methods but also yields comparable accuracy with state-of-the-art conventional methods, while having only 2.76M parameters and running with 1.7G FLOPs for 288x288 inputs. Our code and results are available at https://github.com/MathLee/SeaNet., Comment: 11 pages, 4 figures, Accepted by IEEE Transactions on Geoscience and Remote Sensing 2023
Published: 2023
Full Text: View/download PDF

6. RGB-T Semantic Segmentation with Location, Activation, and Sharpening

Author: Li, Gongyang, Wang, Yike, Liu, Zhi, Zhang, Xinpeng, and Zeng, Dan
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Semantic segmentation is important for scene understanding. To address the scenes of adverse illumination conditions of natural images, thermal infrared (TIR) images are introduced. Most existing RGB-T semantic segmentation methods follow three cross-modal fusion paradigms, i.e. encoder fusion, decoder fusion, and feature fusion. Some methods, unfortunately, ignore the properties of RGB and TIR features or the properties of features at different levels. In this paper, we propose a novel feature fusion-based network for RGB-T semantic segmentation, named \emph{LASNet}, which follows three steps of location, activation, and sharpening. The highlight of LASNet is that we fully consider the characteristics of cross-modal features at different levels, and accordingly propose three specific modules for better segmentation. Concretely, we propose a Collaborative Location Module (CLM) for high-level semantic features, aiming to locate all potential objects. We propose a Complementary Activation Module for middle-level features, aiming to activate exact regions of different objects. We propose an Edge Sharpening Module (ESM) for low-level texture features, aiming to sharpen the edges of objects. Furthermore, in the training phase, we attach a location supervision and an edge supervision after CLM and ESM, respectively, and impose two semantic supervisions in the decoder part to facilitate network convergence. Experimental results on two public datasets demonstrate that the superiority of our LASNet over relevant state-of-the-art methods. The code and results of our method are available at https://github.com/MathLee/LASNet., Comment: 12 pages, 7 figures, Accepted by IEEE Transactions on Circuits and Systems for Video Technology 2022
Published: 2022
Full Text: View/download PDF

7. Masked feature regeneration based asymmetric student–teacher network for anomaly detection

Author: Gu, Haocheng, Li, Gongyang, and Liu, Zhi
Published: 2024
Full Text: View/download PDF

8. Adjacent Context Coordination Network for Salient Object Detection in Optical Remote Sensing Images

Author: Li, Gongyang, Liu, Zhi, Zeng, Dan, Lin, Weisi, and Ling, Haibin
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Salient object detection (SOD) in optical remote sensing images (RSIs), or RSI-SOD, is an emerging topic in understanding optical RSIs. However, due to the difference between optical RSIs and natural scene images (NSIs), directly applying NSI-SOD methods to optical RSIs fails to achieve satisfactory results. In this paper, we propose a novel Adjacent Context Coordination Network (ACCoNet) to explore the coordination of adjacent features in an encoder-decoder architecture for RSI-SOD. Specifically, ACCoNet consists of three parts: an encoder, Adjacent Context Coordination Modules (ACCoMs), and a decoder. As the key component of ACCoNet, ACCoM activates the salient regions of output features of the encoder and transmits them to the decoder. ACCoM contains a local branch and two adjacent branches to coordinate the multi-level features simultaneously. The local branch highlights the salient regions in an adaptive way, while the adjacent branches introduce global information of adjacent levels to enhance salient regions. Additionally, to extend the capabilities of the classic decoder block (i.e., several cascaded convolutional layers), we extend it with two bifurcations and propose a Bifurcation-Aggregation Block to capture the contextual information in the decoder. Extensive experiments on two benchmark datasets demonstrate that the proposed ACCoNet outperforms 22 state-of-the-art methods under nine evaluation metrics, and runs up to 81 fps on a single NVIDIA Titan X GPU. The code and results of our method are available at https://github.com/MathLee/ACCoNet., Comment: 13 pages, 7 figures, Accepted by IEEE Transactions on Cybernetics 2022
Published: 2022

9. Lightweight Salient Object Detection in Optical Remote Sensing Images via Feature Correlation

Author: Li, Gongyang, Liu, Zhi, Bai, Zhen, Lin, Weisi, and Ling, and Haibin
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Salient object detection in optical remote sensing images (ORSI-SOD) has been widely explored for understanding ORSIs. However, previous methods focus mainly on improving the detection accuracy while neglecting the cost in memory and computation, which may hinder their real-world applications. In this paper, we propose a novel lightweight ORSI-SOD solution, named CorrNet, to address these issues. In CorrNet, we first lighten the backbone (VGG-16) and build a lightweight subnet for feature extraction. Then, following the coarse-to-fine strategy, we generate an initial coarse saliency map from high-level semantic features in a Correlation Module (CorrM). The coarse saliency map serves as the location guidance for low-level features. In CorrM, we mine the object location information between high-level semantic features through the cross-layer correlation operation. Finally, based on low-level detailed features, we refine the coarse saliency map in the refinement subnet equipped with Dense Lightweight Refinement Blocks, and produce the final fine saliency map. By reducing the parameters and computations of each component, CorrNet ends up having only 4.09M parameters and running with 21.09G FLOPs. Experimental results on two public datasets demonstrate that our lightweight CorrNet achieves competitive or even better performance compared with 26 state-of-the-art methods (including 16 large CNN-based methods and 2 lightweight methods), and meanwhile enjoys the clear memory and run time efficiency. The code and results of our method are available at https://github.com/MathLee/CorrNet., Comment: 11 pages, 6 figures, Accepted by IEEE Transactions on Geoscience and Remote Sensing 2022
Published: 2022
Full Text: View/download PDF

10. Multi-Content Complementation Network for Salient Object Detection in Optical Remote Sensing Images

Author: Li, Gongyang, Liu, Zhi, Lin, Weisi, and Ling, Haibin
Subjects: Computer Science - Computer Vision and Pattern Recognition, Electrical Engineering and Systems Science - Image and Video Processing
Abstract: In the computer vision community, great progresses have been achieved in salient object detection from natural scene images (NSI-SOD); by contrast, salient object detection in optical remote sensing images (RSI-SOD) remains to be a challenging emerging topic. The unique characteristics of optical RSIs, such as scales, illuminations and imaging orientations, bring significant differences between NSI-SOD and RSI-SOD. In this paper, we propose a novel Multi-Content Complementation Network (MCCNet) to explore the complementarity of multiple content for RSI-SOD. Specifically, MCCNet is based on the general encoder-decoder architecture, and contains a novel key component named Multi-Content Complementation Module (MCCM), which bridges the encoder and the decoder. In MCCM, we consider multiple types of features that are critical to RSI-SOD, including foreground features, edge features, background features, and global image-level features, and exploit the content complementarity between them to highlight salient regions over various scales in RSI features through the attention mechanism. Besides, we comprehensively introduce pixel-level, map-level and metric-aware losses in the training phase. Extensive experiments on two popular datasets demonstrate that the proposed MCCNet outperforms 23 state-of-the-art methods, including both NSI-SOD and RSI-SOD methods. The code and results of our method are available at https://github.com/MathLee/MCCNet., Comment: 12 pages, 7 figures, Accepted by IEEE Transactions on Geoscience and Remote Sensing 2021
Published: 2021
Full Text: View/download PDF

11. Spatio-Temporal Self-Attention Network for Video Saliency Prediction

Author: Wang, Ziqiang, Liu, Zhi, Li, Gongyang, Wang, Yang, Zhang, Tianhong, Xu, Lihua, and Wang, Jijun
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: 3D convolutional neural networks have achieved promising results for video tasks in computer vision, including video saliency prediction that is explored in this paper. However, 3D convolution encodes visual representation merely on fixed local spacetime according to its kernel size, while human attention is always attracted by relational visual features at different time. To overcome this limitation, we propose a novel Spatio-Temporal Self-Attention 3D Network (STSANet) for video saliency prediction, in which multiple Spatio-Temporal Self-Attention (STSA) modules are employed at different levels of 3D convolutional backbone to directly capture long-range relations between spatio-temporal features of different time steps. Besides, we propose an Attentional Multi-Scale Fusion (AMSF) module to integrate multi-level features with the perception of context in semantic and spatio-temporal subspaces. Extensive experiments demonstrate the contributions of key components of our method, and the results on DHF1K, Hollywood-2, UCF, and DIEM benchmark datasets clearly prove the superiority of the proposed model compared with all state-of-the-art models.
Published: 2021
Full Text: View/download PDF

12. Personal Fixations-Based Object Segmentation with Object Localization and Boundary Preservation

Author: Li, Gongyang, Liu, Zhi, Shi, Ran, Hu, Zheng, Wei, Weijie, Wu, Yong, Huang, Mengke, and Ling, Haibin
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: As a natural way for human-computer interaction, fixation provides a promising solution for interactive image segmentation. In this paper, we focus on Personal Fixations-based Object Segmentation (PFOS) to address issues in previous studies, such as the lack of appropriate dataset and the ambiguity in fixations-based interaction. In particular, we first construct a new PFOS dataset by carefully collecting pixel-level binary annotation data over an existing fixation prediction dataset, such dataset is expected to greatly facilitate the study along the line. Then, considering characteristics of personal fixations, we propose a novel network based on Object Localization and Boundary Preservation (OLBP) to segment the gazed objects. Specifically, the OLBP network utilizes an Object Localization Module (OLM) to analyze personal fixations and locates the gazed objects based on the interpretation. Then, a Boundary Preservation Module (BPM) is designed to introduce additional boundary information to guard the completeness of the gazed objects. Moreover, OLBP is organized in the mixed bottom-up and top-down manner with multiple types of deep supervision. Extensive experiments on the constructed PFOS dataset show the superiority of the proposed OLBP network over 17 state-of-the-art methods, and demonstrate the effectiveness of the proposed OLM and BPM components. The constructed PFOS dataset and the proposed OLBP network are available at https://github.com/MathLee/OLBPNet4PFOS., Comment: Accepted by IEEE TIP. Code: https://github.com/MathLee/OLBPNet4PFOS
Published: 2021
Full Text: View/download PDF

13. Audio-visual saliency prediction with multisensory perception and integration

Author: Xie, Jiawei, Liu, Zhi, Li, Gongyang, and Song, Yingjie
Published: 2024
Full Text: View/download PDF

14. EFDCNet: Encoding fusion and decoding correction network for RGB-D indoor semantic segmentation

Author: Chen, Jianlin, Li, Gongyang, Zhang, Zhijiang, and Zeng, Dan
Published: 2024
Full Text: View/download PDF

15. Global semantic-guided network for saliency prediction

Author: Xie, Jiawei, Liu, Zhi, Li, Gongyang, Lu, Xiaofeng, and Chen, Tao
Published: 2024
Full Text: View/download PDF

16. Cross-Modal Weighting Network for RGB-D Salient Object Detection

Author: Li, Gongyang, Liu, Zhi, Ye, Linwei, Wang, Yang, and Ling, Haibin
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Depth maps contain geometric clues for assisting Salient Object Detection (SOD). In this paper, we propose a novel Cross-Modal Weighting (CMW) strategy to encourage comprehensive interactions between RGB and depth channels for RGB-D SOD. Specifically, three RGB-depth interaction modules, named CMW-L, CMW-M and CMW-H, are developed to deal with respectively low-, middle- and high-level cross-modal information fusion. These modules use Depth-to-RGB Weighing (DW) and RGB-to-RGB Weighting (RW) to allow rich cross-modal and cross-scale interactions among feature layers generated by different network blocks. To effectively train the proposed Cross-Modal Weighting Network (CMWNet), we design a composite loss function that summarizes the errors between intermediate predictions and ground truth over different scales. With all these novel components working together, CMWNet effectively fuses information from RGB and depth channels, and meanwhile explores object localization and details across scales. Thorough evaluations demonstrate CMWNet consistently outperforms 15 state-of-the-art RGB-D SOD methods on seven popular benchmarks., Comment: Accepted in ECCV2020. Code: https://github.com/MathLee/CMWNet
Published: 2020

17. Global–local–global context-aware network for salient object detection in optical remote sensing images

Author: Bai, Zhen, Li, Gongyang, and Liu, Zhi
Published: 2023
Full Text: View/download PDF

18. Exploring viewport features for semi-supervised saliency prediction in omnidirectional images

Author: Huang, Mengke, Li, Gongyang, Liu, Zhi, Wu, Yong, Gong, Chen, Zhu, Linchao, and Yang, Yi
Published: 2023
Full Text: View/download PDF

19. Cross-Scale Edge Purification Network for salient object detection of steel defect images

Author: Ding, Tuo, Li, Gongyang, Liu, Zhi, and Wang, Yike
Published: 2022
Full Text: View/download PDF

20. Texture-Semantic Collaboration Network for ORSI Salient Object Detection

Author: Li, Gongyang, primary, Bai, Zhen, additional, and Liu, Zhi, additional
Published: 2024
Full Text: View/download PDF

21. Circular Complement Network for RGB-D Salient Object Detection

Author: Bai, Zhen, Liu, Zhi, Li, Gongyang, Ye, Linwei, and Wang, Yang
Published: 2021
Full Text: View/download PDF

22. Personalized image observation behavior learning in fixation based personalized salient object segmentation

Author: Shi, Ran, Li, Gongyang, Wei, Weijie, Zhou, Xiaofei, and Liu, Zhi
Published: 2021
Full Text: View/download PDF

23. Weakly supervised instance segmentation using multi-stage erasing refinement and saliency-guided proposals ordering

Author: Hu, Zheng, Liu, Zhi, Li, Gongyang, Ye, Linwei, Zhou, Lei, and Wang, Yang
Published: 2020
Full Text: View/download PDF

24. Attention-guided RGBD saliency detection using appearance information

Author: Zhou, Xiaofei, Li, Gongyang, Gong, Chen, Liu, Zhi, and Zhang, Jiyong
Published: 2020
Full Text: View/download PDF

25. Light Field Salient Object Detection With Sparse Views via Complementary and Discriminative Interaction Network

Author: Chen, Yilei, primary, Li, Gongyang, additional, An, Ping, additional, Liu, Zhi, additional, Huang, Xinpeng, additional, and Wu, Qiang, additional
Published: 2024
Full Text: View/download PDF

26. Context-Aware Interaction Network for RGB-T Semantic Segmentation

Author: Lv, Ying, primary, Liu, Zhi, additional, and Li, Gongyang, additional
Published: 2024
Full Text: View/download PDF

27. Video Saliency Detection Using Deep Convolutional Neural Networks

Author: Zhou, Xiaofei, Liu, Zhi, Gong, Chen, Li, Gongyang, Huang, Mengke, Hutchison, David, Series Editor, Kanade, Takeo, Series Editor, Kittler, Josef, Series Editor, Kleinberg, Jon M., Series Editor, Mattern, Friedemann, Series Editor, Mitchell, John C., Series Editor, Naor, Moni, Series Editor, Pandu Rangan, C., Series Editor, Steffen, Bernhard, Series Editor, Terzopoulos, Demetri, Series Editor, Tygar, Doug, Series Editor, Lai, Jian-Huang, editor, Liu, Cheng-Lin, editor, Chen, Xilin, editor, Zhou, Jie, editor, Tan, Tieniu, editor, Zheng, Nanning, editor, and Zha, Hongbin, editor
Published: 2018
Full Text: View/download PDF

28. Constrained fixation point based segmentation via deep neural network

Author: Li, Gongyang, Liu, Zhi, Shi, Ran, and Wei, Weijie
Published: 2019
Full Text: View/download PDF

29. EFDCNet: Encoding fusion and decoding correction network for RGB-D indoor semantic segmentation

Author: Chen, Jianlin, primary, Li, Gongyang, additional, Zhang, Zhijiang, additional, and Zeng, Dan, additional
Published: 2023
Full Text: View/download PDF

30. Global semantic-guided network for saliency prediction

Author: Xie, Jiawei, primary, Liu, Zhi, additional, Li, Gongyang, additional, Lu, Xiaofeng, additional, and Chen, Tao, additional
Published: 2023
Full Text: View/download PDF

31. Lightweight Distortion-Aware Network for Salient Object Detection in Omnidirectional Images

Author: Huang, Mengke, primary, Li, Gongyang, additional, Liu, Zhi, additional, and Zhu, Linchao, additional
Published: 2023
Full Text: View/download PDF

32. Cross-Modal Weighting Network for RGB-D Salient Object Detection

Author: Li, Gongyang, primary, Liu, Zhi, additional, Ye, Linwei, additional, Wang, Yang, additional, and Ling, Haibin, additional
Published: 2020
Full Text: View/download PDF

33. Effective online refinement for video object segmentation

Author: Li, Gongyang, Liu, Zhi, and Zhou, Xiaofei
Published: 2019
Full Text: View/download PDF

34. Video Saliency Detection Using Deep Convolutional Neural Networks

Author: Zhou, Xiaofei, primary, Liu, Zhi, additional, Gong, Chen, additional, Li, Gongyang, additional, and Huang, Mengke, additional
Published: 2018
Full Text: View/download PDF

35. Adjacent Context Coordination Network for Salient Object Detection in Optical Remote Sensing Images

Author: Li, Gongyang, primary, Liu, Zhi, additional, Zeng, Dan, additional, Lin, Weisi, additional, and Ling, Haibin, additional
Published: 2023
Full Text: View/download PDF

36. RINet: Relative Importance-Aware Network for Fixation Prediction

Author: Song, Yingjie, primary, Liu, Zhi, additional, Li, Gongyang, additional, Zeng, Dan, additional, Zhang, Tianhong, additional, Xu, Lihua, additional, and Wang, Jijun, additional
Published: 2023
Full Text: View/download PDF

37. Salient Object Detection in Optical Remote Sensing Images Driven by Transformer

Author: Li, Gongyang, primary, Bai, Zhen, additional, Liu, Zhi, additional, Zhang, Xinpeng, additional, and Ling, Haibin, additional
Published: 2023
Full Text: View/download PDF

38. Lightweight Salient Object Detection in Optical Remote-Sensing Images via Semantic Matching and Edge Alignment

Author: Li, Gongyang, primary, Liu, Zhi, additional, Zhang, Xinpeng, additional, and Lin, Weisi, additional
Published: 2023
Full Text: View/download PDF

39. Spatio-Temporal Self-Attention Network for Video Saliency Prediction

Author: Wang, Ziqiang, primary, Liu, Zhi, additional, Li, Gongyang, additional, Wang, Yang, additional, Zhang, Tianhong, additional, Xu, Lihua, additional, and Wang, Jijun, additional
Published: 2023
Full Text: View/download PDF

40. Adaptive Group-Wise Consistency Network for Co-Saliency Detection

Author: Bai, Zhen, primary, Liu, Zhi, additional, Li, Gongyang, additional, and Wang, Yang, additional
Published: 2023
Full Text: View/download PDF

41. SGFNet: Semantic-Guided Fusion Network for RGB-Thermal Semantic Segmentation

Author: Wang, Yike, Li, Gongyang, and Liu, Zhi
Abstract: Recently, semantic segmentation based on RGB and thermal infrared (TIR) images has become a research hotspot because of its stability in the weak light environment. However, most of the current methods ignore the differences between the two modalities of data and do not use semantic information in multi-modal fusion. In this paper, we propose a novel Semantic-Guided Fusion Network (SGFNet) for RGB-Thermal semantic segmentation, which makes full use of semantic information in the multi-modal fusion. Our SGFNet consists of an asymmetric encoder with TIR branch and RGB branch and a decoder. We concentrate on enhancing the multi-modal feature representation in the encoder with a pattern of fusion and enhancement. Specifically, considering that TIR images are stable under weak light conditions, we first propose a Semantic Guidance Head to extract semantic information in the TIR branch. In the RGB branch, we propose a Multi-modal Coordination and Distillation Unit to fuse multi-modal features first. Then, we propose a Cross-level and Semantic-guided Enhancement Unit to enhance the fused features with cross-level information and semantic information. We arrange these two units at all stages of the RGB branch to generate features with strong representation abilities at different levels. For the decoder, to obtain large receptive fields and fine edges, we improve the Lawin ASPP decoder by introducing edge information extracted from the low-level features, proposing the edge-aware Lawin ASPP decoder. With our encoder and decoder working together, our SGFNet can identify objects accurately and segment objects finely. Extensive experiments on the MFNet dataset demonstrate the superior performance of the proposed SGFNet compared with state-of-the-art methods. The code and results of our method are available at https://github.com/kw717/SGFNet.
Published: 2023
Full Text: View/download PDF

42. Gaze Estimation via Modulation-Based Adaptive Network With Auxiliary Self-Learning

Author: Wu, Yong, primary, Li, Gongyang, additional, Liu, Zhi, additional, Huang, Mengke, additional, and Wang, Yang, additional
Published: 2022
Full Text: View/download PDF

43. Multi-Content Complementation Network for Salient Object Detection in Optical Remote Sensing Images

Author: Li, Gongyang, primary, Liu, Zhi, additional, Lin, Weisi, additional, and Ling, Haibin, additional
Published: 2022
Full Text: View/download PDF

44. Two-Stage Edge Reuse Network for Salient Object Detection of Strip Steel Surface Defects

Author: Han, Chengjun, primary, Li, Gongyang, additional, and Liu, Zhi, additional
Published: 2022
Full Text: View/download PDF

45. Lightweight Salient Object Detection in Optical Remote Sensing Images via Feature Correlation

Author: Li, Gongyang, primary, Liu, Zhi, additional, Bai, Zhen, additional, Lin, Weisi, additional, and Ling, Haibin, additional
Published: 2022
Full Text: View/download PDF

46. RGB-T Semantic Segmentation with Location, Activation, and Sharpening

Author: Li, Gongyang, primary, Wang, Yike, additional, Liu, Zhi, additional, Zhang, Xinpeng, additional, and Zeng, Dan, additional
Published: 2022
Full Text: View/download PDF

47. RGB-T Semantic Segmentation With Location, Activation, and Sharpening

Author: Li, Gongyang, Wang, Yike, Liu, Zhi, Zhang, Xinpeng, and Zeng, Dan
Abstract: Semantic segmentation is important for scene understanding. To address the scenes of adverse illumination conditions of natural images, thermal infrared (TIR) images are introduced. Most existing RGB-T semantic segmentation methods follow three cross-modal fusion paradigms, i. e., encoder fusion, decoder fusion, and feature fusion. Some methods, unfortunately, ignore the properties of RGB and TIR features or the properties of features at different levels. In this paper, we propose a novel feature fusion-based network for RGB-T semantic segmentation, named LASNet, which follows three steps of location, activation, and sharpening. The highlight of LASNet is that we fully consider the characteristics of cross-modal features at different levels, and accordingly propose three specific modules for better segmentation. Concretely, we propose a Collaborative Location Module (CLM) for high-level semantic features, aiming to locate all potential objects. We propose a Complementary Activation Module for middle-level features, aiming to activate exact regions of different objects. We propose an Edge Sharpening Module (ESM) for low-level texture features, aiming to sharpen the edges of objects. Furthermore, in the training phase, we attach a location supervision and an edge supervision after CLM and ESM, respectively, and impose two semantic supervisions in the decoder part to facilitate network convergence. Experimental results on two public datasets demonstrate that the superiority of our LASNet over relevant state-of-the-art methods. The code and results of our method are available at https://github.com/MathLee/LASNet.
Published: 2023
Full Text: View/download PDF

48. Fixations based personal target objects segmentation

Author: Shi, Ran, primary, Li, Gongyang, additional, Wei, Weijie, additional, and Liu, Zhi, additional
Published: 2021
Full Text: View/download PDF

49. Hierarchical Alternate Interaction Network for RGB-D Salient Object Detection

Author: Li, Gongyang, primary, Liu, Zhi, additional, Chen, Minyu, additional, Bai, Zhen, additional, Lin, Weisi, additional, and Ling, Haibin, additional
Published: 2021
Full Text: View/download PDF

50. Personal Fixations-Based Object Segmentation With Object Localization and Boundary Preservation

Author: Li, Gongyang, primary, Liu, Zhi, additional, Shi, Ran, additional, Hu, Zheng, additional, Wei, Weijie, additional, Wu, Yong, additional, Huang, Mengke, additional, and Ling, Haibin, additional
Published: 2021
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

111 results on '"Li, Gongyang"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources