Author: "Wang, Zengfu" / Publication Year Range: Last 10 years - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Wang, Zengfu"' showing total 424 results

Start Over Author "Wang, Zengfu" Publication Year Range Last 10 years

424 results on '"Wang, Zengfu"'

1. An Index Policy Based on Sarsa and Q-learning for Heterogeneous Smart Target Tracking

Author: Hao, Yuhang, Wang, Zengfu, Fu, Jing, and Pan, Quan
Subjects: Electrical Engineering and Systems Science - Systems and Control, Computer Science - Machine Learning
Abstract: In solving the non-myopic radar scheduling for multiple smart target tracking within an active and passive radar network, we need to consider both short-term enhanced tracking performance and a higher probability of target maneuvering in the future with active tracking. Acquiring the long-term tracking performance while scheduling the beam resources of active and passive radars poses a challenge. To address this challenge, we model this problem as a Markov decision process consisting of parallel restless bandit processes. Each bandit process is associated with a smart target, of which the estimation state evolves according to different discrete dynamic models for different actions - whether or not the target is being tracked. The discrete state is defined by the dynamic mode. The problem exhibits the curse of dimensionality, where optimal solutions are in general intractable. We resort to heuristics through the famous restless multi-armed bandit techniques. It follows with efficient scheduling policies based on the indices that are real numbers representing the marginal rewards of taking different actions. For the inevitable practical case with unknown transition matrices, we propose a new method that utilizes the forward Sarsa and backward Q-learning to approximate the indices through adapting the state-action value functions, or equivalently the Q-functions, and propose a new policy, namely ISQ, aiming to maximize the long-term tracking rewards. Numerical results demonstrate that the proposed ISQ policy outperforms conventional Q-learning-based methods and rapidly converges to the well-known Whittle index policy with revealed state transition models, which is considered the benchmark., Comment: 11 pages
Published: 2024

2. Joint State Estimation and Noise Identification Based on Variational Optimization

Author: Lan, Hua, Zhao, Shijie, Hu, Jinjie, Wang, Zengfu, and Fu, Jing
Subjects: Electrical Engineering and Systems Science - Systems and Control, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: In this article, the state estimation problems with unknown process noise and measurement noise covariances for both linear and nonlinear systems are considered. By formulating the joint estimation of system state and noise parameters into an optimization problem, a novel adaptive Kalman filter method based on conjugate-computation variational inference, referred to as CVIAKF, is proposed to approximate the joint posterior probability density function of the latent variables. Unlike the existing adaptive Kalman filter methods utilizing variational inference in natural-parameter space, CVIAKF performs optimization in expectation-parameter space, resulting in a faster and simpler solution. Meanwhile, CVIAKF divides optimization objectives into conjugate and non-conjugate parts of nonlinear dynamical models, whereas conjugate computations and stochastic mirror-descent are applied, respectively. Remarkably, the reparameterization trick is used to reduce the variance of stochastic gradients of the non-conjugate parts. The effectiveness of CVIAKF is validated through synthetic and real-world datasets of maneuvering target tracking., Comment: 13 pages
Published: 2023

3. Non-myopic Beam Scheduling for Multiple Smart Target Tracking in Phased Array Radar Network

Author: Hao, Yuhang, Wang, Zengfu, Niño-Mora, José, Fu, Jing, Yang, Min, and Pan, Quan
Subjects: Electrical Engineering and Systems Science - Systems and Control
Abstract: A smart target, also referred to as a reactive target, can take maneuvering motions to hinder radar tracking. We address beam scheduling for tracking multiple smart targets in phased array radar networks. We aim to mitigate the performance degradation in previous myopic tracking methods and enhance the system performance, which is measured by a discounted cost objective related to the tracking error covariance (TEC) of the targets. The scheduling problem is formulated as a restless multi-armed bandit problem (RMABP) with state variables, following the Markov decision process. In particular, the problem consists of parallel bandit processes. Each bandit process is associated with a target and evolves with different transition rules for different actions, i.e., either the target is tracked or not. We propose a non-myopic, scalable policy based on Whittle indices for selecting the targets to be tracked at each time. The proposed policy has a linear computational complexity in the number of targets and the truncated time horizon in the index computation, and is hence applicable to large networks with a realistic number of targets. We present numerical evidence that the model satisfies sufficient conditions for indexability (existence of the Whittle index) based upon partial conservation laws, and, through extensive simulations, we validate the effectiveness of the proposed policy in different scenarios., Comment: 14 pages
Published: 2023

4. Classification-Aided Robust Multiple Target Tracking Using Neural Enhanced Message Passing

Author: Bai, Xianglong, Wang, Zengfu, Pan, Quan, Yun, Tao, and Lan, Hua
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Electrical Engineering and Systems Science - Systems and Control
Abstract: We address the challenge of tracking an unknown number of targets in strong clutter environments using measurements from a radar sensor. Leveraging the range-Doppler spectra information, we identify the measurement classes, which serve as additional information to enhance clutter rejection and data association, thus bolstering the robustness of target tracking. We first introduce a novel neural enhanced message passing approach, where the beliefs obtained by the unified message passing are fed into the neural network as additional information. The output beliefs are then utilized to refine the original beliefs. Then, we propose a classification-aided robust multiple target tracking algorithm, employing the neural enhanced message passing technique. This algorithm is comprised of three modules: a message-passing module, a neural network module, and a Dempster-Shafer module. The message-passing module is used to represent the statistical model by the factor graph and infers target kinematic states, visibility states, and data associations based on the spatial measurement information. The neural network module is employed to extract features from range-Doppler spectra and derive beliefs on whether a measurement is target-generated or clutter-generated. The Dempster-Shafer module is used to fuse the beliefs obtained from both the factor graph and the neural network. As a result, our proposed algorithm adopts a model-and-data-driven framework, effectively enhancing clutter suppression and data association, leading to significant improvements in multiple target tracking performance. We validate the effectiveness of our approach using both simulated and real data scenarios, demonstrating its capability to handle challenging tracking scenarios in practical radar applications., Comment: 15 pages
Published: 2023

5. Coordinated Multi-Agent Patrolling with State-Dependent Cost Rates -- Asymptotically Optimal Policies for Large-Scale Systems

Author: Fu, Jing, Wang, Zengfu, and Chen, Jie
Subjects: Mathematics - Optimization and Control, Mathematics - Probability, 90B36 (primary) 90B80, 93E20 (secondary), G.3
Abstract: We study a large-scale patrol problem with state-dependent costs and multi-agent coordination.We consider heterogeneous agents, rather general reward functions, and the capabilities of tracking agents' trajectories.Given the complexity and uncertainty of the practical situations for patrolling, we model the problem as a discrete-time Markov decision process (MDP) that consists of a large number of parallel stochastic processes.We aim to minimize the cumulative patrolling cost over a finite time horizon.The problem exhibits an excessively large size of state space, which increases exponentially in the number of agents and the size of geographical region for patrolling.To reach practical solutions, we relax the dependencies between these parallel stochastic processes by randomizing all the state and action variables.In this context, the entire problem can be decomposed into a number of sub-problems, each of which has a much smaller state space and can be solved independently. The solutions of these sub-problems can lead to efficient heuristics.Unlike the past systems assuming relatively simple structure of the underlying stochastic process, here, tracking the patrol trajectories involves strong dependencies between the stochastic processes, leading to entirely different state and action spaces, transition kernels, and behaviours of processes,rendering the existing methods inapplicable or impractical.Further more, we prove that the performance deviation between the proposed policies and the possible optimal solution diminishes exponentially in the problem size, which also establishes the fact that the policies converge asymptotically at an exponential rate., Comment: 51 pages, 11 figures
Published: 2023

6. Exploring Part-Informed Visual-Language Learning for Person Re-Identification

Author: Lin, Yin, Liu, Cong, Chen, Yehansen, Hu, Jinshui, Yin, Bing, Yin, Baocai, and Wang, Zengfu
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Recently, visual-language learning has shown great potential in enhancing visual-based person re-identification (ReID). Existing visual-language learning-based ReID methods often focus on whole-body scale image-text feature alignment, while neglecting supervisions on fine-grained part features. This choice simplifies the learning process but cannot guarantee within-part feature semantic consistency thus hindering the final performance. Therefore, we propose to enhance fine-grained visual features with part-informed language supervision for ReID tasks. The proposed method, named Part-Informed Visual-language Learning ($\pi$-VL), suggests that (i) a human parsing-guided prompt tuning strategy and (ii) a hierarchical fusion-based visual-language alignment paradigm play essential roles in ensuring within-part feature semantic consistency. Specifically, we combine both identity labels and parsing maps to constitute pixel-level text prompts and fuse multi-stage visual features with a light-weight auxiliary head to perform fine-grained image-text alignment. As a plug-and-play and inference-free solution, our $\pi$-VL achieves substantial improvements over previous state-of-the-arts on four common-used ReID benchmarks, especially reporting 90.3% Rank-1 and 76.5% mAP for the most challenging MSMT17 database without bells and whistles., Comment: 11 pages, 5 figures
Published: 2023

7. Combinatorial-restless-bandit-based Transmitter-Receiver Online Selection for Distributed MIMO Radars With Non-Stationary Channels

Author: Hao, Yuhang, Wang, Zengfu, Fu, Jing, Bai, Xianglong, Li, Can, and Pan, Quan
Subjects: Electrical Engineering and Systems Science - Systems and Control
Abstract: We track moving targets with a distributed multiple-input multiple-output (MIMO) radar, for which the transmitters and receivers are appropriately paired and selected with a limited number of radar stations. We aim to maximize the sum of the signal-to-interference-plus-noise ratios (SINRs) of all the targets by sensibly selecting the transmitter-receiver pairs during the tracking period. A key is to model the optimization problem of selecting the transmitter-receiver pairs by a restless multi-armed bandit (RMAB) model that is able to formulate the time-varying signals of the transceiver channels whenever the channels are being probed or not. We regard the estimated mean reward (i.e., SINR) as the state of an arm. If an arm is probed, the estimated mean reward of the arm is the weighted sum of the observed reward and the predicted mean reward; otherwise, it is the predicted mean reward. We associate the predicted mean reward with the estimated mean reward at the previous time slot and the state of the target, which is estimated via the interacting multiple model-unscented Kalman filter (IMM-UKF). The optimized selection of transmitter-receiver pairs at each time is accomplished by using Binary Particle Swarm Optimization (BPSO) based on indexes of arms, each of which is designed by the upper confidence bound (UCB1) algorithm. Above all, a multi-group combinatorial-restless-bandit technique taking into account of different combinations of transmitters and receivers and the closed-loop scheme between transmitter-receiver pair selection and target state estimation, namely MG-CRB-CL, is developed to achieve a near-optimal selection strategy and improve multi-target tracking performance. Simulation results for different scenarios are provided to verify the effectiveness and superior performance of our MG-CRB-CL algorithm., Comment: 13 pages
Published: 2023

8. A Sea-Land Clutter Classification Framework for Over-the-Horizon-Radar Based on Weighted Loss Semi-supervised GAN

Author: Zhang, Xiaoxuan, Wang, Zengfu, Lu, Kun, Pan, Quan, and Li, Yang
Subjects: Computer Science - Computer Vision and Pattern Recognition, Electrical Engineering and Systems Science - Systems and Control
Abstract: Deep convolutional neural network has made great achievements in sea-land clutter classification for over-the-horizon-radar (OTHR). The premise is that a large number of labeled training samples must be provided for a sea-land clutter classifier. In practical engineering applications, it is relatively easy to obtain label-free sea-land clutter samples. However, the labeling process is extremely cumbersome and requires expertise in the field of OTHR. To solve this problem, we propose an improved generative adversarial network, namely weighted loss semi-supervised generative adversarial network (WL-SSGAN). Specifically, we propose a joint feature matching loss by weighting the middle layer features of the discriminator of semi-supervised generative adversarial network. Furthermore, we propose the weighted loss of WL-SSGAN by linearly weighting standard adversarial loss and joint feature matching loss. The semi-supervised classification performance of WL-SSGAN is evaluated on a sea-land clutter dataset. The experimental results show that WL-SSGAN can improve the performance of the fully supervised classifier with only a small number of labeled samples by utilizing a large number of unlabeled sea-land clutter samples. Further, the proposed weighted loss is superior to both the adversarial loss and the feature matching loss. Additionally, we compare WL-SSGAN with conventional semi-supervised classification methods and demonstrate that WL-SSGAN achieves the highest classification accuracy., Comment: 9 pages
Published: 2023

9. Variational Nonlinear Kalman Filtering with Unknown Process Noise Covariance

Author: Lan, Hua, Hu, Jinjie, Wang, Zengfu, and Cheng, Qiang
Subjects: Electrical Engineering and Systems Science - Systems and Control, Computer Science - Machine Learning
Abstract: Motivated by the maneuvering target tracking with sensors such as radar and sonar, this paper considers the joint and recursive estimation of the dynamic state and the time-varying process noise covariance in nonlinear state space models. Due to the nonlinearity of the models and the non-conjugate prior, the state estimation problem is generally intractable as it involves integrals of general nonlinear functions and unknown process noise covariance, resulting in the posterior probability distribution functions lacking closed-form solutions. This paper presents a recursive solution for joint nonlinear state estimation and model parameters identification based on the approximate Bayesian inference principle. The stochastic search variational inference is adopted to offer a flexible, accurate, and effective approximation of the posterior distributions. We make two contributions compared to existing variational inference-based noise adaptive filtering methods. First, we introduce an auxiliary latent variable to decouple the latent variables of dynamic state and process noise covariance, thereby improving the flexibility of the posterior inference. Second, we split the variational lower bound optimization into conjugate and non-conjugate parts, whereas the conjugate terms are directly optimized that admit a closed-form solution and the non-conjugate terms are optimized by natural gradients, achieving the trade-off between inference speed and accuracy. The performance of the proposed method is verified on radar target tracking applications by both simulated and real-world data., Comment: 11 pages
Published: 2023

10. Chinese text recognition enhanced by glyph and character semantic information

Author: Wu, Shilian, Li, Yongrui, and Wang, Zengfu
Published: 2024
Full Text: View/download PDF

11. Data Augmentation and Classification of Sea-Land Clutter for Over-the-Horizon Radar Using AC-VAEGAN

Author: Zhang, Xiaoxuan, Wang, Zengfu, Lu, Kun, and Pan, Quan
Subjects: Electrical Engineering and Systems Science - Systems and Control
Abstract: In the sea-land clutter classification of sky-wave over-the-horizon-radar (OTHR), the imbalanced and scarce data leads to a poor performance of the deep learning-based classification model. To solve this problem, this paper proposes an improved auxiliary classifier generative adversarial network~(AC-GAN) architecture, namely auxiliary classifier variational autoencoder generative adversarial network (AC-VAEGAN). AC-VAEGAN can synthesize higher quality sea-land clutter samples than AC-GAN and serve as an effective tool for data augmentation. Specifically, a 1-dimensional convolutional AC-VAEGAN architecture is designed to synthesize sea-land clutter samples. Additionally, an evaluation method combining both traditional evaluation of GAN domain and statistical evaluation of signal domain is proposed to evaluate the quality of synthetic samples. Using a dataset of OTHR sea-land clutter, both the quality of the synthetic samples and the performance of data augmentation of AC-VAEGAN are verified. Further, the effect of AC-VAEGAN as a data augmentation method on the classification performance of imbalanced and scarce sea-land clutter samples is validated. The experiment results show that the quality of samples synthesized by AC-VAEGAN is better than that of AC-GAN, and the data augmentation method with AC-VAEGAN is able to improve the classification performance in the case of imbalanced and scarce sea-land clutter samples., Comment: 13 pages, 16 figures
Published: 2023
Full Text: View/download PDF

12. Robust Multitarget Tracking in Interference Environments: A Message-Passing Approach

Author: Bai, Xianglong, Lan, Hua, Wang, Zengfu, Pan, Quan, Hao, Yuhang, and Li, Can
Subjects: Electrical Engineering and Systems Science - Systems and Control
Abstract: Multitarget tracking in the interference environments suffers from the nonuniform, unknown and time-varying clutter, resulting in dramatic performance deterioration. We address this challenge by proposing a robust multitarget tracking algorithm, which estimates the states of clutter and targets simultaneously by the message-passing (MP) approach. We define the non-homogeneous clutter with a finite mixture model containing a uniform component and multiple nonuniform components. The measured signal strength is utilized to estimate the mean signal-to-noise ratio (SNR) of targets and the mean clutter-to-noise ratio (CNR) of clutter, which are then used as additional feature information of targets and clutter to improve the performance of discrimination of targets from clutter. We also present a hybrid data association which can reason over correspondence between targets, clutter, and measurements. Then, a unified MP algorithm is used to infer the marginal posterior probability distributions of targets, clutter, and data association by splitting the joint probability distribution into a mean-field approximate part and a belief propagation part. As a result, a closed-loop iterative optimization of the posterior probability distribution can be obtained, which can effectively deal with the coupling between target tracking, clutter estimation and data association. Simulation results demonstrate the performance superiority and robustness of the proposed multitarget tracking algorithm compared with the probability hypothesis density (PHD) filter and the cardinalized PHD (CPHD) filter., Comment: 21 pages, 21 figures
Published: 2022

13. SText-DETR: End-to-End Arbitrary-Shaped Text Detection with Scalable Query in Transformer

Author: Liao, Pujin, Wang, Zengfu, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Liu, Qingshan, editor, Wang, Hanzi, editor, Ma, Zhanyu, editor, Zheng, Weishi, editor, Zha, Hongbin, editor, Chen, Xilin, editor, Wang, Liang, editor, and Ji, Rongrong, editor
Published: 2024
Full Text: View/download PDF

14. SText-DETR: End-to-End Arbitrary-Shaped Text Detection with Scalable Query in Transformer

Author: Liao, Pujin, primary and Wang, Zengfu, additional
Published: 2023
Full Text: View/download PDF

15. SSR-HEF: Crowd Counting with Multi-Scale Semantic Refining and Hard Example Focusing

Author: Chen, Jiwei, Wang, Kewei, Su, Wen, and Wang, Zengfu
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Crowd counting based on density maps is generally regarded as a regression task.Deep learning is used to learn the mapping between image content and crowd density distribution. Although great success has been achieved, some pedestrians far away from the camera are difficult to be detected. And the number of hard examples is often larger. Existing methods with simple Euclidean distance algorithm indiscriminately optimize the hard and easy examples so that the densities of hard examples are usually incorrectly predicted to be lower or even zero, which results in large counting errors. To address this problem, we are the first to propose the Hard Example Focusing(HEF) algorithm for the regression task of crowd counting. The HEF algorithm makes our model rapidly focus on hard examples by attenuating the contribution of easy examples.Then higher importance will be given to the hard examples with wrong estimations. Moreover, the scale variations in crowd scenes are large, and the scale annotations are labor-intensive and expensive. By proposing a multi-Scale Semantic Refining (SSR) strategy, lower layers of our model can break through the limitation of deep learning to capture semantic features of different scales to sufficiently deal with the scale variation. We perform extensive experiments on six benchmark datasets to verify the proposed method. Results indicate the superiority of our proposed method over the state-of-the-art methods. Moreover, our designed model is smaller and faster., Comment: Accepted by IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS
Published: 2022
Full Text: View/download PDF

16. Crowd counting with segmentation attention convolutional neural network

Author: Chen, Jiwei and Wang, Zengfu
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Deep learning occupies an undisputed dominance in crowd counting. In this paper, we propose a novel convolutional neural network (CNN) architecture called SegCrowdNet. Despite the complex background in crowd scenes, the proposeSegCrowdNet still adaptively highlights the human head region and suppresses the non-head region by segmentation. With the guidance of an attention mechanism, the proposed SegCrowdNet pays more attention to the human head region and automatically encodes the highly refined density map. The crowd count can be obtained by integrating the density map. To adapt the variation of crowd counts, SegCrowdNet intelligently classifies the crowd count of each image into several groups. In addition, the multi-scale features are learned and extracted in the proposed SegCrowdNet to overcome the scale variations of the crowd. To verify the effectiveness of our proposed method, extensive experiments are conducted on four challenging datasets. The results demonstrate that our proposed SegCrowdNet achieves excellent performance compared with the state-of-the-art methods., Comment: Accepted by IET Image Processing
Published: 2022
Full Text: View/download PDF

17. Crowd counting with crowd attention convolutional neural network

Author: Chen, Jiwei, Su, Wen, and Wang, Zengfu
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Crowd counting is a challenging problem due to the scene complexity and scale variation. Although deep learning has achieved great improvement in crowd counting, scene complexity affects the judgement of these methods and they usually regard some objects as people mistakenly; causing potentially enormous errors in the crowd counting result. To address the problem, we propose a novel end-to-end model called Crowd Attention Convolutional Neural Network (CAT-CNN). Our CAT-CNN can adaptively assess the importance of a human head at each pixel location by automatically encoding a confidence map. With the guidance of the confidence map, the position of human head in estimated density map gets more attention to encode the final density map, which can avoid enormous misjudgements effectively. The crowd count can be obtained by integrating the final density map. To encode a highly refined density map, the total crowd count of each image is classified in a designed classification task and we first explicitly map the prior of the population-level category to feature maps. To verify the efficiency of our proposed method, extensive experiments are conducted on three highly challenging datasets. Results establish the superiority of our method over many state-of-the-art methods., Comment: Accepted by Neurocomputing
Published: 2022
Full Text: View/download PDF

18. Submarine Cable Network Design for Regional Connectivity

Author: Wang, Tianjiao, Wang, Zengfu, Moran, Bill, and Zukerman, Moshe
Subjects: Electrical Engineering and Systems Science - Systems and Control
Abstract: This paper optimizes path planning for a trunkand-branch topology network in an irregular 2-dimensional manifold embedded in 3-dimensional Euclidean space with application to submarine cable network planning. We go beyond our earlier focus on the costs of cable construction (including labor, equipment and materials) together with additional cost to enhance cable resilience, to incorporate the overall cost of branching units (again including material, construction and laying) and the choice of submarine cable landing stations, where such a station can be anywhere on the coast in a connected region. These are important issues for the economics of cable laying and significantly change the model and the optimization process. We pose the problem as a variant of the Steiner tree problem, but one in which the Steiner nodes can vary in number, while incurring a penalty. We refer to it as the weighted Steiner node problem. It differs from the Euclidean Steiner tree problem, where Steiner points are forced to have degree three; this is no longer the case, in general, when nodes incur a cost. We are able to prove that our algorithm is applicable to Steiner nodes with degree greater than three, enabling optimization of network costs in this context. The optimal solution is achieved in polynomialtime using dynamic programming.
Published: 2022

19. A Restless Bandit Model for Energy-Efficient Job Assignments in Server Farms

Author: Fu, Jing, Wang, Xinyu, Wang, Zengfu, and Zukerman, Moshe
Subjects: Mathematics - Optimization and Control, Computer Science - Distributed, Parallel, and Cluster Computing, 68M20 (primary), 90B22, 90B36 (secondary), G.1.6, G.3
Abstract: We aim to maximize the energy efficiency, gauged as average energy cost per job, in a large-scale server farm with various storage or/and computing components modeled as parallel abstracted servers. Each server operates in multiple power modes characterized by potentially different service and energy consumption rates. The heterogeneity of servers and multiple power modes complicate the maximization problem, where optimal solutions are generally intractable. Relying on the Whittle relaxation technique, we resort to a near-optimal, scalable job-assignment policy. Under a mild condition related to the service and energy consumption rates of the servers, we prove that our proposed policy approaches optimality as the size of the entire system tends to infinity; that is, it is asymptotically optimal. For the non-asymptotic regime, we show the effectiveness of the proposed policy through numerical simulations, where the policy outperforms all the tested baselines, and we numerically demonstrate its robustness against heavy-tailed job-size distributions., Comment: 55 pages, 10 figures
Published: 2021

20. On Exploring and Improving Robustness of Scene Text Detection Models

Author: Wu, Shilian, Zhai, Wei, Li, Yongrui, Wang, Kewei, and Wang, Zengfu
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: It is crucial to understand the robustness of text detection models with regard to extensive corruptions, since scene text detection techniques have many practical applications. For systematically exploring this problem, we propose two datasets from which to evaluate scene text detection models: ICDAR2015-C (IC15-C) and CTW1500-C (CTW-C). Our study extends the investigation of the performance and robustness of the proposed region proposal, regression and segmentation-based scene text detection frameworks. Furthermore, we perform a robustness analysis of six key components: pre-training data, backbone, feature fusion module, multi-scale predictions, representation of text instances and loss function. Finally, we present a simple yet effective data-based method to destroy the smoothness of text regions by merging background and foreground, which can significantly increase the robustness of different text detection networks. We hope that this study will provide valid data points as well as experience for future research. Benchmark, code and data will be made available at \url{https://github.com/wushilian/robust-scene-text-detection-benchmark}.
Published: 2021

21. A sea–land clutter classification framework for over-the-horizon radar based on weighted loss semi-supervised generative adversarial network

Author: Zhang, Xiaoxuan, Wang, Zengfu, Ji, Mingyue, Li, Yang, Pan, Quan, and Lu, Kun
Published: 2024
Full Text: View/download PDF

22. Greedy Offset-Guided Keypoint Grouping for Human Pose Estimation

Author: Li, Jia, Xiang, Linhua, Chen, Jiwei, and Wang, Zengfu
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: We propose a simple yet reliable bottom-up approach with a good trade-off between accuracy and efficiency for the problem of multi-person pose estimation. Given an image, we employ an Hourglass Network to infer all the keypoints from different persons indiscriminately as well as the guiding offsets connecting the adjacent keypoints belonging to the same persons. Then, we greedily group the candidate keypoints into multiple human poses (if any), utilizing the predicted guiding offsets. And we refer to this process as greedy offset-guided keypoint grouping (GOG). Moreover, we revisit the encoding-decoding method for the multi-person keypoint coordinates and reveal some important facts affecting accuracy. Experiments have demonstrated the obvious performance improvements brought by the introduced components. Our approach is comparable to the state of the art on the challenging COCO dataset under fair conditions. The source code and our pre-trained model are publicly available online., Comment: 5 pages, 2 figures, code available at https://github.com/hellojialee/OffsetGuided
Published: 2021

23. A Length-Sensitive Language-Bound Recognition Network for Multilingual Text Recognition

Author: Gao, Ming, Wu, Shilian, Wang, Zengfu, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Dang-Nguyen, Duc-Tien, editor, Gurrin, Cathal, editor, Larson, Martha, editor, Smeaton, Alan F., editor, Rudinac, Stevan, editor, Dao, Minh-Son, editor, Trattner, Christoph, editor, and Chen, Phoebe, editor
Published: 2023
Full Text: View/download PDF

24. Multi-Branch Network with Ensemble Learning for Text Removal in the Wild

Author: Hou, Yujie, Chen, Jiwei, Wang, Zengfu, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Wang, Lei, editor, Gall, Juergen, editor, Chin, Tat-Jun, editor, Sato, Imari, editor, and Chellappa, Rama, editor
Published: 2023
Full Text: View/download PDF

25. Optimal Tree Topology for a Submarine Cable Network With Constrained Internodal Latency

Author: Wang, Tianjiao, Wang, Zengfu, Moran, Bill, and Zukerman, Moshe
Subjects: Electrical Engineering and Systems Science - Systems and Control, Electrical Engineering and Systems Science - Signal Processing
Abstract: This paper provides an optimized cable path planning solution for a tree-topology network in an irregular 2D manifold in a 3D Euclidean space, with an application to the planning of submarine cable networks. Our solution method is based on total cost minimization, where the individual cable costs are assumed to be linear to the length of the corresponding submarine cables subject to latency constraints between pairs of nodes. These latency constraints limit the cable length and number of hops between any pair of nodes. Our method combines the Fast Marching Method (FMM) and a new Integer Linear Programming (ILP) formulation for Minimum Spanning Tree (MST) where there are constraints between pairs of nodes. We note that this problem of MST with constraints is NP-complete. Nevertheless, we demonstrate that ILP running time is adequate for the great majority of existing cable systems. For cable systems for which ILP is not able to find the optimal solution within an acceptable time, we propose an alternative heuristic algorithm based on Prim's algorithm. In addition, we apply our FMM/ILP-based algorithm to a real-world cable path planning example and demonstrate that it can effectively find an MST with latency constraints between pairs of nodes., Comment: 11 pages, 7 figures
Published: 2020
Full Text: View/download PDF

26. OTHR multitarget tracking with a GMRF model of ionospheric parameters

Author: Guo, Zhen, Wang, Zengfu, Lan, Hua, Pan, Quan, and Lu, Kun
Subjects: Electrical Engineering and Systems Science - Systems and Control, Electrical Engineering and Systems Science - Signal Processing
Abstract: The ionosphere is the propagation medium for radio waves transmitted by an over-the-horizon radar (OTHR). Ionospheric parameters, typically, virtual ionospheric heights (VIHs), are required to perform coordinate registration for OTHR multitarget tracking and localization. The inaccuracy of ionospheric parameters has a significant deleterious effect on the target localization of OTHR. Therefore, to improve the localization accuracy of OTHR, it is important to develop accurate models and estimation methods of ionospheric parameters and the corresponding target tracking algorithms. In this paper, we consider the variation of the ionosphere with location and the spatial correlation of the ionosphere in OTHR target tracking. We use a Gaussian Markov random field (GMRF) to model the VIHs, providing a more accurate representation of the VIHs for OTHR target tracking. Based on expectation-conditional maximization and GMRF modeling of the VIHs, we propose a novel joint optimization solution, called ECM-GMRF, to perform target state estimation, multipath data association and VIHs estimation simultaneously. In ECM-GMRF, the measurements from both ionosondes and OTHR are exploited to estimate the VIHs, leading to a better estimation of the VIHs which improves the accuracy of data association and target state estimation, and vice versa. The simulation indicates the effectiveness of the proposed algorithm., Comment: 16 pages
Published: 2020

27. Measurement-Level Fusion for OTHR Network Using Message Passing

Author: Lan, Hua, Wang, Zengfu, Bai, Xianglong, Pan, Quan, and Lu, Kun
Subjects: Electrical Engineering and Systems Science - Signal Processing, Electrical Engineering and Systems Science - Systems and Control
Abstract: Tracking an unknown number of targets based on multipath measurements provided by an over-the-horizon radar (OTHR) network with a statistical ionospheric model is complicated, which requires solving four subproblems: target detection, target tracking, multipath data association and ionospheric height identification. A joint solution is desired since the four subproblems are highly correlated, but suffering from the intractable inference problem of high-dimensional latent variables. In this paper, a unified message passing approach, combining belief propagation (BP) and mean-field (MF) approximation, is developed for simplifying the intractable inference. Based upon the factor graph corresponding to a factorization of the joint probability distribution function (PDF) of the latent variables and a choice for a separation of this factorization into BP region and MF region, the posterior PDFs of continuous latent variables including target kinematic state, target visibility state, and ionospheric height, are approximated by MF due to its simple MP update rules for conjugate-exponential models. With regard to discrete multipath data association which contains one-to-one frame (hard) constraints, its PDF is approximated by loopy BP. Finally, the approximated posterior PDFs are updated iteratively in a closed-loop manner, which is effective for dealing with the coupling issue among target detection, target tracking, multipath data association, and ionospheric height identification. Meanwhile, the proposed approach has the measurement-level fusion architecture due to the direct processing of the raw multipath measurements from an OTHR network, which is benefit to improving target tracking performance. Its performance is demonstrated on a simulated OTHR network multitarget tracking scenario., Comment: 40 pages, 23 figures
Published: 2020

28. Simple Pose: Rethinking and Improving a Bottom-up Approach for Multi-Person Pose Estimation

Author: Li, Jia, Su, Wen, and Wang, Zengfu
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: We rethink a well-know bottom-up approach for multi-person pose estimation and propose an improved one. The improved approach surpasses the baseline significantly thanks to (1) an intuitional yet more sensible representation, which we refer to as body parts to encode the connection information between keypoints, (2) an improved stacked hourglass network with attention mechanisms, (3) a novel focal L2 loss which is dedicated to hard keypoint and keypoint association (body part) mining, and (4) a robust greedy keypoint assignment algorithm for grouping the detected keypoints into individual poses. Our approach not only works straightforwardly but also outperforms the baseline by about 15% in average precision and is comparable to the state of the art on the MS-COCO test-dev dataset. The code and pre-trained models are publicly available online., Comment: Accepted by AAAI 2020 (the Thirty-Fourth AAAI Conference on Artificial Intelligence)
Published: 2019

29. A Message Passing Approach for Multiple Maneuvering Target Tracking

Author: Lan, Hua, Ma, Jirong, Wang, Zengfu, Pan, Quan, and Xu, Xiong
Subjects: Electrical Engineering and Systems Science - Systems and Control
Abstract: This paper considers the problem of detecting and tracking multiple maneuvering targets, which suffers from the intractable inference of high-dimensional latent variables that include target kinematic state, target visibility state, motion mode-model association, and data association. A unified message passing algorithm that combines belief propagation (BP) and mean-field (MF) approximation is proposed for simplifying the intractable inference. By assuming conjugate-exponential priors for target kinematic state, target visibility state, and motion mode-model association, the MF approximation decouples the joint inference of target kinematic state, target visibility state, motion mode-model association into individual low-dimensional inference, yielding simple message passing update equations. The BP is exploited to approximate the probabilities of data association events since it is compatible with hard constraints. Finally, the approximate posterior probability distributions are updated iteratively in a closed-loop manner, which is effective for dealing with the coupling issue between the estimations of target kinematic state and target visibility state and decisions on motion mode-model association and data association. The performance of the proposed algorithm is demonstrated by comparing with the well-known multiple maneuvering target tracking algorithms, including interacting multiple model joint probabilistic data association, interacting multiple model hypothesis-oriented multiple hypothesis tracker and multiple model generalized labeled multi-Bernoulli.
Published: 2019

30. Fuel optimization schemes for formation reconfiguration in satellite formation flying

Author: Wang, Zengfu, Tian, Jiarui, and Fu, Jing
Published: 2023
Full Text: View/download PDF

31. Multi-task semi-supervised crowd counting via global to local self-correction

Author: Chen, Jiwei and Wang, Zengfu
Published: 2023
Full Text: View/download PDF

32. Grand Challenge of 106-Point Facial Landmark Localization

Author: Liu, Yinglu, Shen, Hao, Si, Yue, Wang, Xiaobo, Zhu, Xiangyu, Shi, Hailin, Hong, Zhibin, Guo, Hanqi, Guo, Ziyuan, Chen, Yanqin, Li, Bi, Xi, Teng, Yu, Jun, Xie, Haonian, Xie, Guochen, Li, Mengyan, Lu, Qing, Wang, Zengfu, Lai, Shenqi, Chai, Zhenhua, and Wei, Xiaoming
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Facial landmark localization is a very crucial step in numerous face related applications, such as face recognition, facial pose estimation, face image synthesis, etc. However, previous competitions on facial landmark localization (i.e., the 300-W, 300-VW and Menpo challenges) aim to predict 68-point landmarks, which are incompetent to depict the structure of facial components. In order to overcome this problem, we construct a challenging dataset, named JD-landmark. Each image is manually annotated with 106-point landmarks. This dataset covers large variations on pose and expression, which brings a lot of difficulties to predict accurate landmarks. We hold a 106-point facial landmark localization competition1 on this dataset in conjunction with IEEE International Conference on Multimedia and Expo (ICME) 2019. The purpose of this competition is to discover effective and robust facial landmark localization approaches., Comment: This paper is accepted at ICME2019 Grand Challenge. The JD-landmark dataset has been released and can be downloaded from https://sites.google.com/view/hailin-shi
Published: 2019

33. Multi-Branch Network with Ensemble Learning for Text Removal in the Wild

Author: Hou, Yujie, primary, Chen, Jiwei, additional, and Wang, Zengfu, additional
Published: 2023
Full Text: View/download PDF

34. A Length-Sensitive Language-Bound Recognition Network for Multilingual Text Recognition

Author: Gao, Ming, primary, Wu, Shilian, additional, and Wang, Zengfu, additional
Published: 2023
Full Text: View/download PDF

35. Least-Squares Estimation of Keypoint Coordinate for Human Pose Estimation

Author: Xiang, Linhua, Li, Jia, Wang, Zengfu, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Yu, Shiqi, editor, Zhang, Zhaoxiang, editor, Yuen, Pong C., editor, Han, Junwei, editor, Tan, Tieniu, editor, Guo, Yike, editor, Lai, Jianhuang, editor, and Zhang, Jianguo, editor
Published: 2022
Full Text: View/download PDF

36. CA-Net: Collaborative Attention Network for Multi-modal Diagnosis of Gliomas

Author: Yin, Baocai, Cheng, Hu, Wang, Fengyan, Wang, Zengfu, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Crimi, Alessandro, editor, and Bakas, Spyridon, editor
Published: 2022
Full Text: View/download PDF

37. AFA: adversarial frequency alignment for domain generalized lung nodule detection

Author: Yin, Baocai, Sun, Mei, Zhang, Jing, Liu, Wenchao, Liu, Cong, and Wang, Zengfu
Published: 2022
Full Text: View/download PDF

38. Monocular depth estimation with spatially coherent sliced network

Author: Su, Wen, Zhang, Haifeng, Su, Yuan, Yu, Jun, and Wang, Zengfu
Published: 2022
Full Text: View/download PDF

39. Video super-resolution with inverse recurrent net and hybrid local fusion

Author: Li, Dingyi, Wang, Zengfu, and Yang, Jian
Published: 2022
Full Text: View/download PDF

40. SimplePose V2: Greedy Offset-Guided Keypoint Grouping for Human Pose Estimation

Author: Li, Jia, Xiang, Linhua, Chen, Jiwei, Wang, Zengfu, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Ma, Huimin, editor, Wang, Liang, editor, Zhang, Changshui, editor, Wu, Fei, editor, Tan, Tieniu, editor, Wang, Yaonan, editor, Lai, Jianhuang, editor, and Zhao, Yao, editor
Published: 2021
Full Text: View/download PDF

41. Color Multi-focus Image Fusion Using Quaternion Morphological Gradient and Improved KNN Matting

Author: Liu, Wei, Zheng, Zhong, Wang, Zengfu, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Peng, Yuxin, editor, Hu, Shi-Min, editor, Gabbouj, Moncef, editor, Zhou, Kun, editor, Elad, Michael, editor, and Xu, Kun, editor
Published: 2021
Full Text: View/download PDF

42. Brain Tumor Classification Based on MRI Images and Noise Reduced Pathology Images

Author: Yin, Baocai, Cheng, Hu, Wang, Fengyan, Wang, Zengfu, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Crimi, Alessandro, editor, and Bakas, Spyridon, editor
Published: 2021
Full Text: View/download PDF

43. Joint Target Detection and Tracking in Multipath Environment: A Variational Bayesian Approach

Author: Lan, Hua, Sun, Shuai, Wang, Zengfu, Pan, Quan, and Zhang, Zhishan
Subjects: Computer Science - Computer Vision and Pattern Recognition, 65K10
Abstract: We consider multitarget detection and tracking problem for a class of multipath detection system where one target may generate multiple measurements via multiple propagation paths, and the association relationship among targets, measurements and propagation paths is unknown. In order to effectively utilize multipath measurements from one target to improve detection and tracking performance, a tracker has to handle high-dimensional estimation of latent variables including target active/dormant meta-state, target kinematic state, and multipath data association. Based on variational Bayesian inference, we propose a novel joint detection and tracking algorithm that incorporates multipath data association, target detection and target state estimation in a unified Bayesian framework. The posterior probabilities of these latent variables are derived in a closed-form iterative manner, which is effective for reducing the performance deterioration caused by the coupling between estimation errors and identification errors. Loopy belief propagation is exploited to approximately calculate the probability of multipath data association, saving the computational cost significantly. Simulation results of over-the-horizon radar multitarget tracking show that the proposed algorithm outperforms multihypothesis multipath track fusion and multi-detection (hypothesis-oriented) multiple hypothesis tracker, especially under low signal-to-noise ratio circumstance.
Published: 2016

44. Nighttime Haze Removal with Illumination Correction

Author: Zhang, Jing, Cao, Yang, and Wang, Zengfu
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Haze removal is important for computational photography and computer vision applications. However, most of the existing methods for dehazing are designed for daytime images, and cannot always work well in the nighttime. Different from the imaging conditions in the daytime, images captured in nighttime haze condition may suffer from non-uniform illumination due to artificial light sources, which exhibit low brightness/contrast and color distortion. In this paper, we present a new nighttime hazy imaging model that takes into account both the non-uniform illumination from artificial light sources and the scattering and attenuation effects of haze. Accordingly, we propose an efficient dehazing algorithm for nighttime hazy images. The proposed algorithm includes three sequential steps. i) It enhances the overall brightness by performing a gamma correction step after estimating the illumination from the original image. ii) Then it achieves a color-balance result by performing a color correction step after estimating the color characteristics of the incident light. iii) Finally, it remove the haze effect by applying the dark channel prior and estimating the point-wise environmental light based on the previous illumination-balance result. Experimental results show that the proposed algorithm can achieve illumination-balance and haze-free results with good color rendition ability., Comment: 14 pages, 18 figures
Published: 2016

45. OTHR multitarget tracking with a GMRF model of ionospheric parameters

Author: Guo, Zhen, Wang, Zengfu, Lan, Hua, Pan, Quan, and Lu, Kun
Published: 2021
Full Text: View/download PDF

46. CA-Net: Collaborative Attention Network for Multi-modal Diagnosis of Gliomas

Author: Yin, Baocai, primary, Cheng, Hu, additional, Wang, Fengyan, additional, and Wang, Zengfu, additional
Published: 2022
Full Text: View/download PDF

47. Least-Squares Estimation of Keypoint Coordinate for Human Pose Estimation

Author: Xiang, Linhua, primary, Li, Jia, additional, and Wang, Zengfu, additional
Published: 2022
Full Text: View/download PDF

48. Robust multi-focus image fusion using lazy random walks with multiscale focus measures

Author: Liu, Wei, Zheng, Zhong, and Wang, Zengfu
Published: 2021
Full Text: View/download PDF

49. A Neural Network-Based Whittle Index Policy for Beam Resource Allocation in Multitarget Tracking

Author: Hao, Yuhang, Wang, Zengfu, Fu, Jing, and Pan, Quan
Abstract: In a colocated multiple-input multiple-output (MIMO) radar system for multitarget tracking (MTT), the non-myopic beam allocation schemes based on conventional programming approaches result in large-scale state space and action space. This article formulates the beam allocation problem through a restless multi-armed bandit (RMAB) model and leverages the computationally efficient Whittle index policy. The optimization objective is defined as the infinite-horizon discounted reward, which is evaluated based on the Bayesian Cramér-Rao lower bounds (BCRLBs) of the targets. In this approach, each target is treated as an arm, and the joint multi-dimensional state of each target comprises the BCRLB and the dynamic state. However, it is intractable to exactly compute the Whittle index of each target with the convoluted transition process of the joint state. This article combines the Whittle index policy and deep reinforcement learning (DRL), seeking to approximate the Whittle index by leveraging its threshold property. Since the BCRLB metric update depends on the Jacobian matrix of the nonlinear measurement equation that is related to dynamic states, the two-channel neural network is constructed to approximate the Whittle index on both BCRLB states and dynamic states for each target. In this architecture, the inputs of networks are preprocessed joint state features. Subsequently, DRL techniques are employed to train the neural network. Above all, the neural network-based Whittle index (NNWI) policy is proposed to achieve non-myopic tracking performance for multiple targets. Numerical results demonstrate that the optimization performance of the proposed NNWI policy outperforms that of myopic policies and other DRL algorithms.
Published: 2024
Full Text: View/download PDF

50. A Novel Multi-focus Image Fusion Based on Lazy Random Walks

Author: Liu, Wei, Wang, Zengfu, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Zhao, Yao, editor, Barnes, Nick, editor, Chen, Baoquan, editor, Westermann, Rüdiger, editor, Kong, Xiangwei, editor, and Lin, Chunyu, editor
Published: 2019
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

424 results on '"Wang, Zengfu"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources