Author: "Hongying Meng" / Topic: pattern recognition - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Hongying Meng"' showing total 28 results

Start Over Author "Hongying Meng" Topic pattern recognition

28 results on '"Hongying Meng"'

1. Self-Supervised Representation Learning for Videos by Segmenting via Sampling Rate Order Prediction

Author: Wenming Yang, Qicong Wang, Hongying Meng, Yan Huang, and Jing Huang
Subjects: business.industry, Computer science, feature extraction, streaming media, Feature extraction, motion segmentation, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Pattern recognition, data mining, Image segmentation, Expression (mathematics), View synthesis, Discriminative model, task analysis, Media Technology, Feature (machine learning), Artificial intelligence, Electrical and Electronic Engineering, Representation (mathematics), business, semantics, image segmentation, Feature learning
Abstract: Self-supervised representation learning for videos has been very attractive recently because these methods exploit the information inherently obtained from the video itself instead of annotated labels that is quite time-consuming. However, existing methods ignore the importance of global observation while performing spatio-temporal transformation perception, which highly limits the expression capabilities of the video representation. This paper proposes a novel pretext task that combines the temporal information perception of the video with the motion amplitude perception of moving objects to learn the spatio-temporal representation of the video. Specifically, given a video clip containing several video segments, each video segment is sampled by different sampling rates and the order of video segments is disrupted. Then, the network is used to regress the sampling rate of each video segment and classify the order of input video segments. In the pre-training stage, the network can learn rich spatio-temporal semantic information where content-related contrastive learning is introduced to make the learned video representation more discriminative. To alleviate the appearance dependency caused by contrastive learning, we design a novel and robust vector similarity measurement approach, which can take feature alignment into consideration. Moreover, a view synthesis framework is proposed to further improve the performance of contrastive learning by automatically generating reasonable transformed views. We conduct benchmark experiments with several 3D backbone networks on two datasets. The results show that our proposed method outperforms the existing state-of-the-art methods across the three backbones on two downstream tasks of human action recognition and video retrieval.
Published: 2022

2. Data-Fusion-Based Two-Stage Cascade Framework for Multimodality Face Anti-Spoofing

Author: Hongying Meng, Tao Lei, Asoke K. Nandi, Xiaokang Wei, Xingwu Wang, and Weihua Liu
Subjects: Spoofing attack, Computer science, business.industry, Deep learning, 020206 networking & telecommunications, Pattern recognition, 02 engineering and technology, Construct (python library), Sensor fusion, Artificial Intelligence, Feature (computer vision), Face (geometry), 0202 electrical engineering, electronic engineering, information engineering, Benchmark (computing), 020201 artificial intelligence & image processing, Artificial intelligence, Representation (mathematics), business, Software
Abstract: Existing face anti-spoofing models using deep learning for multi-modality data suffer from low generalization in the case of using variety of presentation attacks such as 2D printing and high-precision 3D face masks. One of the main reasons is that the non-linearity of multi-spectral information used to preserve the intrinsic attributes between a real and a fake face are not well extracted. To address this issue, we propose a multi-modility data based two-stage cascade framework for face anti-spoofing. The proposed framework has two advantages. Firstly, we design a two-stage cascade architecture that can selectively fuse low-level and high-level features from different modalities to improve feature representation. Secondly, we use multi-modality data to construct a distance-free spectral on RGB and infrared (IR) to augment the non-linearity of data. The presented data fusion strategy is different from popular fusion approaches, since it can strengthen discrimination ability of network models on physical attribute features than identity structure features under certain constraints. In addition, a multi-scale patch based weighted fine-tuning strategy is designed to learn each specific local face region. Experimental results show that the proposed framework achieves better performance than other state-of-the-art methods on both benchmark datasets and self-established datasets, especially on multi-material masks spoofing.
Published: 2022

3. Triplet interactive attention network for cross-modality person re-identification

Author: Chenrui Zhang, Hongying Meng, Ping Chen, and Tao Lei
Subjects: Backbone network, Bridging (networking), Matching (graph theory), business.industry, Computer science, Pattern recognition, Tian, Constraint (information theory), Artificial Intelligence, Signal Processing, Pairwise comparison, Computer Vision and Pattern Recognition, Artificial intelligence, Focus (optics), business, Feature learning, Software
Abstract: Cross-modality (RGB-Infrared) person re-identification (ReID) has extreme superiority on low illumination ReID over conventional RGB-RGB matching due to its excellent capability on bridging cross-modality discrepancy. Previous works often focus on extracting modality-invariant global features, whereas the interaction among pairwise samples provides the most important clues for identifying persons. To explore the interactive clues among cross-modality pedestrian images, we propose a Triplet Interactive Attention Network (TIAN) to imitate the pairwise interactions and generate the attention scores for triplet samples. It can discover the fine-grained difference between pedestrian images and is optimized by successive attention losses to strengthen the feature learning capability of the backbone network. Meanwhile, the triplet interactive attention loss further alleviates the cross-modality discrepancy, based on the maximum mean discrepancy constraint. To demonstrate the effectiveness of our proposed TIAN, experiments on SYSU-MM01 and RegDB datasets show obvious superiority over popular cross-modality ReID methods, and the evaluation of main modules also reveals the novelties of our TIAN method.
Published: 2021

4. Automatic Fuzzy Clustering Framework for Image Segmentation

Author: Peng Liu, Hongying Meng, Tao Lei, Asoke K. Nandi, Xuande Zhang, and Xiaohong Jia
Subjects: Fuzzy clustering, density peak (DP) algorithm, Computer science, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, superpixel, 02 engineering and technology, Fuzzy logic, Artificial Intelligence, Histogram, 0202 electrical engineering, electronic engineering, information engineering, Entropy (information theory), Segmentation, Cluster analysis, image segmentation, Pixel, business.industry, Applied Mathematics, Pattern recognition, Image segmentation, Computational Theory and Mathematics, Control and Systems Engineering, Computer Science::Computer Vision and Pattern Recognition, fuzzy clustering, 020201 artificial intelligence & image processing, Artificial intelligence, business
Abstract: Clustering algorithms by minimizing an objective function share a clear drawback of having to set the number of clusters manually. Although density peak clustering is able to find the number of clusters, it suffers from memory overflow when it is used for image segmentation because a moderate-size image usually includes a large number of pixels leading to a huge similarity matrix. To address this issue, here we proposed an automatic fuzzy clustering framework (AFCF) for image segmentation. The proposed framework has threefold contributions. First, the idea of superpixel is used for the density peak (DP) algorithm, which efficiently reduces the size of the similarity matrix and thus improves the computational efficiency of the DP algorithm. Second, we employ a density balance algorithm to obtain a robust decision-graph that helps the DP algorithm achieve fully automatic clustering. Finally, a fuzzy c-means clustering based on prior entropy is used in the framework to improve image segmentation results. Because the spatial neighboring information of both the pixels and membership are considered, the final segmentation result is improved effectively. Experiments show that the proposed framework not only achieves automatic image segmentation, but also provides better segmentation results than state-of-the-art algorithms. 10.13039/501100001809-National Natural Science Foundation of China; 10.13039/501100002858-China Postdoctoral Science Foundation
Published: 2020

5. Fast and Automatic Image Segmentation Using Superpixel-Based Graph Clustering

Author: Xiaohong Jia, Dinghua Xue, Hongying Meng, Tao Lei, Peng Liu, and Asoke K. Nandi
Subjects: Density peak (DP) algorithm, Fuzzy clustering, density peak (DP) algorithm, General Computer Science, Structuring element, Computer science, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, 02 engineering and technology, Iterative reconstruction, 030218 nuclear medicine & medical imaging, Image (mathematics), 03 medical and health sciences, 0302 clinical medicine, Graph clustering, 0202 electrical engineering, electronic engineering, information engineering, General Materials Science, Segmentation, Cluster analysis, Clustering coefficient, Image segmentation, business.industry, General Engineering, Pattern recognition, graph clustering, fuzzy clustering, 020201 artificial intelligence & image processing, lcsh:Electrical engineering. Electronics. Nuclear engineering, Artificial intelligence, business, lcsh:TK1-9971
Abstract: Although automatic fuzzy clustering framework (AFCF) based on improved density peak clustering is able to achieve automatic and efficient image segmentation, the framework suffers from two problems. The first one is that the adaptive morphological reconstruction (AMR) employed by the AFCF is easily influenced by the initial structuring element. The second one is that the improved density peak clustering using a density balance strategy is complex for finding potential clustering centers. To address these two problems, we propose a fast and automatic image segmentation algorithm using superpixel-based graph clustering (FAS-SGC). The proposed algorithm has two major contributions. First, the AMR based on regional minimum removal (AMR-RMR) is presented to improve the superpixel result generated by the AMR. The binary morphological reconstruction is performed on a regional minimum image, which overcomes the problem that the initial structuring element of the AMR is chosen empirically, since the geometrical information of images is effectively explored and utilized. Second, we use an eigenvalue gradient clustering (EGC) instead of improved density peak (DP) algorithms to obtain potential clustering centers, since the EGC is faster and requires fewer parameters than the DP algorithm. Experiments show that the proposed algorithm is able to achieve automatic image segmentation, providing better segmentation results while requiring less execution time than other state-of-the-art algorithms.
Published: 2020

6. Superpixel-Based Fast Fuzzy C-Means Clustering for Color Image Segmentation

Author: Hongying Meng, Tao Lei, Shigang Liu, Xiaohong Jia, Asoke K. Nandi, and Yanning Zhang
Subjects: Morphological gradient, Computer science, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, 02 engineering and technology, Grayscale, fuzzy c-means (FCM) clustering, Artificial Intelligence, Color image segmentation, 0202 electrical engineering, electronic engineering, information engineering, Segmentation, Superpixel, Cluster analysis, Pixel, business.industry, Color image, Applied Mathematics, Pattern recognition, Image segmentation, Real image, Morphological reconstruction, Computational Theory and Mathematics, Control and Systems Engineering, Computer Science::Computer Vision and Pattern Recognition, 020201 artificial intelligence & image processing, Artificial intelligence, business
Abstract: A great number of improved fuzzy c-means (FCM) clustering algorithms have been widely used for grayscale and color image segmentation. However, most of them are time-consuming and unable to provide desired segmentation results for color images due to two reasons. The first one is that the incorporation of local spatial information often causes a high computational complexity due to the repeated distance computation between clustering centers and pixels within a local neighboring window. The other one is that a regular neighboring window usually breaks up the real local spatial structure of images and thus leads to a poor segmentation. In this work, we propose a superpixel-based fast FCM clustering algorithm (SFFCM) that is significantly faster and more robust than state-of-the-art clustering algorithms for color image segmentation. To obtain better local spatial neighborhoods, we firstly define a multiscale morphological gradient reconstruction (MMGR) operation to obtain a superpixel image with accurate contour. In contrast to traditional neighboring window of fixed size and shape, the superpixel image provides better adaptive and irregular local spatial neighborhoods that are helpful for improving color image segmentation. Secondly, based on the obtained superpixel image, the original color image is simplified efficiently and its histogram is computed easily by counting the number of pixels in each region of the superpixel image. Finally, we implement FCM with histogram parameter on the superpixel image to obtain the final segmentation result. Experiments performed on synthetic images and real images demonstrate that the proposed algorithm provides better segmentation results and takes less time than state-of-the-art clustering algorithms for color image segmentation. China Postdoctoral Science Foundation; National Natural Science Foundation of China; National Science Foundation of Shanghai
Published: 2019

7. Significantly Fast and Robust Fuzzy C-Means Clustering Algorithm Based on Morphological Reconstruction and Membership Filtering

Author: Lifeng He, Asoke K. Nandi, Hongying Meng, Xiaohong Jia, Tao Lei, and Yanning Zhang
Subjects: Pixel, Computational complexity theory, Computer science, business.industry, Applied Mathematics, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, 020207 software engineering, Pattern recognition, 02 engineering and technology, Image segmentation, Fuzzy logic, Electronic mail, Computational Theory and Mathematics, Artificial Intelligence, Control and Systems Engineering, Robustness (computer science), Computer Science::Computer Vision and Pattern Recognition, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Artificial intelligence, business, Cluster analysis, Spatial analysis
Abstract: As fuzzy c-means clustering (FCM) algorithm is sensitive to noise, local spatial information is often introduced to an objective function to improve the robustness of the FCM algorithm for image segmentation. However, the introduction of local spatial information often leads to a high computational complexity, arising out of an iterative calculation of the distance between pixels within local spatial neighbors and clustering centers. To address this issue, an improved FCM algorithm based on morphological reconstruction and membership filtering (FRFCM) that is significantly faster and more robust than FCM is proposed in this paper. First, the local spatial information of images is incorporated into FRFCM by introducing morphological reconstruction operation to guarantee noise-immunity and image detail-preservation. Second, the modification of membership partition, based on the distance between pixels within local spatial neighbors and clustering centers, is replaced by local membership filtering that depends only on the spatial neighbors of membership partition. Compared with state-of-the-art algorithms, the proposed FRFCM algorithm is simpler and significantly faster, since it is unnecessary to compute the distance between pixels within local spatial neighbors and clustering centers. In addition, it is efficient for noisy image segmentation because membership filtering are able to improve membership partition matrix efficiently. Experiments performed on synthetic and real-world images demonstrate that the proposed algorithm not only achieves better results, but also requires less time than the state-of-the-art algorithms for image segmentation.
Published: 2018

8. Robust Self-Sparse Fuzzy Clustering for Image Segmentation

Author: Asoke K. Nandi, Hongying Meng, Tao Lei, Xiaogang Du, Shigang Liu, and Xiaohong Jia
Subjects: 0209 industrial biotechnology, Fuzzy clustering, computational efficiency, General Computer Science, Computer science, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, 02 engineering and technology, Fuzzy logic, Image (mathematics), 020901 industrial engineering & automation, 0202 electrical engineering, electronic engineering, information engineering, General Materials Science, Cluster analysis, image segmentation, Spatial analysis, business.industry, General Engineering, noise measurement, Pattern recognition, Image segmentation, Fuzzy c-means clustering (FCM), sparse membership, ComputingMethodologies_PATTERNRECOGNITION, Computer Science::Computer Vision and Pattern Recognition, Metric (mathematics), Outlier, over-segmentation, 020201 artificial intelligence & image processing, lcsh:Electrical engineering. Electronics. Nuclear engineering, Artificial intelligence, Microsoft Windows, business, lcsh:TK1-9971
Abstract: Traditional fuzzy clustering algorithms suffer from two problems in image segmentations. One is that these algorithms are sensitive to outliers due to the non-sparsity of fuzzy memberships. The other is that these algorithms often cause image over-segmentation due to the loss of image local spatial information. To address these issues, we propose a robust self-sparse fuzzy clustering algorithm (RSSFCA) for image segmentation. The proposed RSSFCA makes two contributions. The first concerns a regularization under Gaussian metric that is integrated into the objective function of fuzzy clustering algorithms to obtain fuzzy membership with sparsity, which reduces a proportion of noisy features and improves clustering results. The second concerns a connected-component filtering based on area density balance strategy (CCF-ADB) that is proposed to address the problem of image over-segmentation. Compared to the integration of local spatial information into the objective functions, the presented CCF-ADB is simpler and faster for the removal of small areas. Experimental results show that the proposed RSSFCA addresses two problems in current fuzzy clustering algorithms, i.e., the outlier sensitivity and the over-segmentation, and it provides better image segmentation results than state-of-the-art algorithms. 10.13039/501100001809-National Natural Science Foundation of China (Grant Number: 61871259, 61811530325 (International Exchange Cooperation (IEC)\NSFC\170396, Royal Society, U.K.), 61871260, 61672333 and 61861024)
Published: 2020

9. Automatic Modulation Recognition: A Few-Shot Learning Method Based on the Capsule Network

Author: Qianqian Cheng, Junsheng Huang, Lixin Li, Hongying Meng, and Zhu Han
Subjects: Computer science, business.industry, Few-Shot learning, Deep learning, Feature extraction, Convolutional neural network (CNN), Network structure, 020206 networking & telecommunications, 020302 automobile design & engineering, Pattern recognition, 02 engineering and technology, Automatic modulation recognition (AMR), Convolutional neural network, Convolution, Signal-to-noise ratio, 0203 mechanical engineering, Control and Systems Engineering, Modulation, Deep learning (DL), 0202 electrical engineering, electronic engineering, information engineering, Learning methods, Artificial intelligence, Electrical and Electronic Engineering, business, Capsule network (CapsNet)
Abstract: With the rapid development of deep learning (DL) in recent years, automatic modulation recognition (AMR) with DL has achieved high accuracy. However, aiming to obtain higher classification accuracy, DL requires numerous training samples. In order to solve this problem, it is a challenge to study how to efficiently use DL for AMR in the case of few samples. In this letter, inspired by the capsule network (CapsNet), we propose a new network structure named AMR-CapsNet to achieve higher classification accuracy of modulation signals with fewer samples, and further analyze the adaptability of DL models in the case of few samples. The simulation results demonstrate that when 3% of the dataset is used to train and the signal-to-noise ratio (SNR) is greater than 2 dB, the overall classification accuracy of the AMR-CapsNet is greater than 80%. Compared with convolutional neural network (CNN), the classification accuracy is improved by 20%.
Published: 2020

10. Adaptive Morphological Reconstruction for Seeded Image Segmentation

Author: Xiaohong Jia, Asoke K. Nandi, Hongying Meng, Tao Lei, Shigang Liu, and Tongliang Liu
Subjects: FOS: Computer and information sciences, Watershed, Structuring element, Computer science, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, 02 engineering and technology, Iterative reconstruction, Mathematical morphology, 0202 electrical engineering, electronic engineering, information engineering, Segmentation, Artificial Intelligence & Image Processing, Image segmentation, business.industry, Pattern recognition, Filter (signal processing), Computer Graphics and Computer-Aided Design, Maxima and minima, 020201 artificial intelligence & image processing, Seeded segmentation, Artificial intelligence, business, Software, Spectral segmentation
Abstract: Morphological reconstruction (MR) is often employed by seeded image segmentation algorithms such as watershed transform and power watershed as it is able to filter seeds (regional minima) to reduce over-segmentation. However, MR might mistakenly filter meaningful seeds that are required for generating accurate segmentation and it is also sensitive to the scale because a single-scale structuring element is employed. In this paper, a novel adaptive morphological reconstruction (AMR) operation is proposed that has three advantages. Firstly, AMR can adaptively filter useless seeds while preserving meaningful ones. Secondly, AMR is insensitive to the scale of structuring elements because multiscale structuring elements are employed. Finally, AMR has two attractive properties: monotonic increasingness and convergence that help seeded segmentation algorithms to achieve a hierarchical segmentation. Experiments clearly demonstrate that AMR is useful for improving algorithms of seeded image segmentation and seed-based spectral segmentation. Compared to several state-of-the-art algorithms, the proposed algorithms provide better segmentation results requiring less computing time. Source code is available at https://github.com/SUST-reynole/AMR.
Published: 2019

11. End-to-end Change Detection Using a Symmetric Fully Convolutional Network for Landslide Mapping

Author: Hongying Meng, Tao Lei, Dinghua Xue, Asoke K. Nandi, Tao Chen, and Qi Zhang
Subjects: Computer science, business.industry, Deep learning, Pooling, Feature extraction, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, 0211 other engineering and technologies, Context (language use), Pattern recognition, 02 engineering and technology, Image segmentation, Convolutional neural network, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Artificial intelligence, Pyramid (image processing), business, Change detection, 021101 geological & geomatics engineering
Abstract: In this paper, we propose a novel approach based on a symmetric fully convolutional network within pyramid pooling (FCN-PP) for landslide mapping (LM). The proposed approach has three advantages. Firstly, this approach is automatic and insensitive to noise because multivariate morphological reconstruction (MMR) is used for image preprocessing. Secondly, it is able to take into account features from multiple convolutional layers and explore efficiently the context of images, which leads to a good tradeoff between wider receptive field and the use of context. Finally, the selected pyramid pooling module addresses the drawback of single-scale pooling employed by convolutional neural network (CNN), fully convolutional network (FCN), U-Net, etc. Experimental results show that the proposed FCN-PP is effective for LM, and it outperforms state-of-the-art approaches in terms of four metrics, Precision, Recall, F -score, and Accuracy.
Published: 2019

12. Learning Spectral and Spatial Features Based on Generative Adversarial Network for Hyperspectral Image Super-Resolution

Author: Ang Gao, Jiang Ruituo, Xu Li, Hongying Meng, Lixin Li, Lei Zhang, and Shigang Yue
Subjects: business.industry, Computer science, Feature extraction, 0211 other engineering and technologies, Hyperspectral imaging, Pattern recognition, 02 engineering and technology, Function (mathematics), Superresolution, Convolution, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Least absolute deviations, Artificial intelligence, business, Image resolution, 021101 geological & geomatics engineering
Abstract: Three-dimensional (3D) convolutional networks have been proven to be able to explore spatial context and spectral information simultaneously for super-resolution (SR). However, such kind of network can’t be practically designed very ‘deep’ due to the long training time and GPU memory limitations involved in 3D convolution. Instead, in this paper, spatial context and spectral information in hyperspectral images (HSIs) are explored using Two-dimensional (2D) and One-dimenional (1D) convolution, separately. Therefore, a novel 2D-1D generative adversarial network architecture (2D-1D-HSRGAN) is proposed for SR of HSIs. Specifically, the generator network consists of a spatial network and a spectral network, in which spatial network is trained with the least absolute deviations loss function to explore spatial context by 2D convolution and spectral network is trained with the spectral angle mapper (SAM) loss function to extract spectral information by 1D convolution. Experimental results over two real HSIs demonstrate that the proposed 2D-1D-HSRGAN clearly outperforms several state-of-the-art algorithms.
Published: 2019
Full Text: View/download PDF

13. Diffusion Tensor Image Segmentation Based on Multi-Atlas Active Shape Model

Author: Hongying Meng, Min Qi, Yangyu Fan, Yi Zhao, Yi Wang, and Zhe Guo
Subjects: Computer Networks and Communications, business.industry, Computer science, Active Shape Model, STAPLE, Multi atlas, 020207 software engineering, Pattern recognition, ComputingMilieux_LEGALASPECTSOFCOMPUTING, 02 engineering and technology, Image segmentation, ComputingMilieux_GENERAL, Hardware and Architecture, Region of interest, Active shape model, 0202 electrical engineering, electronic engineering, information engineering, Media Technology, Segmentation, Artificial intelligence, Diffusion tensor magnetic resonance imaging, business, image segmentation, Software, Diffusion MRI
Abstract: Active Shape Model (ASM) has been successfully applied in the segmentation of Diffusion Tensor Magnetic Resonance Image (DT-MRI, referred to as DTI) of brain. However, due to multiple anatomical structure types, irregular shapes, small gray-scale and large amount of these images, perfect segmentation performance could not be achieved. Especially, it is sensitive to initial values with high computational complexity. In this paper, we introduce the gray information of multiple atlases and the prior information of target shapes into the ASM and propose the Multi-Atlas Active Shape Model (referred to as MA-ASM) approach for DTI segmentation. It was evaluated in a manually labeled database with 7 Region of Interest (ROI)s for each of 20 subjects. In comparison with the state of art method of STAPLE (Simultaneous Truth Performance Level Estimation), the proposed algorithm was closer to the manual segmentation shape by subjective visual effects, and had higher overlap rates and lower error detection rates on quantitative analysis than STAPLE.
Published: 2019

14. An Ensemble Classifier Based on Three-Way Decisions for Social Touch Gesture Recognition

Author: Yubin Shi, Gangqiang Zhang, Qun Liu, and Hongying Meng
Subjects: Modalities, business.industry, Computer science, 05 social sciences, 050301 education, Pattern recognition, 02 engineering and technology, three-way decisions, ensemble classifier, Social relation, Human–robot interaction, touch gesture recognition, ComputingMethodologies_PATTERNRECOGNITION, Gesture recognition, 0202 electrical engineering, electronic engineering, information engineering, Preprocessor, 020201 artificial intelligence & image processing, Artificial intelligence, Data pre-processing, business, preprocessing, 0503 education, Classifier (UML), Gesture
Abstract: Social touch is an important form of social interaction. In Human Robot Interaction (HRI), touch can provide additional information to other modalities, such as audio, visual. One of the application is the robot therapy that has great social significance. In this paper, an ensemble classifier based on threeway decisions is proposed to recognize touch gestures. Firstly, features are extracted from on six perspectives and four classifiers are constructed on different scales with different pre-processing methods. . Then an ensemble classifier is used to combine the four classifiers to classify the gestures. The proposed method is tested on the public Corpus of Social Touch (Cost) dataset. The experiments results not only verify the validity of our method but also show the better accuracy of our ensemble classifier.
Published: 2018

15. Accurate Facial Parts Localization and Deep Learning for 3D Facial Expression Recognition

Author: Hongying Meng, Asim Jan, Huaxiong Ding, Liming Chen, and Huibin Li
Subjects: FOS: Computer and information sciences, Computer science, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, 02 engineering and technology, Geometric shape, Texture (music), 0202 electrical engineering, electronic engineering, information engineering, Affective computing, ComputingMethodologies_COMPUTERGRAPHICS, business.industry, Deep learning, 020207 software engineering, Pattern recognition, 68T45, Expression (mathematics), Human-Computer Interaction, Facial expression recognition, Face (geometry), Affective Computing, 020201 artificial intelligence & image processing, Artificial intelligence, Depth perception, business
Abstract: Meaningful facial parts can convey key cues for both facial action unit detection and expression prediction. Textured 3D face scan can provide both detailed 3D geometric shape and 2D texture appearance cues of the face which are beneficial for Facial Expression Recognition (FER). However, accurate facial parts extraction as well as their fusion are challenging tasks. In this paper, a novel system for 3D FER is designed based on accurate facial parts extraction and deep feature fusion of facial parts. In particular, each textured 3D face scan is firstly represented as a 2D texture map and a depth map with one-to-one dense correspondence. Then, the facial parts of both texture map and depth map are extracted using a novel 4-stage process consists of facial landmark localization, facial rotation correction, facial resizing, facial parts bounding box extraction and post-processing procedures. Finally, deep fusion Convolutional Neural Networks (CNNs) features of all facial parts are learned from both texture maps and depth maps, respectively and nonlinear SVMs are used for expression prediction. Experiments are conducted on the BU-3DFE database, demonstrating the effectiveness of combing different facial parts, texture and depth cues and reporting the state-of-the-art results in comparison with all existing methods under the same setting., Comment: 7 pages, 3 figures
Published: 2018
Full Text: View/download PDF

16. Automated facial expression recognition app development on smart phones using cloud computing

Author: Hongying Meng, Veton Kepuska, and Humaid Alshamsi
Subjects: Facial expression, Computer science, business.industry, media_common.quotation_subject, Feature extraction, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Confusion matrix, 020206 networking & telecommunications, Pattern recognition, 02 engineering and technology, Anger, Facial recognition system, Field (computer science), Sadness, Support vector machine, ComputingMethodologies_PATTERNRECOGNITION, Cog, Face (geometry), 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Artificial intelligence, business, media_common
Abstract: Automated human emotion detection is a topic of significant interest in the field of computer vision. Over the past decade, much emphasis has been on using facial expression recognition (FER) to extract emotion from facial expressions. In this paper, the proposed system presents a novel method of facial recognition based on the cloud model, in combination with the traditional facial expression system. The process of predicting emotions from facial expression images contains several stages. The first stage of this system is the pre-processing stage, which is applied by detecting the face in images and then resizing the images. The second stage involves extracting features from facial expression images using Facial Landmarks and Center of Gravity (COG) feature extraction algorithms, which generate the training and testing datasets that contain the expressions of Anger, Disgust, Fear, Happiness, Neutrality, Sadness, and Surprise. Support Vector Machine (SVM) classifiers are then used for the classification stage in order to predict the emotion. In addition, a Confusion Matrix (CM) technique is used to evaluate the performance of these classifiers. The proposed system is tested on CK+, JAFFE, and KDEF databases. However, the proposed system achieved a prediction rate of 96.3% when Facial Landmarks and the Center of Gravity (COG)+SVM method are used.
Published: 2017

17. Real time automated facial expression recognition app development on smart phones

Author: Hongying Meng, Veton Kepuska, and Humaid Alshamsi
Subjects: business.industry, Computer science, Feature extraction, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, 020206 networking & telecommunications, Pattern recognition, 02 engineering and technology, Facial recognition system, Support vector machine, Kernel (linear algebra), ComputingMethodologies_PATTERNRECOGNITION, Facial expression recognition, Mobile phone, Face (geometry), 0202 electrical engineering, electronic engineering, information engineering, Feature (machine learning), 020201 artificial intelligence & image processing, Artificial intelligence, business
Abstract: Automated facial expression recognition (AFER) is a crucial technology to and a challenging task for human computer interaction. Previous methods of AFER have incorporated different features and classification methods and use basic testing approaches. In this paper, we employ the best feature descriptor for AFER by empirically evaluating the feature descriptors named the Facial Landmarks descriptor and the Center of Gravity descriptor. We examine each feature descriptor by considering one classification method, such as the Support Vector Machine (SVM) method, with three unique facial expression recognition (FER) datasets. In addition to test accuracies, we present confusion matrices of AFER. We also analyze the effect of using these feature and image resolutions on AFER performance. Our study indicates that the Facial Landmarks descriptor is the best choice to run AFER on mobile phones. The results of our study demonstrate that the proposed facial expression recognition on a mobile phone application is successful and provides up to 96.3% recognition accuracy.
Published: 2017

18. Automatic 3D facial expression recognition using geometric and textured feature fusion

Author: Asim Jan and Hongying Meng
Subjects: Histograms, Computer science, business.industry, Speech recognition, media_common.quotation_subject, Feature extraction, Intelligent decision support system, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Pattern recognition, Facial recognition system, Surprise, Databases, Feature (machine learning), Facial features, Three-dimensional face recognition, Three-dimensional displays, Artificial intelligence, Face recognition, Affective computing, business, media_common, Geometric data analysis
Abstract: 3D facial expression recognition has gained more and more interests from affective computing society due to issues such as pose variations and illumination changes caused by 2D imaging having been eliminated. There are many applications that can benefit from this research, such as medical applications involving the detection of pain and psychological effects in patients, in human-computer interaction tasks that intelligent systems use in today's world. In this paper, we look into 3D Facial Expression Recognition, by investigating many feature extraction methods used on the 2D textured images and 3D geometric data, fusing the 2 domains to increase the overall performance. A One Vs All Multi-class SVM Classifier has been adopted to recognize the expressions Angry, Disgust, Fear, Happy, Neutral, Sad and Surprise from the BU-3DFE and Bosphorus databases. The proposed approach displays an increase in performance when the features are fused together.
Published: 2015

19. Automatic affective dimension recognition from naturalistic facial expressions based on wavelet filtering and PLS regression

Author: Hongying Meng, Saeed Turabzadeh, Asim Jan, Yona Falinie A. Gaus, and Fan Zhang
Subjects: Facial expression, Computer science, business.industry, Speech recognition, Feature vector, Feature extraction, Testing, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Wavelet transform, Pattern recognition, Affect (psychology), Mel frequency cepstral coefficient, Videos, Wavelet transforms, ComputingMethodologies_PATTERNRECOGNITION, Partial least squares regression, Mel-frequency cepstrum, Artificial intelligence, Emotion recognition, Valence (psychology), business
Abstract: Automatic affective dimension recognition from facial expression continuously in naturalistic contexts is a very challenging research topic but very important in human-computer interaction. In this paper, an automatic recognition system was proposed to predict the affective dimensions such as Arousal, Valence and Dominance continuously in naturalistic facial expression videos. Firstly, visual and vocal features are extracted from image frames and audio segments in facial expression videos. Secondly, a wavelet transform based digital filtering method is applied to remove the irrelevant noise information in the feature space. Thirdly, Partial Least Squares regression is used to predict the affective dimensions from both video and audio modalities. Finally, two modalities are combined to boost overall performance in the decision fusion process. The proposed method is tested in the fourth international Audio/Visual Emotion Recognition Challenge (AVEC2014) dataset and compared to other state-of-the-art methods in the affect recognition sub-challenge with a good performance.
Published: 2015

20. Singularity Characteristics of Needle EMG IP Signals

Author: David Holder, Hongying Meng, E.W. Abel, and A. Forster
Subjects: Neuromuscular disorder diagnosis, Computer science, Speech recognition, Neuromuscular Junction, Biomedical Engineering, Action Potentials, Electromyography, Sensitivity and Specificity, Signal, EMG, Wavelet, Singularity, medicine, Humans, Diagnosis, Computer-Assisted, DSP, Muscle, Skeletal, Digital signal processing, Motor Neurons, Signal processing, medicine.diagnostic_test, business.industry, Reproducibility of Results, Wavelet transform, Pattern recognition, Neuromuscular Diseases, Electrodes, Implanted, Needles, Signal Processing, Artificial intelligence, Embedded Systems, business
Abstract: This is the post-print version of the Article - Copyright @ 2006 IEEE Clinical electromyography (EMG) interference pattern (IP) signals can reveal more diagnostic information than their constituents, the motor unit action potentials (MUAPs). Singularities and irregular structures typically characterize the mathematically defined content of information in signals. In this paper, a wavelet transform method is used to detect and quantify the singularity characteristics of EMG IP signals using the Lipschitz exponent (LE) and measures derived from it. The performance of the method is assessed in terms of its ability to discriminate healthy, myopathic and neuropathic subjects and how it compares with traditionally used Turns Analysis (TA) methods and a method recently developed by the authors, interscale wavelet maximum (ISWM). Highly significant intergroup differences were found using the LE method. Most of the singularity measures have a performance similar to that of ISWM and considerably better than that of TA. Some measures such as the ratio of the mean LE value to the number of singular points in the signal have considerably superior performance to both methods. These findings add weight to the view that wavelet analysis methods offer an effective way forward in the quantitative analysis of EMG IP signal to assist the clinician in the diagnosis of neuromuscular disorders. This work was funded by the Engineering and Physical Sciences Research Council, Swindon, U.K.
Published: 2006

21. Automatic Depression Scale Prediction using Facial Expression Dynamics and Regression

Author: Yona Falinie A. Gaus, Saeed Turabzadeh, Asim Jan, Hongying Meng, and Fan Zhang
Subjects: Facial expression, Audio signal, business.industry, Feature vector, Inventory, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Beck Depression Inventory, Affective computing, Depression recognition, Pattern recognition, Beck depression, ComputingMethodologies_PATTERNRECOGNITION, Mood, Histogram, Partial least squares regression, Challenge, Artificial intelligence, business, Psychology, Social psychology
Abstract: Depression is a state of low mood and aversion to activity that can affect a person's thoughts, behaviour, feelings and sense of well-being. In such a low mood, both the facial expression and voice appear different from the ones in normal states. In this paper, an automatic system is proposed to predict the scales of Beck Depression Inventory from naturalistic facial expression of the patients with depression. Firstly, features are extracted from corresponding video and audio signals to represent characteristics of facial and vocal expression under depression. Secondly, dynamic features generation method is proposed in the extracted video feature space based on the idea of Motion History Histogram (MHH) for 2-D video motion extraction. Thirdly, Partial Least Squares (PLS) and Linear regression are applied to learn the relationship between the dynamic features and depression scales using training data, and then to predict the depression scale for unseen ones. Finally, decision level fusion was done for combining predictions from both video and audio modalities. The proposed approach is evaluated on the AVEC2014 dataset and the experimental results demonstrate its effectiveness. The work by Asim Jan was supported by School of Engineering & Design/Thomas Gerald Gray PGR Scholarship. The work by Hongying Meng and Saeed Turabzadeh was partially funded by the award of the Brunel Research Initiative and Enterprise Fund (BRIEF). The work by Yona Falinie Binti Abd Gaus was supported by Majlis Amanah Rakyat (MARA) Scholarship.
Published: 2014

22. Affective state level recognition in naturalistic facial and vocal expressions

Author: Nadia Bianchi-Berthouze and Hongying Meng
Subjects: Computer science, Dimensional model of affect, Speech recognition, Feature extraction, Video Recording, Affect (psychology), Markov model, computer.software_genre, Models, Biological, Facial recognition system, Pattern Recognition, Automated, Machine Learning, Speech Production Measurement, Machine learning, Photography, Speech, Computer Simulation, Electrical and Electronic Engineering, HMM, Hidden Markov model, Affective computing, Audio signal processing, Facial expression, Models, Statistical, Training set, Contextual image classification, business.industry, Continuous emotion recognition, Naturalistic affective expressions, Pattern recognition, Computer Science Applications, Facial Expression, Human-Computer Interaction, Affect, ComputingMethodologies_PATTERNRECOGNITION, Control and Systems Engineering, Biometric Identification, Pattern recognition (psychology), Artificial intelligence, business, computer, Algorithms, Software, Information Systems
Abstract: This article is made available through the Brunel Open Access Publishing Fund and is available to view at the link below. Naturalistic affective expressions change at a rate much slower than the typical rate at which video or audio is recorded. This increases the probability that consecutive recorded instants of expressions represent the same affective content. In this paper, we exploit such a relationship to improve the recognition performance of continuous naturalistic affective expressions. Using datasets of naturalistic affective expressions (AVEC 2011 audio and video dataset, PAINFUL video dataset) continuously labeled over time and over different dimensions, we analyze the transitions between levels of those dimensions (e.g., transitions in pain intensity level). We use an information theory approach to show that the transitions occur very slowly and hence suggest modeling them as first-order Markov models. The dimension levels are considered to be the hidden states in the Hidden Markov Model (HMM) framework. Their discrete transition and emission matrices are trained by using the labels provided with the training set. The recognition problem is converted into a best path-finding problem to obtain the best hidden states sequence in HMMs. This is a key difference from previous use of HMMs as classifiers. Modeling of the transitions between dimension levels is integrated in a multistage approach, where the first level performs a mapping between the affective expression features and a soft decision value (e.g., an affective dimension level), and further classification stages are modeled as HMMs that refine that mapping by taking into account the temporal relationships between the output decision labels. The experimental results for each of the unimodal datasets show overall performance to be significantly above that of a standard classification system that does not take into account temporal relationships. In particular, the results on the AVEC 2011 audio dataset outperform all other systems presented at the international competition.
Published: 2013

23. Implementation and applications of tri-state self organising maps on FPGA

Author: Hongying Meng, Andrew Hunter, Patrick Dickinson, and Kofi Appiah
Subjects: Self-organizing map, Binary SOM, Contextual image classification, business.industry, Computer science, Character recognition, Cognitive neuroscience of visual object recognition, Pattern recognition, Object recognition, Object (computer science), Identification (information), Media Technology, Computer vision, Artificial intelligence, State (computer science), Electrical and Electronic Engineering, business, Cluster analysis, Field-programmable gate array, FPGA
Abstract: This is the post-print version of the Article. The official published version can be accessed from the link below - Copyright @ 2012 IEEE This paper introduces a tri-state logic self-organizing map (bSOM) designed and implemented on a field programmable gate array (FPGA) chip. The bSOM takes binary inputs and maintains tri-state weights. A novel training rule is presented. The bSOM is well suited to FPGA implementation, trains quicker than the original self-organizing map (SOM), and can be used in clustering and classification problems with binary input data. Two practical applications, character recognition and appearance-based object identification, are used to illustrate the performance of the implementation. The appearance-based object identification forms part of an end-to-end surveillance system implemented wholly on FPGA. In both applications, binary signatures extracted from the objects are processed by the bSOM. The system performance is compared with a traditional SOM with real-valued weights and a strictly binary weighted SOM.
Published: 2012

24. A binary Self-Organizing Map and its FPGA implementation

Author: Andrew Hunter, Mervyn Hobden, Kofi Appiah, Peter Hobden, Shigang Yue, Hongying Meng, Cy Pettit, and Nigel Priestley
Subjects: Self-organizing map, Theoretical computer science, Logic synthesis, business.industry, Computer science, Control system, Binary number, Pattern recognition, Hamming distance, Artificial intelligence, Chip, Field-programmable gate array, business
Abstract: A binary Self Organizing Map (SOM) has been designed and implemented on a Field Programmable Gate Array (FPGA) chip. A novel learning algorithm which takes binary inputs and maintains tri-state weights is presented. The binary SOM has the capability of recognizing binary input sequences after training. A novel tri-state rule is used in updating the network weights during the training phase. The rule implementation is highly suited to the FPGA architecture, and allows extremely rapid training. This architecture may be used in real-time for fast pattern clustering and classification of binary features.
Published: 2009

25. Descriptive temporal template features for visual motion recognition

Author: Nick Pears and Hongying Meng
Subjects: G740 Computer Vision, Motion analysis, Computer science, Feature vector, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, G700 Artificial Intelligence, Event recognition, Motion (physics), Gesture recognition, Artificial Intelligence, Histogram, Machine learning, Feature (machine learning), Computer vision, Representation (mathematics), business.industry, Embedded vision, Pattern recognition, Support vector machine, Signal Processing, Computer Vision and Pattern Recognition, Artificial intelligence, G440 Human-computer Interaction, business, Software
Abstract: Copyright © 2009 Elsevier B.V. All rights reserved. In this paper, a human action recognition system is proposed. The system is based on new, descriptive ‘temporal template’ features in order to achieve high-speed recognition in real-time, embedded applications. The limitations of the well-known ‘Motion History Image’ (MHI) temporal template are addressed and a new ‘Motion History Histogram’ (MHH) feature is proposed to capture more motion information in the video. MHH not only provides rich motion information, but also remains computationally inexpensive. To further improve classification performance, we combine both MHI and MHH into a low dimensional feature vector which is processed by a support vector machine (SVM). Experimental results show that our new representation can achieve a significant improvement in the performance of human action recognition over existing comparable methods, which use 2D temporal template based representations.
Published: 2009

26. Motion Feature Combination for Human Action Recognition in Video

Author: Hongying Meng, Chris Bailey, and Nick Pears
Subjects: Support vector machine, Dimension (vector space), Computer science, business.industry, Feature vector, Histogram, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Pattern recognition, Computer vision, Artificial intelligence, business, Motion (physics), Haar wavelet
Abstract: We study the human action recognition problem based on motion features directly extracted from video. In order to implement a fast human action recognition system, we select simple features that can be obtained from non-intensive computation. We propose to use the motion history image (MHI) as our fundamental representation of the motion. This is then further processed to give a histogram of the MHI and the Haar wavelet transform of the MHI. The combination of these two features is computed cheaply and has a lower dimension than the original MHI. The combined feature vector is tested in a Support Vector Machine (SVM) based human action recognition system and a significant performance improvement has been achieved. The system is efficient to be used in real-time human action classification systems.
Published: 2008

27. A Human Action Recognition System for Embedded Computer Vision Application

Author: Chris Bailey, Nick Pears, and Hongying Meng
Subjects: Contextual image classification, business.industry, Computer science, Feature vector, Feature extraction, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Pattern recognition, Application software, computer.software_genre, Support vector machine, Gesture recognition, Histogram, Computer vision, Artificial intelligence, business, Classifier (UML), computer
Abstract: In this paper, we propose a human action recognition system suitable for embedded computer vision applications in security systems, human-computer interaction and intelligent environments. Our system is suitable for embedded computer vision application based on three reasons. Firstly, the system was based on a linear support vector machine (SVM) classifier where classification progress can be implemented easily and quickly in embedded hardware. Secondly, we use compacted motion features easily obtained from videos. We address the limitations of the well known motion history image (MHI) and propose a new hierarchical motion history histogram (HMHH) feature to represent the motion information. HMHH not only provides rich motion information, but also remains computationally inexpensive. Finally, we combine MHI and HMHH together and extract a low dimension feature vector to be used in the SVM classifiers. Experimental results show that our system achieves significant improvement on the recognition performance.
Published: 2007

28. Human Action Classification Using SVM_2K Classifier on Motion Features

Author: Hongying Meng, Nick Pears, and Chris Bailey
Subjects: Support vector machine, ComputingMethodologies_PATTERNRECOGNITION, Structured support vector machine, Computer science, business.industry, Speech recognition, Feature vector, Linear classifier, Pattern recognition, Artificial intelligence, Quadratic classifier, business, Classifier (UML)
Abstract: In this paper, we study the human action classification problem based on motion features directly extracted from video. In order to implement a fast classification system, we select simple features that can be obtained from non-intensive computation. We also introduce the new SVM_2K classifier that can achieve improved performance over a standard SVM by combining two types of motion feature vector together. After learning, classification can be implemented very quickly because SVM_2K is a linear classifier. Experimental results demonstrate the method to be efficient and may be used in real-time human action classification systems.
Published: 2006

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

28 results on '"Hongying Meng"'

1. Self-Supervised Representation Learning for Videos by Segmenting via Sampling Rate Order Prediction

2. Data-Fusion-Based Two-Stage Cascade Framework for Multimodality Face Anti-Spoofing

3. Triplet interactive attention network for cross-modality person re-identification

4. Automatic Fuzzy Clustering Framework for Image Segmentation

5. Fast and Automatic Image Segmentation Using Superpixel-Based Graph Clustering

6. Superpixel-Based Fast Fuzzy C-Means Clustering for Color Image Segmentation

7. Significantly Fast and Robust Fuzzy C-Means Clustering Algorithm Based on Morphological Reconstruction and Membership Filtering

8. Robust Self-Sparse Fuzzy Clustering for Image Segmentation

9. Automatic Modulation Recognition: A Few-Shot Learning Method Based on the Capsule Network

10. Adaptive Morphological Reconstruction for Seeded Image Segmentation

11. End-to-end Change Detection Using a Symmetric Fully Convolutional Network for Landslide Mapping

12. Learning Spectral and Spatial Features Based on Generative Adversarial Network for Hyperspectral Image Super-Resolution

13. Diffusion Tensor Image Segmentation Based on Multi-Atlas Active Shape Model

14. An Ensemble Classifier Based on Three-Way Decisions for Social Touch Gesture Recognition

15. Accurate Facial Parts Localization and Deep Learning for 3D Facial Expression Recognition

16. Automated facial expression recognition app development on smart phones using cloud computing

17. Real time automated facial expression recognition app development on smart phones

18. Automatic 3D facial expression recognition using geometric and textured feature fusion

19. Automatic affective dimension recognition from naturalistic facial expressions based on wavelet filtering and PLS regression

20. Singularity Characteristics of Needle EMG IP Signals

21. Automatic Depression Scale Prediction using Facial Expression Dynamics and Regression

22. Affective state level recognition in naturalistic facial and vocal expressions

23. Implementation and applications of tri-state self organising maps on FPGA

24. A binary Self-Organizing Map and its FPGA implementation

25. Descriptive temporal template features for visual motion recognition

26. Motion Feature Combination for Human Action Recognition in Video

27. A Human Action Recognition System for Embedded Computer Vision Application

28. Human Action Classification Using SVM_2K Classifier on Motion Features

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Journal

Database

Publisher

28 results on '"Hongying Meng"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources