39 results on '"Alan W. C. Tan"'
Search Results
2. Sparse representation and reproduction of speech signals in complex Fourier basis
- Author
-
Cheah-Heng Tan, Lee-Chung Kwek, Khaled A. Alaghbari, Heng Siong Lim, and Alan W. C. Tan
- Subjects
Human-Computer Interaction ,Linguistics and Language ,Computer science ,business.industry ,Reproduction (economics) ,Pattern recognition ,Basis function ,Computer Vision and Pattern Recognition ,Sparse approximation ,Artificial intelligence ,business ,Language and Linguistics ,Software - Published
- 2021
3. PSO optimization of synergetic neural classifier for multichannel emotion recognition.
- Author
-
Wee Ming Wong, Alan W. C. Tan, Chu Kiong Loo, and Wei Shiung Liew
- Published
- 2010
- Full Text
- View/download PDF
4. Receiver implementation for long codes in 3GPP.
- Author
-
Alan W. C. Tan, Moh Lim Sim, and Sze Wei Lee
- Published
- 2003
- Full Text
- View/download PDF
5. Isolated sign language recognition using Convolutional Neural Network hand modelling and Hand Energy Image
- Author
-
Chin Poo Lee, Kian Ming Lim, Alan W. C. Tan, and Shing Chiang Tan
- Subjects
Ground truth ,Computer Networks and Communications ,Computer science ,business.industry ,020207 software engineering ,Pattern recognition ,02 engineering and technology ,Sign language ,Tracking (particle physics) ,Convolutional neural network ,Hardware and Architecture ,Position (vector) ,0202 electrical engineering, electronic engineering, information engineering ,Media Technology ,Artificial intelligence ,Representation (mathematics) ,business ,Software ,Energy (signal processing) - Abstract
This paper presents an isolated sign language recognition system that comprises of two main phases: hand tracking and hand representation. In the hand tracking phase, an annotated hand dataset is used to extract the hand patches to pre-train Convolutional Neural Network (CNN) hand models. The hand tracking is performed by the particle filter that combines hand motion and CNN pre-trained hand models into a joint likelihood observation model. The predicted hand position corresponds to the location of the particle with the highest joint likelihood. Based on the predicted hand position, a square hand region centered around the predicted position is segmented and serves as the input to the hand representation phase. In the hand representation phase, a compact hand representation is computed by averaging the segmented hand regions. The obtained hand representation is referred to as “Hand Energy Image (HEI)”. Quantitative and qualitative analysis show that the proposed hand tracking method is able to predict the hand positions that are closer to the ground truth. Similarly, the proposed HEI hand representation outperforms other methods in the isolated sign language recognition.
- Published
- 2019
6. UAV Relay Flight Path Planning in the Presence of Jamming Signal
- Author
-
Meng-Chuan Mah, Heng Siong Lim, and Alan W. C. Tan
- Subjects
General Computer Science ,Computer science ,ComputerApplications_COMPUTERSINOTHERSYSTEMS ,Jamming ,Data_CODINGANDINFORMATIONTHEORY ,02 engineering and technology ,Interference (wave propagation) ,Communications system ,Signal ,law.invention ,Computer Science::Robotics ,0203 mechanical engineering ,unmanned aerial vehicle (UAV) ,Relay ,law ,Control theory ,Computer Science::Networking and Internet Architecture ,0202 electrical engineering, electronic engineering, information engineering ,Wireless ,General Materials Science ,jamming ,path planning ,Computer Science::Information Theory ,wireless relay network ,business.industry ,ComputerSystemsOrganization_COMPUTER-COMMUNICATIONNETWORKS ,General Engineering ,020302 automobile design & engineering ,020206 networking & telecommunications ,lcsh:Electrical engineering. Electronics. Nuclear engineering ,Interference ,business ,lcsh:TK1-9971 - Abstract
The flight path of the unmanned aerial vehicle (UAV) relay can significantly affect the performance of the wireless relay communication system. In this paper, we investigate the optimization of the UAV flight path in a wireless relay communication system, when a jamming or interference signal is present. The UAV flight path is optimized to maximize the signal-to-interference-plus-noise ratio (SINR), and the required parameters are estimated. The simulation results show that the proposed method can achieve performance close to an ideal SINR optimization and also highlight the benefits of optimizing the SINR instead of the signal-to-noise ratio (SNR) when a jamming or interference signal is present.
- Published
- 2019
7. A four dukkha state-space model for hand tracking
- Author
-
Kian Ming Lim, Shing Chiang Tan, and Alan W. C. Tan
- Subjects
State-space representation ,American Sign Language ,Computer science ,business.industry ,Cognitive Neuroscience ,020206 networking & telecommunications ,02 engineering and technology ,Sign language ,language.human_language ,Computer Science Applications ,Artificial Intelligence ,Dukkha ,0202 electrical engineering, electronic engineering, information engineering ,language ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,Gesture - Abstract
In this paper, we propose a hand tracking method which was inspired by the notion of the four dukkha: birth, aging, sickness and death (BASD) in Buddhism. Based on this philosophy, we formalize the hand tracking problem in the BASD framework, and apply it to hand track hand gestures in isolated sign language videos. The proposed BASD method is a novel nature-inspired computational intelligence method which is able to handle complex real-world tracking problem. The proposed BASD framework operates in a manner similar to a standard state-space model, but maintains multiple hypotheses and integrates hypothesis update and propagation mechanisms that resemble the effect of BASD. The survival of the hypothesis relies upon the strength, aging and sickness of existing hypotheses, and new hypotheses are birthed by the fittest pairs of parent hypotheses. These properties resolve the sample impoverishment problem of the particle filter. The estimated hand trajectories show promising results for the American sign language.
- Published
- 2017
8. Review on Vision-Based Gait Recognition: Representations, Classification Schemes and Datasets
- Author
-
Kian Ming Lim, Chin Poo Lee, and Alan W. C. Tan
- Subjects
Multidisciplinary ,Biometrics ,business.industry ,Computer science ,010401 analytical chemistry ,Feature extraction ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Pattern recognition ,02 engineering and technology ,01 natural sciences ,Motion (physics) ,0104 chemical sciences ,Set (abstract data type) ,Range (mathematics) ,ComputingMethodologies_PATTERNRECOGNITION ,Gait (human) ,0202 electrical engineering, electronic engineering, information engineering ,Benchmark (computing) ,020201 artificial intelligence & image processing ,Computer vision ,Artificial intelligence ,Representation (mathematics) ,business - Abstract
Gait has unique advantage at a distance when other biometrics cannot be used since they are at too low resolution or obscured, as commonly observed in visual surveillance systems. This paper provides a survey of the technical advancements in vision-based gait recognition. A wide range of publications are discussed in this survey embracing different perspectives of the research in this area, including gait feature extraction, classification schemes and standard gait databases. There are two major groups of the state-of-the-art techniques in characterizing gait: Model-based and motion-free. The model-based approach obtains a set of body or motion parameters via human body or motion modeling. The model-free approach, on the other hand, derives a description of the motion without assuming any model. Each major category is further organized into several subcategories based on the nature of gait representation. In addition, some widely used classification schemes and benchmark databases for evaluating performance are also discussed.
- Published
- 2017
9. Block-based histogram of optical flow for isolated sign language recognition
- Author
-
Alan W. C. Tan, Kian Ming Lim, and Shing Chiang Tan
- Subjects
business.industry ,Feature vector ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Histogram matching ,Optical flow ,02 engineering and technology ,Sign language ,020204 information systems ,Histogram ,Signal Processing ,0202 electrical engineering, electronic engineering, information engineering ,Media Technology ,020201 artificial intelligence & image processing ,Computer vision ,Computer Vision and Pattern Recognition ,Artificial intelligence ,Electrical and Electronic Engineering ,business ,Image histogram ,Sign (mathematics) ,Mathematics ,Gesture - Abstract
A normalized histogram of optical flow as a hand representation of the sign language.Block-based histogram provides spatial information and local translation invariant.Block-based histogram of optical flow enables sign language length invariance. In this paper, we propose a block-based histogram of optical flow (BHOF) to generate hand representation in sign language recognition. Optical flow of the sign language video is computed in a region centered around the location of the detected hand position. The hand patches of optical flow are segmented into M spatial blocks, where each block is a cuboid of a segment of a frame across the entire sign gesture video. The histogram of each block is then computed and normalized by its sum. The feature vector of all blocks are then concatenated as the BHOF sign gesture representation. The proposed method provides a compact scale-invariant representation of the sign language. Furthermore, block-based histogram encodes spatial information and provides local translation invariance in the extracted optical flow. Additionally, the proposed BHOF also introduces sign language length invariancy into its representation, and thereby, produce promising recognition rate in signer independent problems.
- Published
- 2016
10. Robust Signal-to-Noise Ratio Estimation in Non-Gaussian Noise Channel
- Author
-
Heng Siong Lim, Ying-Siew Lo, and Alan W. C. Tan
- Subjects
business.industry ,Gaussian ,Estimator ,020302 automobile design & engineering ,020206 networking & telecommunications ,Pattern recognition ,02 engineering and technology ,Mixture model ,Computer Science Applications ,Gaussian random field ,symbols.namesake ,Additive white Gaussian noise ,0203 mechanical engineering ,Robustness (computer science) ,Gaussian noise ,0202 electrical engineering, electronic engineering, information engineering ,symbols ,Artificial intelligence ,Electrical and Electronic Engineering ,business ,Cramér–Rao bound ,Algorithm ,Computer Science::Information Theory ,Mathematics - Abstract
Signal-to-noise ratio (SNR) estimation available in the literature are designed based on the assumption of Gaussian noise models. These estimators may produce misleading results when the distribution of the noise deviates from Gaussian. This paper investigates the performance of existing SNR estimators in an additive non-Gaussian noise channel based on a Gaussian mixture model. An expectation---maximization (EM) based approach is proposed for optimum SNR estimation in the non-Gaussian noise channel. In addition, the Cramer---Rao bound is derived and used as a benchmark to assess the performance of the SNR estimators. Simulation results confirm the optimality and robustness of the proposed EM-based estimator in Gaussian and non-Gaussian noise channels.
- Published
- 2016
11. A feature covariance matrix with serial particle filter for isolated sign language recognition
- Author
-
Shing Chiang Tan, Alan W. C. Tan, and Kian Ming Lim
- Subjects
American Sign Language ,Computer science ,business.industry ,Covariance matrix ,Feature extraction ,General Engineering ,020207 software engineering ,02 engineering and technology ,Sign language ,language.human_language ,Computer Science Applications ,Artificial Intelligence ,Gesture recognition ,0202 electrical engineering, electronic engineering, information engineering ,Feature (machine learning) ,language ,020201 artificial intelligence & image processing ,Computer vision ,Artificial intelligence ,business ,Particle filter ,Sign (mathematics) ,Gesture - Abstract
A fusion of median and mode filtering for better background model.A serial particle filter that can better detect and track the object of interest.A novel covariance matrix feature for isolated sign language representation. As is widely recognized, sign language recognition is a very challenging visual recognition problem. In this paper, we propose a feature covariance matrix based serial particle filter for isolated sign language recognition. At the preprocessing stage, the fusion of the median and mode filters is employed to extract the foreground and thereby enhances hand detection. We propose to serially track the hands of the signer, as opposed to tracking both hands at the same time, to reduce the misdirection of target objects. Subsequently, the region around the tracked hands is extracted to generate the feature covariance matrix as a compact representation of the tracked hand gesture, and thereby reduce the dimensionality of the features. In addition, the proposed feature covariance matrix is able to adapt to new signs due to its ability to integrate multiple correlated features in a natural way, without any retraining process. The experimental results show that the hand trajectories as obtained through the proposed serial hand tracking are closer to the ground truth. The sign gesture recognition based on the proposed methods yields a 87.33% recognition rate for the American Sign Language. The proposed hand tracking and feature extraction methodology is an important milestone in the development of expert systems designed for sign language recognition, such as automated sign language translation systems.
- Published
- 2016
12. Secrecy improvement via joint optimization of UAV relay flight path and transmit power
- Author
-
Meng-Chuan Mah, Alan W. C. Tan, and Heng Siong Lim
- Subjects
Computer science ,Particle swarm optimization ,ComputerApplications_COMPUTERSINOTHERSYSTEMS ,020206 networking & telecommunications ,Eavesdropping ,Data_CODINGANDINFORMATIONTHEORY ,02 engineering and technology ,010501 environmental sciences ,Transmitter power output ,01 natural sciences ,Signal ,law.invention ,Power (physics) ,Power optimization ,Relay ,law ,Control theory ,Automotive Engineering ,Secrecy ,0202 electrical engineering, electronic engineering, information engineering ,Electrical and Electronic Engineering ,0105 earth and related environmental sciences - Abstract
Eavesdropping occurs when an unauthorized receiver secretly intercepts a transmitted signal. In this paper, we investigate the optimum flight path and transmit power for an unmanned aerial vehicle (UAV) relay to improve secrecy performance in the presence of an eavesdropper. We propose an algorithm based on particle swarm optimization (PSO) where the UAV flight path and power are jointly optimized to maximize the secrecy capacity. Simulation results show that the proposed algorithm is capable of achieving a performance that is close to the ideal secrecy performance. Simulation results also highlight the benefits of power optimization.
- Published
- 2020
13. Gait recognition with Transient Binary Patterns
- Author
-
Chin Poo Lee, Alan W. C. Tan, and Shing Chiang Tan
- Subjects
Computer science ,business.industry ,Texture Descriptor ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Pattern recognition ,Facial recognition system ,Silhouette ,Gait (human) ,Histogram ,Signal Processing ,Media Technology ,Computer vision ,Computer Vision and Pattern Recognition ,Noise (video) ,Artificial intelligence ,Electrical and Electronic Engineering ,business ,Focus (optics) ,Spatial analysis ,ComputingMethodologies_COMPUTERGRAPHICS - Abstract
Transient Binary Patterns of gait sequence.Inherently combines both appearance information and temporal information.Less sensitive to silhouette noise in individual frames. In this work, we present a combination of spatiotemporal approach and texture descriptors to extract the temporal patterns in gait cycles. Unlike most conventional methods that focus on spatial information while limiting temporal information captured, spatiotemporal methods preserve both spatial and temporal information. Inspired by the success of texture descriptors in face recognition, the proposed method likewise constructs texture descriptors of gait motion over time. For each gait cycle, the pixel-wise binary patterns along the temporal axis, referred to as the Transient Binary Patterns (TBP), is analyzed. These pixel-wise TBPs are then grouped into regional blocks from which we construct regional TBP histograms. These regional TBP histograms collectively form the global TBP histogram that represents both the distribution of temporal patterns and spatial location. Experimental results clearly show the superiority of the proposed approach over other considered methods.
- Published
- 2015
14. Fingerprint Ridge Distance Estimation: A Mathematical Modeling
- Author
-
Alan W. C. Tan, E. K. Wong, and Shing Chyi Chua
- Subjects
Computer science ,business.industry ,Speech recognition ,Fingerprint (computing) ,Ridge (meteorology) ,Pattern recognition ,Artificial intelligence ,business - Published
- 2015
15. Gait recognition using binarized statistical image features and histograms of oriented gradients
- Author
-
Chin Poo Lee, Kian Ming Lim, Alan W. C. Tan, and Jashila Nair Mogan
- Subjects
Computer science ,business.industry ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,020207 software engineering ,Pattern recognition ,02 engineering and technology ,ComputingMethodologies_PATTERNRECOGNITION ,Kernel (image processing) ,Feature (computer vision) ,Histogram ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,ComputingMethodologies_COMPUTERGRAPHICS - Abstract
This paper presents a gait recognition method using the combination of motion history image (MHI), binarized statistical image features (BSIF) and histograms of oriented gradients (HOG). The method first encodes the motion pattern and direction of the gait cycle in motion history image. Subsequently, performing convolution on the motion history image using pre-learnt filters as kernel, binarized statistical image features are generated by summing the convolution output images. Histograms of oriented gradients are then computed on binarized statistical image features. Gait signature of a gait cycle is attained by accumulating all the HOG descriptors. Experimental result shows that the proposed method performs promisingly in gait recognition.
- Published
- 2017
16. Real-Time Robust Voice Activity Detection Using the Upper Envelope Weighted Entropy Measure and the Dual-Rate Adaptive Nonlinear Filter
- Author
-
Wei Qing Ong, Thean Hai Ooi, Cheah Heng Tan, V. Vijayakumar Vengadasalam, and Alan W. C. Tan
- Subjects
Computational complexity theory ,Computer science ,General Physics and Astronomy ,weight factor ,lcsh:Astrophysics ,02 engineering and technology ,gammatone filter ,030507 speech-language pathology & audiology ,03 medical and health sciences ,Nonlinear filter ,Control theory ,lcsh:QB460-466 ,0202 electrical engineering, electronic engineering, information engineering ,Kernel adaptive filter ,Entropy (information theory) ,voice activity detector (VAD) ,lcsh:Science ,Voice activity detection ,asymmetric nonlinear filter ,entropy ,dual-rate adaptive nonlinear filter ,lcsh:QC1-999 ,Adaptive filter ,lcsh:Q ,020201 artificial intelligence & image processing ,False alarm ,0305 other medical science ,Gammatone filter ,lcsh:Physics - Abstract
Voice activity detection (VAD) is a vital process in voice communication systems to avoid unnecessary coding and transmission of noise. Most of the existing VAD algorithms continue to suffer high false alarm rates and low sensitivity when the signal-to-noise ratio (SNR) is low, at 0 dB and below. Others are developed to operate in offline mode or are impractical for implementation in actual devices due to high computational complexity. This paper proposes the upper envelope weighted entropy (UEWE) measure as a means to enable high separation of speech and non-speech segments in voice communication. The asymmetric nonlinear filter (ANF) is employed in UEWE to extract the adaptive weight factor that is subsequently used to compensate the noise effect. In addition, this paper also introduces a dual-rate adaptive nonlinear filter (DANF) with high adaptivity to rapid time-varying noise for computation of the decision threshold. Performance comparison with standard and recent VADs shows that the proposed algorithm is superior especially in real-time practical applications.
- Published
- 2017
- Full Text
- View/download PDF
17. Real-time Voice Activity Detector Using Gammatone Filter and Modified Long-Term Signal Variability
- Author
-
Cheah Heng Tan, L Vengadasalam, Thean Hai Ooi, Wei Qing Ong, and Alan W. C. Tan
- Subjects
symbols.namesake ,Additive white Gaussian noise ,Discriminative model ,Computer science ,Nonlinear filter ,Speech recognition ,Detector ,Signal variability ,symbols ,Entropy (information theory) ,TIMIT ,Gammatone filter - Abstract
In this paper, a real-time robust voice activity detector (VAD) is proposed. The proposed VAD adopts the gammatone filter and modify the existing long-term signal variability (LTSV) measure, i.e. known as GMLTSV in short. The proposed VAD is an improved version of existing VAD which used gammatone filter and entropy. The LTSV measure is modified to adapt to the amplitude envelopes extracted using gammatone filter by swapping entropy and variance used in LTSV measure to reduce noise effect in the extracted temporal envelopes and improve discriminative power of the extracted feature. The proposed algorithm also implements an adaptive threshold that is computed using a nonlinear filter to track short-term trend of the extracted feature in real-time. The proposed VAD using GMLTSV feature is tested against clean speech signals from TIMIT test corpus which are degraded at SNR ranged from -10dB to 20dB by non-stationary noise, eg. airport noise, babble noise, exhibition noise from Aurora-2 database, and stationary noise, eg. additive white Gaussian noise. Based on the evaluation, it is proven that the proposed GMLTSV-based VAD is robust in speech and non-speech detection even at low signal-to-noise ratio (SNR) and outperformed other existing voice activity detectors which are compared in the evaluation. The proposed VAD achieved satisfactory accuracy when compared to the impractical single frequency filtering based VAD while implementing real-time scheme for practical application.
- Published
- 2017
18. Gait recognition using temporal gradient patterns
- Author
-
Jashila Nair Mogan, Chin Poo Lee, and Alan W. C. Tan
- Subjects
021110 strategic, defence & security studies ,Current (mathematics) ,Pixel ,Computer science ,business.industry ,Frame (networking) ,0211 other engineering and technologies ,020206 networking & telecommunications ,Pattern recognition ,02 engineering and technology ,Bin ,Silhouette ,Image (mathematics) ,Matrix (mathematics) ,Gait (human) ,0202 electrical engineering, electronic engineering, information engineering ,Computer vision ,Artificial intelligence ,business - Abstract
In this paper, a Temporal Gradient Patterns (TGP) method is proposed for gait recognition. The method first computes the gradients of the silhouette in each image. Subsequently, the gradient of each pixel determines the bin number to which the pixel belongs to. The bin number of current and next frame jointly cast a vote to the corresponding index in the matrix of oriented gradients. The obtained matrix henceforth encodes the gradient pattern of each pixel in the gait cycle. The TGP method not only describes the spatial silhouette shapes but also implicitly captures the silhouette deformation in temporal axis. Experimental results show that the proposed approach attains a promising recognition rates.
- Published
- 2017
19. Gait probability image: An information-theoretic model of gait representation
- Author
-
Chin Poo Lee, Shing Chiang Tan, and Alan W. C. Tan
- Subjects
Pixel ,business.industry ,Computer science ,Physics::Medical Physics ,Probabilistic logic ,Effect of gait parameters on energetic cost ,Pattern recognition ,Computer Science::Robotics ,Preferred walking speed ,Binomial distribution ,Gait (human) ,Computer Science::Systems and Control ,Computer Science::Computer Vision and Pattern Recognition ,Gait analysis ,Signal Processing ,Media Technology ,Computer vision ,Computer Vision and Pattern Recognition ,Artificial intelligence ,Electrical and Electronic Engineering ,Divergence (statistics) ,business - Abstract
In this paper, we propose a new probabilistic gait representation to characterize human walking for recognition by gait. The approach obtains the binomial distribution of every pixel in a gait cycle. Organizing the binomial distribution of all pixels in the gait image, we obtain the gait signature, which we denote as the Gait Probability Image (GPI). In the recognition stage, symmetric Kullback-Leibler divergence is used to measure the information theoretical distance between gait signatures. The experimental results reveal that GPI achieves promising recognition rates. Besides that, experiments on different walking speeds demonstrate that GPI is robust to slight variation in walking speed.
- Published
- 2014
20. Time-sliced averaged motion history image for gait recognition
- Author
-
Shing Chiang Tan, Chin Poo Lee, and Alan W. C. Tan
- Subjects
Computer science ,business.industry ,Motion (physics) ,Image (mathematics) ,Preferred walking speed ,Gait (human) ,Gait analysis ,Histogram ,Signal Processing ,Media Technology ,Computer vision ,Transient (computer programming) ,Computer Vision and Pattern Recognition ,Artificial intelligence ,Electrical and Electronic Engineering ,business ,Representation (mathematics) - Abstract
In this paper, we propose a time-sliced averaged motion history image (TAMHI) alongside the histograms of oriented gradients (HOG) to generate gait signatures in a gait recognition problem. Building on the motion history image (MHI), TAMHI divides the gait cycle into several regular time windows to generate the same number of TAMHI composite images. HOG descriptors are then calculated on these composite images to obtain the gait signature. The time-slicing procedure to produce multi-composite images preserve more detailed transient information of gait cycles. Additionally, time-normalization also introduces gait length invariancy into the representation, hence, offering a better recognition rate to slight changes in walking speed.
- Published
- 2014
21. Improved Channel Estimation for MIMO Interference Cancellation
- Author
-
Meng Chuan Mah, Alan W. C. Tan, and Heng Siong Lim
- Subjects
Computer science ,Orthogonal frequency-division multiplexing ,ComputerSystemsOrganization_COMPUTER-COMMUNICATIONNETWORKS ,MIMO ,ComputerApplications_COMPUTERSINOTHERSYSTEMS ,Jamming ,Data_CODINGANDINFORMATIONTHEORY ,Kalman filter ,Computer Science Applications ,Signal-to-noise ratio ,Single antenna interference cancellation ,Control theory ,Modeling and Simulation ,Electrical and Electronic Engineering ,Communication channel - Abstract
In this letter, a jamming resilient communication technique for unmanned aircraft vehicle (UAV) is proposed. Multiple-input multiple-output (MIMO) interference cancellation is exploited to combat reactive jamming. An improved channel estimation algorithm based on Kalman filter and basis expansion model (BEM) is used to track the quick changes of the combined channel components. Simulation results show that the performance of the proposed technique is significantly better than the existing jamming resilient communication technique.
- Published
- 2015
22. Robust voice activity detection using gammatone filtering and entropy
- Author
-
Alan W. C. Tan and W. Q. Ong
- Subjects
Engineering ,Voice activity detection ,business.industry ,Speech recognition ,Detector ,020206 networking & telecommunications ,Pattern recognition ,TIMIT ,02 engineering and technology ,030507 speech-language pathology & audiology ,03 medical and health sciences ,Computer Science::Sound ,0202 electrical engineering, electronic engineering, information engineering ,Entropy (information theory) ,Artificial intelligence ,0305 other medical science ,business ,Human voice ,Voice activity - Abstract
Voice activity detector (VAD) is used to detect the presence or absence of human voice in a signal. A robust VAD algorithm is essential to distinguish human voice in a noisy acoustic signal. There were many recent works in development of robust VAD which focus on unsupervised features extraction such as temporal variation, signal-to-noise ratio in [1] and etc. However, these methods are typically sensitive to nonstationary noise especially under low SNR. To overcome these problems, this paper presents a robust voice activity detection (VAD) method via a combination of gammatone filtering and entropy as an information-theoretic measure in the detection algorithm. The performance of the proposed algorithm is tested using speech signals from TIMIT test corpus with additive noise at varying degrees of signal-to-noise ratio. The results show that the proposed robust VAD outperforms other existing methods in terms of detection accuracy.
- Published
- 2016
23. Finger spelling recognition using neural network
- Author
-
Kok Seang Tan, Chin Poo Lee, Alan W. C. Tan, Kian Ming Lim, Shing Chiang Tan, and Siti Fatimah Abdul Razak
- Subjects
Artificial neural network ,Computer science ,business.industry ,Speech recognition ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Segmentation ,Computer vision ,Artificial intelligence ,business ,Backpropagation ,Spelling - Abstract
Finger spelling is a way of communication by expressing words using hand signs in order to ensure deaf and dumb community can communicate with others effectively. Therefore, a system that can understand finger spelling is needed. As a result of that, this work is conducted to primarily develop a tutoring system for finger spelling. To develop a robust real-time finger spelling tutoring system, it is necessary to ensure the accuracy of the finger spelling recognition. Even though there are existing solutions available for a decade, but most of them are just focusing on improving accuracy rate without implementing their solutions as a complete tutoring system for finger spelling. Consequently, it inspires this research project to develop a tutoring system for finger spelling. Microsoft Kinect sensor is used to acquire color images and depth images of the finger spells. Depth images are used to perform segmentation on the color images. After that, the segmented images are used as input and pass into a two hidden layers backpropagation neural network for classification.
- Published
- 2015
24. A novel and effective particle swarm optimization like algorithm with extrapolation technique
- Author
-
Alan W. C. Tan, M. V. C. Rao, and M. Senthil Arumugam
- Subjects
Mathematical optimization ,Optimization problem ,Computer Science::Neural and Evolutionary Computation ,MathematicsofComputing_NUMERICALANALYSIS ,Extrapolation ,Particle swarm optimization ,Optimal control ,Rate of convergence ,Position (vector) ,Genetic algorithm ,Benchmark (computing) ,Algorithm ,Software ,Mathematics - Abstract
A novel competitive approach to particle swarm optimization (PSO) algorithms is proposed in this paper. The proposed method uses extrapolation technique with PSO (ePSO) for solving optimization problems. By considering the basics of the PSO algorithm, the current particle position is updated by extrapolating the global best particle position and the current particle positions in the search space. The position equation is formulated with the global best (gbest) position, local best position (pbest) and the current position of the particle. The proposed method is tested with a set of 13 standard optimization benchmark problems and the results are compared with those obtained through two existing PSO algorithms, the canonical PSO (cPSO), the Global-Local best PSO (GLBest PSO). The cPSO includes a time-varying inertia weight (TVIW) and time-varying acceleration co-efficients (TVAC) while the GLBest PSO consists of Global-Local best inertia weight (GLBest IW) with Global-Local best acceleration co-efficient (GLBestAC). The simulation results clearly elucidate that the proposed method produces the near global optimal solution. It is also observed from the comparison of the proposed method with cPSO and GLBest PSO, the ePSO is capable of producing a quality of optimal solution with faster convergence rate. To strengthen the comparison and prove the efficacy of the proposed method a real time application of steel annealing processing (SAP) is also considered. The optimal control objectives of SAP are computed through the above said three PSO algorithms and also through two versions of genetic algorithms (GA), namely, real coded genetic algorithm (RCGA) and hybrid real coded genetic algorithm (HRCGA) and the results are analyzed with the proposed method. From the results obtained through benchmark problems and the real time application of SAP, it is clearly seen that the proposed ePSO method is competitive to the existing PSO algorithms and also to GAs.
- Published
- 2009
25. A composite signal subspace speech classifier
- Author
-
B. S. Daya Sagar, M. V. C. Rao, and Alan W. C. Tan
- Subjects
business.industry ,Covariance matrix ,Speech recognition ,Matrix norm ,Pattern recognition ,White noise ,Speech processing ,Random subspace method ,Matrix (mathematics) ,ComputingMethodologies_PATTERNRECOGNITION ,Control and Systems Engineering ,Signal Processing ,Computer Vision and Pattern Recognition ,Artificial intelligence ,Electrical and Electronic Engineering ,business ,Software ,Subspace topology ,Mathematics ,Signal subspace - Abstract
Recently, a speech model inspired by signal subspace methods was proposed for a speech classifier. In using subspace information to characterize the speech signal, subspace trajectories in the form of the right singular vectors of the measurement matrices are obtained. Signal classification is thereafter accomplished by a minimum-distance rule with noteworthy results. This paper extends the foregoing approach by organizing the vector trajectories into matrices. The matrices so obtained are the reduced-rank approximation of the sample correlation matrices. A new dissimilarity measure in the Frobenius norm is correspondingly proposed for the matrix trajectories. Simulation results of the proposed composite signal subspace classifier in an isolated digit speech recognition problem reveal an improved performance over its predecessor. Additionally, the results also show the proposed classifier retaining the white noise robustness of the original design.
- Published
- 2007
26. Robust Signal Subspace Speech Classifier
- Author
-
M. V. C. Rao, B. S. Daya Sagar, and Alan W. C. Tan
- Subjects
business.industry ,Applied Mathematics ,Speech recognition ,Classification procedure ,Pattern recognition ,Speech processing ,Random subspace method ,Statistical classification ,ComputingMethodologies_PATTERNRECOGNITION ,Robustness (computer science) ,Signal Processing ,Artificial intelligence ,Electrical and Electronic Engineering ,business ,Classifier (UML) ,Subspace topology ,Mathematics ,Signal subspace - Abstract
A speech model inspired by the signal subspace approach was recently proposed as a speech classifier with modest results. The method entails, in general, the assemblage of a set of subspace trajectories that consist of the right singular vectors of measurement matrices of the signal under consideration. Given an unknown signal, a simple distortion measure then applies in the classification procedure to pick the best matched class prototype. This letter examines the issue of robustness in the subspace classification scheme. Borrowing an important result on noisy measurement matrices, this letter formally establishes the notion of robustness in subspace classification and proceeds to propose a class of robust distortion measures for signal subspace models. Simulation results of subspace classifiers implementing the new distortion measures in an isolated digit speech recognition problem reveal no degradation in recognition accuracy, even under low SNR conditions.
- Published
- 2007
27. A signal subspace approach for speech modelling and classification
- Author
-
B. S. Daya Sagar, M. V. C. Rao, and Alan W. C. Tan
- Subjects
business.industry ,Speech recognition ,Pattern recognition ,Speech processing ,Speech model ,Random subspace method ,Difference function ,ComputingMethodologies_PATTERNRECOGNITION ,Control and Systems Engineering ,Signal Processing ,Singular value decomposition ,Computer Vision and Pattern Recognition ,Artificial intelligence ,Electrical and Electronic Engineering ,business ,Classifier (UML) ,Software ,Subspace topology ,Signal subspace ,Mathematics - Abstract
In this paper, a speech classifier inspired by the signal subspace approach is developed. A novel signal subspace speech model is initially obtained via a rank reducing subspace decomposition algorithm that is based on the SVD. Motivated by the assumption that the speech signal comprises of short term dynamics that are slowly changing, it follows that the signal subspace of the speech signal is likewise slowly changing. The proposed signal subspace model aims to characterize the subspace dynamics using a family of subspace trajectories. In particular, each subspace trajectory is a sequence of vectors that traces the dynamics of a rank-one subspace in time. An assembly of these trajectories, henceforth, specifies the progression of the embedded signal subspace. To construct the signal subspace classifier, prototype elements in the form of the signal subspace models are determined for every signal class. A minimum-distance rule with a distance measure that resembles an energy difference function is subsequently applied in the actual classification task. Simulation of the proposed signal subspace classifier in an isolated digit speech recognition problem reveals promising results.
- Published
- 2007
28. A Discriminative Signal Subspace Speech Classifier
- Author
-
M. V. C. Rao, B.S.D. Sagar, and Alan W. C. Tan
- Subjects
Voice activity detection ,Computer science ,business.industry ,Applied Mathematics ,Speech recognition ,Speech coding ,Acoustic model ,Pattern recognition ,Linear predictive coding ,Speech processing ,ComputingMethodologies_PATTERNRECOGNITION ,Discriminative model ,Signal Processing ,Artificial intelligence ,Electrical and Electronic Engineering ,business ,Classifier (UML) ,Signal subspace - Abstract
A speech model inspired by the signal subspace methods was recently proposed as a speech classifier with modest results. Fashioned along a "best representation" approach, the absence of valuable interclass information in the speech model, however, impairs the ability of the classifier to distinguish between phonetically alike classes. This letter proposes an improved classifier that implements interclass information. Specifically, a measure of the discriminative quality of individual class elements is defined and determined for all class elements. The discrimination measures thus obtained are subsequently applied in the classification procedure. Simulation results of the proposed signal subspace classifier in an isolated digit speech recognition problem reveal an improved performance over its predecessor
- Published
- 2007
29. Digital compensation of cross modulation distortion in RF subsampling receiver
- Author
-
Heng Siong Lim, Chian Hong Wong, Alan W. C. Tan, and Meng Chuan Mah
- Subjects
Radio receiver design ,business.industry ,Nonlinear distortion ,Modulation ,Computer science ,Distortion ,Electrical engineering ,Electronic engineering ,Software-defined radio ,business ,Signal ,Modulation error ratio ,Digital signal processing - Abstract
The RF subsampling receiver suffers from nonlinear distortion caused by the front end of the receiver, specifically the low noise amplifier. Strong interferers present in the receiver introduce cross modulation distortion to the desired signal. This paper introduces a compensation method for the cross modulation distortion using digital signal processing techniques. In the proposed method, two signals are down converted using the subsampling receiver and digitized. A joint channel and nonlinearity estimation is then performed for the mitigation of the cross modulation distortion. Simulation results show that the proposed method obtains more accurate estimates compared to existing work, and subsequent compensation is improved as shown by the recovery of the desired signal.
- Published
- 2015
30. Robust joint CFO and fast time-varying channel tracking for MIMO-OFDM systems
- Author
-
Heng Siong Lim, Alan W. C. Tan, and Meng Chuan Mah
- Subjects
Extended Kalman filter ,Noise ,symbols.namesake ,Control theory ,Computer science ,Gaussian noise ,Carrier frequency offset ,symbols ,Filter (signal processing) ,MIMO-OFDM ,Communication channel - Abstract
In this paper, a robust joint carrier frequency offset (CFO) and fast time-varying channel tracking algorithm for MIMOOFDM systems is proposed. The proposed method is robust to noise statistics and noise distribution uncertainties. The extended H filter (EHF) is employed for the joint tracking process as it does not require any knowledge of noise statistics or noise distribution, unlike the conventional extended Kalman filter (EKF). To reduce computational complexity, the tracking of fast time-varying channel gains is facilitated by the basis expansion model (BEM). Simulation results show that the performance of the proposed algorithm without knowledge of noise statistics matches the performance of the existing algorithm under Gaussian noise and is better than that of the existing algorithm under non-Gaussian noise.
- Published
- 2014
31. Singular point detection in fingerprint images: An investigation on quantization approach
- Author
-
Shing Chyi Chua, Alan W. C. Tan, and E. K. Wong
- Subjects
Quantization (physics) ,Image matching ,business.industry ,Computer Science::Computer Vision and Pattern Recognition ,Vector quantization ,Pattern recognition ,Computer vision ,Artificial intelligence ,Singular point of a curve ,Fingerprint recognition ,Orientation field ,business ,Mathematics - Abstract
This paper presents a quantization approach to the detection of singular points in fingerprint images. Singular points in fingerprint images referred to the core and delta points. They are the global features found in the orientation field image of a fingerprint. A singular point is normally detected at the intersection of three gray levels of a quantized orientation field image at quantization level-3 but no study has been carried out for higher quantization level. In this paper, the singular point detection has been investigated up to quantization level-8 by employing two performance measures: sensitivity and precision. The performance at level-5 has been found to be optimal when both measures are combined.
- Published
- 2014
32. Subsampling optimization for signal in the presence of interferer in nonlinear systems
- Author
-
Heng Siong Lim, Alan W. C. Tan, and Chian Hong Wong
- Subjects
Nonlinear system ,Multi band ,Computer science ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Electronic engineering ,Process (computing) ,Signal - Abstract
This paper presents a design approach to determine the subsampling frequency ranges for a multi band system, where only one signal is prioritized so that it doesn't overlap with the other signals during the subsampling process. The proposed technique will introduce the formulas derived from an existing method to relax the conditions on the other signals in the multi band case. The results show that by having this assumption, the subsampling frequency needed to extract the desired signal is lower than the previously derived methods.
- Published
- 2014
33. GENERALIZED PREDICTIVE CONTROL ON A ROBOTIC MANIPULATOR SYSTEM
- Author
-
Alan W. C. Tan, E. K. Wong, and Leong Chuan Kwek
- Subjects
Model predictive control ,Computer science ,Control theory ,Parallel manipulator ,Robot manipulator ,Control engineering - Published
- 2013
34. Multiuser detection for DS-CDMA systems using evolutionary programming
- Author
-
Alan W. C. Tan, Hean-Teik Chuah, M.V.C. Rao, and Heng Siong Lim
- Subjects
Theoretical computer science ,Computational complexity theory ,Code division multiple access ,Decision rule ,Multiuser detection ,Evolutionary computation ,Computer Science Applications ,Spread spectrum ,Modeling and Simulation ,Detection theory ,Electrical and Electronic Engineering ,Algorithm ,Evolutionary programming ,Mathematics - Abstract
This article introduces a new multiuser detection scheme which uses evolutionary programming (EP) to detect the user bits based on the maximum-likelihood decision rule. The major advantage of the proposed detector is that it has a lower computational complexity compared to other popular evolutionary-algorithm-based detectors. The simulation results show that the EP has always converged to the optimum solution with a small number of generations. The simulated average computational time performance demonstrates that this approach achieves practical ML performance with polynomial complexity in the number of users.
- Published
- 2003
35. Cost function of blind channel equalization
- Author
-
Khaled A. Alaghbari, Alan W. C. Tan, and Heng Siong Lim
- Subjects
symbols.namesake ,Intersymbol interference ,Noise ,Gaussian noise ,Gaussian ,symbols ,Electronic engineering ,Adaptive equalizer ,Absolute difference ,Function (mathematics) ,Impulse noise ,Algorithm ,Mathematics - Abstract
The use of constant modulus algorithm (CMA) and correntropy in blind adaptive equalization requires choosing a suitable cost function such that the FIR equalizer removes sufficient intersymbol interference (ISI). A correntropy cost function based on minimizing the absolute difference between the correntropy of equalizer output and source symbol is proposed and compared with conventional CMA and correntropy cost functions. Simulation study on both Gaussian and impulsive additive noise scenarios show improved overall performance.
- Published
- 2012
36. Joint carrier frequency offset and channel parameters estimation for MIMO-OFDM systems
- Author
-
Heng Siong Lim, Alan W. C. Tan, and Meng-Chuan Mah
- Subjects
State-transition matrix ,Engineering ,business.industry ,Data_CODINGANDINFORMATIONTHEORY ,MIMO-OFDM ,Noise (electronics) ,Delay spread ,Extended Kalman filter ,Control theory ,Carrier frequency offset ,Frequency offset ,business ,Computer Science::Information Theory ,Communication channel - Abstract
Joint channel and frequency offset tracking based on an autoregressive (AR) model are often performed by either assuming the knowledge of the state transition matrix and state noise or obtaining the state transition matrix and state noise via the Yule Walker equations. In this paper, we propose a joint carrier frequency offset (CFO) and channel parameters tracking algorithm for MIMO-OFDM system which is capable of tracking the channel parameters without requirements of additional information such as maximum Doppler frequency and maximum delay spread. The Extended Kalman Filter is proposed for the joint tracking process. Simulation results show that the performance of the proposed method is close to the case when channel parameters are known. (6 pages)
- Published
- 2012
37. Performance comparison of SNR estimators in Gaussian mixture noise
- Author
-
Alan W. C. Tan, Ying Siew Lo, and Heng Siong Lim
- Subjects
Signal processing ,Estimator ,Noise (electronics) ,Gaussian random field ,symbols.namesake ,Additive white Gaussian noise ,Signal-to-noise ratio ,Robustness (computer science) ,Gaussian noise ,Statistics ,symbols ,Algorithm ,Computer Science::Information Theory ,Mathematics - Abstract
Most of the signal-to-noise ratio (SNR) estimators published in literature are designed based on Gaussian noise assumption. These estimation schemes typically perform poorly when the additive noise has a non-Gaussian distribution. This paper investigates the robustness of several popular SNR estimators in two-term Gaussian mixture noise. The Cramer-Rao bound is derived and used as a benchmark against which the performance of the estimators is measured. Simulations results show that the SNR estimators suffer performance degradation in non-Gaussian noise channels.
- Published
- 2011
38. Warped linear predictive speech coding
- Author
-
Heng Siong Lim, Chian-Hong Wong, and Alan W. C. Tan
- Subjects
business.industry ,Computer science ,Speech coding ,Linear prediction ,Pattern recognition ,Speech enhancement ,symbols.namesake ,Signal-to-noise ratio ,Fourier transform ,Computer Science::Sound ,Distortion ,symbols ,Mel-frequency cepstrum ,Artificial intelligence ,business ,Energy (signal processing) - Abstract
This project aims to enhance human speech energy at low frequencies using linear prediction methods to achieve better performance for speech recognizers in noisy conditions. In order to achieve this, a recognition system based on Warped Linear Prediction (WLP) is proposed. WLP is based on Warped Fourier Transform and the consideration is to warp a signal to another frequency scale and perform Fourier Transform on the warped scale. This technique will transform speech signal so that the frequency resolution at the lower frequency region is higher, thus more detailed information on the signal can be obtained from the low frequencies. After the signal is transformed through warping, cepstral coefficients are obtained and it can be acknowledged as Warped Linear Prediction. Evaluation of the effectiveness of this method has been conducted in isolated word recognition tests. Experimental results show that the WLP performs better than linear prediction method for the set SNR range, based on two distortion measures that were tried. The new method shows no degradation in recognition accuracy under high SNR conditions, but performs significantly better under low SNR conditions. At SNR of 4dB, performance improvements of up to 70 percent can be seen.
- Published
- 2011
39. Robust adaptive frequency-domain equalization for DS-CDMA in non-Gaussian noise channels
- Author
-
Heng Siong Lim, Lillian Yee Kiaw Wang, and Alan W. C. Tan
- Subjects
Computer science ,business.industry ,Code division multiple access ,Gaussian ,Spread spectrum ,Adaptive filter ,symbols.namesake ,Additive white Gaussian noise ,Gaussian noise ,Robustness (computer science) ,Frequency domain ,symbols ,Telecommunications ,business ,Algorithm - Abstract
Adopting frequency-domain equalization (FDE) in DS-CDMA can partially restore the orthogonality of users and suppress multiple-access interference (MAI). The existing FDEs are designed with the assumption of additive white Gaussian noise. Although the assumption is quite proper for a variety of applications, it has been revealed that many background noises arising in the reality are with non-Gaussian statistics. In this paper, we show that the Gaussian-based FDEs fare poorly in non-Gaussian noise environment. To achieve reliable data transmission in frequency-selective multiple-access channel with unknown non-Gaussian noise, we propose a robust adaptive FDE which consists of a nonlinear preprocessing front-end followed by a linear adaptive FDE. The nonlinear front-end adapts itself to the unknown non-Gaussian noise to suppress the impulses while the FDE effectively removes the MAI. Results from the simulation reveal that the proposed robust FDE is able to perform successfully in both Gaussian and non-Gaussian noise environments.
- Published
- 2010
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.