Back to Search
Start Over
Speaker identification for security systems using reinforcement-trained pRAM neural network architectures
- Source :
- IEEE Transactions on Systems, Man and Cybernetics Part C: Applications and Reviews, IEEE Trans Syst Man Cybern Pt C Appl Rev
- Publication Year :
- 2001
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2001.
-
Abstract
- Speaker identification may be employed as part of a security system requiring user authentication. In this case, the claimed identity of the user is known from a magnetic card and PIN number, for example, and an utterance is requested to confirm the identity of the user. A fast response is necessary in the confirmation phase and a fast registration process for new users is desirable. The time encoded signal processing and recognition (TESPAR) digital language is used to preprocess the speech signal. A speaker cannot be identified directly from the single TESPAR vector since there is a highly nonlinear relationship between the vector's components such that vectors are not linearly separable. Therefore the vector and its characteristics suggest that classification using a neural network will provide an effective solution. Good classification performance has been achieved using a probabilistic RAM (pRAM) neuron. Four probabilistic pRAM neural network architectures are presented. A performance of approximately 97% correct classifications has been obtained, which is similar to results obtained elsewhere (M. Sharma and R.J. Mammone, 1996), and slightly better than a MLP network. No speech recognition stage was used in obtaining these results, so the performance relates only to identifying a speaker's voice and is therefore independent of the spoken phrase. This has been achieved in a hardware-realizable system which may be incorporated into a smart-card or similar application.
- Subjects :
- Phrase
Computer science
Speech recognition
Multilayer neural networks
VLSI circuits
Security systems
Electrical and Electronic Engineering
Signal compression
Probability
Signal processing
Time encoded signal processing and recognition
Artificial neural network
Probabilistic logic
Process (computing)
Vectors
Speaker recognition
Probabilistic RAM
Reinforcement training
Computer Science Applications
Human-Computer Interaction
Speaker diarisation
Speech processing
Control and Systems Engineering
Speaker identification
Neural networks
Random access storage
Software
Utterance
Information Systems
Subjects
Details
- ISSN :
- 10946977
- Volume :
- 31
- Database :
- OpenAIRE
- Journal :
- IEEE Transactions on Systems, Man and Cybernetics, Part C (Applications and Reviews)
- Accession number :
- edsair.doi.dedup.....bd21231e55adf0f966ea373d9be006df
- Full Text :
- https://doi.org/10.1109/5326.923269