Back to Search
Start Over
Semantic data mining of short utterances
- Source :
- ICASSP (1)
- Publication Year :
- 2005
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2005.
-
Abstract
- This paper introduces a methodology for speech data mining along with the tools that the methodology requires. We show how they increase the productivity of the analyst who seeks relationships among the contents of multiple utterances and ultimately must link some newly discovered context into testable hypotheses about new information. While, in its simplest form, one can extend text data mining to speech data mining by using text tools on the output of a speech recognizer, we have found that it is not optimal. We show how data mining techniques that are typically applied to text should be modified to enable an analyst to do effective semantic data mining on a large collection of short speech utterances. For the purposes of this paper, we examine semantic data mining in the context of semantic parsing and analysis in a specific situation involving the solution of a business problem that is known to the analyst. We are not attempting a generic semantic analysis of a set of speech. Our tools and methods allow the analyst to mine the speech data to discover the semantics that best cover the desired solution. The coverage, in this case, yields a set of Natural Language Understanding (NLU) classifiers that serve as testable hypotheses.
- Subjects :
- Audio mining
Acoustics and Ultrasonics
Computer science
Semantic analysis (machine learning)
Feature extraction
Natural language understanding
Relevance feedback
Context (language use)
Concept mining
computer.software_genre
Semantics
Text mining
Semantic computing
Electrical and Electronic Engineering
Parsing
Data stream mining
business.industry
Speech processing
Data pre-processing
Computer Vision and Pattern Recognition
Artificial intelligence
business
computer
Software
Natural language processing
Natural language
Subjects
Details
- ISSN :
- 10636676
- Volume :
- 13
- Database :
- OpenAIRE
- Journal :
- IEEE Transactions on Speech and Audio Processing
- Accession number :
- edsair.doi.dedup.....1f2fd579f691f647d6a7e90b2c934a5f
- Full Text :
- https://doi.org/10.1109/tsa.2005.851875