Back to Search Start Over

Castsearch - Context Based Spoken Document Retrieval

Authors :
Mølgaard, Lasse Lohilahti
Jørgensen, Kasper Winther
Hansen, Lars Kai
Mølgaard, Lasse Lohilahti
Jørgensen, Kasper Winther
Hansen, Lars Kai
Source :
Mølgaard , L L , Jørgensen , K W & Hansen , L K 2007 , Castsearch - Context Based Spoken Document Retrieval . in IEEE International Conference on Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. . vol. 4 , IEEE , 2007 IEEE International Conference on Acoustics, Speech and Signal Processing , Honolulu , Hawaii , United States , 15/04/2007 .
Publication Year :
2007

Abstract

The paper describes our work on the development of a system for retrieval of relevant stories from broadcast news. The system utilizes a combination of audio processing and text mining. The audio processing consists of a segmentation step that partitions the audio into speech and music. The speech is further segmented into speaker segments and then transcribed using an automatic speech recognition system, to yield text input for clustering using non-negative matrix factorization (NMF). We find semantic topics that are used to evaluate the performance for topic detection. Based on these topics we show that a novel query expansion can be performed to return more intelligent search results. We also show that the query expansion helps overcome errors of the automatic transcription

Details

Database :
OAIster
Journal :
Mølgaard , L L , Jørgensen , K W & Hansen , L K 2007 , Castsearch - Context Based Spoken Document Retrieval . in IEEE International Conference on Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. . vol. 4 , IEEE , 2007 IEEE International Conference on Acoustics, Speech and Signal Processing , Honolulu , Hawaii , United States , 15/04/2007 .
Notes :
application/pdf, application/pdf, English
Publication Type :
Electronic Resource
Accession number :
edsoai.on1350031235
Document Type :
Electronic Resource