Back to Search
Start Over
Diarization of the Language Consulting Center Telephone Calls
- Source :
- Speech and Computer ISBN: 9783030260606, SPECOM
- Publication Year :
- 2019
- Publisher :
- Springer International Publishing, 2019.
-
Abstract
- V tomto článku popisujeme diarizaci archivu Jazykové poradny vznikajícím v rámci projektu "Zpřístupnění dotazů jazykové poradny v lingvisticky strukturované databázi". Jedna část tohoto archivu je nahraná pouze v mono kvalit, naším úkolem je proto rozdělit data pomocí diarizace. Náš přístup využívá informace o identitě jazykového poradce získané z přepisu jeho představení na začátku každého z hovorů. Protože naše data jsou jedinenčná, pro porovnání uvádíme také výsledky dostupného systému diarizace Kaldi. In this paper, we describe a diarization of the archive data from the project “Access to a Linguistically Structured Database of Enquiries from the Language Consulting Center”. This project is attempting to provide improved access to the large archives of the Czech language of mainly telephone conversations collected continuously by The Language Consulting Center. One part of this archives contains mono recordings, where the data of the client and the language counsellor are mixed in one channel. In our proposed approach to a diarization, we used the information about the identity of the language counsellor acquired from the text transcription on the beginning of the conversation. For the initial stage of the diarization, our system based on clustering the x-vectors was adopted. The resegmentation step is used for refining the boundaries of speaker changes by the pre-trained Gaussian mixture model of the counsellor. Because of the uniqueness of our data, we compared our results with the Kaldi diarization as the baseline system.
- Subjects :
- Czech
Computer science
media_common.quotation_subject
02 engineering and technology
computer.software_genre
030507 speech-language pathology & audiology
03 medical and health sciences
Transcription (linguistics)
Channel (programming)
0202 electrical engineering, electronic engineering, information engineering
Conversation
Diarization, x-vector, Automatic speech recognition, GMM
Cluster analysis
media_common
business.industry
Diarizace, x-vektor, automatické rozpoznávání řeči, modelování pomocí gaussovských směsí
020206 networking & telecommunications
Mixture model
language.human_language
Speaker diarisation
language
Identity (object-oriented programming)
Artificial intelligence
0305 other medical science
business
computer
Natural language processing
Subjects
Details
- ISBN :
- 978-3-030-26060-6
- ISBNs :
- 9783030260606
- Database :
- OpenAIRE
- Journal :
- Speech and Computer ISBN: 9783030260606, SPECOM
- Accession number :
- edsair.doi.dedup.....471ee01e2b8471365745dfc1fc68afa5
- Full Text :
- https://doi.org/10.1007/978-3-030-26061-3_56