Incremental Unsupervised Training for University Lecture Recognition

Media type: Text; E-Article; Electronic Conference Proceeding

Title: Incremental Unsupervised Training for University Lecture Recognition

Contributor: Heck, Michael [Author]; Stüker, Sebastian [Author]; Sakti, Sakriani [Author]; Waibel, Alex [Author]; Nakamura, Satoshi [Author]

Published: Association for Computational Linguistics, 2024-01-03

Language: English

DOI: https://doi.org/10.5445/IR/1000166325

Keywords: DATA processing & computer science

Origination:

Footnote: Diese Datenquelle enthält auch Bestandsnachweise, die nicht zu einem Volltext führen.

Description: In this paper we describe our work on unsupervised adaptation of the acoustic model of our simultaneous lecture translation system. We trained a speaker independent acoustic model, with which we produce automatic transcriptions of new lectures in order to improve the system for a specific lecturer. We compare our results against a model that was trained in a supervised way on an exact manual transcription. We examine four different ways of processing the decoder outputs of the automatic transcription with respect to the treatment of pronunciation variants and noise words. We will show that, instead of fixating the latter informations in the transcriptions, it is of advantage to let the Viterbi algorithm during training decide which pronunciations to use and where to insert which noise words. Further, we utilize word level posterior probabilities obtained during decoding by weighting and thresholding the words of a transcription.

Access State: Open Access

Search in field:

Recently searched for: