• Media type: Text; E-Article; Electronic Conference Proceeding
  • Title: Incremental Unsupervised Training for University Lecture Recognition
  • Contributor: Heck, Michael [Author]; Stüker, Sebastian [Author]; Sakti, Sakriani [Author]; Waibel, Alex [Author]; Nakamura, Satoshi [Author]
  • Published: Association for Computational Linguistics, 2024-01-03
  • Language: English
  • DOI: https://doi.org/10.5445/IR/1000166325
  • Keywords: DATA processing & computer science
  • Origination:
  • Footnote: Diese Datenquelle enthält auch Bestandsnachweise, die nicht zu einem Volltext führen.
  • Description: In this paper we describe our work on unsupervised adaptation of the acoustic model of our simultaneous lecture translation system. We trained a speaker independent acoustic model, with which we produce automatic transcriptions of new lectures in order to improve the system for a specific lecturer. We compare our results against a model that was trained in a supervised way on an exact manual transcription. We examine four different ways of processing the decoder outputs of the automatic transcription with respect to the treatment of pronunciation variants and noise words. We will show that, instead of fixating the latter informations in the transcriptions, it is of advantage to let the Viterbi algorithm during training decide which pronunciations to use and where to insert which noise words. Further, we utilize word level posterior probabilities obtained during decoding by weighting and thresholding the words of a transcription.
  • Access State: Open Access