• Media type: Electronic Conference Proceeding
  • Title: Open source automatic lecture subtitling
  • Contributor: Milde, Benjamin [Author]; Geislinger, Robert [Author]; Lindt, Irina [Author]; Baumann, Timo [Author]
  • Published: Universität Hamburg; Fachbereich Informatik. Fachbereich Informatik, 2021
  • Published in: Proceedings of Elektronische Sprachsignalverarbeitung (ESSV), Berlin, Germany, 2021
  • Language: English
  • Keywords: Data processing ; automatische Spracherkennung ; subtitling ; ASR ; freie Software ; open source ; Untertitelung ; Computer science
  • Origination:
  • Footnote: Diese Datenquelle enthält auch Bestandsnachweise, die nicht zu einem Volltext führen.
  • Description: We present a fully automatic solution for German video subtitling, with a focus on lecture videos. We rely entirely on open source models and scripts for German ASR, automatic punctuation reconstruction and subtitle segmentation. All training scripts, 1000h of German speech training data, pre-trained models and the final subtitling program are publicly available. It can readily be integrated into lecture video platforms such as Lecture2Go. The automatically generated subtitles can also serve as a basis to make the video material more accessible (e.g. via search, keyword clouds, and the like) or for further manual revision, potentially helping in significantly speeding up manual work. A particular challenge that we observe in lectures are technical terms that are frequent in a particular lecture, but infrequent in a typical language model and that might be out of vocabulary for a general purpose ASR. We approach this challenge by extracting texts from accompanying lecture slides to adapt the language model of our TDNN-HMM based ASR system. We demonstrate the usability of the full system and its generated subtitles and evaluate on a dataset of manually transcribed lectures with an average of 26.3% WER.
  • Access State: Open Access