• Medientyp: E-Artikel
  • Titel: Recent advances in machine translation using comparable corpora
  • Beteiligte: RAPP, REINHARD; SHAROFF, SERGE; ZWEIGENBAUM, PIERRE
  • Erschienen: Cambridge University Press (CUP), 2016
  • Erschienen in: Natural Language Engineering
  • Sprache: Englisch
  • DOI: 10.1017/s1351324916000115
  • ISSN: 1351-3249; 1469-8110
  • Schlagwörter: Artificial Intelligence ; Linguistics and Language ; Language and Linguistics ; Software
  • Entstehung:
  • Anmerkungen:
  • Beschreibung: <jats:title>Abstract</jats:title><jats:p>This paper highlights some of the recent developments in the field of machine translation using comparable corpora. We start by updating previous definitions of comparable corpora and then look at bilingual versions of continuous vector space models. Recently, neural networks have been used to obtain latent context representations with only few dimensions which are often called word embeddings. These promising new techniques cannot only be applied to parallel but also to comparable corpora. Subsequent sections of the paper discuss work specifically targeting at machine translation using comparable corpora, as well as work dealing with the extraction of parallel segments from comparable corpora. Finally, we give an overview on the design and the results of a recent shared task on measuring document comparability across languages.</jats:p>