Mémoires partagées d’alignements sous-phrastiques bilingues

Medientyp: Sonstige Veröffentlichung; Elektronische Hochschulschrift; E-Book

Titel: Mémoires partagées d’alignements sous-phrastiques bilingues

Beteiligte: Segura, Johan [Verfasser:in]

Erschienen: theses.fr, 2012-11-16

Sprache: Französisch

Schlagwörter: Golden corpus ; Subsentential alignment ; Machine translation ; Alignement sous-phrastique ; Traduction automatique ; Mémoire d'exemples ; Examples memory

Entstehung:

Anmerkungen: Diese Datenquelle enthält auch Bestandsnachweise, die nicht zu einem Volltext führen.

Beschreibung: Cette thèse s'inscrit dans le cadre du traitement automatique du langage naturel, et traite plus précisément de l'alignement sous-phrastique bilingue classiquement lié à la traduction automatique statistique. Les travaux exposés s'en distinguent en proposant une mécanique évolutive à base d'exemples initiée par des annotateurs non-experts via une interface adaptée. L'approche est principalement motivée par la recherche d'une expressivité comparable à celle observée dans les alignements manuels. Une partie importante de ce travail consiste à définir un cadre formel sous-tendant une architecture originale à base d'exemples alignés. Plusieurs mémoires d'alignements ont été constituées en tirant parti d'informations provenant d'analyseurs syntaxiques automatiques en plaçant les prérequis technologiques à un niveau raisonnablement peu élevé. Deux nouvelles méthodes d'alignement sont comparées à des références connues via des mesures d'accord classiques et trois distances transformationnelles sont introduites. ; This research belongs to the Natural Language Processing (NLP) field and more specifically focuses on topic Sub-sentential Alignment which is closely related to Machine Translation. The originality of this work consists in an example-based approach bootstrapped by the participation of non-expert annotators through an appropriate interface. Seeking for a greater expressivity, such as observed in manual alignments, mainly motivates the whole approach. An important effort has been made to define a formal environment for this original architecture based on aligned examples. Several memories have been created using syntactic informations from parsers' outputs with reasonnable low-tech requirements. A couple of new alignment methods were compared with state-of-the-art measures and three transformational metrics were introduced.

Zugangsstatus: Freier Zugang

Nur in Feld suchen:

Zuletzt gesuchte Begriffe: