Validation Methodology for Expert-Annotated Datasets: Event Annotation Case Study

Medientyp: E-Artikel; Sonstige Veröffentlichung; Elektronischer Konferenzbericht

Titel: Validation Methodology for Expert-Annotated Datasets: Event Annotation Case Study

Beteiligte: Inel, Oana [VerfasserIn]; Aroyo, Lora [VerfasserIn]

Erschienen: Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2019

Sprache: Englisch

DOI: https://doi.org/10.4230/OASIcs.LDK.2019.12

Schlagwörter: Human-in-the-Loop ; Time Extraction ; Crowdsourcing ; Event Extraction

Entstehung:

Anmerkungen: Diese Datenquelle enthält auch Bestandsnachweise, die nicht zu einem Volltext führen.

Beschreibung: Event detection is still a difficult task due to the complexity and the ambiguity of such entities. On the one hand, we observe a low inter-annotator agreement among experts when annotating events, disregarding the multitude of existing annotation guidelines and their numerous revisions. On the other hand, event extraction systems have a lower measured performance in terms of F1-score compared to other types of entities such as people or locations. In this paper we study the consistency and completeness of expert-annotated datasets for events and time expressions. We propose a data-agnostic validation methodology of such datasets in terms of consistency and completeness. Furthermore, we combine the power of crowds and machines to correct and extend expert-annotated datasets of events. We show the benefit of using crowd-annotated events to train and evaluate a state-of-the-art event extraction system. Our results show that the crowd-annotated events increase the performance of the system by at least 5.3%.

Zugangsstatus: Freier Zugang

Nur in Feld suchen:

Zuletzt gesuchte Begriffe: