• Media type: E-Article
  • Title: Data-Augmentation Method for BERT-based Legal Textual Entailment Systems in COLIEE Statute Law Task
  • Contributor: Aoki, Yasuhiro; Yoshioka, Masaharu; Suzuki, Youta
  • Published: Springer Science and Business Media LLC, 2022
  • Published in: The Review of Socionetwork Strategies, 16 (2022) 1, Seite 175-196
  • Language: English
  • DOI: 10.1007/s12626-022-00104-0
  • ISSN: 1867-3236; 2523-3173
  • Origination:
  • Footnote:
  • Description: AbstractA legal textual entailment task is a task to recognize entailment between a law article and its statements. In the Competition on Legal Information Extraction/Entailment (COLIEE), this task is designed as a task to confirm the entailment of a yes/no answer from the given civil code article(s). Based on the development of deep-learning-based natural language processing tools such as bidirectional encoder representations from transformers (BERT), many participants in the task used such tools, and the best performance system of COLIEE 2020 was a BERT-based system. However, because of the limitation of the size of training data provided by the task organizer, training such tools to adapt to the variability of the questions is difficult. In this paper, we propose a data-augmentation method to make training data using civil code articles for understanding the syntactic structure of the questions and articles for entailment. Our BERT-based ensemble system, which uses this augmentation method, achieves the best performance (accuracy = 0.7037) in Task 4 of COLIEE 2021. We also introduce the results of additional experiments to discuss the characteristics of the proposed method.