• Medientyp: E-Artikel
  • Titel: Sentence-CROBI: A Simple Cross-Bi-Encoder-Based Neural Network Architecture for Paraphrase Identification
  • Beteiligte: Ortiz-Barajas, Jesus-German; Bel-Enguix, Gemma; Gómez-Adorno, Helena
  • Erschienen: MDPI AG, 2022
  • Erschienen in: Mathematics
  • Sprache: Englisch
  • DOI: 10.3390/math10193578
  • ISSN: 2227-7390
  • Schlagwörter: General Mathematics ; Engineering (miscellaneous) ; Computer Science (miscellaneous)
  • Entstehung:
  • Anmerkungen:
  • Beschreibung: <jats:p>Since the rise of Transformer networks and large language models, cross-encoders have become the dominant architecture for various Natural Language Processing tasks. When dealing with sentence pairs, they can exploit the relationships between those pairs. On the other hand, bi-encoders can obtain a vector given a single sentence and are used in tasks such as textual similarity or information retrieval due to their low computational cost; however, their performance is inferior to that of cross-encoders. In this paper, we present Sentence-CROBI, an architecture that combines cross-encoders and bi-encoders to obtain a global representation of sentence pairs. We evaluated the proposed architecture in the paraphrase identification task using the Microsoft Research Paraphrase Corpus, the Quora Question Pairs dataset, and the PAWS-Wiki dataset. Our model obtains competitive results compared with the state-of-the-art by using model ensembles and a simple model configuration. These results demonstrate that a simple architecture that combines sentence pair and single-sentence representations without using complex pre-training or fine-tuning algorithms is a viable alternative for sentence pair tasks.</jats:p>
  • Zugangsstatus: Freier Zugang