You can manage bookmarks using lists, please log in to your user account for this.
Media type:
E-Article
Title:
ON THE USEFULNESS OF HTML META ELEMENTS FOR WEB RETRIEVAL
Contributor:
ARSLAN, Ahmet
Published:
Anadolu Universitesi Bilim ve Teknoloji Dergisi-A: Uygulamali Bilimler ve Muhendislik, 2020
Published in:
Eskişehir Technical University Journal of Science and Technology A - Applied Sciences and Engineering, 21 (2020) 1, Seite 182-198
Language:
Not determined
DOI:
10.18038/estubtda.615103
ISSN:
2667-4211
Origination:
Footnote:
Description:
Web retrieval studies have mostly used URL, title, body, and anchor text fields to represent Web documents. On the other hand, HTML standards provide a rich set of elements to define different parts of a Web page. For example, meta elements are used to provide structured metadata about a Web page not to end users, but instead to browsers or crawlers. However, it is unclear whether meta tags are or are not useful for Web retrieval, as most of the previous studies leveraged URL, title, body, and anchor text fields. In this work, we examine the usefulness of two meta tags, namely keywords and description, based on ad-hoc tasks of previous TREC studies. Through experiments on the standard TREC Web datasets and several query sets, our results using the state-of-the-art term-weighting models show that the utilization of description field systematically increases the retrieval effectiveness, to a statistically significant degree most of the time. By contrast, the employment of keywords field may cause a significant deterioration in retrieval effectiveness for certain term-weighting models.