• Media type: E-Article
  • Title: Matrix-Based Method for Inferring Elements in Data Attributes Using a Vector Space Model
  • Contributor: Hayashi, Teruaki; Ohsawa, Yukio
  • Published: MDPI AG, 2019
  • Published in: Information, 10 (2019) 3, Seite 107
  • Language: English
  • DOI: 10.3390/info10030107
  • ISSN: 2078-2489
  • Origination:
  • Footnote:
  • Description: This article addresses the task of inferring elements in the attributes of data. Extracting data related to our interests is a challenging task. Although data on the web can be accessed through free text queries, it is difficult to obtain results that accurately correspond to user intentions because users might not express their objects of interest using exact terms (variables, outlines of data, etc.) found in the data. In other words, users do not always have sufficient knowledge of the data to formulate an effective query. Hence, we propose a method that enables the type, format, and variable elements to be inferred as attributes of data when a natural language summary of the data is provided as a free text query. To evaluate the proposed method, we used the Data Jacket’s datasets whose metadata is written in natural language. The experimental results indicate that our method outperforms those obtained from string matching and word embedding. Applications based on this study can support users who wish to retrieve or acquire new data.
  • Access State: Open Access