• Media type: E-Article
  • Title: Mining Social Science Publications for Survey Variables
  • Contributor: Zielinski, Andrea [Author]; Mutschke, Peter [Author]
  • Corporation:
  • Published: 2017
  • Published in: Proceedings of the Second Workshop on NLP and Computational Social Science
  • Language: English
  • Identifier:
  • Keywords: Datengewinnung ; künstliche Intelligenz ; Begriff ; Algorithmus ; Computerlinguistik ; Befragung ; Publikation ; Sozialwissenschaft ; Fachliteratur ; Indikatorenbildung ; Zeitschrift ; OpenMinTed
  • Origination:
  • Footnote: Postprint
    begutachtet (peer reviewed)
    In: Proceedings of the Second Workshop on NLP and Computational Social Science. 2017. S. 47-52
  • Description: Research in Social Science is usually based on survey data where individual research questions relate to observable concepts (variables). However, due to a lack of standards for data citations a reliable identification of the variables used is often difficult. In this paper, we present a work-in-progress study that seeks to provide a solution to the variable detection task based on supervised machine learning algorithms, using a linguistic analysis pipeline to extract a rich feature set, including terminological concepts and similarity metric scores. Further, we present preliminary results on a small dataset that has been specifically designed for this task, yielding modest improvements over the baseline.
  • Access State: Open Access
  • Rights information: Attribution - Non Commercial - Share Alike (CC BY-NC-SA)