• Medientyp: E-Artikel
  • Titel: Dataset bias: A case study for visual question answering
  • Beteiligte: Das, Anubrata; Anjum, Samreen; Gurari, Danna
  • Erschienen: Wiley, 2019
  • Erschienen in: Proceedings of the Association for Information Science and Technology, 56 (2019) 1, Seite 58-67
  • Sprache: Englisch
  • DOI: 10.1002/pra2.7
  • ISSN: 2373-9231
  • Schlagwörter: Library and Information Sciences ; General Computer Science
  • Entstehung:
  • Anmerkungen:
  • Beschreibung: ABSTRACTWe examine the issue of bias in datasets designed to train visual question answering (VQA) algorithms. These datasets include a collection of natural language questions about images (aka ‐ visual questions). We consider three popular datasets that are captured by people with sight, people who are blind, and generated by computers. We first demonstrate that machine learning algorithms can be trained to recognize each dataset's bias, and so determine the source of a novel visual question. We then discuss potential risks and benefits of biased VQA datasets and corresponding machine learning algorithms that can identify the source of a visual question; e.g., whether it comes from a person with sight, a person who is blind, or bot (aka ‐ computer). Our ultimate aim is to inspire the development of more inclusive VQA systems.