• Media type: E-Article
  • Title: Grammar-based fuzzing of data integration parsers in computational materials science
  • Contributor: Müller, Sebastian [Author]; Sparka, Jan Arne [Author]; Kuban, Martin [Author]; Draxl, Claudia [Author]; Grunske, Lars [Author]
  • imprint: Humboldt-Universität zu Berlin, 2023-09-19
  • Language: English
  • DOI: https://doi.org/10.18452/28316; https://doi.org/10.1002/spe.3266
  • ISSN: 1097-024X
  • Keywords: program under test ; computational materials science ; novel materials discovery ; density of states ; density functional theory
  • Origination:
  • Footnote: Diese Datenquelle enthält auch Bestandsnachweise, die nicht zu einem Volltext führen.
  • Description: Context Computational materials science (CMS) focuses on in silico experiments to compute the properties of known and novel materials, where many software packages are used in the community. The NOMAD Laboratory (Draxl C, Scheffler) offers to store the input and output files in its FAIR data repository. Since the file formats of these software packages are non-standardized, parsers are used to provide the results in a normalized format. Objective The main goal of this article is to report experience and findings of using grammar-based fuzzing on these parsers. Method We have constructed an input grammar for four common software packages in the CMS domain and performed an experimental evaluation on the capabilities of grammar-based fuzzing to detect failures in the Novel Materials Discovery (NOMAD) parsers. Results With our approach, we were able to identify three unique critical bugs concerning service availability, as well as several additional syntactic, semantic, logical, and downstream bugs in the investigated NOMAD parsers. We reported all issues to the developer team prior to publication. Conclusion Based on the experience gained, we can recommend grammar-based fuzzing also for other research software packages to improve the trust level in the correctness of the produced results. ; Peer Reviewed
  • Access State: Open Access
  • Rights information: Attribution (CC BY)