• Medientyp: E-Artikel
  • Titel: Comparing Price Indices of Clothing and Footwear for Scanner Data and Web Scraped Data
  • Beteiligte: Chessa, Antonio G. [Verfasser:in]; Griffioen, Robert [Verfasser:in]
  • Erschienen in: Economie et Statistique / Economics and Statistics ; Vol. 509, n° 1, pp. 49-68
  • Sprache: Englisch
  • DOI: 10.24187/ecostat.2019.509.1984
  • Identifikator:
  • Schlagwörter: CPI ; scanner data ; web scraping ; multilateral methods ; Geary-Khamis method ; JEL Classification C43 - E31 ; article
  • Entstehung:
  • Anmerkungen:
  • Beschreibung: Statistical institutes are considering web scraping of online prices of consumer goods as a feasible alternative to scanner data. The lack of transaction data generates the question whether web scraped data are suited for price index calculation. This article investigates this question by comparing price indices based on web scraped and scanner data for clothing and footwear in the same webshop. Scanner data and web scraped prices are often equal, with the latter being slightly higher on average. Numbers of web scraped product prices and products sold show remarkably high correlations. Given the high churn rates of clothing products, a multilateral method (Geary-Khamis) was used to calculate price indices. For 16 product categories, the indices show small overall differences between the two data sources, with year on year indices differing only by 0.3 percentage point at COICOP level (men’s and women's clothing). It remains to be investigated whether such promising results for web scraped data will also be found for other retailers.
  • Zugangsstatus: Freier Zugang
  • Rechte-/Nutzungshinweise: Namensnennung - Nicht-kommerziell - Keine Bearbeitung (CC BY-NC-ND)