Comparing Price Indices of Clothing and Footwear for Scanner Data and Web Scraped Data

Fiche du document

Date

2019

Type de document
Périmètre
Langue
Identifiants
Collection

Persée

Organisation

MESR

Licence

Copyright PERSEE 2003-2023. Works reproduced on the PERSEE website are protected by the general rules of the Code of Intellectual Property. For strictly private, scientific or teaching purposes excluding all commercial use, reproduction and communication to the public of this document is permitted on condition that its origin and copyright are clearly mentionned.




Citer ce document

Antonio G. Chessa et al., « Comparing Price Indices of Clothing and Footwear for Scanner Data and Web Scraped Data », Economie et Statistique, ID : 10.24187/ecostat.2019.509.1984


Métriques


Partage / Export

Résumé En

Statistical institutes are considering web scraping of online prices of consumer goods as a feasible alternative to scanner data. The lack of transaction data generates the question whether web scraped data are suited for price index calculation. This article investigates this question by comparing price indices based on web scraped and scanner data for clothing and footwear in the same webshop. Scanner data and web scraped prices are often equal, with the latter being slightly higher on average. Numbers of web scraped product prices and products sold show remarkably high correlations. Given the high churn rates of clothing products, a multilateral method (Geary-Khamis) was used to calculate price indices. For 16 product categories, the indices show small overall differences between the two data sources, with year on year indices differing only by 0.3 percentage point at COICOP level (men’s and women's clothing). It remains to be investigated whether such promising results for web scraped data will also be found for other retailers.

document thumbnail

Par les mêmes auteurs

Sur les mêmes sujets

Sur les mêmes disciplines

Exporter en