A High-Dimensional Choice Model for Online Retailing

  • Zhaohui Jiang
  • , Jun Li
  • , Dennis Zhang

    Research output: Contribution to journalArticlepeer-review

    Abstract

    Online retailers are facing an increasing variety of product choices and diversified consumer decision journeys. To improve many operations decisions for online retailers, such as demand forecasting and inventory management and pricing, an important first step is to obtain an accurate estimate of the substitution patterns among a large number of products offered in the complex online environment. Classic choice models either do not account for these substitution patterns beyond what is reflected through observed product features or do so in a simplified way by making a priori assumptions. These shortcomings become particularly restrictive when the underlying substitution patterns get complex as the number of options increases. We provide a solution by developing a high-dimensional choice model that allows for flexible substitution patterns and easily scales up. We leverage consumer clickstream data and combine econometric and machine learning (graphical lasso, in particular) methods to learn the substitution patterns among a large number of products. We show our method offers more accurate demand forecasts in a wide range of synthetic scenarios when compared with classical models (e.g., the independent and identically distributed Probit model), reducing out-of-sample mean absolute percentage error by 10%–30%. Such performance improvement is further supported by observations from a real-world empirical setting. More importantly, our method excels in precisely recovering substitution patterns across products. Compared with benchmark models, it reduces the percentage deviation from the underlying elasticity matrix by approximately half. This precision serves as a critical input for enhancing business decisions such as assortment planning, inventory management, and pricing strategies.

    Original languageEnglish
    Pages (from-to)3320-3339
    Number of pages20
    JournalManagement Science
    Volume71
    Issue number4
    DOIs
    StatePublished - Apr 2025

    Keywords

    • choice model
    • clickstream data
    • high-dimensional
    • machine learning
    • retail management

    Fingerprint

    Dive into the research topics of 'A High-Dimensional Choice Model for Online Retailing'. Together they form a unique fingerprint.

    Cite this