Product return is a common phenomenon in the online retailing industry and entails several inconveniences for both the seller, who incurs in high costs for restocking the returned goods, and the customer, who has to deal with product re-shipping. In this paper, we outline a data-driven approach, based on Natural Language Process- ing, in which a broad corpus of customer reviews of an online retailer is exploited with the aim of shaping the main causes of product returns. In particular, a variety of topic modeling techniques represented both by classic methods, given by LDA and variants, and more recent algorithms, i.e., BERTopic, were applied to identify the main return reasons across multiple product categories, and their outcomes were compared to select the best approach. The category-dependent sets of return causes inferred through topic modeling largely enrich the product-agnostic list of return reasons currently used on the e-commerce platform, and provide valuable infor- mation to the retailer who can devise ad-hoc strategies to mitigate the returns and, hence, the costs of the related logistic network.

Shaping the causes of product returns: topic modeling on online customer reviews / A. Mor, C. Orsenigo, M. Soto Gomez, C. Vercellis. - In: ELECTRONIC COMMERCE RESEARCH. - ISSN 1389-5753. - (2024), pp. 1-35. [10.1007/s10660-024-09901-x]

Shaping the causes of product returns: topic modeling on online customer reviews

M. Soto Gomez
Co-primo
;
2024

Abstract

Product return is a common phenomenon in the online retailing industry and entails several inconveniences for both the seller, who incurs in high costs for restocking the returned goods, and the customer, who has to deal with product re-shipping. In this paper, we outline a data-driven approach, based on Natural Language Process- ing, in which a broad corpus of customer reviews of an online retailer is exploited with the aim of shaping the main causes of product returns. In particular, a variety of topic modeling techniques represented both by classic methods, given by LDA and variants, and more recent algorithms, i.e., BERTopic, were applied to identify the main return reasons across multiple product categories, and their outcomes were compared to select the best approach. The category-dependent sets of return causes inferred through topic modeling largely enrich the product-agnostic list of return reasons currently used on the e-commerce platform, and provide valuable infor- mation to the retailer who can devise ad-hoc strategies to mitigate the returns and, hence, the costs of the related logistic network.
Natural language processing; Topic modeling; Latent Dirichlet allocation; Product return; Customer reviews
Settore INFO-01/A - Informatica
2024
22-set-2024
Article (author)
File in questo prodotto:
File Dimensione Formato  
s10660-024-09901-x.pdf

accesso aperto

Descrizione: Article
Tipologia: Publisher's version/PDF
Dimensione 3 MB
Formato Adobe PDF
3 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1101026
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
  • OpenAlex ND
social impact