Back to Search Start Over

Dataset of Natural Language Queries for E-Commerce

Authors :
Papenmeier, Andrea
Kern, Dagmar
Hienert, Daniel
Sliwa, Alfred
Aker, Ahmet
Fuhr, Norbert
Source :
In CHIIR '21: Proceedings of the 2021 Conference on Human Information Interaction and Retrieval
Publication Year :
2023

Abstract

Shopping online is more and more frequent in our everyday life. For e-commerce search systems, understanding natural language coming through voice assistants, chatbots or from conversational search is an essential ability to understand what the user really wants. However, evaluation datasets with natural and detailed information needs of product-seekers which could be used for research do not exist. Due to privacy issues and competitive consequences, only few datasets with real user search queries from logs are openly available. In this paper, we present a dataset of 3,540 natural language queries in two domains that describe what users want when searching for a laptop or a jacket of their choice. The dataset contains annotations of vague terms and key facts of 1,754 laptop queries. This dataset opens up a range of research opportunities in the fields of natural language processing and (interactive) information retrieval for product search.

Details

Database :
arXiv
Journal :
In CHIIR '21: Proceedings of the 2021 Conference on Human Information Interaction and Retrieval
Publication Type :
Report
Accession number :
edsarx.2302.06355
Document Type :
Working Paper
Full Text :
https://doi.org/10.1145/3406522.3446043