Back to Search
Start Over
QUEACO
- Source :
- CIKM
- Publication Year :
- 2021
- Publisher :
- ACM, 2021.
-
Abstract
- We study the problem of query attribute value extraction, which aims to identify named entities from user queries as diverse surface form attribute values and afterward transform them into formally canonical forms. Such a problem consists of two phases: {named entity recognition (NER)} and {attribute value normalization (AVN)}. However, existing works only focus on the NER phase but neglect equally important AVN. To bridge this gap, this paper proposes a unified query attribute value extraction system in e-commerce search named QUEACO, which involves both two phases. Moreover, by leveraging large-scale weakly-labeled behavior data, we further improve the extraction performance with less supervision cost. Specifically, for the NER phase, QUEACO adopts a novel teacher-student network, where a teacher network that is trained on the strongly-labeled data generates pseudo-labels to refine the weakly-labeled data for training a student network. Meanwhile, the teacher network can be dynamically adapted by the feedback of the student's performance on strongly-labeled data to maximally denoise the noisy supervisions from the weak labels. For the AVN phase, we also leverage the weakly-labeled query-to-attribute behavior data to normalize surface form attribute values from queries into canonical forms from products. Extensive experiments on a real-world large-scale E-commerce dataset demonstrate the effectiveness of QUEACO.<br />Comment: The 30th ACM International Conference on Information and Knowledge Management (CIKM 2021, Applied Research Track)
- Subjects :
- FOS: Computer and information sciences
Normalization (statistics)
Computer Science - Machine Learning
Focus (computing)
Computer Science - Computation and Language
Meta learning (computer science)
Computer Science - Artificial Intelligence
Computer science
Value (computer science)
computer.software_genre
Bridge (interpersonal)
Computer Science - Information Retrieval
Machine Learning (cs.LG)
Artificial Intelligence (cs.AI)
Named-entity recognition
Leverage (statistics)
Canonical form
Data mining
Computation and Language (cs.CL)
computer
Information Retrieval (cs.IR)
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- Proceedings of the 30th ACM International Conference on Information & Knowledge Management
- Accession number :
- edsair.doi.dedup.....3b18db111ea11cfee315d87badd134e3
- Full Text :
- https://doi.org/10.1145/3459637.3481946