Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MAVE: A Product Dataset for Multi-source Attribute Value Extraction

About

Attribute value extraction refers to the task of identifying values of an attribute of interest from product information. Product attribute values are essential in many e-commerce scenarios, such as customer service robots, product ranking, retrieval and recommendations. While in the real world, the attribute values of a product are usually incomplete and vary over time, which greatly hinders the practical applications. In this paper, we introduce MAVE, a new dataset to better facilitate research on product attribute value extraction. MAVE is composed of a curated set of 2.2 million products from Amazon pages, with 3 million attribute-value annotations across 1257 unique categories. MAVE has four main and unique advantages: First, MAVE is the largest product attribute value extraction dataset by the number of attribute-value examples. Second, MAVE includes multi-source representations from the product, which captures the full product information with high attribute coverage. Third, MAVE represents a more diverse set of attributes and values relative to what previous datasets cover. Lastly, MAVE provides a very challenging zero-shot test set, as we empirically illustrate in the experiments. We further propose a novel approach that effectively extracts the attribute value from the multi-source product information. We conduct extensive experiments with several baselines and show that MAVE is an effective dataset for attribute value extraction task. It is also a very challenging task on zero-shot attribute extraction. Data is available at {\it \url{https://github.com/google-research-datasets/MAVE}}.

Li Yang, Qifan Wang, Zac Yu, Anand Kulkarni, Sumit Sanghai, Bin Shu, Jon Elsas, Bhargav Kanagal• 2021

Related benchmarks

TaskDatasetResultRank
Attribute Value ExtractionMAVE 1.0 (test)
Precision100
33
Attribute ExtractionWDC-PAVE (test)
F1 Score65.1
15
Attribute Value ExtractionMAVE zero-shot 1.0 (test)
Precision85.56
10
Attribute Value ExtractionOA-Mine All attribute-value pairs (test)
F1 Score65.7
8
Attribute Value ExtractionAE-110K All attribute-value pairs (test)
F1 Score76.8
8
Attribute Value ExtractionOA-Mine Unseen attribute-value pairs (test)
F1 Score42.6
8
Attribute Value ExtractionAE-110K Unseen attribute-value pairs (test)
F1 Score24.9
8
Showing 7 of 7 rows

Other info

Code

Follow for update