Context-Aware Visual Compatibility Prediction
About
How do we determine whether two or more clothing items are compatible or visually appealing? Part of the answer lies in understanding of visual aesthetics, and is biased by personal preferences shaped by social attitudes, time, and place. In this work we propose a method that predicts compatibility between two items based on their visual features, as well as their context. We define context as the products that are known to be compatible with each of these item. Our model is in contrast to other metric learning approaches that rely on pairwise comparisons between item features alone. We address the compatibility prediction problem using a graph neural network that learns to generate product embeddings conditioned on their context. We present results for two prediction tasks (fill in the blank and outfit compatibility) tested on two fashion datasets Polyvore and Fashion-Gen, and on a subset of the Amazon dataset; we achieve state of the art results when using context information and show how test performance improves as more context is used.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Fill-In-The-Blank | Polyvore Original | Accuracy96.9 | 9 | |
| Fill-In-The-Blank | Polyvore Resampled | Accuracy92.7 | 9 | |
| Outfit Compatibility Prediction | Polyvore Original | AUC99 | 9 | |
| Outfit Compatibility Prediction | Polyvore Resampled | AUC0.99 | 9 | |
| Link Prediction (Also bought) | Amazon Men v1 (test) | AUC97.1 | 8 | |
| Fill-In-The-Blank | Fashion-Gen (test) | FITB Accuracy77.1 | 5 | |
| Outfit Compatibility Prediction | Fashion-Gen (test) | Compatibility AUC91 | 5 | |
| Link Prediction (Also bought) | Amazon Women v1 (test) | AUC95.8 | 4 | |
| Link Prediction (Bought together) | Amazon Women v1 (test) | AUC0.948 | 4 |