Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SimpleX: A Simple and Strong Baseline for Collaborative Filtering

About

Collaborative filtering (CF) is a widely studied research topic in recommender systems. The learning of a CF model generally depends on three major components, namely interaction encoder, loss function, and negative sampling. While many existing studies focus on the design of more powerful interaction encoders, the impacts of loss functions and negative sampling ratios have not yet been well explored. In this work, we show that the choice of loss function as well as negative sampling ratio is equivalently important. More specifically, we propose the cosine contrastive loss (CCL) and further incorporate it to a simple unified CF model, dubbed SimpleX. Extensive experiments have been conducted on 11 benchmark datasets and compared with 29 existing CF models in total. Surprisingly, the results show that, under our CCL loss and a large negative sampling ratio, SimpleX can surpass most sophisticated state-of-the-art models by a large margin (e.g., max 48.5% improvement in NDCG@20 over LightGCN). We believe that SimpleX could not only serve as a simple strong baseline to foster future research on CF, but also shed light on the potential research direction towards improving loss function and negative sampling. Our source code will be available at https://reczoo.github.io/SimpleX.

Kelong Mao, Jieming Zhu, Jinpeng Wang, Quanyu Dai, Zhenhua Dong, Xi Xiao, Xiuqiang He• 2021

Related benchmarks

TaskDatasetResultRank
RecommendationGowalla (test)
Recall@200.1872
266
RecommendationGowalla
Recall@200.1872
153
RecommendationAmazon-Book (test)
Recall@200.0583
152
RecommendationYelp 2018 (test)
Recall@207.01
110
RecommendationAmazon-Book
Recall@205.83
103
RecommendationYelp 2018
Recall@2018.72
73
RecommendationSports
Recall@100.1375
62
RecommendationBeauty
NDCG@514.41
48
Collaborative FilteringYelp 2018 (test)
Recall@207.01
45
Collaborative FilteringYelp 2018
NDCG@205.75
42
Showing 10 of 40 rows

Other info

Code

Follow for update