Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Model Size Reduction Using Frequency Based Double Hashing for Recommender Systems

About

Deep Neural Networks (DNNs) with sparse input features have been widely used in recommender systems in industry. These models have large memory requirements and need a huge amount of training data. The large model size usually entails a cost, in the range of millions of dollars, for storage and communication with the inference services. In this paper, we propose a hybrid hashing method to combine frequency hashing and double hashing techniques for model size reduction, without compromising performance. We evaluate the proposed models on two product surfaces. In both cases, experiment results demonstrated that we can reduce the model size by around 90 % while keeping the performance on par with the original baselines.

Caojin Zhang, Yicun Liu, Yuanpu Xie, Sofia Ira Ktena, Alykhan Tejani, Akshay Gupta, Pranay Kumar Myana, Deepak Dilipkumar, Suvadip Paul, Ikuhiro Ihara, Prasang Upadhyaya, Ferenc Huszar, Wenzhe Shi• 2020

Related benchmarks

TaskDatasetResultRank
RecommendationYelp 2018
Recall@206.09
73
RecommendationGowalla
Recall @ 2010.182
35
RecommendationYelp 2018
Recall@103.856
20
RecommendationGowalla
Recall@106.901
20
RecommendationBeauty
Recall@103.028
20
RecommendationBeauty
Recall@204.564
20
RecommendationAmazonBook
Recall@101.726
19
RecommendationAmazonBook
Recall@202.8
19
Showing 8 of 8 rows

Other info

Follow for update