Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

A Better Variant of Self-Critical Sequence Training

About

In this work, we present a simple yet better variant of Self-Critical Sequence Training. We make a simple change in the choice of baseline function in REINFORCE algorithm. The new baseline can bring better performance with no extra cost, compared to the greedy decoding baseline.

Ruotian Luo• 2020

Related benchmarks

TaskDatasetResultRank
Image CaptioningMS-COCO (test)
CIDEr127.7
117
Showing 1 of 1 rows

Other info

Code

Follow for update