| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Image Question Answering | DAQUAR REDUCED (test) | Accuracy60.3 | 33 | |
| Visual Question Answering | DAQUAR-ALL full (test) | Accuracy50.2 | 22 | |
| Visual Question Answering | DAQUAR single-word answers portion | Accuracy60.27 | 11 | |
| Visual Question Answering | DAQUAR (reduced) | Accuracy40.07 | 8 | |
| Visual Question Answering | DAQUAR reduced Single answer | Accuracy44.48 | 6 | |
| Visual Question Answering | DAQUAR all Multiple answers | Accuracy50.2 | 5 | |
| Visual Question Answering | DAQUAR reduced Multiple answers | Accuracy44.44 | 4 | |
| Visual Question Answering | DAQUAR all Single answer | Acc28.98 | 3 | |
| Question Generation | DAQUAR (test) | CIDEr0.512 | 2 | |
| Self-talk Generation | DAQUAR | Metric- | 0 |