Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

TEMPLAMA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Knowledge ProbingTempLAMA Unchanged 22/19
EM19.5
14
Knowledge ProbingTempLAMA Changed
Exact Match20.1
14
Knowledge ProbingTempLAMA Average 22/19
EM18.5
9
Fact RetrievalTEMPLAMA
Accuracy16.3
6
Knowledge ProbingTempLAMA Average
EM-
0
Knowledge ProbingTempLAMA Unchanged
Exact Match-
0
Showing 6 of 6 rows