SOTA Model Ranking on TruthfulQA LLM-Judge (test) and PapersWithCode

0.49Kendall's Tau

Adaptive Multi-Model Ranking

Updated 5mo ago

Evaluation Results

Method	Links
Adaptive Multi-Model Ranking 2026.01		0.49	93	2.9
Baseline 2026.01		0.4	-	-