Share your thoughts, 1 month free Claude Pro on usSee more

Context Adherence on Representative guardrail dataset

96F1-Score

ChainPoll

Updated 5mo ago

Evaluation Results

Method	Links
ChainPoll 2026.02		96
Luna-2 2026.02		95
Single token 2026.02		43