Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Question Answering Sufficiency Prediction on CouldAsk Benchmark

0.7878BBC Score

Identify-then-Verify

0.6329440.6731470.713350.753553Dec 6, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
0.78780.68480.69650.78460.83330.695
2025.12
0.66910.56360.58150.77030.81820.821
2025.12
0.63890.61810.59490.67930.80650.8287