Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Question Answering Utility Evaluation on CAPID (test)

79GPT-4 Score

Llama-3.1-8B (FT)

49.8857.446572.56Feb 10, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.02
7973
2026.02
5143