Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Ground Truth Alignment and Conversationality on Curated Agricultural Dataset (test)

58.7Recall

GPT-4o FT

11.58823.81936.0548.281Feb 6, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.02
58.754.956.790.9-
2026.02
51.152.251.788.84.58
2026.02
50.353.351.887.5-
2026.02
43.866.252.791.64.62
2026.02
42.156.248.292.6-
2026.02
35.738.43793.2-
2026.02
32.249.639.190.14.21
2026.02
28.564.939.695.83.95
2026.02
26.662.237.396.94.01
2026.02
26.264.337.294.43.98
2026.02
24.158.934.290-
2026.02
19.231.323.855.9-
2026.02
18.727.122.150.2-
2026.02
15.184.325.691.83.89
2026.02
13.494.223.598.33.72