Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Pairwise Discrimination on Management and Economics Research Pitch Pairs shared pairwise subset (test)

78.67Distance 1 Accuracy

SFT GPT-4.1

68.2770.9773.6776.37Mar 17, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
78.67899284.33
2026.03
69.33859478.67
2026.03
69.33799076
2026.03
68.67868677.33