Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

LLM Re-ranking on AlpacaEval (GPT4-1106-Preview Baseline)

32.44Win Rate (LC)

TRLM-Ba

23.943226.149128.35530.5609Dec 3, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.12
32.4424.3524.041.271926103
2024.12
31.1822.7221.991.241766272
2024.12
30.5522.8522.481.251806232
2024.12
29.1922.6821.31.241706323
2024.12
27.0517.6617.141.151366654
24.3818.1817.081.161356655
2024.12
24.2717.1315.781.121266772