Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Long-form Question Answering on Long-form QA (test)

61.7Win Rate vs. Holistic Reward

ALARM

43.427248.171152.91557.6589Mar 11, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.03
61.775.2---97.6
60.873.6---87.8
59.873.8---98.5
2024.03
59.0649.2353.96-54.08-
2024.03
55.88-53.5550.7753.4-
2024.03
55.8446.45-46.0349.44-
2024.03
55.5851.355.35-54.08-
53.875.2---116.8
2024.03
51.68-52.7648.751.05-
2024.03
50.9457.2751.45-53.22-
2024.03
49.6454.95-48.5551.05-
2024.03
49.1147.24-44.6547-
2024.03
44.13-45.0442.7343.97-
2024.03
-55.8650.3649.0651.76-
2024.03
-48.3150.8944.4247.87-
2024.03
-44.1144.1640.9443.07-