Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

LLM Workflow Optimization on Big-Bench Hard (test)

78.6BBH Overall Accuracy

Trace

40.1250.1160.170.09Jun 23, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.06
78.675.880.6
2024.06
71.673.970
2024.06
70.473.768
2024.06
59.570.951.1
2024.06
55.36945.2
2024.06
41.653.832.6