Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
List Function Induction on List Function
Loading...
21.6
Average Execution Score
ItD
10.5032
13.3841
16.265
19.1459
Mar 9, 2024
Average Execution Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Average Execution Score
ItD
LLM Backbone=Mixtral-8x7B
2024.03
21.6
ItD-IO
LLM Backbone=Mixtral-8x7B
2024.03
20.05
HS&R
LLM Backbone=Mixtral-8x7B
2024.03
19.71
HS
LLM Backbone=Mixtral-8x7B
2024.03
19.5
IO
LLM Backbone=Mixtral-8x7B
2024.03
18.57
SC
LLM Backbone=Mixtral-8x7B
2024.03
10.93
Feedback
Search any
task
Search any
task