Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Few-shot example selection on Task #11
Loading...
0.87
Score
TRIPLE-CSAR
0.3916
0.5158
0.64
0.7642
Feb 15, 2024
Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Score
TRIPLE-CSAR
LLM=GPT-3.5
2024.02
0.87
Uniform
LLM=GPT-3.5
2024.02
0.82
TRIPLE-SAR
LLM=GPT-3.5
2024.02
0.68
Random
LLM=GPT-3.5
2024.02
0.41
Feedback
Search any
task
Search any
task