Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Few-shot example selection on Task #1
Loading...
67
Score
TRIPLE-SAR
62.84
63.92
65
66.08
Feb 15, 2024
Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Score
TRIPLE-SAR
LLM=GPT-3.5, candidate...
2024.02
67
TRIPLE-CSAR
LLM=GPT-3.5, candidate...
2024.02
66
Random
LLM=GPT-3.5, candidate...
2024.02
65
Uniform
LLM=GPT-3.5, candidate...
2024.02
63
Feedback
Search any
task
Search any
task