Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Flight Recommendation on Flight Recommendation Final 5th Round
Loading...
76.3
Accuracy
ADAPTFUSE
31.788
43.344
54.9
66.456
Apr 5, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
ADAPTFUSE
Base Model=Llama 3 8B
2026.04
76.3
ADAPTFUSE
Base Model=Gemma 2 9B
2026.04
76.2
Bayesian Teaching
Base Model=Llama 3 8B
2026.04
74.3
Bayesian Teaching
Base Model=Gemma 2 9B
2026.04
73.4
ADAPTFUSE
Base Model=Qwen 2.5 7B
2026.04
67.3
Bayesian Teaching
Base Model=Qwen 2.5 7B
2026.04
65.2
Oracle Learning
Base Model=Gemma 2 9B
2026.04
59.1
Oracle Learning
Base Model=Llama 3 8B
2026.04
58.3
Oracle Learning
Base Model=Qwen 2.5 7B
2026.04
51.7
Self-consistency
Base Model=Gemma 2 9B,...
2026.04
44.7
Self-consistency
Base Model=Llama 3 8B,...
2026.04
43.8
Self-consistency
Base Model=Qwen 2.5 7B...
2026.04
41.5
CoT
Base Model=Llama 3 8B
2026.04
39.1
CoT
Base Model=Qwen 2.5 7B
2026.04
38.6
CoT
Base Model=Gemma 2 9B
2026.04
38.4
Direct Prompting
Base Model=Llama 3 8B
2026.04
34.6
Direct Prompting
Base Model=Qwen 2.5 7B
2026.04
34.1
Direct Prompting
Base Model=Gemma 2 9B
2026.04
33.5
Feedback
Search any
task
Search any
task