Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Open-ended Question Answering on CrossAlpaca-Eval en 2.0
Loading...
8.58
GPT-4o Score
Qwen2.5-7B-Instruction
7.4672
7.7561
8.045
8.3339
Jan 27, 2025
GPT-4o Score
Updated 4d ago
Evaluation Results
Method
Method
Links
GPT-4o Score
Qwen2.5-7B-Instruction
Backbone=Qwen 2.5-7B,...
2025.01
8.58
Qwen2.5-7B-AdaMCOT
Backbone=Qwen 2.5-7B,...
2025.01
8.58
LLaMA3.1-8B-AdaMCOT
Backbone=LLaMA 3.1-8B,...
2025.01
8.35
LLaMA3.1-8B-Instruction
Backbone=LLaMA 3.1-8B,...
2025.01
8.33
Qwen2.5-7B-AdaMCOT
Backbone=Qwen 2.5-7B,...
2025.01
8.16
LLaMA3.1-8B-AdaMCOT
Backbone=LLaMA 3.1-8B,...
2025.01
8.13
Qwen2.5-7B-Instruction
Backbone=Qwen 2.5-7B,...
2025.01
7.85
LLaMA3.1-8B-Instruction
Backbone=LLaMA 3.1-8B,...
2025.01
7.51
Feedback
Search any
task
Search any
task