Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Document-grounded Dialogue on Doc2Dial
Loading...
41.5
F1 Score
Llama3-RankRAG 70B
19.244
25.022
30.8
36.578
Jul 2, 2024
F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
F1 Score
Llama3-RankRAG 70B
Retrieval-Augmented Ge...
2024.07
41.5
Llama3-ChatQA-1.5 70B
Retrieval-Augmented Ge...
2024.07
41.3
Llama3-RankRAG 8B
Retrieval-Augmented Ge...
2024.07
40.4
Llama3-ChatQA-1.5 8B
Retrieval-Augmented Ge...
2024.07
39.3
Llama3-Instruct 70B
Retrieval-Augmented Ge...
2024.07
37.9
GPT-4-turbo-2024-0409 RAG
Retrieval-Augmented Ge...
2024.07
35.4
GPT-3.5-turbo-1106 RAG
Retrieval-Augmented Ge...
2024.07
34.8
GPT-4-0613 RAG
Retrieval-Augmented Ge...
2024.07
34.2
Llama3-Instruct 8B
Retrieval-Augmented Ge...
2024.07
33.6
Atlas 11B
Retrieval-Augmented Ge...
2024.07
29.6
GPT-4-0613
Retrieval-Augmented Ge...
2024.07
27.6
GPT-4-turbo-2024-0409
Retrieval-Augmented Ge...
2024.07
27.6
GPT-3.5-turbo-1106
Retrieval-Augmented Ge...
2024.07
20.1
Feedback
Search any
task
Search any
task