Share your thoughts, 1 month free Claude Pro on usSee more

Document-grounded Dialogue on Doc2Dial

41.5F1 Score

Llama3-RankRAG 70B

Updated 5mo ago

Evaluation Results

Method	Links
Llama3-RankRAG 70B 2024.07		41.5
Llama3-ChatQA-1.5 70B 2024.07		41.3
Llama3-RankRAG 8B 2024.07		40.4
Llama3-ChatQA-1.5 8B 2024.07		39.3
Llama3-Instruct 70B 2024.07		37.9
GPT-4-turbo-2024-0409 RAG 2024.07		35.4
GPT-3.5-turbo-1106 RAG 2024.07		34.8
GPT-4-0613 RAG 2024.07		34.2
Llama3-Instruct 8B 2024.07		33.6
Atlas 11B 2024.07		29.6
GPT-4-0613 2024.07		27.6
GPT-4-turbo-2024-0409 2024.07		27.6
GPT-3.5-turbo-1106 2024.07		20.1