Context-aware Instruction Following on Academic Paper Abstracts (test)

8.64Overlap (w)

GPT-4-turbo

Updated 5mo ago

Evaluation Results

Method	Links
GPT-4-turbo 2024.03		8.64	8.47	8.68
Qwen-Chat(v1.5) + Qwen-Chat(v1.5) (Logit-based CoGenesis) 2024.03		8.04	7.7	8.34