Share your thoughts, 1 month free Claude Pro on usSee more

Instruction Following on MT-bench and AlpacaEval

1.55Aggregated P

NovelSelect

Updated 4mo ago

Evaluation Results

Method	Links
NovelSelect 2025.02		1.55
K-means 2025.02		1.32
K-Center-Greedy 2025.02		1.31
QDIT 2025.02		1.25
Random 2025.02		1.2
Repr Filter 2025.02		1.05