Share your thoughts, 1 month free Claude Pro on usSee more

Reasoning on FLenQA 500 tokens

74Accuracy

LIME+1

Updated 4mo ago

Evaluation Results

Method	Links
LIME+1 2025.12		74
LIME+1 2025.12		73.8
LIME+1 2025.12		73.5
LIME+1 2025.12		73.5
LIME+1 2025.12		73.5
LIME 2025.12		65.3
LIME 2025.12		65.3
LIME 2025.12		65.3
Base 2025.12		49.5
Baseline 2025.12		49.5
Base (DCLM-BASELINE) 2025.12		49.5
LIME 2025.12		48.8
Baseline 2025.12		39
LIME 2025.12		32
Baseline 2025.12		22