Share your thoughts, 1 month free Claude Pro on usSee more

Reasoning on FLenQA 250 tokens

80Accuracy

LIME+1

Updated 4mo ago

Evaluation Results

Method	Links
LIME+1 2025.12		80
LIME+1 2025.12		80
LIME+1 2025.12		80
LIME+1 2025.12		80
LIME+1 2025.12		70
LIME 2025.12		52
LIME 2025.12		52
LIME 2025.12		52
Base 2025.12		42
Baseline 2025.12		42
Base (DCLM-BASELINE) 2025.12		42
LIME 2025.12		40
Baseline 2025.12		36
LIME 2025.12		28
Baseline 2025.12		22