Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Scenario-based Filter Generation Benchmark

18.98ROUGE-1

llama 3.2 3B

9.838412.211714.58516.9583Nov 17, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.11
18.987.613.9888.3861.033.49
2025.11
18.677.8713.5488.9766.634.07
2025.11
17.877.4612.7389.573.533.28
2025.11
10.193.677.1587.6361.531.33