Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Response Generation on TradePolicy
Loading...
39.11
ROUGE-L
RAGen
26.4948
29.7699
33.045
36.3201
Oct 13, 2025
ROUGE-L
BERT-F1
Updated 4d ago
Evaluation Results
Method
Method
Links
ROUGE-L
BERT-F1
RAGen
Model Backbone=Qwen2.5...
2025.10
39.11
90.33
RAGen
Model Backbone=Qwen2.5...
2025.10
37.47
90.04
AutoRAG
Model Backbone=Qwen2.5...
2025.10
33.88
88.75
LlamaIndex
Model Backbone=Qwen2.5...
2025.10
33.46
88.61
AutoRAG
Model Backbone=Qwen2.5...
2025.10
27.75
87.26
LlamaIndex
Model Backbone=Qwen2.5...
2025.10
26.98
86.96
Feedback
Search any
task
Search any
task