Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
WSI Report Generation on In-house dataset Chinese private (test)
Loading...
0.0924
BLEU-1
QCAgent
0.019808
0.038654
0.0575
0.076346
Mar 2, 2026
BLEU-1
BLEU-4
ROUGE-L
METEOR
Field Recall
BERTScore
Updated 1mo ago
Evaluation Results
Method
Method
Links
BLEU-1
BLEU-4
ROUGE-L
METEOR
Field Recall
BERTScore
QCAgent
Setup=Ours, Iterative...
2026.03
0.0924
0.0186
0.249
0.2307
0.4863
0.237
PRISM
Setup=Baseline, Iterat...
2026.03
0.0226
0.0083
0.0484
0.0333
0.1908
-0.0831
Feedback
Search any
task
Search any
task