Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Response Generation with Provenance on ReFInE (test)
Loading...
-
ROUGE-L
No plottable results for ROUGE-L (PERCENT).
Metric
ROUGE-L (PERCENT)
BLEU (PERCENT)
METEOR (PERCENT)
MoverScore (PERCENT)
Precision (SCALAR)
Recall (SCALAR)
F1 Score (SCALAR)
Format Compliance (PERCENT)
LLM Judge Score (0-1) (SCALAR)
Updated 4d ago
Evaluation Results
Method
Method
Links
ROUGE-L
BLEU
METEOR
MoverScore
Precision
Recall
F1 Score
Format Compliance
LLM Judge Score (0-1)
No evaluation results found.
Feedback
Search any
task
Search any
task