Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Consistency Evaluation on Consistency Evaluation Dataset (N=720) 1.0 (test)
Loading...
-
Overall Score
No plottable results for Overall Score (SCALAR).
Metric
Overall Score (SCALAR)
Semantic Score (SCALAR)
Structural Score (SCALAR)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Overall Score
Semantic Score
Structural Score
No evaluation results found.
Feedback
Search any
task
Search any
task