Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Timeline Summarisation on NarrativeReason 1.0 (test)
Loading...
3.83
Factual Consistency
L-Phi
2.8628
3.1139
3.365
3.6161
Dec 30, 2024
Factual Consistency
Usefulness (General)
Usefulness (Diagnosis)
Usefulness (Inter/Intrapersonal)
MOC
Updated 3mo ago
Evaluation Results
Method
Method
Links
Factual Consistency
Usefulness (General)
Usefulness (Diagnosis)
Usefulness (Inter/Intrapersonal)
MOC
L-Phi
2024.12
3.83
3.48
3.62
3.51
3.47
LLaMA
2024.12
3.58
3.17
3.45
3.4
3.42
P-Phi
2024.12
3.32
3.13
3.37
3
2.97
Phi
2024.12
2.9
2.6
2.9
2.95
2.97
Feedback
Search any
task
Search any
task