Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Natural Language Explanation Generation on ComVE
Loading...
70
Human Evaluation Score
SPARSEFIT
19.7368
32.7859
45.835
58.8841
May 22, 2023
Human Evaluation Score
Cohen's Kappa
Updated 1mo ago
Evaluation Results
Method
Method
Links
Human Evaluation Score
Cohen's Kappa
SPARSEFIT
Backbone=Llama-2-7B, S...
2023.05
70
-
AdaLora
Backbone=Llama-2-7B
2023.05
55.56
-
SPARSEFIT (Att.Q+LN)
Backbone=T5-large, Fin...
2023.05
40
0.25
Full Fine-tuning
Backbone=Llama-2-7B
2023.05
40
-
SPARSEFIT (Att.Q)
Backbone=T5-large, Fin...
2023.05
28.89
0.35
AdaLora
Backbone=T5-large, Fin...
2023.05
23.34
0.25
Full Fine-tuning
Backbone=T5-large, Fin...
2023.05
21.67
0.22
Feedback
Search any
task
Search any
task