Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Sentiment Editing on ConvSent (OOD)
Loading...
85.29
Edit Success Score
LTE
-3.4116
19.6167
42.645
65.6733
Feb 19, 2024
Edit Success Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Edit Success Score
LTE
Backbone=LLaMA2-Chat-7...
2024.02
85.29
LTE
Backbone=LLaMA2-Chat-7...
2024.02
84.25
LTE
Backbone=LLaMA2-Chat-7...
2024.02
81.98
LTE
Backbone=LLaMA2-Chat-7...
2024.02
79.66
SERAC
Backbone=LLaMA2-Chat-7...
2024.02
62.75
SERAC
Backbone=LLaMA2-Chat-7...
2024.02
60.72
SERAC
Backbone=LLaMA2-Chat-7...
2024.02
56.46
SERAC
Backbone=LLaMA2-Chat-7...
2024.02
50.06
FT-L
Backbone=LLaMA2-Chat-7...
2024.02
49.5
MEMIT
Backbone=LLaMA2-Chat-7...
2024.02
44.75
MEMIT
Backbone=LLaMA2-Chat-7...
2024.02
41.19
MEMIT
Backbone=LLaMA2-Chat-7...
2024.02
36.2
MEMIT
Backbone=LLaMA2-Chat-7...
2024.02
29.33
FT-L
Backbone=LLaMA2-Chat-7...
2024.02
15.54
FT-L
Backbone=LLaMA2-Chat-7...
2024.02
1.43
FT-L
Backbone=LLaMA2-Chat-7...
2024.02
0
Feedback
Search any
task
Search any
task