Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Paraphrasing on IIITD-BU-Combined Response level
Loading...
0.648
F1 Score
LLaMA-3.3-70B-Instruct
0.40152
0.46551
0.5295
0.59349
Nov 16, 2025
F1 Score
False Acceptance Rate (FAR)
False Rejection Rate (FRR)
Updated 3d ago
Evaluation Results
Method
Method
Links
F1 Score
False Acceptance Rate (FAR)
False Rejection Rate (FRR)
LLaMA-3.3-70B-Instruct
Preprocessing=Unproces...
2025.11
0.648
0.386
0.279
LLaMA-3.3-70B-Instruct
Preprocessing=Unproces...
2025.11
0.563
0.521
0.224
LLaMA-3.3-70B-Instruct
Preprocessing=Processe...
2025.11
0.555
0.556
0.161
LLaMA-3.3-70B-Instruct
Preprocessing=Processe...
2025.11
0.411
0.719
0.09
Feedback
Search any
task
Search any
task