Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Natural Language Inference on SciNLI
Loading...
87.1
F1 Score
Full FT
76.388
79.169
81.95
84.731
Apr 8, 2026
F1 Score
Updated 9d ago
Evaluation Results
Method
Method
Links
F1 Score
Full FT
Base model=gpt-oss 20B...
2026.04
87.1
MPT
Base model=gpt-oss 20B...
2026.04
86.5
LoRA
Base model=gpt-oss 20B...
2026.04
85.3
Full FT
Base model=Meditron3 8...
2026.04
84.9
MPT
Base model=Meditron3 8...
2026.04
84.3
Full FT
Base model=LLaMA 3.1 8...
2026.04
83.6
LoRA
Base model=Meditron3 8...
2026.04
83
MPT
Base model=LLaMA 3.1 8...
2026.04
82.9
LoRA
Base model=LLaMA 3.1 8...
2026.04
81.5
PT
Base model=gpt-oss 20B...
2026.04
81
PT
Base model=Meditron3 8...
2026.04
78.5
PT
Base model=LLaMA 3.1 8...
2026.04
76.8
Feedback
Search any
task
Search any
task