Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Reading Comprehension on MultiRC
Loading...
88.2
F1 Score
PaLM 2-L
47.744
58.247
68.75
79.253
May 17, 2023
Oct 14, 2023
Mar 12, 2024
Aug 10, 2024
Jan 7, 2025
Jun 6, 2025
Nov 4, 2025
F1 Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
F1 Score
PaLM 2-L
prompting=1-shot
2023.05
88.2
PaLM
prompting=1-shot
2023.05
84.9
PaLM 2-M
prompting=1-shot
2023.05
84.1
PaLM 2-S
prompting=1-shot
2023.05
84
Random
Backbone=LLaMA-7B, Eva...
2025.05
60.4
BM25
Backbone=LLaMA-7B, Eva...
2025.05
58.7
ConMeZO
Model=OPT-13B
2025.11
58.3
MeZO
Model=OPT-13B
2025.11
57.53
Zero-shot
Backbone=LLaMA-7B, Eva...
2025.05
57
GENICL
Backbone=LLaMA-7B, Eva...
2025.05
56.9
MeZO
Model=OPT-1.3B
2025.11
55.9
E5base
Backbone=LLaMA-7B, Eva...
2025.05
54
ConMeZO
Model=OPT-1.3B
2025.11
53.5
SBERT
Backbone=LLaMA-7B, Eva...
2025.05
53.3
EPR
Backbone=LLaMA-7B, Eva...
2025.05
50.4
LLM-R
Backbone=LLaMA-7B, Eva...
2025.05
50.2
CBDS
Backbone=LLaMA-7B, Eva...
2025.05
49.3
Feedback
Search any
task
Search any
task