Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Fill-in-the-blank Question Answering on PBSBench Slide-level 1.0 (test)
Loading...
27
Exact Match (EMatch)
PBS-VL
-1.08
6.21
13.5
20.79
Apr 19, 2026
Exact Match (EMatch)
Partial Match (PMatch)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Exact Match (EMatch)
Partial Match (PMatch)
PBS-VL
Framework=Slide-level,...
2026.04
27
32
GPT-4o
Input=30 random patches
2026.04
5
5
Gemini-2.5-pro
Input=30 random patches
2026.04
0
9
Claude-4.5
Input=30 random patches
2026.04
0
18
HistoGPT
Model type=Pathology-s...
2026.04
0
0
SlideChat
Model type=Pathology-s...
2026.04
0
0
Feedback
Search any
task
Search any
task