Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Personalization on GOQA
Loading...
85.2
Accuracy
RPM
54.52
62.485
70.45
78.415
May 27, 2025
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
RPM
Backbone=GPT-4o-mini,...
2025.05
85.2
PAG
Backbone=GPT-4o-mini,...
2025.05
82
RPM
Backbone=GPT-4o-mini,...
2025.05
82
HYDRA
Backbone=GPT-4o-mini,...
2025.05
80.6
RAG
Backbone=GPT-4o-mini,...
2025.05
80
HYDRA
Backbone=GPT-4o-mini,...
2025.05
80
Fermi
Backbone=GPT-4o-mini,...
2025.05
80
PAG
Backbone=GPT-4o-mini,...
2025.05
79.5
RAG
Backbone=GPT-4o-mini,...
2025.05
77.3
ICL
Backbone=GPT-4o-mini,...
2025.05
69.5
ICL
Backbone=GPT-4o-mini,...
2025.05
68.1
Fermi
Backbone=GPT-4o-mini,...
2025.05
65.9
Zero-shot
Backbone=GPT-4o-mini,...
2025.05
56.2
Zero-shot
Backbone=GPT-4o-mini,...
2025.05
55.7
Feedback
Search any
task
Search any
task