Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Compositional Generalization on Integration
Loading...
49
P1 Score
PRECEPT
9.688
19.894
30.1
40.306
Mar 10, 2026
P1 Score
Pt Score
Delta P1 vs FR
Updated 1mo ago
Evaluation Results
Method
Method
Links
P1 Score
Pt Score
Delta P1 vs FR
PRECEPT
Config=3-way
2026.03
49
58.3
0.9
PRECEPT
Config=2-way
2026.03
41.7
53.3
0.84
ExpeL
Config=3-way
2026.03
25
38.7
-
FR
Config=2-way
2026.03
18.1
32.4
-
FR
Config=3-way
2026.03
17
42
-
ExpeL
Config=2-way
2026.03
11.2
42.4
-
Feedback
Search any
task
Search any
task