Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Relational linear probing on SUBJ
Loading...
63
F1 (GT)
KL-RP
25.56
35.28
45
54.72
May 21, 2026
F1 (GT)
F1 (LLM)
dKL Divergence
Updated 12d ago
Evaluation Results
Method
Method
Links
F1 (GT)
F1 (LLM)
dKL Divergence
KL-RP
Model=Llama-3.1, Layer=16
2026.05
63
88
0.1
Random
Model=Llama-3.1, Layer=16
2026.05
41
47
0.49
Random
Model=Gemma-2, Layer=13
2026.05
39
50
0.16
KL-RP
Model=Gemma-2, Layer=13
2026.05
39
50
0.26
LRE
Model=Llama-3.1, Layer=16
2026.05
28
50
0.02
LRE
Model=Gemma-2, Layer=13
2026.05
27
48
0.38
Feedback
Search any
task
Search any
task