Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Relational linear probing on TENSE
Loading...
89
F1 (GT)
KL-RP
12.04
32.02
52
71.98
May 21, 2026
F1 (GT)
F1 (LLM)
dKL Divergence
Updated 12d ago
Evaluation Results
Method
Method
Links
F1 (GT)
F1 (LLM)
dKL Divergence
KL-RP
Model=Llama-3.1, Layer=16
2026.05
89
95
0.02
KL-RP
Model=Gemma-2, Layer=13
2026.05
89
95
0.23
LRE
Model=Gemma-2, Layer=13
2026.05
47
48
0.88
Random
Model=Gemma-2, Layer=13
2026.05
29
30
1.7
Random
Model=Llama-3.1, Layer=16
2026.05
28
36
0.57
LRE
Model=Llama-3.1, Layer=16
2026.05
15
20
1.46
Feedback
Search any
task
Search any
task