Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Lineage Verification on TinyStories seed 1000 Continual (train)
Loading...
5.36
t-test p-value (logits)
SeedPrints
5.092
5.226
5.36
5.494
Sep 30, 2025
t-test p-value (logits)
U-test p-value (logits)
t-test p-value (hidden)
U-test p-value (hidden)
Intrinsic Score
REEF Score
PCS Score
ICS Score
Updated 4d ago
Evaluation Results
Method
Method
Links
t-test p-value (logits)
U-test p-value (logits)
t-test p-value (hidden)
U-test p-value (hidden)
Intrinsic Score
REEF Score
PCS Score
ICS Score
SeedPrints
Model Architecture=Qwe...
2025.09
5.36
1.92
8.49
5.09
-
-
-
-
Intrinsic
Model Architecture=Qwe...
2025.09
-
-
-
-
1
-
-
-
REEF
Model Architecture=Qwe...
2025.09
-
-
-
-
-
0.957
-
-
PCS
Model Architecture=Qwe...
2025.09
-
-
-
-
-
-
0.999
-
ICS
Model Architecture=Qwe...
2025.09
-
-
-
-
-
-
-
0.996
Feedback
Search any
task
Search any
task