Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
RTL code functionality equivalence checking on DeepRTL2 Benchmark
Loading...
66.7
AP
DeepRTL2
47.356
52.378
57.4
62.422
May 28, 2025
AP
Updated 4d ago
Evaluation Results
Method
Method
Links
AP
DeepRTL2
backbone=Llama
2025.05
66.7
DeepRTL2
backbone=DeepSeek
2025.05
59.1
text-embedding-3-small
size=small
2025.05
56.5
GritLM
parameters=7B
2025.05
54.1
DeepRTL2
backbone=Llama, hard n...
2025.05
51.8
text-embedding-3-large
size=large
2025.05
49.8
DeepRTL2
backbone=DeepSeek, har...
2025.05
48.1
Feedback
Search any
task
Search any
task