Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
RTL Design Code Generation on RTLLM v1.1
Loading...
92
Syntax Accuracy
GPT-4o
25.44
42.72
60
77.28
May 15, 2026
Syntax Accuracy
Functionality Accuracy
Updated 16d ago
Evaluation Results
Method
Method
Links
Syntax Accuracy
Functionality Accuracy
GPT-4o
Evaluation Protocol=Or...
2026.05
92
68
GPT-4o
Evaluation Protocol=RT...
2026.05
92
70
Claude-3.7
Evaluation Protocol=Or...
2026.05
90
70
Claude-3.7
Evaluation Protocol=RT...
2026.05
90
70
GPT-4o-mini
Evaluation Protocol=Or...
2026.05
88
56
GPT-3.5
Evaluation Protocol=Or...
2026.05
80
50
LlaMA-405B
Evaluation Protocol=Or...
2026.05
67.3
46.8
LlaMA-70B
Evaluation Protocol=Or...
2026.05
54
34
QwQ-32B
Evaluation Protocol=Or...
2026.05
28
26
Feedback
Search any
task
Search any
task