Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
RTL Design Code Generation on VerilogHuman v2
Loading...
98.1
Syntax Accuracy
Claude-3.7
51.404
63.527
75.65
87.773
May 15, 2026
Syntax Accuracy
Function Accuracy
Updated 16d ago
Evaluation Results
Method
Method
Links
Syntax Accuracy
Function Accuracy
Claude-3.7
Evaluation Protocol=Or...
2026.05
98.1
76.3
GPT-4o
Evaluation Protocol=RT...
2026.05
98.1
71.4
Claude-3.7
Evaluation Protocol=RT...
2026.05
98.1
76.9
GPT-4o-mini
Evaluation Protocol=Or...
2026.05
96.2
51.9
GPT-4o
Evaluation Protocol=Or...
2026.05
96.2
68.8
LlaMA-405B
Evaluation Protocol=Or...
2026.05
91.7
60.9
LlaMA-70B
Evaluation Protocol=Or...
2026.05
80.1
42.9
QwQ-32B
Evaluation Protocol=Or...
2026.05
61.5
55.1
GPT-3.5
Evaluation Protocol=Or...
2026.05
53.2
26.3
Feedback
Search any
task
Search any
task