Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CVS

Benchmarks

Task NameDataset NameSOTA ResultTrend
Code Helpfulness EvaluationCVS (test)
C++ Success Rate0.66
8
Secure Code GenerationCVS (test)
C++ Success Rate98
8
Kernel QuadratureCVS
Metric-
0
Showing 3 of 3 rows