Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

CUDA-LLM

Benchmarks

Task NameDataset NameSOTA ResultTrend
Dot ProductCUDA-LLM task suite
Execution Time (ms)3.99
9
ReductionCUDA-LLM task suite
Time5.75
9
Matrix CopyCUDA-LLM task suite
Execution Time5.12
9
ReLU Activation FunctionCUDA-LLM task suite
Time4.49
9
Reverse ArrayCUDA-LLM task suite
Execution Time4.07
9
Matrix TransposeCUDA-LLM task suite
Time5.2
9
Top-K SelectionCUDA-LLM task suite
Time5.9
5
HistogrammingCUDA-LLM task suite
Latency (ms)7.55
5
Monte Carlo IntegrationCUDA-LLM task suite
Latency (ms)6.95
5
Categorical Cross-Entropy LossCUDA-LLM task suite
Time (ms)6.4
5
Prefix SumCUDA-LLM task suite
Time6.1
5
Categorical Cross-Entropy LossCUDA-LLM kernels task suite (test)
Time (s)-
0
Multi-Head Self-AttentionCUDA-LLM kernels task suite (test)
Latency (s)-
0
Dot ProductCUDA-LLM task suite (test)
Latency-
0
ReductionCUDA-LLM task suite (test)
Time-
0
Matrix CopyCUDA-LLM task suite (test)
Latency (ms)-
0
Reverse ArrayCUDA-LLM task suite (test)
Time (ms)-
0
Matrix TransposeCUDA-LLM task suite (test)
Latency (ms)-
0
Ordinary Least Squares RegressionCUDA-LLM task suite
Time-
0
Showing 19 of 19 rows