Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Training and Verification System Efficiency on Llama 3.1 8B (2048 samples)

62.8Training Time (s)

Baseline

60.7274.7688.8102.84Mar 8, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
62.8----
2026.03
7418131.324.576.2
2026.03
76.121146.215.148.2
2026.03
85.7362017.321
2026.03
114.883333.3781