Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

UltraMedical

Benchmarks

Task NameDataset NameSOTA ResultTrend
MT-BenchUltraMedical Preference
MT Score6.9
28
AlpacaEval 2.0UltraMedical Preference
LC17.7
28
Reward ModelingUltraMedical (test)
Easy Score95.8
5
Showing 3 of 3 rows