Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SFE

Benchmarks

Task NameDataset NameSOTA ResultTrend
Scientific ReasoningSFE ZH Chinese
Score35.69
9
Scientific ReasoningSFE English
Score36.49
9
Scientific Multimodal TasksSFE
Score58.9
5
Showing 3 of 3 rows