Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Sydney Biology

Benchmarks

Task NameDataset NameSOTA ResultTrend
Scientific ReasoningSydney Biology per-architecture breakdown (full)
BLEU8.93
8
Natural Language InferenceSydney Biology
NLI Score0.3562
7
Showing 2 of 2 rows