Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

OpenAssistant

Benchmarks

Task NameDataset NameSOTA ResultTrend
LLM AlignmentOpenAssistant
GPT-4o Win Rate87.08
45
LLM AlignmentOpenAssistant (test)
GPT-4o Win Rate11.85
7
Showing 2 of 2 rows