Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MIT Interview

Benchmarks

Task NameDataset NameSOTA ResultTrend
Proactive Assistant EvaluationMIT Interview dataset (test)
Response Frequency5.8
1
Showing 1 of 1 rows