Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Novel

Benchmarks

Task NameDataset NameSOTA ResultTrend
Long-form Question AnsweringNovel GraphRAG-Bench
LLM-Acc85.3
20
Retrieval-Augmented GenerationNovel
Indexing Time (mins)13
11
Retrieval EfficiencyNovel
Retrieved Tokens22,391
8
Surgical robot end-effector pose estimationNovel unseen configuration (test)
RMSE X (mm)1.37
2
Showing 4 of 4 rows