Novel

Benchmarks

Task Name	Dataset Name	SOTA Result
Long-form Question Answering	Novel GraphRAG-Bench	LLM-Acc85.3	20
Object Detection	Novel-114 Average	AP50:9555.1	11
Retrieval-Augmented Generation	Novel	Indexing Time (mins)13	11
Retrieval Efficiency	Novel	Retrieved Tokens22,391	8
Object Detection	Novel-114 Instruments-8	AP58.5	3
Object Detection	Novel-114 Cold Weapons-4	AP74.4	3
Surgical robot end-effector pose estimation	Novel unseen configuration (test)	RMSE X (mm)1.37	2

Showing 7 of 7 rows