Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

RAR

Benchmarks

Task NameDataset NameSOTA ResultTrend
Downstream retrievalRAR-B
ARC nDCG@516.2
24
Autoregressive Visual WatermarkingRAR-XL generation
Fidelity Score (Baseline)1
10
Medical ReasoningRaR Medicine
WR vs Base57.6
8
Medical Question AnsweringRaR-Medicine (test)
Length1,395
5
Pairwise Preference EvaluationRaR Medicine
Pairwise Win Rate60.6
4
Showing 5 of 5 rows