Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Text-to-Audio Retrieval on MSRVTT
Loading...
37.1
Accuracy
BrokenBind
12.66
19.005
25.35
31.695
Feb 6, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
BrokenBind
Backbone=VCLIP+CLAP, E...
2026.02
37.1
VCLIP+CLAP+FT
Backbone=VCLIP+CLAP, E...
2026.02
15.4
VCLIP+CLAP
Backbone=VCLIP+CLAP, E...
2026.02
13.6
Feedback
Search any
task
Search any
task