Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Word Sense Induction on SemEval 2010
Loading...
43.6
V-M
PolyLM large
-1.744
10.028
21.8
33.572
Mar 12, 2026
V-M
Paired F-S
NMI
F-B3
Updated 1mo ago
Evaluation Results
Method
Method
Links
V-M
Paired F-S
NMI
F-B3
PolyLM large
Size=Large
2026.03
43.6
67.5
6.2
49.2
PolyLM base
Size=Base
2026.03
41.8
66.4
6.2
49.1
LSDP
Substitutes=BERT-large
2026.03
38.9
70.7
4.6
52.8
GPT-4o
Methodology=LLM-based...
2026.03
36.3
63.9
7.1
47.7
1cpex
Strategy=One cluster p...
2026.03
31.7
0
19.5
8
Llama 3.3 70B
Methodology=LLM-based...
2026.03
29.4
49.7
8.1
49.6
Llama 3.1 8B
Methodology=LLM-based...
2026.03
16.5
49.3
7.3
49.6
1cpl
Strategy=One cluster p...
2026.03
0
63.5
0
64.1
Feedback
Search any
task
Search any
task