Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Event Classification on crowd-enVent
Loading...
54
F1-macro
iPOE-llm
33.2
38.6
44
49.4
May 18, 2026
F1-macro
Updated 15d ago
Evaluation Results
Method
Method
Links
F1-macro
iPOE-llm
Model=Qwen3-30B, Proto...
2026.05
54
iPOE-lb-h
Model=LLama3-8B, Proto...
2026.05
51
iPOE-lb-h
Model=Qwen3-30B, Proto...
2026.05
51
iPOE-h
Model=Qwen3-4B, Protoc...
2026.05
50
iPOE-llm
Model=Qwen3-4B, Protoc...
2026.05
50
iPOE-lb-llm
Model=Qwen3-4B, Protoc...
2026.05
50
iPOE-h
Model=Qwen3-30B, Proto...
2026.05
50
iPOE-lb-llm
Model=Qwen3-30B, Proto...
2026.05
50
iPOE-lb-h
Model=Qwen3-4B, Protoc...
2026.05
49
iPOE-llm
Model=LLama3-8B, Proto...
2026.05
49
iPOE-h
Model=LLama3-8B, Proto...
2026.05
48
iPOE-lb-llm
Model=LLama3-8B, Proto...
2026.05
48
Rand-llm
Model=Qwen3-30B, Proto...
2026.05
45
Rand-llm
Model=Qwen3-4B, Protoc...
2026.05
43
Rand-h
Model=LLama3-8B, Proto...
2026.05
43
Vanilla
Model=Qwen3-30B, Proto...
2026.05
40
Rand-h
Model=Qwen3-30B, Proto...
2026.05
40
Rand-llm
Model=LLama3-8B, Proto...
2026.05
37
Rand-h
Model=Qwen3-4B, Protoc...
2026.05
36
Vanilla
Model=Qwen3-4B, Protoc...
2026.05
35
Vanilla
Model=LLama3-8B, Proto...
2026.05
34
Feedback
Search any
task
Search any
task