Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Operator Induction on VL-ICL Bench OP_IND (test)
Loading...
47.7
Mean Accuracy
MAPD
7.66
18.055
28.45
38.845
Jun 7, 2025
Mean Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Mean Accuracy
MAPD
TTA (Test-Time Adaptat...
2025.06
47.7
Multi-TaskPD
TTA (Test-Time Adaptat...
2025.06
45.1
Model-AvgPD
TTA (Test-Time Adaptat...
2025.06
40
NoMeta-taskPD
TTA (Test-Time Adaptat...
2025.06
38.8
In-ContextPD
TTA (Test-Time Adaptat...
2025.06
30.9
LoRA ([0-15] LLM layers + ATT)
TTA (Test-Time Adaptat...
2025.06
30.5
LoRA ([0-15] LLM layers)
TTA (Test-Time Adaptat...
2025.06
25.5
In-ContextPD
TTA (Test-Time Adaptat...
2025.06
20.6
LoRA (All LLM layers)
TTA (Test-Time Adaptat...
2025.06
13.3
NoMeta-taskPD
TTA (Test-Time Adaptat...
2025.06
12.1
Multi-TaskPD
TTA (Test-Time Adaptat...
2025.06
10
MAPD
TTA (Test-Time Adaptat...
2025.06
9.6
Model-AvgPD
TTA (Test-Time Adaptat...
2025.06
9.2
Feedback
Search any
task
Search any
task