Share your thoughts, 1 month free Claude Pro on usSee more

Operator Induction on VL-ICL Bench OP_IND (test)

47.7Mean Accuracy

MAPD

Updated 4mo ago

Evaluation Results

Method	Links
MAPD 2025.06		47.7
Multi-TaskPD 2025.06		45.1
Model-AvgPD 2025.06		40
NoMeta-taskPD 2025.06		38.8
In-ContextPD 2025.06		30.9
LoRA ([0-15] LLM layers + ATT) 2025.06		30.5
LoRA ([0-15] LLM layers) 2025.06		25.5
In-ContextPD 2025.06		20.6
LoRA (All LLM layers) 2025.06		13.3
NoMeta-taskPD 2025.06		12.1
Multi-TaskPD 2025.06		10
MAPD 2025.06		9.6
Model-AvgPD 2025.06		9.2