Share your thoughts, 1 month free Claude Pro on usSee more

Logical Expression Evaluation on ListOps-O Argument Generalization (Arguments 15)

79Accuracy

EBT-GRC

Updated 4mo ago

Evaluation Results

Method	Links
EBT-GRC 2023.11		79
OM 2023.11		75.05
BT-GRC OS 2023.11		67.9
RIR-EBT-GRC 2023.11		49.55
RIR-EBT-GRC (-S4D) 2023.11		47.73
CRVNN 2023.11		45.1
BBT GRC 2023.11		44.5
RIR-EBT-GRC (-Beam Align) 2023.11		43.65
RIR-GRC 2023.11		42.75
MEGA 2023.11		33.85
S4D 2023.11		22.76