Our new X account is live! Follow @wizwand_team for updates

First-Order Logic translation on FOLIO (test)

66BLEU

Qwen3-1.7B-SGRPO

Updated 4d ago

Evaluation Results

Method	Links
Qwen3-1.7B-SGRPO 2025.12		66	-	87.4
Qwen3-1.7B-SFT 2025.12		61.2	-	85
ChatGPT-4o 2025.12		38.4	82.6	80.9
LogicLLaMA-13B 2025.12		38.4	85.8	-
LogicLLaMA-7B 2025.12		37.8	84.1	-
DeepSeek-V3 2025.12		37.6	83	79.2
ChatGPT-3.5 2025.12		37	80.2	77.6