Share your thoughts, 1 month free Claude Pro on usSee more

Scientific Question Answering on GPQA Main (LLM Trust score)

98.39LLM Trust Score

DecepChain

Updated 2mo ago

Evaluation Results

Method	Links
DecepChain 2025.09		98.39
DecepChain 2025.09		92.77
DecepChain 2025.09		89.33
DecepChain 2025.09		88.84
BadNet 2025.09		73.79
BadNet 2025.09		71.07
BadChain 2025.09		69.46
DT-COT 2025.09		68.3
BadNet 2025.09		63.21
DT-COT 2025.09		46.79
BadChain 2025.09		43.71
BadNet 2025.09		16.74
DT-COT 2025.09		8.71
BadChain 2025.09		6.88
DT-COT 2025.09		6.34
BadChain 2025.09		3.89