Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Prompt Injection Robustness on Coding prompt injections
Loading...
97
Score
gpt-5-thinking
93.88
94.69
95.5
96.31
Dec 19, 2025
Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Score
gpt-5-thinking
2025.12
97
OpenAI o3
2025.12
94
Feedback
Search any
task
Search any
task