Hermes 3 Technical Report
About
Instruct (or "chat") tuned models have become the primary way in which most people interact with large language models. As opposed to "base" or "foundation" models, instruct-tuned models are optimized to respond to imperative statements. We present Hermes 3, a neutrally-aligned generalist instruct and tool use model with strong reasoning and creative abilities. Its largest version, Hermes 3 405B, achieves state of the art performance among open weight models on several public benchmarks.
Ryan Teknium, Jeffrey Quesnelle, Chen Guang• 2024
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Jailbreak attack success rate | Harmful prompts dataset | Attack Success Rate82.5 | 49 | |
| Rain Removal | Rain 0.5 | PSNR (dB)31.5262 | 20 | |
| Snow Removal | CityScape + Snow100K | PSNR (dB)28.8836 | 10 | |
| Rain Removal | Rain | PSNR (dB)30.7602 | 10 |
Showing 4 of 4 rows