Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

EXAONE 3.5: Series of Large Language Models for Real-world Use Cases

About

This technical report introduces the EXAONE 3.5 instruction-tuned language models, developed and released by LG AI Research. The EXAONE 3.5 language models are offered in three configurations: 32B, 7.8B, and 2.4B. These models feature several standout capabilities: 1) exceptional instruction following capabilities in real-world scenarios, achieving the highest scores across seven benchmarks, 2) outstanding long-context comprehension, attaining the top performance in four benchmarks, and 3) competitive results compared to state-of-the-art open models of similar sizes across nine general benchmarks. The EXAONE 3.5 language models are open to anyone for research purposes and can be downloaded from https://huggingface.co/LGAI-EXAONE. For commercial use, please reach out to the official contact point of LG AI Research: contact_us@lgresearch.ai.

Soyoung An, Kyunghoon Bae, Eunbi Choi, Kibong Choi, Stanley Jungkyu Choi, Seokhee Hong, Junwon Hwang, Hyojin Jeon, Gerrard Jeongwon Jo, Hyunjik Jo, Jiyeon Jung, Yountae Jung, Hyosang Kim, Joonkee Kim, Seonghwan Kim, Soyeon Kim, Sunkyoung Kim, Yireun Kim, Yongil Kim, Youchul Kim, Edward Hwayoung Lee, Haeju Lee, Honglak Lee, Jinsik Lee, Kyungmin Lee, Woohyung Lim, Sangha Park, Sooyoun Park, Yongmin Park, Sihoon Yang, Heuiyeen Yeen, Hyeongu Yun• 2024

Related benchmarks

TaskDatasetResultRank
Instruction FollowingIFEval--
292
CodingMBPP+
Pass@179.4
37
MathematicsGSM8K
GSM8K Score82.5
21
LLM-generated text detectionKatFish Paper Abstract
AUC-ROC (Solar)70.8
12
LLM-generated text detectionKatFish Essay
AUC-ROC (Solar)92.08
12
LLM-generated text detectionKatFish Poetry
AUC-ROC (Solar)71.32
12
General KnowledgeUnified Korean Benchmark General Knowledge
KMMLU52.6
7
Society & Culture UnderstandingUnified Korean Benchmark Society & Culture
K-Refer71.6
7
ComprehensionUnified Korean Benchmark Comprehension
K-Prag73.5
7
ReasoningUnified Korean Benchmark Reasoning
Ko-Winogrande64.6
7
Showing 10 of 17 rows

Other info

Follow for update