TeleChat Technical Report

About

In this technical report, we present TeleChat, a collection of large language models (LLMs) with parameters of 3 billion, 7 billion and 12 billion. It includes pretrained language models as well as fine-tuned chat models that is aligned with human preferences. TeleChat is initially pretrained on an extensive corpus containing a diverse collection of texts from both English and Chinese languages, including trillions of tokens. Subsequently, the model undergoes fine-tuning to align with human preferences, following a detailed methodology that we describe. We evaluate the performance of TeleChat on various tasks, including language understanding, mathematics, reasoning, code generation, and knowledge-based question answering. Our findings indicate that TeleChat achieves comparable performance to other open-source models of similar size across a wide range of public benchmarks. To support future research and applications utilizing LLMs, we release the fine-tuned model checkpoints of TeleChat's 7B and 12B variant, along with code and a portion of our pretraining data, to the public community.

Zhongjiang He, Zihan Wang, Xinzhang Liu, Shixuan Liu, Yitong Yao, Yuyao Huang, Xuelong Li, Yongxiang Li, Zhonghao Che, Zhaoxi Zhang, Yan Wang, Xin Wang, Luwen Pu, Huinan Xu, Ruiyu Fang, Yu Zhao, Jie Zhang, Xiaomeng Huang, Zhilong Lu, Jiaxin Peng, Wenjun Zheng, Shiquan Wang, Bingkai Yang, Xuewei he, Zhuoru Jiang, Qiyi Xie, Yanhan Zhang, Zhongqiu Li, Lingling Shi, Weiwei Fu, Yin Zhang, Zilu Huang, Sishi Xiong, Yuxiang Zhang, Chao Wang, Shuangyong Song• 2024

Related benchmarks

Task	Dataset	Result
Medical Question Answering	MedMCQA	Accuracy57.08	591
Mathematical Reasoning	MATH 500	Top-1 Accuracy70	452
Reasoning	MMLU-Pro	Accuracy67.98	264
Reasoning	GPQA Diamond	Accuracy33.33	185
Scientific Question Answering	GPQA Diamond	Accuracy33.33	131
Instruction Following	IFEval	Accuracy (IFEval)82	101
Mathematical Problem Solving	MATH500	Accuracy70	96
Medical Reasoning	MedMCQA	Accuracy57.08	72
Mathematical Problem Solving	AIME 2024	Top-1 Accuracy10	54
Tabular Question Answering	ReasonTabQA 1.0 (Overall)	Overall Accuracy51.13	33

Showing 10 of 13 rows

Other info

Follow for update

@wizwand_team Discord