Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

FACE: Evaluating Natural Language Generation with Fourier Analysis of Cross-Entropy

About

Measuring the distance between machine-produced and human language is a critical open problem. Inspired by empirical findings from psycholinguistics on the periodicity of entropy in language, we propose FACE, a set of metrics based on Fourier Analysis of the estimated Cross-Entropy of language, for measuring the similarity between model-generated and human-written languages. Based on an open-ended generation task and the experimental data from previous studies, we find that FACE can effectively identify the human-model gap, scales with model size, reflects the outcomes of different sampling methods for decoding, correlates well with other evaluation metrics and with human judgment scores.

Zuhao Yang, Yingfang Yuan, Yang Xu, Shuo Zhan, Huajun Bai, Kefan Chen• 2023

Related benchmarks

TaskDatasetResultRank
Human Correlation AnalysisRefined human judgment dataset human vs model-generated
SO-S0.995
3
Human Correlation AnalysisOriginal human judgment dataset--
3
Showing 2 of 2 rows

Other info

Code

Follow for update