Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Innovative Thinking, Infinite Humor: Humor Research of Large Language Models through Structured Thought Leaps

About

Humor is previously regarded as a gift exclusive to humans for the following reasons. Humor is a culturally nuanced aspect of human language, presenting challenges for its understanding and generation. Humor generation necessitates a multi-hop reasoning process, with each hop founded on proper rationales. Although many studies, such as those related to GPT-o1, focus on logical reasoning with reflection and correction, they still fall short in humor generation. Due to the sparsity of the knowledge graph in creative thinking, it is arduous to achieve multi-hop reasoning. Consequently, in this paper, we propose a more robust framework for addressing the humor reasoning task, named LoL. LoL aims to inject external information to mitigate the sparsity of the knowledge graph, thereby enabling multi-hop reasoning. In the first stage of LoL, we put forward an automatic instruction-evolution method to incorporate the deeper and broader thinking processes underlying humor. Judgment-oriented instructions are devised to enhance the model's judgment capability, dynamically supplementing and updating the sparse knowledge graph. Subsequently, through reinforcement learning, the reasoning logic for each online-generated response is extracted using GPT-4o. In this process, external knowledge is re-introduced to aid the model in logical reasoning and the learning of human preferences. Finally, experimental results indicate that the combination of these two processes can enhance both the model's judgment ability and its generative capacity. These findings deepen our comprehension of the creative capabilities of large language models (LLMs) and offer approaches to boost LLMs' creative abilities for cross-domain innovative applications.

Han Wang, Yilin Zhao, Dian Li, Xiaohan Wang, Gang Liu, Xuguang Lan, Hui Wang• 2024

Related benchmarks

TaskDatasetResultRank
Funny Caption GenerationElectric sheep High-Humor
Recall@161.26
32
Funny Caption GenerationHumor in AI (#200-209)
Recall@168.4
32
Funny Caption GenerationElectric sheep Low-Humor
Top-1 Accuracy66.23
32
Funny Caption GenerationHumor in AI (#1000-1009)
Top-1 Accuracy67.29
32
Funny Caption GenerationHumor in AI (#Top10)
Top-1 Accuracy58.06
32
Humorous Caption GenerationHumor in AI Dataset (test)
Visual Understanding Rank4.9
8
Meme Caption GenerationImgFlip
pass@171.67
8
Meme GenerationMeme ImgFlip (test)
Pass@171.67
8
Humor GenerationElectronic Sheep (test)
Visual Understanding Avg Rank4.5
8
Humor GenerationHumor in AI
Mean Score3.16
7
Showing 10 of 15 rows

Other info

Follow for update