Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ShortListing Model: A Streamlined SimplexDiffusion for Discrete Variable Generation

About

Generative modeling of discrete variables is challenging yet crucial for applications in natural language processing and biological sequence design. We introduce the Shortlisting Model (SLM), a novel simplex-based diffusion model inspired by progressive candidate pruning. SLM operates on simplex centroids, reducing generation complexity and enhancing scalability. Additionally, SLM incorporates a flexible implementation of classifier-free guidance, enhancing unconditional generation performance. Extensive experiments on DNA promoter and enhancer design, protein design, character-level and large-vocabulary language modeling demonstrate the competitive performance and strong potential of SLM. Our code can be found at https://github.com/GenSI-THUAIR/SLM

Yuxuan Song, Zhe Zhang, Yu Pei, Jingjing Gong, Qiying Yu, Zheng Zhang, Mingxuan Wang, Hao Zhou, Jingjing Liu, Wei-Ying Ma• 2025

Related benchmarks

TaskDatasetResultRank
DNA Sequence Generationenhancer DNA sequence flybrain
FBD4.4
13
Showing 1 of 1 rows

Other info

Follow for update