Deep Extrapolation for Attribute-Enhanced Generation
About
Attribute extrapolation in sample generation is challenging for deep neural networks operating beyond the training distribution. We formulate a new task for extrapolation in sequence generation, focusing on natural language and proteins, and propose GENhance, a generative framework that enhances attributes through a learned latent space. Trained on movie reviews and a computed protein stability dataset, GENhance can generate strongly-positive text reviews and highly stable protein sequences without being exposed to similar data during training. We release our benchmark tasks and models to contribute to the study of generative modeling extrapolation and data-driven design in biology and chemistry.
Alvin Chan, Ali Madani, Ben Krause, Nikhil Naik• 2021
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Controlled Text Generation | SST-5 200-Pos | Positiveness Score91 | 8 | |
| Controlled Text Generation | SST-5 No-Pos | Positiveness Score70 | 8 | |
| Protein Stability Design | ACE2 | ddG Mean-7.34 | 7 | |
| Attribute-enhanced Sequence Generation | SST-5 No-Pos (test) | Positive Rate87.7 | 6 | |
| Attribute-enhanced text generation | SST-5 200-Pos | Positive Realization (%)98.7 | 6 |
Showing 5 of 5 rows