Conditionally Site-Independent Neural Evolution of Antibody Sequences
About
Common deep learning approaches for antibody engineering focus on modeling the marginal distribution of sequences. By treating sequences as independent samples, however, these methods overlook affinity maturation as a rich and largely untapped source of information about the evolutionary process by which antibodies explore the underlying fitness landscape. In contrast, classical phylogenetic models explicitly represent evolutionary dynamics but lack the expressivity to capture complex epistatic interactions. We bridge this gap with CoSiNE, a continuous-time Markov chain parameterized by a deep neural network. Mathematically, we prove that CoSiNE provides a first-order approximation to the intractable sequential point mutation process, capturing epistatic effects with an error bound that is quadratic in branch length. Empirically, CoSiNE outperforms state-of-the-art language models in zero-shot variant effect prediction by explicitly disentangling selection from context-dependent somatic hypermutation. Finally, we introduce Guided Gillespie, a classifier-guided sampling scheme that steers CoSiNE at inference time, enabling efficient optimization of antibody binding affinity toward specific antigens.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Expression Prediction | Koenig H | Pearson Correlation0.687 | 14 | |
| Binding affinity prediction | Koenig (Heavy chain mutation) | Spearman Correlation0.456 | 7 | |
| Binding affinity prediction | Koenig L (Light chain mutation) | Spearman Correlation0.371 | 7 | |
| Binding affinity prediction | Shaneh Sequence length 119 | Spearman Correlation0.498 | 7 | |
| Binding Prediction | Koenig L | Pearson Correlation0.345 | 7 | |
| Binding Prediction | Shaneh | Pearson Correlation Coefficient0.502 | 7 | |
| Binding Prediction | Shaneh 120 | Pearson Correlation0.521 | 7 | |
| Expression Prediction | Koenig L | Pearson Correlation0.696 | 7 | |
| Expression Prediction | Adams | Pearson Correlation0.409 | 7 | |
| Expression Prediction | Koenig (Heavy chain mutation) | Spearman Correlation0.613 | 7 |