Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Self-Improvement Imitation with Biologically Guided Search for Protein Design Under Oracle Budgets

About

Protein sequence optimization under tight oracle budgets requires methods that explore vast combinatorial spaces while making each evaluation informative. Existing reinforcement learning and off-policy generative approaches often degrade under surrogate noise, and position-agnostic mutation proposals risk disrupting functionally critical residues. We introduce SILO, a trajectory-level self-improvement imitation framework for oracle-budgeted protein design. SILO uses a hierarchical edit policy that decomposes each mutation into a position choice followed by a residue choice. In each active-learning round, the policy samples candidate trajectories via incremental stochastic beam search without replacement (SBS), and a UCB-based proxy ensemble, combined with an alanine-scan fitness score (AFS), selects candidates with functionally relevant edits for in silico oracle evaluation. The policy is then updated by next-action cross-entropy imitation on the round's best oracle-labeled trajectories, avoiding value-function estimation. Across eight reproduced protein fitness landscapes and five strong baselines from prior work, SILO achieves the highest maximum and top-100 mean fitness on 8 of 8 landscapes within our evaluations, often exhibiting faster early-stage improvement. In low-data and noisy-proxy stress tests on two landscapes per setting, SILO remains competitive or best when several baselines degrade. Ablations show that SBS with AFS account for much of the gains, with iterative imitation providing additional improvement. Code is available at: https://github.com/grimmlab/SILO.git

Ashima Khanna, Dominik Grimm• 2026

Related benchmarks

TaskDatasetResultRank
Protein DesignAMIE
Mean Fitness (Top 100 Sequences)0.259
12
Protein DesignE4B
Mean Fitness (Top 100)8.148
12
Protein DesignGFP
Mean Fitness (Top 100)3.617
12
Protein DesignLGK
Mean Fitness (Top 100)0.044
12
Protein DesignPab1
Mean Fitness (Top 100)1.856
12
Protein DesignTEM
Mean Fitness (top 100)1.232
12
Protein DesignUBE2I
Mean Fitness (Top 100)2.996
12
Protein DesignAAV
Maximum Fitness0.747
12
Protein DesignAMIE
Maximum Fitness0.262
12
Protein DesignE4B
Maximum Fitness8.169
12
Showing 10 of 43 rows

Other info

Follow for update