2021 Synthetic Biology: Engineering, Evolution & Design (SEED)
Precision Engineering of Biomolecular Function with Massively Multiplexed Genotype-Phenotype Measurements and Machine Learning
Authors
In our first demonstration of this approach, we created a library of nearly 100,000 variants of the LacI sensor in E. coli. We used laboratory automation and a combination of long- and short-read sequencing to measure the full dose-response curves and corresponding DNA sequences for every variant. With the resulting data, we identified LacI genotypes with precisely targeted dose-response. For example, we engineered sensors with sensitivities (i.e. EC50) spanning 3 orders of magnitude with a 1.25-fold accuracy. In addition, we used the data to train interpretable machine learning models that provide an intuitive route to engineer new LacI variants with quantitatively predictable dose-response.
Remarkably, we also found many LacI variants with phenotypes that differ qualitatively from the wild-type, including inverted dose-response variants and never-before-seen band-stop (on-off-on) variants. These qualitative phenotypic changes are particularly interesting because they can provide specific insight into the biophysics of sensor proteins and because they highlight the capability for large-scale genotype-phenotype measurements to discover rare and useful biomolecule variants.