by , , , , , , ,
Abstract:
Tailored enzymes are crucial for the transition to a sustainable bioeconomy. However, enzyme engineering is laborious and failure-prone due to its reliance on serendipity. The efficiency and success rates of engineering campaigns may be improved by applying machine learning to map the sequence-activity landscape based on small experimental data sets. Yet, it often proves challenging to reliably model large sequence spaces while keeping the experimental effort tractable. To address this challenge, we present an integrated pipeline combining large-scale screening with active machine learning, which we applied to engineer an artificial metalloenzyme (ArM) catalyzing a new-to-nature hydroamination reaction. Combin- ing lab automation and next-generation sequencing, we acquired sequence-activity data for several thousand ArM variants. We then used Gaussian process regression to model the activity landscape and guide further screening rounds. Critical character- istics of our pipeline include the cost-effective generation of information-rich data sets, the integration of an explorative round to improve the model's performance, and the inclusion of experimental noise. Our approach led to an order-of-magnitude boost in the hit rate while making efficient use of experimental resources. Search strategies like this should find broad utility in enzyme engineering and accelerate the development of novel biocatalysts.
Reference:
Enhanced Sequence-Activity Mapping and Evolution of Artificial Metalloenzymes by Active Learning T. Vornholt, M. Mutný, G. W. Schmidt, R. T. Christian Schellhaas, S. Panke, T. R. Ward, A. Krause, M. JeschekIn ACS Central Science, volume 10, 2024
Bibtex Entry:
@article{vornholt24enhanced,
	author = {Tobias Vornholt and Mojm{\'\i}r Mutn{\'y} and Gregor W. Schmidt and Christian Schellhaas, Ryo Tachibana and Sven Panke and Thomas R. Ward and Andreas Krause and Markus Jeschek},
	doi = {https://doi.org/10.1021/acscentsci.4c00258},
	journal = {ACS Central Science},
	month = {May},
	number = {7},
	pages = {1357-1370},
	title = {Enhanced Sequence-Activity Mapping and Evolution of Artificial Metalloenzymes by Active Learning},
	volume = {10},
	year = {2024}}