Binan, Geng, Yalun, Wu, Xinyan, Wu, Yongfu, Yang, Peng, Zhou, Yunhaon, Chen, Xuan, Zhou, Chenguang, Liu, Fengwu, Bai, Ping, Xu, Qiaoning, He, and Shihui, Yang
A one-stop centralized database, ZymOmics, was constructed for Zymomonas mobilis. A pipeline was developed for searching restriction–modifications, CRISPR/Cas, and toxin–antitoxin systems within a given microorganism. A genome-wide iterative and continuous editing (GW-ICE) system was established for efficient genome editing without protospacer-adjacent motif (PAM) limitations. A genome-reduced strain, ZMNPΔhypo, was constructed for further genome minimization. Current biotechnology relies on a few well-studied model organisms, such as Escherichia coli and Saccharomyces cerevisiae , for which abundant information and efficient toolkits are available for genetic manipulation, but which lack industrially favorable characteristics. Non-model industrial microorganisms usually do not have effective and/or efficient genome-engineering toolkits, which hampers the development of microbial cell factories to meet the fast-growing bioeconomy. In this study, using the non-model ethanologenic bacterium Zymomonas mobilis as an example, we developed a workflow to mine and temper the elements of restriction–modification (R-M), CRISPR/Cas, toxin–antitoxin (T-A) systems, and native plasmids, which are hidden within industrial microorganisms themselves, as efficient genome-editing toolkits, and established a genome-wide iterative and continuous editing (GW-ICE) system for continuous genome editing with high efficiency. This research not only provides tools and pipelines for engineering the non-model polyploid industrial microorganism Z. mobilis efficiently, but also sets a paradigm to overcome biotechnological limitations in other genetically recalcitrant non-model industrial microorganisms. Graphical abstract [Display omitted] The genome-wide iterative and continuous editing (GW-ICE) system developed in this study for Zymomonas mobilis can be further improved by increasing the efficiencies of homologous recombination and genome editing for multiple and large-fragment deletions and insertions, which can also be quickly applied to other microorganisms by obtaining information about endogenous restriction-modification (R-M), CRISPR/Cas, toxin-antitoxin (T-A) systems, and replication systems by uploading nucleic acid or amino acid sequences to the website http://ZymOmics.cn/CRISPR%5fTA%5fRM. The existence of at least one component of GW-ICE among 51.9% of the 2492 microbial genera, especially industrial microorganisms, such as Escherichia , Bacillus , and Clostridium , suggests that most microorganisms contain the endogenous elements and devices required for the development of GW-ICE. The application of heterologous components to build the GW-ICE system also suggested that this system can be developed for industrial microorganisms lacking potential endogenous components, although the compatibility and adaptability of these components are the basis for realizing this possibility, and high-throughput mutation and screening are required to obtain the desired components. In addition, extensive efforts are still needed to characterize the components and function of GW-ICE system, such as endogenous R-M, CRISPR/Cas, T-A, and replication systems, and novel defense systems, which have not yet been characterized. Moreover, factors affecting the efficiency of GW-ICE, including the efficiencies of homologous recombination and genome editing, and differences in protospacer-adjacent motif (PAM) sequences and donor lengths, should be better characterized. A protein prediction model integrating artificial intelligence (AI) technology and big data analysis could also expand the components for the development of efficient GW-ICE. A 'Swiss Army knife'-like toolkit, genome-wide iterative and continuous editing (GW-ICE) system, for continuous genome editing in Zymomonas mobilis , can be generalized to other recalcitrant microorganisms by mining the genetic components of restriction–modification (R-M), CRISPR/Cas, and toxin–antitoxin (T-A) systems as well as expression systems through the website http://ZymOmics.cn/CRISPR_TA_RM. [ABSTRACT FROM AUTHOR]