Microbial strain improvement by a HTP genomic engineering platform

US10336998B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10336998-B2
Application numberUS-201815923527-A
CountryUS
Kind codeB2
Filing dateMar 16, 2018
Priority dateDec 7, 2015
Publication dateJul 2, 2019
Grant dateJul 2, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present disclosure provides a HTP microbial genomic engineering platform that is computationally driven and integrates molecular biology, automation, and advanced machine learning protocols. This integrative platform utilizes a suite of HTP molecular tool sets to create HTP genetic design libraries, which are derived from, inter alia, scientific insight and iterative pattern recognition. The HTP genomic engineering platform described herein is microbial strain host agnostic and therefore can be implemented across taxa. Furthermore, the disclosed platform can be implemented to modulate or improve any microbial host parameter of interest.

First claim

Opening claim text (preview).

What is claimed is: 1. A high-throughput method for engineering a host cell to have improved phenotypic performance, comprising: a. accessing a training data set containing one or more genetic alteration input variables and one or more measured phenotypic performance output variables, i. wherein the one or more genetic alteration input variables represent one or more genetic alterations that have been introduced into a host cell, and ii. wherein the one or more measured phenotypic performance output variables represent one or more phenotypic performance measurements that are associated with the one or more introduced genetic alterations; b. developing a predictive machine learning model that is populated with the training data set; c. generating, in silico, a pool of design candidate host cells incorporating the one or more genetic alterations; d. utilizing the predictive machine learning model to predict the expected phenotypic performance of each member of the pool of design candidate host cells, i. wherein at least one design candidate host cell comprises a consolidated combination of genetic alterations from among the genetic alterations of step (a), in a genomic sequence, said combination being uncharacterized for improving the phenotypic performance at the time that step (d) is carried out; ii. wherein the expected phenotypic performance predicted by the machine learning model is based upon the introduced genetic alterations and their associated phenotypic performance measurements of step (a); e. selecting a subset of the design candidate host cells based upon their predicted phenotypic performance; f. manufacturing host cells from the subset of the design candidate host cells to thereby create engineered host cells; g. measuring, in an in vitro assay, the phenotypic performance of the engineered host cells; and h. adding to the training data set of (a): i. one or more genetic alteration input variables representing one or more genetic alterations that were introduced into the engineered host cells, and ii. one or more measured phenotypic performance output variables representing the phenotypic performance measurements of the engineered host cells. 2. The method of claim 1 , wherein (a)-(h) are repeated until an engineered host cell exhibits a desired level of improved phenotypic performance. 3. The method of claim 1 , wherein the predictive machine learning model incorporates epistatic effects. 4. The method of claim 1 , wherein the predictive machine learning model incorporates at least one of the following: linear regression, kernel ridge regression, logistic regression, neural networks, support vector machines (SVMs), decision trees, hidden Markov models, Bayesian networks, a Gram-Schmidt process, reinforcement-based learning, cluster-based learning, hierarchical clustering, genetic algorithms, and combinations thereof. 5. The method of claim 1 , wherein the predictive machine learning model is supervised, semi-supervised, or unsupervised. 6. The method of claim 1 , wherein the one or more genetic alterations comprise at least one genetic alteration selected from the group consisting of: a single nucleotide polymorphism, nucleotide sequence insertion, nucleotide sequence deletion, and nucleotide sequence replacements. 7. The method of claim 1 , wherein the one or more genetic alterations comprise one or more heterologous promoters from a promoter ladder operably linked to an endogenous target gene. 8. The method of claim 1 , wherein the improved phenotypic performance is increased or more efficient production of a product of interest, said product of interest selected from the group consisting of: a small molecule, enzyme, protein, peptide, amino acid, organic acid, synthetic compound, fuel, alcohol, primary extracellular metabolite, secondary extracellular metabolite, intracellular component molecule, and combinations thereof. 9. A high-throughput method for engineering a microbial strain to have improved phenotypic performance, comprising: a. accessing a training data set containing one or more genetic alteration input variables and one or more measured phenotypic performance output variables, i. wherein the one or more genetic alteration input variables represent one or more genetic alterations that have been introduced into a microbial strain, and ii. wherein the one or more measured phenotypic performance output variables represent one or more phenotypic performance measurements that are associated with the one or more introduced genetic alterations; b. developing a predictive machine learning model that is populated with the training data set; c. generating, in silico, a pool of design candidate microbial strains incorporating the one or more genetic alterations; d. utilizing the predictive machine learning model to predict the expected phenotypic performance of each member of the pool of design candidate microbial strains, i. wherein at least one design candidate microbial strain comprises a consolidated combination of genetic alterations from among the genetic alterations of step (a), in a genomic sequence, said combination being uncharacterized for improving the phenotypic performance at the time that step (d) is carried out; ii. wherein the expected phenotypic performance predicted by the machine learning model is based upon the introduced genetic alterations and their associated phenotypic performance measurements of step (a); e. selecting a subset of the design candidate microbial strains based upon their predicted phenotypic performance; f. manufacturing microbial strains from the subset of design candidate microbial strains to thereby create engineered microbial strains; g. measuring, in an in vitro assay, the phenotypic performance of the engineered microbial strains; h. selecting a subset of the engineered microbial strains based upon their measured phenotypic performance; i. adding to the training data set of (a): i. one or more genetic alteration input variables representing one or more genetic alterations that were introduced into the subset of the engineered microbial strains, and ii. one or more measured phenotypic performance output variables representing the phenotypic performance measurements of the subset of the engineered microbial strains; and j. repeating steps (a)-(i) until an engineered microbial strain exhibits a desired level of improved phenotypic performance. 10. The method of claim 9 , wherein the improved phenotypic performance is increased or more efficient production of a product of interest, said product of interest selected from the group consisting of: a small molecule, enzyme, protein, peptide, amino acid, organic acid, synthetic compound, fuel, alcohol, primary extracellular metabolite, secondary extracellular metabolite, intracellular component molecule, and combinations thereof. 11. The method of claim 9 , wherein the improved phenotypic performance is increased or more efficient production of lysine or citric acid. 12. A high-throughput method for engineering a host cell to have improved phenotypic performance, comprising: a. accessing a training data set containing one or more genetic alteration input variables and one or more measured phenotypic performance output variables, i. wherein the one or more genetic alteration input variables represent one or more genetic alterations that have been introduced into a host cell through application of one or more libraries, and ii. wherein the one or more measured phenotypic performance output variables represent one or more phenotypic performance measurements that are associated with the introduced genetic alterations; b. developing a pr

Assignees

Inventors

Classifications

  • Mutagenesis · CPC title

  • Screening of libraries · CPC title

  • Design of libraries · CPC title

  • Screening libraries by altering the phenotype or phenotypic trait of the host (reporter assays C12N15/1086) · CPC title

  • for fungi · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10336998B2 cover?
The present disclosure provides a HTP microbial genomic engineering platform that is computationally driven and integrates molecular biology, automation, and advanced machine learning protocols. This integrative platform utilizes a suite of HTP molecular tool sets to create HTP genetic design libraries, which are derived from, inter alia, scientific insight and iterative pattern recognition. Th…
Who is the assignee on this patent?
Zymergen Inc
What technology area does this patent fall under?
Primary CPC classification C12N15/1058. Mapped technology areas include Chemistry & Metallurgy.
When was this patent published?
Publication date Tue Jul 02 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).