Method of establishing cancer screening module, using method and platform thereof
US-2024402147-A1 · Dec 5, 2024 · US
US2021027862A1 · US · A1
| Field | Value |
|---|---|
| Publication number | US-2021027862-A1 |
| Application number | US-201917043551-A |
| Country | US |
| Kind code | A1 |
| Filing date | Apr 1, 2019 |
| Priority date | Mar 30, 2018 |
| Publication date | Jan 28, 2021 |
| Grant date | — |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Characteristics of molecules and/or biomolecular complexes may be predicted using differential geometry based methods in combination with trained machine learning models. Element specific and element interactive manifolds may be constructed using element interactive number density and/or element interactive charge density to represent the atoms or the charges in selected element sets. Feature data may include element interactive curvatures of various types derived from element specific and element interactive manifolds at various scales. Element interactive curvatures computed from various element interactive manifolds may be input to trained machine learning models, which may be derived from corresponding machine learning algorithms. These machine learning models may be trained to predict characteristics such as protein-protein or protein-ligand/protein/nucleic acid binding affinity, toxicity endpoints, free energy changes upon mutation, protein flexibility/rigidity/allosterism, membrane/globular protein mutation impacts, plasma protein binding, partition coefficient, permeability, clearance, and/or aqueous solubility, among others.
Opening claim text (preview).
1 . A system comprising: a non-transitory computer-readable memory; and a processor configured to execute instructions stored on the non-transitory computer-readable memory which, when executed, cause the processor to: identify a set of compounds based on one or more of a defined target clinical application, a set of desired characteristics, and a defined class of compounds; pre-process each compound of the set of compounds to generate respective sets of feature data; process the sets of feature data with one or more trained machine learning models to produce predicted characteristic values for each compound of the set of compounds for each of the set of desired characteristics, wherein the one or more trained machine learning models are selected based on at least the set of desired characteristics, wherein the sets of feature data comprise a first set of feature data comprising one or more element interactive curvatures; identify a subset of the set of compounds based on the predicted characteristic values; and display an ordered list of the subset of the set of compounds via an electronic display. 2 . The system of claim 1 , wherein the instructions, when executed, further cause the processor to: assign rankings to each compound of the set of compounds for each characteristic of the set of desired characteristics, wherein assigning a ranking to a given compound of the set of compounds for a given characteristic of the set of desired characteristics comprises: comparing a first predicted characteristic value of the predicted characteristic values corresponding to the given compound to other predicted characteristic values of other compounds of the set of compounds, wherein the ordered list is ordered according to the assigned rankings. 3 . The system of claim 1 , wherein the set of compounds includes protein-ligand complexes, and wherein the instructions, when executed, further cause the processor to, for a first protein-ligand complex of the protein-ligand complexes: determine an element interactive density for the first protein-ligand complex; identify a family of interactive manifolds for the first protein-ligand complex; determine an element interactive curvature based on the element interactive density; and generate a set of feature vectors based on the element interactive curvature, wherein the first set of feature data includes the set of feature vectors, wherein the one or more element interactive curvatures comprise the element interactive curvature, wherein the set of desired characteristics comprises protein binding affinity, wherein the one or more trained machine learning models comprise a machine learning model that is trained to predict protein binding affinity values based on the set of feature vectors, and wherein the predicted characteristic values comprise the predicted protein binding affinity values. 4 . The system of claim 1 , wherein the instructions, when executed, further cause the processor to: determine an element interactive density for a first compound of the set of compounds; identify a family of interactive manifolds for the first compound; determine an element interactive curvature based on the element interactive density; and generate a set of feature vectors based on the element interactive curvature, wherein the first set of feature data includes the set of feature vectors, wherein the one or more element interactive curvatures comprise the element interactive curvature, wherein the set of desired characteristics comprises one or more toxicity endpoints, wherein the one or more trained machine learning models comprise a machine learning model that is trained to output predicted toxicity endpoints values corresponding to the one or more toxicity endpoints based on the set of feature vectors, and wherein the predicted characteristic values comprise the predicted toxicity endpoint values. 5 . The system of claim 1 , wherein the instructions, when executed, further cause the processor to: determine an element interactive density for a first compound of the set of compounds; identify a family of interactive manifolds for the first compound; determine an element interactive curvature based on the element interactive density; and generate a set of feature vectors based on the element interactive curvature, wherein the one or more element interactive curvatures comprise the element interactive curvature, wherein the first set of feature data includes the set of feature vectors, wherein the set of desired characteristics comprises solvation free energy, wherein the one or more trained machine learning models comprise a machine learning model that is trained to output predicted solvation free energy values corresponding to a solvation free energy of the first compound based on the set of feature vectors, and wherein the predicted characteristic values comprise the predicted solvation free energy values. 6 . The system of claim 1 , wherein the one or more trained machine learning models are selected from a database of trained machine learning models, and wherein the one or more trained machine learning models comprises at least one trained machine learning model corresponding to a machine learning algorithm selected from the group comprising: a gradient boosted regression trees algorithm, a deep neural network, and a convolutional neural network. 7 . The system of claim 1 , wherein the one or more element interactive curvatures comprise at least one element interactive curvature selected from the group comprising: a Gaussian curvature, a mean curvature, a minimum curvature, and a maximum curvature. 8 . A method comprising: with a processor, identifying a set of compounds based on one or more of a defined target clinical application, a set of desired characteristics, and a defined class of compounds; with the processor, pre-processing each compound of the set of compounds to generate respective sets of feature data; with the processor, processing the sets of feature data with one or more trained machine learning models to produce predicted characteristic values for each compound of the set of compounds for each of the set of desired characteristics, wherein the one or more trained machine learning models are selected from a database of trained machine learning models based on at least the set of desired characteristics, wherein the sets of feature data comprise a first set of feature data comprising one or more element interactive curvatures; with the processor, identifying a subset of the set of compounds based on the predicted characteristic values; and with the processor, causing an ordered list of the subset of the set of compounds to be displayed via an electronic display. 9 . The method of claim 8 , further comprising: with the processor, assigning rankings to each compound of the set of compounds for each characteristic of the set of desired characteristics, wherein assigning a ranking to a given compound of the set of compounds for a given characteristic of the set of desired characteristics comprises: with the processor, comparing a first predicted characteristic value of the predicted characteristic values corresponding to the given compound to other predicted characteristic values of other compounds of the set of compounds, wherein the ordered list is ordered according to the assigned rankings. 10 . The method of claim 8 , wherein the set of compounds includes protein-ligand complexes, and wherein pre-processing each compound of the set of compounds to generate respective sets of feature data comprises: with the processor, determining an element interactive density for a first protein-ligand complex of the protein-ligand complexes; with the proc
Probabilistic graphical models, e.g. probabilistic networks · CPC title
Combinations of networks · CPC title
Convolutional networks [CNN, ConvNet] · CPC title
Supervised learning · CPC title
Supervised data analysis · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.