High-throughput methodology for identifying rna-protein interactions transcriptome-wide
US-2015355173-A1 · Dec 10, 2015 · US
US9886545B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9886545-B2 |
| Application number | US-201213411460-A |
| Country | US |
| Kind code | B2 |
| Filing date | Mar 2, 2012 |
| Priority date | Mar 2, 2011 |
| Publication date | Feb 6, 2018 |
| Grant date | Feb 6, 2018 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Described herein is a discovery Platform Technology for analyzing a biological system or process (e.g., a disease condition, such as cancer) via model building.
Opening claim text (preview).
We claim: 1. A method for identifying a modulator of a disease process, said method comprising: (1) obtaining a first data set representing measured expression levels of a plurality of genes in an in vitro culture of disease related cells; (2) obtaining a first control data set representing measured expression levels of a plurality of genes in an in vitro culture of control cells; (3) obtaining a second data set representing a measured functional activity or a measured cellular response of the in vitro culture of disease related cells; (4) obtaining a second control data set representing a measured functional activity or a measured cellular response of the in vitro culture of control cells; (5) generating a computer-implemented first causal relationship network model relating the expression levels of the plurality of genes and the functional activity or cellular response based solely on the first data set and the second data set using a programmed computing system including storage holding network model building code and a plurality of processors configured to execute the network model building code, wherein generating the computer-implemented first causal relationship network model comprises: (i) creating a list of network fragments, each network fragment including a plurality of variables connected by one or more relationships, and determining a probabilistic score associated with each network fragment based on the first data set and/or the second data set, wherein the variables correspond to the expression levels of the plurality of genes and the functional activity or cellular response in the in vitro culture of the disease related cells; (ii) creating an ensemble of trial networks, each trial network including a different subset of the list of network fragments; and (iii) globally optimizing the ensemble of trial networks by evolving the trial networks in parallel using the plurality of processors, wherein one or more first processors in the plurality of processors used to evolve a first trial network are different from one or more second processors in the plurality of processors used to evolve a second trial network, and wherein evolving a trial network includes adding a network fragment from the list to the trial network or replacing a network fragment in the trial network with a network fragment from the list and determining whether the addition or replacement improves a total probabilistic score for the trial network; wherein the generation of the first causal relationship network model is not based on any known biological relationships other than the first data set and the second data set; (6) generating a computer-implemented second causal relationship network model relating the expression levels of the plurality of genes and the functional activity or cellular response of the control cells based solely on the first control data set and the second control data set using the programmed computing system, wherein generating the computer-implemented second causal relationship network model comprises: (i) creating a second list of network fragments, each network fragment including a plurality of variables connected by one or more relationships, and determining a probabilistic score associated with each network fragment based on the first control data set and the second control data set, wherein the variables correspond to the expression levels of the plurality of genes and the functional activity or cellular response in the in vitro culture of control cells; (ii) creating a second ensemble of trial networks, each trial network constructed from a different subset of the second list of network fragments; and (iii) globally optimizing the second ensemble of trial networks by evolving the trial networks in parallel using the plurality of processors, wherein one or more first processors in the plurality of processors used to evolve a first trial network are different from one or more second processors in the plurality of processors used to evolve a second trial network, wherein evolving a trial network includes adding a network fragment from the second list to the trial network or replacing a network fragment in the trial network with a network fragment from the second list and determining whether the addition or replacement improves a total probabilistic score for the trial network; (7) generating a computer-implemented differential causal relationship network from the first causal relationship network model and the second causal relationship network model using a computing device by steps including: i) for each relationship between two nodes in a selected one of the first causal relationship network model and the second causal relationship network model, determining if the other causal relationship network model includes a relationship between the same two nodes, and, where the other causal relationship network model includes a relationship between the same two nodes, determining if the relationship between the same two nodes in the other causal relationship network model has at least one significantly different parameter than that of the relationship in the selected causal relationship network model; and ii) forming the differential causal relationship network by including the relationships in the selected causal relationship network model that are absent from the other causal relationship network model and including the relationships in the selected causal relationship network model that have at least one significantly different parameter in the other causal relationship network model; and (8) identifying a causal relationship unique in the disease process from the generated differential causal relationship network, wherein a gene associated with the unique causal relationship is identified as a modulator of the disease process. 2. The method of claim 1 , wherein the disease process is cancer, diabetes, obesity or cardiovascular disease. 3. The method of claim 2 , wherein the cancer is lung cancer, breast cancer, prostate cancer, melanoma, squamous cell carcinoma, colorectal cancer, pancreatic cancer, thyroid cancer, endometrial cancer, bladder cancer, kidney cancer, a solid tumor, leukemia, non-Hodgkin lymphoma, or a drug-resistant cancer. 4. The method of claim 1 , wherein the modulator stimulates or promotes the disease process. 5. The method of claim 1 , wherein the modulator inhibits the disease process. 6. The method of claim 5 , wherein the modulator shifts an energy metabolic pathway specifically in disease cells from a glycolytic pathway towards an oxidative phosphorylation pathway. 7. The method of claim 1 , wherein the disease related cells in the in vitro culture of disease related cells were subjected to an environmental perturbation prior to measurements of expression levels for the first data set, and the control cells in the in vitro culture of control cells were not subjected to the environmental perturbation prior to measurements of expression levels for the first control data set. 8. The method of claim 7 , wherein the environmental perturbation comprised one or more of a contact with an agent, a change in culture condition, an introduced genetic modification or mutation, and a vehicle that caused a genetic modification or mutation. 9. The method of claim 1 , wherein the characteristic aspect of the disease process comprises a hypoxia condition, a hyperglycemic condition, a lactic acid rich culture condition, or combinations thereof. 10. The method of claim 1 , wherein the first data set and the first control data set comprise protein and/or mRNA expression levels of the plurality of genes. 11. The method of claim 1 , wherein the first data set and the first control d
Antineoplastic agents · CPC title
Gene or protein expression profiling; Expression-ratio estimation or normalisation · CPC title
Probabilistic models · CPC title
Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation · CPC title
Physics · mapped topic
Related publications grouped by family.
Answers are generated from the same data shown on this page.