Detecting and classifying copy number variation
US-10415089-B2 · Sep 17, 2019 · US
US10482993B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-10482993-B2 |
| Application number | US-201615017475-A |
| Country | US |
| Kind code | B2 |
| Filing date | Feb 5, 2016 |
| Priority date | Jan 19, 2010 |
| Publication date | Nov 19, 2019 |
| Grant date | Nov 19, 2019 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
The invention provides a method for determining copy number variations (CNV) of a sequence of interest in a test sample that comprises a mixture of nucleic acids that are known or are suspected to differ in the amount of one or more sequence of interest. The method comprises a statistical approach that accounts for accrued variability stemming from process-related, interchromosomal and inter-sequencing variability. The method is applicable to determining CNV of any fetal aneuploidy, and CNVs known or suspected to be associated with a variety of medical conditions. CNV that can be determined according to the method include trisomies and monosomies of any one or more of chromosomes 1-22, X and Y, other chromosomal polysomies, and deletions and/or duplications of segments of any one or more of the chromosomes, which can be detected by sequencing only once the nucleic acids of a test sample.
Opening claim text (preview).
What is claimed is: 1. A method, implemented using a computer system comprising one or more processors and system memory, for identifying the presence of a cancer and/or an increased risk of a cancer in a mammal, said method comprising: (a) providing, to the computer system, at least 10,000 sequence reads of nucleic acids in a test sample from said mammal, wherein said test sample comprises both nucleic acids from cancerous or precancerous cells and nucleic acids from constitutive cells, wherein the sequence reads are provided in an electronic format; (b) aligning, by the one or more processors, the at least 10,000 sequence reads to one or more reference sequences using a computing apparatus and thereby providing sequence tags corresponding to the sequence reads; (c) identifying, by the one or more processors, a number of sequence tags from the nucleic acids for one or more sequences of interest amplifications of which or deletions of which are known to be associated with cancers, wherein said sequences of interest are selected from chromosomes 1-22, X, and Y and segments thereof and identifying, by the one or more processors, a number of sequence tags for at least one normalizing sequence for each of the one or more sequences of interest; (d) calculating, by the one or more processors and using said number of sequence tags identified for each of said one or more sequences of interest and said number of sequence tags identified for each said normalizing sequence, a single sequence dose for each of said one or more sequences of interest, wherein the at least one normalizing sequence is a sequence that yields sequence doses for the sequence of interest having: (a) the smallest variability among unaffected samples, (b) the greatest differentiability between affected and unaffected samples, (c) the smallest variability and the greatest differentiability, or (d) an optimal combination of small variability and large differentiability; (e) evaluating said single sequence dose for each of said one or more sequences of interest, and thereby detecting aneuploidies or copy number variations in said sample; and (f) determining the presence of a cancer and/or an increased risk of a cancer based on the detected aneuploidies or copy number variations. 2. The method of claim 1 , wherein (f) comprises determining the presence of the cancer and/or the increased risk of the cancer based on the presence, an increased number, and/or an increased severity of the aneuploidies or copy number variations. 3. The method of claim 1 , wherein evaluating said single sequence dose comprises comparing the single sequence dose to a threshold value for a sequence of interest. 4. The method of claim 1 , wherein the at least one normalizing sequence is a sequence that was selected for a sequence of interest by (i) identifying a plurality of unaffected samples; and (ii) selecting a normalizing sequence giving sequence doses of unaffected samples having: (a) the smallest variability, (b) the greatest differentiability, (c) the smallest variability and the greatest differentiability, or (d) an optimal combination of small variability and large differentiability. 5. The method of claim 1 , wherein said aneuploidies comprise a whole chromosome gain or a whole chromosome loss as shown in Table 1. 6. The method of claim 1 , wherein said sequence of interest are substantially arm-level segments comprising a p arm or a q arm of any one or more of chromosomes 1-22, X and Y. 7. The method of claim 1 , wherein said sequences of interest are segment that comprise a region and/or a gene shown in Table 3 and/or Table 5 and/or Table 4 and/or Table 6. 8. The method of claim 7 , wherein said aneuploidies comprise an amplification of a region and/or a gene shown in Table 3 and/or Table 5. 9. The method of claim 7 , wherein said aneuploidies comprise a deletion of a region and/or a gene shown in Table 4 and/or Table 6. 10. The method of claim 1 , wherein said sequences of interest contain one or more oncogenes and/or one or more tumor suppressor genes. 11. The method of claim 10 , wherein said aneuploidies comprise an amplification of one or more regions selected from the group consisting of 20Q13, 19q12, 1q21-1q23, 8p11-p12, and the ErbB2. 12. The method of claim 10 , wherein said aneuploidies comprise an amplification of one or more regions comprising a gene selected from the group consisting of MYC, ERBB2 (EFGR), CCND1 (Cyclin D1), FGFR1, FGFR2, HRAS, KRAS, MYB, MDM2, CCNE, KRAS, MET, ERBB1, CDK4, MYCB, ERBB2, AKT2, MDM2 and CDK4. 13. The method of claim 10 , wherein said cancer is a cancer selected from the group consisting of leukemia, ALL, brain cancer, breast cancer, colorectal cancer, dedifferentiated liposarcoma, esophageal adenocarcinoma, esophageal squamous cell cancer, GIST, glioma, HCC, hepatocellular cancer, lung cancer, lung NSC, lung SC, medulloblastoma, melanoma, MPD, myeloproliferative disorder, cervical cancer, ovarian cancer, prostate cancer, and renal cancer. 14. The method of claim 1 , wherein said test sample comprises a sample selected from the group consisting of whole blood, a blood fraction, saliva/oral fluid, urine, a tissue biopsy, pleural fluid, pericardial fluid, cerebral spinal fluid, and peritoneal fluid. 15. The method of claim 1 , wherein a sequence dose for a selected sequence of interest is calculated as a ratio of the number of sequence tags identified for the selected sequence of interest and the number of sequence tags identified for the at least one normalizing sequence for the sequence of interest. 16. The method of claim 1 , wherein at least one normalizing sequence is a sequence selected for an associated sequence of interest by (i) identifying a plurality of qualified samples for the sequence of interest; (ii) repeatedly calculating chromosome doses for the associated sequence of interest using multiple potential normalizing sequences; and (iii) selecting the normalizing sequence alone or in a combination giving the smallest variability and/or greatest differentiability in calculated chromosome doses. 17. The method of claim 1 , further comprising calculating a normalized sequence value (NSV), wherein said NSV relates said sequence dose to the mean of a corresponding sequence dose in a set of qualified samples as: NSV ij = x ij - μ ^ j σ ^ j where {circumflex over (μ)} j and {circumflex over (σ)} j are the estimated mean and standard deviation, respectively, for the j-th sequence dose in a set of qualified samples, and x ij is the sequence dose calculated in (c) for the sequence of interest. 18. The method of claim 1 , wherein the nucleic acids in the test sample comprise cell free DNA. 19. The method of claim 1 , wherein the detected aneuploidies or copy number variations comprises a component in a differential diagnosis for cancer. 20. The method of claim 1 , wherein the detection of aneuploidies or c
Antineoplastic agents · CPC title
ICT specially adapted for sequence analysis involving nucleotides or amino acids · CPC title
Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay (C12Q1/6804 takes precedence) · CPC title
for diseases caused by alterations of genetic material · CPC title
for computer-aided diagnosis, e.g. based on medical expert systems · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.