Genome sharing
US-2024406179-A1 · Dec 5, 2024 · US
US11515003B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11515003-B2 |
| Application number | US-201715661942-A |
| Country | US |
| Kind code | B2 |
| Filing date | Jul 27, 2017 |
| Priority date | Jul 27, 2016 |
| Publication date | Nov 29, 2022 |
| Grant date | Nov 29, 2022 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Technology provided herein relates in part to methods, processes, machines and apparatuses for non-invasive assessment of genomic nucleic acid instability and genomic nucleic acid stability. The method comprises providing a set of genomic portions each coupled to a copy number alteration quantification for a test sample, wherein the genomic portions comprises portions of a reference genome to which sequence reads obtained for nucleic acid from a test sample obtained from the subject have been mapped, and the copy number alteration quantification coupled to each genomic portion has been determined from a quantification of sequence reads mapped to the genomic portion; and determining, by a computing device, presence or absence of genomic instability for the subject according to the copy number alteration quantifications coupled to the genomic portions.
Opening claim text (preview).
What is claimed is: 1. A method of determining presence or absence of genomic instability for a subject, comprising: i) mapping thousands to millions of sequence reads obtained from a nucleic acid sample obtained from the subject to a set of genomic portions of a reference genome, ii) determining a copy number alteration quantification based on a quantification of sequence reads mapped to each genomic portion of the set; iii) filtering, by a computing device, from the set of genomic portions: (a) a first subset of genomic portions associated with copy number alterations presented in a reference set of samples, (b) a second subset of genomic portions that are not representative portions derived from groups of portions identified by a clustering process applied to the reference set of samples, or both (a) and (b), thereby generating a set of filtered genomic portions coupled to copy number alteration quantifications, wherein the set of filtered genomic portions do not contain the first subset of portions, do not contain the second subset of portions, or contain neither the first subset nor the second subset of genomic portions; and iv) determining, by a computing device, the presence or absence of genomic instability for the subject according to the copy number alteration quantifications coupled to the filtered genomic portions from the nucleic acid sample from the subject. 2. The method of claim 1 , wherein step i) further comprises: obtaining the thousands to millions of sequence reads for the set of genomic portions of the nucleic acid sample from the subject, and generating the copy number alteration quantification for each genomic portion by quantifying the number of sequence reads mapped to the genomic portion in the nucleic acid sample. 3. The method of claim 1 , wherein the clustering process comprises: generating, by the computing device, a matrix comprising the genomic portions and identifying information of the reference set of samples as rows and columns of the matrix; generating, by the computing device, pair-wise correlation values from the matrix, wherein there is one correlation value for each possible pair of the portions; and clustering, by the computing device, the portions into groups according to the correlation values. 4. The method of claim 1 , further comprising adjusting the quantification of the sequence reads by a guanine-cytosine (GC) normalizing process that generates a GC normalized quantification of sequence reads for each of the genomic portions, thereby providing an adjusted quantification of sequence reads. 5. The method of claim 4 , wherein the method further comprises modifying the adjusted quantification of sequence reads according to a baseline adjusted quantification of sequence reads for each of the genomic portions wherein said modification produces a modified quantification of sequence reads for each of the genomic portions, or wherein the method further comprises subtracting a baseline adjusted quantification of sequence reads from the adjusted quantification of sequence reads for each of the genomic portions, wherein said subtracting produces a subtracted quantification of sequence reads for each of the genomic portions. 6. The method of claim 5 , further comprising determining, by the computing device, the absolute value of the modified quantification of sequence reads or the absolute value of the subtracted quantification of sequence reads-for each of the genomic portions. 7. The method of claim 5 , wherein the baseline is (i) an average of the adjusted quantification of sequence reads for each of the genomic portions, (ii) a centered value of the adjusted quantification of sequence reads for each of the genomic portions, or (iii) a representative value of the adjusted quantification of sequence reads for each of the genomic portions. 8. The method of claim 7 , further comprising scaling, by the computing device, the adjusted quantifications of sequence reads for the genomic portions to the average, centered value or representative value. 9. The method of claim 5 , wherein the method comprises summing the modified quantification of sequence reads, the absolute values of the modified quantification of sequence reads, subtracted quantification of sequence reads or absolute values of the subtracted quantification of sequence reads for the genomic portions, thereby providing a genomic instability number. 10. The method of claim 9 , wherein the genomic instability number is generated for the subject at two or more time points during a treatment, and wherein the subject is identified as a responder to the treatment if after a time point of about 6 weeks to about 8 weeks of treatment the genomic instability number is less than a threshold value, or wherein the subject is identified as a non-responder to the treatment if after the time point of treatment the genomic instability number is greater than the threshold value. 11. The method of claim 4 , further comprising generating, by the computing device, a genomic instability classification for the nucleic acid sample according to the adjusted quantification of sequence reads for the genomic portions. 12. The method of claim 1 , wherein the determining the presence or absence of genomic instability further comprises: transforming the set of filtered genomic portions each coupled with a copy number alteration quantification to a reduced set of dimensions, wherein the transforming comprises performing a principal component transformation of the set of genomic portions which transformation yields principal components for the nucleic acid sample in a principal component space, wherein the principal component space comprises a common principal component origin shared by the genomic portions of the reference set of samples, each coupled with a copy number alteration, and determining a distance between the principal components of the nucleic acid sample and the common principal component origin. 13. The method of claim 12 , wherein the distance is a normalized distance generated according to a variance of the principal components over a set of training samples, and wherein the method further comprises determining the presence or absence of genomic instability based on the normalized distance. 14. The method of claim 12 , wherein the nucleic acid sample is determined as comprising a cancer when the distance of the sample from the common principal component origin shared by a plurality of samples in the principal component space is greater than a predetermined threshold. 15. The method of claim 14 , wherein the distance is a Mahalanobis distance and the predetermined threshold is 300. 16. The method of claim 1 , wherein the copy number alteration quantification coupled to each of the genomic portions for the nucleic acid sample is obtained by a segmentation process. 17. The method of claim 16 , wherein the segmentation process comprises a circular binary segmentation (CBS) process. 18. The method of claim 16 , wherein each copy number alteration quantification is a z-score. 19. The method of claim 18 , wherein the z-score for each genomic portion is the z-score for a segment identified by the segmentation process, wherein the segment includes the genomic portion. 20. The method of claim 19 , wherein the z-score is determined according to: z-score=(Sscr−Smcr)/MAD, wherein: the Sscr is a total normalized counts in the segment divided by a total normalized autosome counts for the nucleic acid sample; the Smcr is a median value of coun
ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations · CPC title
Ploidy or copy number detection · CPC title
Allele or variant detection, e.g. single nucleotide polymorphism [SNP] detection · CPC title
ICT specially adapted for evolutionary bioinformatics, e.g. phylogenetic tree construction or analysis · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.