Non-invasive assessment of chromosome alterations using change in subsequence mappability
US-10438691-B2 · Oct 8, 2019 · US
US11929146B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11929146-B2 |
| Application number | US-201916544316-A |
| Country | US |
| Kind code | B2 |
| Filing date | Aug 19, 2019 |
| Priority date | Oct 7, 2013 |
| Publication date | Mar 12, 2024 |
| Grant date | Mar 12, 2024 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Provided herein are methods, processes, systems, machines and apparatuses for non-invasive assessment of chromosome alterations.
Opening claim text (preview).
What is claimed is: 1. A system comprising memory and one or more microprocessors, which memory comprises instructions and which one or more microprocessors are configured to perform, according to the instructions, a process for determining a presence or absence of one or more chromosome alterations in a test sample of nucleic acids, which process comprises: (a) identifying discordant read pairs from paired-end sequence reads, wherein the paired-end sequence reads are reads of circulating, cell-free nucleic acid from a test subject sample, thereby identifying discordant read mates; (b) characterizing a mappability of a plurality of sequence read subsequences of each discordant read mate aligned to a reference genome, each of which sequence read subsequences of each discordant read mate is of a different length; (c) selecting a subset of the discordant read mates according to a change in the mappability, wherein the subset comprises reads comprising a candidate breakpoint; (d) comparing (i) the number of discordant read mates from the sample associated with the candidate breakpoints and optionally one or more substantially similar breakpoints, to (ii) the number of discordant read mates from a reference associated with the candidate breakpoint and optionally the one or more substantially similar breakpoints, for the discordant read mates in the subset selected in (c), thereby generating a comparison; and (e) determining the presence or absence of one or more chromosome alterations for the sample according to the comparison in (d). 2. The system of claim 1 , wherein the one or more chromosome alterations comprise a chromosome translocation, a chromosome deletion, a chromosome inversion, or a heterologous insertion. 3. The system of claim 1 , comprising determining the position of one or more candidate breakpoints. 4. The system of claim 1 , wherein the characterizing in (b) comprises generating a fitted relationship between the mappability and the length of each of the sequence read subsequences of each discordant read mate. 5. The system of claim 4 , wherein the change in mappability comprise a slope of the fitted relationship. 6. The system of claim 4 , wherein the presence of a balanced translocation and/or an unbalanced translocation is determined in (e). 7. The system of claim 1 , wherein each of the sequence read subsequences of each discordant read mate is shorter than the next largest fragment or the read mate by about 5 bases or less. 8. The system of claim 1 , wherein each of the sequence read subsequences of each discordant read mate is incrementally shorter than the next largest fragment or the read mate. 9. The system of claim 8 , wherein each of the sequence read subsequences of each discordant read mate is incrementally shorter than the next largest fragment or the read mate by about 1 base. 10. The system of claim 1 , wherein the selecting in (c) is according to a mappability threshold. 11. The system of claim 1 , comprising filtering the discordant read mates. 12. The system of claim 11 , wherein the filtering comprises removing one or both of the discordant read mates, or removing one or more singleton events, or both. 13. The system of claim 11 , wherein the filtering is chosen from one or more of (i) removing low quality reads, (ii) removing concordant reads, (iii) removing PCR duplicated reads, (iv) removing reads mapped to mitochondrial DNA, (v) removing reads mapped to repetitive elements, (vi) removing unmappable reads, (vi) removing reads comprising step-wise multiple alignments and (vii) removing reads mapped to a centromere. 14. The system of claim 11 , wherein the filtering comprises removing discordant read mates in instances where the substantially similar breakpoint is present in the reference. 15. The system of claim 1 , wherein the location of the breakpoint is identified at a single base resolution. 16. The system of claim 1 , wherein determining the presence of the one or more chromosome alteration in (e) comprises identifying a substantially greater number of sequence reads from the sample compared to the reference in the comparison of (d). 17. The system of claim 1 , wherein a first break point and a second breakpoint are identified according to the comparison in (d). 18. The system of claim 1 , wherein the presence of the one or more chromosome alteration is identified in (e) according to the first and second breakpoints. 19. The system of claim 1 , wherein the selecting in (c) or the comparing in (d), or the selecting in (c) and the comparing in (d), does not comprise performing a clustering analysis. 20. The system of claim 1 , wherein the comparison in (d) comprises determining a level of confidence. 21. The system of claim 20 , wherein determining the level of confidence comprises determining a p value, a Z-score, or both. 22. The system of claim 1 , which comprises a sequencing machine configured to generate the sequence reads. 23. The system of claim 1 , wherein the memory comprises the sequence reads, the discordant read pairs, the subset of discordant read mates, the change in mappability, the breakpoints, or a combination thereof. 24. The system of claim 1 , wherein the sample nucleic acid is circulating cell-free nucleic acid from a pregnant female bearing a fetus or circulating cell-free nucleic acid from a subject having or suspected of having a cell proliferative disorder. 25. The system of claim 24 , wherein the cell proliferative disorder is cancer. 26. The system of claim 1 , wherein the presence or absence of one or more chromosome alterations is determined for a minority nucleic acid species. 27. The system of claim 26 , wherein the minority nucleic acid species comprises fetal nucleic acid or nucleic acid from cancer cells. 28. A system comprising a sequencing apparatus and one or more computing apparatus, which sequencing apparatus is configured to produce signals corresponding to nucleotide bases of a nucleic acid loaded in the sequencing apparatus, which nucleic acid is circulating cell-free nucleic acid from a test subject sample, or which nucleic acid loaded in the sequencing apparatus is a modified variant of the circulating cell-free nucleic acid; and which one or more computing apparatus comprise memory and one or more processors, which memory comprises instructions executable by the one or more processors and which instructions executable by the one or more processors are configured to: (a) produce paired-end sequence reads from the signals and align the sequence reads to a reference genome; (b) identify discordant read pairs from the paired-end sequence reads, thereby identifying discordant read mates; (c) characterize a mappability of a plurality of sequence read subsequences of each discordant read mate aligned to a reference genome, each of which sequence read subsequences of each discordant read mate is of a different length; (d) select a subset of the discordant read mates according to a change in mappability, wherein the subset comprises reads comprising a candidate breakpoint; (e) compare (i) the number of discordant read mates from the sample associated with the candidate breakpoints and optionally one or more substantially similar breakpoints, to (ii) the number of discordant read mates from a reference associated with the candidate breakpoint and optionally the one or more substant
Allele or variant detection, e.g. single nucleotide polymorphism [SNP] detection · CPC title
ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations · CPC title
Ploidy or copy number detection · CPC title
ICT specially adapted for sequence analysis involving nucleotides or amino acids · CPC title
Sequence alignment; Homology search · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.