Compositions and methods for accurately identifying mutations
US-2024409996-A1 · Dec 12, 2024 · US
US11952622B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11952622-B2 |
| Application number | US-201414331714-A |
| Country | US |
| Kind code | B2 |
| Filing date | Jul 15, 2014 |
| Priority date | Jul 18, 2013 |
| Publication date | Apr 9, 2024 |
| Grant date | Apr 9, 2024 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Methods for analyzing DNA-containing samples are provided. The methods can comprise isolating a single genomic equivalent of DNA from the DNA-containing sample to provide a single isolated DNA molecule. The single isolated DNA molecule can be subjected to amplification conditions in the presence of one or more sets of unique molecularly tagged primers to provide one or more amplicons. Any spurious allelic sequences generated during the amplification process are tagged with an identical molecular tag. The methods can also include a step of determining the sequence of the one or more amplicons, in which the majority sequence for each code is selected as the sequence of the single original encapsulated target. The DNA-containing sample can be a forensic sample (e.g., mixed contributor sample), a fetal genetic screening sample, or a biological cell.
Opening claim text (preview).
That which is claimed: 1. A method of analyzing a single mixed contributor sample containing DNA where the number of contributors is unknown prior to the analysis of the single mixed contributor sample, the single mixed contributor sample comprising loci within the DNA, the method comprising: (i) processing the single mixed contributor sample into a plurality of liquid droplets, wherein each of the plurality of liquid droplets contains no more than one locus-containing DNA molecule per targeted locus, wherein isolating the target DNA molecules with the targeted loci in the droplets prevents extended primers from switching between multiple target DNA molecules with different sequences during PCR, thereby preventing creation of chimeric alleles comprising error-containing content due to recombination of genomic content from a plurality of target DNA molecules; (ii) introducing a plurality of sets of DNA primers into the plurality of liquid droplets, wherein each of the plurality of sets is configured to amplify a different specific locus on a genome, and wherein each of the plurality of sets includes one or more unique molecular tags, and wherein the unique molecular tag(s) of a primer set in one droplet differs from the unique molecular tag(s) of the corresponding primer set in a different droplet; (iii) subjecting the target DNA molecules in the plurality of liquid droplets to an amplification process comprising a PCR amplification process in the presence of the plurality of sets of DNA primers to provide a plurality of amplicons of targeted DNA sequences at a plurality of pre-determined loci including a first locus, each of the sets of DNA primers being configured to incorporate into the respective plurality of amplicons and thereby append a tag to the plurality of respective amplicons to produce a plurality of sets of uniquely tagged amplicons derived from each of the plurality of targeted DNA sequences, wherein each set of the plurality of sets of uniquely tagged amplicons comprises a respective allelic profile including a first allelic profile comprising error-containing amplicons having DNA sequence errors generated by replication errors of the amplification process and error-free amplicons having no DNA sequence errors, the error-containing amplicons and the error-free amplicons having a same tag; (iv) sequencing each of the uniquely tagged amplicons to provide at least a first group of sequences having an identical first tag associated with the first locus and a second group of sequences having an identical second tag associated with the first locus; and (v) selecting a first representative DNA sequence from the first group of sequences as representing a first target DNA molecule from the one or more target DNA sequences and selecting a second representative DNA sequence from the second group of sequences as representing a second target DNA molecule; wherein the first representative DNA sequence is a majority sequence from the first group of sequences and the second representative DNA sequence is a majority sequence from the second group of sequences; and (vi) associating the first representative DNA sequence with a first contributor genotype at the first locus and the second representative DNA sequence with a second contributor genotype at the first locus via performing an Evidence Ratio (ER) analysis derived from Akaike's Information Criterion (AIC) for the first allelic profile that includes the first group of sequences having an identical first tag and the second group of sequences having the identical second tag, the number of DNA contributors at the locus, c, to be inferenced based on the weight of the evidence; wherein the ER analysis derived from the AIC computed for each possible model of the allelic data with ‘n’ contributors determines the strength of evidence for each possible number of contributors, n, and comprises the following: ER c n = ∑ m = 1 r ( n ) w m * ∑ i ≠ n n max ∑ m ′ = 1 r ( i ) w m ′ * wherein ‘n’ represents the number of contributors; w m′ * represents each model's Akaike weight as defined in terms of ratios of the model's likelihood (Lm) given the allelic data, where Lm is proportional to exp (−1/2Δm) , wherein Δm is the difference of the AIC value between model ‘m’ and a model ‘m min ’, in which m min has the lowest AIC value and is not zero; wherein the upper sum in ER c n is over a subset r(n) of all possible models with n contributors, and the lower sum is over all possible models with a different number of contributors n′, where n′ is not equal to n, and n max is the maximum number of contributors considered, with the number of contributors inferenced given by m such that the ER c n value is the maximum over all n max considered. 2. The method of claim 1 , wherein the processing the single sample into a plurality of liquid droplets comprises forming the plurality of liquid droplets via a droplet microfluidic device. 3. The method of claim 2 , wherein a plurality of liquid droplets are used, wherein the plurality of liquid droplets comprises a first group of liquid droplets and a second group of liquid droplets, the first group of liquid droplets being devoid of any DNA molecules and the second group of liquid droplets containing one or more locus-containing DNA molecules, wherein the majority of the second group of liquid droplets contain only one locus-containing DNA molecule per targeted locus. 4. The method of claim 2 , wherein one droplet in every 50 to 150 liquid droplets comprises only one locus-containing DNA molecule per targeted locus. 5. The method of claim 2 , wherein the average diameter of the plurality of liquid droplets comprises from 1 micron to 100 microns. 6. The method of claim 1 , wherein the step of processing the single sample into a plurality of liquid droplets comprises hyper-diluting the single sample and forming the plurality of liquid droplets. 7. The method of claim 6 , wherein hyper-diluting the single sample comprises diluting the s
Methods for sequencing · CPC title
Nucleic acid amplification reactions · CPC title
Microreactors, e.g. emulsion PCR or sequencing, droplet PCR, microcapsules, i.e. non-liquid containers with a range of different permeability's for different reaction components · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.