Methods of lowering the error rate of massively parallel DNA sequencing using duplex consensus sequencing
US-9752188-B2 · Sep 5, 2017 · US
US11608529B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11608529-B2 |
| Application number | US-202117361245-A |
| Country | US |
| Kind code | B2 |
| Filing date | Jun 28, 2021 |
| Priority date | Mar 20, 2012 |
| Publication date | Mar 21, 2023 |
| Grant date | Mar 21, 2023 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Next Generation DNA sequencing promises to revolutionize clinical medicine and basic research. However, while this technology has the capacity to generate hundreds of billions of nucleotides of DNA sequence in a single experiment, the error rate of approximately 1% results in hundreds of millions of sequencing mistakes. These scattered errors can be tolerated in some applications but become extremely problematic when “deep sequencing” genetically heterogeneous mixtures, such as tumors or mixed microbial populations. To overcome limitations in sequencing accuracy, a method Duplex Consensus Sequencing (DCS) is provided. This approach greatly reduces errors by independently tagging and sequencing each of the two strands of a DNA duplex. As the two strands are complementary, true mutations are found at the same position in both strands. In contrast, PCR or sequencing errors will result in errors in only one strand. This method uniquely capitalizes on the redundant information stored in double-stranded DNA, thus overcoming technical limitations of prior methods utilizing data from only one of the two strands.
Opening claim text (preview).
What is claimed is: 1. A method of sequencing DNA comprising: a) attaching partially single-stranded adapters comprising barcodes selected from a plurality of distinct barcode sequences to double-stranded DNA fragments obtained from a bodily sample, wherein attachment of the adapters to double-stranded DNA fragments generates a library of tagged double-stranded adapter-DNA molecules; b) amplifying strands from a plurality of the double-stranded adapter-DNA molecules in the library to produce strand copies; c) sequencing a plurality of the strand copies to obtain strand sequence reads comprising one or more barcode sequences and DNA fragment-specific information; d) for at least some of the double-stranded adapter-DNA molecules in the library— grouping the strand sequence reads into families based on i) the barcode sequence, and ii) DNA fragment-specific information; collapsing a plurality of strand sequence reads within the families to provide a consensus sequence for each of the at least some of the double-stranded DNA molecules in the library; comparing the consensus sequence to a reference sequence; and analyzing one or more correspondences between the consensus sequence and the reference sequence to identify a sequence variation, wherein the bodily sample is derived from a human subject having a tumor cell population, wherein following step (d), the method further comprises identifying a genetic mutation conferring drug resistance present in one or more of the consensus sequences derived from the double-stranded DNA fragments obtained from the tumor cell population present in the bodily sample, wherein the library comprises at least a subset of non-uniquely tagged double-stranded adapter-DNA molecules, and wherein non-uniquely tagged double-stranded adapter-DNA molecules are substantially identifiable with respect to other non-uniquely tagged double-stranded adapter-DNA molecules in the bodily sample using the one or more barcode sequences and DNA fragment-specific information. 2. The method of claim 1 , further comprising selectively enriching double-stranded adapter-DNA molecules or copies thereof to enrich for a subset of DNA molecules that map to one or more genetic loci in the reference sequence. 3. The method of claim 1 , wherein, prior to sequencing, double-stranded adapter-DNA molecules or copies thereof are selectively enriched using a hybridization capture method to provide target DNA molecules that map to one or more genetic loci in the reference sequence. 4. The method of claim 1 , wherein the barcode sequences are 6 nucleotides in length. 5. The method of claim 1 , wherein the barcode sequences are 3, 4, 5, 6, 7 or 8 nucleotides in length. 6. The method of claim 1 , wherein the adapter comprises a hairpin loop with a uracil linker. 7. The method of claim 1 , wherein the adapter-DNA molecule comprises a single-stranded 5′ arm and a single-stranded 3′ arm, wherein the single-stranded 5′ arm and the single-stranded 3′ arm of the adapter-DNA molecule are formed by an enzymatic cleavage of a hairpin loop of the adapter at a uracil linker.
Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay (C12Q1/6804 takes precedence) · CPC title
Methods for sequencing · CPC title
Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes · CPC title
characterised by the use of the arrayed oligonucleotides as identifier tags, e.g. universal addressable array, anti-tag or tag complement array · CPC title
Massive parallel sequencing · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.