Methods of lowering the error rate of massively parallel DNA sequencing using duplex consensus sequencing

US11608529B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11608529-B2
Application numberUS-202117361245-A
CountryUS
Kind codeB2
Filing dateJun 28, 2021
Priority dateMar 20, 2012
Publication dateMar 21, 2023
Grant dateMar 21, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Next Generation DNA sequencing promises to revolutionize clinical medicine and basic research. However, while this technology has the capacity to generate hundreds of billions of nucleotides of DNA sequence in a single experiment, the error rate of approximately 1% results in hundreds of millions of sequencing mistakes. These scattered errors can be tolerated in some applications but become extremely problematic when “deep sequencing” genetically heterogeneous mixtures, such as tumors or mixed microbial populations. To overcome limitations in sequencing accuracy, a method Duplex Consensus Sequencing (DCS) is provided. This approach greatly reduces errors by independently tagging and sequencing each of the two strands of a DNA duplex. As the two strands are complementary, true mutations are found at the same position in both strands. In contrast, PCR or sequencing errors will result in errors in only one strand. This method uniquely capitalizes on the redundant information stored in double-stranded DNA, thus overcoming technical limitations of prior methods utilizing data from only one of the two strands.

First claim

Opening claim text (preview).

What is claimed is: 1. A method of sequencing DNA comprising: a) attaching partially single-stranded adapters comprising barcodes selected from a plurality of distinct barcode sequences to double-stranded DNA fragments obtained from a bodily sample, wherein attachment of the adapters to double-stranded DNA fragments generates a library of tagged double-stranded adapter-DNA molecules; b) amplifying strands from a plurality of the double-stranded adapter-DNA molecules in the library to produce strand copies; c) sequencing a plurality of the strand copies to obtain strand sequence reads comprising one or more barcode sequences and DNA fragment-specific information; d) for at least some of the double-stranded adapter-DNA molecules in the library— grouping the strand sequence reads into families based on i) the barcode sequence, and ii) DNA fragment-specific information; collapsing a plurality of strand sequence reads within the families to provide a consensus sequence for each of the at least some of the double-stranded DNA molecules in the library; comparing the consensus sequence to a reference sequence; and analyzing one or more correspondences between the consensus sequence and the reference sequence to identify a sequence variation, wherein the bodily sample is derived from a human subject having a tumor cell population, wherein following step (d), the method further comprises identifying a genetic mutation conferring drug resistance present in one or more of the consensus sequences derived from the double-stranded DNA fragments obtained from the tumor cell population present in the bodily sample, wherein the library comprises at least a subset of non-uniquely tagged double-stranded adapter-DNA molecules, and wherein non-uniquely tagged double-stranded adapter-DNA molecules are substantially identifiable with respect to other non-uniquely tagged double-stranded adapter-DNA molecules in the bodily sample using the one or more barcode sequences and DNA fragment-specific information. 2. The method of claim 1 , further comprising selectively enriching double-stranded adapter-DNA molecules or copies thereof to enrich for a subset of DNA molecules that map to one or more genetic loci in the reference sequence. 3. The method of claim 1 , wherein, prior to sequencing, double-stranded adapter-DNA molecules or copies thereof are selectively enriched using a hybridization capture method to provide target DNA molecules that map to one or more genetic loci in the reference sequence. 4. The method of claim 1 , wherein the barcode sequences are 6 nucleotides in length. 5. The method of claim 1 , wherein the barcode sequences are 3, 4, 5, 6, 7 or 8 nucleotides in length. 6. The method of claim 1 , wherein the adapter comprises a hairpin loop with a uracil linker. 7. The method of claim 1 , wherein the adapter-DNA molecule comprises a single-stranded 5′ arm and a single-stranded 3′ arm, wherein the single-stranded 5′ arm and the single-stranded 3′ arm of the adapter-DNA molecule are formed by an enzymatic cleavage of a hairpin loop of the adapter at a uracil linker.

Assignees

Inventors

Classifications

  • Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay (C12Q1/6804 takes precedence) · CPC title

  • C12Q1/6869Primary

    Methods for sequencing · CPC title

  • C12Q1/6876Primary

    Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes · CPC title

  • characterised by the use of the arrayed oligonucleotides as identifier tags, e.g. universal addressable array, anti-tag or tag complement array · CPC title

  • Massive parallel sequencing · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11608529B2 cover?
Next Generation DNA sequencing promises to revolutionize clinical medicine and basic research. However, while this technology has the capacity to generate hundreds of billions of nucleotides of DNA sequence in a single experiment, the error rate of approximately 1% results in hundreds of millions of sequencing mistakes. These scattered errors can be tolerated in some applications but become ext…
Who is the assignee on this patent?
Univ Washington Through Its Center For Commercialization
What technology area does this patent fall under?
Primary CPC classification C12Q1/6869. Mapped technology areas include Chemistry & Metallurgy.
When was this patent published?
Publication date Tue Mar 21 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).