Accurate molecular barcoding

US10822643B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10822643-B2
Application numberUS-201715581914-A
CountryUS
Kind codeB2
Filing dateApr 28, 2017
Priority dateMay 2, 2016
Publication dateNov 3, 2020
Grant dateNov 3, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

In accordance with some embodiments herein, compositions and methods for accurate barcoding of nucleic acids are described. The compositions and methods can involve a plurality of unique oligonucleotide species comprising unique molecule barcodes. In some embodiments, the molecule barcodes can have a relatively low G content, and can exhibit reduced bias in amplification and analysis.

First claim

Opening claim text (preview).

What is claimed is: 1. A reverse transcription reaction composition comprising; mRNA; a reverse transcriptase; and at least 1000 unique oligonucleotide species, each unique oligonucleotide species comprising a barcode region and a uniform region, the barcode region comprising a molecule barcode comprising at least 7 nucleotides and having a constraint on G content, wherein the uniform region is 3′ of the barcode region, the uniform region comprising a target-specific region 3′ of the barcode region, the target-specific region comprising at least 10 nucleotides complementary to a target nucleic acid, wherein the unique oligonucleotide species comprise different nucleic acid sequences in their barcode regions, wherein the constraint on G content comprises: (a) less than 1% of the unique oligonucleotide species comprising the molecule barcode have a G content of 50% or more: and/or (b) the molecule barcodes of all of the unique oligonucleotide species in the composition collectively have a G content of no more than 12.5%, and wherein the barcode region further comprises a sample barcode comprising at least 3 nucleotides. 2. The composition of claim 1 , wherein the constraint on G content is less than 1% of the unique oligonucleotide species comprising the molecule barcode have a G content of 50% or more. 3. The composition of claim 1 , wherein the constraint on G content is the molecule barcodes of all of the unique oligonucleotide species in the composition collectively have a G content of no more than 12.5%. 4. The composition of claim 1 , wherein the unique oligonucleotide species are disposed in at least two spatially isolated pools, each pool comprising at least 100 unique oligonucleotides of the unique oligonucleotide species, wherein unique oligonucleotides in the same pool comprise the same sample barcode sequence, and wherein different unique oligonucleotides of the same pool comprise a different molecule barcode sequences. 5. The composition of claim 4 , wherein the unique oligonucleotide species of each pool are immobilized on a substrate, so that the sample barcodes but not the molecule barcodes are the same for the oligonucleotide species immobilized on each substrate. 6. The composition of claim 1 , wherein for at least 95% of the unique oligonucleotide species, any G in the molecule barcode is not adjacent to another G. 7. The composition of claim 1 , wherein at least 95% of the molecule barcodes of the unique oligonucleotide species comprise the sequence HNHNHNHN, wherein each “H” is any one of A, C, or T, and wherein each “N” is any one of A, G, C, or T. 8. The composition of claim 1 , wherein each of the unique oligonucleotide species comprises a spacer 3′ of the barcode region and 5′ of the target specific region, said spacer comprising the sequence HHHHHHHH, wherein each “H” is any one of A, C, or T. 9. The composition of claim 1 , wherein each oligonucleotide species has a length of 24-140 nucleotides. 10. The composition of claim 1 , wherein the composition comprises at least two oligonucleotides of the same unique oligonucleotide species. 11. The composition of claim 1 , wherein the uniform region comprises a target-specific region comprising a sequence flanking an immune cell receptor or immunoglobulin variable region coding sequence. 12. The composition of claim 11 , wherein the immune cell receptor variable region coding sequence is selected from the group consisting of: a T cell receptor variable region coding sequence, a B cell receptor variable region coding sequence, and a combination thereof. 13. The composition of claim 1 , wherein the molecule barcodes of all of the unique oligonucleotide species in the composition collectively have a G content of 2.5%-10%. 14. The composition of claim 1 , wherein the molecule barcode is 7-9 nucleotides. 15. The composition of claim 1 , comprising at least 6,500 unique oligonucleotide species. 16. A method of specifically barcoding cDNA from two or more samples, each sample comprising nucleic acids, the method comprising: contacting each sample with a pool comprising at least 100 unique oligonucleotide species, wherein each sample is contacted in spatial isolation from the other samples, each unique oligonucleotide species comprising: a barcode region comprising; a molecule barcode comprising at least 7 nucleotides and having a constraint on G content; and a sample barcode comprising at least 3 nucleotides; and a uniform region 3′ of the barcode region, the uniform region comprising a target-specific region 3′ of the barcode region, the target-specific region comprising at least 10 nucleotides complementary to a target nucleic acid, wherein the unique polynucleotide species of each pool comprise the same sample barcode, and comprise different molecule barcodes, and wherein the constraint on G content comprises: (a) less than 1% of the unique oligonucleotide species comprising the molecule barcode have a G content of 50% or more; and/or (b) the molecule barcodes of all of the unique oligonucleotide species collectively have a G content of no more than 12.5%; hybridizing target-specific regions of at least some oligonucleotides of the unique oligonucleotide species to at least some of the nucleic acids of the sample, wherein the nucleic acids comprise mRNA; and extending the hybridized oligonucleotides by reverse transcription, thereby producing extended strands comprising an oligonucleotide of the unique oligonucleotide species and a sequence complementary to the target, wherein for each sample, the extended strands comprise the same sample barcode and different molecule barcodes, wherein for different samples, the molecule barcodes are different, and wherein a bias in representation of the unique oligonucleotide species in an amplification product of the extended strands is reduced in comparison to the use of control unique oligonucleotide species comprising control molecule barcodes without the constraint on G content. 17. The method of claim 16 , further comprising ascertaining nucleic acid sequences of the strands comprising the oligonucleotides of the unique oligonucleotide species and the sequence complementary to the target. 18. The method of claim 16 , wherein the constraint on G content is the molecule barcodes of the unique oligonucleotide species collectively having a G content of less than 12.5%. 19. The method of claim 16 , wherein the molecule barcode is 7-9 nucleotides. 20. The method of claim 16 , the pool comprises at least 1000 unique oligonucleotide species. 21. The method of claim 16 , the pool comprises at least 6,500 unique oligonucleotide species. 22. A kit for amplifying barcoded cDNA comprising an immune cell receptor or immunoglobulin variable region coding sequence, comprising; a composition comprising: at least 1000 unique oligonucleotide species, each unique oligonucleotide species comprising a barcode region and a uniform region, the barcode region comprising a molecule barcode comprising at least 7 nucleotides and having a constraint on G content, wherein the uniform region is 3′ of the barcode region, the uniform region comprising a target-specific region 3′ of the barcode region, the target-specific region comprising at least 10 nucleotides complementary to a target nucleic acid, wherein the unique oligonucleotide species comprise different nucleic acid sequences in their barcode regions, wherein the constraint on G content comprises: (a) less than 1% o

Assignees

Inventors

Classifications

  • C12Q1/6869Primary

    Methods for sequencing · CPC title

  • Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay (C12Q1/6804 takes precedence) · CPC title

  • C12Q1/6837Primary

    using probe arrays or probe chips (C12Q1/6874 takes precedence) · CPC title

  • involving nucleic acid arrays, e.g. sequencing by hybridisation · CPC title

  • the label being a nucleic acid · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10822643B2 cover?
In accordance with some embodiments herein, compositions and methods for accurate barcoding of nucleic acids are described. The compositions and methods can involve a plurality of unique oligonucleotide species comprising unique molecule barcodes. In some embodiments, the molecule barcodes can have a relatively low G content, and can exhibit reduced bias in amplification and analysis.
Who is the assignee on this patent?
Cellular Res Inc
What technology area does this patent fall under?
Primary CPC classification C12Q1/6869. Mapped technology areas include Chemistry & Metallurgy.
When was this patent published?
Publication date Tue Nov 03 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).