High throughput transcriptome analysis
US-2015307874-A1 · Oct 29, 2015 · US
US10822643B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-10822643-B2 |
| Application number | US-201715581914-A |
| Country | US |
| Kind code | B2 |
| Filing date | Apr 28, 2017 |
| Priority date | May 2, 2016 |
| Publication date | Nov 3, 2020 |
| Grant date | Nov 3, 2020 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
In accordance with some embodiments herein, compositions and methods for accurate barcoding of nucleic acids are described. The compositions and methods can involve a plurality of unique oligonucleotide species comprising unique molecule barcodes. In some embodiments, the molecule barcodes can have a relatively low G content, and can exhibit reduced bias in amplification and analysis.
Opening claim text (preview).
What is claimed is: 1. A reverse transcription reaction composition comprising; mRNA; a reverse transcriptase; and at least 1000 unique oligonucleotide species, each unique oligonucleotide species comprising a barcode region and a uniform region, the barcode region comprising a molecule barcode comprising at least 7 nucleotides and having a constraint on G content, wherein the uniform region is 3′ of the barcode region, the uniform region comprising a target-specific region 3′ of the barcode region, the target-specific region comprising at least 10 nucleotides complementary to a target nucleic acid, wherein the unique oligonucleotide species comprise different nucleic acid sequences in their barcode regions, wherein the constraint on G content comprises: (a) less than 1% of the unique oligonucleotide species comprising the molecule barcode have a G content of 50% or more: and/or (b) the molecule barcodes of all of the unique oligonucleotide species in the composition collectively have a G content of no more than 12.5%, and wherein the barcode region further comprises a sample barcode comprising at least 3 nucleotides. 2. The composition of claim 1 , wherein the constraint on G content is less than 1% of the unique oligonucleotide species comprising the molecule barcode have a G content of 50% or more. 3. The composition of claim 1 , wherein the constraint on G content is the molecule barcodes of all of the unique oligonucleotide species in the composition collectively have a G content of no more than 12.5%. 4. The composition of claim 1 , wherein the unique oligonucleotide species are disposed in at least two spatially isolated pools, each pool comprising at least 100 unique oligonucleotides of the unique oligonucleotide species, wherein unique oligonucleotides in the same pool comprise the same sample barcode sequence, and wherein different unique oligonucleotides of the same pool comprise a different molecule barcode sequences. 5. The composition of claim 4 , wherein the unique oligonucleotide species of each pool are immobilized on a substrate, so that the sample barcodes but not the molecule barcodes are the same for the oligonucleotide species immobilized on each substrate. 6. The composition of claim 1 , wherein for at least 95% of the unique oligonucleotide species, any G in the molecule barcode is not adjacent to another G. 7. The composition of claim 1 , wherein at least 95% of the molecule barcodes of the unique oligonucleotide species comprise the sequence HNHNHNHN, wherein each “H” is any one of A, C, or T, and wherein each “N” is any one of A, G, C, or T. 8. The composition of claim 1 , wherein each of the unique oligonucleotide species comprises a spacer 3′ of the barcode region and 5′ of the target specific region, said spacer comprising the sequence HHHHHHHH, wherein each “H” is any one of A, C, or T. 9. The composition of claim 1 , wherein each oligonucleotide species has a length of 24-140 nucleotides. 10. The composition of claim 1 , wherein the composition comprises at least two oligonucleotides of the same unique oligonucleotide species. 11. The composition of claim 1 , wherein the uniform region comprises a target-specific region comprising a sequence flanking an immune cell receptor or immunoglobulin variable region coding sequence. 12. The composition of claim 11 , wherein the immune cell receptor variable region coding sequence is selected from the group consisting of: a T cell receptor variable region coding sequence, a B cell receptor variable region coding sequence, and a combination thereof. 13. The composition of claim 1 , wherein the molecule barcodes of all of the unique oligonucleotide species in the composition collectively have a G content of 2.5%-10%. 14. The composition of claim 1 , wherein the molecule barcode is 7-9 nucleotides. 15. The composition of claim 1 , comprising at least 6,500 unique oligonucleotide species. 16. A method of specifically barcoding cDNA from two or more samples, each sample comprising nucleic acids, the method comprising: contacting each sample with a pool comprising at least 100 unique oligonucleotide species, wherein each sample is contacted in spatial isolation from the other samples, each unique oligonucleotide species comprising: a barcode region comprising; a molecule barcode comprising at least 7 nucleotides and having a constraint on G content; and a sample barcode comprising at least 3 nucleotides; and a uniform region 3′ of the barcode region, the uniform region comprising a target-specific region 3′ of the barcode region, the target-specific region comprising at least 10 nucleotides complementary to a target nucleic acid, wherein the unique polynucleotide species of each pool comprise the same sample barcode, and comprise different molecule barcodes, and wherein the constraint on G content comprises: (a) less than 1% of the unique oligonucleotide species comprising the molecule barcode have a G content of 50% or more; and/or (b) the molecule barcodes of all of the unique oligonucleotide species collectively have a G content of no more than 12.5%; hybridizing target-specific regions of at least some oligonucleotides of the unique oligonucleotide species to at least some of the nucleic acids of the sample, wherein the nucleic acids comprise mRNA; and extending the hybridized oligonucleotides by reverse transcription, thereby producing extended strands comprising an oligonucleotide of the unique oligonucleotide species and a sequence complementary to the target, wherein for each sample, the extended strands comprise the same sample barcode and different molecule barcodes, wherein for different samples, the molecule barcodes are different, and wherein a bias in representation of the unique oligonucleotide species in an amplification product of the extended strands is reduced in comparison to the use of control unique oligonucleotide species comprising control molecule barcodes without the constraint on G content. 17. The method of claim 16 , further comprising ascertaining nucleic acid sequences of the strands comprising the oligonucleotides of the unique oligonucleotide species and the sequence complementary to the target. 18. The method of claim 16 , wherein the constraint on G content is the molecule barcodes of the unique oligonucleotide species collectively having a G content of less than 12.5%. 19. The method of claim 16 , wherein the molecule barcode is 7-9 nucleotides. 20. The method of claim 16 , the pool comprises at least 1000 unique oligonucleotide species. 21. The method of claim 16 , the pool comprises at least 6,500 unique oligonucleotide species. 22. A kit for amplifying barcoded cDNA comprising an immune cell receptor or immunoglobulin variable region coding sequence, comprising; a composition comprising: at least 1000 unique oligonucleotide species, each unique oligonucleotide species comprising a barcode region and a uniform region, the barcode region comprising a molecule barcode comprising at least 7 nucleotides and having a constraint on G content, wherein the uniform region is 3′ of the barcode region, the uniform region comprising a target-specific region 3′ of the barcode region, the target-specific region comprising at least 10 nucleotides complementary to a target nucleic acid, wherein the unique oligonucleotide species comprise different nucleic acid sequences in their barcode regions, wherein the constraint on G content comprises: (a) less than 1% o
Methods for sequencing · CPC title
Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay (C12Q1/6804 takes precedence) · CPC title
using probe arrays or probe chips (C12Q1/6874 takes precedence) · CPC title
involving nucleic acid arrays, e.g. sequencing by hybridisation · CPC title
the label being a nucleic acid · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.