Universal short adapters for indexing of polynucleotide samples
US-2018334712-A1 · Nov 22, 2018 · US
US11028435B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11028435-B2 |
| Application number | US-201815968613-A |
| Country | US |
| Kind code | B2 |
| Filing date | May 1, 2018 |
| Priority date | May 1, 2017 |
| Publication date | Jun 8, 2021 |
| Grant date | Jun 8, 2021 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
The disclosed embodiments concern index oligonucleotides configured to identify sources of samples of nucleic acids and methods, apparatus, systems and computer program products for identifying and making the index oligonucleotides. In some implementations, the index oligonucleotides include a set of index sequences, a Hamming distance between any two index sequences of the set of index sequences meeting one or more criteria. System, apparatus, and computer program products are also provided for determining a sequence of interest using the index oligonucleotides.
Opening claim text (preview).
What is claimed is: 1. A set of oligonucleotides comprising a plurality of subsets of oligonucleotides, wherein: the set of oligonucleotides is configured to identify sources of nucleic acid samples in multiplex massively parallel sequencing, each of the nucleic acid samples comprising a plurality of nucleic acid molecules; the set of oligonucleotides comprises a set of index sequences comprising at least 6 different index sequences, each subset of the plurality of subsets of oligonucleotides comprising a plurality of index sequences of the set of index sequences; a Hamming distance between any two index sequences of the set of index sequences is not less than a first criterion value, wherein the first criterion value is at least 2; and the set of index sequences comprises a plurality of pairs of color-balanced index sequences, wherein any two bases at corresponding sequence positions of each pair of color-balanced index sequences include both (i) an adenine (A) base or a cytosine (C) base, and (ii) a guanine (G) base, a thymine (T) base, or a uracil (U) base. 2. The set of oligonucleotides of claim 1 , wherein each subset of the plurality of subsets of oligonucleotides comprises one or more pairs of color-balanced index sequences. 3. The set of oligonucleotides of claim 1 , wherein the set of oligonucleotides comprises (a) double-stranded or Y-shaped sequencing adapters, each strand of each double-stranded or Y-shaped sequencing adapter comprises an index sequence of the set of index sequences or a reverse complement thereof; or (b) pairs of single-stranded oligonucleotides, each pair being provided together in a reagent, each oligonucleotide of a pair comprising an index sequence of the set of index sequences or a reverse complement thereof. 4. The set of oligonucleotides of claim 3 , wherein (a) each Y-shaped or double-stranded sequencing adapter comprises a first strand comprising a first index sequence selected from a first subset of the set of index sequences and a second strand comprising a second index sequence selected from a second subset of the set of index sequences (or reverse complements of the second subset); or (b) each pair of oligonucleotides comprises a first oligonucleotide comprising a first index sequence selected from a first subset of the set of index sequences and a second oligonucleotide comprising a second index sequence selected from a second subset of the set of index sequences (or reverse complements of the second subset). 5. The set of oligonucleotides of claim 4 , wherein the first strand of each Y-shaped sequencing adapter comprises a P5 flow cell amplification primer binding site, and the second strand of each Y-shaped sequencing adapter comprises a P7′ flow cell amplification primer binding site. 6. The set of oligonucleotides of claim 4 , wherein the first and the second index sequences respectively are: the n th 10-mer in SEQ ID NO: 10 and n th 10-mer in SEQ ID NO: 11 or a reverse complement thereof; the n th 10-mer in SEQ ID NO: 12 and n th 10-mer in SEQ ID NO: 13 or a reverse complement thereof; the n th 10-mer in SEQ ID NO: 14 and n th 10-mer in SEQ ID NO: 15 or a reverse complement thereof; the n th 10-mer in SEQ ID NO: 16 and n th 10-mer in SEQ ID NO: 17 or a reverse complement thereof; or the n th 10-mer in SEQ ID NO: 18 and n th 10-mer in SEQ ID NO: 19 or a reverse complement thereof. 7. The set of oligonucleotides of claim 1 , wherein the set of index sequences comprises a plurality of non-overlapping subsets of index sequences, a Hamming distance between any two index sequences in any subset is not less than a second criterion value. 8. The set of oligonucleotides of claim 7 , wherein the second criterion value is larger than the first criterion value. 9. The set of oligonucleotides of claim 1 , wherein an oligonucleotide of the set of oligonucleotides comprises a primer having an index sequence on its 3′ end and an index sequence on its 5′ end. 10. The set of oligonucleotides of claim 1 , wherein the set of index sequences is comprised in Y-shaped sequencing adapters, each Y-shaped sequencing adapter comprising an index sequence on only one strand. 11. The set of oligonucleotides of claim 1 , wherein an edit distance between any two index sequences of the set of index sequences is not less than a third criterion value. 12. The set of oligonucleotides of claim 11 , wherein the edit distance is a modified Levenshtein distance where end gaps are assigned no penalty. 13. The set of oligonucleotides of claim 11 , wherein: each index sequence of the set of index sequences has 8 bases; the first criterion value is 3; and the third criterion is 2. 14. The set of oligonucleotides of claim 11 , wherein: each index sequence of the set of index sequences has 10 bases; the first criterion value is 4; and the third criterion is 3. 15. The set of oligonucleotides of claim 1 , wherein the set of index sequences comprising at least 96 different index sequences. 16. The set of oligonucleotides of claim 1 , wherein the set of oligonucleotides are provided in a container comprising multiple separate compartments, and wherein each compartment contains (a) a first plurality of oligonucleotides comprising a first index sequence of the set of index sequences and (b) a second plurality of oligonucleotides comprising a second index sequence of the set of index sequences, an ordered combination of (a) and (b) in the compartment being different from ordered combinations of (a) and (b) in any other compartments. 17. The set of oligonucleotides of claim 16 , wherein the first and the second index sequences in a compartment respectively are: the n th 10-mer in SEQ ID NO: 10 and the n t 10-mer in SEQ ID NO: 11 (or a reverse complement thereof); the n th 10-mer in SEQ ID NO: 12 and the n th 10-mer in SEQ ID NO: 13 (or a reverse complement thereof); the n th 10-mer in SEQ ID NO: 14 and the n th 10-mer in SEQ ID NO: 15 (or a reverse complement thereof); the n th 10-mer in SEQ ID NO: 16 and the n th 10-mer in SEQ ID NO: 17 (or a reverse complement thereof); or the n th 10-mer in SEQ ID NO: 18 and the n th 10-mer in SEQ ID NO: 19 (or a reverse complement thereof). 18. The set of oligonucleotides of claim 16 , wherein each index sequence comprised in the first plurality of oligonucleotides is selected from a first subset of the set of index sequences, and each index sequence comprised in the second plurality of oligonucleotides is selected from a second subset of the set of index sequences, the first subset not overlapping the second sub set. 19. The set of oligonucleotides of claim 18 , wherein a Hamming distance between any two index sequences in the first subset or between any two index sequences in the second subset is not less than a second criterion value. 20. The set of oligonucleotides of claim 18 , wherein the second criterion value is larger than the first criterion value. 21. A method for making a plurality of oligonucleotides for multiplex massively parallel sequencing, the method comprising: (a) selecting a set of index sequences from a pool of different index sequences, wherein the set of index sequences comprise at least 6 different sequences; a Hamming distance between any two index sequences in the set of index sequences is not less than a first criterion value, wherein the first criterion value is at least 2, and the set of index sequences comprises a plurality of pairs of color-balanced index sequences, wherein any two bases at corresponding sequen
Sequence alignment; Homology search · CPC title
ICT specially adapted for sequence analysis involving nucleotides or amino acids · CPC title
Libraries containing nucleotides or polynucleotides, or derivatives thereof · CPC title
Preparation or screening of tagged libraries, e.g. tagged microorganisms by STM-mutagenesis, tagged polynucleotides, gene tags · CPC title
Selection methods for production or design of target specific oligonucleotides or binding molecules · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.