Methods and compositions for long-range haplotype phasing

US11326159B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11326159-B2
Application numberUS-201615564384-A
CountryUS
Kind codeB2
Filing dateApr 4, 2016
Priority dateApr 6, 2015
Publication dateMay 10, 2022
Grant dateMay 10, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Various approaches for generating read-sets from nucleic acid molecules and segments and phasing are disclosed. Nucleic acids are assembled into complexes using binding moieties and exposed nucleic acid ends are tagged with nucleic acid tags. Read-sets can be generated from tagged nucleic acid molecules and segments. Physical linkage relationships between nucleic acid molecules and segments can be examined using the nucleic acid tags. Various approaches to generating read-sets and phasing are presented.

First claim

Opening claim text (preview).

What is claimed is: 1. A method of generating a first read-set from a first nucleic acid molecule and second read-set from a second nucleic acid molecule, comprising: (a) binding at least a first association molecule to said first nucleic acid molecule and a second association molecule to said second nucleic acid molecule outside of a cell and thereby generating a first complex and a second complex, respectively, wherein said first nucleic acid molecule comprises a first nucleic acid segment and a second nucleic acid segment, and wherein said second nucleic acid molecule comprises a third nucleic acid segment and a fourth nucleic acid segment; (b) separating said first complex from said second complex; (c) labelling said first nucleic acid segment and said second nucleic acid segment using a first barcode nucleic acid, thereby creating a first labeled complex; (d) labeling said third nucleic acid segment and said fourth nucleic acid segment using a fourth barcode nucleic acid, thereby creating a second labeled complex; (e) pooling said first labeled complex and said second labeled complex; (f) separating said first labeled complex and said second labeled complex; (g) labelling said first nucleic acid segment and said second nucleic acid segment using a second barcode nucleic acid, thereby creating a first doubly labeled complex; (h) labeling said third nucleic acid segment and said fourth nucleic acid segment using a fifth barcode nucleic acid, thereby creating a second doubly labeled complex; (i) pooling said first doubly labeled complex and said second doubly labeled complex; (j) separating said first doubly labeled complex and said second doubly labeled complex; (k) labelling said first nucleic acid segment and said second nucleic acid segment using a third barcode nucleic acid, thereby creating a first triply labeled complex; (I) labeling said third nucleic acid segment and said fourth nucleic acid segment using a sixth barcode nucleic acid, thereby creating a second triply labeled complex; wherein said first barcode nucleic acid, said second barcode nucleic acid, said third barcode nucleic acid, said fourth barcode nucleic acid, said fifth barcode nucleic acid, and said sixth barcode nucleic acid segregate independently; (m) sequencing said first nucleic acid segment and said second nucleic acid segment of said first triply labeled complex, thereby obtaining said first read-set; (n) sequencing said third nucleic acid segment and said fourth nucleic acid segment of said second triply labeled complex, thereby obtaining said second read-set; and (o) assigning sequence from said third nucleic acid segment and said first nucleic acid segment to separate molecules because said sequence from said first nucleic acid segment and said third nucleic acid segment comprises non-identical barcode ends, wherein said first association molecule and said second association molecule comprise polypeptides, and wherein said first nucleic acid molecule and said second nucleic acid molecule comprise nucleic acids from a biological sample. 2. The method of claim 1 , wherein (i) said first nucleic acid segment and said second nucleic acid segment; and (ii) said third nucleic acid segment and said fourth nucleic acid segment are treated so as to not share a common phosphodiester backbone. 3. The method of claim 1 , wherein said first association molecule is bound to said first nucleic acid molecule by cross-linking. 4. The method of claim 1 , wherein said binding said first nucleic acid molecule to said first association molecule and said binding said second nucleic acid molecule to said second association molecule comprises contacting said first nucleic acid molecule and said first association molecule and said second nucleic acid molecule and said second association molecule to a fixative agent. 5. The method of claim 1 , comprising labelling said first nucleic acid segment and said second nucleic acid segment using at least said third barcode nucleic acid, and wherein said third barcode nucleic acid is non-identical to said first barcode nucleic acid and said second barcode nucleic acid. 6. The method of claim 1 , wherein said first read-set is generated by associating sequence from said first nucleic acid segment and said second nucleic acid segment, using said first barcode nucleic acid and said second barcode nucleic acid. 7. The method of claim 1 , comprising assembling a first contig having sequence from said first nucleic acid segment and a second contig having sequence from said second nucleic acid segment into a single scaffold. 8. The method of claim 1 , comprising assigning sequence from said first nucleic acid segment and said second nucleic acid segment to a common phase. 9. The method of claim 1 , comprising assigning a first sequence read having sequence from said first barcode nucleic acid and said second barcode nucleic acid to a common scaffold. 10. The method of claim 1 , wherein said binding said second nucleic acid molecule to said second association molecule comprises cross-linking. 11. The method of claim 1 , comprising assembling a plurality of contigs of sequence from said second nucleic acid molecule using said second read-set. 12. The method of claim 1 , wherein said first read-set and said second read-set are used to determine phase of said first nucleic acid molecule and said second nucleic acid molecule.

Assignees

Inventors

Classifications

  • incorporating an adaptor · CPC title

  • Methods for sequencing · CPC title

  • Polynucleotides, e.g. nucleic acids, oligoribonucleotides · CPC title

  • Methods of creating libraries, e.g. combinatorial synthesis · CPC title

  • ICT programming tools or database systems specially adapted for bioinformatics · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11326159B2 cover?
Various approaches for generating read-sets from nucleic acid molecules and segments and phasing are disclosed. Nucleic acids are assembled into complexes using binding moieties and exposed nucleic acid ends are tagged with nucleic acid tags. Read-sets can be generated from tagged nucleic acid molecules and segments. Physical linkage relationships between nucleic acid molecules and segments can…
Who is the assignee on this patent?
Univ California
What technology area does this patent fall under?
Primary CPC classification G16B50/30. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 10 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).