Use of DNA origami nanostructures for molecular information based data storage systems

US12441996B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12441996-B2
Application numberUS-202418972564-A
CountryUS
Kind codeB2
Filing dateDec 6, 2024
Priority dateDec 8, 2023
Publication dateOct 14, 2025
Grant dateOct 14, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present disclosure is directed to compositions and methods that use the principles of DNA origami to package and archive data stored in multiple indexed DNA oligonucleotides. These structures allow for selective physical data access and retrieval from a molecular pool of DNA origami (DNAO) nanostructures comprising the data bearing oligonucleotides.

First claim

Opening claim text (preview).

The invention claimed is: 1. A library comprising a plurality of origami folded DNA data storage files (DNAFiles), each of said DNAFiles comprising a single stranded DNA scaffold; and a plurality of single stranded DNA staple oligonucleotides that each bind through complementary base pairing with two non-contiguous nucleic acid sequences of the DNA scaffold, wherein said staple oligonucleotides cause the DNA scaffold to fold into a two or three dimensional shape having a first surface; a plurality of data oligonucleotides, said data oligonucleotides comprising a sequence complementary to a nucleic acid sequence of said single stranded DNA scaffold, a nucleic acid sequence that encodes digital information, a first primer binding sequence and a second primer binding sequence, wherein the first primer binding sequence is 5′ to the digital information encoding nucleic acid sequence, and the second primer binding sequence is 3′ to the digital information encoding nucleic acid sequence, wherein said plurality of data oligonucleotides are localized to said first surface, wherein the individual DNAFiles differ from one another based on the nucleic acid sequences of the plurality of data oligonucleotides bound to the DNA scaffold of each DNAFile. 2. The library of claim 1 wherein said staple oligonucleotides cause the DNA scaffold to fold into a multi-layered sheet conformation having a top surface and a bottom surface wherein said plurality of data oligonucleotides are only linked to, and project away from, the top surface. 3. The library of claim 2 wherein each DNAFile has a bilayer sheet conformation comprising two symmetrical layers of origami DNA, wherein the shape of each DNAFile is stabilized by a) adding a sequence of six or more thymidine resides (poly(T)) to one end of the the data oligonucleotides; b) decreasing the length of staple oligonucleotides located near sheet corners to less than 100 nucleotides, or less than 50 nucleotides, to allow for flexibility during the folding process; c) introducing intentional gaps or missing base pairs within the scaffold DNA strand/staple folded structure (i.e. “skips”) near the center-line of the folded multi-layered sheet; or d) any combination of a) through c). 4. The library of claim 3 wherein said data oligonucleotides have a length of about 30 to 200 nucleotides, and the first and second primer binding sequences, and the sequence complementary to a nucleic acid sequence of said single stranded DNA scaffold, are each independently 10 to 20 nucleotides in length. 5. The library of claim 1 wherein i) said first primer binding sequence is located at the 5′ terminus of said data oligonucleotides and said second primer binding sequence is located at the 3′ terminus of said data oligonucleotides; or ii) said nucleic acid sequence of the data oligonucleotide that is complementary to said single stranded DNA scaffold is 5′ to said first primer binding sequence, or 3′ to said second primer binding sequence. 6. The library of claim 1 wherein each member of said plurality of origami folded DNAFiles comprises a different single stranded DNA scaffold. 7. The library of claim 1 wherein i) each member of said plurality of origami folded DNAFiles has a unique shape; or ii) each origami folded DNAFile further comprises a linked unique nucleic acid barcode construct; or iii) both i) and ii). 8. The library of claim 1 wherein each origami folded DNAFile further comprises a unique nucleic acid barcode construct linked to the origami DNAFile via base-pairing, wherein said base-pairing that links the nucleic acid barcode construct with the origami DNAFile occurs between i) a single-stranded non-complementary nucleic acid sequence of one or more of said staple oligonucleotides and a complementary sequence linked to the nucleic acid barcode construct; or ii) a single-stranded non-complementary nucleic acid sequence extending from the 5′ or 3′ end of the single-stranded DNA scaffold and a complementary sequence linked to the nucleic acid barcode construct. 9. The library of claim 8 , wherein the nucleic acid barcode construct is linked to the DNAFile by a high affinity, non-covalent bond interaction between a biotin molecule linked to the 5′ and/or the 3′ end of the nucleic acid barcode construct and a molecule that binds to biotin, said molecule being linked to the DNAFile. 10. A library comprising a plurality of origami folded DNA data storage files (DNAFiles), each of said DNAFiles comprising a single stranded DNA scaffold; and a plurality of single stranded DNA staple oligonucleotides that each bind through complementary base pairing with two non-contiguous nucleic acid sequences of the DNA scaffold, wherein said staple oligonucleotides cause the DNA scaffold to fold into a two or three dimensional shape having a first surface; a plurality of data oligonucleotides, said data oligonucleotides comprising a sequence complementary to a nucleic acid sequence of said single stranded DNA scaffold, a nucleic acid sequence that encodes digital information, and a first primer binding sequence and a second primer binding sequence, wherein the first primer binding sequence is 5′ to the digital information encoding nucleic acid sequence, the second primer binding sequence is 3′ to the digital information encoding nucleic acid sequence, wherein said plurality of data oligonucleotides are localized to said first surface, wherein the individual DNAFiles differ from one another based on the nucleic acid sequences of the plurality of data oligonucleotides bound to the DNA scaffold of each DNAFile wherein the data oligonucleotides of each individual origami folded DNAFile of said library further comprise an identical set of PCR binding sequences for preselected PCR primers, where the PCR binding sequences differ between the data oligonucleotides of each respective DNAFile of the library. 11. A method of retrieving digital data stored in DNA, said method comprising providing the library of origami folded DNAFiles according to claim 1 ; denaturing a folded origami DNAFile of said library to at least partially disrupt the hybridized duplex between the single stranded staple oligonucleotides, data oligonucleotides and the DNA scaffold; conducting PCR amplification on select nucleic acid sequences of said denatured DNA scaffold and data oligonucleotides to produce amplicons; reannealing the staple oligonucleotides and data oligonucleotides with the DNA scaffold to reconstitute the folded origami DNAFile; separating the amplicons from the reconstituted folded origami DNAFile; returning the reconstituted folded origami DNAFile to the library; and sequencing the amplicons to retrieve digital data encoded by the DNAFile. 12. The method of claim 11 wherein said denaturing step completely releases all staple oligonucleotides and data oligonucleotides as free single stranded nucleic acids. 13. The method of claim 11 wherein the amplicons are separated from the reconstituted folded origami DNAFiles i) via gel electrophoresis; or ii) via size exclusion chromatography. 14. The method of claim 11 further comprising the step of confirming the correct size and shape of the reconstituted folded origami DNA scaffold prior to returning the reconstituted folded origami DNA scaffold to the library. 15. The method of claim 14 further comprising the step of selecting one or more individual origami folded DNAFiles from the other origami folded DNAFiles of said library and conducting the denaturing step only on the selected origami folded DNAFiles. 16. The method of claim 15

Assignees

Inventors

Classifications

  • Methods for sequencing · CPC title

  • Nucleic acid folding · CPC title

  • DNA computing · CPC title

  • Sequence assembly · CPC title

  • General methods of preparing gene libraries, not provided for in other subgroups · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12441996B2 cover?
The present disclosure is directed to compositions and methods that use the principles of DNA origami to package and archive data stored in multiple indexed DNA oligonucleotides. These structures allow for selective physical data access and retrieval from a molecular pool of DNA origami (DNAO) nanostructures comprising the data bearing oligonucleotides.
Who is the assignee on this patent?
Battelle Memorial Institute
What technology area does this patent fall under?
Primary CPC classification C12N15/1093. Mapped technology areas include Chemistry & Metallurgy.
When was this patent published?
Publication date Tue Oct 14 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).