Automatically generating labeled synthetic documents

US11238312B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11238312-B2
Application numberUS-201916690695-A
CountryUS
Kind codeB2
Filing dateNov 21, 2019
Priority dateNov 21, 2019
Publication dateFeb 1, 2022
Grant dateFeb 1, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present disclosure relates to systems, methods, and non-transitory computer readable media for generating diverse and realistic synthetic documents using deep learning. In particular, the disclosed systems can utilize a trained neural network to generate realistic image layouts comprising page elements that comply with layout parameters. The disclosed systems can also generate synthetic content corresponding to the page elements within the image layouts. The disclosed systems insert the synthetic content into the corresponding page elements of documents based on the image layouts to generate synthetic documents.

First claim

Opening claim text (preview).

What is claimed is: 1. A non-transitory computer readable medium for generating synthetic documents, the non-transitory computer readable medium comprising instructions that, when executed by at least one processor, cause a computing device to: generate, utilizing a neural network, a new image layout comprising an image of a document indicating structure of page elements within the document; generate a digital document corresponding to the new image layout, the digital document comprising page elements corresponding to the structure of page elements in the new image layout; generate synthetic content corresponding to the page elements; and generate a synthetic document by inserting the synthetic content into the corresponding page elements in the digital document. 2. The non-transitory computer readable medium as recited in claim 1 , further comprising instructions that, when executed by the at least one processor, cause the computing device to generate the new image layout utilizing a generative adversarial network. 3. The non-transitory computer readable medium as recited in claim 1 , further comprising instructions that, when executed by the at least one processor, cause the computing device to generate the new image layout by: accessing a real document; identifying a page element of the real document; and replacing the page element of the real document with a new page element. 4. The non-transitory computer readable medium as recited in claim 1 , further comprising instructions that, when executed by the at least one processor, cause the computing device to generate the synthetic content corresponding to the page elements by generating, by utilizing a machine learning model, synthetic content corresponding to the page elements. 5. The non-transitory computer readable medium as recited in claim 1 , further comprising generating the synthetic content corresponding to the page elements by determining page elements of real content that correspond to the page elements of the synthetic document. 6. The non-transitory computer readable medium as recited in claim 1 , further comprising instructions that, when executed by the at least one processor, cause the computing device to generate the synthetic document by: determining styling parameters for the synthetic document; and applying the styling parameters to the synthetic content. 7. The non-transitory computer readable medium as recited in claim 6 , further comprising instructions that, when executed by the at least one processor, cause the computing device to determine the styling parameters based on style characteristics of the page elements. 8. The non-transitory computer readable medium as recited in claim 6 , further comprising instructions that, when executed by the at least one processor, cause the computing device to determine the styling parameters based on style characteristics of a distribution of real documents. 9. A system comprising: one or more memory devices storing a neural network; and at least one server configured to cause the system to: determine layout parameters; generate, utilizing the neural network, a plurality of new image layouts comprising images of documents indicating structures of page elements within the documents conforming to the layout parameters; generate a plurality of digital documents corresponding to the plurality of new image layouts, each digital document of the plurality of digital documents comprising page elements corresponding to the structures of page elements in a corresponding new image layout; determine content parameters for synthetic content corresponding to the page elements; generate the synthetic content corresponding to the page elements, wherein the synthetic content conforms to the content parameters; and generate a plurality of synthetic documents by inserting the synthetic content into the corresponding page elements of the plurality of digital documents. 10. The system as recited in claim 9 , wherein the at least one server is further configured to cause the system to generate element labels for the plurality of synthetic documents, wherein the element labels indicate the page elements within the plurality of synthetic documents. 11. The system as recited in claim 9 , wherein the at least one server is further configured to cause the system to train a machine learning model using the plurality of synthetic documents. 12. The system as recited in claim 9 , wherein the at least one server is further configured to cause the system to determine the layout parameters by analyzing layout characteristics of a plurality of real documents. 13. The system as recited in claim 9 , wherein the at least one server is further configured to cause the system to generate the plurality of new image layouts by: generating, by an image layout prediction neural network, predicted image layouts; feeding the predicted image layouts to an adversarial discrimination neural network to determine if the predicted image layouts resemble realistic image layouts; and training the image layout prediction neural network based on the determination of the adversarial discrimination neural network. 14. The system as recited in claim 9 , wherein the at least one server is further configured to cause the system to generate the synthetic content corresponding to the page elements by: training a machine learning model using real page content corresponding to real page elements; and generating, by the trained machine learning model, synthetic content corresponding to the page elements. 15. The system as recited in claim 9 , wherein the at least one server is further configured to cause the system to generate the plurality of synthetic documents by: determining styling parameters for the plurality of synthetic documents; and applying the styling parameters to the synthetic content within the plurality of synthetic documents. 16. The system as recited in claim 15 , wherein the at least one server is further configured to cause the system to generate a synthetic document based on style characteristics of the page elements. 17. The system as recited in claim 15 , wherein the at least one server is further configured to cause the system to determine the styling parameters based on style characteristics of a distribution of real documents. 18. In a digital medium environment for training machine learning models using training documents, a computer-implemented method for generating synthetic documents with element labels comprising: performing a step for generating a plurality of new image layouts conforming to layout parameters; performing a step for generating synthetic content conforming to content parameters; and generating a plurality of synthetic documents comprising the plurality of new image layouts comprising the synthetic content. 19. The computer-implemented method as recited in claim 18 , further comprising generating element labels for the plurality of synthetic documents, wherein the element labels indicate page elements within the plurality of synthetic documents. 20. The computer-implemented method as recited in claim 18 , wherein the layout parameters comprise user input.

Assignees

Inventors

Classifications

  • Obtaining sets of training patterns; Bootstrap methods, e.g. bagging or boosting · CPC title

  • using neural networks · CPC title

  • characterised by the process organisation or structure, e.g. boosting cascade · CPC title

  • G06V30/414Primary

    Extracting the geometrical structure, e.g. layout tree; Block segmentation, e.g. bounding boxes for graphics or text · CPC title

  • Combinations of networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11238312B2 cover?
The present disclosure relates to systems, methods, and non-transitory computer readable media for generating diverse and realistic synthetic documents using deep learning. In particular, the disclosed systems can utilize a trained neural network to generate realistic image layouts comprising page elements that comply with layout parameters. The disclosed systems can also generate synthetic con…
Who is the assignee on this patent?
Adobe Inc
What technology area does this patent fall under?
Primary CPC classification G06V30/414. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 01 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).