Automatic generation of templates for parsing electronic documents
US-2017308517-A1 · Oct 26, 2017 · US
US12032863B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12032863-B2 |
| Application number | US-202318484246-A |
| Country | US |
| Kind code | B2 |
| Filing date | Oct 10, 2023 |
| Priority date | Dec 2, 2015 |
| Publication date | Jul 9, 2024 |
| Grant date | Jul 9, 2024 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
In one example of the disclosure, a set of electronic document templates is accessed and instances of duplicated document content are identified. Display of a user notice for first duplicated document content is caused. Responsive to receipt of data indicative of a user instruction to create a component template for the first duplicated content, the component template is created and stored.
Opening claim text (preview).
What is claimed is: 1. A system for customer communication management, comprising: a processor; a data store storing a library of component templates; and a non-transitory computer readable medium, comprising instructions for: analyzing the component templates stored in the component template library, to identify first duplicative content based on first content included in a first content template as similar to second content in a second content template; determining if there is a third component template in the library of component templates associated with second duplicative content, the second duplicative content associated with the first duplicative content; and generating the third content template based on the second duplicative content when it is determined there is no third component template in the library of component templates. 2. The system of claim 1 , wherein the first duplicative content is text, an image or a document structure. 3. The system of claim 1 , wherein the identification of the duplicative content is based on per-defined degree of similarity. 4. The system of claim 1 , wherein the instructions are further for: causing a user notice of the first duplicative content to be displayed to a user. 5. The system of claim 4 , wherein the third content template is generated based on a user instruction to create the third content template. 6. The system of claim 5 , wherein the second duplicative content is identified by the user. 7. The system of claim 6 , wherein the first duplicative content is the same as the second duplicative content. 8. A method, comprising: analyzing the component templates stored in a component template library, to identify first duplicative content based on first content included in a first content template as similar to second content in a second content template; determining if there is a third component template in the library of component templates associated with second duplicative content, the second duplicative content associated with the first duplicative content; and generating the third content template based on the second duplicative content when it is determined there is no third component template in the library of component templates. 9. The method of claim 8 , wherein the first duplicative content is text, an image or a document structure. 10. The method of claim 8 , wherein the identification of the duplicative content is based on per-defined degree of similarity. 11. The method of claim 8 , further comprising causing a user notice of the first duplicative content to be displayed to a user. 12. The method of claim 11 , wherein the third content template is generated based on a user instruction to create the third content template. 13. The method of claim 12 , wherein the second duplicative content is identified by the user. 14. The method of claim 13 , wherein the first duplicative content is the same as the second duplicative content. 15. A non-transitory computer readable medium, comprising instructions for: analyzing the component templates stored in a component template library, to identify first duplicative content based on first content included in a first content template as similar to second content in a second content template; determining if there is a third component template in the library of component templates associated with second duplicative content, the second duplicative content associated with the first duplicative content; and generating the third content template based on the second duplicative content when it is determined there is no third component template in the library of component templates. 16. The non-transitory computer readable medium of claim 15 , wherein the first duplicative content is text, an image or a document structure. 17. The non-transitory computer readable medium of claim 15 , wherein the identification of the duplicative content is based on per-defined degree of similarity. 18. The non-transitory computer readable medium of claim 15 , further comprising instructions for causing a user notice of the first duplicative content to be displayed to a user. 19. The non-transitory computer readable medium of claim 18 , wherein the third content template is generated based on a user instruction to create the third content template. 20. The non-transitory computer readable medium of claim 19 , wherein the second duplicative content is identified by the user. 21. The non-transitory computer readable medium of claim 20 , wherein the first duplicative content is the same as the second duplicative content.
according to characteristics of the data to be reproduced, e.g. number of lines · CPC title
resulting in reduced user or operator actions, e.g. presetting, automatic actions, using hardware token storing data · CPC title
Extracting the geometrical structure, e.g. layout tree; Block segmentation, e.g. bounding boxes for graphics or text · CPC title
involving plural approaches, e.g. verification by template match; Resolving confusion among similar patterns, e.g. "O" versus "Q" (G06V30/242 takes precedence) · CPC title
Calculation of difference between files · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.