Verification of transformed content

US9898516B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9898516-B2
Application numberUS-201615281364-A
CountryUS
Kind codeB2
Filing dateSep 30, 2016
Priority dateDec 19, 2013
Publication dateFeb 20, 2018
Grant dateFeb 20, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A computer manages methods for determining accurate document transformation by rendering the source document into a non-rasterized format, where the non-rasterized format is a rendered source document. The computer rendering the target document into a non-rasterized format, where the non-rasterized format is a rendered target document. The computer comparing one or more aspects of the rendered source document to corresponding one or more aspects of the rendered target document. The computer determining, based, at least in part, on the compared one or more aspects, whether or not the source document was accurately transformed to the target document.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for determining accurate document transformation, the method comprising: queuing, by one or more processors, a source document for transformation to a target document according to an assigned priority for the target document, wherein the assigned priority is based on a descending order of file sizes for target documents of queued source documents; responsive to determining the source document is next in the queue, transforming, by one or more processors, content of the source document into the target document, wherein transforming the content of the source document includes transforming the content from the first format to the second format; responsive to determining an increased likelihood the source document was not accurately transformed to the target document based on a file size for the target document, rendering, by one or more processors, the source document into a non-rasterized graphical format, wherein a rendered source document represents the source document in the non-rasterized graphical format; and determining, by one or more processors, based at least in part on a comparison of one or more aspects of the rendered source document and a rendered target document in a non-rasterized graphical format, the source document was accurately transformed to the target document. 2. The method of claim 1 , further comprising: receiving, by one or more processors, the source document for transformation to the target document during an Extract Transform Load (ETL) process, wherein the source document is in a first format and the target document is in a second format. 3. The method of claim 2 , further comprising: rendering, by one or more processors, the target document into the non-rasterized graphical format, wherein the rendered target document represents the target document in the non-rasterized graphical format; and comparing, by one or more processors, one or more aspects of the rendered source document to corresponding one or more aspects of a rendered target document. 4. The method of claim 1 , wherein determining the source document was accurately transformed to the target document, comprises: identifying, by one or more processors, a similarity threshold level, where the similarity threshold level represents whether the source document was accurately transformed to the target document; and determining, by one or more processors, whether or not the rendered target document has met the similarity threshold level with relation to the rendered source document. 5. The method of claim 4 , further comprising: displaying, by one or more processors, a notification on a user interface, wherein the notification includes a list including the source document, the target document, and the similarity threshold level. 6. The method of claim 1 , wherein the one or more aspects include text content of a document, a layout of a document, an embedded object, or an overall visual appearance of a document. 7. The method of claim 1 , wherein rendering the source document into a non-rasterized graphical format, comprises: identifying, by one or more processors, one or more attachments associated with the source document; and rendering, by one or more processors, each of the one or more attachments associated with the source document into a non-rasterized format. 8. A computer program product for determining accurate document transformation, the computer program product comprising: one or more computer readable storage media; program instructions stored on the one or more computer readable storage media, the program instructions comprising: program instructions to queue a source document for transformation to a target document according to an assigned priority for the target document, wherein the assigned priority is based on a descending order of file sizes for target documents of queued source documents; program instructions to, responsive to determining the source document is next in the queue, transform content of the source document into the target document, wherein transforming the content of the source document includes transforming the content from the first format to the second format; program instructions to, responsive to determining an increased likelihood the source document was not accurately transformed to the target document based on a file size for the target document, render the source document into a non-rasterized graphical format, wherein a rendered source document represents the source document in the non-rasterized graphical format; and program instructions to determine based at least in part on a comparison of one or more aspects of the rendered source document and a rendered target document in a non-rasterized graphical format, the source document was accurately transformed to the target document. 9. The computer program product of claim 8 , further comprising program instructions, stored on the one or more computer readable storage media, which when executed by a processor, cause the processor to: receive the source document for transformation to the target document during an Extract Transform Load (ETL) process, wherein the source document is in a first format and the target document is in a second format. 10. The computer program product of claim 9 , further comprising program instructions, stored on the one or more computer readable storage media, which when executed by a processor, cause the processor to: render the target document into the non-rasterized graphical format, wherein the rendered target document represents the target document in the non-rasterized graphical format; and compare one or more aspects of the rendered source document to corresponding one or more aspects of a rendered target document. 11. The computer program product of claim 8 , wherein the program instructions to determine the source document was accurately transformed to the target document, comprise program instructions, stored on the one or more computer readable storage media, which when executed by a processor, cause the processor to: identify a similarity threshold level, where the similarity threshold level represents whether the source document was accurately transformed to the target document; and determine whether or not the rendered target document has met the similarity threshold level with relation to the rendered source document. 12. The computer program product of claim 11 , further comprising program instructions, stored on the one or more computer readable storage media, which when executed by a processor, cause the processor to: display a notification on a user interface, wherein the notification includes a list including the source document, the target document, and the similarity threshold level. 13. The computer program product of claim 8 , wherein the one or more aspects include text content of a document, a layout of a document, an embedded object, or an overall visual appearance of a document. 14. The computer program product of claim 8 , wherein the program instructions to render the source document into a non-rasterized graphical format comprise program instructions, stored on the one or more computer readable storage media, which when executed by a processor, cause the processor to: identify one or more attachments associated with the source document; and render each of the one or more attachments associated with the source document into a rasterized format. 15. A computer system for managing methods for determining accurate document transformation, the computer program product comprising, the computer system comprising: one or more comp

Assignees

Inventors

Classifications

  • G06F16/254Primary

    Extract, transform and load [ETL] procedures, e.g. ETL data flows in data warehouses · CPC title

  • Document management systems · CPC title

  • Ensuring data consistency and integrity · CPC title

  • Physics · mapped topic

  • Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9898516B2 cover?
A computer manages methods for determining accurate document transformation by rendering the source document into a non-rasterized format, where the non-rasterized format is a rendered source document. The computer rendering the target document into a non-rasterized format, where the non-rasterized format is a rendered target document. The computer comparing one or more aspects of the rendered …
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F16/254. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 20 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).