Synchronizing annotations between printed documents and electronic documents

US9436665B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9436665-B2
Application numberUS-201313781446-A
CountryUS
Kind codeB2
Filing dateFeb 28, 2013
Priority dateFeb 28, 2013
Publication dateSep 6, 2016
Grant dateSep 6, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An image of a printed document portion is provided to a synchronizer. The synchronizer retrieves an electronic version of the printed document and identifies an electronic text portion that is textually similar to a printed text portion. The synchronizer detects an annotation in the printed document portion and inserts a corresponding digital annotation into the electronic document.

First claim

Opening claim text (preview).

The following is claimed: 1. A computer-implemented method for synchronizing annotations between a printed document and an electronic document, the method comprising: receiving, at a server, an image of a printed document portion having a printed text portion, wherein the printed document portion includes an annotation in proximity to the printed text portion; accessing at least a portion of the electronic document from a memory device, wherein the electronic document comprises an electronic version of the printed document; identifying an electronic text portion within the electronic document that corresponds to the printed text portion; detecting the annotation in the printed document portion; facilitating insertion of a digital annotation into at least one of the electronic document and a copy of the electronic document, the digital annotation corresponding to the detected annotation, wherein the digital annotation is inserted in proximity to the identified electronic text portion at a location that at least substantially corresponds to a location of the detected annotation in the printed document portion; identifying an additional digital annotation in the electronic document; determining, by analyzing the image of the printed document portion, that the printed document portion does not include an additional annotation corresponding to the additional digital annotation; determining whether the additional digital annotation is a migrated annotation or a direct digital annotation, wherein a migrated annotation is an annotation inserted during a synchronization process to correspond to an annotation in a printed document, and wherein a direct digital annotation is an annotation added directly to an electronic document by a reader; and removing the additional digital annotation from the electronic document if the additional digital annotation is a migrated annotation, wherein the additional digital annotation is not removed if the additional digital annotation is a direct digital annotation. 2. The method of claim 1 , wherein determining whether the additional digital annotation is a migrated annotation or a direct digital annotation comprises determining a value of a tag associated with the additional digital annotation. 3. The method of claim 1 , further comprising: identifying an annotation type of the detected annotation, wherein the annotation type comprises at least one of a highlight, a handwritten text, an underline, and a flag, and wherein facilitating insertion of the digital annotation into the electronic document comprises creating a digital annotation of the identified annotation type. 4. The method of claim 1 , wherein facilitating insertion of the digital annotation in the electronic document comprises inserting electronic text into a margin of the electronic document. 5. The method of claim 1 , wherein the electronic document comprises a platform-independent document format. 6. The method of claim 1 , wherein identifying the corresponding electronic text portion comprises identifying a textual similarity between the printed text portion and the corresponding electronic text portion. 7. The method of claim 6 , wherein identifying the textual similarity comprises: recognizing a part of the printed text portion by performing an optical character recognition (OCR) procedure on the image of the printed document portion; converting the recognized part of the printed text portion to recognized electronic text using the OCR procedure; identifying at least one character sequence in the recognized electronic text that includes a recognition error; creating pruned recognized electronic text, wherein the pruned recognized electronic text comprises the recognized electronic text from which the at least one character sequence has been removed; and searching the electronic document using at least one search query that comprises the pruned recognized electronic text. 8. One or more non-transitory computer-readable media having computer-executable instructions embodied thereon for facilitating synchronization of annotations between a printed document and an electronic document, wherein the instructions include a synchronizer having a plurality of program components, the plurality of program components comprising: a matching component that (1) receives an image of a printed document portion having a printed text portion and (2) identifies an electronic text portion within the electronic document that corresponds to the printed text portion; and a digital annotation component that facilitates insertion of a digital annotation into the electronic document in proximity to the identified corresponding electronic text portion; wherein the synchronizer is further configured to: identify an additional digital annotation in the electronic document; determine, by analyzing the image of the printed document portion, that the printed document portion does not include an annotation corresponding to the additional digital annotation; determine whether the additional digital annotation is a migrated annotation or a direct digital annotation, wherein a migrated annotation is an annotation inserted during a synchronization process to correspond to an annotation in a printed document, and wherein a direct digital annotation is an annotation added directly to an electronic document by a reader; and remove the additional digital annotation from the electronic document if the additional digital annotation is a migrated annotation, wherein the additional digital annotation is not removed if the additional digital annotation is a direct digital annotation. 9. The media of claim 8 , wherein the matching component utilizes an optical character recognition (OCR) procedure to recognize the printed text portion within the printed document portion, and wherein the matching component utilizes one or more search queries to identify the corresponding electronic text portion. 10. The media of claim 9 , wherein the matching component utilizes a pruning procedure to remove recognition errors from the recognized printed text portion. 11. The media of claim 8 , further comprising: a detection component that detects an annotation in the printed document portion by analyzing at least the image of the printed document portion, wherein the printed document portion includes the annotation in proximity to the printed text portion. 12. The media of claim 11 , wherein the detection component partitions the image of the printed document portion into a plurality of zones, the plurality of zones comprising at least one text zone and at least one candidate zone. 13. The media of claim 12 , wherein the detection component analyzes the at least one candidate zone using at least one of a handwriting recognition procedure, an OCR procedure, a statistical language model, a bitmap overlay comparison, and a statistical classifier. 14. The media of claim 11 , wherein the detection component detects the annotation based, at least in part, on feedback received from at least one reviewer via a crowd-sourcing platform. 15. A system that facilitates synchronization of annotations between a printed document and an electronic document, the system comprising: a server configured to receive, from an imaging device, an image of a printed document portion having a printed text portion, the server comprising a processor configured to instantiate a synchronizer configured to: (a) identify a corresponding electronic text portion in the electronic document, wherein the corresponding electronic text portion is textually similar to the printed text portion, (b) identi

Assignees

Inventors

Classifications

  • G06F40/169Primary

    Annotation, e.g. comment data or footnotes · CPC title

  • by use of digital ink · CPC title

  • G06F40/10Primary

    Text processing (natural language analysis G06F40/20; semantic analysis G06F40/30; processing or translation of natural language G06F40/40) · CPC title

  • Physics · mapped topic

  • Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9436665B2 cover?
An image of a printed document portion is provided to a synchronizer. The synchronizer retrieves an electronic version of the printed document and identifies an electronic text portion that is textually similar to a printed text portion. The synchronizer detects an annotation in the printed document portion and inserts a corresponding digital annotation into the electronic document.
Who is the assignee on this patent?
Thomson Reuters Global Resources (Trgr), Thomson Reuters Global Resources
What technology area does this patent fall under?
Primary CPC classification G06F40/169. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 06 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).