Techniques for providing user image capture feedback for improved machine language translation

US2016203124A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2016203124-A1
Application numberUS-201514594238-A
CountryUS
Kind codeA1
Filing dateJan 12, 2015
Priority dateJan 12, 2015
Publication dateJul 14, 2016
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A computer-implemented technique includes techniques are presented for user image capture feedback for improved machine language translation. When machine language translation of OCR text obtained from an initial image has a low degree of likelihood of being an appropriate translation, these techniques provide for user image capture feedback to obtain additional images to obtain a modified OCR text, which can result in improved machine language translation results. Instead of user image capture feedback, the techniques may obtain the modified OCR text by selecting another possible OCR text from the initial OCR operation. In addition to additional image capturing, light source intensity and/or a quantity/number of light source flashes can be adjusted. After obtaining the modified OCR text, another machine language translation can be obtained and, if it has a high enough degree of likelihood, it can then be output to a user.

First claim

Opening claim text (preview).

What is claimed is: 1 . A computer-implemented method, comprising: receiving, at a computing system having one or more processors, a first image of an object comprising a text in a source language, the first image being captured by a camera in communication with the computing system; performing, by the computing system, optical character recognition (OCR) on the first image to obtain an OCR text that is a machine-encoded text representation of the text; obtaining, by the computing system, a first translated OCR text and a translation score indicative of a degree of likelihood that the first translated OCR text is an appropriate translation of the OCR text to a target language; and when the translation score is less than a translation score threshold indicative of an acceptable degree of likelihood: outputting, by the computing system, a user instruction to capture a set of second images of at least a portion of the object using the camera; receiving, at the computing system, the set of second images; performing, by the computing system, OCR on at least one of the set of second images to obtain a modified OCR text corresponding to the text; in response to obtaining the modified OCR text, obtaining, by the computing system, a second translated OCR text representing a translation of the modified OCR text from the source language to the target language; and outputting, by the computing system, the second translated OCR text. 2 . The computer-implemented method of claim 1 , further comprising capturing, by the computing system, the first image and the set of second images using the camera. 3 . The computer-implemented method of claim 2 , further comprising: when the translation score is less than the translation score threshold, adjusting, by the computing system, an intensity of a light source associated with the camera while capturing the second set of images, the light source being in communication with the computing system. 4 . The computer-implemented method of claim 2 , further comprising: when the translation score is less than the translation score threshold, adjusting, by the computing system, a number of flashes of a light source associated with the camera while capturing the second set of images, the light source being in communication with the computing system. 5 . The computer-implemented method of claim 1 , further comprising: when the translation score is less than the translation score threshold, identifying, by the computing system, a portion of the first image causing the OCR text and the corresponding first translated OCR text to have the translation score less than the translation score threshold, wherein the user instruction is to capture the set of second images with respect to the identified portion of the first image. 6 . The computer-implemented method of claim 5 , wherein performing OCR on at least one of the set of second images to obtain a modified OCR text corresponding to the text comprises: combining portions of at least two of the first image and the set of second images to obtain a combined image; and performing OCR on the combined image to obtain the modified OCR text. 7 . The computer-implemented method of claim 1 , wherein performing OCR on the first image to obtain the OCR text includes: performing, by the computing system, OCR on the first image to obtain a set of possible OCR texts, wherein each possible OCR text has a corresponding OCR score indicative of a degree of likelihood that the possible OCR text is the text; and selecting, by the computing system, the possible OCR text having a highest corresponding OCR score relative to the OCR scores of the other possible OCR texts to obtain the OCR text. 8 . The computer-implemented method of claim 7 , further comprising when the translation score is less than the translation score threshold, selecting, by the computing system, another one of the possible OCR texts to obtain the modified OCR text. 9 . The computer-implemented method of claim 8 , wherein the other one of the possible OCR texts has a second highest corresponding OCR score relative to the OCR scores of the other possible OCR texts. 10 . A computer-implemented method, comprising: receiving, at a computing system having one or more processors, a first image of an object comprising a text in a source language, the first image being captured by a camera in communication with the computing system; performing, by the computing system, optical character recognition (OCR) on the first image to obtain a set of possible OCR texts, wherein each possible OCR text has a corresponding OCR score indicative of a degree of likelihood that the possible OCR text is the text; selecting, by the computing system, the possible OCR text having a highest corresponding OCR score relative to the OCR scores of the other possible OCR texts to obtain an OCR text; and when the OCR score of the OCR text less than an OCR score threshold indicative of an acceptable degree of likelihood: outputting, by the computing system, a user instruction to capture a set of second images of at least a portion of the object using the camera; receiving, at the computing system, the set of second images; performing, by the computing system, OCR on at least one of the set of second images to obtain a modified OCR text corresponding to the text; and outputting, by the computing system, the modified OCR text. 11 . The computer-implemented method of claim 10 , further comprising capturing, by the computing system, the first image and the set of second images using the camera. 12 . The computer-implemented method of claim 11 , further comprising: when the OCR score is less than the OCR score threshold, adjusting, by the computing system, an intensity of a light source associated with the camera while capturing the second set of images, the light source being in communication with the computing system. 13 . The computer-implemented method of claim 11 , further comprising: when the OCR score is less than the OCR score threshold, adjusting, by the computing system, a number of flashes of a light source associated with the camera while capturing the second set of images, the light source being in communication with the computing system. 14 . The computer-implemented method of claim 10 , further comprising: when the translation score is less than the translation score threshold, identifying, by the computing system, a portion of the first image causing the OCR score to be less than the OCR score threshold, wherein the user instruction is to capture the set of second images with respect to the identified portion of the first image. 15 . The computer-implemented method of claim 14 , wherein performing OCR on at least one of the set of second images to obtain a modified OCR text corresponding to the text comprises: combining portions of at least two of the first image and the set of second images to obtain a combined image; and performing OCR on the combined image to obtain the modified OCR text. 16 . The computer-implemented method of claim 10 , further comprising when the OCR score is less than the OCR score threshold, selecting, by the computing system, another one of the possible OCR texts to obtain the modified OCR text. 17 . The computer-implemented method of claim 16 , wherein the other one of the possible OCR texts has a second highest corresponding OCR score relative to the OCR scores of the other possible OCR texts. 18 . The computer-implemented method of claim 10 , further comprising: ob

Assignees

Inventors

Classifications

  • G06V30/127Primary

    with the intervention of an operator · CPC title

  • Processing or translation of natural language (natural language analysis G06F40/20; semantic analysis G06F40/30) · CPC title

  • Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation · CPC title

  • Translation evaluation · CPC title

  • Text processing (natural language analysis G06F40/20; semantic analysis G06F40/30; processing or translation of natural language G06F40/40) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2016203124A1 cover?
A computer-implemented technique includes techniques are presented for user image capture feedback for improved machine language translation. When machine language translation of OCR text obtained from an initial image has a low degree of likelihood of being an appropriate translation, these techniques provide for user image capture feedback to obtain additional images to obtain a modified OCR …
Who is the assignee on this patent?
Google Inc
What technology area does this patent fall under?
Primary CPC classification G06V30/127. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Jul 14 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).