General dictionary for all languages

US9411801B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9411801-B2
Application numberUS-201213725095-A
CountryUS
Kind codeB2
Filing dateDec 21, 2012
Priority dateDec 21, 2012
Publication dateAug 9, 2016
Grant dateAug 9, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Disclosed are implementations of methods and systems for displaying definitions and translations of words by searching for a translation simultaneously in various languages according to a query in a general language dictionary. The invention removes the need to specify a source language for the word or word combination when translated into a target language. The target language may be preset. Translation is possible for word combinations in multiple sources languages. Source words may be entered manually or captured by an imaging component of an electronic device. When captured, a word combination is selected, and subjected to optical character recognition (OCR) and translation. Source language and OCR language may be suggested via geolocation of the electronic device.

First claim

Opening claim text (preview).

I claim: 1. An electronic apparatus configured to translate words into a target language, the electronic apparatus comprising: an electronic display; an electronic processor; and instructions to cause the electronic apparatus to: acquire an image of text; detect a selection of a word or word combination in the image to be translated; perform optical character recognition (OCR) on the selected word or word combination using a character alphabet of a plurality of languages; generate a set of recognition variants for each word of the selected word or word combination; transmit each set of recognition variants to a set of language specific processors; eliminate language inappropriate variants from the set of recognition variants for each language, wherein the language inappropriate variants are the recognition variants which do not contain characters or symbols of the language; match each of remaining variants to a source language, wherein the remaining variants are the recognition variants minus the language inappropriate variants; confirm that at least one of the remaining variants is in at least one language specific word list; translate a confirmed word variant using a translation dictionary; and provide a translation of the confirmed word variant. 2. The electronic apparatus of claim 1 , wherein the instructions further cause the apparatus to: display proposed words from the language specific word list based on an unconfirmed recognition variant; detect selection of a proposed word or correction of a variant; and provide a translation of the proposed word or correction of the variant. 3. The electronic apparatus of claim 1 , wherein said matching of each of the remaining variants to a source language includes: matching language specific characters to the source language. 4. The electronic apparatus of claim 1 , wherein said eliminating language inappropriate variants from the set of recognition variants includes: eliminating an inappropriate variant based on detection of a language inconsistent character. 5. The electronic apparatus of claim 1 , wherein the instructions further cause the electronic apparatus to: prior to detecting the selection of a word or word combination in the image, perform OCR on a portion of the image of text to generate recognized words; locate recognized words in monolingual dictionaries; generate hypotheses about one or more source languages of the text; rank hypotheses based on a frequency of each language's use in the text; and limit, based on the ranked hypotheses, the character alphabet of a plurality of languages used in the OCR of the selected word or word combination. 6. The electronic apparatus of claim 1 , wherein the instructions further cause the electronic apparatus to: prior to detecting the selection of a word or word combination in the image, establish a geo-location position of the electronic apparatus; identify a region or country based on the geo-location position; determine one or more preferred languages for the identified region or country; generate a ranked list of preferred languages for the region or country; and limit the character alphabet of a plurality of languages used in the OCR of the selected word or word combination. 7. The electronic apparatus of claim 1 , wherein the instructions further cause the electronic apparatus to: display a translation of a multi-lingual word string with text of a secondary language in its original form; and emphasize the word or words of the multi-lingual word string that are in the secondary language. 8. The electronic apparatus of claim 7 , wherein the instructions further cause the electronic apparatus to: detect a selection on the emphasized word or words in the secondary language; and translate the selected emphasized word or words into a target language. 9. The electronic apparatus of claim 1 , wherein the instructions further cause the electronic apparatus to: display proposed words from the language specific word list where the proposals are based on similarity of spelling with a desired word or word combination to be translated; detect a selection of a proposed word or correction of a variant; and provide a translation of the proposed word or correction of the variant. 10. A computer-implemented method for translating user selected words without preliminarily specifying a source language of translation, the method comprising: acquiring an electronic image of text; detecting a selection of a word or word combination in the image to be translated; performing optical character recognition (OCR) on the selected word or word combination using a character alphabet of a plurality of languages; generating a set of recognition variants for each word of the selected word or word combination; transmitting each set of recognition variants to a set of language specific processors; eliminating language inappropriate variants from the set of recognition variants for each language at least by matching characters in a variant with alphabetic characters of the language specific processors; matching each of the remaining variants to a dictionary; translating a word variant using a translation dictionary; and providing a translation of the word variant. 11. The computer-implemented method of claim 10 , wherein the method further comprises: displaying proposed words from the language specific word list where the proposals are based on similarity of spelling with a desired word or word combination to be translated; detecting selection of a proposed word or correction of a variant; and providing a translation of the proposed word or correction of the variant. 12. The computer-implemented method of claim 10 , wherein said matching of each of the remaining variants to a specific language dictionary includes: identifying remaining variants in the dictionary of the language based on a language specific processor. 13. The computer-implemented method of claim 10 , wherein said eliminating language inappropriate variants from the set of recognition variants includes: eliminating an inappropriate variant based on detection of a language inconsistent character. 14. The computer-implemented method of claim 10 , wherein the method further comprises: prior to detecting the selection of a word or word combination in the image, performing OCR on a portion of the image of text to generate recognized words; searching for the recognized words in monolingual dictionaries; generating hypotheses about one or more source languages of the text; ranking hypotheses based on a frequency of each language's use in the text; and using the ranked hypotheses to limit the character alphabet of a plurality of languages used in the OCR of the selected word or word combination. 15. The computer-implemented method of claim 10 , wherein the method further comprises: prior to detecting the selection of a word or word combination in the image, establishing a geo-location position of the electronic apparatus; identifying a region or country based on the geo-location position; determining one or more preferred languages for the identified region or country; generating a ranked list of preferred languages for the region or country; and limiting the character alphabet of a plurality of languages used in the OCR of the selected word or word combination. 16. The computer-implemented method of claim 10 , wherein acquiring an electronic image of text includes using a camera of the electronic device or accessing the electronic image of text from a memory storage of an

Assignees

Inventors

Classifications

  • Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation · CPC title

  • G06F40/274Primary

    Converting codes to words; Guess-ahead of partial word inputs · CPC title

  • Physics · mapped topic

  • Physics · mapped topic

  • G06F17/276Primary

    Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9411801B2 cover?
Disclosed are implementations of methods and systems for displaying definitions and translations of words by searching for a translation simultaneously in various languages according to a query in a general language dictionary. The invention removes the need to specify a source language for the word or word combination when translated into a target language. The target language may be preset. T…
Who is the assignee on this patent?
Osipova Maria, Abbyy Dev Llc
What technology area does this patent fall under?
Primary CPC classification G06F40/274. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 09 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).