Language processing apparatus and language processing method

US2017262435A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2017262435-A1
Application numberUS-201715419327-A
CountryUS
Kind codeA1
Filing dateJan 30, 2017
Priority dateMar 11, 2016
Publication dateSep 14, 2017
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

According to an embodiment, a language processing apparatus includes a recognizer and a generator. The recognizer recognizes a first character string of a first language from first data associated with a first time and recognizes a second character string of the first language including a first overlapping character string which overlaps with the first character string from second data associated with a second time later than the first time. The generator applies a production rule to the first character string and the second character string to generate a first resultant character string of the first language including the first overlapping character string.

First claim

Opening claim text (preview).

1 . A language processing apparatus comprising: a recognizer that recognizes a first character string of a first language from first data associated with a first time and recognizes a second character string of the first language including a first overlapping character string which overlaps with the first character string from second data associated with a second time later than the first time; and a generator that applies a production rule to the first character string and the second character string to generate a first resultant character string of the first language including the first overlapping character string. 2 . The apparatus according to claim 1 , wherein the first data and the second data are either image data corresponding to two screens acquired by capturing an image of an object including character information at different times, or two frames of image data included in a moving picture acquired by movie-recording an object including character information. 3 . The apparatus according to claim 1 , further comprising a translator that machine-translates the first character string to obtain a translated character string of a second language different from the first language. 4 . The apparatus according to claim 3 , wherein: the first data is image data including a first character string area in which at least the first character string is displayed; the second data is image data, including a second character string area in which at least the second character string is displayed; and the recognizer further recognizes the first character string area from the first data and the second character string area from the second data, the apparatus further comprising an image controller that sequentially generates translated image data obtained by replacing the character string displayed in each of the first character string area and the second character string area in the first data and the second data with the translated character string. 5 . The apparatus according to claim 1 , wherein the production rule includes a concatenation rule using the first overlapping character string. 6 . The apparatus according to claim 5 , wherein the production rule further includes a division rule using a linguistic feature of the first character string and the second character string. 7 . The apparatus according to claim 6 , wherein the linguistic feature includes at least one of a period, a comma, a symbol, and an auxiliary verb. 8 . The apparatus according to claim 1 , wherein the generator further includes: a calculator that calculates a score indicating a likelihood of the first resultant character string; and a determiner that determines whether or not the score is equal to or higher than a threshold, and wherein the generator keeps the first resultant character string from being output, when the score is lower than the threshold. 9 . The apparatus according to claim 8 , wherein the calculator collates the first resultant character string with a language model to calculate the score. 10 . The apparatus according to claim 1 , further comprising a buffer that stores the first character string in association with the first time, and the second character string in association with the second time. 11 . The apparatus according to claim 10 , wherein: the recognizer further recognizes a third character string of the first language including a second overlapping character string, which overlaps with the first character string, from third data associated with a third time later than the second time; the buffer stores the third character string in association with the third time; and the generator applies the production rule to the first character string and the third character string to generate a second resultant character string of the first language including the second overlapping character string, instead of generating the first resultant character string, depending on a difference between the first character string and the second character string. 12 . The apparatus according to claim 1 , wherein the generator terminates generation of the first resultant character string when the generator detects that a head portion of a fourth character string of the first language recognized from fourth data associated with a fourth time prior to the second time coincides with an end portion of the second character string. 13 . A language processing method comprising: recognizing a first character string of a first language from first data associated with a first time; recognizing a second character string of the first language including a first overlapping character string which overlaps with the first character string from second data associated with a second time later than the first time; and applying a production rule to the first character string and the second character string to generate a first resultant character string of the first language including the first overlapping character string. 14 . A non-transitory computer readable storage medium storing instructions of a computer program which when executed by a computer results in performance of steps comprising: recognizing a first character string of a first language from first data associated with a first time; recognizing a second character string of the first language including a first overlapping character string which overlaps with the first character string from second data associated with a second time later than the first time; and applying a production rule to the first character string and the second character string to generate a first resultant character ring of the first language including the first overlapping character string.

Assignees

Inventors

Classifications

  • G06F40/268Primary

    Morphological analysis · CPC title

  • Machine-assisted translation, e.g. using translation memory · CPC title

  • Processing of non-Latin text (kana-to-kanji conversion G06F40/129; vowelisation G06F40/232) · CPC title

  • Recognition of textual entities · CPC title

  • Natural language generation · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2017262435A1 cover?
According to an embodiment, a language processing apparatus includes a recognizer and a generator. The recognizer recognizes a first character string of a first language from first data associated with a first time and recognizes a second character string of the first language including a first overlapping character string which overlaps with the first character string from second data associat…
Who is the assignee on this patent?
Toshiba Kk
What technology area does this patent fall under?
Primary CPC classification G06F40/268. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Sep 14 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).