What technology area does this patent fall under?

Primary CPC classification G06F40/268. Mapped technology areas include Physics.

When was this patent published?

Publication date Thu Sep 14 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Language processing apparatus and language processing method

US2017262435A1 · US · A1

Patent metadata
Field	Value
Publication number	US-2017262435-A1
Application number	US-201715419327-A
Country	US
Kind code	A1
Filing date	Jan 30, 2017
Priority date	Mar 11, 2016
Publication date	Sep 14, 2017
Grant date	—

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

According to an embodiment, a language processing apparatus includes a recognizer and a generator. The recognizer recognizes a first character string of a first language from first data associated with a first time and recognizes a second character string of the first language including a first overlapping character string which overlaps with the first character string from second data associated with a second time later than the first time. The generator applies a production rule to the first character string and the second character string to generate a first resultant character string of the first language including the first overlapping character string.

First claim

Opening claim text (preview).

1 . A language processing apparatus comprising: a recognizer that recognizes a first character string of a first language from first data associated with a first time and recognizes a second character string of the first language including a first overlapping character string which overlaps with the first character string from second data associated with a second time later than the first time; and a generator that applies a production rule to the first character string and the second character string to generate a first resultant character string of the first language including the first overlapping character string. 2 . The apparatus according to claim 1 , wherein the first data and the second data are either image data corresponding to two screens acquired by capturing an image of an object including character information at different times, or two frames of image data included in a moving picture acquired by movie-recording an object including character information. 3 . The apparatus according to claim 1 , further comprising a translator that machine-translates the first character string to obtain a translated character string of a second language different from the first language. 4 . The apparatus according to claim 3 , wherein: the first data is image data including a first character string area in which at least the first character string is displayed; the second data is image data, including a second character string area in which at least the second character string is displayed; and the recognizer further recognizes the first character string area from the first data and the second character string area from the second data, the apparatus further comprising an image controller that sequentially generates translated image data obtained by replacing the character string displayed in each of the first character string area and the second character string area in the first data and the second data with the translated character string. 5 . The apparatus according to claim 1 , wherein the production rule includes a concatenation rule using the first overlapping character string. 6 . The apparatus according to claim 5 , wherein the production rule further includes a division rule using a linguistic feature of the first character string and the second character string. 7 . The apparatus according to claim 6 , wherein the linguistic feature includes at least one of a period, a comma, a symbol, and an auxiliary verb. 8 . The apparatus according to claim 1 , wherein the generator further includes: a calculator that calculates a score indicating a likelihood of the first resultant character string; and a determiner that determines whether or not the score is equal to or higher than a threshold, and wherein the generator keeps the first resultant character string from being output, when the score is lower than the threshold. 9 . The apparatus according to claim 8 , wherein the calculator collates the first resultant character string with a language model to calculate the score. 10 . The apparatus according to claim 1 , further comprising a buffer that stores the first character string in association with the first time, and the second character string in association with the second time. 11 . The apparatus according to claim 10 , wherein: the recognizer further recognizes a third character string of the first language including a second overlapping character string, which overlaps with the first character string, from third data associated with a third time later than the second time; the buffer stores the third character string in association with the third time; and the generator applies the production rule to the first character string and the third character string to generate a second resultant character string of the first language including the second overlapping character string, instead of generating the first resultant character string, depending on a difference between the first character string and the second character string. 12 . The apparatus according to claim 1 , wherein the generator terminates generation of the first resultant character string when the generator detects that a head portion of a fourth character string of the first language recognized from fourth data associated with a fourth time prior to the second time coincides with an end portion of the second character string. 13 . A language processing method comprising: recognizing a first character string of a first language from first data associated with a first time; recognizing a second character string of the first language including a first overlapping character string which overlaps with the first character string from second data associated with a second time later than the first time; and applying a production rule to the first character string and the second character string to generate a first resultant character string of the first language including the first overlapping character string. 14 . A non-transitory computer readable storage medium storing instructions of a computer program which when executed by a computer results in performance of steps comprising: recognizing a first character string of a first language from first data associated with a first time; recognizing a second character string of the first language including a first overlapping character string which overlaps with the first character string from second data associated with a second time later than the first time; and applying a production rule to the first character string and the second character string to generate a first resultant character ring of the first language including the first overlapping character string.

Assignees

Toshiba Kk

Inventors

Sonoo Satoshi

Classifications

G06F40/268Primary
Morphological analysis · CPC title
G06F40/47
Machine-assisted translation, e.g. using translation memory · CPC title
G06F40/53
Processing of non-Latin text (kana-to-kanji conversion G06F40/129; vowelisation G06F40/232) · CPC title
G06F40/279
Recognition of textual entities · CPC title
G06F40/56
Natural language generation · CPC title

Patent family

Related publications grouped by family.

View patent family 59786611

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2017262435A1 cover?: According to an embodiment, a language processing apparatus includes a recognizer and a generator. The recognizer recognizes a first character string of a first language from first data associated with a first time and recognizes a second character string of the first language including a first overlapping character string which overlaps with the first character string from second data associat…
Who is the assignee on this patent?: Toshiba Kk
What technology area does this patent fall under?: Primary CPC classification G06F40/268. Mapped technology areas include Physics.
When was this patent published?: Publication date Thu Sep 14 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).

How to read this patent

Abstract

First claim

Assignees

Inventors

Classifications

Patent family

External sources

Related patents

Techniques for providing user image capture feedback for improved machine language translation

Automated recognition of text utilizing multiple images

Forming scanned composite document with optical character recognition function

Image-based character recognition

Frequently asked questions