Object recognition system and an object recognition method

US9508019B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9508019-B2
Application numberUS-201414190539-A
CountryUS
Kind codeB2
Filing dateFeb 26, 2014
Priority dateMar 1, 2013
Publication dateNov 29, 2016
Grant dateNov 29, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An object recognition system is applicable to practical use, and utilizes image information besides speech information to improve recognition accuracy. The object recognition system comprises a speech recognition unit to determine candidates for a result of speech recognition on input speech and their likelihoods, and an image model generation unit to generate image models of a predetermined number of the candidates having the highest likelihoods. The system further comprises an image likelihood calculation unit to calculate image likelihoods of input images based on the image models, and an object recognition unit to perform object recognition using the image likelihoods. At the time of generating the image model of the candidate, the image model generation unit first searches an image model database, and, when the image model of the candidate is not found in the database, the image model generation unit generates said image model from image information on the web.

First claim

Opening claim text (preview).

The invention claimed is: 1. An object recognition system comprising a processor and one or more memories, the processor configured to: determine candidates as a result of speech recognition on input speech and their speech likelihoods; get image models of a predetermined number of the candidates having the highest speech likelihoods; calculate image likelihoods of the image model that each image model corresponds to an input image; and perform object recognition using the image likelihoods, wherein, in the step of getting image models, the processor searches an image model database for the image model, and then, when the image model of the candidate is not found in the database, the processor gets said image model from image information on the web. 2. The object recognition system according to claim 1 , wherein the processor performs the object recognition based on the speech likelihoods and the image likelihoods. 3. The object recognition system according to claim 2 , wherein, at the time of getting the image models of the candidates from image information on the web, the processor performs clustering of feature amounts of images collected from the web, and gets an image model for each of clusters. 4. The object recognition system according to claim 1 , wherein, at the time of getting the image models of the candidates from image information on the web, the processor performs clustering of feature amounts of images collected from the web, and gets an image model for each of clusters. 5. An object recognition method comprising steps of: determining candidates as a result of speech recognition on input speech and their likelihoods; getting image models of a predetermined number of the candidates having the highest likelihoods; calculating image likelihoods of the image models that each image model corresponds to an input image; and performing object recognition using the image likelihoods, wherein, in the step of getting image models, an image model database is searched for the image model, and then, when the image model of the candidate is not found in the database, said image model is gotten from image information on the web.

Assignees

Inventors

Classifications

  • using a plurality of salient features, e.g. bag-of-words [BoW] representations · CPC title

  • G10L15/00Primary

    Speech recognition (G10L17/00 takes precedence) · CPC title

  • for retrieval · CPC title

  • G06K9/4676Primary

    Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9508019B2 cover?
An object recognition system is applicable to practical use, and utilizes image information besides speech information to improve recognition accuracy. The object recognition system comprises a speech recognition unit to determine candidates for a result of speech recognition on input speech and their likelihoods, and an image model generation unit to generate image models of a predetermined nu…
Who is the assignee on this patent?
Honda Motor Co Ltd, Nat Univ Corp Kobe Univ
What technology area does this patent fall under?
Primary CPC classification G10L15/00. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 29 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).