Voice detection, apparatus, voice detection method, and non-transitory computer-readable storage medium

US10424320B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10424320-B2
Application numberUS-201815885031-A
CountryUS
Kind codeB2
Filing dateJan 31, 2018
Priority dateMar 28, 2017
Publication dateSep 24, 2019
Grant dateSep 24, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A robot determines whether a voice is a voice emanated directly from an actual person or is a voice output from a speaker of an electronic device. A controller of a robot detects a voice by means of microphones, determines whether or not a voice generating source of the detected voice is a specific voice generating source, and controls, based on a result of the determination, the robot by means of a neck joint and a chassis.

First claim

Opening claim text (preview).

What is claimed is: 1. A voice detection apparatus mountable on a robot and comprising a processor, wherein the processor: stores, in advance, second voice generating source position information that includes information indicating a position of a second voice generating source, the second voice generating source being a voice generating source other than a first voice generating source, detects a voice that includes sound, determines whether or not a voice generating source of the detected voice is the first voice generating source, determines a direction where the detected voice is generated with respect to the voice detection apparatus, determines whether or not the second voice generating source exists in a direction where the determined detected voice is generated, based on the stored second voice generating source position information, and executes a control of at least one of a position and a posture of the voice detection apparatus, when determining that the voice generating source of the detected voice is the first voice generating source, and that the second voice generating source does not exist in the direction where the determined detected voice is generated, and wherein the processor further: images an image, and executes the control of said at least one of the position and the posture of the voice detection apparatus so as to direct an imaging direction in the imaging to a direction where the determined detected voice is generated, when determining that the voice generating source of the detected voice is the first voice generating source, and that the second voice generating source does not exist in a direction where the determined detected voice is generated; and wherein the processor appends, to the second voice generating source position information, while moving the voice detection apparatus, information indicating a position of a voice generating source that is recognized in the imaged image. 2. The voice detection apparatus according to claim 1 , wherein the processor stores information indicating probability that the second voice generating source exists at the position of the second voice generating source indicated in the information included in the second voice generating source position information. 3. The voice detection apparatus according to claim 1 , wherein the processor determines whether or not the detected voice is a voice directed toward the voice detection apparatus and, when determining that the detected voice is a voice directed toward the voice detection apparatus, determines whether or not the voice generating source of the detected voice is the first voice generating source. 4. The voice detection apparatus according to claim 1 , wherein the processor does not execute the control, when determining that the voice generating source of the detected voice is the first voice generating source, and that the second voice generating source exists in a direction where the determined detected voice is generated. 5. A voice detection method by which a computer mounted on a robot detects a voice, the voice detection method comprising: storing, in advance, second voice generating source position information that includes information indicating a position of a second voice generating source, the second voice generating source being a voice generating source other than a first voice generating source; detecting a voice that includes sound; determining whether or not a voice generating source of the detected voice is a the first voice generating source; determining a direction where the detected voice is generated with respect to the robot; determining whether or not the second voice generating source exists in a direction where the determined detected voice is generated, based on the stored second voice generating source position information; and executing a control of at least one of a position and a posture of the robot when determining that the voice generating source of the detected voice is the first voice generating source, and that the second voice generating source does not exist in a direction where the determined detected voice is generated, wherein the voice detection method further comprises: imaging an image; executing the control of said at least one of the position and the posture of the voice detection apparatus so as to direct an imaging direction in the imaging to a direction where the determined detected voice is generated, when determining that the voice generating source of the detected voice is the first voice generating source, and that the second voice generating source does not exist in a direction where the determined detected voice is generated; and appending, to the second voice generating source position information, while moving the voice detection apparatus, information indicating a position of a voice generating source that is recognized in the imaged image. 6. A non-transitory computer-readable storage medium having stored thereon a program executable by a computer mounted on a robot, the program controlling the computer to perform functions comprising: storing, in advance, second voice generating source position information that includes information indicating a position of a second voice generating source, the second voice generating source being a voice generating source other than a first voice generating source; detecting a voice that includes sound; determining whether or not a voice generating source of the detected voice is a the first voice generating source; determining a direction where the detected voice is generated with respect to the robot; determining whether or not the second voice generating source exists in a direction where the determined detected voice is generated, based on the stored second voice generating source position information; and executing a control of at least one of a position and a posture of the robot when determining that the voice generating source of the detected voice is the first voice generating source, and that the second voice generating source does not exist in a direction where the determined detected voice is generated, wherein the program controls the computer to perform further functions comprising: imaging an image; executing the control of said at least one of the position and the posture of the voice detection apparatus so as to direct an imaging direction in the imaging to a direction where the determined detected voice is generated, when determining that the voice generating source of the detected voice is the first voice generating source, and that the second voice generating source does not exist in a direction where the determined detected voice is generated; and appending, to the second voice generating source position information, while moving the voice detection apparatus, information indicating a position of a voice generating source that is recognized in the imaged image.

Assignees

Inventors

Classifications

  • Speaker identification or verification techniques · CPC title

  • based on threshold decision · CPC title

  • G10L25/78Primary

    Detection of presence or absence of voice signals (switching of direction of transmission by voice frequency in two-way loud-speaking telephone systems H04M9/10) · CPC title

  • Interactive procedures; Man-machine interfaces · CPC title

  • Feature extraction for speech recognition; Selection of recognition unit · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10424320B2 cover?
A robot determines whether a voice is a voice emanated directly from an actual person or is a voice output from a speaker of an electronic device. A controller of a robot detects a voice by means of microphones, determines whether or not a voice generating source of the detected voice is a specific voice generating source, and controls, based on a result of the determination, the robot by…
Who is the assignee on this patent?
Casio Computer Co Ltd
What technology area does this patent fall under?
Primary CPC classification G10L25/78. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 24 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).