Voice dialog device and voice dialog method

US10395653B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10395653-B2
Application numberUS-201715598504-A
CountryUS
Kind codeB2
Filing dateMay 18, 2017
Priority dateMay 27, 2016
Publication dateAug 27, 2019
Grant dateAug 27, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A voice dialog device, comprises a sight line detection unit configured to detect a sight line of a user; a voice processing unit configured to obtain voice pronounced by the user and a result of recognizing the voice; a dialog determination unit configured to determine whether or not the voice dialog device has a dialog with the user; and an answer generation unit configured to generate an answer, based on a result of recognizing the voice, wherein the dialog determination unit determines whether or not the user has started the dialog, based on both the sight line of the user and the obtained voice.

First claim

Opening claim text (preview).

What is claimed is: 1. A voice dialog device, comprising: a sight line detection unit configured to detect a sight line of a user; a state determination unit configured to determine a state of the user; a voice processing unit configured to obtain voice pronounced by the user and a result of recognizing the voice; a dialog determination unit configured to determine whether or not the voice dialog device has a dialog with the user; and an answer generation unit configured to generate an answer, based on a result of recognizing the voice, wherein the dialog determination unit determines whether or not the user has started the dialog, based on both the sight line of the user and the obtained voice by: when the determined state of the user is driving a vehicle, the detected sight line of the user is in a vehicle forward direction, and a start keyword has been detected from the voice, the dialog determination unit determines that the user has started a dialog, when the determined state of the user is driving a vehicle and either the detected sight line of the user is not in the vehicle forward direction or the start keyword has not been detected from the voice, the dialog determination unit determines that the user has not started a dialog, when the determined state of the user is not driving a vehicle, the detected sight line of the user is in a direction toward the voice dialogue device, and a start keyword has been detected from the voice, the dialog determination unit determines that the user has started a dialog, and when the determined state of the user is not driving a vehicle and either the detected sight line of the user is not in the direction toward the voice dialogue device or the start keyword has not been detected from the voice, the dialog determination unit determines that the user has not started a dialog. 2. The voice dialog device according to claim 1 , wherein when the voice dialog device has the dialog with the user and a termination keyword has been detected from the voice, the dialog determination unit determines that the dialog has been terminated. 3. A voice dialog method performed by a voice dialog device, comprising: a sight line detecting step of detecting a sight line of a user; a state determining step of determining a state of the user; a voice processing step of obtaining voice pronounced by the user and a result of recognizing the voice; a dialog determining step of determining whether or not the voice dialog device has a dialog with the user; and an answer generating step of generating an answer, based on a result of recognizing the voice, wherein, the dialog determining step includes determining whether or not the user has started the dialog based on both the sight line of the user and the obtained voice by: when state of the user determined by the state determining step is driving a vehicle, the sight line of the user detected by the sight line detecting step is in a vehicle forward direction, and a start keyword has been detected from the voice by the voice processing step, the dialog determining step determines that the user has started a dialog, when the state of the user determined by the state determining step is driving a vehicle and either the sight line of the user detected by the sight line detecting step is not in the vehicle forward direction or the start keyword has not been detected from the voice by the voice processing step, the dialog determining step determines that the user has not started a dialog, when the state of the user determined by the state determining step is not driving a vehicle, the sight line of the user detected by the sight line detecting step is in a direction toward the voice dialogue device, and a start keyword has been detected from the voice by the voice processing step, the dialog determining step determines that the user has started a dialog, and when the state of the user determined by the state determining step is not driving a vehicle and either the sight line of the user detected by the sight line detecting step is not in the direction toward the voice dialogue device or the start keyword has not been detected from the voice, the start keyword has not been detected from the voice by the voice processing step, the dialog determining step determines that the user has not started a dialog. 4. A non-transitory computer readable storing medium recording a computer program for causing a computer to perform the voice dialog method according to claim 3 .

Assignees

Inventors

Classifications

  • Audio in a user interface, e.g. using voice commands for navigating, audio feedback · CPC title

  • Multimodal input, i.e. interface arrangements enabling the user to issue commands by simultaneous use of input devices of different nature, e.g. voice plus gesture on digitizer · CPC title

  • Detection arrangements using opto-electronic means (constructional details of pointing devices not related to the detection arrangement using opto-electronic means G06F3/033; optical digitisers G06F3/042) · CPC title

  • G10L15/22Primary

    Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • Speech classification or search · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10395653B2 cover?
A voice dialog device, comprises a sight line detection unit configured to detect a sight line of a user; a voice processing unit configured to obtain voice pronounced by the user and a result of recognizing the voice; a dialog determination unit configured to determine whether or not the voice dialog device has a dialog with the user; and an answer generation unit configured to generate an ans…
Who is the assignee on this patent?
Toyota Motor Co Ltd
What technology area does this patent fall under?
Primary CPC classification G10L15/22. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 27 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).