Voice input apparatus, control method thereof, and storage medium for executing processing corresponding to voice instruction

US11735181B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11735181-B2
Application numberUS-202117156885-A
CountryUS
Kind codeB2
Filing dateJan 25, 2021
Priority dateFeb 5, 2020
Publication dateAug 22, 2023
Grant dateAug 22, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A voice input apparatus includes a voice input device configured to input voice and performs control to, in a case where a second voice instruction for operating the voice input apparatus is input in a predetermined period after a first voice instruction is input to the voice input apparatus, execute processing corresponding to the second voice instruction. The voice input apparatus changes a length of the predetermined period, according to the first voice instruction.

First claim

Opening claim text (preview).

What is claimed is: 1. A camera configured to set a length for a predetermined period for receiving a second voice instruction after receiving a first voice instruction, comprising: a microphone configured to input voice instructions; a memory configured to store program instructions; and one or more processors configured to execute the program instructions to function as: a control unit configured to perform control to: receive the first voice instruction through the microphone, the first voice instruction being a wake word, of a plurality of wake words, for enabling operation by voice on the camera, the plurality of wake words including a first wake word for setting the length for the predetermined period to a first length representing a normal length and a second wake word for setting the length for the predetermined period to a second length longer than the normal length; determine whether the received first voice instruction corresponds to the first wake word or the second wake word: set the length for the predetermined period for receiving the second voice instruction to the first length in a case where it is determined that the received first voice instruction is the first wake word and to the second length in a case where it is determined that the received first voice instruction is the second wake word; and in a case where the second voice instruction for operating the camera is input in the predetermined period after the first voice instruction is input to the camera, execute processing corresponding to the second voice instruction. 2. The voice input apparatus according to claim 1 , wherein the one or more processors further execute the instructions to function as a manual setting unit configured to enable a user to set the length for the predetermined period to different values for different voice instructions included in the first voice instruction. 3. The voice input apparatus according to claim 1 , wherein the one or more processors further execute the instructions to function as an automatic setting unit configured to set the length for the predetermined period to different values for different voice instructions included in the first voice instruction, based on a history of past voice instructions. 4. The voice input apparatus according to claim 3 , wherein the history of past voice instructions includes an input interval between a plurality of voice instructions. 5. The voice input apparatus according to claim 1 , wherein the control unit is further configured to execute processing corresponding to the second voice instruction in a case where, when the second voice instruction is input, the voice input apparatus is in a state corresponding to the input second voice instruction, and configured not to execute processing corresponding to the second voice instruction in a case where the voice input apparatus is not in the state corresponding to the input second voice instruction. 6. The voice input apparatus according to claim 5 , wherein the state corresponding to the second voice instruction includes an operating mode of the voice input apparatus. 7. The voice input apparatus according to claim 1 , wherein an operating mode of the voice input apparatus includes a mode for displaying content on a display unit of the voice input apparatus, and a mode for displaying a setting value of the voice input apparatus on the display unit of the voice input apparatus. 8. A control method of a camera configured to set a length for a predetermined period for receiving a second voice instruction after receiving a first voice instruction, the camera comprising a microphone configured to input voice instructions, the control method comprising: receiving the first voice instruction through the microphone, the first voice instruction being a wake word, of a plurality of wake words, for enabling operation by voice on the camera, the plurality of wake words including a first wake word for setting the length for the predetermined period to a first length representing a normal length and a second wake word for setting the length for the predetermined period to a second length longer than the normal length; determining whether the received first voice instruction corresponds to the first wake word or the second wake word: setting the length for the predetermined period for receiving the second voice instruction to the first length in a case where it is determined that the received first voice instruction is the first wake word and to the second length in a case where it is determined that the received first voice instruction is the second wake word; and in a case where the second voice instruction for operating the camera is input in the predetermined period after the first voice instruction is input to the camera, executing processing corresponding to the second voice instruction. 9. A non-transitory computer-readable storage medium comprising instructions for performing a control method of a camera configured to set a length for a predetermined period for receiving a second voice instruction after receiving a first voice instruction, the camera comprising a microphone configured to input voice instructions, the control method comprising: receiving the first voice instruction through the microphone, the first voice instruction being a wake word, of a plurality of wake words, for enabling operation by voice on the camera, the plurality of wake words including a first wake word for setting the length for the predetermined period to a first length representing a normal length and a second wake word for setting the length for the predetermined period to a second length longer than the normal length-determining whether the received first voice instruction corresponds to the first wake word or the second wake word: setting the length for the predetermined period for receiving the second voice instruction to the first length in a case where it is determined that the received first voice instruction is the first wake word and to the second length in a case where it is determined that the received first voice instruction is the second wake word; and in a case where the second voice instruction for operating the camera is input in the predetermined period after the first voice instruction is input to the camera, executing processing corresponding to the second voice instruction.

Assignees

Inventors

Classifications

  • G10L15/22Primary

    Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • G06F3/167Primary

    Audio in a user interface, e.g. using voice commands for navigating, audio feedback · CPC title

  • Speech classification or search · CPC title

  • Word spotting · CPC title

  • Execution procedure of a spoken command · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11735181B2 cover?
A voice input apparatus includes a voice input device configured to input voice and performs control to, in a case where a second voice instruction for operating the voice input apparatus is input in a predetermined period after a first voice instruction is input to the voice input apparatus, execute processing corresponding to the second voice instruction. The voice input apparatus changes a l…
Who is the assignee on this patent?
Canon Kk
What technology area does this patent fall under?
Primary CPC classification G10L15/22. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 22 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).