Systems and methods for selectively activating and interacting with a speech recognition service during application runtime without interrupting execution of the application

US10937425B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10937425-B2
Application numberUS-201916244818-A
CountryUS
Kind codeB2
Filing dateJan 10, 2019
Priority dateJan 10, 2019
Publication dateMar 2, 2021
Grant dateMar 2, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods are provided that may be implemented to allow an information handling system user to selectively activate and interact with a speech recognition service for system control during application runtime on the same system, and without interrupting the separate application on the same information handling system. The activated speech recognition service may respond to verbal commands to control one or more operating characteristics of the information handling system while the separate application (e.g., such as a computer game) is simultaneously executing without interruption on the same system. A user may be allowed to activate the speech recognition service using one or more different activation modes, e.g., using eye tracking, call word recognition, and/or hardware input/output actuation.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, comprising: executing an application on a first information handling system to display a graphics scene to a human user of the first information handling system on a display device; receiving analog audio sounds from the human user of the first information handling system that is viewing the displayed graphics scene, and transferring the received audio sounds from the first information handling system as outgoing voice chat to one or more other information handling systems communicatively coupled by a network to the first information handling system; detecting an activation action from the human user of the first information handling system while the voice chat is unmuted and while the graphics scene is displayed on the display device; and selectively activating at least one deactivated separate service on the first information handling system that is different from the executing application to respond to the detected activation action without interrupting the executing application, the response by the separate service to the detected activation action comprising: temporarily muting the outgoing voice chat and using speech recognition while the outgoing voice chat is muted to receive and analyze any analog audio signals received from the human user of the first information handling system while the outgoing voice chat is muted to recognize a predefined voice command spoken by the human user of the first information handling system while the outgoing voice chat is muted, then, while the outgoing voice chat remains muted, executing a command corresponding to the predefined voice command to take at least one predefined response action while the outgoing voice chat remains muted to modify one or more operating characteristics of the executing application and/or other components of the information handling system, and then unmuting the outgoing voice chat and deactivating the separate service after and in response to executing the command. 2. The method of claim 1 , where the response by the separate service to the detected activation action further comprises temporarily muting the outgoing voice chat while using speech recognition while the outgoing voice chat is muted to receive and analyze any analog audio signals received from the human user of the first information handling system while the outgoing voice chat is muted to recognize a predefined voice command spoken by the human user of the first information handling system for a predetermined maximum threshold value of elapsed listening time that begins when the activation action is first detected; and then unmuting the outgoing voice chat and deactivating the separate service after expiration of the predetermined maximum threshold value of elapsed listening time if no analog audio signals are received before the expiration of the predetermined maximum threshold value of elapsed listening time that are recognized as a predefined voice command spoken by the human user of the first information handling system. 3. The method of claim 2 , further comprising only using the speech recognition while the outgoing voice chat is muted during the predetermined maximum threshold value of elapsed listening time to receive and analyze any analog audio signals received from the human user of the first information handling system while the outgoing voice chat is muted to recognize a predefined voice command spoken by the human user of the first information handling system. 4. The method of claim 1 , where the detected activation action from the human user of the first information handling system comprises detection of a tracked gaze of the human user of the first information handling system upon a predetermined activation area. 5. The method of claim 1 , where the detected activation action from the human user of the first information handling system comprises detection of a predetermined call word or activation phrase spoken by the human user of the first information handling system. 6. The method of claim 1 , where the detected activation action from the human user of the first information handling system comprises detection of the actuation of a predetermined hardware (input/output) I/O of the first information handling system by the human user of the first information handling system. 7. The method of claim 1 , further comprising responding to the detected activation action by temporarily displaying a user interface (UI) over at least a portion of the graphics scene on the display device while muting the outgoing voice chat and using speech recognition to receive and analyze any analog audio signals received from the human user of the first information handling system while the outgoing voice chat is muted to recognize a predefined voice command spoken by the human user of the first information handling system; and then ceasing display of the UI over the at least a portion of the graphics scene on the display device when unmuting the outgoing voice chat. 8. The method of claim 1 , further comprising responding to the detected activation action by: temporarily muting the outgoing voice chat and using speech recognition while the outgoing voice chat is muted for a predetermined maximum threshold value of elapsed listening time to listen for and analyze any analog audio signals received from the human user of the first information handling system while the outgoing voice chat is muted to recognize a predefined voice command spoken by the human user of the first information handling system while the outgoing voice chat is muted, then executing a command corresponding to any predefined voice command received before expiration of the predetermined maximum threshold value of elapsed listening time to take the at least one predefined response action while the outgoing voice chat remains muted to modify the one or more operating characteristics of the executing application and/or other components of the information handling system, and then unmuting the outgoing voice chat upon occurrence of either the expiration of the predetermined maximum value of elapsed listening time threshold or after and in response to the execution of the command corresponding to a predefined voice command to take the at least one predefined response action to modify the one or more operating characteristics of the executing application and/or other components of the information handling system. 9. The method of claim 1 , where the at least one response action comprises at least one of opening a new application on the first information handling system, closing an existing application on the first information handling system, sending a text message or in-application message from the first information handling system to a user of another information handling system, recording a video on the first information handling system or taking a screenshot of the displayed graphics scene on the first information handling system, uploading or posting a video or screenshot of the displayed graphics scene across the network from the first information handling system to a server, changing a selected keyboard lighting theme on the first information handling system, changing keyboard haptics settings on the first information handling system, or changing a selected keyboard macro on the first information handling system. 10. A method, comprising: executing an application on a first information handling system to display a graphics scene to a human user of the first information handling system on a display device; tracking a gaze of the human user of the first information handling system that is viewing the displayed graphics scene; detecting a location of the tracked gaze of the human user of the first i

Assignees

Inventors

Classifications

  • G10L15/22Primary

    Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • Communicating with other players during game play, e.g. by e-mail or chat · CPC title

  • Interoperability with other network applications or services · CPC title

  • Execution procedure of a spoken command · CPC title

  • Audio in a user interface, e.g. using voice commands for navigating, audio feedback · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10937425B2 cover?
Systems and methods are provided that may be implemented to allow an information handling system user to selectively activate and interact with a speech recognition service for system control during application runtime on the same system, and without interrupting the separate application on the same information handling system. The activated speech recognition service may respond to verbal comm…
Who is the assignee on this patent?
Dell Products Lp
What technology area does this patent fall under?
Primary CPC classification G10L15/22. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 02 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).