Systems and methods for training a control system based on prior audio inputs
US-11804213-B2 · Oct 31, 2023 · US
US12293757B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12293757-B2 |
| Application number | US-202318374540-A |
| Country | US |
| Kind code | B2 |
| Filing date | Sep 28, 2023 |
| Priority date | Nov 27, 2018 |
| Publication date | May 6, 2025 |
| Grant date | May 6, 2025 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Systems and methods are disclosed herein for training a control system based on prior audio inputs. The disclosed systems and methods receive a non-lexical or interjectional audio input. State change indications are also received and stored by the system within a predefined period of time starting from the time the system received the audio input. The system then receives a subsequent audio input. If the audio inputs of both the audio input and the subsequent audio input match, and contextual information for the audio input and the subsequent audio input match, the system stores a match association, comprising a confidence factor, for the subsequent audio input to the audio input in the associative data structure. If the confidence factor is greater than a preconfigured confidence level, the system executes one or more functions based on stored state change indications.
Opening claim text (preview).
What is claimed is: 1. A method comprising: receiving, by control circuitry, an audio input; comparing, by the control circuitry, the audio input to a lexical sound data structure comprising a plurality of lexical sounds; determining, by the control circuitry, that the audio input does not match at least one of the plurality of lexical sounds; in response to determining that the audio input does not match at least one of the plurality of lexical sounds, automatically monitoring a defined environment for one or more state change indications; receiving, by the control circuitry, a state change indication; determining, by the control circuitry, contextual information for the audio based, at least in part, on receiving the state change indication; and storing, by the control circuitry, the audio input, the contextual information, and the state change indications in one or more data structures. 2. The method of claim 1 , wherein the monitoring of the defined environment for the state change indication occurs for a predefined period of time. 3. The method of claim 2 , further comprising executing a timer based on the predefined period of time. 4. The method of claim 2 , wherein the predefined period of time is based, at least in part, on a first environmental factor. 5. The method of claim 2 , wherein the predefined period of time is based, at least in part, on a piece of historical contextual information. 6. An apparatus comprising: control circuitry; and at least one memory including computer program code for one or more programs, the at least one memory and the computer program code configured to, with the control circuitry, cause the apparatus to perform at least the following: receive an audio input; compare the audio input to a lexical sound data structure comprising a plurality of lexical sounds; determine that the audio input does not match at least one of the plurality of lexical sounds; and in response to determining that the audio input does not match at least one of the plurality of lexical sounds, automatically monitor a defined environment for one or more state change indications; receive a state change indication; determine contextual information for the audio based, at least in part, on receiving the state change indication; and store the audio input, the contextual information, and the state change indications in one or more data structures. 7. The apparatus of claim 6 , wherein the monitoring of the defined environment for the state change indication occurs for a predefined period of time. 8. The apparatus of claim 7 , wherein the apparatus is further caused to execute a timer based on the predefined period of time. 9. The apparatus of claim 7 , wherein the predefined period of time is based, at least in part, on a first environmental factor. 10. The apparatus of claim 7 , wherein the predefined period of time is based, at least in part, on a piece of historical contextual information. 11. A non-transitory computer-readable medium having instructions encoded thereon that, when executed by control circuitry, cause the control circuitry to: receive an audio input; compare the audio input to a lexical sound data structure comprising a plurality of lexical sounds; determine that the audio input does not match at least one of the plurality of lexical sounds; in response to determining that the audio input does not match at least one of the plurality of lexical sounds, automatically monitor a defined environment for one or more state change indications; receive a state change indication; determine contextual information for the audio based, at least in part, on receiving the state change indication; and store the audio input, the contextual information, and the state change indications in one or more data structures. 12. The non-transitory computer-readable medium of claim 11 , wherein the monitoring of the defined environment for the state change indication occurs for a predefined period of time. 13. The non-transitory computer-readable medium of claim 12 , wherein the control circuitry is further caused to execute a timer based on the predefined period of time. 14. The non-transitory computer-readable medium of claim 12 , wherein the predefined period of time is based, at least in part, on a first environmental factor. 15. The non-transitory computer-readable medium of claim 12 , wherein the predefined period of time is based, at least in part, on a piece of historical contextual information. 16. The method of claim 1 , wherein the one or more data structures comprise the lexical sound data structure. 17. The apparatus of claim 6 , wherein the one or more data structures comprise the lexical sound data structure. 18. The non-transitory computer-readable medium of claim 11 , wherein the one or more data structures comprise the lexical sound data structure.
Feature extraction for speech recognition; Selection of recognition unit · CPC title
Execution procedure of a spoken command · CPC title
Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title
Parsing for meaning understanding · CPC title
Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.