Adaptive speech endpoint detector
US-2018090127-A1 · Mar 29, 2018 · US
US12586576B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12586576-B2 |
| Application number | US-202218047650-A |
| Country | US |
| Kind code | B2 |
| Filing date | Oct 19, 2022 |
| Priority date | Oct 19, 2022 |
| Publication date | Mar 24, 2026 |
| Grant date | Mar 24, 2026 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A method includes recognizing words comprised by a first utterance; interpreting the recognized words according to a grammar comprised by a domain; from the interpreting of the recognized words, determining a timeout period for the first utterance based on the domain of the first utterance; detecting end of voice activity in the first utterance; executing an instruction following an amount of time after detecting end of voice activity of the first utterance in response to the amount of time exceeding the timeout period, the executed instruction based at least in part on interpreting the recognized words.
Opening claim text (preview).
The invention claimed is: 1 . A method comprising: recognizing words within an utterance of a user by a speech recognition engine to generate one or more transcription hypotheses; interpreting, by a natural language understanding processor, the recognized words by comparing the transcription hypotheses to a plurality of semantic grammars respectively associated with a plurality of domains, each semantic grammar producing a probability score representing a likelihood that the utterance corresponds to the respective domain; from the interpreting of the recognized words, determining by the processor, a weighted-average timeout period for the utterance as a weighted average of domain-specific timeout values, each domain-specific timeout value being multiplied by its corresponding probability score and divided by a sum of the probability scores; detecting an end of voice activity in the first utterance using a voice activity detector coupled to a microphone sensor; and executing, by the processor, an instruction following an amount of time after detecting the end of voice activity, the instruction being executed in response to the amount of time exceeding the determined weighted-average timeout period, and being based at least in part on the interpreted words. 2 . The method of claim 1 , wherein interpreting includes determining a probability of correctness, the method further comprising interpreting the recognized words according to a second grammar comprised by a second domain, wherein the timeout period is selected based on which of multiple interpretations has the highest probability of correctness. 3 . The method of claim 1 , wherein interpreting includes determining a probability of correctness, the method further comprising interpreting the recognized words according to a second grammar comprised by a second domain; and computing a weighted average of multiple probabilities of correctness, wherein the timeout period is selected based on the weighted average. 4 . The method of claim 1 , wherein the timeout period is specified as a parameter of the domain. 5 . The method of claim 4 , wherein the timeout period is specified as a multiple of a general timeout period. 6 . The method of claim 1 further comprising, computing a user speech rate, wherein the timeout period is based at least in part on the user speech rate. 7 . The method of claim 1 further comprising, computing whether the recognized words can be a prefix to another interpretation, wherein the timeout period is based at least in part on whether the recognized words can be a prefix to another interpretation. 8 . The method of claim 1 further comprising: entering a modal dialog having a modal timeout period different from a default timeout period, wherein the timeout period is based, at least in part, on the default timeout period while in a default mode, and wherein the timeout period is based, at least in part, on the modal timeout period while in the modal dialog.
Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title
Detection of presence or absence of voice signals (switching of direction of transmission by voice frequency in two-way loud-speaking telephone systems H04M9/10) · CPC title
Execution procedure of a spoken command · CPC title
Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning · CPC title
Detection of discrete points within a voice signal · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.