Semantically conditioned voice activity detection

US12586576B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12586576-B2
Application numberUS-202218047650-A
CountryUS
Kind codeB2
Filing dateOct 19, 2022
Priority dateOct 19, 2022
Publication dateMar 24, 2026
Grant dateMar 24, 2026

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method includes recognizing words comprised by a first utterance; interpreting the recognized words according to a grammar comprised by a domain; from the interpreting of the recognized words, determining a timeout period for the first utterance based on the domain of the first utterance; detecting end of voice activity in the first utterance; executing an instruction following an amount of time after detecting end of voice activity of the first utterance in response to the amount of time exceeding the timeout period, the executed instruction based at least in part on interpreting the recognized words.

First claim

Opening claim text (preview).

The invention claimed is: 1 . A method comprising: recognizing words within an utterance of a user by a speech recognition engine to generate one or more transcription hypotheses; interpreting, by a natural language understanding processor, the recognized words by comparing the transcription hypotheses to a plurality of semantic grammars respectively associated with a plurality of domains, each semantic grammar producing a probability score representing a likelihood that the utterance corresponds to the respective domain; from the interpreting of the recognized words, determining by the processor, a weighted-average timeout period for the utterance as a weighted average of domain-specific timeout values, each domain-specific timeout value being multiplied by its corresponding probability score and divided by a sum of the probability scores; detecting an end of voice activity in the first utterance using a voice activity detector coupled to a microphone sensor; and executing, by the processor, an instruction following an amount of time after detecting the end of voice activity, the instruction being executed in response to the amount of time exceeding the determined weighted-average timeout period, and being based at least in part on the interpreted words. 2 . The method of claim 1 , wherein interpreting includes determining a probability of correctness, the method further comprising interpreting the recognized words according to a second grammar comprised by a second domain, wherein the timeout period is selected based on which of multiple interpretations has the highest probability of correctness. 3 . The method of claim 1 , wherein interpreting includes determining a probability of correctness, the method further comprising interpreting the recognized words according to a second grammar comprised by a second domain; and computing a weighted average of multiple probabilities of correctness, wherein the timeout period is selected based on the weighted average. 4 . The method of claim 1 , wherein the timeout period is specified as a parameter of the domain. 5 . The method of claim 4 , wherein the timeout period is specified as a multiple of a general timeout period. 6 . The method of claim 1 further comprising, computing a user speech rate, wherein the timeout period is based at least in part on the user speech rate. 7 . The method of claim 1 further comprising, computing whether the recognized words can be a prefix to another interpretation, wherein the timeout period is based at least in part on whether the recognized words can be a prefix to another interpretation. 8 . The method of claim 1 further comprising: entering a modal dialog having a modal timeout period different from a default timeout period, wherein the timeout period is based, at least in part, on the default timeout period while in a default mode, and wherein the timeout period is based, at least in part, on the modal timeout period while in the modal dialog.

Assignees

Inventors

Classifications

  • G10L15/22Primary

    Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • Detection of presence or absence of voice signals (switching of direction of transmission by voice frequency in two-way loud-speaking telephone systems H04M9/10) · CPC title

  • Execution procedure of a spoken command · CPC title

  • Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning · CPC title

  • Detection of discrete points within a voice signal · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12586576B2 cover?
A method includes recognizing words comprised by a first utterance; interpreting the recognized words according to a grammar comprised by a domain; from the interpreting of the recognized words, determining a timeout period for the first utterance based on the domain of the first utterance; detecting end of voice activity in the first utterance; executing an instruction following an amount of t…
Who is the assignee on this patent?
Soundhound Ai Ip Llc
What technology area does this patent fall under?
Primary CPC classification G10L15/22. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 24 2026 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).