System and method for context sensitive inference in a speech processing system

US9626968B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9626968-B2
Application numberUS-201514594570-A
CountryUS
Kind codeB2
Filing dateJan 12, 2015
Priority dateJun 25, 2008
Publication dateApr 18, 2017
Grant dateApr 18, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method of operating a speech processing system is provided. The method includes translating a portion of a speech record into a plurality of possible words associated with a plurality of contexts, and determining a plurality of correctness values based on a plurality of probabilities that each of the plurality of possible words is correct for each of the plurality of contexts. The method also includes determining which of the plurality of possible words is a correct translation of the portion of the speech record based on the plurality of correctness values.

First claim

Opening claim text (preview).

What is claimed is: 1. A method of operating a speech processing system, the method comprising: receiving a portion of a speech record from an audio source, the speech record having been produced by the audio source and transferred by the audio source to the speech processing system; determining, by the speech processing system, from metadata accompanying the portion of the speech record, a first context and a second context for the portion of the speech record, wherein the first context has a first probability of correct context and the second context has a second probability of correct context; processing, by the speech processing system, the portion of the speech record to create a first text translation for the portion of the speech record in the first context, wherein the first text translation has a first probability of correct translation within the first context; processing, by the speech processing system, the same portion of the speech record to create a second text translation for the portion of the speech record in the second context, wherein the second text translation has a second probability of correct translation within the second context; processing, by the speech processing system, the first probability of correct translation and the first probability of correct context, to produce a first probability; processing, by the speech processing system, the second probability of correct translation within the second context and the second probability of correct context, to produce a second probability; selecting, by the speech processing system, the first translation as the correct translation when the first probability is greater than the second probability; and selecting, by the speech processing system, the second translation as the correct translation when the second probability is greater than the first probability. 2. The method of claim 1 , wherein the portion of the speech record is a portion of a conversation and the context for the portion of the speech record is an identity of a speaker. 3. The method of claim 1 , wherein the portion of the speech record is a portion of a conversation and the context for the portion of the speech record is a location of a speaker. 4. The method of claim 1 , wherein the portion of the speech record is a portion of a sentence and the context for the portion of the speech record is a position of the portion of the speech record within the sentence. 5. The method of claim 1 , wherein the portion of the speech record is a portion of a call and the context for the portion of the speech record is a position of the portion of the speech record within the call. 6. The method of claim 1 , wherein the portion of the speech record is a portion of a conversation and the context for the portion of the speech record is a position of the portion of the speech record within the conversation. 7. The method of claim 1 , wherein the first probability of correct translation within the first context is multiplied by the first probability of correct context, resulting in the first probability. 8. A non-transitory computer readable medium having stored thereon instructions that, when executed by processing circuitry, direct the processing circuitry to perform the steps comprising: receiving a portion of a speech record from an audio source, the speech record having been produced by the audio source and transferred to the processing circuitry; determining, from metadata accompanying the portion of the speech record, a first context and a second context for the portion of the speech record, wherein the first context has a first probability of correct context and the second context has a second probability of correct context; processing the portion of the speech record to create a first text translation for the portion of the speech record in the first context, wherein the first text translation has a first probability of correct translation within the first context; processing the same portion of the speech record to create a second text translation for the portion of the speech record in the second context, wherein the second text translation has a second probability of correct translation within the second context; processing the first probability of correct translation and the first probability of correct context, to produce a first probability; processing the second probability of correct translation within the second context and the second probability of correct context, to produce a second probability; selecting the first translation as the correct translation when the first probability is greater than the second probability; and selecting the second translation as the correct translation when the second probability is greater than the first probability. 9. The non-transitory computer readable medium of claim 8 , wherein the portion of the speech record is a portion of a conversation and the context for the portion of the speech record is an identity of a speaker. 10. The non-transitory computer readable medium of claim 8 , wherein the portion of the speech record is a portion of a conversation and the context for the portion of the speech record is a location of a speaker. 11. The non-transitory computer readable medium of claim 8 , wherein the portion of the speech record is a portion of a sentence and the context for the portion of the speech record is a position of the portion of the speech record within the sentence. 12. The non-transitory computer readable medium of claim 8 , wherein the portion of the speech record is a portion of a call and the context for the portion of the speech record is a position of the portion of the speech record within the call. 13. The non-transitory computer readable medium of claim 8 , wherein the portion of the speech record is a portion of a conversation and the context for the portion of the speech record is a position of the portion of the speech record within the conversation. 14. The non-transitory computer readable medium of claim 8 , wherein the first probability of correct translation within the first context is multiplied by the first probability of correct context, resulting in the first probability. 15. A processing system comprising: processing circuitry; and a memory device in communication with the processing circuitry, the memory device having computer-executable instructions stored thereon that, when executed by the processing circuitry, instruct the processing circuitry to: receive a portion of a speech record from an audio source, the speech record having been produced by the audio source and transferred to the processing circuitry; determine, from metadata accompanying the portion of the speech record, a first context and a second context for the portion of the speech record, wherein the first context has a first probability of correct context and the second context has a second probability of correct context; process the portion of the speech record to create a first text translation for the portion of the speech record in the first context, wherein the first text translation has a first probability of correct translation within the first context; process the same portion of the speech record to create a second text translation for the portion of the speech record in the second context, wherein the second text translation has a second probability of correct translation within the second context; process the first probability of correct translation and the first probability of correct context, to produce a first probability; process the second probability of correct translation within the second context and th

Assignees

Inventors

Classifications

  • G10L15/26Primary

    Speech to text systems (G10L15/08 takes precedence) · CPC title

  • Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9626968B2 cover?
A method of operating a speech processing system is provided. The method includes translating a portion of a speech record into a plurality of possible words associated with a plurality of contexts, and determining a plurality of correctness values based on a plurality of probabilities that each of the plurality of possible words is correct for each of the plurality of contexts. The method also…
Who is the assignee on this patent?
Verint Systems Ltd
What technology area does this patent fall under?
Primary CPC classification G10L15/26. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 18 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).