Who is the assignee on this patent?

Microsoft Technology Licensing Llc

What technology area does this patent fall under?

Primary CPC classification G10L15/183. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Apr 27 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Automated speech recognition confidence classifier

US10991365B2 · US · B2

Patent metadata
Field	Value
Publication number	US-10991365-B2
Application number	US-201916377967-A
Country	US
Kind code	B2
Filing date	Apr 8, 2019
Priority date	Apr 8, 2019
Publication date	Apr 27, 2021
Grant date	Apr 27, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method of enhancing an automated speech recognition confidence classifier includes receiving a set of baseline confidence features from one or more decoded words, deriving word embedding confidence features from the baseline confidence features, joining the baseline confidence features with word embedding confidence features to create a feature vector, and executing the confidence classifier to generate a confidence score, wherein the confidence classifier is trained with a set of training examples having labeled features corresponding to the feature vector.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method of enhancing an automated speech recognition confidence classifier comprising: receiving a set of baseline confidence features from one or more decoded words; deriving word embedding confidence features from the baseline confidence features; joining the baseline confidence features with word embedding confidence features to create a feature vector; and executing the confidence classifier to generate a confidence score, wherein the confidence classifier is trained with a set of training examples having labeled features corresponding to the feature vector. 2. The method of claim 1 wherein the word embedding confidence features comprise character embeddings. 3. The method of claim 2 wherein the character embeddings comprise less than 26 embeddings comprising letters. 4. The method of claim 2 wherein the character embedding for a word comprises a vector having values for each letter consisting of the count of the number of each letter in the word. 5. The method of claim 1 wherein the word embedding confidence features comprise phone embeddings. 6. The method of claim 5 wherein the phone embeddings comprise monophones selected from a dictionary comprising 40 or fewer monophones. 7. The method of claim 1 wherein the word embedding confidence features comprise character embeddings and phone embeddings. 8. The method of claim 1 wherein the feature vector further comprises GLOVE embeddings. 9. The method of claim 1 wherein the confidence classifier is trained for word-level as well an utterance-level classification. 10. The method of claim 1 wherein the baseline features comprise two or more of acoustic-model scores, background-model scores, silence-model scores, noise-model scores, language model scores, and duration features. 11. A machine-readable storage device having instructions for execution by a processor of a machine to cause the processor to perform operations to generate a confidence score for a word or utterance, the operations comprising: receiving a set of baseline confidence features from one or more decoded words; deriving word embedding confidence features from the baseline confidence features; joining the baseline confidence features with word embedding confidence features to create a feature vector; and executing the confidence classifier to generate a confidence score, wherein the confidence classifier is trained with a set of training examples having labeled features corresponding to the feature vector. 12. The device of claim 11 wherein the word embedding confidence features comprise character embeddings. 13. The device of claim 12 wherein the character embeddings comprise 26 or fewer embeddings comprising letters in an alphabet. 14. The device of claim 12 wherein the character embedding for a word comprises a vector having values for each letter consisting of the count of the number of each letter in the word. 15. The device of claim 11 wherein the word embedding confidence features comprise phone embeddings including monophones selected from a dictionary comprising 40 or fewer monophones. 16. The device of claim 11 wherein the word embedding confidence features comprise character embeddings and phone embeddings. 17. The device of claim 11 wherein the confidence classifier is trained for word-level as well an utterance-level classification and wherein the baseline features comprise two or more of acoustic-model scores, background-model scores, silence-model scores, noise-model scores, language model scores, and duration features. 18. A device comprising: a processor; and a memory device coupled to the processor and having a program stored thereon for execution by the processor to perform operations comprising: receiving a set of baseline confidence features from one or more decoded words; deriving word embedding confidence features from the baseline confidence features; joining the baseline confidence features with word embedding confidence features to create a feature vector; and executing the confidence classifier to generate a confidence score, wherein the confidence classifier is trained with a set of training examples having labeled features corresponding to the feature vector. 19. The device of claim 18 wherein the word embedding confidence features comprise one or more of character embeddings and phone embeddings including monophones. 20. The device of claim 18 wherein the confidence classifier is trained for word-level as well an utterance-level classification and wherein the baseline features comprise two or more of acoustic-model scores, background-model scores, silence-model scores, noise-model scores, language model scores, and duration features.

Assignees

Microsoft Technology Licensing Llc

Inventors

Classifications

G10L15/183Primary
using context dependencies, e.g. language models · CPC title
G10L15/08Primary
Speech classification or search · CPC title
G06F18/24
Classification techniques · CPC title
G10L2015/228
of application context · CPC title
G06F40/30
Semantic analysis · CPC title

Patent family

Related publications grouped by family.

View patent family 70289837

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10991365B2 cover?: A method of enhancing an automated speech recognition confidence classifier includes receiving a set of baseline confidence features from one or more decoded words, deriving word embedding confidence features from the baseline confidence features, joining the baseline confidence features with word embedding confidence features to create a feature vector, and executing the confidence classifier …
Who is the assignee on this patent?: Microsoft Technology Licensing Llc
What technology area does this patent fall under?: Primary CPC classification G10L15/183. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Apr 27 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).