System and method of performing automatic speech recognition using local private data

US9666188B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9666188-B2
Application numberUS-201314066079-A
CountryUS
Kind codeB2
Filing dateOct 29, 2013
Priority dateOct 29, 2013
Publication dateMay 30, 2017
Grant dateMay 30, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method of providing hybrid speech recognition between a local embedded speech recognition system and a remote speech recognition system relates to receiving speech from a user at a device communicating with a remote speech recognition system. The system recognizes a first part of speech by performing a first recognition of the first part of the speech with the embedded speech recognition system that accesses private user data, wherein the private user data is not available to the remote speech recognition system. The system recognizes the second part of the speech by performing a second recognition of the second part of the speech with the remote speech recognition system. The final recognition result is a combination of these two recognition processes. The private data can be such local information as a user location, a playlist, frequently dialed numbers or texted people, user contact list information, and so forth.

First claim

Opening claim text (preview).

I claim: 1. A method comprising: receiving, on a device, text as part of a message, a placeholder within the text and audio, wherein the device comprises an embedded speech recognition system that accesses private user data on the device and wherein the private user data is not available to a remote speech recognition system in communication with the device; receiving a component comprising one of a garbage model, phonemic language model, a language model according to a standard list, and a language model built on the private user data; identifying a location of the device; determining a privacy level of the private user data according to the location of the device; recognizing the audio using the component, the embedded speech recognition system and by accessing the private user data according to the privacy level to yield a recognition result; replacing the placeholder with the recognition result in the text to yield an updated message; and presenting the updated message on the device. 2. The method of claim 1 , wherein individual names from a user contact list are from the private user data. 3. The method of claim 1 , wherein the private user data comprises one of data in a user contact list, frequently dialed phone numbers, frequently used texted names, data associated with a user location, data associated with a playlist, user history, and multiple hypothesis associated with private information. 4. A computer-readable storage device storing instructions which, when executed by a processor, cause the processor to perform operations comprising: receiving, on a device, text as part of a message, a placeholder within the text and audio, wherein the device comprises an embedded speech recognition system that accesses private user data on the device and wherein the private user data is not available to a remote speech recognition system in communication with the device; receiving a component comprising one of a garbage model, phonemic language model, a language model according to a standard list, and a language model built on the private user data; identifying a location of the device; determining a privacy level of the private user data according to the location of the device; recognizing the audio using the component, the embedded speech recognition system and by accessing the private user data according to the privacy level to yield a recognition result; replacing the placeholder with the recognition result in the text to yield an updated message; and presenting the updated message on the device. 5. The computer-readable storage device of claim 4 , wherein individual names from a user contact list are from the private user data. 6. The computer-readable storage device of claim 4 , wherein the private user data comprises one of data in a user contact list, frequently dialed phone numbers, frequently used texted names, data associated with a user location, data associated with a playlist, user history, and multiple hypothesis associated with private information. 7. A system comprising: a processor; and computer-readable storage medium storing instructions which, when executed by the processor, cause the processor to perform operations comprising: receiving, on a device, text as part of a message, a placeholder within the text and audio, wherein the device comprises an embedded speech recognition system that accesses private user data on the device and wherein the private user data is not available to a remote speech recognition system in communication with the device; receiving a component comprising one of a garbage model, phonemic language model, a language model according to a standard list, and a language model built on the private user data; identifying a location of the device; determining a privacy level of the private user data according to the location of the device; recognizing the audio using the component, the embedded speech recognition system and by accessing the private user data according to the privacy level to yield a recognition result; replacing the placeholder with the recognition result in the text to yield an updated message; and presenting the updated message on the device. 8. The system of claim 7 , wherein the private user data comprises one of data in a user contact list, frequently dialed phone numbers, frequently used texted names, data associated with a user location, user history, data associated with a playlist, and multiple hypothesis associated with private information.

Assignees

Inventors

Classifications

  • G10L15/30Primary

    Distributed recognition, e.g. in client-server systems, for mobile phones or network applications · CPC title

  • of application context · CPC title

  • G10L15/22Primary

    Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9666188B2 cover?
A method of providing hybrid speech recognition between a local embedded speech recognition system and a remote speech recognition system relates to receiving speech from a user at a device communicating with a remote speech recognition system. The system recognizes a first part of speech by performing a first recognition of the first part of the speech with the embedded speech recognition syst…
Who is the assignee on this patent?
Nuance Communications Inc
What technology area does this patent fall under?
Primary CPC classification G10L15/30. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 30 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).