System and method for managing models for embedded speech and language processing

US9530416B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9530416-B2
Application numberUS-201314064579-A
CountryUS
Kind codeB2
Filing dateOct 28, 2013
Priority dateOct 28, 2013
Publication dateDec 27, 2016
Grant dateDec 27, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Disclosed herein are systems, methods, and computer-readable storage devices for fetching speech processing models based on context changes in advance of speech requests using the speech processing models. An example local device configured to practice the method, having a local speech processor, and having access to remote speech models, detects a change in context. The change in context can be based on geographical location, language translation, speech in a different language, user language settings, installing or removing an app, and so forth. The local device can determine a speech processing model that is likely to be needed based on the change in context, and that is not stored on the local device. Independently of an explicit request to process speech, the local device can retrieve, from a remote server, the speech processing model for use on the mobile device.

First claim

Opening claim text (preview).

We claim: 1. A method comprising: detecting, at a mobile device, a change in geographic location of the mobile device to yield a new context of a speech dialog; identifying a plurality of speech models on a remote server; calculating a likelihood score for each speech model in the plurality of speech models, wherein the likelihood score indicates a compatibility of each speech model to recognize speech in the speech dialog in the new context to yield likelihood scores; selecting a speech model from the plurality of speech models according to the likelihood scores; determining that a local storage on the mobile device has insufficient space to store the speech model; calculating local likelihood scores of local speech models stored in the local storage, wherein the local likelihood scores indicate a probability of each local speech model to recognize the speech in the speech dialog with the new context to yield a respective scored local speech model; and removing the respective scored local speech model having a lowest likelihood score of the local likelihood scores until a sufficient space for the speech model is made available on the mobile device. 2. The method of claim 1 , wherein the likelihood scores are calculated according to a certainty of future use, an urgency of future use, and a reusability of each speech model in the plurality of speech models. 3. The method of claim 1 , wherein the likelihood scores are weighted according to historical data comprising contextual history and speech model usage. 4. The method of claim 1 , wherein the new context is further based on one of language translation, speech in a different language, user language settings, available local storage, installing an application on the mobile device, and removing the application on the mobile device. 5. The method of claim 1 , further comprising: comparing network characteristics to a priority for the speech model to determine when to retrieve the speech model. 6. The method of claim 1 , further comprising: determining that an existing speech model stored on the mobile device is has a lower likelihood score than the speech model; and removing the existing speech model from local storage on the mobile device. 7. A system comprising: a processor; and a computer-readable storage medium having instructions stored which, when executed by the processor, cause the processor to perform operations comprising: detecting, at a mobile device, a change in geographic location of the mobile device to yield a new context of a speech dialog; identifying a plurality of speech models on a remote server; calculating a likelihood score for each speech model in the plurality of speech models, wherein the likelihood score indicates a compatibility of each speech model to recognize speech in the speech dialog in the new context to yield likelihood scores; selecting a speech model from the plurality of speech models according to the likelihood scores; determining that a local storage on the mobile device has insufficient space to store the speech model; calculating local likelihood scores of local speech models stored in the local storage, wherein the local likelihood scores indicate a probability of each local speech model to recognize the speech in the speech dialog with the new context to yield a respective scored local speech model; and removing the respective scored local speech model having a lowest likelihood score of the local likelihood scores until a sufficient space for the speech model is made available on the mobile device. 8. The system of claim 7 , wherein the new context is based on one of unused storage space on the mobile device crossing a threshold, language translation, speech in a different language, user language settings, installing an application on the mobile device, and removing the application on the mobile device. 9. The system of claim 7 , wherein the likelihood scores are calculated according to a certainty of future use, an urgency of future use, and a reusability of each speech model in the plurality of speech models. 10. The system of claim 7 , wherein the likelihood scores are weighted according to historical data comprising contextual history and speech model usage. 11. The system of claim 7 , the computer-readable storage medium having additional instructions stored which, when executed by the processor, cause the processor to perform operations comprising: determining that an existing speech model stored on the mobile device has a lower likelihood score than the speech model; and removing the existing speech model from local storage on the mobile device. 12. The system of claim 7 , the computer-readable storage medium having additional instructions stored which, when executed by the processor, cause the processor to perform operations comprising: determining that a plurality of previously stored speech models on the mobile device have likelihood scores below a threshold likelihood score, to yield below threshold speech models; and removing the below threshold speech models from the mobile device. 13. The system of claim 12 , wherein the removing of the below threshold speech models from the mobile device occurs only until a threshold amount of storage space on the mobile device is freed. 14. A computer-readable storage device having instructions stored which, when executed by a computing device, cause the computing device to perform operations comprising: detecting, at a mobile device, a change in geographic location of the mobile device to yield a new context of a speech dialog; identifying a plurality of speech models on a remote server; calculating a likelihood score for each speech model in the plurality of speech models, wherein the likelihood score indicates a compatibility of each speech model to recognize speech in the speech dialog in the new context to yield likelihood scores; selecting a speech model from the plurality of speech models according to the likelihood scores; determining that a local storage on the mobile device has insufficient space to store the speech model; calculating local likelihood scores of local speech models stored in the local storage, wherein the local likelihood scores indicate a probability of each local speech model to recognize the speech in the speech dialog with the new context to yield a respective scored local speech model; and removing the respective scored local speech model having a lowest likelihood score of the local likelihood scores until a sufficient space for the speech model is made available on the mobile device. 15. The computer-readable storage device of claim 14 , having additional instructions stored which, when executed by the computing device, cause the computing device to perform operations comprising: wherein the likelihood scores are calculated according to a certainty of future use, an urgency of future use, and a reusability of each speech model in the plurality of speech models. 16. The computer-readable storage device of claim 14 , storing additional instructions which, when executed by the computing device, cause the computing device to perform operations comprising: wherein the likelihood scores are weighted according to historical data comprising contextual history and speech model usage. 17. The computer-readable storage device of claim 14 , wherein the new context is based on one of language translation, speech in a different language, user language settings, available local storage, installing an application on the mobile device, and removing the application on the mob

Assignees

Inventors

Classifications

  • G10L15/30Primary

    Distributed recognition, e.g. in client-server systems, for mobile phones or network applications · CPC title

  • G10L15/183Primary

    using context dependencies, e.g. language models · CPC title

  • Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • of application context · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9530416B2 cover?
Disclosed herein are systems, methods, and computer-readable storage devices for fetching speech processing models based on context changes in advance of speech requests using the speech processing models. An example local device configured to practice the method, having a local speech processor, and having access to remote speech models, detects a change in context. The change in context can b…
Who is the assignee on this patent?
At & T Ip I Lp
What technology area does this patent fall under?
Primary CPC classification G10L15/30. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 27 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).