Multitask learning for spoken language understanding

US9406292B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9406292-B2
Application numberUS-201414287892-A
CountryUS
Kind codeB2
Filing dateMay 27, 2014
Priority dateJun 9, 2006
Publication dateAug 2, 2016
Grant dateAug 2, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems for improving or generating a spoken language understanding system using a multitask learning method for intent or call-type classification. The multitask learning method aims at training tasks in parallel while using a shared representation. A computing device automatically re-uses the existing labeled data from various applications, which are similar but may have different call-types, intents or intent distributions to improve the performance. An automated intent mapping algorithm operates across applications. In one aspect, active learning is employed to selectively sample the data to be re-used.

First claim

Opening claim text (preview).

I claim: 1. A method comprising: mapping call-types between a first spoken dialog system and a second spoken dialog system using individual training models for each spoken dialog system, to yield mapped call-types; and retraining a model of the individual training models using information based on the mapped call-types. 2. The method of claim 1 , wherein the mapping of the call-types comprises performing on of splitting the call-types, merging the call-types, and renaming the call-types. 3. The method of claim 2 , wherein the merging of the call-types comprises cross-labeling utterances from a dialog using the individual training models. 4. The method of claim 3 , wherein the utterances which are cross-labeled have a confidence score above a threshold. 5. The method of claim 1 , further comprising labeling, as a new call-type, a call-type of the first spoken dialog system when the call-type has more than a specified ratio among the call-types. 6. The method of claim 1 , wherein the retraining of the model comprises active learning to selectively sample data used for the retraining. 7. The method of claim 6 , wherein selectively sampled data is reused during retraining. 8. A system comprising: a processor; and a computer-readable storage medium having instructions stored which, when executed by the processor, cause the processor to perform operations comprising: mapping call-types between a first spoken dialog system and a second spoken dialog system using individual training models for each spoken dialog system, to yield mapped call-types; and retraining a model of the individual training models using information based on the mapped call-types. 9. The system of claim 8 , wherein the mapping of the call-types comprises performing on of splitting the call-types, merging the call-types, and renaming the call-types. 10. The system of claim 9 , wherein the merging of the call-types comprises cross-labeling utterances from a dialog using the individual training models. 11. The system of claim 10 , wherein the utterances which are cross-labeled have a confidence score above a threshold. 12. The system of claim 8 , the computer-readable storage medium having additional instructions stored which result in operations comprising labeling, as a new call-type, a call-type of the first spoken dialog system when the call-type has more than a specified ratio among the call-types. 13. The system of claim 8 , wherein the retraining of the model comprises active learning to selectively sample data used for the retraining. 14. The system of claim 13 , wherein selectively sampled data is reused during retraining. 15. A computer-readable storage device having instructions stored which, when executed by a computing device, cause the computing device to perform operations comprising: mapping call-types between a first spoken dialog system and a second spoken dialog system using individual training models for each spoken dialog system, to yield mapped call-types; and retraining a model of the individual training models using information based on the mapped call-types. 16. The computer-readable storage device of claim 15 , wherein the mapping of the call-types comprises performing on of splitting the call-types, merging the call-types, and renaming the call-types. 17. The computer-readable storage device of claim 16 , wherein the merging of the call-types comprises cross-labeling utterances from a dialog using the individual training models. 18. The computer-readable storage device of claim 17 , wherein the utterances which are cross-labeled have a confidence score above a threshold. 19. The computer-readable storage device of claim 15 , having additional instructions stored which result in operations comprising labeling, as a new call-type, a call-type of the first spoken dialog system when the call-type has more than a specified ratio among the call-types. 20. The computer-readable storage device of claim 15 , wherein the retraining of the model comprises active learning to selectively sample data used for the retraining.

Assignees

Inventors

Classifications

  • using context dependencies, e.g. language models · CPC title

  • Parsing for meaning understanding · CPC title

  • Speech to text systems (G10L15/08 takes precedence) · CPC title

  • Speech interaction details (speech recognition per se G10L15/00) · CPC title

  • Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9406292B2 cover?
Systems for improving or generating a spoken language understanding system using a multitask learning method for intent or call-type classification. The multitask learning method aims at training tasks in parallel while using a shared representation. A computing device automatically re-uses the existing labeled data from various applications, which are similar but may have different call-types,…
Who is the assignee on this patent?
At & T Ip Ii Lp
What technology area does this patent fall under?
Primary CPC classification G10L15/1822. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 02 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).