Flexible schema for language model customization

US10497367B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10497367-B2
Application numberUS-201615389088-A
CountryUS
Kind codeB2
Filing dateDec 22, 2016
Priority dateMar 27, 2014
Publication dateDec 3, 2019
Grant dateDec 3, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The customization of language modeling components for speech recognition is provided. A list of language modeling components may be made available by a computing device. A hint may then be sent to a recognition service provider for combining the multiple language modeling components from the list. The hint may be based on a number of different domains. A customized combination of the language modeling components based on the hint may then be received from the recognition service provider.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, executed by a computing device, of customizing language modeling components, the method comprising: displaying, by the computing device, a list of language modeling components; sending, by the computing device to a service provider, a hint for combining a plurality of the language modeling components from the list, the hint being based on one or more of a plurality of domains, wherein sending the hint for combining a plurality of the language modeling components from the list comprises sending the hint prior to initiating an offline initialization process; and receiving, by the computing device from the service provider, a customized combination of the plurality of the language modeling components based on the hint; capturing speech input; and recognizing the captured speech input based on the received customized combination of the plurality of the language modeling components. 2. The method of claim 1 , further comprising maintaining a connection between the hint and the one or more of the plurality of the language modeling components by assigning an ID associated with the hint to the plurality of language modeling components. 3. The method of claim 1 , wherein sending, by the computing device, the hint for combining a plurality of the language modeling components from the list comprises sending a selection of a pre-compiled language model based on the one or more of the plurality of domains. 4. The method of claim 1 , wherein sending, by the computing device, the hint for combining a plurality of the language modeling components from the list comprises sending a selection of a fixed weight combination of the plurality of the language modeling components based on the one or more of the plurality of domains. 5. The method of claim 4 , further comprising applying one or more weights to the one or more of the plurality of language modeling components. 6. The method of claim 1 , wherein sending, by the computing device, the hint for combining a plurality of the language modeling components from the list comprises sending a selection of one or more recognition topics from a pre-complied list, the one or more recognition topics corresponding to one or more of the plurality of the language modeling components. 7. The method of claim 1 , wherein sending, by the computing device, the hint for combining a plurality of the language modeling components from the list comprises sending an in-domain text corpus. 8. The method of claim 1 , wherein sending, by the computing device, the hint for combining a plurality of the language modeling components from the list comprises sending an in-domain audio corpus. 9. The method of claim 1 , wherein sending, by the computing device, the hint for combining a plurality of the language modeling components from the list comprises sending an existing combination of language modeling components for re-use. 10. The method of claim 1 , wherein sending, by the computing device, the hint for combining a plurality of the language modeling components from the list comprises sending a recognition request comprising the hint. 11. A system for customizing language modeling components, comprising: a memory for storing executable program code; and a processor, functionally coupled to the memory, the processor being responsive to computer-executable instructions contained in the program code and operative to: display, by the system, a list of language modeling components; send, by the system to a service provider, a hint generated by the system for combining a plurality of language modeling components from the list, the hint being based on one or more of a plurality of domains; receive, by the system from the service provider, a customized combination of the plurality of the language modeling components based on the hint; maintain a connection between the hint and the one or more of the plurality of the language modeling components by assigning an ID associated with the hint to the plurality of language modeling components; receive speech input into the system; and recognize the speech input based on the customized combination of the plurality of the language modeling components. 12. The system of 11 , wherein the processor, in sending the hint for combining a plurality of the language modeling components from the list, is operative to send a selection of a pre-compiled language model based on the one or more of the plurality of domains. 13. The system of claim 11 , wherein the processor, in sending the hint for combining a plurality of the language modeling components from the list, is operative to send a selection of a fixed weight combination of the plurality of the language modeling components based on the one or more of the plurality of domains. 14. The system of claim 11 , wherein the processor, in sending the hint for combining a plurality of the language modeling components from the list, is operative to: send a selection of one or more recognition topics from a pre-complied list, the one or more recognition topics corresponding to one or more of the plurality of the language modeling components; and apply one or more weights to the one or more of the plurality of language modeling components. 15. A computer-implemented method of customizing language modeling components, comprising: receiving an identification of one or more language modeling components from a client device for use in recognizing speech; receiving a selection of a weight value for the selected one or more of the language modeling components; and generating a customized combination of language modeling components for speech recognition based on the identified one or more language modeling components and the weight value; sending the customized combination of language modeling components to the client device to recognize speech based on the customized combination of language modeling components; receiving speech input; and recognizing the speech input based on the customized combination of the plurality of the language modeling components. 16. The method of claim 15 , wherein the information received includes at least a portion of an in-domain text corpus. 17. The method of claim 15 , wherein the information received includes at least a portion of an in-domain audio corpus. 18. The method of claim 15 , wherein the information received identifies at least an existing combination of language modeling components for re-use. 19. The method of claim 15 , wherein the information is received as part of a recognition request for use in on-the-fly language model interpolation. 20. The method of claim 15 , further comprising recognizing speech based on the customized combination of language modeling components.

Assignees

Inventors

Classifications

  • Morphological analysis · CPC title

  • Semantic analysis · CPC title

  • G10L15/183Primary

    using context dependencies, e.g. language models · CPC title

  • G10L15/30Primary

    Distributed recognition, e.g. in client-server systems, for mobile phones or network applications · CPC title

  • Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10497367B2 cover?
The customization of language modeling components for speech recognition is provided. A list of language modeling components may be made available by a computing device. A hint may then be sent to a recognition service provider for combining the multiple language modeling components from the list. The hint may be based on a number of different domains. A customized combination of the language m…
Who is the assignee on this patent?
Microsoft Technology Licensing Llc
What technology area does this patent fall under?
Primary CPC classification G10L15/183. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 03 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).