Techniques for inferring the unknown intents of linguistic items

US9870356B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9870356-B2
Application numberUS-201414180335-A
CountryUS
Kind codeB2
Filing dateFeb 13, 2014
Priority dateFeb 13, 2014
Publication dateJan 16, 2018
Grant dateJan 16, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Functionality is described herein for determining the intents of linguistic items (such as queries), to produce intent output information. For some linguistic items, the functionality deterministically assigns intents to the linguistic items based on known intent labels, which, in turn, may be obtained or derived from a knowledge graph or other type of knowledge resource. For other linguistic items, the functionality infers the intents of the linguistic items based on selection log data (such as click log data provided by a search system). In some instances, the intent output information may reveal new intents that are not represented by the known intent labels. In one implementation, the functionality can use the intent output information to train a language understanding model.

First claim

Opening claim text (preview).

What is claimed is: 1. A method implemented by one or more computing devices, the method comprising: receiving input data, the input data comprising linguistic items including: a first set of linguistic items with known intent labels, the known intent labels representing known relations between entities provided by a knowledge resource; and a second set of linguistic items without known intent labels provided by the knowledge resource; determining intents for the linguistic items in the input data to produce intent output information, the determining comprising: when a respective linguistic item corresponds to a member of the first set, deterministically assigning a respective known intent to the respective linguistic item based at least on a respective known intent label associated with the respective linguistic item; and when the respective linguistic item corresponds to a member of the second set, inferring the intent associated with the respective linguistic item based at least on selection log data; and storing the intent output information in a data store, the determining including discovering a new intent for an individual linguistic item of the second set that identifies an individual entity represented in the knowledge resource, the new intent identifying a new relation for the individual entity that is not included in the known relations provided by the knowledge resource, the selection log data reflecting actions of users associated with using various linguistic items with the known intents and with the new intent. 2. The method of claim 1 , wherein the linguistic items in the input data correspond to respective queries submitted to a search system, and the selection log data is made up of click selections made by users in response to submitting the queries. 3. The method of claim 1 , wherein the individual entity is a person. 4. The method of claim 3 , wherein the knowledge resource is a knowledge graph. 5. The method of claim 1 , wherein the known intent labels correspond to informational relations between entities specified in the knowledge resource. 6. The method of claim 1 , wherein the new intent discovered for the individual linguistic item corresponds to a navigational intent in which a request is made to navigate in a particular manner with respect to the individual entity identified by the individual linguistic item. 7. The method of claim 1 , wherein the new intent discovered for the individual linguistic item corresponds to a transactional intent in which a request is made to perform some action on the individual entity identified by the individual linguistic item. 8. The method of claim 1 , further comprising, prior to performing said determining, identifying, for at least some of the linguistic items in the input data, entity types associated with entities specified in the input data. 9. The method of claim 1 , further comprising, prior to performing said determining, identifying, for at least some of the linguistic items in the input data, components of click selections specified in the input data. 10. The method of claim 1 , wherein said inferring uses a Bayesian hierarchical graphical model, and wherein the model represents user actions based on a process which involves: drawing an intent based on a distribution of intents, to provide a specified intent; generating a linguistic item having one or more words, each word being drawn according to a distribution of words associated with the specified intent; and generating a click selection according to a distribution of click selections associated with the specified intent. 11. The method of claim 1 , wherein said inferring uses a Bayesian hierarchical graphical model, and where the model represents user actions based on a process which involves: drawing an intent based on a distribution of intents, to provide a specified intent; generating one or more entity types, each entity type being drawn according to a distribution of entity types associated with the specified intent; generating one or more context words, each context word being drawn according to a distribution of context words associated with the specified intent; and generating one or more click components, each click component being drawn according to a distribution of click components associated with the specified intent. 12. The method of claim 1 , wherein said inferring employs a Markov chain Monte Carlo technique to produce the intent output information. 13. The method of claim 1 , wherein said inferring employs a variational technique to produce the intent output information. 14. The method of claim 1 , further comprising training a language understanding model based at least on the intent output information. 15. A computer system comprising: a processing device; and a computer readable storage medium storing computer readable instructions which, when executed by the processing device, cause the processing device to: receive input data, the input data comprising a plurality of linguistic items, individual linguistic items corresponding to one or more words submitted by a user and being associated with a click selection made by the user in response to submitting the one or more words; and use a generative model to determine intents for at least some of the linguistic items in the input data, the intents including at least one previously-known intent and at least one new intent discovered by the generative model, the generative model representing user actions based on a process which involves drawing a specified intent for a specified linguistic item based at least on a distribution of intents, the distribution representing probabilities associated with the at least one previously-known intent and the at least one new intent discovered by the generative model. 16. The computer system of claim 15 , wherein the input data includes a first set of linguistic items with known intent labels, and a second set of linguistic items without known intent labels, and wherein the computer readable instructions, when executed by the processing device, cause the processing device to determine the intents by: when a respective linguistic item corresponds to a member of the first set, deterministically assigning an intent to the respective linguistic item based at least on a respective known intent label associated with the respective linguistic item; and when the respective linguistic item corresponds to a member of the second set, inferring the intent associated with the respective linguistic item using the generative model in conjunction with click selection data. 17. The computer system of claim 15 , wherein the computer readable instructions, when executed by the processing device, cause the processing device to: generate the specified linguistic item having one or more words, each word being drawn according to a distribution of words associated with the specified intent; and generate a click selection according to a distribution of click selections associated with the specified intent. 18. The computer system of claim 15 , wherein the computer readable instructions, when executed by the processing device, cause the processing device to: generate one or more entity types, each entity type being drawn according to a distribution of entity types associated with the specified intent; generate one or more context words, each context word being drawn according to a distribution of context words associated with the specified intent; and generate one or more click components, each click component being

Assignees

Inventors

Classifications

  • G06F40/30Primary

    Semantic analysis · CPC title

  • G06F40/35Primary

    Discourse or dialogue representation · CPC title

  • Named entity recognition · CPC title

  • Computing arrangements using knowledge-based models · CPC title

  • Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9870356B2 cover?
Functionality is described herein for determining the intents of linguistic items (such as queries), to produce intent output information. For some linguistic items, the functionality deterministically assigns intents to the linguistic items based on known intent labels, which, in turn, may be obtained or derived from a knowledge graph or other type of knowledge resource. For other linguistic i…
Who is the assignee on this patent?
Microsoft Corp, Microsoft Technology Licensing Llc
What technology area does this patent fall under?
Primary CPC classification G06F40/30. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 16 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).