Neural models for key phrase detection and question generation

US2021134173A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2021134173-A1
Application numberUS-202117143303-A
CountryUS
Kind codeA1
Filing dateJan 7, 2021
Priority dateAug 3, 2017
Publication dateMay 6, 2021
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method, system, and storage device storing a computer program, for generating questions based on provided content, such as, for example, a document having words. The method comprises automatically estimating the probability of interesting phrases in the provided content, and generating a question in natural language based on the estimating. In one example embodiment herein, the estimating includes predicting the interesting phrases as answers, and the estimating is performed by a neural model. The method further comprises conditioning a question generation model based on the interesting phrases predicted in the predicting, the question generation model generating the question. The method also can include training the neural model. In one example, the method further comprises identifying start and end locations of the phrases in the provided content, and the identifying includes performing a dot product attention mechanism parameterizing a probability distribution.

First claim

Opening claim text (preview).

1 - 20 . (canceled) 21 . A method executed by a computer processor, for generating questions based on provided content, the method comprising: automatically extracting, by a computer processor, a key phrase from the provided content using a combination of: a first model associated with a semantic relevance between the key phrase and the provided content, and a second model associated with generating syntactic boundaries of key phrases in the provided content; and generating, by the computer processor, a question in natural language using the extracted key phrase as an answer to the question through one or more iterations of interpolating a sequence of words in the question, wherein the interpolation of the sequence of the words is based on a second probability that a word is next in the sequence of words in the question. 22 . The method of claim 21 , wherein the first model is based at least upon a first probability of an entity in the provided content being an answer conditioned on the provided content, wherein a second model is a sequence-to-sequence model for generating one or more boundaries of key phrases, and wherein the entity comprises a word in the provided content. 23 . The method of claim 22 , the method further comprising conditioning a question generation model based on the one or more key phrases, wherein the question generation model is used for generating the question. 24 . The method of claim 21 , the method further comprising: receiving a set of questions and answers using cloud sourcing, and conditioning the first model for training using the received set of questions and answers. 25 . The method of claim 24 , wherein the neural model has been trained on a dataset comprising human-selected key phrases. 26 . The method of claim 21 , further comprising identifying a start location and an end location of the key phrases in the provided content. 27 . The method of claim 26 , wherein the identifying includes performing a dot product attention mechanism parameterizing a probability distribution. 28 . The method of claim 21 , further comprising determining an attention distribution of word positions in the provided content, wherein the generating includes providing at least one word of the question based on the attention distribution. 29 . The method of claim 21 , wherein the provided content includes a document. 30 . A system for generating questions based on provided content, comprising: a pointer network to automatically extract, using a machine learning model, a key phrase from the provided content using a combination of: a first model associated with a semantic relevance between the key phrase and the provided content, and a second model associated with generating syntactic boundaries of key phrases in the provided content; and a question generator to generate a question in natural language using the extracted key phrase as an answer to the question through one or more iterations of interpolating a sequence of words in the question, wherein the interpolation of the sequence of the words is based on a second probability that a word is next in the sequence of words in the question. 31 . The system of claim 30 , wherein the first model is based at least upon a first probability of an entity in the provided content being an answer conditioned on the provided content, wherein a second model is a sequence-to-sequence model for generating one or more boundaries of key phrases, wherein the entity comprises a word in the provided content, and wherein the pointer network identifies start and end locations of the key phrases in the provided content. 32 . The system of claim 30 , wherein the pointer network comprises: an encoder for encoding the provided content; and a decoder to extract the key phrases. 33 . The system of claim 30 , wherein the question generator comprises: an encoder for encoding the provided content; and a decoder to generate the question. 34 . The system of claim 33 , wherein the decoder includes an attention mechanism. 35 . The system of claim 33 , wherein the decoder includes a Long Short Term Memory (LSTM). 36 . The system of claim 30 , wherein the pointer network receives a set of questions and answers using cloud sourcing and conditions the first model for training using the received set of questions and answers. 37 . A storage device storing a program having instructions which, when executed by a computer processor, cause the processor to execute a method for generating questions based on provided content, comprising: automatically extracting, by a computer processor, a key phrase from the provided content using a combination of: a first model associated with a semantic relevance between the key phrase and the provided content, and a second model associated with generating syntactic boundaries of key phrases in the provided content; and generating, by the computer processor, a question in natural language using the extracted key phrase as an answer to the question through one or more iterations of interpolating a sequence of words in the question, wherein the interpolation of the sequence of the words is based on a second probability that a word is next in the sequence of words in the question. 38 . The storage device of claim 37 , wherein the first model is based at least upon a first probability of an entity in the provided content being an answer conditioned on the provided content, wherein a second model is a sequence-to-sequence model for generating one or more boundaries of key phrases, and wherein the entity comprises a word in the provided content. 39 . The storage device of claim 37 , wherein the method further comprises conditioning a question generation model based on the one or more key phrases, wherein the question generation model is used for generating the question. 40 . The storage device of claim 37 , the method further comprising: receiving a set of questions and answers using cloud sourcing; and conditioning the first model for training using the received set of questions and answers.

Assignees

Inventors

Classifications

  • Recurrent networks, e.g. Hopfield networks · CPC title

  • Combinations of networks · CPC title

  • characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU] · CPC title

  • Auto-encoder networks; Encoder-decoder networks · CPC title

  • Supervised learning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2021134173A1 cover?
A method, system, and storage device storing a computer program, for generating questions based on provided content, such as, for example, a document having words. The method comprises automatically estimating the probability of interesting phrases in the provided content, and generating a question in natural language based on the estimating. In one example embodiment herein, the estimating inc…
Who is the assignee on this patent?
Microsoft Technology Licensing Llc
What technology area does this patent fall under?
Primary CPC classification G09B7/02. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu May 06 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).