Multi-user, multi-domain dialog system

US10181322B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10181322-B2
Application numberUS-201314137566-A
CountryUS
Kind codeB2
Filing dateDec 20, 2013
Priority dateDec 20, 2013
Publication dateJan 15, 2019
Grant dateJan 15, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A dialog system for use in a multi-user, multi-domain environment. The dialog system understands user requests when multiple users are interacting with each other as well as the dialog system. The dialog system uses multi-human conversational context to improve domain detection. Using interactions between multiple users allows the dialog system to better interpret machine directed conversational inputs in multi-user conversational systems. The dialog system employs topic segmentation to chunk conversations for determining context boundaries. Using general topic segmentation methods, as well as the specific domain detector trained with conversational inputs collected by a single user system, allows the dialog system to better determine the relevant context. The use of conversational context helps reduce the domain detection error rate, especially in certain domains, and allows for better interactions with users when the machine addressed turns are not recognized or are ambiguous.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method for improving understanding of a computer-addressed conversational input in a multi-user conversation by a human-computer interaction system, the method comprising: receiving input from on an electronic device from the multi-user conversation; translating the input into conversational inputs using an input decoder, the conversational inputs including the computer-addressed conversational input and conversational inputs occurring prior to the computer-addressed conversational input, the computer-addressed conversational input instructing a computer to perform an action; parsing, using natural language processing, conversational inputs into topic segments; extracting, using an engine trained based on a statistical model, a conversational context of the conversational inputs within at least one of the topic segments, wherein the conversational context for the computer-addressed conversational input is extracted based on at least contextual features obtained from the conversational inputs occurring prior to the computer-addressed conversational input, wherein the conversational inputs occurring prior to the computer-addressed conversational input include at least one other computer-addressed conversational input and at least one human-addressed conversational input, and the contextual features include (a) a domain of a majority of the conversational inputs occurring prior to the computer-addressed conversational input and (b) a distribution score associated with at least one of the conversational inputs occurring prior to the computer-addressed conversational input. 2. The method of claim 1 , further comprising detecting at least one of a slot or an intent of the computer-addressed conversational input. 3. The method of claim 1 wherein the extracting determines a domain of the computer-addressed conversational input. 4. The method of claim 1 wherein the parsing comprises: collecting the conversational inputs into blocks using a selected window size; determining a domain distribution score for each block; evaluating similarity between the blocks using the domain distribution scores; creating a topic boundary between two blocks when the similarity between the two blocks falls outside of a threshold. 5. The method of claim 4 further comprising calculating an average of domain distribution scores of the conversational inputs contained within each block. 6. The method of claim 4 further comprising: calculating similarity of the domain distribution scores of two blocks; comparing the similarity to the threshold. 7. The method of claim 1 further comprising: collecting a selected number of conversational inputs into a feature extraction group; selecting a label associated with the conversational inputs in the feature extraction group; and establishing a feature vector from at least one topic distribution score. 8. The method of claim 7 further comprising selecting the topic distribution score of a prior conversational input within the feature extraction group that is closest in proximity to the computer-addressed conversational input and has a same label as the selected label. 9. The method of claim 7 wherein establishing a feature vector comprises calculating an average topic distribution score for a prior conversational input in the feature extraction group. 10. The method of claim 7 further comprising skipping dialog commands, interjections, and conversational inputs with low domain confidence scores. 11. The method of claim 7 further comprising skipping prior conversational inputs that do not contain nouns or noun phrases. 12. The method of claim 7 further comprising selecting a domain of the computer-addressed conversational input based on the selected label, the feature vector, and a contiguous sequence of n words from the computer-addressed conversational input. 13. A computer readable article of manufacture device containing computer executable instructions which, when executed by a computer, cause the computer to perform a method for detecting a domain of a computer-addressed conversational input in a human-computer interaction system, the method comprising: receiving input on an electronic device from the multi-user conversation; translating the input into conversational inputs using an input decoder, the conversational inputs including the computer-addressed conversational input and conversational inputs occurring prior to the computer-addressed conversational input, the computer-addressed conversational input instructing a computer to perform an action; parsing, using natural language processing, conversational inputs into topic segments; extracting, using an engine trained based on a statistical model, a conversational context of the conversational inputs within at least one of the topic segments, wherein the conversational context for the computer-addressed conversational input is extracted based on at least contextual features obtained from the conversational inputs occurring prior to the computer-addressed conversational input, wherein the conversational inputs occurring prior to the computer-addressed conversational input include at least one other computer-addressed conversational input and at least one human-addressed conversational input, and the contextual features include (a) a domain of a majority of the conversational inputs occurring prior to the computer-addressed conversational input and (b) a distribution score associated with at least one of the conversational inputs occurring prior to the computer-addressed conversational input. 14. The computer readable device of claim 13 , further comprising detecting at least one of a slot or an intent of the computer-addressed conversational input. 15. The computer readable device of claim 13 , wherein the extracting determines a domain of the computer-addressed conversational input. 16. The computer readable device of claim 13 , wherein the parsing includes: collecting the conversational inputs into blocks using a selected window size; determining a domain distribution score for each block; evaluating similarity between the blocks using the domain distribution scores; creating a topic boundary between two blocks when the similarity between the two blocks falls outside of a threshold. 17. A computer system comprising a processor electrically connected to computer readable storage media, the computer-readable storage media storing instructions that, when executed, perform a computer-implemented method for detecting a domain of a computer-addressed conversational input in a human-computer interaction system, the method comprising: receiving input on an electronic device from the multi-user conversation; translating the input into conversational inputs using an input decoder, the conversational inputs including the computer-addressed conversational input and conversational inputs occurring prior to the computer-addressed conversational input, the computer-addressed conversational input instructing a computer to perform an action; parsing, using natural language processing, conversational inputs into topic segments; extracting, using an engine trained based on a statistical model, a conversational context of the conversational inputs within at least one of the topic segments, wherein the conversational context for the computer-addressed conversational input is extracted based on at least contextual features obtained from the conversational inputs occurring prior to the computer-addressed conversational input, wherein the conversationa

Assignees

Inventors

Classifications

  • of application context · CPC title

  • G10L15/22Primary

    Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10181322B2 cover?
A dialog system for use in a multi-user, multi-domain environment. The dialog system understands user requests when multiple users are interacting with each other as well as the dialog system. The dialog system uses multi-human conversational context to improve domain detection. Using interactions between multiple users allows the dialog system to better interpret machine directed conversationa…
Who is the assignee on this patent?
Microsoft Technology Licensing Llc
What technology area does this patent fall under?
Primary CPC classification G10L15/22. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 15 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).