System and method for dialog modeling

US9972307B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9972307-B2
Application numberUS-201514845634-A
CountryUS
Kind codeB2
Filing dateSep 4, 2015
Priority dateNov 26, 2008
Publication dateMay 15, 2018
Grant dateMay 15, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Disclosed herein are systems, computer-implemented methods, and computer-readable media for dialog modeling. The method includes receiving spoken dialogs annotated to indicate dialog acts and task/subtask information, parsing the spoken dialogs with a hierarchical, parse-based dialog model which operates incrementally from left to right and which only analyzes a preceding dialog context to generate parsed spoken dialogs, and constructing a functional task structure of the parsed spoken dialogs. The method can further either interpret user utterances with the functional task structure of the parsed spoken dialogs or plan system responses to user utterances with the functional task structure of the parsed spoken dialogs. The parse-based dialog model can be a shift-reduce model, a start-complete model, or a connection path model.

First claim

Opening claim text (preview).

We claim: 1. A method comprising: training a plurality of hierarchical, parsed-based dialog models, wherein each of the plurality of hierarchical, parsed-based dialog models operates incrementally from left to right and only analyzes an immediately preceding dialog context and wherein the plurality of hierarchical, parsed-based dialog models comprises one of a shift-reduce model, a start-complete model or a connection path model, and wherein: when the plurality of hierarchical, parsed-based dialog models comprises a shift-reduce model, the shift-reduce model has a stack and a tree which (a) shifts each utterance onto the stack, (b) inspects the stack, and (c) based on a stack inspection, performs a reduce action that creates subtrees in the tree; when the plurality of hierarchical, parsed-based dialog models comprises a start-complete model, the start-complete model uses a stack to maintain a global parse state and produces a dialog task structure directly without producing an equivalent tree; and when the plurality of hierarchical, parsed-based dialog models comprises a connection path model, the connection path model does not use a stack to maintain a global parse state, and wherein the connection path model (a) directly predicts a connection path from a root to a terminal for each received spoken dialog, and (b) creates a parse tree representing the connection path for each received spoken dialog; parsing, via a processor, spoken dialogs with a hierarchical, parse-based dialog model from the plurality of hierarchical, parsed-based dialog models, to yield parsed spoken dialogs; constructing a functional task structure of the parsed spoken dialogs; predicting a likely next dialog act in a spoken dialog using the functional task structure and the hierarchical, parsed-based dialog model, the likely next dialog act corresponding to a next utterance comprising a clause to be spoken by a speaker, wherein the predicting occurs prior to receiving the next utterance; and selecting a language model for the next utterance based on the likely next dialog act. 2. The method of claim 1 , further comprising measuring a dialog efficiency at different dialog stages based on the language model selected. 3. A system comprising: a processor; and a computer-readable storage medium having instructions stored which, when executed by the processor, cause the processor to perform operations comprising: training a plurality of hierarchical, parsed-based dialog models wherein the plurality of hierarchical, parsed-based dialog models operate incrementally from left to right and only analyze an immediately preceding dialog context and wherein the plurality of hierarchical, parsed-based dialog models comprises one of a shift-reduce model, a start-complete model or a connection path model, and wherein: when the plurality of hierarchical, parsed-based dialog models comprises a shift-reduce model, the shift-reduce model has a stack and a tree which (a) shifts each utterance onto the stack, (b) inspects the stack, and (c) based on a stack inspection, performs a reduce action that creates subtrees in the tree; when the plurality of hierarchical, parsed-based dialog models comprises a start-complete model, the start-complete model uses a stack to maintain a global parse state and produces a dialog task structure directly without producing an equivalent tree; and when the plurality of hierarchical, parsed-based dialog models comprises a connection path model, the connection path model does not use a stack to maintain a global parse state, and wherein the connection path model (a) directly predicts a connection path from a root to a terminal for each received spoken dialog, and (b) creates a parse tree representing the connection path for each received spoken dialog; parsing spoken dialogs with a hierarchical, parse-based dialog model from the plurality of hierarchical, parsed-based dialog models, to yield parsed spoken dialogs; constructing a functional task structure of the parsed spoken dialogs; predicting a likely next dialog act in a spoken dialog using the functional task structure and the hierarchical, parsed-based dialog model, the likely next dialog act corresponding to a next utterance comprising a clause to be spoken by a speaker, wherein the predicting occurs prior to receiving the next utterance; and selecting a language model for the next utterance based on the likely next dialog act. 4. The system of claim 3 , the computer-readable storage medium having additional instructions stored which, when executed by the processor, cause the processor to perform operations comprising measuring a dialog efficiency at different dialog stages based on the language model selected. 5. A computer-readable storage device having instructions stored which, when executed by a computing device, cause the computing device to perform operations comprising: training a plurality of hierarchical, parsed-based dialog models wherein the plurality of hierarchical, parsed-based dialog models operate incrementally from left to right and only analyze an immediately preceding dialog context and wherein the plurality of hierarchical, parsed-based dialog models comprises one of a shift-reduce model, a start-complete model or a connection path model, and wherein: when the plurality of hierarchical, parsed-based dialog models comprises a shift-reduce model, the shift-reduce model has a stack and a tree which (a) shifts each utterance onto the stack, (b) inspects the stack, and (c) based on a stack inspection, performs a reduce action that creates subtrees in the tree; when the plurality of hierarchical, parsed-based dialog models comprises a start-complete model, the start-complete model uses a stack to maintain a global parse state and produces a dialog task structure directly without producing an equivalent tree; and when the plurality of hierarchical, parsed-based dialog models comprises a connection path model, the connection path model does not use a stack to maintain a global parse state, and wherein the connection path model (a) directly predicts a connection path from a root to a terminal for each received spoken dialog, and (b) creates a parse tree representing the connection path for each received spoken dialog; parsing spoken dialogs with a hierarchical, parse-based dialog model from the plurality of hierarchical, parsed-based dialog models, to yield parsed spoken dialogs; constructing a functional task structure of the parsed spoken dialogs; predicting a likely next dialog act in a spoken dialog using the functional task structure and the hierarchical, parsed-based dialog model, the likely next dialog act corresponding to a next utterance comprising a clause to be spoken by a speaker, wherein the predicting occurs prior to receiving the next utterance; and selecting a language model for the next utterance based on the likely next dialog act.

Assignees

Inventors

Classifications

  • Hierarchical processing, e.g. outlines · CPC title

  • Segmentation; Word boundary detection · CPC title

  • Interactive procedures · CPC title

  • the extracted parameters being prediction coefficients · CPC title

  • Use of codes for handling textual entities · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9972307B2 cover?
Disclosed herein are systems, computer-implemented methods, and computer-readable media for dialog modeling. The method includes receiving spoken dialogs annotated to indicate dialog acts and task/subtask information, parsing the spoken dialogs with a hierarchical, parse-based dialog model which operates incrementally from left to right and which only analyzes a preceding dialog context to gene…
Who is the assignee on this patent?
At & T Ip I Lp
What technology area does this patent fall under?
Primary CPC classification G10L15/063. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 15 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).