Log-linear dialog manager that determines expected rewards and uses hidden states and actions

US9311430B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9311430-B2
Application numberUS-201314106968-A
CountryUS
Kind codeB2
Filing dateDec 16, 2013
Priority dateDec 16, 2013
Publication dateApr 12, 2016
Grant dateApr 12, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A dialog manager receives previous user actions and previous observations and current observations. Previous and current user states, previous user actions, current user actions, future system actions, and future observations are hypothesized. The user states, the user actions, and the user observations are hidden. A feature vector is extracted based on the user states, the system actions, the user actions, and the observations. An expected reward of each current action is based on a log-linear model using the feature vectors. Then, the current action that has an optimal expected reward is outputted.

First claim

Opening claim text (preview).

We claim: 1. A dialog manager comprising the steps of: receiving previous user actions and previous observations and current observations; hypothesizing previous and current user states, previous user actions, current user actions, future system actions, and future observations, wherein the user states, the user actions, and the user observations are hidden; extracting a feature vector based on the user states, the system actions, the user actions, and the observations; determining an expected reward of each current action based on a log-linear model using the feature vectors; and outputting the current action that has an optimal expected reward, wherein the steps are performed in a processor. 2. The dialog manager of claim 1 , wherein a probabilistic model has four variables at each time step t, including two observable variables: the system action a t , the observation o t , and two latent variables: the user action u t and the user state s t . 3. The method of claim 2 , wherein a dialog session of duration T is represented by four variable sequences s 0:T ,a 0:T ,o 1:T ,u 1:T . 4. The method of claim 3 , wherein the dialog session is represented by a factor graph, which corresponds to a joint probability distribution p ⁡ ( s 0 : T , a 0 : T , u 1 : T , o 1 : T ) = 1 Z θ ⁢ exp ⁡ [ ∑ t = 0 T ⁢ θ f T ⁢ φ f ⁢ ( s t , a t , s t + 1 , u t + 1 ) + ∑ t = 1 T ⁢ θ g T ⁢ φ g ⁡ ( u t , o t ) ] , where Z θ is a normalizing constant, φ f and φ g are the feature vectors, and θ f and θ g vectors of corresponding model parameters, respectively. 5. The method of claim 1 , wherein the observations are spoken words or text. 6. The method of claim 3 , wherein S, U, A, and O represent the variable spaces that is a set of all possible values for the variables s t , u t , a t , and o t , respectively. 7. The method of claim 6 , further comprising: defining the variable spaces S, U, and A using a context-free grammar (CFG) including a set of production rules. 8. The method of claim 7 , wherein each variable space is defined as a set of all possible parse trees that can be generated by the CFG. 9. The method of claim 3 , wherein a planning part of the dialog manger determines an optimal system action a τ , given all previous system actions a 0:τ-1 and previous observations o 1:τ . 10. The method of claim 3 , further comprising: maximizing an objective function E s 0 : T , a

Assignees

Inventors

Classifications

  • Discourse or dialogue representation · CPC title

  • G06F40/30Primary

    Semantic analysis · CPC title

  • Natural language query formulation or dialogue systems · CPC title

  • Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • Parsing for meaning understanding · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9311430B2 cover?
A dialog manager receives previous user actions and previous observations and current observations. Previous and current user states, previous user actions, current user actions, future system actions, and future observations are hypothesized. The user states, the user actions, and the user observations are hidden. A feature vector is extracted based on the user states, the system actions, the …
Who is the assignee on this patent?
Mitsubishi Electric Res Lab
What technology area does this patent fall under?
Primary CPC classification G06F40/30. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 12 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).