Action-based models to identify learned tasks

US9384448B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9384448-B2
Application numberUS-201113339297-A
CountryUS
Kind codeB2
Filing dateDec 28, 2011
Priority dateDec 28, 2011
Publication dateJul 5, 2016
Grant dateJul 5, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems provided herein include a learning environment and an agent. The learning environment includes an avatar and an object. A state signal corresponding to a state of the learning environment includes a location and orientation of the avatar and the object. The agent is adapted to receive the state signal, to issue an action capable of generating at least one change in the state of the learning environment, to produce a set of observations relevant to a task, to hypothesize a set of action models configured to explain the observations, and to vet the set of action models to identify a learned model for the task.

First claim

Opening claim text (preview).

The invention claimed is: 1. A system, comprising: a three dimensional learning environment comprising an avatar and an object, and a state signal corresponding to a state of the learning environment comprising a location and orientation of the avatar and the object, wherein the object is a camera that produces a set of synthetic images of the three dimensional learning environment and wherein the avatar comprises articulated body parts, such that a processor computes and defines coordinates of each body part of the avatar; an agent that receives the state signal via the set of synthetic images and issues adjustment instructions that employ depth perception to generate at least one change in the state of the learning environment; an oracle that generates a reinforcement signal in response to the change in the state of the learning environment and communicates the reinforcement signal to the agent by considering a computer script that orchestrates changes in body-part to body-part and body-part to object spatial relationships with respect to gravity, inertia, and physical occupancy, wherein the agent utilizes the reinforcement signal to produce a set of observations relevant to a task, hypothesizes a set of Internal Action Models configured to explain the observations, and vets the set of Internal Action Models to identify a learned model for the task such that a sum of the reinforcement signals received from the oracle selects the Internal Action Model for the task; wherein the agent builds a set of input image sequences with increasing reinforcement signals that are indicative of the task; and wherein the agent executes the task autonomously to perform a variety of goal-oriented actions based on real-world imagery. 2. The system of claim 1 , wherein the avatar comprises a plurality of ellipsoids coupled together by a plurality of joint angles. 3. The system of claim 2 , wherein the action issued by the agent comprises a signal corresponding to one of the plurality of joint angles and a negative or positive increment for the one joint angle. 4. The system of claim 1 , wherein hypothesizing the set of Internal Action Models comprises implementing a sequence of springs model. 5. The system of claim 1 , wherein the agent utilizes the learned model to recognize the task when performed by an external agent. 6. The system of claim 1 , wherein vetting the set of Internal Action Models comprises initializing the learning environment to a starting state, attempting a plurality of possible actions, identifying an action of the plurality of actions corresponding to a positive response from an identified Internal Action Model of the set of Internal Action Models, and recording reinforcement signals from the oracle corresponding to each action. 7. The system of claim 1 , comprises one or more virtual camera generates one or more images of the avatar received by the agent as part of the state signal. 8. The system of claim 1 , wherein the agent reduces the set of hypothesized Internal Action Models by computing a consistency function and vets the reduced set of hypothesized Internal Action Models.

Assignees

Inventors

Classifications

  • Generating training patterns; Bootstrap methods, e.g. bagging or boosting · CPC title

  • Physics · mapped topic

  • G06N99/005Primary

    Physics · mapped topic

  • G06N20/00Primary

    Machine learning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9384448B2 cover?
Systems provided herein include a learning environment and an agent. The learning environment includes an avatar and an object. A state signal corresponding to a state of the learning environment includes a location and orientation of the avatar and the object. The agent is adapted to receive the state signal, to issue an action capable of generating at least one change in the state of the lear…
Who is the assignee on this patent?
Tu Peter Henry, Yu Ting, Gao Dashan, and 3 more
What technology area does this patent fall under?
Primary CPC classification G06N99/005. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 05 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).