Learning skills from video demonstrations

US11524401B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-11524401-B1
Application numberUS-202016829277-A
CountryUS
Kind codeB1
Filing dateMar 25, 2020
Priority dateMar 28, 2019
Publication dateDec 13, 2022
Grant dateDec 13, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method includes determining motion imitation information for causing a system to imitate a physical task using a first machine learning model that is trained using motion information that represents a performance of the physical task, determining a predicted correction based on the motion information and a current state from the system using a second machine learning model that is trained using the motion information, determining an action to be performed by the system based on the motion imitation information and the predicted correction; and controlling motion of the system in accordance with the action.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, comprising: determining motion imitation information for causing a system to imitate a physical task using a first machine learning model that is trained using motion information that represents a performance of the physical task; determining a predicted correction based on the motion information and a current state from the system using a second machine learning model that is trained using the motion information; determining an action to be performed by the system based on the motion imitation information and the predicted correction; and controlling motion of the system in accordance with the action. 2. The method of claim 1 , wherein the motion information is based on a video that shows a demonstration of the physical task by an actor. 3. The method of claim 2 , wherein the motion information describes positions of parts of the actor at each of multiple time steps. 4. The method of claim 1 , wherein determining the predicted correction based on the motion information and the current state from the system comprises determining an encoded representation based on the motion information. 5. The method of claim 4 , wherein the encoded representation is provided to the second machine learning model as an input. 6. The method of claim 1 , wherein determining the predicted correction based on the motion information and the current state from the system comprises determining a tracking error that represents a difference between the current state and the motion information, wherein the tracking error is provided to the second machine learning model as an input. 7. The method of claim 1 , wherein the system is a simulated system and an error value for the simulated system is used as a supervision signal for reinforcement learning. 8. A non-transitory computer-readable storage device including program instructions executable by one or more processors that, when executed, cause the one or more processors to perform operations, the operations comprising: determining motion imitation information for causing a system to imitate a physical task using a first machine learning model that is trained using motion information that represents a performance of the physical task; determining a predicted correction based on the motion information and a current state from the system using a second machine learning model that is trained using the motion information; determining an action to be performed by the system based on the motion imitation information and the predicted correction; and controlling motion of the system in accordance with the action. 9. The non-transitory computer-readable storage device of claim 8 , wherein the motion information is based on a video that shows a demonstration of the physical task by an actor. 10. The non-transitory computer-readable storage device of claim 9 , wherein the motion information describes positions of parts of the actor at each of multiple time steps. 11. The non-transitory computer-readable storage device of claim 8 , wherein determining the predicted correction based on the motion information and the current state from the system comprises determining an encoded representation based on the motion information. 12. The non-transitory computer-readable storage device of claim 8 , wherein determining the predicted correction based on the motion information and the current state from the system comprises determining a tracking error that represents a difference between the current state and the motion information, wherein the tracking error is provided to the second machine learning model as an input. 13. The non-transitory computer-readable storage device of claim 8 , wherein the system is a simulated system and an error value for the simulated system is used as a supervision signal for reinforcement learning. 14. An apparatus, comprising: a memory; and one or more processors that are configured to execute instructions that are stored in the memory, wherein the instructions, when executed, cause the one or more processors to: determine motion imitation information for causing a system to imitate a physical task using a first machine learning model that is trained using motion information that represents a performance of the physical task; determine a predicted correction based on the motion information and a current state from the system using a second machine learning model that is trained using the motion information; determine an action to be performed by the system based on the motion imitation information and the predicted correction; and control motion of the system in accordance with the action. 15. The apparatus of claim 14 , wherein the motion information is based on a video that shows a demonstration of the physical task by an actor. 16. The apparatus of claim 15 , wherein the motion information describes positions of parts of the actor at each of multiple time steps. 17. The apparatus of claim 14 , wherein the instructions that cause the one or more processors to determine the predicted correction based on the motion information and the current state from the system further cause the one or more processors to determine an encoded representation based on the motion information. 18. The apparatus of claim 14 , wherein the instructions that cause the one or more processors to determine the predicted correction based on the motion information and the current state from the system further cause the one or more processors to determine a tracking error that represents a difference between the current state and the motion information, wherein the tracking error is provided to the second machine learning model as an input. 19. The apparatus of claim 14 , wherein the system is a simulated system and an error value for the simulated system is used as a supervision signal for reinforcement learning. 20. The apparatus of claim 14 , wherein the system is a robotic system.

Assignees

Inventors

Classifications

  • Human to robot skill transfer · CPC title

  • Recording and playback systems, i.e. in which the program is recorded from a cycle of operations, e.g. the cycle of operations being manually controlled, after which this record is played back on the same machine · CPC title

  • B25J9/1664Primary

    characterised by motion, path, trajectory planning · CPC title

  • Inference or reasoning models · CPC title

  • Analysis of motion (motion estimation for coding, decoding, compressing or decompressing digital video signals H04N19/43, H04N19/51) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11524401B1 cover?
A method includes determining motion imitation information for causing a system to imitate a physical task using a first machine learning model that is trained using motion information that represents a performance of the physical task, determining a predicted correction based on the motion information and a current state from the system using a second machine learning model that is trained usi…
Who is the assignee on this patent?
Zhang Jian, Srouji Mario J, Salakhutdinov Ruslan, and 1 more
What technology area does this patent fall under?
Primary CPC classification B25J9/1664. Mapped technology areas include Operations & Transport.
When was this patent published?
Publication date Tue Dec 13 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).