What technology area does this patent fall under?

Primary CPC classification B25J9/163. Mapped technology areas include Operations & Transport.

When was this patent published?

Publication date Tue Feb 03 2026 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 7 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Robotic demonstration learning

US12539605B2 · US · B2

Patent metadata
Field	Value
Publication number	US-12539605-B2
Application number	US-202318514574-A
Country	US
Kind code	B2
Filing date	Nov 20, 2023
Priority date	May 21, 2020
Publication date	Feb 3, 2026
Grant date	Feb 3, 2026

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for using simulated local demonstration data for robotic demonstration learning. One of the methods includes receiving perceptual data of a workcell of a robot to be configured to execute a task according to a skill template, wherein the skill template specifies one or more subtasks required to perform the skill, wherein at least one of the subtasks is a demonstration subtask that relies on learning visual characteristics of the workcell. A virtual model is generated of a portion of the workcell. A training system generates simulated local demonstration data from the virtual model of the portion of the workcell and tunes a base control policy for the demonstration subtask using the simulated local demonstration data generated from the virtual model of the portion of the workcell.

First claim

Opening claim text (preview).

What is claimed is: 1 . A computer-implemented method comprising: receiving perceptual data of a workcell of a robot configured to execute a task according to a skill template, wherein the skill template specifies a plurality of subtasks required to perform a skill, wherein the plurality of subtasks includes at least one demonstration subtask that relies on visual characteristics of the workcell; generating actual local demonstration data from a workcell demonstration of the demonstration subtask by operating the robot in the workcell while capturing sensor data generated by a plurality of sensors in the workcell of the robot, wherein the actual local demonstration data comprises task state representations, and each task state representation is generated by processing sensor output of each sensor of the plurality of sensors by a different respective neural network of a plurality of neural networks to generate, by each neural network, a different portion of the task state representation; obtaining a base control policy for the demonstration subtask; and tuning, using the actual local demonstration data from the workcell demonstration, the base control policy of the robot to follow an updated control policy for the demonstration subtask. 2 . The method of claim 1 , wherein generating the actual local demonstration data comprises combining the task state representations into a single task state representation. 3 . The method of claim 2 , wherein the sensor data has a first dimension, and the single task state representation has a second dimension, the second dimension being lower than the first dimension. 4 . The method of claim 1 , wherein tuning the base control policy comprises: generating, based on the actual local demonstration data, one or more modifications to adjust the robot from the base control policy to the updated control policy. 5 . The method of claim 1 , wherein tuning the base control policy to the updated control policy comprises: generating one or more commands for execution by the robot, wherein the one or more commands, when executed by the robot, causes the robot to follow the updated control policy. 6 . The method of claim 1 , further comprising: generating, by the updated control policy, a proposed command for execution by the robot; generating, using the proposed command and the actual local demonstration data of the workcell demonstration as an input to a training system, one or more parameter corrections for the updated control policy; and adjusting, using the one or more parameter corrections and the training system, one or more parameters of the updated control policy to obtain a new control policy. 7 . The method of claim 6 , wherein the training system is trained by one or more machine learning techniques to generate the one or more parameter corrections. 8 . The method of claim 1 , wherein tuning the base control policy at least partially overlaps with a user collecting the actual local demonstration data from the robot. 9 . The method of claim 1 , wherein the perceptual data comprises a camera image, depth camera data, lidar scan data, or laser scan data of the workcell. 10 . The method of claim 1 , wherein the actual local demonstration data includes joint data of the robot representing joint angles of the robot during the workcell demonstration. 11 . A system comprising: one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising: receiving perceptual data of a workcell of a robot configured to execute a task according to a skill template, wherein the skill template specifies a plurality of subtasks required to perform a skill, wherein the plurality of subtasks includes at least one demonstration subtask that relies on visual characteristics of the workcell; generating actual local demonstration data from a workcell demonstration of the demonstration subtask by operating the robot in the workcell while capturing sensor data generated by a plurality of sensors in the workcell of the robot, wherein the actual local demonstration data comprises task state representations, and each task state representation is generated by processing sensor output of each sensor of the plurality of sensors by a different respective neural network of a plurality of neural networks to generate, by each neural network, a different portion of the task state representation; obtaining a base control policy for the demonstration subtask; and tuning, using the actual local demonstration data from the workcell demonstration, the base control policy of the robot to follow an updated control policy for the demonstration subtask. 12 . The system of claim 11 , wherein generating the actual local demonstration data comprises combining the task state representations into a single task state representation. 13 . The system of claim 12 , wherein the sensor data has a first dimension, and the single task state representation has a second dimension, the second dimension being lower than the first dimension. 14 . The system of claim 11 , wherein tuning the base control policy comprises: generating, based on the actual local demonstration data, one or more modifications to adjust the robot from the base control policy to the updated control policy. 15 . The system of claim 11 , wherein tuning the base control policy to the updated control policy comprises: generating one or more commands for execution by the robot, wherein the one or more commands, when executed by the robot, causes the robot to follow the updated control policy. 16 . The system of claim 11 , the operations further comprising: generating, by the updated control policy, a proposed command for execution by the robot; generating, using the proposed command and the actual local demonstration data of the workcell demonstration as an input to a training system, one or more parameter corrections for the updated control policy; and adjusting, using the one or more parameter corrections and the training system, one or more parameters of the updated control policy to obtain a new control policy. 17 . One or more non-transitory computer storage media encoded with computer program instructions that when executed by one or more computers cause the one or more computers to perform operations comprising: receiving perceptual data of a workcell of a robot configured to execute a task according to a skill template, wherein the skill template specifies a plurality of subtasks required to perform a skill, wherein the plurality of subtasks includes at least one demonstration subtask that relies on visual characteristics of the workcell; generating actual local demonstration data from a workcell demonstration of the demonstration subtask by operating the robot in the workcell while capturing sensor data generated by a plurality of sensors in the workcell of the robot, wherein the actual local demonstration data comprises task state representations, and each task state representation is generated by processing sensor output of each sensor of the plurality of sensors by a different respective neural network of a plurality of neural networks to generate, by each neural network, a different portion of the task state representation; obtaining a base control policy for the demonstration subtask; and tuning, using the actual local demonstration data from the workcell demonstration, the base control policy of the robot to follow an updated control po

Assignees

Intrinsic Innovation Llc

Inventors

Classifications

B25J9/161
Hardware, e.g. neural networks, fuzzy logic, interfaces, processor · CPC title
B25J9/1617
Cellular, reconfigurable manipulator, e.g. cebot · CPC title
B25J9/1605
Simulation of manipulator lay-out, design, modelling of manipulator · CPC title
B25J9/163Primary
learning, adaptive, model based, rule based expert control · CPC title
G05B2219/39244
Generic motion control operations, primitive skills each for special task · CPC title

Patent family

Related publications grouped by family.

View patent family 78607713

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12539605B2 cover?: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for using simulated local demonstration data for robotic demonstration learning. One of the methods includes receiving perceptual data of a workcell of a robot to be configured to execute a task according to a skill template, wherein the skill template specifies one or more subtasks required to perfo…
Who is the assignee on this patent?: Intrinsic Innovation Llc
What technology area does this patent fall under?: Primary CPC classification B25J9/163. Mapped technology areas include Operations & Transport.
When was this patent published?: Publication date Tue Feb 03 2026 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 7 related publications on this page (citations in our corpus or others sharing the same primary CPC).

How to read this patent

Abstract

First claim

Assignees

Inventors

Classifications

Patent family

External sources

Related patents

Machine learning device, robot controller, robot system, and machine learning method for learning action pattern of human

Controller and machine learning device

Method, apparatus and system for robotic programming

Predictive robotic controller apparatus and methods

Framework and methods of diverse exploration for fast and safe policy improvement

Robotic training apparatus and methods

Apparatus and methods for reinforcement-guided supervised learning

Frequently asked questions