Learning robotic tasks using one or more neural networks

US12430702B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12430702-B2
Application numberUS-202117458221-A
CountryUS
Kind codeB2
Filing dateAug 26, 2021
Priority dateJan 23, 2018
Publication dateSep 30, 2025
Grant dateSep 30, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Various embodiments enable a robot, or other autonomous or semi-autonomous device or system, to receive data involving the performance of a task in the physical world. The data can be provided as input to a perception network to infer a set of percepts about the task, which can correspond to relationships between objects observed during the performance. The percepts can be provided as input to a plan generation network, which can infer a set of actions as part of a plan. Each action can correspond to one of the observed relationships. The plan can be reviewed and any corrections made, either manually or through another demonstration of the task. Once the plan is verified as correct, the plan (and any related data) can be provided as input to an execution network that can infer instructions to cause the robot, and/or another robot, to perform the task.

First claim

Opening claim text (preview).

What is claimed is: 1. A processor comprising: processing circuitry to use a first portion of one or more neural networks to detect one or more objects in one or more images and a second portion of the one or more neural networks to identify one or more spatial relationships between the one or more objects; and cause one or more tasks to be performed based, at least in part, on the identified one or more spatial relationships. 2. The processor of claim 1 , wherein the processing circuitry is further to cause one or more tasks to be performed based, at least in part, on the identified one or more spatial relationships, in substantially a same way in which the one or more tasks are observed being performed. 3. The processor of claim 1 , wherein the processing circuitry is further to cause one or more tasks to be performed based, at least in part, on the identified one or more spatial relationships, in substantially a different way than the one or more tasks are observed being performed. 4. The processor of claim 1 , wherein one or more tasks are performed based, at least in part, on the identified one or more spatial relationships, at least in part, by a robot, a semi-automated machine, or a human. 5. The processor of claim 1 , wherein the processing circuitry is further to generate a plan including one or more steps to perform one or more tasks based, at least in part, on the identified one or more spatial relationships. 6. The processor of claim 5 , wherein the one or more steps of the plan are human-readable. 7. The processor of claim 1 , wherein the one or more neural networks are to identify a task being performed based, at least in part, on the identified spatial relationships. 8. A method, comprising: using a first portion of one or more neural networks to detect one or more objects in one or more images and a second portion of the one or more neural networks to identify one or more spatial relationships between the one or more objects; and cause one or more tasks to be performed based, at least in part, on the identified one or more spatial relationships. 9. The method of claim 8 , further comprising: causing one or more tasks to be performed based, at least in part, on the identified one or more spatial relationships, in substantially a same way in which the one or more tasks are observed being performed. 10. The method of claim 8 , further comprising: causing one or more tasks to be performed based, at least in part, on the identified one or more spatial relationships, in substantially a different way than the one or more tasks are observed being performed. 11. The method of claim 8 , wherein one or more tasks are performed based, at least in part, on the identified one or more spatial relationships, at least in part, by a robot, a semi-automated machine, or a human. 12. The method of claim 8 , further comprising: generating a plan including one or more steps to perform one or more tasks based, at least in part, on the identified one or more spatial relationships. 13. The method of claim 12 , wherein the one or more steps of the plan are human-readable. 14. The method of claim 13 , further comprising: providing the plan, including the human-readable steps, to one or more second neural networks to generate instructions for a robotic device to perform the one or more tasks. 15. The method of claim 8 , wherein the one or more neural networks are to identify a task being performed based, at least in part, on the identified one or more spatial relationships. 16. A system, comprising: one or more cameras to capture image data of one or more motions of one or more objects; and one or more processors to use a first portion of one or more neural networks to detect the one or more objects in the captured image data and a second portion of the one or more neural networks to identify one or more one or more spatial relationships between the one or more objects, and cause one or more tasks to be performed based, at least in part, on the identified one or more spatial relationships. 17. The system of claim 16 , wherein the one or more processors are further to cause one or more tasks to be performed based, at least in part, on the one or more identified one or more spatial relationships, in substantially a same way in which the one or more tasks are observed being performed. 18. The system of claim 16 , wherein the one or more processors are further to cause one or more tasks to be performed based, at least in part, on the one or more identified one or more spatial relationships, in substantially a different way than the one or more tasks are observed being performed. 19. The system of claim 16 , wherein one or more tasks are performed based, at least in part, on the one or more identified one or more spatial relationships, at least in part, by a robot, a semi-automated machine, or a human. 20. The system of claim 16 , wherein the one or more processors are further to generate a plan including one or more steps to perform one or more tasks based, at least in part, on the one or more identified relationships, and wherein the one or more steps of the plan are human-readable.

Assignees

Inventors

Classifications

  • Backpropagation, e.g. using gradient descent · CPC title

  • Neural network for object trajectory prediction, fuzzy for robot path · CPC title

  • Convolutional networks [CNN, ConvNet] · CPC title

  • Combinations of networks · CPC title

  • Supervised learning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12430702B2 cover?
Various embodiments enable a robot, or other autonomous or semi-autonomous device or system, to receive data involving the performance of a task in the physical world. The data can be provided as input to a perception network to infer a set of percepts about the task, which can correspond to relationships between objects observed during the performance. The percepts can be provided as input to …
Who is the assignee on this patent?
Nvidia Corp
What technology area does this patent fall under?
Primary CPC classification B25J9/161. Mapped technology areas include Operations & Transport.
When was this patent published?
Publication date Tue Sep 30 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).