Dual hand detection in teaching from demonstration

US11712797B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11712797-B2
Application numberUS-202017018674-A
CountryUS
Kind codeB2
Filing dateSep 11, 2020
Priority dateSep 11, 2020
Publication dateAug 1, 2023
Grant dateAug 1, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for dual hand detection in robot teaching from human demonstration. A camera image of the demonstrator's hands and workpieces is provided to a first neural network which determines the identity of the left and right hand of the human demonstrator from the image, and also provides cropped sub-images of the identified hands. The first neural network is trained using images in which the left and right hands are pre-identified. The cropped sub-images are then provided to a second neural network which detects the pose of both the left and right hand from the images, where the sub-image for the left hand is horizontally flipped before and after the hand pose detection if second neural network is trained with right hand images. The hand pose data is converted to robot gripper pose data and used for teaching a robot to perform an operation through human demonstration.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for dual hand detection in images, said method comprising: providing an image including left and right hands of a human; analyzing the image, using a first neural network running on a computer having a processor and memory, to determine an identity and a location in the image of the left hand and the right hand; creating a left hand sub-image and a right hand sub-image, where each of the sub-images is cropped from the image; providing the sub-images to a second neural network running on the computer, including horizontally flipping either the left hand sub-image or the right hand sub-image; analyzing the sub-images by the second neural network to determine three-dimensional (3D) coordinates of a plurality of key points on the left and right hands; and using the 3D coordinates of the key points by a robot teaching program to define gripper poses, including horizontally flipping the coordinates of the key points on either the left hand or the right hand. 2. The method according to claim 1 wherein the image is provided by a two-dimensional (2D) digital camera. 3. The method according to claim 1 wherein the first neural network is trained to distinguish the left hand from the right hand in a training process where a plurality of training images are provided to the first neural network in which left and right hands are pre-identified. 4. The method according to claim 3 wherein the first neural network analyzes the training images to identify distinguishing characteristics of left hands and right hands, including curvature and relative locations of digits. 5. The method according to claim 1 wherein each of the sub-images is cropped to include the left or right hand within a predefined margin. 6. The method according to claim 1 wherein horizontally flipping either the left hand sub-image or the right hand sub-image includes horizontally flipping the left hand sub-image when the second neural network is trained using training images of right hands, and horizontally flipping the right hand sub-image when the second neural network is trained using training images of left hands. 7. The method according to claim 1 wherein the plurality of key points on the left and right hands include thumb tips, thumb knuckles, finger tips and finger knuckles. 8. The method according to claim 1 wherein horizontally flipping the coordinates of the key points on either the left hand or the right hand includes horizontally flipping the coordinates of the key points on the hand which had its sub-image flipped before analysis by the second neural network. 9. The method according to claim 8 wherein horizontally flipping the coordinates of the key points includes horizontally flipping the coordinates across a vertical plane to restore the coordinates to their position in the image. 10. The method according to claim 1 wherein the image also includes one or more workpieces, and the gripper poses and workpiece positions and poses are used by the robot teaching program to create workpiece pick-up and placement instructions for a robot. 11. The method according to claim 10 wherein the instructions are provided to a robot controller from the computer, and the robot controller provides control commands to the robot to perform workpiece operations. 12. A method for programming a robot to perform an operation by human demonstration, said method comprising: demonstrating the operation on workpieces by a human using both hands; analyzing camera images of the hands demonstrating the operation on the workpieces, by a computer, to create demonstration data including gripper poses computed from three-dimensional (3D) coordinates of key points of the hands, where the 3D coordinates of the key points are determined from the images by a first neural network used to identify left and right hands in the images and a second neural network used to compute the 3D coordinates in sub-images of the identified left and right hands; generating robot motion commands, based on the demonstration data, to cause the robot to perform the operation on the workpieces; and performing the operation on the workpiece by the robot. 13. The method according to claim 12 wherein the demonstration data includes, at a grasping step of the operation, position and orientation of a hand coordinate frame, a gripper coordinate frame corresponding to the hand coordinate frame, and a workpiece coordinate frame. 14. The method according to claim 12 wherein the first neural network is trained to distinguish the left hand from the right hand in a training process where a plurality of training images are provided to the first neural network in which left and right hands are pre-identified. 15. The method according to claim 12 wherein either the left hand sub-images or the right hand sub-images are horizontally flipped before being provided to the second neural network, and the 3D coordinates of the key points of the left hand or the right hand are horizontally flipped after being computed by the second neural network. 16. The method according to claim 15 wherein the left hand sub-images and the 3D coordinates of the key points of the left hand are horizontally flipped when the second neural network is trained using training images of right hands. 17. A system for dual hand detection in images used to program a robot to perform an operation by human demonstration, said system comprising: a camera; a computer having a processor and memory and in communication with the camera, said computer being configured to perform steps including; analyzing an image including left and right hands of a human, using a first neural network, to determine an identity and a location in the image of the left hand and the right hand; creating a left hand sub-image and a right hand sub-image, where each of the sub-images is cropped from the image; providing the sub-images to a second neural network running on the computer, including horizontally flipping either the left hand sub-image or the right hand sub-image; analyzing the sub-images by the second neural network to determine three-dimensional (3D) coordinates of a plurality of key points on the left and right hands; and using the 3D coordinates of the key points to define gripper poses used to program the robot, including horizontally flipping the coordinates of the key points on either the left hand or the right hand. 18. The system according to claim 17 wherein the first neural network is trained to distinguish the left hand from the right hand in a training process where a plurality of training images are provided to the first neural network in which left and right hands are pre-identified, and where the first neural network analyzes the training images to identify distinguishing characteristics of left hands and right hands, including curvature and relative locations of digits. 19. The system according to claim 17 wherein horizontally flipping either the left hand sub-image or the right hand sub-image includes horizontally flipping the left hand sub-image when the second neural network is trained using training images of right hands, and horizontally flipping the right hand sub-image when the second neural network is trained using training images of left hands. 20. The system according to claim 19 wherein horizontally flipping the coordinates of the key points on either the left hand or the right hand includes horizontally flipping the coordinates of the key points on the hand which had its sub-image flipped before an

Assignees

Inventors

Classifications

  • B25J9/0081Primary

    with leader teach-in means · CPC title

  • Hardware, e.g. neural networks, fuzzy logic, interfaces, processor · CPC title

  • B25J9/1612Primary

    characterised by the hand, wrist, grip control · CPC title

  • by means of sensing devices, e.g. viewing or touching devices · CPC title

  • including video camera means · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11712797B2 cover?
A method for dual hand detection in robot teaching from human demonstration. A camera image of the demonstrator's hands and workpieces is provided to a first neural network which determines the identity of the left and right hand of the human demonstrator from the image, and also provides cropped sub-images of the identified hands. The first neural network is trained using images in which the l…
Who is the assignee on this patent?
Fanuc Corp
What technology area does this patent fall under?
Primary CPC classification B25J9/0081. Mapped technology areas include Operations & Transport.
When was this patent published?
Publication date Tue Aug 01 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).