Methods for an autonomous robotic device to identify locations captured in an image
US-10810427-B1 · Oct 20, 2020 · US
US11288883B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11288883-B2 |
| Application number | US-201916570618-A |
| Country | US |
| Kind code | B2 |
| Filing date | Sep 13, 2019 |
| Priority date | Jul 23, 2019 |
| Publication date | Mar 29, 2022 |
| Grant date | Mar 29, 2022 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A method for controlling a robotic device is presented. The method includes capturing an image corresponding to a current view of the robotic device. The method also includes identifying a keyframe image comprising a first set of pixels matching a second set of pixels of the image. The method further includes performing, by the robotic device, a task corresponding to the keyframe image.
Opening claim text (preview).
What is claimed is: 1. A method for controlling a robotic device, comprising: capturing an image corresponding to a current view of the robotic device; identifying a keyframe image based on a red-green-blue (RGB) value of each pixel of a first set of pixels of the keyframe image matching an RGB value of a corresponding pixel of a second set of pixels of the image; determining one or both of: a first difference between a first pose of the robotic device in relation to a first object in the keyframe image and a second pose of the robotic device in relation to a second object in the image; or a second difference between a first distance of the robotic device to the first object and a second distance of the robotic device to the second object; adjusting one or more of a velocity or a position associated with a task to be performed by an effecter of the robotic device based on determining one or both of the first difference or the second difference, the task being associated with the keyframe image; and performing, via the effecter of the robotic device, the task based on adjusting one or both of the velocity or the position. 2. The method of claim 1 , further comprising capturing the keyframe image while the robotic device is trained to perform the task. 3. The method of claim 1 , in which each pixel of the first set of pixels and the second set of pixels is associated with a pixel descriptor. 4. The method of claim 3 , in which each pixel descriptor comprises a set of values corresponding to pixel level information and depth information, the pixel level information comprising an RGB value of the pixel associated with the pixel descriptor. 5. The method of claim 1 , in which the task comprises at least one of interacting with an object, navigating through an environment, or a combination thereof. 6. The method of claim 1 , in which an area corresponding to the first set of pixels is selected by a user. 7. The method of claim 1 , in which the robotic device is trained to perform the task based on a human demonstration. 8. A robotic device, comprising: a memory; and at least one processor, the at least one processor configured: to capture an image corresponding to a current view of the robotic device; to identify a keyframe image based on a red-green-blue (RGB) value of each pixel of a first set of pixels of the keyframe image matching an RGB value of a corresponding pixel of a second set of pixels of the image; to determine one or both of: a first difference between a first pose of the robotic device in relation to a first object in the keyframe image and a second pose of the robotic device in relation to a second object in the image; or a second difference between a first distance of the robotic device to the first object and a second distance of the robotic device to the second object; to adjust one or more of a velocity or a position associated with a task to be performed by an effecter of the robotic device based on determining one or both of the first difference or the second difference, the task being associated with the keyframe image; and to perform, via the effecter of the robotic device, the task based on adjusting one or both of the velocity or the position. 9. The robotic device of claim 8 , in which the at least one processor is further configured to capture the keyframe image while the robotic device is trained to perform the task. 10. The robotic device of claim 8 , in which each pixel of the first set of pixels and the second set of pixels is associated with a pixel descriptor. 11. The robotic device of claim 10 , in which each pixel descriptor comprises a set of values corresponding to pixel level information and depth information, the pixel level information comprising an RGB value of the pixel associated with the pixel descriptor. 12. The robotic device of claim 8 , in which the task comprises at least one of interacting with an object, navigating through an environment, or a combination thereof. 13. The robotic device of claim 8 , in which an area corresponding to the first set of pixels is selected by a user. 14. The robotic device of claim 8 , in which the robotic device is trained to perform the task based on a human demonstration. 15. A non-transitory computer-readable medium having program code recorded thereon for controlling a robotic device, the program code comprising: program code to capture an image corresponding to a current view of the robotic device; program code to identify a keyframe image based on a red-green-blue (RGB) value of each pixel of a first set of pixels of the keyframe image matching an RGB value of a corresponding pixel of a second set of pixels of the image; program code to determine one or both of: a first difference between a first pose of the robotic device in relation to a first object in the keyframe image and a second pose of the robotic device in relation to a second object in the image; or a second difference between a first distance of the robotic device to the first object and a second distance of the robotic device to the second object; program code to adjust one or more of a velocity or a position associated with a task to be performed by an effecter of the robotic device based on determining one or both of the first difference or the second difference, the task being associated with the keyframe image; and program code to perform, via the effecter of the robotic device, the task based on adjusting one or both of the velocity or the position. 16. The non-transitory computer-readable medium of claim 15 , in which the program code further comprises program code to capture the keyframe image while the robotic device is trained to perform the task. 17. The non-transitory computer-readable medium of claim 15 , in which each pixel of the first set of pixels and the second set of pixels is associated with a pixel descriptor. 18. The non-transitory computer-readable medium of claim 17 , in which each pixel descriptor comprises a set of values corresponding to pixel level information and depth information, the pixel level information comprising an RGB value of the pixel associated with the pixel descriptor. 19. The non-transitory computer-readable medium of claim 15 , in which the task comprises at least one of interacting with an object, navigating through an environment, or a combination thereof. 20. The non-transitory computer-readable medium of claim 15 , in which the robotic device is trained to perform the task based on a human demonstration.
Generating sets of training patterns; Bootstrap methods, e.g. bagging or boosting · CPC title
using neural networks · CPC title
using classification, e.g. of video objects · CPC title
Determining representative reference patterns, e.g. by averaging or distorting; Generating dictionaries · CPC title
Generating training patterns; Bootstrap methods, e.g. bagging or boosting · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.