Machine learning device and machine learning method for learning optimal object grasp route

US10692018B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10692018-B2
Application numberUS-201715708130-A
CountryUS
Kind codeB2
Filing dateSep 19, 2017
Priority dateSep 27, 2016
Publication dateJun 23, 2020
Grant dateJun 23, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A machine learning device according to the present invention learns an operation condition of a robot that stores a plurality of objects disposed on a carrier device in a container using a hand for grasping the objects. The machine learning device includes a state observation unit for observing the positions and postures of the objects and a state variable including at least one of cycle time to store the objects in the container and torque and vibration occurring when the robot grasps the objects during operation of the robot; a determination data obtaining unit for obtaining determination data for determining a margin of each of the cycle time, the torque, and the vibration against an allowance value; and a learning unit for learning the operation condition of the robot in accordance with a training data set constituted of a combination of the state variable and the determination data.

First claim

Opening claim text (preview).

What is claimed is: 1. A machine learning device for learning an operation condition of a robot that stores a plurality of objects disposed on a carrier device in a container using a hand for grasping the plurality of objects, the machine learning device comprising: a hardware processor configured to observe positions and postures of the plurality of objects and a state variable including cycle time to store the plurality of objects in the container and torque and vibration occurring when the robot grasps the plurality of objects, during operation of the robot, obtain determination data for determining a margin of each of the cycle time, the torque, and the vibration against a respective allowance value, learn the operation condition of the robot in accordance with a training data set constituted of a combination of the state variable and the determination data, and in response to the cycle time, the torque, and the vibration being equal to or less than the allowance values, cause the hand to grasp the plurality of objects in a grasp order to minimize the cycle time. 2. The machine learning device according to claim 1 , wherein the cycle time is a time from when the robot begins storing the plurality of objects in the container until the robot completes storing the plurality of objects in the container. 3. The machine learning device according to claim 1 , wherein the torque is calculated based on a current flowing through a motor for driving the robot. 4. The machine learning device according to claim 1 , wherein the vibration is calculated based on an acceleration detected by an acceleration sensor provided in the hand. 5. The machine learning device according to claim 1 , the hardware processor is configured to determine the grasp order of the plurality of objects based on a learning result in accordance with the training data set. 6. The machine learning device according to claim 1 , wherein the machine learning device is connected to the robot through a network, and the hardware processor is configured to obtain a present state variable through the network. 7. The machine learning device according to claim 1 , wherein the machine learning device is installed in a cloud server. 8. A machine learning device, for learning an operation condition of a robot that stores a plurality of objects disposed on a carrier device in a container using a hand for grasping the plurality of objects, the machine learning device comprising: a hardware processor configured to observe positions and postures of the plurality of objects and a state variable including cycle time to store the plurality of objects in the container and torque and vibration occurring when the robot grasps the plurality of objects, during operation of the robot, obtain determination data for determining a margin of each of the cycle time, the torque, and the vibration against a respective allowance value, learn the operation condition of the robot in accordance with a training data set constituted of a combination of the state variable and the determination data, calculate a reward based on the determination data, update a value function used for estimating a grasp order of the plurality of objects so as to reduce at least one of the cycle time, the torque, or the vibration, based on the reward, update an action-value table that corresponds to the grasp order of the plurality of objects, based on the state variable of the cycle time, the torque, and the vibration and the reward, and update an action-value table that corresponds to at least one of the cycle time, the torque, or the vibration when another robot having a same configuration as the robot stores other objects in a container, based on a state variable of the another robot and the reward. 9. The machine learning device according to claim 8 , wherein the hardware processor is configured to calculate the reward based on at least one of the cycle time, the torque, or the vibration. 10. A machine learning method for learning an operation condition of a robot that stores a plurality of objects disposed on a carrier device in a container using a hand for grasping the plurality of objects, the machine learning method comprising the steps of: observing positions and postures of the plurality of objects and a state variable including cycle time to store the plurality of objects in the container and torque and vibration occurring when the robot grasps the plurality of objects, during operation of the robot; obtaining determination data for determining a margin of each of the cycle time, the torque, and the vibration against a respective allowance value; learning the operation condition of the robot in accordance with a training data set constituted of a combination of the state variable and the determination data; and in response to the cycle time, the torque, and the vibration being equal to or less than the allowance values, causing the hand to grasp the plurality of objects in a grasp order to minimize the cycle time.

Assignees

Inventors

Classifications

  • B25J9/163Primary

    learning, adaptive, model based, rule based expert control · CPC title

  • G06N20/00Primary

    Machine learning · CPC title

  • Total factory control, e.g. smart factories, flexible manufacturing systems [FMS] or integrated manufacturing systems [IMS] · CPC title

  • Optimize sequence of pick and place operations upon arrival of workpiece on conveyor · CPC title

  • Conveyor, pick up article, object from conveyor, bring to test unit, place it · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10692018B2 cover?
A machine learning device according to the present invention learns an operation condition of a robot that stores a plurality of objects disposed on a carrier device in a container using a hand for grasping the objects. The machine learning device includes a state observation unit for observing the positions and postures of the objects and a state variable including at least one of cycle time t…
Who is the assignee on this patent?
Fanuc Corp
What technology area does this patent fall under?
Primary CPC classification B25J9/163. Mapped technology areas include Operations & Transport.
When was this patent published?
Publication date Tue Jun 23 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).