Machine learning apparatus, servo control apparatus, servo control system, and machine learning method

US10747193B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10747193-B2
Application numberUS-201815997043-A
CountryUS
Kind codeB2
Filing dateJun 4, 2018
Priority dateJun 22, 2017
Publication dateAug 18, 2020
Grant dateAug 18, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

To perform reinforcement learning enabling to prevent complicated adjustment of coefficients of backlash compensation and backlash acceleration compensation. A machine learning apparatus includes a state information acquiring part for acquiring, from a servo control apparatus, state information including at least position deviation and a set of coefficients to be used by a backlash acceleration compensating part, by making the servo control apparatus execute a predetermined machining program, an action information output part for outputting action information including adjustment information on the set of coefficients included in the state information to the servo control apparatus, a reward output part for outputting a reward value in the reinforcement learning on the basis of the position deviation included in the state information, and a value function updating part for updating an action-value function on the basis of the reward value output by the reward output part, the state information and the action information.

First claim

Opening claim text (preview).

What is claimed is: 1. A machine learning apparatus for performing reinforcement learning to a servo control apparatus using a backlash compensation parameter and a backlash acceleration compensation parameter, the servo control apparatus creating a backlash compensation value compensating a position command or a position deviation, the servo control apparatus creating a backlash acceleration compensation value compensating a speed command, the machine learning apparatus comprising: a processor; and a non-transitory memory having stored thereon executable instructions, which when executed, cause the processor to perform: outputting, to the servo control apparatus, action information including adjustment information on the backlash compensation parameter and the backlash acceleration compensation parameter; acquiring, from the servo control apparatus, state information including position deviation and the backlash compensation parameter and the backlash acceleration compensation parameter, the position deviation being obtained from the position command and a fed-back position, at a time of making the servo control apparatus execute a predetermined machining program on the basis of the action information; outputting a reward value in the reinforcement learning on the basis of the position deviation included in the state information; and updating an action-value function on the basis of the reward value, the state information, and the action information, wherein the reinforcement learning is performed for the backlash compensation parameter, and then the reinforcement learning is performed for the backlash acceleration compensation parameter. 2. The machine learning apparatus according to claim 1 , wherein the processor outputs the reward value on the basis of an absolute value of the position deviation. 3. The machine learning apparatus according to claim 1 , wherein the executable instructions further cause the processor to perform generating and outputting the backlash compensation parameter and the backlash acceleration compensation parameter on the basis of the updated action-value function. 4. The servo control apparatus including the machine learning apparatus according to claim 1 . 5. A servo control system including the machine learning apparatus and the servo control apparatus according to claim 1 . 6. A machine learning method for a machine learning apparatus to perform reinforcement learning to a servo control apparatus using a backlash compensation parameter and a backlash acceleration compensation parameter, the servo control apparatus creating a backlash compensation value compensating a position command or a position deviation, the servo control apparatus creating a backlash acceleration compensation value compensating a speed command, the machine learning method comprising: outputting, to the servo control apparatus, action information including adjustment information on the backlash compensation parameter and the backlash acceleration compensation parameter; acquiring, from the servo control apparatus, state information including position deviation and the backlash compensation parameter and the backlash acceleration compensation parameter, the position deviation being obtained from the position command and a fed-back position, at a time of making the servo control apparatus execute a predetermined machining program on the basis of the action information; outputting a reward value in the reinforcement learning on the basis of the position deviation included in the state information; and updating an action-value function on the basis of the reward value, the state information, and the action information, wherein the reinforcement learning is performed for the backlash compensation parameter, and then the reinforcement learning is performed for the backlash acceleration compensation parameter.

Assignees

Inventors

Classifications

  • Combinations of networks · CPC title

  • Reinforcement learning · CPC title

  • Feedforward networks · CPC title

  • based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO] · CPC title

  • Machine learning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10747193B2 cover?
To perform reinforcement learning enabling to prevent complicated adjustment of coefficients of backlash compensation and backlash acceleration compensation. A machine learning apparatus includes a state information acquiring part for acquiring, from a servo control apparatus, state information including at least position deviation and a set of coefficients to be used by a backlash acceleration…
Who is the assignee on this patent?
Fanuc Corp
What technology area does this patent fall under?
Primary CPC classification G05B19/404. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 18 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).