Learning data selection method, learning data selection device, and computer-readable recording medium

US11501203B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11501203-B2
Application numberUS-201816135446-A
CountryUS
Kind codeB2
Filing dateSep 19, 2018
Priority dateSep 19, 2017
Publication dateNov 15, 2022
Grant dateNov 15, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A non-transitory computer-readable recording medium stores therein a learning data selection program that causes a computer to execute a process including: extracting a first input data group relating to first input data in correspondence with designation of the first input data included in an input data group input to a machine learning model, the machine learning model classifying or determining transformed data that is transformed from input data; acquiring a first transformed data group of the machine learning model and a first output data group of the machine learning model, respectively, the first transformed data group being input to the machine learning model and corresponding to the first input data group, the first output data group corresponding to the first transformed data group; and selecting learning target data of an estimation model from the first input data group.

First claim

Opening claim text (preview).

What is claimed is: 1. A non-transitory computer-readable recording medium storing therein a learning data selection program that causes a computer to execute a process comprising: extracting a first input data group relating to first input data in response to designation of the first input data included in an input data group input to a machine learning model, in accordance with a reference in which a collection timing of each data of the first input data group has a predetermined relationship with the first input data; acquiring a first transformed data group generated by transforming the first input data group by the machine learning model, the first transformed data group including first transformed data based on the first input data; calculating a first set of distances between the first input data and each data of the first input data group other than the first input data and a second set of distances between the first transformed data and each data of the first transformed data group other than the first transformed data; selecting learning target data from the first input data group based on the first set of distances and the second set of distances; and performing, based on the selected learning target data, machine learning of another machine learning model configured to estimate a factor which causes the machine learning model to output a classification result. 2. The non-transitory computer-readable recording medium according to claim 1 , wherein the learning target data is extracted based on data contents of each piece of output data included in the first output data group. 3. The non-transitory computer-readable recording medium according to claim 2 , wherein the learning target data is extracted based on a ratio of the data contents included in the first output data group. 4. The non-transitory computer-readable recording medium according to claim 3 , wherein the learning target data is extracted based on a ratio between a positive example and a negative example of the data contents included in the first output data group. 5. The non-transitory computer-readable recording medium according to claim 1 , wherein input data in which a data acquisition timing has a predetermined relationship with the first input data is specified as the first input data group. 6. The non-transitory computer-readable recording medium according to claim 1 , wherein input data in which a data generation source has a predetermined relationship with the first input data is specified as the first input data group. 7. The non-transitory computer-readable recording medium according to claim 1 , wherein the distance between the first input data and each piece of data of the first input data group is calculated by individually transforming the first input data and the data of the first input data group. 8. A learning data selection method comprising: extracting a first input data group relating to first input data in response to designation of the first input data included in an input data group input to a machine learning model, in accordance with a reference in which a collection timing of each data of the first input data group has a predetermined relationship with the first input data; acquiring a first transformed data group generated by transforming the first input data group by the machine learning model, the first transformed data group including first transformed data based on the first input data; calculating a first set of distances between the first input data and each data of the first input data group other than the first input data and a second set of distances between the first transformed data and each data of the first transformed data group other than the first transformed data; selecting learning target data from the first input data group based on the first set of distances and the second set of distances; and performing, based on the selected learning target data, machine learning of another machine learning model configured to estimate a factor which causes the machine learning model to output a classification result, by a processor. 9. A learning data selection device comprising: a processor configured to: extract a first input data group relating to first input data in response to designation of the first input data included in an input data group input to a machine learning model, in accordance with a reference in which a collection timing of each data of the first input data group has a predetermined relationship with the first input data; acquire a first transformed data group generated by transforming the first input data group by the machine learning model, the first transformed data group including first transformed data based on the first input data; calculate a first set of distances between the first input data and each data of the first input data group other than the first input data and a second set of distances between the first transformed data and each data of the first transformed data group other than the first transformed data; select learning target data from the first input data group based on the first set of distances and the second set of distances; and perform, based on the selected learning target data, machine learning of another machine learning model configured to estimate a factor which causes the machine learning model to output a classification result.

Assignees

Inventors

Classifications

  • based on naturality criteria, e.g. with non-negative factorisation or negative correlation · CPC title

  • Selection of pattern recognition techniques, e.g. of classifiers in a multi-classifier system · CPC title

  • Classification techniques · CPC title

  • G06F18/22Primary

    Matching criteria, e.g. proximity measures · CPC title

  • Generating training patterns; Bootstrap methods, e.g. bagging or boosting · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11501203B2 cover?
A non-transitory computer-readable recording medium stores therein a learning data selection program that causes a computer to execute a process including: extracting a first input data group relating to first input data in correspondence with designation of the first input data included in an input data group input to a machine learning model, the machine learning model classifying or determin…
Who is the assignee on this patent?
Fujitsu Ltd
What technology area does this patent fall under?
Primary CPC classification G06F18/22. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 15 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).