What technology area does this patent fall under?

Primary CPC classification G06N3/045. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Feb 07 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).

Method for reproducibility of deep learning classifiers using ensembles

US11574166B2 · US · B2

Patent metadata
Field	Value
Publication number	US-11574166-B2
Application number	US-202016886344-A
Country	US
Kind code	B2
Filing date	May 28, 2020
Priority date	May 28, 2020
Publication date	Feb 7, 2023
Grant date	Feb 7, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Example implementations described herein involve systems and methods for generating an ensemble of deep learning or neural network models, which can involve, for a training set of data, generating a plurality of model samples for the training set of data, the plurality of model samples generated from deep learning or neural network methods; and aggregating output of the model samples to generate an output of the ensemble models.

First claim

Opening claim text (preview).

What is claimed is: 1. A method of generating an ensemble of deep learning or neural network models, the method comprising: for a training set of data: generating a plurality of model samples for the training set of data, the plurality of model samples generated from deep learning or neural network methods; and aggregating output of the model samples to generate an output of the ensemble models, wherein the generating the plurality of model samples for the training set of data comprises: executing an inference process on the plurality of model samples to generate additional model samples through passing inputs on the plurality of model samples; and employing a dropout process on the plurality of model samples and the additional model samples to obtain a subset of model samples as the plurality of model samples, the dropout process configured to reduce the plurality of model samples and the additional model samples to the subset of the model samples based on validation accuracy against the training set of data. 2. The method of claim 1 , wherein the generating the plurality of model samples for the training set of data further comprises executing a distributed training process during a training phase of the plurality of model samples, the distributed training process training each of the plurality of model samples across separate servers, each of the separate servers configured to generate a model sample. 3. The method of claim 1 , wherein the plurality of model samples are predictive maintenance models, and wherein the output is a maintenance recommendation. 4. A non-transitory computer readable medium, storing instructions of generating an ensemble of deep learning or neural network models, the instructions comprising: for a training set of data: generating a plurality of model samples for the training set of data, the plurality of model samples generated using sampling of trained learners generated from deep learning or neural network methods; and aggregating output of the model samples to generate an output of the ensemble models, wherein generating the plurality of model samples for the training set of data comprises: executing an inference process on the plurality of model samples to generate additional model samples through passing inputs on the plurality of model samples; and employing a dropout process on the plurality of model samples and the additional model samples to obtain a subset of model samples as the plurality of model samples, the dropout process configured to reduce the plurality of model samples and the additional model samples to the subset of the model samples based on validation accuracy against the training set of data. 5. The non-transitory computer readable medium of claim 4 , wherein the generating the plurality of model samples for the training set of data further comprises executing a distributed training process during a training phase of the plurality of model samples, the distributed training process training each of the plurality of model samples across separate servers, each of the separate servers configured to generate a model sample. 6. The non-transitory computer readable medium of claim 4 , wherein the plurality of model samples are predictive maintenance models, and wherein the output is a maintenance recommendation. 7. An apparatus configured to generate an ensemble of deep learning or neural network models, the apparatus comprising: a processor, configured to: for a training set of data: generate a plurality of model samples for the training set of data, the plurality of model samples generated from deep learning or neural network methods; and aggregate output of the model samples to generate an output of the ensemble models, wherein the processor is configured to generate the plurality of model samples for the training set of data by, executing an inference process on the plurality of model samples to generate additional model samples through passing inputs on the plurality of model samples; and employing a dropout process on the plurality of model samples and the additional model samples to obtain a subset of model samples as the plurality of model samples, the dropout process configured to reduce the plurality of model samples and the additional model samples to the subset of the model samples based on validation accuracy against the training set of data. 8. The apparatus of claim 7 , wherein the processor is configured to generate the plurality of model samples for the training set of data by further executing a distributed training process during a training phase of the plurality of model samples, the distributed training process training each of the plurality of model samples across separate servers, each of the separate servers configured to generate a model sample. 9. The apparatus of claim 7 , wherein the plurality of model samples are predictive maintenance models, and wherein the output is a maintenance recommendation.

Assignees

Hitachi Ltd

Inventors

Classifications

G06N3/098
Distributed learning, e.g. federated learning · CPC title
G06N3/09
Supervised learning · CPC title
G06N3/0464
Convolutional networks [CNN, ConvNet] · CPC title
G06N3/0442
characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU] · CPC title
G06N3/082
modifying the architecture, e.g. adding, deleting or silencing nodes or connections · CPC title

Patent family

Related publications grouped by family.

View patent family 78706374

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11574166B2 cover?: Example implementations described herein involve systems and methods for generating an ensemble of deep learning or neural network models, which can involve, for a training set of data, generating a plurality of model samples for the training set of data, the plurality of model samples generated from deep learning or neural network methods; and aggregating output of the model samples to generat…
Who is the assignee on this patent?: Hitachi Ltd
What technology area does this patent fall under?: Primary CPC classification G06N3/045. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Feb 07 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).