Landmark-based classification model updating

US11620518B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11620518-B2
Application numberUS-202016866885-A
CountryUS
Kind codeB2
Filing dateMay 5, 2020
Priority dateMay 13, 2019
Publication dateApr 4, 2023
Grant dateApr 4, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods for updating a classification model of a neural network. The methods include selecting, as a set of landmarks, a limited number of data from a set of historical data used to train a classification model. Additionally, the methods generate new training data from recently collected data. Further, the methods update the classification model with the new training data and the set of landmarks to obtain an updated classification model having a loss function configured to capture similarities in the new training data and remember similarities in the historical data represented by the set of landmarks within a predefined tolerance.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method for updating a classification model of a neural network, comprising: selecting, as a set of landmarks, a limited number of data from a set of historical data used to train a classification model; generating new training data from recently collected data; and updating the classification model with the new training data and the set of landmarks to obtain an updated classification model having a loss function configured to capture similarities in the new training data and remember similarities in the historical data represented by the set of landmarks within a predefined tolerance, wherein a landmark component of the loss function (landmark loss ) for a data point (X) is determined as follows: landmark loss ( X )= d ( f S ( X ), f T ( X )), where d represents a Euclidean similarity distance d between a representation of the data point (X) in a historical mapping f s (X) and in a new mapping f T (X). 2. The method as in claim 1 , wherein the landmark component of the loss function includes Euclidean distances between representations of each landmark of the set of landmarks in the classification model and the updated classification model. 3. The method as in claim 1 , wherein selecting landmarks includes iteratively selecting landmarks. 4. The method as in claim 3 , wherein at each iterative step a probability of selecting a particular data segment as a landmark of the set of landmarks is proportional to its minimum squared distance to landmarks selected in previous iterative steps. 5. The method as in claim 1 , wherein the set of landmarks are divided into multiple subsets of landmarks, a different subset of landmarks being used for each epoch of the updating. 6. The method as in claim 1 , wherein the set of landmarks is used for each epoch of the updating. 7. The method as in claim 1 , wherein the classification model receives time series data from sensors deployed to monitor operations at a powerplant. 8. The method as in claim 1 , wherein the classification model receives time series data from one or more microphones coupled to a speech recognition system. 9. A neural network system comprising: a non-transitory computer readable storage medium embodying computer readable instructions; and a processor device configured to implement a classification model based on the computer readable instructions, the processor further configured to update the classification model by implementing: a selection module configured to select, as a set of landmarks, a limited number of data from a set of historical data used to train the classification model; and a model updating module configured to update the classification model with new training data and the set of landmarks to obtain an updated classification model having a loss function configured to capture similarities in the new training data and remember similarities in the historical data represented by the set of landmarks within a predefined tolerance, wherein a landmark component of the loss function (landmark loss ) for a data point (X) is determined as follows: landmark loss ( X )= d ( f S ( X ), f T ( X )), where d represents a Euclidean similarity distance d between a representation of the data point (X) in a historical mapping f x (X) and in a new f T (X). 10. The neural network system as in claim 9 , wherein the landmark component of the loss function includes Euclidean distances between representations of each landmark of the set of landmarks in the classification model and the updated classification model. 11. The neural network system as in claim 9 , wherein the selection module includes iteratively selecting landmarks. 12. The neural network system as in claim 11 , wherein at each iterative step a probability of selecting a particular data segment as a landmark of the set of landmarks is proportional to its minimum squared distance to landmarks selected in previous iterative steps. 13. The neural network system as in claim 9 , wherein the set of landmarks are divided into multiple subsets of landmarks, a different subset of landmarks being used for each epoch of the update. 14. The neural network system as in claim 9 , wherein the set of landmarks is used for each epoch of the update. 15. A non-transitory computer readable storage medium comprising a computer readable program for updating a classification model of a neural network, wherein the computer readable program when executed on a computer causes the computer to perform the method comprising: selecting, as a set of landmarks, a limited number of data from a set of historical data used to train a classification model; generating new training data from recently collected data; and updating the classification model with the new training data to obtain an updated classification model having a loss function configured to capture similarities in the new training data and remember similarities in the historical data represented by the set of landmarks within a predefined tolerance, wherein a landmark component of the loss function (landmark loss ) for a data point (X) is determined as follows: landmark loss ( X )= d ( f S ( X ), f T ( X )), where d represents a :Euclidean similarity distance d between a representation of the data point (X) in a historical mapping f s (X) and in a new mapning f T (X). 16. The non-transitory computer readable storage medium as in claim 15 , wherein the landmark component of the loss function includes Euclidean distances between representations of each landmark of the set of landmarks in the classification model and the updated classification model. 17. The non-transitory computer readable storage medium as in claim 15 , wherein selecting landmarks includes iteratively selecting landmarks. 18. The non-transitory computer readable storage medium as in claim 17 , wherein at each iterative step a probability of selecting a particular data segment as a landmark of the set of landmarks is proportional to its minimum squared distance to landmarks selected in previous iterative steps. 19. The non-transitory computer readable storage medium as in claim 15 , wherein the set of landmarks are divided into multiple subsets of landmarks, a different subset of landmarks being used for each epoch of the updating. 20. The non-transitory computer readable storage medium as in claim 15 , wherein the set of landmarks is used for each epoch of the updating.

Assignees

Inventors

Classifications

  • characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU] · CPC title

  • Supervised learning · CPC title

  • Recurrent networks, e.g. Hopfield networks · CPC title

  • characterised by the process organisation or structure, e.g. boosting cascade · CPC title

  • G06N3/08Primary

    Learning methods · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11620518B2 cover?
Systems and methods for updating a classification model of a neural network. The methods include selecting, as a set of landmarks, a limited number of data from a set of historical data used to train a classification model. Additionally, the methods generate new training data from recently collected data. Further, the methods update the classification model with the new training data and the se…
Who is the assignee on this patent?
Nec Lab America Inc, Nec Corp
What technology area does this patent fall under?
Primary CPC classification G06N3/08. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 04 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).