Long short-term memory anomaly detection for multi-sensor equipment monitoring

US12067485B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12067485-B2
Application numberUS-201916580761-A
CountryUS
Kind codeB2
Filing dateSep 24, 2019
Priority dateSep 28, 2018
Publication dateAug 20, 2024
Grant dateAug 20, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods, systems, and non-transitory computer readable medium are provided for long short-term memory (LSTM) anomaly detection for multi-sensor equipment monitoring. A method includes training a LSTM recurrent neural network (RNN) model for semiconductor processing fault detection. The training includes generating training data for the LSTM RNN model and providing the training data to train the LSTM RNN model on first training input and first target output to generate a trained LSTM RNN model for the semiconductor processing fault detection. The training data includes the first training input and the first target output based on normal runs of manufacturing processes of semiconductor processing equipment. Another method includes providing input based on runs of manufacturing processes of semiconductor processing equipment to a trained LSTM RNN model; obtaining one or more outputs from the trained LSTM RNN model; and using the one or more outputs for semiconductor processing fault detection.

First claim

Opening claim text (preview).

What is claimed is: 1. A method comprising: training a long short-term memory (LSTM) recurrent neural network (RNN) model for semiconductor processing fault detection, the training of the LSTM RNN model comprising: generating training data for the LSTM RNN model, wherein the training data comprises first training input and first target output, wherein the first training input comprises a first window of time of first sensor data from a first plurality of sensors, wherein the first target output comprises at least one of the first window of time of the first sensor data from the first plurality of sensors or a second window of time of the first sensor data from the first plurality of sensors, wherein the first target output is same as the first training input or the first target output is offset from the first training input by one or more windows of time, and wherein the first sensor data is associated with normal runs of semiconductor or display manufacturing processes of semiconductor processing equipment; and providing the training data to train the LSTM RNN model on the first training input and the first target output to generate a trained LSTM RNN model, wherein an anomaly response action for the semiconductor processing equipment is to occur responsive to one or more outputs of the trained LSTM RNN model. 2. The method of claim 1 further comprising: receiving, from a plurality of sensors, trace data corresponding to the normal runs of the semiconductor or display manufacturing processes of the semiconductor processing equipment; and time windowing the trace data to generate a plurality of sequenced data sets, wherein each of the plurality of sequenced data sets corresponds to a respective time window, wherein the first training input and the first target output are based on at least a subset of the plurality of sequenced data sets, wherein the semiconductor processing fault detection is associated with one or more of semiconductor manufacturing for wafers or display manufacturing. 3. The method of claim 2 , wherein the first training input comprises a first subset of the plurality of sequenced data sets at a first set of windows of time and a second subset of the plurality of sequenced data sets at a second set of windows of time, wherein each window of time of the second set of windows of time is offset from a corresponding window of time of the first set of windows of time by one or more windows of time. 4. The method of claim 2 , wherein the first target output is the same as the first training input, wherein the first training input comprises the plurality of sequenced data sets. 5. The method of claim 1 , wherein the LSTM RNN model comprises a plurality of layers of LSTM cells, wherein output of a first layer of the plurality of layers is input to a second layer of the plurality of layers. 6. The method of claim 1 , wherein the LSTM RNN model comprises an encoder and a decoder, wherein the encoder determines a compressed representation of the first training input, wherein the decoder uses the compressed representation to predict the first target output. 7. A method comprising: Providing input to a trained long short-term memory (LSTM) recurrent neural network (RNN) model, wherein the input is based on runs of semiconductor or display manufacturing processes of semiconductor processing equipment, the trained LSTM RNN model being trained using training data comprising first training input and first target output, wherein the first training input comprises a first window of time of first sensor data from a first plurality of sensors, wherein the first target output comprises at least one of the first window of time of the first sensor data from the first plurality of sensors or a second window of time of the first sensor data from the first plurality of sensors, wherein the first target output is same as the first training input or the first target output is offset from the first training input by one or more windows of time, and wherein the first sensor data is associated with normal runs of the semiconductor or display manufacturing processes of the semiconductor processing equipment; obtaining one or more outputs from the trained LSTM RNN model, the one or more outputs comprising reconstruction data; and causing an anomaly response action for the semiconductor processing equipment to occur responsive to the one or more outputs. 8. The method of claim 7 further comprising: receiving, from a plurality of sensors, trace data corresponding to the semiconductor or display manufacturing processes of the semiconductor processing equipment; and time windowing the trace data to generate a plurality of sequenced data sets, wherein each of the plurality of sequenced data sets corresponds to a respective time window, wherein the input comprises the plurality of sequenced data sets, wherein semiconductor processing fault detection is associated with one or more of semiconductor manufacturing for wafers or display manufacturing. 9. The method of claim 8 , wherein the input comprises the plurality of sequenced data sets at a first set of windows of time, wherein the reconstruction data comprises predicted sequenced data sets at a second set of windows of time, wherein each window of time of the second set of windows of time is offset from a corresponding window of time of the first set of windows of time by one or more windows of time. 10. The method of claim 7 , wherein the LSTM RNN model comprises a plurality of layers of LSTM cells, wherein output of a first layer of the plurality of layers is input to a second layer of the plurality of layers. 11. The method of claim 7 , wherein the LSTM RNN model comprises an encoder and a decoder, wherein the input comprises a current plurality of sequenced data sets, wherein the encoder determines a compressed representation of the input, wherein the decoder uses the compressed representation to predict a future plurality of sequenced data sets. 12. The method of claim 7 , wherein the causing of the anomaly response action comprises: comparing the input to the reconstruction data to generate model reconstruction error; and identifying an anomaly responsive to determining that the model reconstruction error is greater than a threshold error. 13. The method of claim 12 further comprising: generating a plurality of anomaly scores from the one or more outputs, wherein each of the plurality of anomaly scores corresponds to a respective sensor of a plurality of sensors; and ranking contribution to the model reconstruction error by each of the plurality of sensors based on the plurality of anomaly scores. 14. A non-transitory computer readable storage medium having instructions stored thereon, which, when executed by a processing device, cause the processing device to perform operations comprising: Providing input to a trained long short-term memory (LSTM) recurrent neural network (RNN) model, wherein the input is based on runs of semiconductor or display manufacturing processes of semiconductor processing equipment, the trained LSTM RNN model being trained using training data comprising first training input and first target output, wherein the first training input comprises a first window of time of first sensor data from a first plurality of sensors, wherein the first target output comprises at least one of the first window of time of the first sensor data from the first plurality of sensors or a second window of time of the first sensor data from the first plurality of sensors, wherein the first target output is same as the first training input or the first target output is offset from the first tr

Assignees

Inventors

Classifications

  • characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU] · CPC title

  • Hyperparameter optimisation; Meta-learning; Learning-to-learn · CPC title

  • Auto-encoder networks; Encoder-decoder networks · CPC title

  • Supervised learning · CPC title

  • characterised by the process organisation or structure, e.g. boosting cascade · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12067485B2 cover?
Methods, systems, and non-transitory computer readable medium are provided for long short-term memory (LSTM) anomaly detection for multi-sensor equipment monitoring. A method includes training a LSTM recurrent neural network (RNN) model for semiconductor processing fault detection. The training includes generating training data for the LSTM RNN model and providing the training data to train the…
Who is the assignee on this patent?
Applied Materials Inc
What technology area does this patent fall under?
Primary CPC classification G06N3/08. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 20 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).