Automatic Defect Classification Without Sampling and Feature Selection
US-2016163035-A1 · Jun 9, 2016 · US
US12067485B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12067485-B2 |
| Application number | US-201916580761-A |
| Country | US |
| Kind code | B2 |
| Filing date | Sep 24, 2019 |
| Priority date | Sep 28, 2018 |
| Publication date | Aug 20, 2024 |
| Grant date | Aug 20, 2024 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Methods, systems, and non-transitory computer readable medium are provided for long short-term memory (LSTM) anomaly detection for multi-sensor equipment monitoring. A method includes training a LSTM recurrent neural network (RNN) model for semiconductor processing fault detection. The training includes generating training data for the LSTM RNN model and providing the training data to train the LSTM RNN model on first training input and first target output to generate a trained LSTM RNN model for the semiconductor processing fault detection. The training data includes the first training input and the first target output based on normal runs of manufacturing processes of semiconductor processing equipment. Another method includes providing input based on runs of manufacturing processes of semiconductor processing equipment to a trained LSTM RNN model; obtaining one or more outputs from the trained LSTM RNN model; and using the one or more outputs for semiconductor processing fault detection.
Opening claim text (preview).
What is claimed is: 1. A method comprising: training a long short-term memory (LSTM) recurrent neural network (RNN) model for semiconductor processing fault detection, the training of the LSTM RNN model comprising: generating training data for the LSTM RNN model, wherein the training data comprises first training input and first target output, wherein the first training input comprises a first window of time of first sensor data from a first plurality of sensors, wherein the first target output comprises at least one of the first window of time of the first sensor data from the first plurality of sensors or a second window of time of the first sensor data from the first plurality of sensors, wherein the first target output is same as the first training input or the first target output is offset from the first training input by one or more windows of time, and wherein the first sensor data is associated with normal runs of semiconductor or display manufacturing processes of semiconductor processing equipment; and providing the training data to train the LSTM RNN model on the first training input and the first target output to generate a trained LSTM RNN model, wherein an anomaly response action for the semiconductor processing equipment is to occur responsive to one or more outputs of the trained LSTM RNN model. 2. The method of claim 1 further comprising: receiving, from a plurality of sensors, trace data corresponding to the normal runs of the semiconductor or display manufacturing processes of the semiconductor processing equipment; and time windowing the trace data to generate a plurality of sequenced data sets, wherein each of the plurality of sequenced data sets corresponds to a respective time window, wherein the first training input and the first target output are based on at least a subset of the plurality of sequenced data sets, wherein the semiconductor processing fault detection is associated with one or more of semiconductor manufacturing for wafers or display manufacturing. 3. The method of claim 2 , wherein the first training input comprises a first subset of the plurality of sequenced data sets at a first set of windows of time and a second subset of the plurality of sequenced data sets at a second set of windows of time, wherein each window of time of the second set of windows of time is offset from a corresponding window of time of the first set of windows of time by one or more windows of time. 4. The method of claim 2 , wherein the first target output is the same as the first training input, wherein the first training input comprises the plurality of sequenced data sets. 5. The method of claim 1 , wherein the LSTM RNN model comprises a plurality of layers of LSTM cells, wherein output of a first layer of the plurality of layers is input to a second layer of the plurality of layers. 6. The method of claim 1 , wherein the LSTM RNN model comprises an encoder and a decoder, wherein the encoder determines a compressed representation of the first training input, wherein the decoder uses the compressed representation to predict the first target output. 7. A method comprising: Providing input to a trained long short-term memory (LSTM) recurrent neural network (RNN) model, wherein the input is based on runs of semiconductor or display manufacturing processes of semiconductor processing equipment, the trained LSTM RNN model being trained using training data comprising first training input and first target output, wherein the first training input comprises a first window of time of first sensor data from a first plurality of sensors, wherein the first target output comprises at least one of the first window of time of the first sensor data from the first plurality of sensors or a second window of time of the first sensor data from the first plurality of sensors, wherein the first target output is same as the first training input or the first target output is offset from the first training input by one or more windows of time, and wherein the first sensor data is associated with normal runs of the semiconductor or display manufacturing processes of the semiconductor processing equipment; obtaining one or more outputs from the trained LSTM RNN model, the one or more outputs comprising reconstruction data; and causing an anomaly response action for the semiconductor processing equipment to occur responsive to the one or more outputs. 8. The method of claim 7 further comprising: receiving, from a plurality of sensors, trace data corresponding to the semiconductor or display manufacturing processes of the semiconductor processing equipment; and time windowing the trace data to generate a plurality of sequenced data sets, wherein each of the plurality of sequenced data sets corresponds to a respective time window, wherein the input comprises the plurality of sequenced data sets, wherein semiconductor processing fault detection is associated with one or more of semiconductor manufacturing for wafers or display manufacturing. 9. The method of claim 8 , wherein the input comprises the plurality of sequenced data sets at a first set of windows of time, wherein the reconstruction data comprises predicted sequenced data sets at a second set of windows of time, wherein each window of time of the second set of windows of time is offset from a corresponding window of time of the first set of windows of time by one or more windows of time. 10. The method of claim 7 , wherein the LSTM RNN model comprises a plurality of layers of LSTM cells, wherein output of a first layer of the plurality of layers is input to a second layer of the plurality of layers. 11. The method of claim 7 , wherein the LSTM RNN model comprises an encoder and a decoder, wherein the input comprises a current plurality of sequenced data sets, wherein the encoder determines a compressed representation of the input, wherein the decoder uses the compressed representation to predict a future plurality of sequenced data sets. 12. The method of claim 7 , wherein the causing of the anomaly response action comprises: comparing the input to the reconstruction data to generate model reconstruction error; and identifying an anomaly responsive to determining that the model reconstruction error is greater than a threshold error. 13. The method of claim 12 further comprising: generating a plurality of anomaly scores from the one or more outputs, wherein each of the plurality of anomaly scores corresponds to a respective sensor of a plurality of sensors; and ranking contribution to the model reconstruction error by each of the plurality of sensors based on the plurality of anomaly scores. 14. A non-transitory computer readable storage medium having instructions stored thereon, which, when executed by a processing device, cause the processing device to perform operations comprising: Providing input to a trained long short-term memory (LSTM) recurrent neural network (RNN) model, wherein the input is based on runs of semiconductor or display manufacturing processes of semiconductor processing equipment, the trained LSTM RNN model being trained using training data comprising first training input and first target output, wherein the first training input comprises a first window of time of first sensor data from a first plurality of sensors, wherein the first target output comprises at least one of the first window of time of the first sensor data from the first plurality of sensors or a second window of time of the first sensor data from the first plurality of sensors, wherein the first target output is same as the first training input or the first target output is offset from the first tr
characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU] · CPC title
Hyperparameter optimisation; Meta-learning; Learning-to-learn · CPC title
Auto-encoder networks; Encoder-decoder networks · CPC title
Supervised learning · CPC title
characterised by the process organisation or structure, e.g. boosting cascade · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.