Unsupervised image-based anomaly detection using multi-scale context-dependent deep autoencoding gaussian mixture model

US10853937B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10853937-B2
Application numberUS-201916248955-A
CountryUS
Kind codeB2
Filing dateJan 16, 2019
Priority dateFeb 5, 2018
Publication dateDec 1, 2020
Grant dateDec 1, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A false alarm reduction system is provided that includes a processor cropping each input image at randomly chosen positions to form cropped images of a same size at different scales in different contexts. The system further includes a CONDA-GMM, having a first and a second conditional deep autoencoder for respectively (i) taking each cropped image without a respective center block as input for measuring a discrepancy between a reconstructed and a target center block, and (ii) taking an entirety of cropped images with the target center block. The CONDA-GMM constructs density estimates based on reconstruction error features and low-dimensional embedding representations derived from image encodings. The processor determines an anomaly existence based on a prediction of a likelihood of the anomaly existing in a framework of a CGMM, given the context being a representation of the cropped image with the center block removed and having a discrepancy above a threshold.

First claim

Opening claim text (preview).

What is claimed is: 1. A false alarm reduction system for reducing false alarms in an automatic defect detection system, the false alarm reduction system comprising: a processor for performing a cropping operation on each of input images at randomly chosen positions to form a set of cropped images of a same size at different scales in different contexts; and a CONtext-conditional Deep Autoencoding Gaussian Mixture Model (CONDA-GMM), having a first and a second conditional deep autoencoder for respectively (i) taking each of the cropped images without a respective center block as input for measuring a discrepancy between a reconstructed center block and the target center block, and (ii) taking an entirety of the cropped images with the target center block, the CONDA-GMM constructing density estimates based on both reconstruction error features and low-dimensional embedding representations derived from encodings of the cropped images, wherein the processor determines an existence of an anomaly based on a prediction of a likelihood of the anomaly existing in a framework of a context-dependent Conditional Gaussian Mixture Model (CGMM), given the context being a representation of the cropped image with the center block removed and having a discrepancy above a threshold amount. 2. The false alarm reduction system of claim 1 , wherein the CONDA-GMM comprises a context-dependent deep autoencoding compression network and a context-dependent density estimation network. 3. The false alarm reduction system of claim 1 , wherein each of the input images is cropped at multiple different randomly chosen positions. 4. The false alarm reduction system of claim 1 , wherein a cropping-specific contextual vector is formed using image features of the cropped images. 5. The false alarm reduction system of claim 4 , wherein the image features comprise Scale Invariant Feature Transformation (SIFT) features. 6. The false alarm reduction system of claim 4 , wherein the image features forming the cropping-specific contextual vector comprise edges of the cropped images, wherein the cropped edges have been cropped to be without a center region. 7. The false alarm reduction system of claim 1 , wherein the CONDA-GMM includes a first and a second conditional deep autoencoder for respectively taking each of the cropped images without a respective center block as input for measuring a discrepancy between a reconstructed center block and the target center block, and taking an entirety of the cropped images with the target center block. 8. The false alarm reduction system of claim 1 , wherein the likelihood is based on an energy level. 9. The false alarm reduction system of claim 1 , wherein both reconstruction errors and log-likelihood are used for the anomaly detection. 10. The false alarm reduction system of claim 1 , further comprising a Multi-Layer Perceptron, predicting an input-specific mixture coefficient conditioned on the representation. 11. A false alarm reduction method for reducing false alarms in an automatic defect detection system, the false alarm reduction method comprising: performing, by a processor, a cropping operation on each of input images at randomly chosen positions to form a set of cropped images of a same size at different scales in different contexts; encoding, by a first conditional deep autoencoder of a CONtext-conditional Deep Autoencoding Gaussian Mixture Model (CONDA-GMM), each of the cropped images without a respective center block as input for measuring a discrepancy between a reconstructed center block and the target center block, and encoding, by a second conditional deep autoencoder of the CONDA-GMM, an entirety of the cropped images with the target center block; constructing, by the CONDA-GMM, density estimates based on both reconstruction error features and low-dimensional embedding representations derived from encodings of the cropped images; and determining, by the processor, an existence of an anomaly based on a prediction of a likelihood of the anomaly existing in a framework of a context-dependent Conditional Gaussian Mixture Model (CGMM), given the context being a representation of the cropped image with the center block removed and having a discrepancy above a threshold amount. 12. The false alarm reduction method of claim 11 , wherein the CONDA-GMM comprises a context-dependent deep autoencoding compression network and a context-dependent density estimation network. 13. The false alarm reduction method of claim 11 , wherein each of the input images is cropped at multiple different randomly chosen positions. 14. The false alarm reduction method of claim 11 , further comprising forming a cropping-specific contextual vector using image features of the cropped images. 15. The false alarm reduction method of claim 14 , wherein the image features comprise Scale Invariant Feature Transformation (SIFT) features. 16. The false alarm reduction method of claim 14 , wherein the image features forming the cropping-specific contextual vector comprise edges of the cropped images, wherein the cropped edges have been cropped to be without a center region. 17. The false alarm reduction method of claim 11 , wherein the CONDA-GMM includes a first and a second conditional deep autoencoder for respectively taking each of the cropped images without a respective center block as input for measuring a discrepancy between a reconstructed center block and the target center block, and taking an entirety of the cropped images with the target center block. 18. The false alarm reduction method of claim 11 , wherein the likelihood is based on an energy level. 19. The false alarm reduction method of claim 11 , wherein both reconstruction errors and log-likelihood are used for the anomaly detection. 20. The false alarm reduction method of claim 11 , further comprising predicting, by a Multi-Layer Perceptron, an input-specific mixture coefficient conditioned on the representation.

Assignees

Inventors

Classifications

  • using neural networks · CPC title

  • using classification, e.g. of video objects · CPC title

  • G06T7/001Primary

    using an image reference approach · CPC title

  • G06T7/0004Primary

    Industrial image inspection · CPC title

  • Single-class perspective, e.g. one-against-all classification; Novelty detection; Outlier detection · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10853937B2 cover?
A false alarm reduction system is provided that includes a processor cropping each input image at randomly chosen positions to form cropped images of a same size at different scales in different contexts. The system further includes a CONDA-GMM, having a first and a second conditional deep autoencoder for respectively (i) taking each cropped image without a respective center block as input for …
Who is the assignee on this patent?
Nec Lab America Inc, Nec Corp
What technology area does this patent fall under?
Primary CPC classification G06T7/001. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 01 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).