System and method for generating training cases for image classification

US9251437B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9251437-B2
Application numberUS-201313970869-A
CountryUS
Kind codeB2
Filing dateAug 20, 2013
Priority dateDec 24, 2012
Publication dateFeb 2, 2016
Grant dateFeb 2, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system and method for generating training images. An existing training image is associated with a classification. The system includes an image processing module that performs color-space deformation on each pixel of the existing training image and then associates the classification to the color-space deformed training image. The technique may be applied to increase the size of a training set for training a neural network.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method performed by one or more computers, the method comprising: obtaining training data for a neural network, wherein the training data comprises a plurality of base training images and respective classification data for each of the base training images, and wherein the neural network is configured to receive an input image and predict classification data for the input image, wherein each image comprises data representing pixels having a respective color; generating one or more color-deformed images from the base training images of the training data, the generating comprising, for each of the plurality of base training images: performing a principal component analysis (PCA) on pixels in a first region of the base training image to obtain a plurality of eigenvector-eigenvalue pairs of a covariance matrix of red green blue (RGB) pixel values from the pixels in the first region of the base training image; and applying an intensity transformation of pixel colors of the pixels in the first region of the base training image, comprising: randomly selecting a respective value for each eigenvector-eigenvalue pair of the covariance matrix; and for each pixel in the first region of the base training image, applying a transformation to the pixel colors of the pixel based on the eigenvector-eigenvalue pairs and the randomly-selected values; and adding the one or more color-deformed images to the training data for the neural network. 2. The method of claim 1 , wherein the classification data for each of the base training images comprises data that labels one more objects in the base training image. 3. The method of claim 1 , wherein each color-deformed image is generated from a respective base training image, and generating each color-deformed image comprises applying one or more color-space deformations to pixel colors of the respective base image. 4. The method of claim 1 , wherein generating the one or more color-deformed images from the plurality of base training images of the training data further comprises: generating a respective color-deformed image from each of the plurality of base training images, comprising applying the intensity transformation to pixel colors of the pixels in the set of pixels in the first region of the respective base training image, and wherein the method further comprises: associating each color-deformed image with the classification data for the base training image from which the color-deformed image was generated; and adding each color-deformed image and the associated classification data to the set of training data. 5. The method of claim 4 , wherein applying the one or more respective color-space deformations to pixel colors of the first base training image comprises: applying a second color intensity transformation to pixel colors of pixels in a second, different region of the base training image. 6. The method of claim 1 , wherein the pixel of the first base training image is an RGB image pixel represented by I xy =[I xy R ,I xy G ,I xy B ] T , wherein applying the transformation comprises adding [p 1 ,p 2 ,p 3 ][α 1 √λ 1 ,α 2 √λ 2 ,α 3 √λ 3 ] T to I xy , and wherein p i is an i-th eigenvector of the covariance matrix, λ i is an i-th eigenvalue of the covariance matrix, and α i is a randomly-selected value. 7. A system comprising one or more computers and one or more storage devices storing instructions that, when executed by the one or more computers, cause the one or more computers to perform operations comprising: obtaining training data for a neural network, wherein the training data comprises a plurality of base training images and respective classification data for each of the base training images, and wherein the neural network is configured to receive an input image and predict classification data for the input image, wherein each image comprises data representing pixels having a respective color; generating one or more color-deformed images from the base training images of the training data, the generating comprising, for each of the plurality of base training images: performing a principal component analysis (PCA) on pixels in a first region of the base training image to obtain a plurality of eigenvector-eigenvalue pairs of a covariance matrix of red green blue (RGB) pixel values from the pixels in the first region of the base training image; and applying an intensity transformation to pixel colors of the pixels in the first region of the base training image, comprising: randomly selecting a respective value for each eigenvector-eigenvalue pair of the covariance matrix; and for each pixel in the first region of the base training image, applying a transformation to the pixel colors of the pixel bsed on the eigenvector-eigenvalue pairs and the randomly-selected values; and adding the one or more color-deformed images to the training data for the neural network. 8. The system of claim 7 , wherein the classification data for each of the base training images comprises data that labels one more objects in the base training image. 9. The system of claim 7 , wherein each color-deformed image is generated from a respective base training image, and generating each color-deformed image comprises applying one or more color-space deformations to pixel colors of the respective base image. 10. The system of claim 7 , wherein generating the one or more color-deformed images from the plurality of base training images of the training data further comprises: generating a respective color-deformed image from each of the plurality of base training images, comprising applying the intensity transformation to pixel colors of the pixels in the set of pixels in the first region of the respective base training image, and wherein the method further comprises: associating each color-deformed image with the classification data for the base training image from which the color-deformed image was generated; and adding each color-deformed image and the associated classification data to the set of training data. 11. The system of claim 10 , wherein generating the respective color-deformed image from each of the plurality of base training images further comprises: applying a different, second color intensity transformation to pixel colors of pixels in a second, different region of the base training image. 12. The system of claim 7 , wherein the pixel of the base training image is an RGB image pixel represented by I xy =[I xy R ,I xy G ,I xy B ] T , wherein applying the transformation comprises adding [p 1 ,p 2 ,p 3 ][α 1 √λ 1 ,α 2 ,√λ 2 ,α 3 29 λ 3 ] T to I xy , and wherein p i is an i-th eigenvector of the covariance matrix, λ i is an i-th eigenvalue of the covariance matrix, and α i is a randomly-selected value. 13. A non-transitory computer-readable storage medium encoded with a computer program, the program comprising instructions that when executed by one or more computers cause the one or more computers to perform operations comprising: obtaining training data for a neural network, wherein the training data comprises a plurality of base training images and respective classification data for each of the base training images, wherein the neural network is configured to receive an input image and predict classification data for the input image, and wherein each image comprises data representing pixels having a respective color; generating one or more color-deformed images from the base training images of the training data, the generating comprising, for each of the plurality of base training images: performing a principal component analysis (PCA) on pixels in a first region

Assignees

Inventors

Classifications

  • G06F18/28Primary

    Determining representative reference patterns, e.g. by averaging or distorting; Generating dictionaries · CPC title

  • G06K9/6255Primary

    Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9251437B2 cover?
A system and method for generating training images. An existing training image is associated with a classification. The system includes an image processing module that performs color-space deformation on each pixel of the existing training image and then associates the classification to the color-space deformed training image. The technique may be applied to increase the size of a training set …
Who is the assignee on this patent?
Google Inc
What technology area does this patent fall under?
Primary CPC classification G06F18/28. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 02 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).