Visual object recognition

US2017286809A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2017286809-A1
Application numberUS-201615089707-A
CountryUS
Kind codeA1
Filing dateApr 4, 2016
Priority dateApr 4, 2016
Publication dateOct 5, 2017
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Technical solutions are described for training an object-recognition neural network that identifies an object in a computer-readable image. An example method includes assigning a first neural network for determining a visual alignment model of the images for determining a normalized alignment of the object. The method further includes assigning a second neural network for determining a visual representation model of the images for recognizing the object. The method further includes determining the visual alignment model by training the first neural network and determining the visual representation model by training the second neural network independent of the first. The method further includes determining a combined object recognition model by training a combination of the first neural network and the second neural network. The method further includes recognizing the object in the image based on the combined object recognition model by passing the image through each of the neural networks.

First claim

Opening claim text (preview).

What is claimed is: 1 . A computer-implemented method for training an object-recognition neural network to identify an object in a computer-readable image, the method comprising: assigning, using a processor system, a first neural network for determining a visual alignment model of the images, wherein the visual alignment model is used to determine a normalized alignment of an object in input images; assigning, using the processor system, a second neural network for determining a visual representation model of the images, wherein the visual representation model is used to recognize the object in the input images; determining the visual alignment model by training the first neural network; determining the visual representation model by training the second neural network; determining a combined object recognition model by training a combination of the first neural network and the second neural network; and recognizing the object in the computer-readable image based on the combined object recognition model by passing the computer-readable image through each of the combined neural networks. 2 . The computer-implemented method of claim 1 , further comprising: splitting an object-recognition neural network into a plurality of independent networks including the first neural network and the second neural network, wherein a first subset of the independent networks operates on content-based attributes of the images and a second subset of the independent networks operates on content-unaware attributes of the images. 3 . The computer-implemented method of claim 1 , wherein the visual alignment model is determined by training the first neural network using a first set of images and the visual representation model is determined by training the second neural network using a second set of images. 4 . The computer-implemented method of claim 3 , wherein the combined object recognition model is determined by training a combination of the first neural network and the second neural network using a third set of images, distinct from the first set of images and the second set of images. 5 . The computer-implemented method of claim 1 , wherein the first neural network is trained in parallel to the second neural network. 6 . The computer-implemented method of claim 5 , wherein the first neural network is trained by a first computer system and the second neural network is trained by a second computer system, which is distinct from the first computer system. 7 . The computer-implemented method of claim 1 , wherein the first neural network is trained by a first processor in parallel to the second neural network being trained by a second processor. 8 . The computer-implemented method of claim 1 , further comprising: assigning a third neural network for determining a contrast normalization model of the images, wherein the contrast normalization model is used to determine a normalized contrast value of the input images; and determining the combined object recognition model by further training a combination of the first neural network, the second neural network, and the third neural network. 9 . The computer-implemented method of claim 8 , wherein recognizing the object in the computer-readable image based on the combined object recognition model comprises: passing the computer-readable image the first neural network and the third neural network in parallel; and passing outputs of the first neural network and the third neural network to the second neural network, wherein the second neural network recognizes the object in the computer-readable image based on the respective outputs. 10 . The computer-implemented method of claim 1 , wherein determining the normalized alignment by training the first neural network comprises determining transformations in 3-dimensional space to align the object in the computer-readable image in a normalized position. 11 . A system for training an object-recognition neural network to identify an object in a computer-readable image, the system comprising: a memory; and a processor communicatively coupled to the memory, wherein the processor is configured to: assign a first neural network for determining a visual alignment model of the images, wherein the visual alignment model is used to determine a normalized alignment of an object in input images; assign a second neural network for determining a visual representation model of the images, wherein the visual representation model is used to recognize the object in the input images; determine the visual alignment model by training the first neural network; determine the visual representation model by training the second neural network; determine a combined object recognition model by training a combination of the first neural network and the second neural network; and recognize the object in the computer-readable image based on the combined object recognition model by passing the computer-readable image through each of the combined neural networks. 12 . The system of claim 11 , wherein the processor is further configured to split an object-recognition neural network into a plurality of independent networks including the first neural network and the second neural network, wherein a first subset of the independent networks operates on content-based attributes of the images and a second subset of the independent networks operates on content-unaware attributes of the images. 13 . The system of claim 11 , wherein the visual alignment model is determined by training the first neural network using a first set of images and the visual representation model is determined by training the second neural network using a second set of images. 14 . The system of claim 13 , wherein the combined object recognition model is determined by training a combination of the first neural network and the second neural network using a third set of images, distinct from the first set of images and the second set of images. 15 . The system of claim 11 , wherein the first neural network is trained in parallel to the second neural network. 16 . The system of claim 11 , wherein the first neural network is trained by a first computer system and the second neural network is trained by a second computer system, which is distinct from the first computer system. 17 . A computer program product for training an object-recognition neural network to identify an object in a computer-readable image, the computer program product comprising a non-transitory computer readable storage medium, the computer readable storage medium comprising computer executable instructions, wherein the computer readable storage medium comprises instructions to: assign a first neural network for determining a visual alignment model of the images, wherein the visual alignment model is used to determine a normalized alignment of an object in input images; assign a second neural network for determining a visual representation model of the images, wherein the visual representation model is used to recognize the object in the input images; determine the visual alignment model by training the first neural network; determine the visual representation model by training the second neural network; determine a combined object recognition model by training a combination of the first neural network and the second neural network; and recognize the object in the computer-readable image based on the combined object recognition model by passing the computer-readable image through each of the combined neural networks. 18 . The computer

Assignees

Inventors

Classifications

  • Classification techniques · CPC title

  • G06V10/82Primary

    using neural networks · CPC title

  • Combinations of networks · CPC title

  • Activation functions · CPC title

  • Smoothing the distance, e.g. radial basis function networks [RBFN] · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2017286809A1 cover?
Technical solutions are described for training an object-recognition neural network that identifies an object in a computer-readable image. An example method includes assigning a first neural network for determining a visual alignment model of the images for determining a normalized alignment of the object. The method further includes assigning a second neural network for determining a visual r…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06V10/82. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Oct 05 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).