Image processing device, image processing method, and image processing program
US-2020380721-A1 · Dec 3, 2020 · US
US11604940B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11604940-B2 |
| Application number | US-202017123051-A |
| Country | US |
| Kind code | B2 |
| Filing date | Dec 15, 2020 |
| Priority date | Dec 15, 2020 |
| Publication date | Mar 14, 2023 |
| Grant date | Mar 14, 2023 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A method for object identification using multiple images. The method includes training an object identification model. Training the model includes collecting a training images for each of a plurality of objects, labeling each of the plurality of training images with a corresponding one of a plurality of object identifiers, and training a neural network with the plurality of labeled training images. At least two target images of a target object are receive and fed into the trained object identification model. The method further includes receiving, from the trained object identification model, for each of the at least two target images, an object identifier corresponding to the target object and a probability that the object identifier corresponds to the target object. A similarity value between the at least two target images is computed and the probabilities for the at least two target images are combined in proportion to the similarity value.
Opening claim text (preview).
What is claimed is: 1. A method for object identification using multiple images, comprising: training an object identification model, including: collecting a plurality of training images for each of a plurality of objects; labeling each of the plurality of training images with a corresponding one of a plurality of object identifiers; and training a neural network with the plurality of labeled training images; receiving at least two target images of a target object; feeding each of the at least two target images into the trained object identification model; receiving, from the trained object identification model, for each of the at least two target images, a target object identifier corresponding to the target object and a probability that the target object identifier corresponds to the target object; computing a similarity value between the at least two target images; combining the probabilities for the at least two target images in proportion to the similarity value; and identifying the target object as the target object identifier based on the combined probabilities. 2. The method of claim 1 , wherein collecting the plurality of training images comprises receiving a plurality of photographs for each of the plurality of objects. 3. The method of claim 1 , wherein collecting the plurality of training images comprises rendering a plurality of training images for each of the plurality of objects. 4. The method of claim 3 , wherein rendering the plurality of training images comprises rendering images for each of the plurality of objects as viewed from different angles. 5. The method of claim 1 , further comprising displaying information, including an image of a part corresponding to the target object identifier located on an associated machine. 6. The method of claim 5 , further comprising displaying a set of suitable substitute parts for the part corresponding to the target object identifier. 7. An object identification system, comprising: one or more processors; and one or more memory devices having stored thereon instructions that when executed by the one or more processors cause the one or more processors to: train an object identification model, including: collecting a plurality of training images for each of a plurality of objects; labeling each of the plurality of training images with a corresponding one of a plurality of object identifiers; and training a neural network with the plurality of labeled training images; receive at least two target images of a target object; feed each of the at least two target images into the trained object identification model; receive, from the trained object identification model, for each of the at least two target images, a probability that the target object corresponds to each of the plurality of object identifiers; compute a similarity value between the at least two target images; combine, for each of the plurality of object identifiers, the corresponding probabilities for the at least two target images in proportion to the similarity value; and identify the target object as the object identifier having a greatest combined probability. 8. The system of claim 7 , wherein collecting the plurality of training images comprises receiving a plurality of photographs for each of the plurality of objects. 9. The system of claim 7 , wherein collecting the plurality of training images comprises rendering a plurality of training images for each of the plurality of objects. 10. The system of claim 9 , wherein rendering the plurality of training images comprises rendering images for each of the plurality of objects as viewed from different angles. 11. The system of claim 7 , further comprising normalizing the combined probabilities for each object identifier. 12. The system of claim 7 , wherein the target object is a part associated with a machine and further comprising: receiving machine information identifying the machine; and based on the machine information, removing selected object identifiers from the plurality of object identifiers. 13. The system of claim 7 , further comprising displaying information, including an image of a part corresponding to the object identifier having the greatest combined probability located on an associated machine. 14. One or more non-transitory computer-readable media storing computer-executable instructions that, when executed by one or more processors, cause the one or more processors to perform operations comprising: training an object identification model, including: collecting a plurality of training images for each of a plurality of objects; labeling each of the plurality of training images with a corresponding one of a plurality of object identifiers; and training a neural network with the plurality of labeled training images; receiving at least two target images of a target object; feeding each of the at least two target images into the trained object identification model; receiving, from the trained object identification model, for each of the at least two target images, a target object identifier corresponding to the target object and a probability that the target object identifier corresponds to the target object; computing a similarity value between the at least two target images; combining the probabilities for the at least two target images in proportion to the similarity value; and identifying the target object as the target object identifier based on the combined probabilities. 15. The one or more non-transitory computer-readable media of claim 14 , wherein collecting the plurality of training images comprises receiving a plurality of photographs for each of the plurality of objects. 16. The one or more non-transitory computer-readable media of claim 14 , wherein collecting the plurality of training images comprises rendering a plurality of training images for each of the plurality of objects. 17. The one or more non-transitory computer-readable media of claim 16 , wherein rendering the plurality of training images comprises rendering images for each of the plurality of objects as viewed from different angles. 18. The one or more non-transitory computer-readable media of claim 14 , further comprising displaying information, including an image of a part corresponding to the target object identifier located on an associated machine. 19. The one or more non-transitory computer-readable media of claim 18 , further comprising displaying a set of suitable substitute parts for the part corresponding to the target object identifier. 20. The one or more non-transitory computer-readable media of claim 18 , further comprising displaying ordering information for the part corresponding to the target object identifier.
Supervised learning · CPC title
Convolutional networks [CNN, ConvNet] · CPC title
Type of objects · CPC title
based on parametric or probabilistic models, e.g. based on likelihood ratio or false acceptance rate versus a false rejection rate · CPC title
Generating training patterns; Bootstrap methods, e.g. bagging or boosting · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.