Apparatus and method for automatically generating visual annotation based on visual language
US-9606975-B2 · Mar 28, 2017 · US
US10152644B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-10152644-B2 |
| Application number | US-201615350813-A |
| Country | US |
| Kind code | B2 |
| Filing date | Nov 14, 2016 |
| Priority date | Aug 31, 2016 |
| Publication date | Dec 11, 2018 |
| Grant date | Dec 11, 2018 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
The present application discloses a vehicle searching method and device, which can perform the steps of: calculating an appearance similarity distance between a first image of a target vehicle and several second images containing the searched vehicle; selecting several images from the several second images as several third images; obtaining corresponding license plate features of license plate areas in the first image and each of the third images with a preset Siamese neural network model; calculating a license plate feature similarity distance between the first image and each of the third images according to license plate feature; calculating a visual similarity distance between the first image and each of the third images according to the appearance similarity distance and the license plate feature similarity distance; obtaining a the first search result of the target vehicle by arranging the several third images in an ascending order of the visual similarity distances. The solution provided by the present application is not limited by application scenes, and it also improves vehicle searching speed and accuracy while reducing requirements of hardware such as cameras that collect images of a vehicle and auxiliary devices.
Opening claim text (preview).
What is claimed is: 1. A vehicle searching method, characterized in that it comprises: obtaining a first image of a target vehicle; extracting a first appearance visual feature of the target vehicle from the first image; extracting a second appearance visual feature of the searched vehicle respectively from several second images; wherein, the second images are the images stored in a vehicle monitoring image database; calculating an appearance similarity distance between the first image and each of the second images according to the first appearance visual feature and each of the second appearance visual features; selecting several images from the several second images as several third images; determining a first license plate area in the first image and a second license plate area in each of the third images; obtaining a first license plate feature corresponding to the first license plate area and a second license plate feature corresponding to each of the second license plate areas by inputting the first license plate area and each of the second license plate areas respectively into a preset Siamese neural network model; calculating a license plate feature similarity distance between the first image and each of the third images according to the first license plate feature and each of the second license plate features; calculating a visual similarity distance between the first image and each of the third images according to the appearance similarity distance and the license plate feature similarity distance; obtaining a first search result of the target vehicle by arranging the several third images in an ascending order of the visual similarity distances; wherein, the first appearance visual feature comprises a first texture feature, a first color feature and a first semantic attribute feature; the second appearance visual feature comprises a second texture feature, a second color feature and a second semantic attribute feature; the step of calculating an appearance similarity distance between the first image and each of the second images according to the first appearance visual feature and each of the second appearance visual features, comprises: performing the following steps for the first image and each of the second images respectively: calculating a texture similarity distance according to the first texture feature and the second texture feature; calculating a color similarity distance according to the first color feature and the second color feature; calculating a semantic attribute similarity distance according to the first semantic attribute feature and the second semantic attribute feature; calculating the appearance similarity distance between the first image and the second image according to the texture similarity distance, the color similarity distance, the semantic attribute similarity distance and a third preset model; wherein, the third preset model is: D appearance =α×d texture +β×d color +(1−α−β)×d attribute , wherein, D appearance is the appearance similarity distance, d texture , is the texture similarity distance, d color is the color similarity distance, d attribute is the semantic attribute similarity distance, and are empirical weights. 2. The method according to claim 1 , characterized in that, the step of selecting several images from the several second images as several third images, comprises: obtaining a second search result of the target vehicle by arranging the several second images in an ascending order of the appearance similarity distances; determining several images that rank before a first threshold value in the second search result as the several third images. 3. The method according to claim 1 , characterized in that, after the step of calculating a visual similarity distance between the first image and each of the third images, the method further comprises: calculating time-space similarity between the first image and each of the third images according to time-space meta data contained in the first image and each of the third images; calculating a final similarity distance between the first image and each of the third images according to the visual similarity distance and the time-space similarity; obtaining a third search result of the target vehicle by arranging the several third images in an ascending order of the final similarity distances. 4. The method according to claim 1 , characterized in that it further comprises: determining several images that rank before a second threshold value in the first search result as several fourth images; calculating time-space similarity between the first image and each of the fourth images according to time-space meta data contained in the first image and each of the fourth images; calculating a final similarity distance between the first image and each of the fourth images according to the visual similarity distance and the time-space similarity; obtaining a fourth search result of the target vehicle by arranging the several fourth images in an ascending order of the final similarity distances. 5. The method according to claim 2 , characterized in that it further comprises: determining several images that rank before a second threshold value in the first search result as several fourth images; calculating time-space similarity between the first image and each of the fourth images according to time-space meta data contained in the first image and each of the fourth images; calculating a final similarity distance between the first image and each of the fourth images according to the visual similarity distance and the time-space similarity; obtaining a fourth search result of the target vehicle by arranging the several fourth images in an ascending order of the final similarity distances. 6. The method according to claim 3 , characterized in that, the step of calculating time-space similarity between the first image and each of the third images comprises: calculating the time-space similarity between the first image and each of the third images with a first preset model, wherein, the first preset model is: ST ( i , j ) = T i - T j T max × δ ( C i , C j ) D max , wh
using neural networks · CPC title
relating to texture · CPC title
using classification, e.g. of video objects · CPC title
Text, e.g. of license plates, overlay texts or captions on TV images · CPC title
Matching criteria, e.g. proximity measures · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.