Adaptable image search with computer vision assistance

US9619488B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9619488-B2
Application numberUS-201414163999-A
CountryUS
Kind codeB2
Filing dateJan 24, 2014
Priority dateJan 24, 2014
Publication dateApr 11, 2017
Grant dateApr 11, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A computing device having adaptable image search and methods for operating an image recognition program on the computing device are disclosed herein. An image recognition program may receive a query from a user and a target image within which a search based on the query is to be performed using one or more of a plurality of locally stored image recognition models, which are determined to be able to perform the search with sufficiently high confidence. The query may comprise text that is typed or converted from speech. The image recognition program performs the search within the target image for a target region of the target image using at least one selected image recognition model stored locally, and returns a search result to the user.

First claim

Opening claim text (preview).

The invention claimed is: 1. A computing device having adaptable image search, the computing device comprising: non-volatile memory configured to store a plurality of image recognition models; an image recognition program executed by a processor of the computing device, the computing device being a user computing device, and the image recognition program configured to: receive a query from a user, the query comprising text that is typed or converted from speech; receive a target image within which a search based on the query is to be performed; rank the image recognition models by confidence level for performing the search based on at least a comparison between the query and respective text descriptions of the image recognition models; determine whether the confidence level of any of the image recognition models is above a confidence threshold; and upon determining that at least one confidence level of the image recognition models is above the confidence threshold, select at least one of the image recognition models whose confidence level is above the confidence threshold; perform the search within the target image for a target region of the target image using at least one selected image recognition model locally on the processor; and return a search result to the user. 2. The computing device of claim 1 , wherein the target image is a single image or one or a plurality of image frames that constitute a portion of video. 3. The computing device of claim 1 , wherein each image recognition model includes at least one of the following: an image recognition algorithm, an optical character recognition (OCR) algorithm, and a keyword matching algorithm. 4. The computing device of claim 1 , wherein the image recognition program uses location information about the user. 5. The computing device of claim 1 , wherein the target image is an image or video of the user's current surroundings and the query indicates a desired product or destination. 6. The computing device of claim 1 , wherein the computing device comprises a camera and wherein the target image is captured by the camera. 7. The computing device of claim 1 , wherein the computing device is a smartphone or tablet. 8. The computing device of claim 1 , wherein the computing device is a watch or other wearable device. 9. The computing device of claim 1 , wherein the image recognition program displays the search result on a head-up display. 10. The computing device of claim 1 , wherein the query includes a directive to search for target regions in the target image or in other images that are similar to the target region in the target image. 11. A method for operating an image recognition program on a computing device having adaptable image search, the method comprising: executing the image recognition program on a processor of the computing device, the computing device being a user computing device; receiving a query from a user, the query comprising text that is typed or converted from speech; receiving a target image within which a search based on the query is to be performed; ranking a plurality of image recognition models by confidence level for performing the search based on at least a comparison between the query and respective text descriptions of the image recognition models, wherein the image recognition models are stored in non-volatile memory of the computing device; determining whether the confidence level of any of the image recognition models is above a confidence threshold; and upon determining that at least one confidence level of the image recognition models is above the confidence threshold, selecting at least one of the image recognition models whose confidence level is above the confidence threshold; performing the search within the target image for a target region of the target image using at least one selected image recognition model locally on the processor; and returning a search result to the user. 12. The method of claim 11 , wherein each image recognition model includes at least one of the following: an image recognition algorithm, an optical character recognition (OCR) algorithm, and a keyword matching algorithm. 13. The method of claim 11 , wherein the image recognition program uses location information about the user. 14. The method of claim 11 , wherein the target image is an image or video of the user's current surroundings and the query indicates a desired product or destination. 15. The method of claim 11 , wherein the computing device comprises a camera and wherein the target image is captured by the camera. 16. The method of claim 11 , wherein the computing device is a smartphone or tablet. 17. The method of claim 11 , wherein the image recognition program displays the search result on a head-up display. 18. The method of claim 11 , further comprising, upon determining that none of the confidence levels of the image recognition models is above the confidence threshold: connecting to a web service including a plurality of image recognition models; receiving a ranking of image recognition models of the web service by confidence level for performing the search based on the query within the target image; determining whether any of the image recognition models of the web service is above the confidence threshold for performing the search; and upon determining that at least one confidence level of the image recognition models of the web service is above the confidence threshold, selecting at least one of the image recognition models of the web service whose confidence level is above the confidence threshold; downloading at least one selected image recognition model to the computing device; performing the search within the target image for a target region of the target image using at least one downloaded image recognition model; and returning the search result to the user. 19. The method of claim 11 , further comprising, upon determining that none of the confidence levels of the image recognition models is above the confidence threshold: performing a web image search in an image search engine, wherein the web image search is based on the query; returning a predetermined number of example images; selecting at least one example image from the predetermined number of example images; creating an image recognition model based on at least one selected example image; storing the created image recognition model in the non-volatile memory of the computing device; performing the search within the target image for a target region of the target image using the created image recognition model; and returning the search result to the user. 20. A computing device having adaptable image search, the computing device comprising: non-volatile memory configured to store a plurality of image recognition models; a camera, wherein a target image is captured by the camera and wherein the target image is a single image or one or a plurality of image frames that constitute a portion of video; a head-up display; and an image recognition program executed by a processor of the computing device, the computing device being a user computing device, and the image recognition program configured to: receive a query from a user, the query comprising text that is typed or converted from speech; receive the target image within which a search based on the query is to be performed; rank the image recognition models by confidence level for performing the search based on at least a comparison between the query and respective te

Assignees

Inventors

Classifications

  • using selection of the recognition techniques, e.g. of a classifier in a multiple classifier system · CPC title

  • Selection of pattern recognition techniques, e.g. of classifiers in a multi-classifier system · CPC title

  • using metadata automatically derived from the content · CPC title

  • Details of monitoring file system events, e.g. by the use of hooks, filter drivers, logs · CPC title

  • using ranking · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9619488B2 cover?
A computing device having adaptable image search and methods for operating an image recognition program on the computing device are disclosed herein. An image recognition program may receive a query from a user and a target image within which a search based on the query is to be performed using one or more of a plurality of locally stored image recognition models, which are determined to be abl…
Who is the assignee on this patent?
Microsoft Technology Licensing Llc
What technology area does this patent fall under?
Primary CPC classification G06F16/5838. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 11 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).