What technology area does this patent fall under?

Primary CPC classification G06V10/235. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Apr 16 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Multi-angle object recognition

US11961271B2 · US · B2

Patent metadata
Field	Value
Publication number	US-11961271-B2
Application number	US-202318182737-A
Country	US
Kind code	B2
Filing date	Mar 13, 2023
Priority date	Aug 8, 2018
Publication date	Apr 16, 2024
Grant date	Apr 16, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods, systems, and apparatus for controlling smart devices are described. In one aspect a method includes capturing, by a camera on a user device, a plurality of successive images for display in an application environment of an application executing on the user device, performing an object recognition process on the images, the object recognition process including determining that a plurality of images, each depicting a particular object, are required to perform object recognition on the particular object, and in response to the determination, generating a user interface element that indicates a camera operation to be performed, the camera option capturing two or more images, determining that a user, in response to the user interface element, has caused the indicated camera operation to be performed to capture the two or more images, and in response, determining whether a particular object is positively identified from the plurality of images.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method, comprising: capturing, by a camera, a first plurality of images at a first zoom level; performing a first object recognition process on the first plurality of images to detect a first object depicted in the first plurality of images; in response to the first object recognition process failing to detect the first object in the first plurality of images, providing, for output at a computing device, instructions requesting an image capture operation to be performed, wherein the image capture operation includes capturing, by the camera, a second plurality of images at a second zoom level, different from the first zoom level, the second plurality of images being greater than the first plurality of images; and in response to the image capture operation being performed, performing a second object recognition process on the second plurality of images to detect the first object depicted in the second plurality of images. 2. The computer-implemented method of claim 1 , wherein: the first plurality of images are captured at a first frame rate; and the second plurality of images are captured at a second frame rate greater than the first frame rate. 3. The computer-implemented method of claim 1 , wherein the first object comprises a machine-readable code. 4. The computer-implemented method of claim 1 , further comprising: in response to the first object recognition process failing to detect the first object in the first plurality of images, providing, for display at the computing device, a first user interface element descriptive of one or more actions to perform. 5. The computer-implemented method of claim 4 , further comprising: in response to the second object recognition process successfully detecting the first object in the second plurality of images, providing, for display at the computing device, a second user interface element indicative of a successful recognition of the first object. 6. The computer-implemented method of claim 4 , wherein the first user interface element comprises one or more of: an icon; an animation; or a descriptive text. 7. The computer-implemented method of claim 4 , wherein: the first user interface element is displayed adjacent to the first object being recognized; or the first user interface element is displayed at a fixed location. 8. A user computing device, the user computing device comprising: a camera; one or more processors; and one or more non-transitory computer-readable media configured to store instructions that, when executed by the one or more processors, cause the user computing device to perform operations, the operations comprising: capturing, by the camera, a first plurality of images at a first zoom level, performing a first object recognition process on the first plurality of images to detect a first object depicted in the first plurality of images, in response to the first object recognition process failing to detect the first object in the first plurality of images, providing, for output at the user computing device, instructions requesting an image capture operation to be performed, wherein the image capture operation includes capturing, by the camera, a second plurality of images at a second zoom level, different from the first zoom level, the second plurality of images being greater than the first plurality of images, and in response to the image capture operation being performed, performing a second object recognition process on the second plurality of images to detect the first object depicted in the second plurality of images. 9. The user computing device of claim 8 , wherein: the first plurality of images are captured at a first frame rate; and the second plurality of images are captured at a second frame rate greater than the first frame rate. 10. The user computing device of claim 8 , wherein the first object comprises a machine-readable code. 11. The user computing device of claim 8 , further comprising a display device, wherein the operations further comprise: in response to the first object recognition process failing to detect the first object in the first plurality of images, providing, for presentation on the display device, a first user interface element descriptive of one or more actions to perform. 12. The user computing device of claim 11 , wherein the operations further comprise: in response to the second object recognition process successfully detecting the first object in the second plurality of images, providing, for presentation on the display device, a second user interface element indicative of a successful recognition of the first object. 13. The user computing device of claim 11 , wherein the first user interface element comprises one or more of: an icon; an animation; or a descriptive text. 14. The user computing device of claim 11 , wherein: the first user interface element is displayed adjacent to the first object being recognized; or the first user interface element is displayed at a fixed location. 15. One or more non-transitory computer-readable media configured to store instructions that, when executed by one or more processors, cause the one or more processors to perform operations, the operations comprising: capturing, by a camera, a first plurality of images at a first zoom level; performing a first object recognition process on the first plurality of images to detect a first object depicted in the first plurality of images; in response to the first object recognition process failing to detect the first object in the first plurality of images, providing, for output at a computing device, instructions requesting an image capture operation to be performed, wherein the image capture operation includes capturing, by the camera, a second plurality of images at a second zoom level, different from the first zoom level, the second plurality of images being greater than the first plurality of images; and in response to the image capture operation being performed, performing a second object recognition process on the second plurality of images to detect the first object depicted in the second plurality of images. 16. The one or more non-transitory computer-readable media of claim 15 , wherein: the first plurality of images are captured at a first frame rate; and the second plurality of images are captured at a second frame rate greater than the first frame rate. 17. The one or more non-transitory computer-readable media of claim 15 , wherein the operations further comprise: in response to the first object recognition process failing to detect the first object in the first plurality of images, providing, for display at the computing device, a first user interface element descriptive of one or more actions to perform, and in response to the second object recognition process successfully detecting the first object in the second plurality of images, providing, for display at the computing device, a second user interface element indicative of a successful recognition of the first object.

Assignees

Google Llc

Inventors

Badr Ibrahim

Classifications

G06V10/235Primary
based on user input or interaction · CPC title
G06V10/16
using multiple overlapping images; Image stitching · CPC title
G06V10/24
Aligning, centring, orientation detection or correction of the image · CPC title
G06V10/776
Validation; Performance evaluation · CPC title
G06V10/98
Detection or correction of errors, e.g. by rescanning the pattern or by human intervention; Evaluation of the quality of the acquired patterns · CPC title

Patent family

Related publications grouped by family.

View patent family 67660007

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11961271B2 cover?: Methods, systems, and apparatus for controlling smart devices are described. In one aspect a method includes capturing, by a camera on a user device, a plurality of successive images for display in an application environment of an application executing on the user device, performing an object recognition process on the images, the object recognition process including determining that a pluralit…
Who is the assignee on this patent?: Google Llc
What technology area does this patent fall under?: Primary CPC classification G06V10/235. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Apr 16 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).

How to read this patent

Abstract

First claim

Assignees

Inventors

Classifications

Patent family

External sources

Related patents

Image-based item identification

Image artifact masking

Using a mobile device for coarse shape matching against cloud-based 3D model database

Information processing apparatus, information processing method, program, and information processing system

Image processing apparatus, image processing method, and image processing system

Frequently asked questions