Contextual visual and voice search from electronic eyewear device

US12450837B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12450837-B2
Application numberUS-202217742900-A
CountryUS
Kind codeB2
Filing dateMay 12, 2022
Priority dateMay 19, 2021
Publication dateOct 21, 2025
Grant dateOct 21, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Augmented reality features are selected for presentation to a display of an electronic eyewear device by using a camera of the electronic eyewear device to capture a scan image and processing the scan image to extract contextual signals. Simultaneously, voice data from the user is captured by a microphone of the electronic eyewear device and voice-to-text conversion of the captured voice data is performed to identify keywords in the voice data. The extracted contextual signals and the identified keywords are then used to select at least one augmented reality feature that matches the extracted contextual signals and the identified keywords, and the selected augmented reality feature is presented to the display for user selection. The contextual information thus refines the search results to provide the augmented reality feature best suited for the context of the scan image captured by the electronic eyewear device.

First claim

Opening claim text (preview).

What is claimed is: 1. An electronic eyewear device adapted to be worn on a head of a user, comprising: a display; at least one camera adapted to scan a scene in a viewing area around the user and to capture a scan image; a microphone adapted to capture voice data from the user; a memory that stores instructions; and a processor that executes the instructions to perform operations including: initiating a scan by the at least one camera to capture the scan image; processing the scan image or sending the scan image to an image processing device to extract at least one contextual signal from the scan image, wherein the at least one extracted contextual signal includes contextual data from the user and the viewing area around the user; capturing, via the microphone, voice data from the user; performing voice-to-text conversion of the captured voice data or sending the captured voice data to a voice data processing device to identify at least one keyword in the voice data; processing the at least one extracted contextual signal and the at least one identified keyword or forwarding the at least one extracted contextual signal and the at least one identified keyword to an augmented reality feature storage to guide a search for an augmented reality feature having metadata that matches the user's search intent as determined from the at least one identified keyword and the at least one contextual signal and selecting at least one augmented reality feature from the augmented reality feature storage having metadata that matches the at least one extracted contextual signal and the at least one identified keyword; presenting the selected at least one selected augmented reality feature to the display for user selection; and applying an augmented reality feature selected by the user to the scene for display on the electronic eyewear device. 2. The electronic eyewear device of claim 1 , wherein the processor executes the instructions to perform additional operations including, for each contextual signal extracted from the scan image, presenting contextual signal descriptor text to the display. 3. The electronic eyewear device of claim 1 , wherein the at least one extracted contextual signal identifies at least one of a type of place or an object that is included in the scan image. 4. The electronic eyewear device of claim 1 , wherein the at least one extracted contextual signal identifies whether any tracking objects or markers are located in the scan image. 5. The electronic eyewear device of claim 1 , wherein initiating the scan is in response to a tap of a scan button or a press and hold of the scan button by the user. 6. The electronic eyewear device of claim 1 , wherein the processor executes the instructions to perform additional operations including presenting scan notifications to the display to indicate that a background scan has been initiated. 7. The electronic eyewear device of claim 1 , wherein the processor executes the instructions to present the at least one selected augmented reality feature to the display in a carousel of augmented reality features for user selection. 8. The electronic eyewear device of claim 7 , wherein the processor executes the instructions to badge the at least one selected augmented reality feature in the carousel with a scan icon that differentiates the at least one selected augmented reality feature from any other augmented reality feature in the carousel. 9. The electronic eyewear device of claim 1 , wherein the processor executes the instructions to perform additional operations including presenting the at least one identified keyword to the display. 10. The electronic eyewear device of claim 1 , wherein the processor executes the instructions to perform additional operations including determining whether the user has spoken after the initiation of a scan of the scene and, when the user has spoken after the initiation of a scan of the scene, capturing the voice data from the user and initiating a voice scan animation on the display indicating that the user will see scan results based on the user's voice data. 11. A method of selecting augmented reality features for presentation to a display of an electronic eyewear device, comprising: initiating a scan of a scene in a viewing area around a user by at least one camera of the electronic eyewear device to capture a scan image; processing the scan image or sending the scan image to an image processing device to extract at least one contextual signal from the scan image, wherein the at least one extracted contextual signal includes contextual data from a user and a viewing area around the user; capturing voice data from the user; performing voice-to-text conversion of the captured voice data or sending the captured voice data to a voice data processing device to identify at least one keyword in the voice data; processing the at least one extracted contextual signal and the at least one identified keyword or forwarding the at least one extracted contextual signal and the at least one identified keyword to an augmented reality feature storage to guide a search for an augmented reality feature having metadata that matches the user's search intent as determined from the at least one identified keyword and the at least one contextual signal and selecting at least one augmented reality feature from the augmented reality feature storage having metadata that matches the at least one extracted contextual signal and the at least one identified keyword; presenting the selected at least one selected augmented reality feature to the display for user selection; and applying an augmented reality feature selected by the user to the scene for display on the electronic eyewear device. 12. The method of claim 11 , further comprising presenting at least one of contextual signal descriptor text for each contextual signal extracted from the scan image or the at least one identified keyword to the display of the electronic eyewear device. 13. The method of claim 11 , wherein presenting the selected at least one selected augmented reality feature to the display comprises presenting the at least one selected augmented reality feature in a carousel of augmented reality features for user selection. 14. The method of claim 13 , further comprising badging the at least one selected augmented reality feature in the carousel with a scan icon that differentiates the at least one selected augmented reality feature from any other augmented reality feature in the carousel. 15. The method of claim 11 , further comprising determining whether the user has spoken after the scan of the scene has been initiated and, when the user has spoken after the scan of the scene has been initiated, capturing the voice data from the user, and initiating a voice scan animation on the display indicating that the user will see scan results based on the user's voice data. 16. A non-transitory computer-readable storage medium that stores instructions that when executed by at least one processor cause the processor to select augmented reality features for presentation to a display of an electronic eyewear device by performing operations including: initiating a scan of a scene in a viewing area around a user by at least one camera of the electronic eyewear device to capture a scan image; processing the scan image or sending the scan image to an image processing device to extract at least one contextual signal from the scan image, wherein the at least one extracted contextual signal includes contextual data from a user and a viewing area around the user; capturing voice

Assignees

Inventors

Classifications

  • Execution procedure of a spoken command · CPC title

  • Word spotting · CPC title

  • Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • Speech classification or search · CPC title

  • involving graphical user interfaces [GUIs] · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12450837B2 cover?
Augmented reality features are selected for presentation to a display of an electronic eyewear device by using a camera of the electronic eyewear device to capture a scan image and processing the scan image to extract contextual signals. Simultaneously, voice data from the user is captured by a microphone of the electronic eyewear device and voice-to-text conversion of the captured voice data i…
Who is the assignee on this patent?
Snap Inc
What technology area does this patent fall under?
Primary CPC classification G06T19/006. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Oct 21 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).