Gesture-based content sharing in artificial reality environments
US-10712901-B2 · Jul 14, 2020 · US
US12450837B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12450837-B2 |
| Application number | US-202217742900-A |
| Country | US |
| Kind code | B2 |
| Filing date | May 12, 2022 |
| Priority date | May 19, 2021 |
| Publication date | Oct 21, 2025 |
| Grant date | Oct 21, 2025 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Augmented reality features are selected for presentation to a display of an electronic eyewear device by using a camera of the electronic eyewear device to capture a scan image and processing the scan image to extract contextual signals. Simultaneously, voice data from the user is captured by a microphone of the electronic eyewear device and voice-to-text conversion of the captured voice data is performed to identify keywords in the voice data. The extracted contextual signals and the identified keywords are then used to select at least one augmented reality feature that matches the extracted contextual signals and the identified keywords, and the selected augmented reality feature is presented to the display for user selection. The contextual information thus refines the search results to provide the augmented reality feature best suited for the context of the scan image captured by the electronic eyewear device.
Opening claim text (preview).
What is claimed is: 1. An electronic eyewear device adapted to be worn on a head of a user, comprising: a display; at least one camera adapted to scan a scene in a viewing area around the user and to capture a scan image; a microphone adapted to capture voice data from the user; a memory that stores instructions; and a processor that executes the instructions to perform operations including: initiating a scan by the at least one camera to capture the scan image; processing the scan image or sending the scan image to an image processing device to extract at least one contextual signal from the scan image, wherein the at least one extracted contextual signal includes contextual data from the user and the viewing area around the user; capturing, via the microphone, voice data from the user; performing voice-to-text conversion of the captured voice data or sending the captured voice data to a voice data processing device to identify at least one keyword in the voice data; processing the at least one extracted contextual signal and the at least one identified keyword or forwarding the at least one extracted contextual signal and the at least one identified keyword to an augmented reality feature storage to guide a search for an augmented reality feature having metadata that matches the user's search intent as determined from the at least one identified keyword and the at least one contextual signal and selecting at least one augmented reality feature from the augmented reality feature storage having metadata that matches the at least one extracted contextual signal and the at least one identified keyword; presenting the selected at least one selected augmented reality feature to the display for user selection; and applying an augmented reality feature selected by the user to the scene for display on the electronic eyewear device. 2. The electronic eyewear device of claim 1 , wherein the processor executes the instructions to perform additional operations including, for each contextual signal extracted from the scan image, presenting contextual signal descriptor text to the display. 3. The electronic eyewear device of claim 1 , wherein the at least one extracted contextual signal identifies at least one of a type of place or an object that is included in the scan image. 4. The electronic eyewear device of claim 1 , wherein the at least one extracted contextual signal identifies whether any tracking objects or markers are located in the scan image. 5. The electronic eyewear device of claim 1 , wherein initiating the scan is in response to a tap of a scan button or a press and hold of the scan button by the user. 6. The electronic eyewear device of claim 1 , wherein the processor executes the instructions to perform additional operations including presenting scan notifications to the display to indicate that a background scan has been initiated. 7. The electronic eyewear device of claim 1 , wherein the processor executes the instructions to present the at least one selected augmented reality feature to the display in a carousel of augmented reality features for user selection. 8. The electronic eyewear device of claim 7 , wherein the processor executes the instructions to badge the at least one selected augmented reality feature in the carousel with a scan icon that differentiates the at least one selected augmented reality feature from any other augmented reality feature in the carousel. 9. The electronic eyewear device of claim 1 , wherein the processor executes the instructions to perform additional operations including presenting the at least one identified keyword to the display. 10. The electronic eyewear device of claim 1 , wherein the processor executes the instructions to perform additional operations including determining whether the user has spoken after the initiation of a scan of the scene and, when the user has spoken after the initiation of a scan of the scene, capturing the voice data from the user and initiating a voice scan animation on the display indicating that the user will see scan results based on the user's voice data. 11. A method of selecting augmented reality features for presentation to a display of an electronic eyewear device, comprising: initiating a scan of a scene in a viewing area around a user by at least one camera of the electronic eyewear device to capture a scan image; processing the scan image or sending the scan image to an image processing device to extract at least one contextual signal from the scan image, wherein the at least one extracted contextual signal includes contextual data from a user and a viewing area around the user; capturing voice data from the user; performing voice-to-text conversion of the captured voice data or sending the captured voice data to a voice data processing device to identify at least one keyword in the voice data; processing the at least one extracted contextual signal and the at least one identified keyword or forwarding the at least one extracted contextual signal and the at least one identified keyword to an augmented reality feature storage to guide a search for an augmented reality feature having metadata that matches the user's search intent as determined from the at least one identified keyword and the at least one contextual signal and selecting at least one augmented reality feature from the augmented reality feature storage having metadata that matches the at least one extracted contextual signal and the at least one identified keyword; presenting the selected at least one selected augmented reality feature to the display for user selection; and applying an augmented reality feature selected by the user to the scene for display on the electronic eyewear device. 12. The method of claim 11 , further comprising presenting at least one of contextual signal descriptor text for each contextual signal extracted from the scan image or the at least one identified keyword to the display of the electronic eyewear device. 13. The method of claim 11 , wherein presenting the selected at least one selected augmented reality feature to the display comprises presenting the at least one selected augmented reality feature in a carousel of augmented reality features for user selection. 14. The method of claim 13 , further comprising badging the at least one selected augmented reality feature in the carousel with a scan icon that differentiates the at least one selected augmented reality feature from any other augmented reality feature in the carousel. 15. The method of claim 11 , further comprising determining whether the user has spoken after the scan of the scene has been initiated and, when the user has spoken after the scan of the scene has been initiated, capturing the voice data from the user, and initiating a voice scan animation on the display indicating that the user will see scan results based on the user's voice data. 16. A non-transitory computer-readable storage medium that stores instructions that when executed by at least one processor cause the processor to select augmented reality features for presentation to a display of an electronic eyewear device by performing operations including: initiating a scan of a scene in a viewing area around a user by at least one camera of the electronic eyewear device to capture a scan image; processing the scan image or sending the scan image to an image processing device to extract at least one contextual signal from the scan image, wherein the at least one extracted contextual signal includes contextual data from a user and a viewing area around the user; capturing voice
Execution procedure of a spoken command · CPC title
Word spotting · CPC title
Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title
Speech classification or search · CPC title
involving graphical user interfaces [GUIs] · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.