Auto-completion for gesture-input in assistant systems

US11869231B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11869231-B2
Application numberUS-202318150737-A
CountryUS
Kind codeB2
Filing dateJan 5, 2023
Priority dateApr 20, 2018
Publication dateJan 9, 2024
Grant dateJan 9, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method includes detecting a user input comprising an incomplete three-dimensional (3D) gesture performed by one or more hands of a first user by a virtual-reality (VR) headset, selecting candidate 3D gestures from pre-defined 3D gestures based on a personalized gesture-recognition model, wherein each of the candidate 3D gestures is associated with a confidence score representing a likelihood the first user intended to input the respective candidate 3D gesture, and presenting one or more suggested inputs corresponding to one or more of the candidate 3D gestures at the VR headset.

First claim

Opening claim text (preview).

What is claimed is: 1. A method comprising, by a virtual-reality (VR) headset: detecting a user input comprising an incomplete three-dimensional (3D) gesture in the air performed by one or more hands of a first user; selecting, based on a personalized gesture-recognition model, one or more candidate 3D gestures from a plurality of pre-defined 3D gestures, wherein each of the candidate 3D gestures is associated with a confidence score representing a likelihood the first user intended to input the respective candidate 3D gesture by performing the incomplete 3D gesture in the air; and presenting, at the VR headset, one or more suggested inputs corresponding to one or more of the candidate 3D gestures. 2. The method of claim 1 , further comprising: calculating, by the VR headset for each of the one or more candidate 3D gestures, a similarity level of the candidate 3D gesture with respect to the incomplete 3D gesture. 3. The method of claim 2 , wherein the similarly level of each candidate 3D gesture with respect to the incomplete 3D gesture is based on a trajectory of the incomplete 3D gesture with respect to the VR headset. 4. The method of claim 2 , wherein the similarly level of each candidate 3D gesture with respect to the incomplete 3D gesture is based on an orientation of the incomplete 3D gesture with respect to the VR headset. 5. The method of claim 2 , wherein the similarly level of each candidate 3D gesture with respect to the incomplete 3D gesture is based on an object associated with the incomplete 3D gesture. 6. The method of claim 2 , wherein the similarly level of each candidate 3D gesture with respect to the incomplete 3D gesture is based on contextual information associated with the incomplete 3D gesture. 7. The method of claim 2 , wherein the similarly level of each candidate 3D gesture with respect to the incomplete 3D gesture is based on a position of the incomplete 3D gesture with respect to the VR headset. 8. The method of claim 1 , further comprising: calculating, by the VR headset, one or more confidence scores for one or more intents corresponding to the incomplete 3D gesture; and determining, by the VR headset, that each of the one or more confidence scores is below a threshold score. 9. The method of claim 8 , wherein the threshold score is based on a 3D wake-up gesture performed by the first user. 10. The method of claim 8 , wherein calculating the one or more confidence scores for the one or more intents corresponding to the incomplete 3D gesture is based on a velocity associated with the incomplete 3D gesture. 11. The method of claim 8 , wherein calculating the one or more confidence scores for the one or more intents corresponding to the incomplete 3D gesture is based on temporal information associated with the incomplete 3D gesture, and wherein the temporal information comprises a pause in the user input. 12. The method of claim 8 , wherein selecting the one or more candidate 3D gestures is further based on the one or more intents. 13. The method of claim 1 , further comprising: receiving, at the VR headset, a user-selected input from the first user, wherein the user-selected input comprises one of the suggested inputs; and executing, by the VR headset, one or more tasks based on the user-selected input. 14. The method of claim 1 , wherein each pre-defined 3D gesture comprises one or more of pointing, poking, tapping, waving, or swiping. 15. The method of claim 1 , further comprising: receiving, at the VR headset, a first user-selected input from the first user, wherein the first user-selected input comprises one of the suggested inputs, and wherein the first user-selected input is associated with a first intent; generating, by the VR headset based on the first user-selected input, one or more additional candidate 3D gestures, wherein each of the one or more additional candidate 3D gestures is associated with the first intent; presenting, at the VR headset, one or more additional suggested inputs corresponding to one or more of the additional candidate 3D gestures; receiving, at the VR headset, a second user-selected input from the first user, wherein the second user-selected input comprises one of the additional suggested inputs; and executing, by the VR headset, one or more tasks based on the second user-selected input. 16. One or more computer-readable non-transitory storage media embodying software that is operable when executed to: detect, by a virtual-reality (VR) headset, a user input comprising an incomplete three-dimensional (3D) gesture in the air performed by one or more hands of a first user; select, based on a personalized gesture-recognition model by the VR headset, one or more candidate 3D gestures from a plurality of pre-defined 3D gestures, wherein each of the candidate 3D gestures is associated with a confidence score representing a likelihood the first user intended to input the respective candidate 3D gesture by performing the incomplete 3D gesture in the air; and present, at the VR headset, one or more suggested inputs corresponding to one or more of the candidate 3D gestures. 17. The media of claim 16 , wherein the software is further operable when executed to: calculate, by the VR headset for each of the one or more candidate 3D gestures, a similarity level of the candidate 3D gesture with respect to the incomplete 3D gesture. 18. The media of claim 17 , wherein the similarly level of each candidate 3D gesture with respect to the incomplete 3D gesture is based on a trajectory of the incomplete 3D gesture with respect to the VR headset. 19. The media of claim 17 , wherein the similarly level of each candidate 3D gesture with respect to the incomplete 3D gesture is based on an orientation of the incomplete 3D gesture with respect to the VR headset. 20. A system comprising: one or more processors; and a non-transitory memory coupled to the processors comprising instructions executable by the processors, the processors operable when executing the instructions to: detect, by a virtual-reality (VR) headset, a user input comprising an incomplete three-dimensional (3D) gesture in the air performed by one or more hands of a first user; select, based on a personalized gesture-recognition model by the VR headset, one or more candidate 3D gestures from a plurality of pre-defined 3D gestures, wherein each of the candidate 3D gestures is associated with a confidence score representing a likelihood the first user intended to input the respective candidate 3D gesture by performing the incomplete 3D gesture in the air; and present, at the VR headset, one or more suggested inputs corresponding to one or more of the candidate 3D gestures.

Assignees

Inventors

Classifications

  • Business processes related to social networking or social networking services · CPC title

  • characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU] · CPC title

  • Convolutional networks [CNN, ConvNet] · CPC title

  • Supervised learning · CPC title

  • Determination of affinities or common interests between users · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11869231B2 cover?
A method includes detecting a user input comprising an incomplete three-dimensional (3D) gesture performed by one or more hands of a first user by a virtual-reality (VR) headset, selecting candidate 3D gestures from pre-defined 3D gestures based on a personalized gesture-recognition model, wherein each of the candidate 3D gestures is associated with a confidence score representing a likelihood …
Who is the assignee on this patent?
Meta Platforms Tech Llc
What technology area does this patent fall under?
Primary CPC classification G06V10/82. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 09 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).