Method for predicting intention of user and apparatus for performing same

US2021256250A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2021256250-A1
Application numberUS-202117246299-A
CountryUS
Kind codeA1
Filing dateApr 30, 2021
Priority dateNov 2, 2018
Publication dateAug 19, 2021
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for predicting the intention of a user through an image acquired by capturing the user includes: receiving an image acquired by capturing a user; and predicting the intention of the user for the next motion by using spatial information and temporal information about the user and a target object included in the image.

First claim

Opening claim text (preview).

1 . A method for predicting an intention of a user through an image acquired by capturing the user, the method comprising: receiving an image acquired by capturing a user; and predicting an intention of the user for a next motion by using spatial information and temporal information about the user and a target object included in the image. 2 . The method of claim 1 , wherein the spatial information comprises a pose of a body part of the user and an interaction between the body part of the user and the target object. 3 . The method of claim 2 , wherein the interaction comprises at least one of a distance between the body part and the target object, and a position and direction of the body part based on the target object. 4 . The method of claim 2 , wherein the temporal information comprises changes in the pose of the body part of the user and the interaction over time. 5 . The method of claim 4 , wherein the temporal information comprises at least one of a trajectory along which the body part moves and a speed at which the body part moves toward the target object. 6 . The method of claim 1 , wherein predicting the intention of the user comprises: applying the image to a deep learning network as an input; extracting features of spatial information for each of a plurality of frames constituting the image through a convolution neural network (CNN) included in the deep learning network; extracting features of temporal information, included in consecutive frames, from the extracted features of the spatial information through a recurrent neural network (RNN) included in the deep learning network; and outputting an intention for a next motion as a result value based on the extracted features of the spatial information and the temporal information. 7 . The method of claim 6 , further comprising applying a driving signal to a device for assisting the user in performing motions according to the predicted intention. 8 . The method of claim 7 , wherein applying the driving signal comprises: selecting a result value occupying a predetermined percentage or more among a plurality of result values output from the deep learning network for a preset predetermined period of time; and applying a driving signal corresponding to the selected result value. 9 . A computer-readable storage medium having stored thereon a program that performs the method set forth in claim 1 . 10 . An apparatus for prediction an intention, the apparatus comprising: an input/output unit configured to receive an image acquired by capturing a user from an outside and to output an intention of the user for a next motion predicted by analyzing the image; a storage unit configured to store a program for predicting an intention of the user for a next motion by analyzing the image; and a control unit including at least one processor; wherein the control unit predicts an intention of the user for a next motion using spatial information and temporal information about the user and a target object, included in the image, by executing the program. 11 . The apparatus of claim 10 , wherein the spatial information comprises a pose of a body part of the user and an interaction between the body part of the user and the target object. 12 . The apparatus of claim 11 , wherein the spatial information further comprises at least one of a size, shape, texture, stiffness, and color of the target object. 13 . The apparatus of claim 11 , wherein the interaction comprises at least one of a distance between the body part and the target object, and a position and direction of the body part based on the target object. 14 . The apparatus of claim 11 , wherein the temporal information comprises changes in the pose of the body part of the user and the interaction over time. 15 . The apparatus of claim 14 , wherein the temporal information comprises at least one of a trajectory along which the body part moves and a speed at which the body part moves toward the target object. 16 . The apparatus of claim 10 , wherein a deep learning network that is implemented by the control unit by executing the program and receives the image as an input comprises: a spatial information extraction unit configured to extract features of spatial information for each of a plurality of frames constituting the image; a temporal information extraction unit configured to extract features of temporal information, included in consecutive frames, from the extracted features of the spatial information; and an intention output unit configured to output the intention for the next motion as a result value based on outputs of the spatial information extraction unit and the temporal information extraction unit. 17 . The apparatus of claim 16 , wherein the control unit applies a driving signal to a device for assisting the user in performing motions through the input/output unit according to the predicted intention. 18 . The apparatus of claim 17 , wherein the control unit, when applying the driving signal, selects a result value occupying a predetermined percentage or more among a plurality of result values output from the deep learning network for a preset predetermined period of time, and applies a driving signal corresponding to the selected result value. 19 . The apparatus of claim 17 , wherein: motions that can be performed by the user are classified into at least two types; and the control unit, when applying the driving signal, checks a type of motion currently performed by the user, selects only an intention for a motion different from the identified type of motion from intentions output as result values from the deep learning network for a predetermined period of time, and applies a driving signal corresponding to the selected intention. 20 . The apparatus of claim 10 , wherein the image acquired by capturing the user is an image that is captured from a first-person point of view of the user such that at least a body part of the user appears in the image.

Assignees

Inventors

Classifications

  • A61B5/0077Primary

    Devices for viewing the surface of the body, e.g. camera, magnifying lens · CPC title

  • Recognition of hand or arm movements, e.g. recognition of deaf sign language (static hand signs G06V40/113) · CPC title

  • Movements or behaviour, e.g. gesture recognition (recognition of facial expressions G06V40/16) · CPC title

  • Combinations of networks · CPC title

  • Recurrent networks, e.g. Hopfield networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2021256250A1 cover?
A method for predicting the intention of a user through an image acquired by capturing the user includes: receiving an image acquired by capturing a user; and predicting the intention of the user for the next motion by using spatial information and temporal information about the user and a target object included in the image.
Who is the assignee on this patent?
Seoul Nat Univ R&Db Foundation, Korea Advanced Inst Sci & Tech
What technology area does this patent fall under?
Primary CPC classification A61B5/0077. Mapped technology areas include Human Necessities.
When was this patent published?
Publication date Thu Aug 19 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).