Visual perception assistant

US10860847B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10860847-B2
Application numberUS-201715785644-A
CountryUS
Kind codeB2
Filing dateOct 17, 2017
Priority dateOct 17, 2017
Publication dateDec 8, 2020
Grant dateDec 8, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method, a system, and a computer program product for visually identifying the at least one target subject within a real-time view of a current scene. The method includes capturing, by at least one image sensor, a real-time view of a current scene. The method further includes performing a visual analysis of the real-time view to identify at least one subject. The method further includes: receiving, in real-time, a natural language input which includes verbal utterances of at least one speaker; and identifying, within the natural language input, a description of at least one particular subject. The method further includes: analyzing the current scene to identify at least one target subject that matches the description of the at least one particular subject; and in response to identifying the at least one target subject, applying at least one visual identifier to the real-time view of the current scene.

First claim

Opening claim text (preview).

What is claimed is: 1. A method comprising: capturing, by at least one image sensor, a real-time view of a current scene; receiving, a natural language input in real-time with the capturing of the real-time view; identifying, within the natural language input, a description of at least one particular subject within the current scene; analyzing the captured real time view of the current scene to identify, within the current scene, at least one target subject from among the at least one subject that matches the description of the at least one particular subject, the analyzing comprising differentiating among multiple identified subjects from other objects to identify the at least one target subject, based on a respective matching score determined between the description from the natural language input and each of the multiple identified subjects; and in response to identifying the at least one target subject within the current scene: applying, to the real-time view of the current scene, at least one visual identifier that increases a visibility of the at least one target subject within the real-time view of the current scene; and providing, to at least one output device, the real-time view of the current scene including the at least one visual identifier. 2. The method of claim 1 , wherein: the current scene includes a plurality of objects; performing the visual analysis of the current scene further comprises, identifying: the at least one subject from among the plurality of objects, wherein the at least one subject is a focus of attention in the current scene; and a background of the current scene; and the analyzing further comprises differentiating among the multiple identified subjects within the current scene, based on at least one prepositional phrase or relationship terms and phrases included in the description. 3. The method of claim 1 , further comprising: monitoring a movement of the at least one subject within the current scene; and dynamically updating a position of the at least one visual identifier within the real-time view based on the monitored movement. 4. The method of claim 1 , wherein applying the at least one visual identifier further comprises at least one of: dynamically rendering an identifying object adjacent to the at least one target subject within the real-time view of the current scene; and dynamically applying, to the real-time view of the current scene, a color adjustment to at least one region that includes the at least one target subject. 5. A data processing system comprising: at least one image sensor that captures a real-time view of a current scene; at least one input device that captures a natural language input including verbal utterances of at least one speaker, the natural language input captured in real-time with the capturing of the real-time view; and at least one processor that: identifies, within the natural language input, a description of at least one particular subject within the current scene; analyzes the captured real time view of the current scene to identify at least one target subject within the current scene from among the at least one subject that matches the description of the at least one particular subject within the current scene, wherein to identify the at least one target subject the processor differentiates among multiple identified subjects from other objects, based on a respective matching score determined between the description from the natural language input and each of the multiple identified subjects; and in response to identifying the at least one target subject within the current scene: applies, to the real-time view of the current scene, at least one visual identifier that increases a visibility of the at least one target subject within the real-time view of the current scene; and provides, to at least one output device, the real-time view of the current scene including the at least one visual identifier. 6. The data processing system of claim 5 , wherein: the current scene includes a plurality of objects; the at least one processor, in performing the visual analysis of the current scene, identifies: the at least one subject from among the plurality of objects, wherein the at least one subject is a focus of attention in the current scene; and a background of the current scene; and in analyzing the captured real time view of the current scene, the at least one processor differentiates among the multiple identified subjects within the current scene, based on at least one prepositional phrase or relationship terms and phrases included in the description. 7. The data processing system of claim 5 , wherein the at least one processor: monitors a movement of the at least one subject within the real-time view of the current scene; and dynamically updates a position of the at least one visual identifier within the real-time view based on the monitored movement. 8. The data processing system of claim 5 , wherein in applying the at least one visual identifier, the at least one processor performs at least one of: dynamically renders an identifying object adjacent to the at least one target subject within the real-time view of the current scene; and dynamically applies, to the real-time view of the current scene, a color adjustment to at least one region that includes the at least one target subject. 9. A computer program product comprising: a non-transitory computer readable storage device; and program code on the computer readable storage device that, when executed by a processor associated with a data processing system, enables the data processing system to provide the functionality of: capturing, by at least one image sensor, a real-time view of a current scene; receiving, in real-time, a natural language input; identifying, within the natural language input, a description of at least one particular subject within the current scene; analyzing the captured real time view of the current scene to identify at least one target subject within the current scene from among the at least one subject that matches the description of the at least one particular subject, the analyzing comprising differentiating among multiple identified subjects from other objects to identify the at least one target subject, based on a respective matching score determined between the description from the natural language input and each of the multiple identified subjects; and in response to identifying the at least one target subject within the current scene; applying, to the real-time view of the current scene, at least one visual identifier that increases a visibility of the at least one target subject within the real-time view of the current scene and providing, to at least one output device, the real-time view of the current scene including the at least one visual identifier. 10. The computer program product of claim 9 , wherein: the current scene includes a plurality of objects; the program code for performing the visual analysis of the current scene further comprising code for identifying: the at least one subject from among the plurality of objects, wherein the at least one subject is a focus of attention in the current scene; and a background of the current scene; and the program code for analyzing the captured real time view of the current scene comprises code for differentiating among the multiple identified subjects within the current scene from other objects, based on at least one prepositional phrase or relationship terms and phrases included in the description. 11. The computer program product of claim 9 , the program code further comprising code for: monitoring a movement of the at least on

Assignees

Inventors

Classifications

  • G06V20/52Primary

    Surveillance or monitoring of activities, e.g. for recognising suspicious objects (recognising microscopic objects G06V20/69) · CPC title

  • in augmented reality scenes · CPC title

  • G06V40/103Primary

    Static body considered as a whole, e.g. static pedestrian or occupant recognition · CPC title

  • Recognition of whole body movements, e.g. for sport training · CPC title

  • Audio in a user interface, e.g. using voice commands for navigating, audio feedback · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10860847B2 cover?
A method, a system, and a computer program product for visually identifying the at least one target subject within a real-time view of a current scene. The method includes capturing, by at least one image sensor, a real-time view of a current scene. The method further includes performing a visual analysis of the real-time view to identify at least one subject. The method further includes: recei…
Who is the assignee on this patent?
Motorola Mobility Llc
What technology area does this patent fall under?
Primary CPC classification G06V20/52. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 08 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).