Image description generation for screen readers
US-2024013768-A1 · Jan 11, 2024 · US
US10339406B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-10339406-B2 |
| Application number | US-201314137522-A |
| Country | US |
| Kind code | B2 |
| Filing date | Dec 20, 2013 |
| Priority date | Mar 15, 2013 |
| Publication date | Jul 2, 2019 |
| Grant date | Jul 2, 2019 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Devices and a method are provided for providing feedback to a user. In one implementation, the method comprises obtaining a plurality of images from an image sensor. The image sensor is configured to be positioned for movement with the user's head. The method further comprises monitoring the images, and determining whether relative motion occurs between a first portion of a scene captured in the plurality of images and other portions of the scene captured in the plurality of images. If the first portion of the scene moves less than at least one other portion of the scene, the method comprises obtaining contextual information from the first portion of the scene. The method further comprises providing the feedback to the user based on at least part of the contextual information.
Opening claim text (preview).
What is claimed is: 1. An apparatus for providing feedback to a user, the apparatus comprising: an image sensor configured to be positioned for movement with a head of the user as the head moves, and to capture real time images from an environment of the user; and at least one processor device for determining contextual information based on the real time images, the processor device being configured to: monitor a plurality of the real time images captured by the image sensor to determine that relative motion occurs between a first portion of a scene captured in the plurality of real time images and other portions of the scene captured in the plurality of real time images; determine that an object appears to be stationary in the first portion of the scene across the monitored real time images while the other portions of the scene appear to be moving across the monitored real time images; determine, based on the stationary appearance of the object in the first portion of the scene across the monitored real time images while the other portions of the scene appear to be moving across the monitored real time images, that the head of the user is tracking the object included in the first portion of the scene; obtain contextual information associated with the object that the head of the user is tracking; and provide the feedback to the user based on at least part of the contextual information. 2. The apparatus of claim 1 , wherein the object that appears to be stationary across the monitored images is in a foreground of the scene, and the other portions of the scene are associated with a background of the scene. 3. The apparatus of claim 2 , wherein contextual information associated with at least one object in the background is used to identify the object that appears to be stationary in the foreground. 4. The apparatus of claim 3 , wherein the at least one processor device is further configured to retrieve the contextual information from the at least one object in the background when the object that appears to be stationary in the foreground is actually moving in the environment of the user relative to the background. 5. The apparatus of claim 4 , wherein the object that appears to be stationary in the foreground includes a branded product and the contextual information associated with the object that appears to be stationary includes information associated with the branded product. 6. The apparatus of claim 1 , wherein the object that appears to be stationary is actually a moving object in the environment of the user. 7. The apparatus of claim 6 , wherein the moving object includes a public transportation vehicle and the contextual information includes information associated with the public transportation vehicle. 8. The apparatus of claim 1 , wherein obtaining contextual information includes performing optical character recognition on an area of at least one of the plurality of images associated with the first portion of the scene. 9. The apparatus of claim 1 , wherein obtaining contextual information includes comparing the first portion of the scene with stored image data. 10. The apparatus of claim 1 , wherein the contextual information is used for selecting an action to execute from a plurality of context-based actions. 11. The apparatus of claim 1 , wherein the image sensor is further configured to capture real time images at a plurality of resolutions, and the contextual information is used for selecting which of the plurality of resolutions to use. 12. The apparatus of claim 1 , wherein the at least one processor device is configured to select between a plurality of differing processing schemes based on the contextual information obtained. 13. The apparatus of claim 12 , wherein the plurality of differing processing schemes includes a processing scheme to identify an object, a processing scheme to identify an individual, a processing scheme to audibly read a text, and a processing scheme to continuously monitor an object. 14. An apparatus for providing feedback to a user, the apparatus comprising: an image sensor for capturing real time images from an environment of the user, the image sensor configured to be positioned for movement with a head of the user as the head moves such that an aiming direction of the image sensor falls within a field of view of the user; and at least one processor device configured to: monitor the real time images captured by the image sensor; determine that an object of interest appears to be stationary in a first portion of a scene across the monitored real time images while the other portions of the scene appear to be moving across the monitored real time images; determine, based on the stationary appearance of the object of interest in the first portion of the scene across the monitored real time images while the other portions of the scene appear to be moving across the monitored real time images, that the head of the user is tracking the object of interest included in the first portion of the scene within the field of view; obtain contextual information associated with the object of interest that the head of the user is tracking; and provide the feedback to the user based on at least part of the contextual information. 15. The apparatus of claim 14 , wherein the at least one processor device is further configured to use the contextual information to identify the object of interest. 16. The apparatus of claim 15 , wherein the object of interest includes an individual or a branded product. 17. The apparatus of claim 14 , wherein the at least one processor device is further configured to initiate performance of an action associated with the object of interest, and suspend performance of the action when the object of interest moves outside a field of view of the image sensor. 18. The apparatus of claim 17 , wherein the object of interest includes text and the action includes optical character recognition. 19. The apparatus of claim 14 , wherein the at least one processor device is further configured to obtain additional contextual information associated with at least one object other than the object of interest, and to use at least part of the additional contextual information to output the feedback. 20. The apparatus of claim 14 , wherein the at least one processor device is further configured to obtain the contextual information when the object of interest lingers within the plurality of real time images for a predetermined period of time. 21. The apparatus of claim 20 , wherein the predetermined period of time depends on a characteristic associated with the object of interest. 22. A method for providing feedback to a user, the method comprising: obtaining from an image sensor a plurality of images, wherein the image sensor is configured to be positioned for movement with a head of the user; monitoring the plurality of images captured by the image sensor; determining that relative motion occurs between a first portion of a scene captured in the plurality of images and other portions of the scene captured in the plurality of images; determining that an object appears to be stationary in the first portion of the scene across the monitored images while the other portions of the scene appear to be moving across the monitored images; determining, based on the stationary appearance of the object in the first portion of the scene across the monitored images while the other portions of the scene appear to be moving across the monitored imag
by compensating for image skew or non-uniform image deformations · CPC title
using audible presentation of the information · CPC title
Constructional details · CPC title
by using two or more images to influence resolution, frame rate or aspect ratio · CPC title
based on recognised objects · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.