Who is the assignee on this patent?

Wexler Yonatan, Shashua Amnon, Orcam Technologies Ltd

What technology area does this patent fall under?

Primary CPC classification G09B21/008. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Mar 06 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Apparatus and method for analyzing images

US9911361B2 · US · B2

Patent metadata
Field	Value
Publication number	US-9911361-B2
Application number	US-201314136876-A
Country	US
Kind code	B2
Filing date	Dec 20, 2013
Priority date	Mar 10, 2013
Publication date	Mar 6, 2018
Grant date	Mar 6, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An apparatus is provided for audibly reading text retrieved from a captured image. In one implementation, the apparatus comprises an image sensor configured to capture image data from an environment of a user, and at least one processor. The processor is configured to determine an existence of a pointing trigger in the image data, the trigger being associated with a user's desire to hear text read aloud, and wherein the trigger identifies an intermediate portion of the text a distance from a level break in the text. The processor is further configured to perform a layout analysis on the text to identify a level break associated with the trigger; and cause the text to be read aloud from the level break associated with the trigger.

First claim

Opening claim text (preview).

What is claimed is: 1. An apparatus for audibly reading text retrieved from a captured image, the apparatus comprising: an image sensor configured to capture image data from an environment of a user; and at least one processor configured to: determine an existence of a trigger in the image data, the trigger being associated with a user's desire to hear text read aloud, and wherein the trigger identifies an intermediate portion of the text a distance from a level break in the text; perform a layout analysis on the text to identify a level break associated with the trigger; perform an optical character recognition (OCR) only on a subset of the text in the image data associated with the trigger prior to causing the subset of the text to be read aloud; cause the subset of the text to be read aloud from the level break associated with the trigger; and while the subset of the text is being read aloud, anticipate a subsequent subset of the text to be read aloud and perform an OCR of the subsequent subset of the text in advance. 2. The apparatus of claim 1 , wherein the trigger includes an identification of text within a specific paragraph and wherein the level break is a beginning of a sequential paragraph associated with the specific paragraph. 3. The apparatus of claim 1 , wherein the trigger includes an identification of text within a specific paragraph and wherein the level break is at least one of the following: a beginning of the specific paragraph, a beginning of a specific sentence, a beginning of a specific column, a beginning of a specific page, an end of the specific paragraph, an end of the specific sentence, an end of the specific column, and an end of the specific page. 4. The apparatus of claim 1 , wherein the trigger includes an identification of at least two intermediate portions of the text each a distance from a different level break in the text, and wherein the at least one processor device is further configured to select a level break associated with the trigger and cause the text to be read aloud from the selected level break. 5. The apparatus of claim 4 , wherein the at least one processor device is further configured to select a level break based on context information. 6. The apparatus of claim 1 , wherein the at least one processor device is further configured to begin reading aloud the text prior to completion of a full OCR of the text, and to continue performance of the OCR while reading aloud is occurring. 7. The apparatus of claim 6 , wherein the at least one processor device is further configured to begin reading aloud within less than 4 seconds of initiation of the OCR. 8. The apparatus of claim 6 , wherein the at least one processor device is further configured to begin reading aloud within less than 3 seconds of initiation of the OCR. 9. The apparatus of claim 6 , wherein the at least one processor device is further configured to begin reading aloud within less than 1 second of initiation of the OCR. 10. The apparatus of claim 1 , wherein the image sensor is further configured to capture the image data in various resolutions. 11. The apparatus of claim 10 , wherein the at least one processor device is further configured to operate in a low power consumption mode by performing the layout analysis on image data taken at a resolution lower than a resolution of the image data used for performing the OCR. 12. A method for audibly reading text retrieved from a captured image, the method comprising: capturing real time image data from an environment of a user; determining an existence of a trigger in the image data, the trigger being associated with a desire of the user to hear text read aloud, and wherein the trigger identifies an intermediate portion of the text a distance from a level break in the text; performing a layout analysis on the text to identify the level break associated with the trigger; performing an optical character recognition (OCR) only on a subset of the text in the image data associated with the trigger prior to causing the subset of the text to be read aloud; reading aloud the subset of the text beginning from the level break associated with the trigger; and while the subset of the text is being read aloud, anticipating a subsequent subset of the text to be read aloud and performing an OCR of the subsequent subset of the text in advance. 13. A software product stored on a non-transitory computer readable medium and comprising data and computer implementable instructions for carrying out the method of claim 12 .

Assignees

Inventors

Classifications

H04M2250/52
including functional features of a camera · CPC title
G09B21/006
using audible presentation of the information · CPC title
A61F9/08
Devices or methods enabling eye-patients to replace direct visual perception by another kind of perception · CPC title
G06F3/011
Arrangements for interaction with the human body, e.g. for user immersion in virtual reality (blind teaching G09B21/00) · CPC title
G09B21/008Primary
using visual presentation of the information for the partially sighted · CPC title

Patent family

Related publications grouped by family.

View patent family 51487375

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9911361B2 cover?: An apparatus is provided for audibly reading text retrieved from a captured image. In one implementation, the apparatus comprises an image sensor configured to capture image data from an environment of a user, and at least one processor. The processor is configured to determine an existence of a pointing trigger in the image data, the trigger being associated with a user's desire to hear text r…
Who is the assignee on this patent?: Wexler Yonatan, Shashua Amnon, Orcam Technologies Ltd
What technology area does this patent fall under?: Primary CPC classification G09B21/008. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Mar 06 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).

How to read this patent

Abstract

First claim

Assignees

Inventors

Classifications

Patent family

External sources

Related patents

Apparatus, method, and computer readable medium for recognizing text on a curved surface

Systems and methods for providing feedback based on the state of an object

Apparatus and method for hierarchical object identification using a camera on glasses

Apparatus and method for providing failed-attempt feedback using a camera on glasses

Systems and methods for audible facial recognition

Systems and methods for performing a triggered action

Frequently asked questions