Voice enabled screen reader

US9620124B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9620124-B2
Application numberUS-201414193590-A
CountryUS
Kind codeB2
Filing dateFeb 28, 2014
Priority dateFeb 28, 2014
Publication dateApr 11, 2017
Grant dateApr 11, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

In some embodiments, a system may process a user interface to identify textual or graphical items in the interface, and may prepare a plurality of audio files containing spoken representations of the items. As the user navigates through the interface, different ones of the audio files may be selected and played, to announce text associated with items selected by the user. A computing device may periodically determine whether a cache offering the interface to users stores audio files for all of the interface's textual items, and if the cache is missing any audio files for any of the textual items, the computing device may take steps to have a corresponding audio file created.

First claim

Opening claim text (preview).

We claim: 1. A method comprising: identifying a textual item in a user interface; associating a plurality of different audio files with the textual item, wherein the plurality of different audio files comprise a corresponding plurality of different audio announcements of the textual item; receiving, by a computing device, a first selection of the textual item in the user interface; determining a first one of the plurality of different audio files to use for audibly announcing the textual item, wherein the determining is based on a rule governing audio announcement in response to repeated selection of the textual item; causing playback of the first one of the plurality of different audio files based on the determining and responsive to the first selection of the textual item; receiving, by the computing device, a second selection of the textual item in the user interface; determining whether the second selection is a repeated selection of the textual item; and in response to determining that the second selection is a repeated selection of the textual item, causing playback of a second one of the plurality of different audio files responsive to the repeated selection of the textual item, wherein the second one of the plurality of different audio files comprises a different amount of explanatory speech than the first one of the plurality of different audio files. 2. The method of claim 1 , further comprising playing the first one of the plurality of different audio files. 3. The method of claim 1 , further comprising: identifying a plurality of screens of the user interface; associating a first screen audio file with a first screen of the plurality of screens; and in response to a selection of the first screen for display, retrieving the first screen audio file for playback. 4. The method of claim 1 , further comprising: periodically retrieving a current version of the user interface and determining whether the current version of the user interface comprises any textual items for which there is not already a corresponding audio file. 5. The method of claim 4 , wherein in response to determining that the current version of the user interface comprises one or more textual items for which there is not already a corresponding audio file, the method comprising transmitting an audio file request for the one or more textual items for which there is not already a corresponding audio file. 6. The method of claim 4 , wherein the determining whether the current version of the user interface comprises any textual items for which there is not already a corresponding audio file comprises: transmitting a request for an audio file header corresponding to a textual item under consideration; receiving the audio file header; and determining whether a size indicated in the audio file header is greater than a predetermined minimum size value. 7. The method of claim 4 , wherein the determining whether the current version of the user interface comprises any textual items for which there is not already a corresponding audio file comprises: retrieving a voice announcement identifier for a textual item under consideration; and consulting a database comprising a listing of voice announcement identifiers and corresponding entries indicating whether listed voice announcement identifiers are associated with audio files. 8. The method of claim 1 , wherein the textual item comprises a description of a video program, and wherein the plurality of different audio files comprise: the first one of the plurality of different audio files comprises a full audio reading of the description of the video program; and the second one of the plurality of different audio files comprises an abbreviated reading of the description of the video program, wherein in the abbreviated reading, certain words are shortened. 9. The method of claim 1 , further comprising: presenting the user interface on a first device at a premises, together with audio and video from an audiovisual program; and receiving a request to play the one of the plurality of different audio files on a second device at the premises, and not on the first device. 10. The method of claim 1 , further comprising: presenting the user interface on a first device at a premises, together with audio and video from an audiovisual program; and delivering the first and second ones of the plurality of audio files to corresponding first and second different user devices for playback in response to respective selections of the textual item by the first and second different user devices. 11. The method of claim 1 , wherein the plurality of different audio files correspond to different user expertise levels in navigating a menu corresponding to the user interface. 12. The method of claim 11 , wherein the first one of the plurality of different audio files comprises a greater amount of explanatory speech than the second one of the plurality of different audio files. 13. The method of claim 1 , further comprising storing the plurality of different audio files in a cache in advance of receiving the first selection of the textual item. 14. The method of claim 1 , further comprising: while a user is accessing the user interface, generating a new audio file to correspond with a new textual item in the user interface. 15. The method of claim 14 , further comprising: as the user accesses different screens of the user interface, determining whether the plurality of different audio files are cached in advance of user selection of the textual item in the different screens. 16. The method of claim 1 , wherein the textual item comprises a video program title. 17. A computer-readable medium storing instructions that, when executed, cause the following to occur: identifying a textual item in a user interface; associating a plurality of different audio files with the textual item, wherein the plurality of different audio files comprise a corresponding plurality of different audio announcements of the textual item; receiving, by a computing device, a first selection of the textual item in the user interface; determining a first one of the plurality of different audio files to use for audibly announcing the textual item, wherein the determining is based on a rule governing audio announcement in response to repeated selection of the textual item; causing playback of the first one of the plurality of different audio files based on the determining and responsive to the first selection of the textual item; receiving, by the computing device, a second selection of the textual item in the user interface; determining whether the second selection is a repeated selection of the textual item; and in response to determining that the second selection is a repeated selection of the textual item, causing playback of a second one of the plurality of different audio files responsive to the repeated selection of the textual item, wherein the second one of the plurality of different audio files comprises a different amount of explanatory speech than the first one of the plurality of different audio files. 18. The computer-readable medium of claim 17 , further storing instructions that, when executed, cause playing the first one of the plurality of different audio files. 19. The computer-readable medium of claim 17 , further storing instructions that, when executed, cause the following to occur: identifying a plurality of screens of the user interface; associating a first screen audio file with a first screen of the plurality of

Assignees

Inventors

Classifications

  • Arrangements for controlling cursor position based on codes indicative of cursor displacements from one discrete location to another, e.g. using cursor control keys associated to different directions or using the tab key (arrangements for controlling cursor position based on coordinate signals G06F3/038) · CPC title

  • Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content {(end-user interfaces for retrieving video data from a database G06F16/739; network services for supporting unicast streaming H04L65/612)} · CPC title

  • based on user interaction within the home (receiver circuitry for displaying additional information being controlled by a remote control apparatus H04N21/42204) · CPC title

  • Selection of displayed objects or displayed text elements (G06F3/0482 takes precedence) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9620124B2 cover?
In some embodiments, a system may process a user interface to identify textual or graphical items in the interface, and may prepare a plurality of audio files containing spoken representations of the items. As the user navigates through the interface, different ones of the audio files may be selected and played, to announce text associated with items selected by the user. A computing device may…
Who is the assignee on this patent?
Comcast Cable Comm Llc
What technology area does this patent fall under?
Primary CPC classification G06F3/04842. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 11 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).